Query lcl|NC_019719.1_cdsid_YP_007112279.1 [gene=F848_gp03] [protein=portal protein] [protein_id=YP_007112279.1] [location=2059..3333] Match_columns 424 No_of_seqs 145 out of 1019 Neff 9.5 Searched_HMMs 1612 Date Thu Nov 7 16:33:29 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:1884 Length: 424 # 100.0 4E-124 3E-127 696.9 46.8 424 1-424 1-424 (424) 2 protein:vir:189 Length: 424 # 100.0 5E-123 3E-126 690.9 46.8 424 1-424 1-424 (424) 3 protein:vir:4337 Length: 434 # 100.0 3E-101 2E-104 571.8 43.3 419 1-424 1-421 (434) 4 protein:vir:81072 Length: 432 100.0 4E-100 3E-103 565.3 44.2 410 8-424 1-421 (432) 5 protein:vir:100150 Length: 437 100.0 1E-99 6E-103 563.4 44.0 412 8-424 1-424 (437) 6 protein:vir:10362 Length: 432 100.0 1.5E-99 9E-103 562.4 44.3 410 8-424 1-421 (432) 7 protein:vir:4509 Length: 424 # 100.0 2.2E-99 1E-102 561.4 45.2 415 1-422 1-424 (424) 8 protein:vir:97060 Length: 432 100.0 2.4E-99 2E-102 561.2 44.8 410 8-424 1-421 (432) 9 protein:vir:105064 Length: 421 100.0 1.8E-99 1E-102 562.0 43.3 403 16-424 1-412 (421) 10 protein:vir:105002 Length: 432 100.0 1.4E-98 9E-102 557.0 45.0 406 14-424 1-423 (432) 11 protein:vir:107605 Length: 432 100.0 1.4E-98 9E-102 557.0 45.0 406 14-424 1-423 (432) 12 protein:vir:102855 Length: 432 100.0 1.4E-98 9E-102 557.0 45.0 406 14-424 1-423 (432) 13 protein:vir:5737 Length: 419 # 100.0 1.1E-98 7E-102 557.6 44.0 403 14-424 1-408 (419) 14 protein:vir:1431 Length: 419 # 100.0 1E-98 7E-102 557.7 43.1 402 15-424 1-407 (419) 15 protein:vir:4454 Length: 414 # 100.0 4.5E-98 3E-101 554.3 43.5 402 14-424 1-407 (414) 16 protein:vir:102080 Length: 429 100.0 6.3E-98 4E-101 553.5 43.8 406 14-424 1-420 (429) 17 protein:vir:81152 Length: 411 100.0 6.9E-98 4E-101 553.2 43.2 399 14-423 1-411 (411) 18 protein:vir:1266 Length: 416 # 100.0 2.2E-97 1E-100 550.5 42.2 399 15-424 1-406 (416) 19 protein:vir:80333 Length: 419 100.0 3.7E-97 2E-100 549.3 42.7 402 15-424 1-407 (419) 20 protein:vir:483 Length: 413 # 100.0 4.7E-97 3E-100 548.7 42.9 401 15-424 1-406 (413) 21 protein:vir:93610 Length: 454 100.0 4.1E-97 3E-100 549.0 42.4 401 16-424 1-415 (454) 22 protein:vir:1380 Length: 422 # 100.0 1.6E-96 1E-99 545.8 43.5 402 14-424 1-421 (422) 23 protein:vir:102118 Length: 409 100.0 1.5E-96 9E-100 545.9 42.5 398 16-424 1-408 (409) 24 protein:vir:100249 Length: 431 100.0 1.8E-96 1.1E-99 545.5 42.7 400 14-424 1-431 (431) 25 protein:vir:6240 Length: 457 # 100.0 2E-95 1.3E-98 539.7 43.3 406 14-424 1-425 (457) 26 protein:vir:1326 Length: 457 # 100.0 2.7E-95 1.7E-98 539.0 43.0 406 14-424 1-425 (457) 27 protein:vir:2683 Length: 412 # 100.0 5.5E-95 3.4E-98 537.3 42.9 401 8-424 1-406 (412) 28 protein:vir:98396 Length: 441 100.0 2.7E-94 1.7E-97 533.5 43.7 411 1-424 4-430 (441) 29 protein:vir:9408 Length: 441 # 100.0 4.1E-94 2.5E-97 532.6 43.6 411 1-424 4-437 (441) 30 protein:vir:79984 Length: 441 100.0 4.1E-94 2.5E-97 532.6 43.6 411 1-424 4-437 (441) 31 protein:vir:8418 Length: 409 # 100.0 7E-94 4.3E-97 531.3 44.2 397 14-424 1-404 (409) 32 protein:vir:93943 Length: 409 100.0 9.9E-94 6.1E-97 530.5 43.3 398 11-424 1-403 (409) 33 protein:vir:94426 Length: 409 100.0 2.1E-93 1.3E-96 528.6 42.9 398 11-424 1-403 (409) 34 protein:vir:96980 Length: 409 100.0 2.4E-93 1.5E-96 528.3 43.0 398 11-424 1-403 (409) 35 protein:vir:4598 Length: 416 # 100.0 2.1E-93 1.3E-96 528.7 42.2 394 14-424 1-412 (416) 36 protein:vir:81095 Length: 416 100.0 2.1E-93 1.3E-96 528.7 42.2 394 14-424 1-412 (416) 37 protein:vir:101648 Length: 518 100.0 3.9E-93 2.4E-96 527.2 43.2 396 21-424 1-423 (518) 38 protein:vir:7853 Length: 518 # 100.0 5E-93 3.1E-96 526.6 43.1 396 21-424 1-423 (518) 39 protein:vir:3868 Length: 417 # 100.0 4.9E-92 3E-95 521.2 41.1 390 21-424 1-400 (417) 40 protein:vir:81218 Length: 423 100.0 4.5E-91 2.8E-94 515.9 43.7 403 14-422 1-423 (423) 41 protein:vir:9702 Length: 406 # 100.0 9.1E-91 5.6E-94 514.2 41.6 388 21-424 1-402 (406) 42 protein:vir:101647 Length: 460 100.0 1.6E-90 1E-93 512.8 41.4 404 16-424 1-459 (460) 43 protein:vir:8317 Length: 409 # 100.0 4.5E-88 2.8E-91 499.5 40.4 374 14-406 1-409 (409) 44 protein:vir:94666 Length: 723 100.0 1.4E-87 8.9E-91 496.7 39.8 380 28-424 1-408 (723) 45 protein:vir:95378 Length: 406 100.0 2.9E-86 1.8E-89 489.5 42.7 391 14-424 1-401 (406) 46 protein:vir:960 Length: 413 # 100.0 1.7E-86 1.1E-89 490.8 40.6 402 1-424 4-413 (413) 47 protein:vir:80134 Length: 403 100.0 2.6E-86 1.6E-89 489.7 40.8 388 14-424 1-398 (403) 48 protein:vir:102727 Length: 945 100.0 1.1E-85 7E-89 486.3 41.4 419 1-424 55-524 (945) 49 protein:vir:8100 Length: 466 # 100.0 7.2E-85 4.5E-88 481.9 41.6 401 14-424 1-466 (466) 50 protein:vir:6210 Length: 394 # 100.0 6E-84 3.7E-87 476.9 39.8 384 14-424 1-390 (394) 51 protein:vir:104259 Length: 403 100.0 2E-83 1.2E-86 474.0 40.4 386 14-424 1-399 (403) 52 protein:vir:9359 Length: 348 # 100.0 1.2E-83 7.3E-87 475.2 38.8 337 74-424 1-342 (348) 53 protein:vir:3843 Length: 397 # 100.0 5E-83 3.1E-86 471.8 39.4 380 14-424 1-387 (397) 54 protein:vir:100187 Length: 385 100.0 1.9E-82 1.2E-85 468.6 39.5 376 14-422 1-385 (385) 55 protein:vir:100882 Length: 383 100.0 2.1E-81 1.3E-84 462.9 39.8 376 14-417 1-383 (383) 56 protein:vir:80796 Length: 574 100.0 5.4E-81 3.4E-84 460.6 39.8 420 1-424 20-493 (574) 57 protein:vir:1082 Length: 359 # 100.0 5E-81 3.1E-84 460.8 38.0 352 14-396 1-359 (359) 58 protein:vir:7407 Length: 392 # 100.0 1.2E-79 7.3E-83 453.3 38.5 363 16-409 1-392 (392) 59 protein:vir:80644 Length: 551 100.0 6E-79 3.7E-82 449.4 40.9 409 10-424 1-516 (551) 60 protein:vir:95965 Length: 385 100.0 4.1E-79 2.6E-82 450.3 37.4 373 14-421 1-385 (385) 61 protein:vir:4854 Length: 386 # 100.0 5.8E-79 3.6E-82 449.5 37.6 373 14-424 1-385 (386) 62 protein:vir:101289 Length: 395 100.0 7.9E-79 4.9E-82 448.8 37.8 372 14-424 1-384 (395) 63 protein:vir:9507 Length: 395 # 100.0 7.9E-79 4.9E-82 448.8 37.8 372 14-424 1-384 (395) 64 protein:vir:100650 Length: 395 100.0 7.9E-79 4.9E-82 448.8 37.8 372 14-424 1-384 (395) 65 protein:vir:100691 Length: 535 100.0 2.1E-78 1.3E-81 446.4 39.3 418 1-424 13-490 (535) 66 protein:vir:4995 Length: 384 # 100.0 6.3E-79 3.9E-82 449.3 35.1 367 14-402 1-384 (384) 67 protein:vir:94002 Length: 378 100.0 1.2E-78 7.6E-82 447.7 35.3 356 14-424 1-370 (378) 68 protein:vir:93867 Length: 378 100.0 2.5E-78 1.6E-81 446.0 35.9 356 14-424 1-370 (378) 69 protein:vir:1023 Length: 392 # 100.0 8.7E-78 5.4E-81 443.1 38.7 363 16-409 1-392 (392) 70 protein:vir:3989 Length: 392 # 100.0 8.7E-78 5.4E-81 443.1 38.7 363 16-409 1-392 (392) 71 protein:vir:63755 Length: 547 100.0 1.5E-77 9.3E-81 441.8 39.9 406 14-424 1-512 (547) 72 protein:vir:1661 Length: 378 # 100.0 3.6E-78 2.2E-81 445.2 36.3 356 14-424 1-370 (378) 73 protein:vir:4089 Length: 395 # 100.0 1.3E-77 7.9E-81 442.2 38.2 379 14-424 1-391 (395) 74 protein:vir:78310 Length: 376 100.0 4E-77 2.5E-80 439.4 36.6 366 14-423 1-376 (376) 75 protein:vir:96579 Length: 576 100.0 2.7E-76 1.7E-79 434.9 40.8 417 1-424 14-495 (576) 76 protein:vir:95599 Length: 563 100.0 1.4E-75 8.6E-79 431.0 40.9 414 1-424 14-523 (563) 77 protein:vir:99312 Length: 563 100.0 1.4E-75 8.6E-79 431.0 40.9 414 1-424 14-523 (563) 78 protein:vir:4194 Length: 540 # 100.0 4.8E-76 3E-79 433.5 37.4 391 1-424 1-441 (540) 79 protein:vir:98643 Length: 395 100.0 2.8E-76 1.7E-79 434.8 36.0 380 14-423 1-395 (395) 80 protein:vir:4156 Length: 542 # 100.0 1.2E-75 7.2E-79 431.4 38.6 390 16-424 1-443 (542) 81 protein:vir:4952 Length: 386 # 100.0 1.5E-75 9.3E-79 430.8 38.9 378 14-421 1-386 (386) 82 protein:vir:9641 Length: 395 # 100.0 2.8E-76 1.7E-79 434.8 34.8 377 14-423 1-395 (395) 83 protein:vir:3153 Length: 467 # 100.0 2.2E-75 1.3E-78 429.9 39.3 371 53-424 1-442 (467) 84 protein:vir:94869 Length: 378 100.0 8.7E-76 5.4E-79 432.1 36.8 356 14-424 1-370 (378) 85 protein:vir:858 Length: 378 # 100.0 3E-75 1.9E-78 429.1 36.6 355 14-424 1-370 (378) 86 protein:vir:4828 Length: 382 # 100.0 2.3E-75 1.4E-78 429.8 35.0 369 14-421 1-382 (382) 87 protein:vir:99452 Length: 651 100.0 1.4E-70 8.5E-74 403.6 33.9 413 1-424 1-535 (651) 88 protein:vir:79772 Length: 648 100.0 1.4E-68 8.5E-72 392.6 38.9 408 1-424 1-491 (648) 89 protein:vir:78641 Length: 278 100.0 1.4E-62 8.4E-66 359.8 32.5 273 74-360 1-278 (278) 90 protein:vir:79150 Length: 368 100.0 1.2E-59 7.4E-63 343.6 28.9 347 8-376 1-368 (368) 91 protein:vir:103971 Length: 376 100.0 3.6E-57 2.2E-60 330.0 33.3 327 1-367 26-376 (376) 92 protein:vir:100328 Length: 346 100.0 3.2E-57 2E-60 330.3 31.9 323 8-365 1-346 (346) 93 protein:vir:267 Length: 348 # 100.0 1.2E-56 7.6E-60 327.1 31.7 332 1-371 1-348 (348) 94 protein:vir:79207 Length: 351 100.0 2.2E-56 1.4E-59 325.7 32.5 327 1-367 1-351 (351) 95 protein:vir:78191 Length: 351 100.0 3.2E-56 2E-59 324.8 32.4 327 1-367 1-351 (351) 96 protein:vir:98567 Length: 340 100.0 5.2E-56 3.2E-59 323.7 32.1 324 8-364 1-340 (340) 97 protein:vir:78749 Length: 337 100.0 8.9E-56 5.5E-59 322.4 29.3 322 1-361 1-337 (337) 98 protein:vir:1150 Length: 350 # 100.0 4E-55 2.5E-58 318.8 31.9 324 8-360 1-350 (350) 99 protein:vir:6058 Length: 344 # 100.0 6E-55 3.8E-58 317.8 32.2 325 8-365 1-344 (344) 100 protein:vir:5691 Length: 344 # 100.0 5.2E-55 3.2E-58 318.2 30.8 327 8-365 1-344 (344) 101 protein:vir:3743 Length: 345 # 100.0 3.3E-54 2E-57 313.8 33.0 327 1-362 1-345 (345) 102 protein:vir:2013 Length: 344 # 100.0 1.3E-54 8.2E-58 316.0 30.4 323 8-365 1-344 (344) 103 protein:vir:3780 Length: 345 # 100.0 7.5E-54 4.6E-57 311.8 31.0 331 1-362 1-345 (345) 104 protein:vir:4698 Length: 251 # 100.0 5.5E-51 3.4E-54 296.1 25.8 242 14-268 1-251 (251) 105 protein:vir:98853 Length: 219 100.0 1.7E-45 1.1E-48 266.0 22.9 208 153-364 1-219 (219) 106 protein:vir:5249 Length: 437 # 99.9 1.6E-26 9.8E-30 162.0 29.1 387 10-424 1-435 (437) 107 protein:vir:107742 Length: 537 99.9 2.2E-22 1.4E-25 139.3 29.9 404 1-424 41-533 (537) 108 protein:vir:99853 Length: 488 99.9 1E-21 6.4E-25 135.6 29.0 395 1-424 1-409 (488) 109 protein:vir:94049 Length: 532 99.8 3.7E-21 2.3E-24 132.6 26.6 404 1-424 1-508 (532) 110 protein:vir:103860 Length: 528 99.8 8.8E-20 5.4E-23 125.1 32.1 392 17-424 1-443 (528) 111 protein:vir:99232 Length: 526 99.8 8E-20 5E-23 125.3 31.2 402 1-424 12-448 (526) 112 protein:vir:99563 Length: 862 99.8 3.1E-20 1.9E-23 127.5 28.6 401 1-424 50-553 (862) 113 protein:vir:108215 Length: 469 99.8 1.9E-19 1.2E-22 123.3 32.5 402 1-424 1-455 (469) 114 protein:vir:96068 Length: 765 99.8 3.9E-20 2.4E-23 127.0 28.5 397 1-424 30-533 (765) 115 protein:vir:79233 Length: 526 99.8 4.9E-19 3E-22 121.0 31.0 402 1-424 12-441 (526) 116 protein:vir:107662 Length: 427 99.8 3E-19 1.9E-22 122.1 26.3 373 8-421 1-427 (427) 117 protein:vir:79063 Length: 491 99.8 1.6E-17 1E-20 112.6 33.8 388 1-424 1-423 (491) 118 protein:vir:107880 Length: 491 99.8 1.4E-17 8.8E-21 113.0 32.6 387 1-424 15-423 (491) 119 protein:vir:79647 Length: 435 99.8 1.2E-18 7.2E-22 118.9 25.9 378 1-424 5-433 (435) 120 protein:vir:104338 Length: 422 99.8 1.9E-18 1.2E-21 117.7 26.9 373 10-422 1-422 (422) 121 protein:vir:1986 Length: 512 # 99.8 1.4E-17 8.5E-21 113.0 30.8 397 1-424 12-440 (512) 122 protein:vir:80040 Length: 461 99.7 6.3E-18 3.9E-21 114.9 27.0 396 1-423 1-461 (461) 123 protein:vir:77981 Length: 448 99.7 3.6E-16 2.2E-19 105.3 29.1 397 1-424 1-437 (448) 124 protein:vir:79511 Length: 448 99.7 6.5E-16 4E-19 103.8 30.0 402 1-424 5-441 (448) 125 protein:vir:389 Length: 530 # 99.7 2.5E-16 1.5E-19 106.1 27.3 413 8-424 1-530 (530) 126 protein:vir:79538 Length: 502 99.7 8.2E-16 5.1E-19 103.3 28.4 406 14-424 1-498 (502) 127 protein:vir:96738 Length: 505 99.6 5E-16 3.1E-19 104.5 26.6 412 1-424 1-502 (505) 128 protein:vir:3420 Length: 533 # 99.6 1E-14 6.4E-18 97.3 28.2 419 1-424 1-531 (533) 129 protein:vir:95254 Length: 488 99.6 2.7E-14 1.6E-17 95.0 30.2 409 1-424 1-471 (488) 130 protein:vir:98816 Length: 446 99.6 1.1E-14 6.8E-18 97.1 27.9 373 6-400 1-446 (446) 131 protein:vir:95542 Length: 548 99.5 5.9E-15 3.7E-18 98.6 23.1 405 14-424 1-539 (548) 132 protein:vir:10321 Length: 495 99.5 1.3E-14 7.8E-18 96.8 24.5 405 8-424 1-491 (495) 133 protein:vir:78161 Length: 355 99.5 1.9E-13 1.2E-16 90.3 25.6 287 131-424 1-325 (355) 134 protein:vir:6382 Length: 553 # 99.5 1.5E-13 9.6E-17 90.8 24.9 408 1-424 1-549 (553) 135 protein:vir:105782 Length: 449 99.4 5.7E-14 3.6E-17 93.2 21.6 386 1-424 1-449 (449) 136 protein:vir:78589 Length: 695 99.4 2.2E-13 1.4E-16 90.0 23.5 395 1-424 60-546 (695) 137 protein:vir:101541 Length: 694 99.4 2.7E-13 1.6E-16 89.5 23.4 396 1-424 59-545 (694) 138 protein:vir:3648 Length: 695 # 99.4 2.4E-13 1.5E-16 89.8 23.1 395 1-424 60-546 (695) 139 protein:vir:106716 Length: 698 99.4 1.8E-13 1.1E-16 90.5 22.3 396 1-424 60-549 (698) 140 protein:vir:102426 Length: 631 99.2 5.8E-12 3.6E-15 82.2 18.4 405 8-424 1-516 (631) 141 protein:vir:106491 Length: 646 99.1 9.7E-11 6E-14 75.5 21.7 396 21-424 1-487 (646) 142 protein:vir:8654 Length: 629 # 99.1 5.6E-11 3.5E-14 76.8 20.3 413 1-424 1-508 (629) 143 protein:vir:99088 Length: 629 99.1 6E-11 3.7E-14 76.6 20.0 413 1-424 1-508 (629) 144 protein:vir:97900 Length: 639 99.0 1.8E-10 1.1E-13 74.0 19.1 405 1-424 1-514 (639) 145 protein:vir:107517 Length: 639 99.0 1.8E-10 1.1E-13 74.0 19.1 405 1-424 1-514 (639) 146 protein:vir:102602 Length: 456 99.0 1.6E-09 1E-12 68.7 22.4 391 8-424 1-455 (456) 147 protein:vir:105819 Length: 456 99.0 1.6E-09 1E-12 68.7 22.4 391 8-424 1-455 (456) 148 protein:vir:7987 Length: 456 # 98.9 3.2E-09 2E-12 67.1 21.1 393 8-424 1-455 (456) 149 protein:vir:106027 Length: 629 98.9 2.4E-09 1.5E-12 67.8 20.4 402 1-424 1-504 (629) 150 protein:vir:98444 Length: 434 98.9 2.3E-08 1.4E-11 62.5 25.4 357 46-424 1-431 (434) 151 protein:vir:104082 Length: 485 98.7 1E-07 6.3E-11 58.9 23.5 394 1-424 1-485 (485) 152 protein:vir:7768 Length: 484 # 98.7 1.4E-07 8.7E-11 58.2 24.5 396 1-424 1-479 (484) 153 protein:vir:38 Length: 496 # N 98.5 3.4E-07 2.1E-10 56.0 24.4 385 16-421 1-496 (496) 154 protein:vir:5839 Length: 533 # 98.5 5.2E-07 3.3E-10 55.0 23.9 407 1-424 1-523 (533) 155 protein:vir:2427 Length: 485 # 98.4 8.3E-07 5.1E-10 53.9 23.8 391 1-424 8-482 (485) 156 protein:vir:98883 Length: 517 98.4 9.8E-07 6.1E-10 53.5 22.0 388 14-424 1-516 (517) 157 protein:vir:94742 Length: 409 98.3 1.4E-06 8.4E-10 52.7 28.6 347 8-396 1-409 (409) 158 protein:vir:1634 Length: 409 # 98.3 1.6E-06 9.9E-10 52.4 28.1 347 8-396 1-409 (409) 159 protein:vir:9751 Length: 422 # 98.3 1.9E-06 1.2E-09 51.9 24.9 360 8-423 1-422 (422) 160 protein:vir:1587 Length: 508 # 98.3 2E-06 1.2E-09 51.9 22.6 387 14-424 1-506 (508) 161 protein:vir:5961 Length: 503 # 98.3 2E-06 1.2E-09 51.8 31.9 394 1-424 20-499 (503) 162 protein:vir:2500 Length: 501 # 98.3 2E-06 1.3E-09 51.8 23.8 387 1-424 16-501 (501) 163 protein:vir:3028 Length: 500 # 98.3 2E-06 1.3E-09 51.8 23.2 391 14-424 1-499 (500) 164 protein:vir:9815 Length: 500 # 98.3 2E-06 1.3E-09 51.8 23.2 391 14-424 1-499 (500) 165 protein:vir:8184 Length: 474 # 98.2 2.4E-06 1.5E-09 51.4 22.5 403 1-424 2-473 (474) 166 protein:vir:4782 Length: 522 # 98.2 2.5E-06 1.6E-09 51.3 24.0 386 14-422 1-522 (522) 167 protein:vir:103219 Length: 201 98.2 7.6E-08 4.7E-11 59.6 11.9 181 227-422 1-201 (201) 168 protein:vir:80959 Length: 499 98.2 3.2E-06 2E-09 50.7 26.2 389 12-421 1-499 (499) 169 protein:vir:95806 Length: 440 98.2 3.4E-06 2.1E-09 50.6 23.7 389 6-424 1-435 (440) 170 protein:vir:4073 Length: 279 # 98.2 4.1E-08 2.5E-11 61.1 9.7 252 58-399 1-279 (279) 171 protein:vir:9568 Length: 410 # 98.1 4.5E-06 2.8E-09 49.9 27.8 349 8-420 1-410 (410) 172 protein:vir:4223 Length: 486 # 98.1 5.1E-06 3.2E-09 49.6 26.2 388 1-424 8-484 (486) 173 protein:vir:99916 Length: 504 98.1 5.4E-06 3.4E-09 49.5 27.5 403 1-424 6-496 (504) 174 protein:vir:2341 Length: 488 # 98.0 6.1E-06 3.8E-09 49.2 26.7 397 1-424 1-488 (488) 175 protein:vir:1236 Length: 483 # 98.0 7.7E-06 4.8E-09 48.6 26.3 387 1-424 20-480 (483) 176 protein:vir:78227 Length: 480 97.9 1.1E-05 6.7E-09 47.8 23.9 389 10-424 1-473 (480) 177 protein:vir:96240 Length: 511 97.9 1.1E-05 6.7E-09 47.8 25.1 397 1-424 24-507 (511) 178 protein:vir:96494 Length: 501 97.9 1.2E-05 7.6E-09 47.5 24.1 406 1-424 1-496 (501) 179 protein:vir:96839 Length: 474 97.9 1.3E-05 8.3E-09 47.3 26.4 384 1-422 13-474 (474) 180 protein:vir:95113 Length: 474 97.9 1.4E-05 8.7E-09 47.2 28.7 382 1-424 20-472 (474) 181 protein:vir:4898 Length: 502 # 97.9 1.5E-05 9.1E-09 47.1 25.0 408 1-424 20-498 (502) 182 protein:vir:9306 Length: 511 # 97.8 1.5E-05 9.4E-09 47.0 23.6 397 1-424 26-507 (511) 183 protein:vir:103951 Length: 511 97.8 1.6E-05 1E-08 46.9 24.9 394 1-424 20-498 (511) 184 protein:vir:93747 Length: 472 97.8 1.6E-05 1E-08 46.8 28.4 387 1-424 11-468 (472) 185 protein:vir:97171 Length: 512 97.8 2.2E-05 1.4E-08 46.1 22.7 395 1-424 24-508 (512) 186 protein:vir:96366 Length: 511 97.7 2.6E-05 1.6E-08 45.8 22.4 397 1-424 20-507 (511) 187 protein:vir:78805 Length: 511 97.7 2.6E-05 1.6E-08 45.8 22.4 397 1-424 20-507 (511) 188 protein:vir:78537 Length: 480 97.7 2.8E-05 1.7E-08 45.5 23.5 386 10-424 1-473 (480) 189 protein:vir:94101 Length: 474 97.7 3E-05 1.9E-08 45.4 26.7 401 1-424 1-473 (474) 190 protein:vir:105889 Length: 474 97.7 3E-05 1.9E-08 45.4 26.7 401 1-424 1-473 (474) 191 protein:vir:94805 Length: 492 97.7 3.2E-05 2E-08 45.2 26.3 387 1-424 21-486 (492) 192 protein:vir:80680 Length: 441 97.6 4.2E-05 2.6E-08 44.6 28.2 373 1-424 1-433 (441) 193 protein:vir:79703 Length: 505 97.6 4.2E-05 2.6E-08 44.6 25.7 384 14-424 1-504 (505) 194 protein:vir:3964 Length: 453 # 97.6 4.4E-05 2.7E-08 44.5 24.4 394 1-424 2-452 (453) 195 protein:vir:99781 Length: 511 97.6 4.6E-05 2.9E-08 44.3 24.0 393 1-424 20-506 (511) 196 protein:vir:97447 Length: 474 97.5 5.4E-05 3.4E-08 44.0 27.6 386 1-424 5-470 (474) 197 protein:vir:94498 Length: 474 97.5 5.4E-05 3.4E-08 44.0 27.6 386 1-424 5-470 (474) 198 protein:vir:97336 Length: 492 97.5 5.6E-05 3.4E-08 43.9 27.1 387 1-424 29-486 (492) 199 protein:vir:95899 Length: 474 97.5 6.4E-05 4E-08 43.6 25.5 386 1-424 9-471 (474) 200 protein:vir:96266 Length: 474 97.5 6.4E-05 4E-08 43.6 25.5 386 1-424 9-471 (474) 201 protein:vir:733 Length: 453 # 97.4 6.8E-05 4.2E-08 43.4 24.8 391 1-424 3-452 (453) 202 protein:vir:99522 Length: 470 97.4 7.2E-05 4.5E-08 43.3 26.6 382 1-422 11-470 (470) 203 protein:vir:78907 Length: 518 97.4 7.4E-05 4.6E-08 43.2 24.8 392 14-423 1-518 (518) 204 protein:vir:99072 Length: 479 97.4 7.5E-05 4.7E-08 43.2 26.4 394 1-424 2-470 (479) 205 protein:vir:79043 Length: 479 97.4 8.6E-05 5.3E-08 42.9 27.6 391 1-424 7-479 (479) 206 protein:vir:106639 Length: 481 97.3 8.8E-05 5.5E-08 42.8 26.1 399 1-423 21-481 (481) 207 protein:vir:107112 Length: 478 97.3 9.9E-05 6.2E-08 42.5 29.1 387 1-424 1-477 (478) 208 protein:vir:2732 Length: 501 # 97.2 0.00012 7.7E-08 42.0 27.9 395 1-424 41-493 (501) 209 protein:vir:3609 Length: 452 # 97.2 0.00013 8.1E-08 41.9 26.0 386 1-424 3-451 (452) 210 protein:vir:105292 Length: 478 97.2 0.00015 9.2E-08 41.6 27.1 387 1-424 1-477 (478) 211 protein:vir:96179 Length: 468 96.9 0.00029 1.8E-07 40.0 26.4 380 1-420 17-468 (468) 212 protein:vir:97376 Length: 320 96.6 7E-05 4.4E-08 43.4 10.3 306 14-403 1-320 (320) 213 protein:vir:106571 Length: 499 96.6 0.00048 3E-07 38.8 28.0 382 1-424 5-488 (499) 214 protein:vir:105461 Length: 470 96.6 0.00049 3E-07 38.7 23.8 377 8-421 1-470 (470) 215 protein:vir:9871 Length: 429 # 96.5 0.00055 3.4E-07 38.5 24.4 374 1-424 1-429 (429) 216 protein:vir:102950 Length: 471 96.4 0.00061 3.8E-07 38.2 23.5 376 8-420 1-471 (471) 217 protein:vir:78083 Length: 537 96.4 0.00063 3.9E-07 38.1 28.9 397 1-424 1-521 (537) 218 protein:vir:94546 Length: 506 96.3 0.00074 4.6E-07 37.8 21.4 396 1-424 6-498 (506) 219 protein:vir:104500 Length: 537 95.8 0.0015 9.3E-07 36.1 23.5 408 1-424 1-536 (537) 220 protein:vir:103177 Length: 533 95.6 0.0017 1.1E-06 35.7 20.4 399 17-424 1-513 (533) 221 protein:vir:104892 Length: 558 93.4 0.0078 4.8E-06 32.1 22.7 399 17-424 1-541 (558) 222 protein:vir:9922 Length: 489 # 91.8 0.014 8.8E-06 30.7 25.1 394 1-424 2-489 (489) 223 protein:vir:7208 Length: 524 # 91.0 0.018 1.1E-05 30.1 21.3 385 8-401 1-524 (524) 224 protein:vir:103458 Length: 524 90.7 0.02 1.2E-05 29.9 21.3 385 8-401 1-524 (524) 225 protein:vir:106999 Length: 564 90.6 0.02 1.2E-05 29.9 21.7 406 1-424 1-549 (564) 226 protein:vir:94709 Length: 522 89.5 0.026 1.6E-05 29.3 20.9 371 1-424 1-475 (522) 227 protein:vir:106282 Length: 521 88.0 0.035 2.2E-05 28.6 22.9 402 8-424 1-519 (521) 228 protein:vir:94956 Length: 452 87.2 0.04 2.5E-05 28.2 25.1 374 8-424 1-450 (452) 229 protein:vir:98265 Length: 524 83.0 0.072 4.5E-05 26.8 26.7 385 8-401 1-524 (524) 230 protein:vir:108049 Length: 524 82.1 0.08 5E-05 26.6 22.8 383 10-401 1-524 (524) 231 protein:vir:101806 Length: 516 79.2 0.11 6.6E-05 25.9 22.9 402 8-424 1-515 (516) 232 protein:vir:101189 Length: 516 79.2 0.11 6.6E-05 25.9 22.9 402 8-424 1-515 (516) 233 protein:vir:5665 Length: 511 # 78.9 0.11 6.8E-05 25.9 22.4 402 8-423 1-511 (511) 234 protein:vir:100598 Length: 516 78.2 0.12 7.2E-05 25.7 23.5 404 8-424 1-515 (516) 235 protein:vir:95149 Length: 501 77.8 0.12 7.5E-05 25.6 27.9 394 1-424 1-495 (501) 236 protein:vir:6896 Length: 523 # 77.7 0.12 7.5E-05 25.6 21.5 381 8-401 1-523 (523) 237 protein:vir:102668 Length: 547 74.8 0.15 9.6E-05 25.0 17.7 352 8-424 1-486 (547) 238 protein:vir:81017 Length: 521 71.7 0.19 0.00012 24.5 23.9 402 10-424 1-520 (521) 239 protein:vir:102330 Length: 451 66.6 0.26 0.00016 23.8 27.0 363 24-422 1-451 (451) 240 protein:vir:80453 Length: 535 65.3 0.28 0.00018 23.6 23.1 393 1-424 1-527 (535) 241 protein:vir:6596 Length: 521 # 60.2 0.38 0.00023 22.9 24.6 402 10-424 1-520 (521) 242 protein:vir:95315 Length: 559 50.7 0.6 0.00037 21.8 17.4 356 1-424 1-483 (559) 243 protein:vir:100039 Length: 522 50.5 0.61 0.00038 21.8 19.6 364 8-424 1-463 (522) 244 protein:vir:107404 Length: 555 38.2 1.1 0.00067 20.4 19.2 357 1-424 1-492 (555) 245 protein:vir:107822 Length: 555 38.2 1.1 0.00067 20.4 19.2 357 1-424 1-492 (555) 246 protein:vir:98506 Length: 555 38.2 1.1 0.00067 20.4 19.2 357 1-424 1-492 (555) 247 protein:vir:2198 Length: 536 # 37.1 1.1 0.0007 20.3 19.3 393 1-424 1-531 (536) 248 protein:vir:10447 Length: 536 33.1 1.4 0.00086 19.8 18.9 393 1-424 1-531 (536) 249 protein:vir:78696 Length: 542 23.3 2.3 0.0014 18.6 15.7 373 8-424 1-477 (542) 250 protein:vir:1538 Length: 535 # 20.8 2.7 0.0017 18.2 20.0 382 1-424 1-480 (535) 251 protein:vir:94572 Length: 535 20.2 2.8 0.0017 18.1 18.1 380 1-424 1-476 (535) No 1 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=4.3e-124 Score=696.95 Aligned_cols=424 Identities=100% Similarity=1.488 Sum_probs=410.8 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) ||||||||++|||+|||++|++||.+++...+...+..++++..++.++..|+.+.|+++++||+||++||++||++||+ T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~ 80 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) T ss_pred CCCCcceEeecCCCchHHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceE Confidence 99999999999999999999999999888888888888888888888999999999999999999999999999999999 Q ss_pred EEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC Q lcl|NC_019719. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +|+.++++..++...+||++++|+.+||++||+++||+.++.+++++||+|++++|+.+|.+++|||++|.+|++..+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~~~~~ 160 (424) T protein:vir:18 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) T ss_pred EEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC Confidence 99999988877777899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEecCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019719. 161 KVVYRYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 161 ~~~~~~~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~ 240 (424) ...|.|..++..+.|+++||||+|+++.++++|+||+..++.++.++.+++++..++|+||++|++|++++.+..+++++ T Consensus 161 ~~~y~~~~~g~~~~~~~~eIih~r~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~ 240 (424) T protein:vir:18 161 KVVYRYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) T ss_pred eEEEEEEeCCeEEEeccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999887799999 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH Q lcl|NC_019719. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~ 320 (424) +++++.|++..++.|+|++++|++|++|++++++++|+||+|++++++++||++|||||++||..++++++++|+|++.+ T Consensus 241 ~~~~~~~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~ 320 (424) T protein:vir:18 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) T ss_pred HHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999888889999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCe Q lcl|NC_019719. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDV 400 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~ 400 (424) .|+++||.|++.+||++|+++|+++.++.+++++||+++++++|.+++++.+.+++++|+||+||+|+++|+||+||||+ T Consensus 321 ~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD~ 400 (424) T protein:vir:18 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDV 400 (424) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Confidence 99999999999999999999999999888899999999999999999999999999999999999999999999999999 Q ss_pred eeecccccchhhccccCCCcccCC Q lcl|NC_019719. 401 AMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +++++|++|++.++++++++++|| T Consensus 401 ~~~~~n~~~l~~~~~~~~p~~~ga 424 (424) T protein:vir:18 401 AMRQSQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred eeeccCccchHhhhccCCCccCCC Confidence 999999999999999999999999 No 2 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=5.3e-123 Score=690.92 Aligned_cols=424 Identities=98% Similarity=1.475 Sum_probs=409.6 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) ||||||||+++||+|||++|++||.++....+.......++...++.++..++.+.|+++++|++||++||++||++||+ T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~ 80 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) T ss_pred CCCCccccccCCCCchHHHHHhhccccccccccchhhccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceE Confidence 99999999999999999999999999888888777777788888888899999999999999999999999999999999 Q ss_pred EEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC Q lcl|NC_019719. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +|+.+++|..++...+||++++|+.+||++||+++||+.++.+++++||+|++++|+..|++++|||++|.+|++..+++ T Consensus 81 vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~ 160 (424) T protein:vir:18 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) T ss_pred EEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC Confidence 99999988877777899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEecCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019719. 161 KVVYRYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 161 ~~~~~~~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~ 240 (424) ..+|.|..++..+.|+++||||+|+++.++++|+||+..++.++.++.+++++..++|+||++|+++++++....+++++ T Consensus 161 ~~~y~~~~~g~~~~~~~~eVihir~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~ 240 (424) T protein:vir:18 161 KVVYRYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) T ss_pred eEEEEEEeCCeEEEeccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999887789999 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH Q lcl|NC_019719. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~ 320 (424) +++++.|+++.++.|+|++++|++|++|+++++++.|+||+|++++++++||++|||||++||..++++++++|.|++.+ T Consensus 241 ~~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~ 320 (424) T protein:vir:18 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) T ss_pred HHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999988889999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCe Q lcl|NC_019719. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDV 400 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~ 400 (424) .|+++||.|++++||++|+++|+++.++.+++++||+++++++|.+++++.+++++++|+||+||+|+++|+||+||||+ T Consensus 321 ~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~ 400 (424) T protein:vir:18 321 GFLQYTLQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGDV 400 (424) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Confidence 99999999999999999999999999888899999999999999999999999999999999999999999999999999 Q ss_pred eeecccccchhhccccCCCcccCC Q lcl|NC_019719. 401 AMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +++++|++|++.++++++++++|| T Consensus 401 ~~~~~n~~~l~~~~~~~~~~~n~a 424 (424) T protein:vir:18 401 AMRQAQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred eeeccCccchhhhhccCCccccCC Confidence 999999999999999999999999 No 3 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=2.9e-101 Score=571.76 Aligned_cols=419 Identities=31% Similarity=0.516 Sum_probs=355.0 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |-+--..+--....+.-+.+.+|..... ..........+.+..+.++..|+.+.++++++|++||++||++||++||+ T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~~ia~lp~~ 78 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWGGKTI--RLTDGAFWSQFLGRESSSGKKVTVDKAMKLSAVWACVRLISTSVAGLPLG 78 (434) T ss_pred Cccchhhhhhhcccccchhhhccccccc--ccCchHHHHHHhcCCccCCceechhhhhccHHHHHHHHHHHHhhhhCceE Confidence 3222111111111111111111110000 00111112233344556788899999999999999999999999999999 Q ss_pred EEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC Q lcl|NC_019719. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +|+++.+|...+ ..+||++++|+.+||++||+++||+.++.+++++||+|+++.++ .|++++|+||+|.+|++..+.+ T Consensus 79 ~~~~~~~g~~~~-~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v~~~~~~~ 156 (434) T protein:vir:43 79 VYERKADGSRVD-ARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRVDLECDEN 156 (434) T ss_pred EEEEcCCCcccc-ccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEcCC Confidence 999988776544 57899999999999999999999999999999999999999887 6999999999999999988755 Q ss_pred ce-EE-EEEecCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH Q lcl|NC_019719. 161 KV-VY-RYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 161 ~~-~~-~~~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~ 238 (424) +. .| .+..++..+.|+++||||+|+++.|+++|+||+..+..++.+..+++++..++|+||++|+++++++..+ +++ T Consensus 157 g~~~y~~~~~~g~~~~~~~~eVih~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l-~~e 235 (434) T protein:vir:43 157 GRLKYFYTTKKGARREIERTNMLHIPAFTLDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRIL-QPA 235 (434) T ss_pred CeEEEEEEecCceEEEEccccEEEecCcCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCC-CHH Confidence 43 33 3445567789999999999999999999999999999999999999999999999999999999998776 677 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH Q lcl|NC_019719. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ 318 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~ 318 (424) +.++++++|+++.++.|+|+++++++|++|+++++++.|+||+|++++++++||++|||||++||..+.++.+++|.|++ T Consensus 236 ~~~~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~ 315 (434) T protein:vir:43 236 QREEFREYVKSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQ 315 (434) T ss_pred HHHHHHHHHHHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHH Confidence 88889999999999999999999999999999999999999999999999999999999999999988888888899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019719. 319 NLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 319 ~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g 398 (424) .+.|+++||.|++.+||++||++|+++.++.+++++||+++++++|.+++++.+.+++++|+||+||+|+++|+||+||| T Consensus 316 ~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~gg 395 (434) T protein:vir:43 316 MLAFLTFSISSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGG 395 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 99999999999999999999999999988778899999999999999999999999999999999999999999999999 Q ss_pred CeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 399 DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |++++|+|++|++..++++.++.--+ T Consensus 396 D~~~~~~n~~~~~~~~~~~~~~~~~~ 421 (434) T protein:vir:43 396 DILTVQSNLVPIDQLGQSNKSQAVRA 421 (434) T ss_pred CeEeeccCccchhhhhccCCCcchhh Confidence 99999999999988876554443211 No 4 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=4.3e-100 Score=565.33 Aligned_cols=410 Identities=30% Similarity=0.506 Sum_probs=358.9 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccc-------cccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQT-------GPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |-=|+.=|||+|++.+|.+........+... ..+....+.++..|+.+.|+++++||+||++||++||++|++ T Consensus 1 ~~~~~~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~ 80 (432) T protein:vir:81 1 MPDEKKLGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLT 80 (432) T ss_pred CCchhhcchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhCcee Confidence 7778889999999999887654322211111 122333455788899999999999999999999999999999 Q ss_pred EEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC Q lcl|NC_019719. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +|++..+|.. ...+||++++|+.+||++||+++||+.++.+++++||||++++++ +|++.+||||+|..|++..+.+ T Consensus 81 ~y~~~~~g~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~~~~~ 157 (432) T protein:vir:81 81 MYMRTPDGRK--EAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDPK 157 (432) T ss_pred eEEecCCcce--ecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEECCC Confidence 9999877643 346799999999999999999999999999999999999999987 5999999999999999998754 Q ss_pred c-eEEEE-EecCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH Q lcl|NC_019719. 161 K-VVYRY-QRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 161 ~-~~~~~-~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~ 238 (424) + ..|.+ ..++....|+++||||+|+++.|+++|+||+..+..++.++.+++++..++|+||++|+++++++..+ +++ T Consensus 158 g~~~y~~~~~~g~~~~~~~~~iih~r~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l-~~e 236 (432) T protein:vir:81 158 GNTAYRYRRTDGQMIDIPKQQIWKIMGYSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFL-TDD 236 (432) T ss_pred CcEEEEEEecCceEEEEccccEEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCC-CHH Confidence 3 44554 44667789999999999999999999999999999999999999999999999999999999998776 666 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCc-cchhHHH Q lcl|NC_019719. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS-WGSGIEQ 317 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~-~~~n~e~ 317 (424) +++++++.+ .+..|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..+.+++ +++|+|+ T Consensus 237 ~~~~~~~~~---~~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq 313 (432) T protein:vir:81 237 QYDSFAKKV---SGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHHH---hhhhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHH Confidence 666665544 466788999999999999999999999999999999999999999999999999887665 4578999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Q lcl|NC_019719. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~ 397 (424) +.+.|+++||.|++++||++|+++|+++.++.+++++||+++|+++|.+++++.+++++++||||+||+|+++|+||+|| T Consensus 314 ~~~~f~~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g 393 (432) T protein:vir:81 314 QQLGFLTMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999998887899999999999999999999999999999999999999999999998 Q ss_pred CC-eeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 398 GD-VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd-~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |+ .+++++|++|++..+.+..+++.++ T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~~~~~~~~ 421 (432) T protein:vir:81 394 NAAVLTVQSAMVPLDSIGLQASPEPASG 421 (432) T ss_pred CcceEeecCcccchhhhccCCCCCCCCC Confidence 64 5568999999988877666554444 No 5 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=9.7e-100 Score=563.42 Aligned_cols=412 Identities=33% Similarity=0.585 Sum_probs=356.1 Q ss_pred cc--CCCCCchHH-HHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_019719. 8 ID--LRTNNGWWA-RLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFET 84 (424) Q Consensus 8 ~~--~~~~~G~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~ 84 (424) |+ +++.-+-+. .+++||.... ..........+.......+..|+.+.|+++++||+||++||++||++||++|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~~--s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~~~~~ 78 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVPI--SLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLNLYQT 78 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCcc--cCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCceeEEEE Confidence 65 444444332 3456664432 222233334455666678888999999999999999999999999999999999 Q ss_pred cccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc-eE Q lcl|NC_019719. 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-VV 163 (424) Q Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-~~ 163 (424) +.+|... ...+|++.++|+.+||++||+++||+.++.+++++||+|++++|+ .|.+++|||++|..|++..+.++ .. T Consensus 79 ~~~g~~~-~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-~g~~~~L~~l~p~~v~i~~~~~g~~~ 156 (437) T protein:vir:10 79 KPDGTRV-LAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-AGVLIGLELMLPQRTTVKRLTSGALQ 156 (437) T ss_pred cCCCcee-eccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEECCCCeEE Confidence 8877544 457899999999999999999999999999999999999999999 59999999999999999886543 34 Q ss_pred EEEE-ecCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHH Q lcl|NC_019719. 164 YRYQ-RDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQ 242 (424) Q Consensus 164 ~~~~-~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~ 242 (424) |.|. .++....|+++||||+|+++.|+++|+||+..+..++.+..+++++..++|+||++|+++|+.+..+ ++++.++ T Consensus 157 y~~~~~~g~~~~~~~~dIih~r~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l-~~e~~~~ 235 (437) T protein:vir:10 157 YTYRNVDGTVSTLAEDDVFHVRGFSLDGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQIL-QKEKRAE 235 (437) T ss_pred EEEEecCceEEEEccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCC-CHHHHHH Confidence 5443 4667788999999999999999999999999999999999999999999999999999999998776 6677788 Q ss_pred HHHHHHHHh-CCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHH Q lcl|NC_019719. 243 VEENFKEIA-GGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG 321 (424) Q Consensus 243 ~~~~~~~~~-~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~ 321 (424) +++.|++.+ |..|+|+++++++|++|+++++++.|+||+|++++++++||++|||||.+||+.++++++++|+|++.+. T Consensus 236 ~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~ 315 (437) T protein:vir:10 236 IRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLG 315 (437) T ss_pred HHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHH Confidence 888887755 5578999999999999999999999999999999999999999999999999999988888899999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCee Q lcl|NC_019719. 322 FLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVA 401 (424) Q Consensus 322 ~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~ 401 (424) |+++||.|++.+||++|+++||++.++..++++||+++++++|.+++++.+++++++|+||+||+|+++|+||+||||++ T Consensus 316 f~~~tl~P~~~~ie~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~~ 395 (437) T protein:vir:10 316 FLTFTLRPWLTRIEQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNAAV 395 (437) T ss_pred HHHHHHHHHHHHHHHHHHhhccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcce Confidence 99999999999999999999999988877889999999999999999999999999999999999999999999987765 Q ss_pred -eecccccchhhccccCCCccc-----CC Q lcl|NC_019719. 402 -MRQSQYVPITDLGTNKEPRNN-----GA 424 (424) Q Consensus 402 -~~~~n~~~~~~~~~~~~~~~~-----ga 424 (424) ++++|++|++..+++..+..+ |+ T Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (437) T protein:vir:10 396 LTVQSALLPIDKLGEHTTATAAQDALKAW 424 (437) T ss_pred EeecCcccchhhccCcCCCcchhcccccc Confidence 479999999876654322111 00 No 6 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=1.5e-99 Score=562.44 Aligned_cols=410 Identities=30% Similarity=0.505 Sum_probs=355.0 Q ss_pred ccCCCCCchHHHHHhhccCcccCccccccccc-------ccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTG-------PVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |-=-.--|+|+|+++.|.++.+..-....... .+....+..+..|+.+.|+++++||+||++||++||++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~ 80 (432) T protein:vir:10 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLT 80 (432) T ss_pred CCCCcccchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhhhCcee Confidence 22222349999999999886543221111111 12333455788899999999999999999999999999999 Q ss_pred EEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC Q lcl|NC_019719. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +|+++.+|.. ...+||++++|+.+||++||+++||+.++.+++++||||++++|+ +|++.+||||+|.+|++..+.+ T Consensus 81 ~y~~~~~g~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~~~~ 157 (432) T protein:vir:10 81 MYMRTPDGRK--EAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTK 157 (432) T ss_pred EEEecCCCcc--cccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEEcCC Confidence 9999887743 356799999999999999999999999999999999999999997 5999999999999999998654 Q ss_pred -ceEEEE-EecCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH Q lcl|NC_019719. 161 -KVVYRY-QRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 161 -~~~~~~-~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~ 238 (424) ...|.+ ..++..+.|+++||||+|+++.++++|+||+..+..++.++.+++++..++|+||++|++|++++..+ +++ T Consensus 158 g~~~y~~~~~~g~~~~~~~~~iih~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l-~~e 236 (432) T protein:vir:10 158 GNTAYRYRRTDGQMIDIPKQQIWKIMGYSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFL-TDD 236 (432) T ss_pred CcEEEEEEecCceEEEEcCccEEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCC-CHH Confidence 344544 34667789999999999999999999999999999999999999999999999999999999998776 666 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCc-cchhHHH Q lcl|NC_019719. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS-WGSGIEQ 317 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~-~~~n~e~ 317 (424) +++++++.+ .+..|+|++++|++|++|++++++++|+||+|++++++++||++|||||++||..+.+++ +++|.|+ T Consensus 237 ~~~~~~~~~---~~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~ 313 (432) T protein:vir:10 237 QYDSFAKKV---SGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHHH---hhhhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHH Confidence 666666554 456788999999999999999999999999999999999999999999999999887655 4578999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Q lcl|NC_019719. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~ 397 (424) +.+.|+++||.|++++||++|+++|+++.++.+++++||+++++++|.+++++.+++++++||||+||+|+++|+||++| T Consensus 314 ~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g 393 (432) T protein:vir:10 314 QQLGFLSMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999998887899999999999999999999999999999999999999999999998 Q ss_pred CC-eeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 398 GD-VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd-~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |+ .+++++|++|++..+++..+++.++ T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~~~~~~~~ 421 (432) T protein:vir:10 394 NAAVLTVQSAMVPLDSIGLQASPEPASG 421 (432) T ss_pred CcceEeecCcccchhhhcccCCCCCCCC Confidence 75 4558999999998877665555444 No 7 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=2.2e-99 Score=561.45 Aligned_cols=415 Identities=23% Similarity=0.378 Sum_probs=352.2 Q ss_pred CCCCcccccCCCC-CchHHHHHhhccCcccCccccccccccccc-ccccCcccccHHHHhhhHHHHHHHHHHHHhhccCc Q lcl|NC_019719. 1 MEEPKYTIDLRTN-NGWWARLQSWFVGGRLVTPNQGSQTGPVSA-HGHLGDSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~~~~~~~~~~-~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) |----.| .|-. .|..--+.++|++....+|........+.. ....++..|+.+.|+++++|++||++||++||++| T Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~lp 78 (424) T protein:vir:45 1 MLYCWWA--HWLWPEGGRVLLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMP 78 (424) T ss_pred CeeEeee--ceecCcchhHHHHhhccccCCCCCccccchhhhhhhccccCCceechHHhhccHHHHHHHHHHHHHHhhCc Confidence 2211111 1222 233333333455544444443322222222 22345778999999999999999999999999999 Q ss_pred eEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEc Q lcl|NC_019719. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~ 158 (424) +++|++.+++ . +...+||++++|+.+||++||+++||+.++.+++++||+|++++|+..|.+++|+|++|..|++..+ T Consensus 79 ~~v~~~~~~~-~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~ 156 (424) T protein:vir:45 79 LHVMRRHKGK-V-EPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNT 156 (424) T ss_pred eEEEEecCCc-e-eecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEc Confidence 9999886433 3 3456899999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceEEEEEecCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH Q lcl|NC_019719. 159 GKKVVYRYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~ 238 (424) ++...|.+...+....|+++||||+|+++.++++|+||+..+..++.++.++++++.++|+||++|++|++++..+ +++ T Consensus 157 ~~~~~y~~~~~~~~~~~~~~eVih~r~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l-~~e 235 (424) T protein:vir:45 157 GGRYTYGLYNEYGAFAISPDDMIHIRALGNNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGL-NKE 235 (424) T ss_pred CCeEEEEEEecCceEEECcccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCC-CHH Confidence 9888888888877889999999999999999999999999999999999999999999999999999999999876 778 Q ss_pred HHHHHHHHHHHHhCC--cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH Q lcl|NC_019719. 239 QRSQVEENFKEIAGG--PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 239 ~~~~~~~~~~~~~~~--~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) +.+++++.|++.+++ +|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+.++++ ++|+| T Consensus 236 ~~~~~~~~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e 313 (424) T protein:vir:45 236 SWGWLKDQWQKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKAT--FSNIS 313 (424) T ss_pred HHHHHHHHHHHHhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHH Confidence 888898888876654 58999999999999999999999999999999999999999999999999887665 45999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Q lcl|NC_019719. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL 395 (424) Q Consensus 317 ~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~ 395 (424) ++.++|+++||.|++.+||++||++|+++.++. +++++||+++++++|.+++++.+++++++|+||+||+|+++|+||+ T Consensus 314 q~~~~f~~~tL~P~~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi 393 (424) T protein:vir:45 314 AQAIQFVRYTMMPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPV 393 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 999999999999999999999999999988864 6899999999999999999999999999999999999999999999 Q ss_pred CCCCeeeecccccchhhcccc----CCCccc Q lcl|NC_019719. 396 PGGDVAMRQSQYVPITDLGTN----KEPRNN 422 (424) Q Consensus 396 ~~gd~~~~~~n~~~~~~~~~~----~~~~~~ 422 (424) ||||++++|+|+.+....... ++.+++ T Consensus 394 ~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 394 EGLDEMLVSVNAANPAGDFKPPKNDEGKTNE 424 (424) T ss_pred CCcceeeecccccccccccCCCCCCCCCCCC Confidence 999999999999875432111 111111 No 8 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=2.4e-99 Score=561.23 Aligned_cols=410 Identities=29% Similarity=0.505 Sum_probs=354.1 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccc-------cccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQT-------GPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |-=-.--|||+|++++|.+.....-...... ..+....+.+|..|+.+.|+++++||+||++||++||++||+ T Consensus 1 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ia~lp~~ 80 (432) T protein:vir:97 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAVAAMPLM 80 (432) T ss_pred CCCcccCchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHHHHhhccCceE Confidence 2212223999999999987654322111111 112333456788899999999999999999999999999999 Q ss_pred EEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC Q lcl|NC_019719. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +|++..+|..+ ..+||++++|+.+||++||+++||+.++.+++++||||++++++ +|++.+||||+|.+|++..+.+ T Consensus 81 ~y~~~~~g~~~--~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~~~~ 157 (432) T protein:vir:97 81 MYMRTPDGRKE--AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTK 157 (432) T ss_pred EEEecCCCccc--ccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEEcCC Confidence 99998877433 46899999999999999999999999999999999999999997 5999999999999999988654 Q ss_pred -ceEEEEE-ecCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH Q lcl|NC_019719. 161 -KVVYRYQ-RDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 161 -~~~~~~~-~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~ 238 (424) ...|.+. .++....|+++||||+|+++.|+++|+||+..+..++.+..+++++..++|+||++|++|++++..+ +++ T Consensus 158 g~~~y~~~~~~g~~~~~~~~~iih~r~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l-~~e 236 (432) T protein:vir:97 158 GNTAYRYRRTDGQMIDIPRQQIWKIMGYSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFL-TDD 236 (432) T ss_pred CcEEEEEEecCceEEEEccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCCC-CHH Confidence 4455544 4567788999999999999999999999999999999999999999999999999999999999876 566 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCc-cchhHHH Q lcl|NC_019719. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS-WGSGIEQ 317 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~-~~~n~e~ 317 (424) +++++++. +.+..|+|++++|++|++|++++++++|+||+|++++++++||++|||||++||..+.+++ +++|+|+ T Consensus 237 ~~~~~~~~---~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~ 313 (432) T protein:vir:97 237 QYDSFSKK---VSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHH---HhhhhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHH Confidence 66655544 4466788999999999999999999999999999999999999999999999998877655 4578999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Q lcl|NC_019719. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~ 397 (424) +.+.|+++||.|++++||++|+++|+++.++.+++++||+++|+++|.+++++.+.+++++||||+||+|+++|+||++| T Consensus 314 ~~~~f~~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g 393 (432) T protein:vir:97 314 QQLGFLTMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999998877889999999999999999999999999999999999999999999998 Q ss_pred CCe-eeecccccchhhccccCCCcccCC Q lcl|NC_019719. 398 GDV-AMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd~-~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |+. ++++.|++|++..+.+..++++++ T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~~~~~~~~ 421 (432) T protein:vir:97 394 NAAVLTVQSAMVPLDSIGLQASPEPASG 421 (432) T ss_pred CcceEeecccccchhhhcccCCCCCCCC Confidence 765 458999999998877665555444 No 9 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=1.8e-99 Score=562.00 Aligned_cols=403 Identities=26% Similarity=0.424 Sum_probs=346.2 Q ss_pred hHHHHHhhccCcccCccccccccc---ccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcccc Q lcl|NC_019719. 16 WWARLQSWFVGGRLVTPNQGSQTG---PVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 16 ~~~~l~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) || +..+|++...+......+.. .+....+.+|..|+++.++++++||+||++||++||++||++|+++++|+.+ T Consensus 1 m~--~~~~~~~~~~~~s~~~~w~~~~~~~~~~~~~~g~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~~- 77 (421) T protein:vir:10 1 MF--IPQMFEGKKRSVSGGGFWEAMLGGVRSSHSKAGVMITPETALALSAVRACVTLLAESVAQLPVELYRRDKNGGRQ- 77 (421) T ss_pred CC--CcchhcccccccCcchhhHHHhhhhccCcccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCcee- Confidence 22 23334433322111111111 1223344567889999999999999999999999999999999998887654 Q ss_pred ccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEecCce Q lcl|NC_019719. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEY 172 (424) Q Consensus 93 ~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~ 172 (424) ...+||++++|+.+||++||+++||+.++.+++++||||++++|+.+|+|++||||+|.+|++..++++..|.+ ..... T Consensus 78 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~g~~~y~-~~~~g 156 (421) T protein:vir:10 78 RATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPDGMPYYE-IPEIG 156 (421) T ss_pred ecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCCceEEEE-EcCCC Confidence 45689999999999999999999999999999999999999999999999999999999999988876654332 33334 Q ss_pred EEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC---CCHHHHHHHHHHHHH Q lcl|NC_019719. 173 ADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV---LTEQQRSQVEENFKE 249 (424) Q Consensus 173 ~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~---~~~~~~~~~~~~~~~ 249 (424) ..+++++|||+|+++.++++|+||+..+..++.+..++++++.++|+||++|+|+|+.+... .++++.+++++.|++ T Consensus 157 ~~~~~~eiih~~~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~ 236 (421) T protein:vir:10 157 ETLPMRMMHHVKVFSLDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTD 236 (421) T ss_pred cEEchhhEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHH Confidence 57999999999999999999999999999999999999999999999999999999987654 377888888888888 Q ss_pred HhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHH Q lcl|NC_019719. 250 IAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ 328 (424) Q Consensus 250 ~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~ 328 (424) .+++ .|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..+.++ ++|+|++.+.|+++||. T Consensus 237 ~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~f~~~tl~ 314 (421) T protein:vir:10 237 RYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKAT--NNNIEHQGLQFVMYTLL 314 (421) T ss_pred HhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCc--cccHHHHHHHHHHHHHH Confidence 7654 68899999999999999999999999999999999999999999999999887765 45999999999999999 Q ss_pred HHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccccc Q lcl|NC_019719. 329 PYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYV 408 (424) Q Consensus 329 P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~ 408 (424) |++.+||++||++|+++.++.+++++||++.++++|.+++++.+++++++|+||+||+|+++|+||+||||++++|+|++ T Consensus 315 P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~ 394 (421) T protein:vir:10 315 AWLKRHEGALQRDLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGDKYLTPLNMV 394 (421) T ss_pred HHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccc Confidence 99999999999999999888788999999999999999999999999999999999999999999999999999999999 Q ss_pred chhhccccC--CCcccCC Q lcl|NC_019719. 409 PITDLGTNK--EPRNNGA 424 (424) Q Consensus 409 ~~~~~~~~~--~~~~~ga 424 (424) +++.....+ +..+.+| T Consensus 395 ~~~~~~~~~~~~~~~~~~ 412 (421) T protein:vir:10 395 DSAQIIPGDKKPTAQQMA 412 (421) T ss_pred cccccccCCCCcccccCc Confidence 887654322 2222333 No 10 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=1.4e-98 Score=557.04 Aligned_cols=406 Identities=23% Similarity=0.394 Sum_probs=352.8 Q ss_pred CchHHHHHhhccCcccCcccccccc---ccc-c-cccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccC Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQT---GPV-S-AHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~---~~~-~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) =|||+|++++|...+...+...... ..+ . .....++..++.+.++++++||+||++||++||++||++|++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 6999999998864433222111111 111 1 1123456778999999999999999999999999999999997766 Q ss_pred ccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc------- Q lcl|NC_019719. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK------- 161 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~------- 161 (424) .. ...+|++.++|+.+||++||+++||+.++.+++++||+|++++|+..|++++|||++|.+|++..++.. T Consensus 81 ~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~ 158 (432) T protein:vir:10 81 IQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTK 158 (432) T ss_pred ee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccce Confidence 43 357899999999999999999999999999999999999999999999999999999999999876432 Q ss_pred eEEEEEecCceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019719. 162 VVYRYQRDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 162 ~~~~~~~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~ 240 (424) .+|.+..++....|+++||||+|+. +.++++|+||+..+..++.+..++++++.++|+||+.|+++|+.+..+ ++++. T Consensus 159 ~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l-~~e~~ 237 (432) T protein:vir:10 159 MWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDL-NEDAK 237 (432) T ss_pred EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCC-CHHHH Confidence 3566777788889999999999964 678999999999999999999999999999999999999999998876 66777 Q ss_pred HHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHH Q lcl|NC_019719. 241 SQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN 319 (424) Q Consensus 241 ~~~~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~ 319 (424) +++++.|++.+++ .|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||++||..++++ ++|+|++. T Consensus 238 ~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~s~~e~~~ 315 (432) T protein:vir:10 238 KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQ 315 (432) T ss_pred HHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHH Confidence 8888888886655 78999999999999999999999999999999999999999999999999877765 45899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019719. 320 LGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 320 ~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g 398 (424) ++|+++||+|++.+||++|+++|+++.++. +++++||+++|+++|.+++++.+++++++|++|+||+|+++|+||+||| T Consensus 316 ~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~gg 395 (432) T protein:vir:10 316 QQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGG 395 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 999999999999999999999999988864 6889999999999999999999999999999999999999999999999 Q ss_pred Ceeeecccccchhhcccc--CCCcccCC Q lcl|NC_019719. 399 DVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~--~~~~~~ga 424 (424) |++++|+|++|++.+++. +++++++- T Consensus 396 D~~~~~~n~~~~~~~~~~~~k~~~~~~~ 423 (432) T protein:vir:10 396 DRLLVNGNMLPIDMAGQAYLKGGDTNGE 423 (432) T ss_pred CeEeecccccchhhccccccCCCCCCCC Confidence 999999999999876432 11111111 No 11 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=1.4e-98 Score=557.04 Aligned_cols=406 Identities=23% Similarity=0.394 Sum_probs=352.8 Q ss_pred CchHHHHHhhccCcccCcccccccc---ccc-c-cccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccC Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQT---GPV-S-AHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~---~~~-~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) =|||+|++++|...+...+...... ..+ . .....++..++.+.++++++||+||++||++||++||++|++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 6999999998864433222111111 111 1 1123456778999999999999999999999999999999997766 Q ss_pred ccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc------- Q lcl|NC_019719. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK------- 161 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~------- 161 (424) .. ...+|++.++|+.+||++||+++||+.++.+++++||+|++++|+..|++++|||++|.+|++..++.. T Consensus 81 ~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~ 158 (432) T protein:vir:10 81 IQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTK 158 (432) T ss_pred ee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccce Confidence 43 357899999999999999999999999999999999999999999999999999999999999876432 Q ss_pred eEEEEEecCceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019719. 162 VVYRYQRDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 162 ~~~~~~~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~ 240 (424) .+|.+..++....|+++||||+|+. +.++++|+||+..+..++.+..++++++.++|+||+.|+++|+.+..+ ++++. T Consensus 159 ~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l-~~e~~ 237 (432) T protein:vir:10 159 MWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDL-NEDAK 237 (432) T ss_pred EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCC-CHHHH Confidence 3566777788889999999999964 678999999999999999999999999999999999999999998876 66777 Q ss_pred HHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHH Q lcl|NC_019719. 241 SQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN 319 (424) Q Consensus 241 ~~~~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~ 319 (424) +++++.|++.+++ .|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||++||..++++ ++|+|++. T Consensus 238 ~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~s~~e~~~ 315 (432) T protein:vir:10 238 KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQ 315 (432) T ss_pred HHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHH Confidence 8888888886655 78999999999999999999999999999999999999999999999999877765 45899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019719. 320 LGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 320 ~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g 398 (424) ++|+++||+|++.+||++|+++|+++.++. +++++||+++|+++|.+++++.+++++++|++|+||+|+++|+||+||| T Consensus 316 ~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~gg 395 (432) T protein:vir:10 316 QQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGG 395 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 999999999999999999999999988864 6889999999999999999999999999999999999999999999999 Q ss_pred Ceeeecccccchhhcccc--CCCcccCC Q lcl|NC_019719. 399 DVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~--~~~~~~ga 424 (424) |++++|+|++|++.+++. +++++++- T Consensus 396 D~~~~~~n~~~~~~~~~~~~k~~~~~~~ 423 (432) T protein:vir:10 396 DRLLVNGNMLPIDMAGQAYLKGGDTNGE 423 (432) T ss_pred CeEeecccccchhhccccccCCCCCCCC Confidence 999999999999876432 11111111 No 12 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=1.4e-98 Score=557.04 Aligned_cols=406 Identities=23% Similarity=0.394 Sum_probs=352.8 Q ss_pred CchHHHHHhhccCcccCcccccccc---ccc-c-cccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccC Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQT---GPV-S-AHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~---~~~-~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) =|||+|++++|...+...+...... ..+ . .....++..++.+.++++++||+||++||++||++||++|++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 6999999998864433222111111 111 1 1123456778999999999999999999999999999999997766 Q ss_pred ccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc------- Q lcl|NC_019719. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK------- 161 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~------- 161 (424) .. ...+|++.++|+.+||++||+++||+.++.+++++||+|++++|+..|++++|||++|.+|++..++.. T Consensus 81 ~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~ 158 (432) T protein:vir:10 81 IQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTK 158 (432) T ss_pred ee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccce Confidence 43 357899999999999999999999999999999999999999999999999999999999999876432 Q ss_pred eEEEEEecCceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019719. 162 VVYRYQRDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 162 ~~~~~~~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~ 240 (424) .+|.+..++....|+++||||+|+. +.++++|+||+..+..++.+..++++++.++|+||+.|+++|+.+..+ ++++. T Consensus 159 ~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l-~~e~~ 237 (432) T protein:vir:10 159 MWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDL-NEDAK 237 (432) T ss_pred EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCC-CHHHH Confidence 3566777788889999999999964 678999999999999999999999999999999999999999998876 66777 Q ss_pred HHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHH Q lcl|NC_019719. 241 SQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN 319 (424) Q Consensus 241 ~~~~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~ 319 (424) +++++.|++.+++ .|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||++||..++++ ++|+|++. T Consensus 238 ~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~s~~e~~~ 315 (432) T protein:vir:10 238 KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQ 315 (432) T ss_pred HHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHH Confidence 8888888886655 78999999999999999999999999999999999999999999999999877765 45899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019719. 320 LGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 320 ~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g 398 (424) ++|+++||+|++.+||++|+++|+++.++. +++++||+++|+++|.+++++.+++++++|++|+||+|+++|+||+||| T Consensus 316 ~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~gg 395 (432) T protein:vir:10 316 QQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGG 395 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 999999999999999999999999988864 6889999999999999999999999999999999999999999999999 Q ss_pred Ceeeecccccchhhcccc--CCCcccCC Q lcl|NC_019719. 399 DVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~--~~~~~~ga 424 (424) |++++|+|++|++.+++. +++++++- T Consensus 396 D~~~~~~n~~~~~~~~~~~~k~~~~~~~ 423 (432) T protein:vir:10 396 DRLLVNGNMLPIDMAGQAYLKGGDTNGE 423 (432) T ss_pred CeEeecccccchhhccccccCCCCCCCC Confidence 999999999999876432 11111111 No 13 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=1.1e-98 Score=557.56 Aligned_cols=403 Identities=27% Similarity=0.430 Sum_probs=348.6 Q ss_pred CchHHHHHhhccCcccCc-ccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVT-PNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) =||++.+ +++.... .........+....+.+|..++.+.++++++|++||++||++||++||++|+++++|+.+ T Consensus 1 m~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~- 75 (419) T protein:vir:57 1 MFIPQFW----KGRPSENRVNWQVVPGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGGRE- 75 (419) T ss_pred Ccchhhh----ccCCccccccccccccccccccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCcee- Confidence 2333333 3322111 111222233445567788899999999999999999999999999999999999887644 Q ss_pred ccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEecCce Q lcl|NC_019719. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEY 172 (424) Q Consensus 93 ~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~ 172 (424) ...+|++.++|+.+||++||+++||+.++.+++++||+|++++|+.+|.+++|||++|++|++..++++..| |...+.. T Consensus 76 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g~~~-y~~~~~~ 154 (419) T protein:vir:57 76 IAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDGMPY-YDIPSIG 154 (419) T ss_pred ccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCceEE-EEEcCCc Confidence 457899999999999999999999999999999999999999999999999999999999999988766543 3344455 Q ss_pred EEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC---CCCHHHHHHHHHHHHH Q lcl|NC_019719. 173 ADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK---VLTEQQRSQVEENFKE 249 (424) Q Consensus 173 ~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~---~~~~~~~~~~~~~~~~ 249 (424) ..+++++|||+|+++.|+++|+||+..+..++.+..++++++.++|+||++|+++|+.+.. ..++++.+++++.|.+ T Consensus 155 ~~~~~~~vih~r~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~ 234 (419) T protein:vir:57 155 EILPMRMVHHIKSFSLDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTE 234 (419) T ss_pred eEEchhhEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHH Confidence 6799999999999999999999999999999999999999999999999999999998653 3467888888888877 Q ss_pred HhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHH Q lcl|NC_019719. 250 IAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ 328 (424) Q Consensus 250 ~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~ 328 (424) .+++ .|+|+++++++|++|++++++++|+||+|++++++++||++|||||.+||..+.++ ++|+|++.+.|+++||+ T Consensus 235 ~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~f~~~~l~ 312 (419) T protein:vir:57 235 RYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKST--NNNIEHQGLQYVIYTML 312 (419) T ss_pred HhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCc--cccHHHHHHHHHHHHHH Confidence 6654 68999999999999999999999999999999999999999999999999877664 55999999999999999 Q ss_pred HHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccccc Q lcl|NC_019719. 329 PYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYV 408 (424) Q Consensus 329 P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~ 408 (424) |++.+||++|+++|+++.++.+++++||+++++++|.+++++.+++++++|+||+||+|+++|+||+||||++++|+|++ T Consensus 313 P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~ 392 (419) T protein:vir:57 313 AILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGGDKYLTPLNMV 392 (419) T ss_pred HHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccccc Confidence 99999999999999999888889999999999999999999999999999999999999999999999999999999999 Q ss_pred chhhccccCCCcccCC Q lcl|NC_019719. 409 PITDLGTNKEPRNNGA 424 (424) Q Consensus 409 ~~~~~~~~~~~~~~ga 424 (424) ++..+.+..++..+.- T Consensus 393 ~~~~~~~~~~~~~~~~ 408 (419) T protein:vir:57 393 DSKALTGIGKATPQQL 408 (419) T ss_pred cccccccccCCCcccC Confidence 8876555332222211 No 14 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=1e-98 Score=557.74 Aligned_cols=402 Identities=25% Similarity=0.439 Sum_probs=345.6 Q ss_pred chHHHHHhhccCcccCccccccc-ccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 15 GWWARLQSWFVGGRLVTPNQGSQ-TGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 15 G~~~~l~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =||+|.+....+.. ......+ ...++...+.++..|+.+.|+++++|++||++||++||++||++|+++.++. .. T Consensus 1 ~~~~r~~~~~~~~~--~~~~~~~~~~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~--~~ 76 (419) T protein:vir:14 1 MFFSRQLLSNLGQT--QMSAGGWVSALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIAQLPIELYERSGEDR--KP 76 (419) T ss_pred Cccccccccccccc--ccCcchhhHHhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhccCceEEEEecCCcc--cc Confidence 24444322211111 1111112 2233445566788899999999999999999999999999999999876653 34 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEecCceE Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..+|+++++|+.+||++||+++||+.++.+++++||+|++++|+.+|.+++|||++|.+|++..++++..++...+ .. T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~~~~y~~~~--~~ 154 (419) T protein:vir:14 77 ATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDLKPVYRVRG--SD 154 (419) T ss_pred ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceEEEEEcc--Cc Confidence 5789999999999999999999999999999999999999999999999999999999999988766543322222 23 Q ss_pred EecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC---CHHHHHHHHHHHHHH Q lcl|NC_019719. 174 DFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL---TEQQRSQVEENFKEI 250 (424) Q Consensus 174 ~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~---~~~~~~~~~~~~~~~ 250 (424) .++.++|+|+|+++.++++|+||+..+..++.+..++++++.++|+||++|+|+|+.+.... ++++.+++++.|++. T Consensus 155 ~~~~~~i~h~~~~~~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (419) T protein:vir:14 155 PMPQRLVHHVRWMSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAK 234 (419) T ss_pred ccchhheeEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHH Confidence 47899999999999999999999999999999999999999999999999999999886553 678888899999887 Q ss_pred hCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHH Q lcl|NC_019719. 251 AGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 251 ~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P 329 (424) +++ .|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||++||..++++ ++|+|++++.|+++||.| T Consensus 235 ~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t--~s~~E~~~~~f~~~~L~P 312 (419) T protein:vir:14 235 FGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERAT--FSNIEHQSLQFVIYTLLP 312 (419) T ss_pred hcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC--cccHHHHHHHHHHHHHHH Confidence 665 68899999999999999999999999999999999999999999999999876664 458999999999999999 Q ss_pred HHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccc Q lcl|NC_019719. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~ 409 (424) ++.+||++|+++|+++.++.+++++||+++|+++|.+++++.+++++++|+||+||+|+++|+||+||||++++|+|+++ T Consensus 313 ~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~~~~n~~~ 392 (419) T protein:vir:14 313 WVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMVD 392 (419) T ss_pred HHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccc Confidence 99999999999999998888899999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccCCCcccCC Q lcl|NC_019719. 410 ITDLGTNKEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~~~~~~~ga 424 (424) ++...+.+.++.+.+ T Consensus 393 ~~~~~~~~~~~~~~~ 407 (419) T protein:vir:14 393 ASKPQQLPVGKSEPT 407 (419) T ss_pred ccccccccCCCCCCc Confidence 887665433322221 No 15 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=4.5e-98 Score=554.26 Aligned_cols=402 Identities=28% Similarity=0.488 Sum_probs=345.8 Q ss_pred CchHHHHHhhccCcccCccccccc-cccc-ccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQ-TGPV-SAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~ 91 (424) =|||++|+.. ........... ...+ ......++..|+.+.++++++|++||++||++||++||++|+.++++. T Consensus 1 Mg~f~~lf~r---~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~-- 75 (414) T protein:vir:44 1 MVFFSGLFQR---KSDAPVTTPAELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLK-- 75 (414) T ss_pred Cchhhhhhcc---CccCcccchhhHhHhhccCccccCCceechhhhhccHHHHHHHHHHHHHhccCceEEEEecCCce-- Confidence 5778777433 22222211111 1111 233456788899999999999999999999999999999999876654 Q ss_pred cccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc-eEEEEE-ec Q lcl|NC_019719. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-VVYRYQ-RD 169 (424) Q Consensus 92 ~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-~~~~~~-~~ 169 (424) +...+|+++++|+.+||++||+++||+.++.+++++||||++++++ .|.+.+||||+|.+|++..++.+ ..|.+. .+ T Consensus 76 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~~~ 154 (414) T protein:vir:44 76 QRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSWEPVYQVTFPD 154 (414) T ss_pred eecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCCcEEEEEEecC Confidence 3457899999999999999999999999999999999999999887 69999999999999998877543 344443 45 Q ss_pred CceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHH Q lcl|NC_019719. 170 SEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) Q Consensus 170 ~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~ 249 (424) +....|+++||||+|+++.|+++|+||+..+..++.+..++++++.++|+||++|+++++++..+ ++++.+++++.|++ T Consensus 155 g~~~~~~~~evih~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l-~~e~~~~~~~~~~~ 233 (414) T protein:vir:44 155 GSTDVLSQEDIWHVRTLTLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTL-SDQAYERLKKDFEE 233 (414) T ss_pred ceEEEEccccEEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC-CHHHHHHHHHHHHH Confidence 56788999999999999999999999999999999999999999999999999999999999876 67778888888877 Q ss_pred HhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHH Q lcl|NC_019719. 250 IAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ 328 (424) Q Consensus 250 ~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~ 328 (424) .+++ +|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.|+++||+ T Consensus 234 ~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t--~~n~e~~~~~~~~~~l~ 311 (414) T protein:vir:44 234 RHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRAT--FNNIEELGLGFINYSLV 311 (414) T ss_pred HhcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHHHHHHHHH Confidence 6654 78999999999999999999999999999999999999999999999999876654 56999999999999999 Q ss_pred HHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccccc Q lcl|NC_019719. 329 PYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYV 408 (424) Q Consensus 329 P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~ 408 (424) |++++||++||++|+++.++.+++++||+++|+++|.+++++.+++++++||||+||+|+++|+||+||||++++|.|++ T Consensus 312 P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~ 391 (414) T protein:vir:44 312 PYLTRIEQRINTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMT 391 (414) T ss_pred HHHHHHHHHHHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeccccccc Confidence 99999999999999999888788999999999999999999999999999999999999999999999999999999998 Q ss_pred chhhccccCCCcccCC Q lcl|NC_019719. 409 PITDLGTNKEPRNNGA 424 (424) Q Consensus 409 ~~~~~~~~~~~~~~ga 424 (424) +........+++.+.+ T Consensus 392 ~~~~~~~~~~~~~~~~ 407 (414) T protein:vir:44 392 TKPSDGSKAGKQKDNA 407 (414) T ss_pred ccCCccccCCCCCCCC Confidence 7654433322222222 No 16 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=6.3e-98 Score=553.48 Aligned_cols=406 Identities=22% Similarity=0.384 Sum_probs=350.2 Q ss_pred CchHHHHHhhccCcccCccccccccccc--ccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPV--SAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~ 91 (424) =|||++++++++..+............+ ......++..++.+.++++++|++||++||++||++||++|++.+++. T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l~~~~~~~~~~~~-- 78 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYGI-- 78 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCCCcceechhhhhccHHHHHHHHHHHHhhccCceEEEEecCCce-- Confidence 5788888776654321111110011111 111234566788999999999999999999999999999999876664 Q ss_pred cccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc-------eEE Q lcl|NC_019719. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-------VVY 164 (424) Q Consensus 92 ~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-------~~~ 164 (424) +...+|++.++|+.+||++||+++||+.++.+++++||+|++++|+..|.+++|||++|++|++..++.. .+| T Consensus 79 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~~~~~~~ 158 (429) T protein:vir:10 79 QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWY 158 (429) T ss_pred eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEE Confidence 3467899999999999999999999999999999999999999999999999999999999999887532 346 Q ss_pred EEEecCceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019719. 165 RYQRDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 165 ~~~~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~ 243 (424) .+..++..+.|+++||||+|+. +.++++|+||+..+..++.+..+++++..++|+||+.|+++++.+..+ ++++.+++ T Consensus 159 ~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l-~~e~~~~~ 237 (429) T protein:vir:10 159 VVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDL-NEDAKKVF 237 (429) T ss_pred EEccCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCC-CHHHHHHH Confidence 6777778889999999999964 678999999999999999999999999999999999999999998776 66778888 Q ss_pred HHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHH Q lcl|NC_019719. 244 EENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 244 ~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~ 322 (424) ++.|++.+++ .|+|+++++++|++|++++.++.|+|++|.+++++++||++|||||.+||+.++++ ++|+|++.++| T Consensus 238 ~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~sn~e~~~~~f 315 (429) T protein:vir:10 238 RENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQQQF 315 (429) T ss_pred HHHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHHH Confidence 8988886654 78999999999999999999999999999999999999999999999999877765 45999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCee Q lcl|NC_019719. 323 LQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVA 401 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~ 401 (424) ++.||.|++.+||++||++|+++.++. +++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+||||++ T Consensus 316 ~~~~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~ 395 (429) T protein:vir:10 316 YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRL 395 (429) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee Confidence 999999999999999999999988864 6899999999999999999999999999999999999999999999999999 Q ss_pred eecccccchhhcccc--CCCcccCC Q lcl|NC_019719. 402 MRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~--~~~~~~ga 424 (424) ++|+|++|++.+++. +++++++. T Consensus 396 ~~~~n~~~~d~~~~~~~k~g~~~~~ 420 (429) T protein:vir:10 396 LVNGNMLPIDMAGQAYLKGGDTNGE 420 (429) T ss_pred eecccccchhhccccccCCCCCCCC Confidence 999999999876432 22222222 No 17 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=6.9e-98 Score=553.24 Aligned_cols=399 Identities=23% Similarity=0.397 Sum_probs=350.4 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||+|++++|.+...... . ..+. ...+.++..++.+.++++++|++||++||++||++|+++|++++++.. . T Consensus 1 MG~~~~~~~~~~~~~~~~~---~-~~~~-~~~~~g~~~~~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~--~ 73 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVD---M-TNPL-LLQWLGVDPDTPRNQLSEATYFACLKILSESLGKLPLKMYQKTERGIV--K 73 (411) T ss_pred CchHHHHHhhccCcccccc---c-chHH-HHHHhcCcccChhhhhccHHHHHHHHHHHHhHhhCceeEEEecCCcee--e Confidence 6999999988876543221 1 1122 223455677889999999999999999999999999999998877644 3 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc-------eEEEE Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-------VVYRY 166 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-------~~~~~ 166 (424) ..+|+++++|+.+||++||+++||+.++.+++++||||++++|+ .|.+.+|||++|++|++..++.+ .+|.| T Consensus 74 ~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 152 (411) T protein:vir:81 74 SDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-GPQLQALWILPSQYVTIVVDDRGLLGEKNAIWYRY 152 (411) T ss_pred ecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCceEEEEEECCceEEEEEcCcccccccceEEEEE Confidence 46799999999999999999999999999999999999999998 58999999999999999987653 23444 Q ss_pred E--ecCceEEecHhHeeEecc-CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019719. 167 Q--RDSEYADFSQKEIFHLKG-FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 167 ~--~~~~~~~~~~~evih~r~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~ 243 (424) . .++....|+++||||+|+ ++.++++|+||+..+..++.+..+++++..++|+||+.|+++|+++..+ ++++++++ T Consensus 153 ~~~~~g~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l-~~e~~~~~ 231 (411) T protein:vir:81 153 NDPYDGKMYVFRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDL-NQEARDRL 231 (411) T ss_pred EecCCceEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC-CHHHHHHH Confidence 3 356678899999999995 5678999999999999999999999999999999999999999998776 67778888 Q ss_pred HHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHH Q lcl|NC_019719. 244 EENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 244 ~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~ 322 (424) ++.|++.+++ +|+|+++++++|++|+++++++.|+||+|++++++++||++|||||.+||..++++ ++|+|++.++| T Consensus 232 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~~n~e~~~~~f 309 (411) T protein:vir:81 232 VKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSS--YASAEAQNLAF 309 (411) T ss_pred HHHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--chhHHHHHHHH Confidence 8888886654 68999999999999999999999999999999999999999999999999887665 45999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCee Q lcl|NC_019719. 323 LQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVA 401 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~ 401 (424) +++||.|++++||++|+++|+++.++. +++++||+++++++|.+++++.+++++++|+||+||+|+++|+||+||||++ T Consensus 310 ~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD~~ 389 (411) T protein:vir:81 310 YVDTLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYGNNL 389 (411) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCee Confidence 999999999999999999999988864 6889999999999999999999999999999999999999999999999999 Q ss_pred eecccccchhhccccCCCcccC Q lcl|NC_019719. 402 MRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~g 423 (424) ++++|++|++.++++.+...++ T Consensus 390 ~~~~n~~pl~~~~~~~~kgGd~ 411 (411) T protein:vir:81 390 MANGNYIPLSMLGANYGKGGDS 411 (411) T ss_pred eeccCccchhhhhhhhccCCCC Confidence 9999999998877653321122 No 18 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=2.2e-97 Score=550.49 Aligned_cols=399 Identities=23% Similarity=0.378 Sum_probs=349.2 Q ss_pred chHHHHHhhccCcccCc-cc---ccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcc Q lcl|NC_019719. 15 GWWARLQSWFVGGRLVT-PN---QGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 15 G~~~~l~~~~~~~~~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) =||+++ |.++.... .. .......++...+.++..|+.+.++++++||+||++||++||++||++|++++++.. T Consensus 1 m~~~~~---f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~~~ 77 (416) T protein:vir:12 1 MLLERM---FEKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKLPIHTYKRTDGGIE 77 (416) T ss_pred Cccchh---cccccCccccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCccc Confidence 355555 33322211 11 112233455555677888999999999999999999999999999999998765543 Q ss_pred ccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcC--CceEEEEEe Q lcl|NC_019719. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYRYQR 168 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~--~~~~~~~~~ 168 (424) ...+||++++|+.+||++||+++||+.++.+++++||||+++.|+..|.+.+||||+|.+|++..+. +..+|.+.. T Consensus 78 --~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~ 155 (416) T protein:vir:12 78 --RKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPTTGMLWYQTVL 155 (416) T ss_pred --cccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCCCcEEEEEEec Confidence 3467999999999999999999999999999999999999999999999999999999999987653 445677777 Q ss_pred cCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019719. 169 DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 169 ~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) ++..+.|+++||||+|+++.++++|+||+.++..++.+..+++++..++|+||+.|++|++.+..+ ++++.+++++.|+ T Consensus 156 ~g~~~~~~~~eiih~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-~~e~~~~~~~~~~ 234 (416) T protein:vir:12 156 NGKAIELYDYEVLHFKGLSTDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFL-DEKPKENVRKEWK 234 (416) T ss_pred CCeEEEecCccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCC-CHHHHHHHHHHHH Confidence 888899999999999999999999999999999999999999999999999999999999998765 7788888999888 Q ss_pred HHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHH Q lcl|NC_019719. 249 EIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ 328 (424) Q Consensus 249 ~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~ 328 (424) ... ++++++++++|++|++++++++|+||+|.+++++++||++|||||++||...+++ ++|.|++.++|+++||. T Consensus 235 ~~~---~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~f~~~~l~ 309 (416) T protein:vir:12 235 RVN---KVENIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKAT--FSNIEHQSIEYVRNTLQ 309 (416) T ss_pred HHh---cCCCeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCC--cccHHHHHHHHHHHHHH Confidence 765 4678999999999999999999999999999999999999999999999877665 45999999999999999 Q ss_pred HHHHHHHHHHHhhccCcccc-ccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccc Q lcl|NC_019719. 329 PYISRWENSIQRWLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQY 407 (424) Q Consensus 329 P~~~~ie~~l~~~l~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~ 407 (424) |++.+||++|+++|+++.++ .+++++||+++++++|.+++++.+.+++++|+||+||+|+++|+||+||||++++|+|+ T Consensus 310 P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~n~ 389 (416) T protein:vir:12 310 PWIVNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKYISSLNY 389 (416) T ss_pred HHHHHHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccc Confidence 99999999999999998876 46899999999999999999999999999999999999999999999999999999999 Q ss_pred cchhhccccCCCcccCC Q lcl|NC_019719. 408 VPITDLGTNKEPRNNGA 424 (424) Q Consensus 408 ~~~~~~~~~~~~~~~ga 424 (424) ++++...+++..+.+++ T Consensus 390 ~~~~~~~~~~~~~~~~~ 406 (416) T protein:vir:12 390 VFLDFLEEYQRLKAGGA 406 (416) T ss_pred ccccccchhhccccccc Confidence 99987765544332222 No 19 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=3.7e-97 Score=549.26 Aligned_cols=402 Identities=25% Similarity=0.443 Sum_probs=344.5 Q ss_pred chHHHHHhhccCcccCccccccc-ccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 15 GWWARLQSWFVGGRLVTPNQGSQ-TGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 15 G~~~~l~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =||++... +......+....+ ...++...+.++..|+++.++++++|++||++||++||++||++|++++++. +. T Consensus 1 m~~~~~~~--~~~~~~~~~~~~~~~~~~g~~~s~~~~~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~--~~ 76 (419) T protein:vir:80 1 MFFSRQLL--SNLGQTQPGSGGWVSALLGSARSEAGQVVTPASALSLTVLQNCVTLLAESIAQLPVELYERSGDDR--KP 76 (419) T ss_pred CCcccccc--cccCcCCCCcchhhHHhhcccccccCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEecCCCc--cc Confidence 12222211 1111112211111 2233444567788999999999999999999999999999999999987764 34 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEecCceE Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..+|+++++|+.+||++||+++||+.++.+++++||||++++|+.+|++.+|||++|++|++..++++..+ |...+ .. T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~~~~-y~~~~-~~ 154 (419) T protein:vir:80 77 ATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDLKPM-YRVAG-AD 154 (419) T ss_pred ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceEE-EEEcC-cc Confidence 57899999999999999999999999999999999999999999999999999999999999988765432 22222 23 Q ss_pred EecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC---CCHHHHHHHHHHHHHH Q lcl|NC_019719. 174 DFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV---LTEQQRSQVEENFKEI 250 (424) Q Consensus 174 ~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~---~~~~~~~~~~~~~~~~ 250 (424) .+++++|+|+|+++.|+++|+||+..+..++.+..+++++..++|+||+.|+++|+.+... .++++.+++++.|++. T Consensus 155 ~~~~~~i~h~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (419) T protein:vir:80 155 PLPQRLVHHVRWMSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAK 234 (419) T ss_pred ccchhheEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHH Confidence 5899999999999999999999999999999999999999999999999999999987543 3677888899999887 Q ss_pred hCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHH Q lcl|NC_019719. 251 AGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 251 ~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P 329 (424) +++ .|+|+++++++|++|++++.++.|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.|+++||.| T Consensus 235 ~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t--~~n~e~~~~~f~~~~l~P 312 (419) T protein:vir:80 235 FGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERAT--FSNIEHQSLQFVIYTLLP 312 (419) T ss_pred hcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC--cccHHHHHHHHHHHHHHH Confidence 665 68899999999999999999999999999999999999999999999999876665 458999999999999999 Q ss_pred HHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccc Q lcl|NC_019719. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~ 409 (424) ++.+||++|+++|+++.++.+++++||+++++++|.+++++.+++++++|+||+||+|+++|+||+||||++++|+|+++ T Consensus 313 ~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD~~~~~~n~~~ 392 (419) T protein:vir:80 313 WVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMVD 392 (419) T ss_pred HHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccc Confidence 99999999999999998888899999999999999999999999999999999999999999999999999999999998 Q ss_pred hhhccccCCCcccCC Q lcl|NC_019719. 410 ITDLGTNKEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~~~~~~~ga 424 (424) ++.....+.++.+-+ T Consensus 393 ~~~~~~~~~~~~~~~ 407 (419) T protein:vir:80 393 ASKPQPIPMGKTEPT 407 (419) T ss_pred ccccccccCCCCCch Confidence 877655443333333 No 20 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=4.7e-97 Score=548.71 Aligned_cols=401 Identities=28% Similarity=0.507 Sum_probs=348.6 Q ss_pred chHHHHHhhccCcccCcccccc-cccccc-cccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcccc Q lcl|NC_019719. 15 GWWARLQSWFVGGRLVTPNQGS-QTGPVS-AHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 15 G~~~~l~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) =||+ ++|++.......... ....+. .....++..|+.+.|+++++|++||++||+++|++|+++|+.++++.. T Consensus 1 ~~f~---~~f~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~-- 75 (413) T protein:vir:48 1 MFFS---GLFQRKSDAPVTTPAELAEAIGLSYDTYTGKRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLKT-- 75 (413) T ss_pred Cccc---hhhccCccCCccchHHHHHhhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcce-- Confidence 1233 334443322222211 112222 233567888999999999999999999999999999999998765543 Q ss_pred ccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCce-EEEEE-ecC Q lcl|NC_019719. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV-VYRYQ-RDS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~-~~~~~-~~~ 170 (424) ...+|++.++|+.+||++||+++||+.++.+++++||+|++++|+ .|++.+|||++|++|++..+++.. .|.+. .++ T Consensus 76 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~~~g 154 (413) T protein:vir:48 76 RVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQWQPVYQVTFPDG 154 (413) T ss_pred eecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCceEEEEEEecCc Confidence 456899999999999999999999999999999999999999987 589999999999999998876543 44443 456 Q ss_pred ceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019719. 171 EYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 171 ~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ....|+++||||+|+++.++++|+||+..+..++.+..++++++.++|+||+.|+++++++..+ ++++.+++++.|++. T Consensus 155 ~~~~~~~~evih~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~-~~e~~~~~~~~~~~~ 233 (413) T protein:vir:48 155 SVDVLTQDEIWHVRTLTLDGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKL-TPDAYERLKKDFEER 233 (413) T ss_pred eEEEEccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCC-CHHHHHHHHHHHHHH Confidence 6778999999999999999999999999999999999999999999999999999999999876 667788888888876 Q ss_pred hCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHH Q lcl|NC_019719. 251 AGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 251 ~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P 329 (424) +++ .|+|+++++++|++|++++.+++|+||+|.+++++++||++|||||.+||+.++++ ++|.|++.+.|+++||.| T Consensus 234 ~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~~n~e~~~~~f~~~~i~P 311 (413) T protein:vir:48 234 HTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRAT--FNNIEELGLGFINYSLVP 311 (413) T ss_pred hcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCC--cccHHHHHHHHHHHHHHH Confidence 654 78999999999999999999999999999999999999999999999999876654 559999999999999999 Q ss_pred HHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccc Q lcl|NC_019719. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~ 409 (424) ++++||++||++|+++.++.+++++||+++|+++|.+++++.+++++++|+||+||+|+++|+||+||||++++|+|+++ T Consensus 312 ~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~ 391 (413) T protein:vir:48 312 YLTRIEQRINTGLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTT 391 (413) T ss_pred HHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeccccccc Confidence 99999999999999998887899999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccCCCcccCC Q lcl|NC_019719. 410 ITDLGTNKEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~~~~~~~ga 424 (424) +...+++++++.+++ T Consensus 392 ~~~~~~~~~~~~~~~ 406 (413) T protein:vir:48 392 SPSAGDDNGKKKESG 406 (413) T ss_pred cccccccCCCCCCCC Confidence 988877665555555 No 21 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=4.1e-97 Score=549.02 Aligned_cols=401 Identities=23% Similarity=0.289 Sum_probs=340.8 Q ss_pred hHHHHHhhccCcccCcccccccc-ccc------ccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccC Q lcl|NC_019719. 16 WWARLQSWFVGGRLVTPNQGSQT-GPV------SAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 16 ~~~~l~~~~~~~~~~~~~~~~~~-~~~------~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) ||+.|+..........+.....+ ..+ ....+.++..|+++.++++++|++||++||++||++||++|+++.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~g 80 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQTDAQG 80 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEeccCC Confidence 66655332222222222211111 111 12345677889999999999999999999999999999999998877 Q ss_pred ccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC-ceEEEEE Q lcl|NC_019719. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-KVVYRYQ 167 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-~~~~~~~ 167 (424) ...+ ...|+ .++|+.+||++||+++||+.++.+++++||+|++++|+.+|.+.+|||++|++|++..+++ ..+|.+. T Consensus 81 ~~~~-~~~~~-~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~y~~~ 158 (454) T protein:vir:93 81 IRRE-TRRGD-IARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADDGEVFYRIT 158 (454) T ss_pred ccch-hhhHH-HHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCCCcEEEEEE Confidence 6544 34454 4556679999999999999999999999999999999999999999999999999988754 4556665 Q ss_pred ec-----CceEEecHhHeeEecc-CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHH Q lcl|NC_019719. 168 RD-----SEYADFSQKEIFHLKG-FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRS 241 (424) Q Consensus 168 ~~-----~~~~~~~~~evih~r~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~ 241 (424) .. +....|+++||||+|+ ++.++++|+||+..+..++.+..+++++..++|+||++|+++++++..+ ++++.+ T Consensus 159 ~~~~~~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l-~~e~~~ 237 (454) T protein:vir:93 159 PDRNCGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSI-TEENAK 237 (454) T ss_pred eccccccceeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCC-CHHHHH Confidence 43 2356899999999995 5678999999999999999999999999999999999999999998765 677889 Q ss_pred HHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHH Q lcl|NC_019719. 242 QVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG 321 (424) Q Consensus 242 ~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~ 321 (424) ++++.|++.+++.|+|++++|++|++|+++++++.|+||+|++++++++||++|||||.+||..++++ ++|+|++.+. T Consensus 238 ~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~ 315 (454) T protein:vir:93 238 KLKSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPS--SDNVEALEQQ 315 (454) T ss_pred HHHHHHHHHhcccccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc--chhHHHHHHH Confidence 99999999998899999999999999999999999999999999999999999999999999877664 4699999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCee Q lcl|NC_019719. 322 FLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVA 401 (424) Q Consensus 322 ~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~ 401 (424) |+++||.|++..||++||++|+++.+ ++++||+++|+++|.+++++.+.+++++|+||+||+|+++|+||+||||++ T Consensus 316 f~~~~l~P~~~~ie~~ln~~L~~~~~---~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~ 392 (454) T protein:vir:93 316 YYSQCLQTLIESIELLLDEALETGEN---ESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDAL 392 (454) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCC---cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCee Confidence 99999999999999999999987643 579999999999999999999999999999999999999999999999999 Q ss_pred eecccccchhhccccCCCcccCC Q lcl|NC_019719. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++++|+.+++..++.+..++..+ T Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~ 415 (454) T protein:vir:93 393 YLQQQNYSLEALSRRDAREDPFA 415 (454) T ss_pred eeccCccchHhhhccCcccCCCC Confidence 99999999877665433322111 No 22 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=1.6e-96 Score=545.84 Aligned_cols=402 Identities=25% Similarity=0.401 Sum_probs=349.3 Q ss_pred CchHHHHHhhccCcccCcccc-------cccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQ-------GSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQ 86 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~ 86 (424) =|||++|++...+........ ......+..++...+..++.+.++++++|++||++||++||++|+++|+..+ T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~~~ 80 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKDKE 80 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEecCc Confidence 689999877655443221111 1112223344445566788999999999999999999999999999997543 Q ss_pred cCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc----- Q lcl|NC_019719. 87 NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK----- 161 (424) Q Consensus 87 ~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~----- 161 (424) . ..+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|++++|+|++|.+|++..++++ T Consensus 81 ~------~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~~ 154 (422) T protein:vir:13 81 E------YKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSSL 154 (422) T ss_pred c------cccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceecc Confidence 2 34689999999999999999999999999999999999999999999999999999999999887654 Q ss_pred --eEEEEE-ecCceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH Q lcl|NC_019719. 162 --VVYRYQ-RDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 162 --~~~~~~-~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~ 237 (424) .+|.+. .++....++++||||++.+ +.++++|+||+..+..++.+..++++++.++|+||++|+|++++++.+ ++ T Consensus 155 ~~~~y~~~~~~g~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l-~~ 233 (422) T protein:vir:13 155 SKVWYVVTDKNGKEHKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDL-DE 233 (422) T ss_pred ceEEEEEEeCCCeEEEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCC-CH Confidence 334443 4567788999999999864 678999999999999999999999999999999999999999998776 67 Q ss_pred HHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH Q lcl|NC_019719. 238 QQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) ++.+++++.|++.+++ +|+|+++++++|++|++++.+++|+||+|.+++++++||++|||||.+||+.++++ ++|+| T Consensus 234 e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~--~sn~e 311 (422) T protein:vir:13 234 KAKKIFKKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERAT--FNNLT 311 (422) T ss_pred HHHHHHHHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHH Confidence 7788888888887655 68899999999999999999999999999999999999999999999999887765 45899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Q lcl|NC_019719. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL 395 (424) Q Consensus 317 ~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~ 395 (424) ++.++|+++||.|++.+||++|+++|+++.++. +++++||+++++++|.+++++.+++++++|+||+||+|+++|+||+ T Consensus 312 ~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~ 391 (422) T protein:vir:13 312 EQQKDFYVTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPV 391 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 999999999999999999999999999998864 6899999999999999999999999999999999999999999999 Q ss_pred CCCCeeeecccccchhhcccc-CCCcccCC Q lcl|NC_019719. 396 PGGDVAMRQSQYVPITDLGTN-KEPRNNGA 424 (424) Q Consensus 396 ~~gd~~~~~~n~~~~~~~~~~-~~~~~~ga 424 (424) ||||++++|+|++|++.++++ +++.+.|+ T Consensus 392 ~ggD~~~~~~n~~~l~~~~~~~~~~g~~~g 421 (422) T protein:vir:13 392 EGGDRLLVNGNMIPIEMAGEQYKKGGEKGG 421 (422) T ss_pred CCcCeeeeccCccchhhcccccccCCCcCC Confidence 999999999999999887754 23333344 No 23 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=1.5e-96 Score=545.91 Aligned_cols=398 Identities=25% Similarity=0.413 Sum_probs=339.3 Q ss_pred hHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccc Q lcl|NC_019719. 16 WWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDL 95 (424) Q Consensus 16 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~ 95 (424) || |+..|................+. ....++..++.+.|+++++|++||++||++||++||++|++.+ +. +... T Consensus 1 m~--f~~~~~~~~~~~~~~~~~~~~~~-g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~-~~--~~~~ 74 (409) T protein:vir:10 1 ML--FRKGFKNQSQEISIDDKKILEWL-GINPSETYVNGKSCLKQATVFGCIRILSDNISKLPIKIYQKKD-GI--KRVP 74 (409) T ss_pred Cc--ccccccCcCCCCCCChHHHHHHh-cCCcCcceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecC-Ce--eecc Confidence 22 12222222211111111111111 1244577889999999999999999999999999999998643 32 2346 Q ss_pred cchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc-------eEEEEE- Q lcl|NC_019719. 96 SNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-------VVYRYQ- 167 (424) Q Consensus 96 ~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-------~~~~~~- 167 (424) +||++++|+.+||++||+++||+.++.+++++||||++++|+..|.+++|||++|.+|++..++.+ ..|.+. T Consensus 75 ~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~~~~ 154 (409) T protein:vir:10 75 DHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYLYTD 154 (409) T ss_pred CchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccccccceEEEEEEe Confidence 799999999999999999999999999999999999999999999999999999999999887543 234443 Q ss_pred ecCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHH Q lcl|NC_019719. 168 RDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENF 247 (424) Q Consensus 168 ~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~ 247 (424) ..+....|+++||||+|+++.++++|+||+..+..++.+..++++++.++|+||++|++|++++..+ ++++.+++++.| T Consensus 155 ~~g~~~~~~~~evih~r~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l-~~e~~~~~~~~~ 233 (409) T protein:vir:10 155 DLGQRHKFMSDEILHFKGLTADGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDL-NPEAEEVFKENF 233 (409) T ss_pred CCceeEEeccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCC-CHHHHHHHHHHH Confidence 4456788999999999999999999999999999999999999999999999999999999998765 677888899999 Q ss_pred HHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 248 KEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 248 ~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) ++.+++ .|+|+++++++|++|++++.++.|+||+|++++++++||++|||||.+||..++++ ++|+|++.++|+++| T Consensus 234 ~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~~~~e~~~~~f~~~~ 311 (409) T protein:vir:10 234 ERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRAT--HSNITEQNREFYIDT 311 (409) T ss_pred HHHhccccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc--cccHHHHHHHHHHHH Confidence 886655 68899999999999999999999999999999999999999999999999877664 569999999999999 Q ss_pred HHHHHHHHHHHHHhhccCcccc-ccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS 405 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~ 405 (424) |.|++++||++||++|+++.++ .+++++||+++++++|.+++++.+.+++++|+||+||+|+++|+||+||||++++|+ T Consensus 312 l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~~~~~~ 391 (409) T protein:vir:10 312 LQSILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDVLLING 391 (409) T ss_pred HHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecc Confidence 9999999999999999998876 468999999999999999999999999999999999999999999999999999999 Q ss_pred cccchhhccccCCCcccCC Q lcl|NC_019719. 406 QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 406 n~~~~~~~~~~~~~~~~ga 424 (424) |++|++..+++. .++|- T Consensus 392 n~~~~~~~~~~~--~kgGe 408 (409) T protein:vir:10 392 NMIPVKMAGEQY--SKGGE 408 (409) T ss_pred Cccchhhccccc--cccCC Confidence 999998876522 12222 No 24 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=1.8e-96 Score=545.55 Aligned_cols=400 Identities=25% Similarity=0.413 Sum_probs=339.3 Q ss_pred CchHHHHHhhccCcccCccc----ccc-----------c-------ccccccccccCcccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPN----QGS-----------Q-------TGPVSAHGHLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~----~~~-----------~-------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia 71 (424) =|+|++|++.........+. ... . ...+......++..++.+.|+++++|++||++|| T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci~~Ia 80 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCVTLIS 80 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHHHHHH Confidence 58888876543221111111 000 0 0011223355677889999999999999999999 Q ss_pred HhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCc Q lcl|NC_019719. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~ 151 (424) ++||++|+++|++++. ++...+||++++|+.+||++||+++||+.++.+++++||+|++++|+. |.+++|+|++|. T Consensus 81 ~~iA~lp~~v~~~~~~---~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~ 156 (431) T protein:vir:10 81 GTIGMLPMNLISSDDS---KQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-NRPIRLIPMDRG 156 (431) T ss_pred HhhccCceEEEEecCc---eeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CceEEEEEEcCc Confidence 9999999999987432 234678999999999999999999999999999999999999999985 899999999999 Q ss_pred eEEEEEcCC-ceEEEEE-ecCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE Q lcl|NC_019719. 152 NMDVKLVGK-KVVYRYQ-RDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 152 ~v~~~~~~~-~~~~~~~-~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~ 229 (424) +|++..+.+ ..+|.+. .++....|+++||||+|+++.|+++|+||+..+..++.++.+++++..++|+||++|++||+ T Consensus 157 ~v~~~~~~~~~~~y~~~~~~g~~~~~~~~dViHir~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 236 (431) T protein:vir:10 157 SAKGRLTSTWQIVYDYTTPTGDKIELPAREVFHLRDLSIDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIE 236 (431) T ss_pred eeEEEEcCCCeEEEEEEeCCceEEEEchhhEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEe Confidence 999877644 4445544 45677889999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhC-CcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCC Q lcl|NC_019719. 230 TGEKVLTEQQRSQVEENFKEIAG-GPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS 308 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~ 308 (424) ++..+ ++++.+++++.|++.++ .+|+|+++++++|++|++++++++|+||+|++++++++||++|||||++||+.+++ T Consensus 237 ~~~~l-s~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~ 315 (431) T protein:vir:10 237 VPKEL-SDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTS 315 (431) T ss_pred cCCCC-CHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCC Confidence 99765 67778888888877655 57999999999999999999999999999999999999999999999999987655 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCC----CCCHH Q lcl|NC_019719. 309 TSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAG----LRTIN 384 (424) Q Consensus 309 ~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g----~~T~N 384 (424) +++|+|++.++|+++||.|++.+||++||++|+++.++.+++++||+++|+++|.+++++.+++++..| |||+| T Consensus 316 --t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~N 393 (431) T protein:vir:10 316 --WGSGIEQLAIFFIQYGLSHWFVSWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQN 393 (431) T ss_pred --ccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHH Confidence 456999999999999999999999999999999988887899999999999999999999999999765 59999 Q ss_pred HHHHHhCCCCCCC--CCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 385 EMRRTDNLPPLPG--GDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 385 E~R~~~G~~p~~~--gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |+|+++|+||+++ ||++++|.|..+.++..+. |. ++ T Consensus 394 E~R~~~gl~p~~~~~gD~~~~p~n~~~~~~~~~~--p~--~~ 431 (431) T protein:vir:10 394 EVREMLDLPRADDPVADQLRNPMTQKQKGSGDEP--PA--TT 431 (431) T ss_pred HHHHHhCCCCCCCccccceecccccccCCCCCCC--CC--CC Confidence 9999999999955 9999999998876543221 11 11 No 25 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=2e-95 Score=539.70 Aligned_cols=406 Identities=26% Similarity=0.426 Sum_probs=340.2 Q ss_pred CchHHHHHhhccCcccCcccccccc--c---ccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccC Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQT--G---PVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~--~---~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) =|||++|+++..+..........+. . ........+++.|+.+.|+++++||+||++||++||++||++|++.+++ T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGT 80 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCCc Confidence 6999998765444332111111111 1 1123334678889999999999999999999999999999999876433 Q ss_pred ccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc-----eE Q lcl|NC_019719. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-----VV 163 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-----~~ 163 (424) . +.. +|+.+..|+.+||++||+++||+.++.+++++||||+++.++ .|.+.+||||+|.+|++..+... .+ T Consensus 81 ~--~~~-~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~ 156 (457) T protein:vir:62 81 R--KEI-DTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGLRRKVF 156 (457) T ss_pred c--ccc-cchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCccceeE Confidence 2 233 445444455689999999999999999999999999998665 68999999999999998765321 12 Q ss_pred EEE--EecCc---eEEecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH Q lcl|NC_019719. 164 YRY--QRDSE---YADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 164 ~~~--~~~~~---~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~ 237 (424) +.| ...+. ...|+++||||+|+++.++ ++|+||+.++..++.+..++++++.++|+||++|++||+++..+ ++ T Consensus 157 ~~y~~~~~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l-s~ 235 (457) T protein:vir:62 157 EAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTM-SE 235 (457) T ss_pred EEEEEccCCceeEEEeeCccceEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCC-CH Confidence 223 22222 2568999999999998877 89999999999999999999999999999999999999999776 77 Q ss_pred HHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH Q lcl|NC_019719. 238 QQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) ++.+++++.|++.+++ .|+|++++|++|++|++++++++|+||+|++++++++||++|||||.+||..++++++++|.| T Consensus 236 e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e 315 (457) T protein:vir:62 236 EGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLA 315 (457) T ss_pred HHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHH Confidence 7888899999887654 689999999999999999999999999999999999999999999999999999988888999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC Q lcl|NC_019719. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 317 ~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~ 396 (424) ++.++|+++||.|++++||++||++|+++.+..+++++||+++++++|.+++++.+.+++++|+||+||+|+++|+||+| T Consensus 316 q~~~~f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ 395 (457) T protein:vir:62 316 EQNIAFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLP 395 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999999999999999887778999999999999999999999999999999999999999999999 Q ss_pred CC--CeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 397 GG--DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 397 ~g--d~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) || |++++|+|+.+++...+.+......+ T Consensus 396 ~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~ 425 (457) T protein:vir:62 396 DGLGEKYRVPLNLGEIGEEPEPEPAPAPPA 425 (457) T ss_pred CCCcceeeeccccccccccccccccCCCcc Confidence 87 99999999998876544322111111 No 26 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=2.7e-95 Score=539.01 Aligned_cols=406 Identities=26% Similarity=0.439 Sum_probs=345.4 Q ss_pred CchHHHHHhhccCcccCccccccc----cc-ccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccC Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQ----TG-PVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~----~~-~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) =|||++|++++.+........... .. ........++..|+.+.|+++++||+||++||++||++|+++|++..++ T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGS 80 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 699999988766543222111111 11 1123345678899999999999999999999999999999999986544 Q ss_pred ccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc-----eE Q lcl|NC_019719. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-----VV 163 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-----~~ 163 (424) . +....|++..+|+.+|| +||+++||+.++.+++++||+|+++.++ .|.+++||||+|.+|++..+... .+ T Consensus 81 ~--~~~~~~~l~~~ln~~~n-~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~ 156 (457) T protein:vir:13 81 R--KEIVTPEWLDYPNAEPG-GMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRRKVF 156 (457) T ss_pred c--cccccchHHHhccccCC-CCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccceeE Confidence 3 34567899999986666 7999999999999999999999999776 58999999999999998765322 12 Q ss_pred --EEEEecCc---eEEecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH Q lcl|NC_019719. 164 --YRYQRDSE---YADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 164 --~~~~~~~~---~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~ 237 (424) |.+..++. ...|+++||||+|+++.++ ++|+||+..+..+|.+..++++++.++|+||++|++||+++..+ ++ T Consensus 157 ~~y~~~~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l-s~ 235 (457) T protein:vir:13 157 EAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTM-SE 235 (457) T ss_pred EEEEEecCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCC-CH Confidence 33333332 2468999999999998876 89999999999999999999999999999999999999998776 67 Q ss_pred HHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH Q lcl|NC_019719. 238 QQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) ++.+++++.|++.+++ +|+|++++|++|++|+++++++.|+||+|++++++++||++|||||++||..++++++++|.| T Consensus 236 e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e 315 (457) T protein:vir:13 236 EGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLA 315 (457) T ss_pred HHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHH Confidence 8888899989887654 789999999999999999999999999999999999999999999999999999888888999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC Q lcl|NC_019719. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 317 ~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~ 396 (424) ++.++|+++||.|++++||++|+++|+++.++.+++++||+++|+++|.+++++.+.+++++|+||+||+|+++|+||+| T Consensus 316 q~~~~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ 395 (457) T protein:vir:13 316 EQNIAFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLP 395 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCccccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999999999999999888778899999999999999999999999999999999999999999999 Q ss_pred CC--CeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 397 GG--DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 397 ~g--d~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) || |++++|+|+.+++...+.+......| T Consensus 396 ~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~ 425 (457) T protein:vir:13 396 DGLGEKYRVPLNLGEVGEEPEPEPAPAPPA 425 (457) T ss_pred CCcccceeeccccccccccccccccCCCCC Confidence 86 99999999998866444322111111 No 27 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=5.5e-95 Score=537.33 Aligned_cols=401 Identities=20% Similarity=0.291 Sum_probs=344.2 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEeccc Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~ 87 (424) |.|-.+.|++.|++..+.......+.... ..+..+...++..++.+.++++|+|++||++||++||++||++|++++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~- 77 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWIDQSTSKL--YDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK- 77 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhccccccc--ccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeeccc- Confidence 99999999999998776555433332211 123344445666788999999999999999999999999999998653 Q ss_pred CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc--eEEE Q lcl|NC_019719. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK--VVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~--~~~~ 165 (424) ..+|++.++|+.+||++||+++||+.++.+++++||+|++++|+.+|.+.+|+|++|.+|++..+.+. .+|. T Consensus 78 ------~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~ 151 (412) T protein:vir:26 78 ------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYS 151 (412) T ss_pred ------cccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEE Confidence 24689999999999999999999999999999999999999999999999999999999999877543 3444 Q ss_pred EEe-cCceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019719. 166 YQR-DSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 166 ~~~-~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~ 243 (424) +.. ++....|+++||||+|++ +.++++|+||+..+..++.++.+++++. ++.++..++++++.+..+ ++++.+++ T Consensus 152 ~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~~--~~~~~~~~~~i~~~~~~l-~~e~~~~~ 228 (412) T protein:vir:26 152 IHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNV-GKEKRQQV 228 (412) T ss_pred EEcCCceEEEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecCCCC-CHHHHHHH Confidence 433 355678999999999986 5688999999999999999999999885 455555566777776665 77788889 Q ss_pred HHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHH Q lcl|NC_019719. 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 244 ~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~ 323 (424) ++.|++..+ ++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+.+++ +++|+|++.+.|+ T Consensus 229 ~~~~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~--~~sn~e~~~~~f~ 304 (412) T protein:vir:26 229 LEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT--NFAKNEELNRFYL 304 (412) T ss_pred HHHHHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHHHH Confidence 999988765 578899999999999999999999999999999999999999999999976544 5669999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeee Q lcl|NC_019719. 324 QYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAM 402 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~ 402 (424) ++||.|++.+||++|+++|+++.++. +++++||++++++.|.+++++.+++++++|++|+||+|+++|+||+||||+++ T Consensus 305 ~~~l~P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~ 384 (412) T protein:vir:26 305 QHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPL 384 (412) T ss_pred HHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeee Confidence 99999999999999999999998874 68899999999999999999999999999999999999999999999999999 Q ss_pred ecccccchhhccccCCCcccCC Q lcl|NC_019719. 403 RQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~~~~~~ga 424 (424) +++|++|++...+.+...++|. T Consensus 385 ~~~n~~~~~~~~~~~~~~~gG~ 406 (412) T protein:vir:26 385 ISGDLYPIDTPLELRKSLKGGD 406 (412) T ss_pred ecccccccccchhhcccccCCC Confidence 9999999987665443333332 No 28 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=2.7e-94 Score=533.55 Aligned_cols=411 Identities=18% Similarity=0.277 Sum_probs=334.7 Q ss_pred CCCCcccccCCCCCch--HHHHHhhccCcccCc---ccccc--cccccccccccCcccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGW--WARLQSWFVGGRLVT---PNQGS--QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~--~~~l~~~~~~~~~~~---~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) ....-|-+++..|.-= .-.+++.|.+.+... +..+. .......+...++..++.+.|+++++||+||++||++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~acv~~Ia~~ 83 (441) T protein:vir:98 4 YNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASD 83 (441) T ss_pred ecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhhhhccHHHHHHHHHHHHh Confidence 2223455565554411 111223333322211 11111 1111222334456678999999999999999999999 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceE Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) ||++|+++|+.+ +....|+++++|+.+||++||+++||+.++.+++++||||++++|+.+|+|++|||++|++| T Consensus 84 iA~lpl~~~~~~------~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v 157 (441) T protein:vir:98 84 LARMPIRVTVNG------QINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEI 157 (441) T ss_pred hccCceEEecCC------cccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcee Confidence 999999998643 23457999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCce-EEEEEe-----cCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCcee Q lcl|NC_019719. 154 DVKLVGKKV-VYRYQR-----DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQI 227 (424) Q Consensus 154 ~~~~~~~~~-~~~~~~-----~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~v 227 (424) ++..++++. +|.+.. .+..+.++++||||+|+++.++++|+||+..+..++.+..++++++.++|+||++|+|+ T Consensus 158 ~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~dviHir~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gi 237 (441) T protein:vir:98 158 ELKLDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGI 237 (441) T ss_pred EEEECCCCcEEEEEEEeccCcceeeEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEE Confidence 998876543 333321 22357899999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC Q lcl|NC_019719. 228 LSTGEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 228 l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~ 306 (424) |+++....++++++++++.|++.+++ +|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||... T Consensus 238 l~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~ 317 (441) T protein:vir:98 238 LKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET 317 (441) T ss_pred EEeCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC Confidence 99998888888888899888876654 78999999999999999999999999999999999999999999999998633 Q ss_pred CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_019719. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEM 386 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~ 386 (424) . + .+.+++...|. +||+|++.+||++|+++|+++.+ +++++||+++|++.|.+++++.+++++++|+||+||+ T Consensus 318 ~-~---~s~~q~~~~y~-~tl~P~~~~ie~~ln~~L~~~~~--~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~ 390 (441) T protein:vir:98 318 A-N---MSITDANLDYL-STLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEI 390 (441) T ss_pred C-C---ccHHHHHHHHH-HHHHHHHHHHHHHHHhhcccccc--CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 2 2 24566666665 69999999999999999987653 5689999999999999999999999999999999999 Q ss_pred HHHhCCCCCCCCC--eeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 387 RRTDNLPPLPGGD--VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 387 R~~~G~~p~~~gd--~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |+++|+||+|||| .+++|+|++|++..++.+..+.++. T Consensus 391 R~~~gl~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~ 430 (441) T protein:vir:98 391 RQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRAT 430 (441) T ss_pred HHHhCCCCCCCCCcceEeeccccccccccccccccccccc Confidence 9999999999988 5889999999988765433222211 No 29 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=4.1e-94 Score=532.58 Aligned_cols=411 Identities=18% Similarity=0.270 Sum_probs=331.9 Q ss_pred CCCCcccccCCCCCc--hHHHHHhhccCcccC---cccccc--cccccccccccCcccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTNNG--WWARLQSWFVGGRLV---TPNQGS--QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G--~~~~l~~~~~~~~~~---~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) ....-|-|++-.|.- -.-.+++.|.+.+.. .+..+. ....+..+...++..++.+.|+++++||+||++||++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~ 83 (441) T protein:vir:94 4 YNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASD 83 (441) T ss_pred ccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHHHHh Confidence 122234444433320 000122233332211 111111 1111223334456678899999999999999999999 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceE Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) ||++|+++|+.+ +....|+++++|+.+||++||+++||+.++.+++++||||++++|+..|+|++|+|++|++| T Consensus 84 iA~lp~~~~~~~------~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v 157 (441) T protein:vir:94 84 LARMPIRVTVNG------QINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEI 157 (441) T ss_pred hccCceeeecCc------cccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcee Confidence 999999998643 23457999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCce-EEEEEe-----cCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCcee Q lcl|NC_019719. 154 DVKLVGKKV-VYRYQR-----DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQI 227 (424) Q Consensus 154 ~~~~~~~~~-~~~~~~-----~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~v 227 (424) ++..++++. +|.+.. .+..+.++++||||+|+++.++++|+||+..+..++.+..+++++..++|+||++|+|| T Consensus 158 ~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 237 (441) T protein:vir:94 158 ELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGI 237 (441) T ss_pred EEEECCCccEEEEEEEeccCCceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEE Confidence 999876544 333321 22356899999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC Q lcl|NC_019719. 228 LSTGEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 228 l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~ 306 (424) |++++...++++++++++.|++.+++ .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||... T Consensus 238 l~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 317 (441) T protein:vir:94 238 LKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET 317 (441) T ss_pred EEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC Confidence 99998888888888999888886654 78999999999999999999999999999999999999999999999998633 Q ss_pred CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_019719. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEM 386 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~ 386 (424) . + .+.+++...| .+||.|++.+||++|+++|+++. .+++++||+++|++.|.+++++.+++++++|+||+||+ T Consensus 318 ~-~---~s~~q~~~~~-~~tl~P~~~~ie~eln~kl~~~~--~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~ 390 (441) T protein:vir:94 318 A-N---MSITDANLDY-LSTLKPYITCVCAELNFKFNDEY--VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEI 390 (441) T ss_pred C-C---ccHHHHHHHH-HHHHHHHHHHHHHHHhhhccccc--cCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 2 2 2445665554 57999999999999999998654 35789999999999999999999999999999999999 Q ss_pred HHHhCCCCCCCCC--eeeecccccchhhccccCCCccc-------CC Q lcl|NC_019719. 387 RRTDNLPPLPGGD--VAMRQSQYVPITDLGTNKEPRNN-------GA 424 (424) Q Consensus 387 R~~~G~~p~~~gd--~~~~~~n~~~~~~~~~~~~~~~~-------ga 424 (424) |+++|+||+|||| .+++++|++|++..++.+..+.+ |+ T Consensus 391 R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgG 437 (441) T protein:vir:94 391 RQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGG 437 (441) T ss_pred HHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCC Confidence 9999999999988 58899999999887553322221 11 No 30 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=4.1e-94 Score=532.58 Aligned_cols=411 Identities=18% Similarity=0.270 Sum_probs=331.9 Q ss_pred CCCCcccccCCCCCc--hHHHHHhhccCcccC---cccccc--cccccccccccCcccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTNNG--WWARLQSWFVGGRLV---TPNQGS--QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G--~~~~l~~~~~~~~~~---~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) ....-|-|++-.|.- -.-.+++.|.+.+.. .+..+. ....+..+...++..++.+.|+++++||+||++||++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~ 83 (441) T protein:vir:79 4 YNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASD 83 (441) T ss_pred ccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHHHHh Confidence 122234444433320 000122233332211 111111 1111223334456678899999999999999999999 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceE Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) ||++|+++|+.+ +....|+++++|+.+||++||+++||+.++.+++++||||++++|+..|+|++|+|++|++| T Consensus 84 iA~lp~~~~~~~------~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v 157 (441) T protein:vir:79 84 LARMPIRVTVNG------QINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEI 157 (441) T ss_pred hccCceeeecCc------cccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcee Confidence 999999998643 23457999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCce-EEEEEe-----cCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCcee Q lcl|NC_019719. 154 DVKLVGKKV-VYRYQR-----DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQI 227 (424) Q Consensus 154 ~~~~~~~~~-~~~~~~-----~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~v 227 (424) ++..++++. +|.+.. .+..+.++++||||+|+++.++++|+||+..+..++.+..+++++..++|+||++|+|| T Consensus 158 ~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 237 (441) T protein:vir:79 158 ELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGI 237 (441) T ss_pred EEEECCCccEEEEEEEeccCCceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEE Confidence 999876544 333321 22356899999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC Q lcl|NC_019719. 228 LSTGEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 228 l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~ 306 (424) |++++...++++++++++.|++.+++ .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||... T Consensus 238 l~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 317 (441) T protein:vir:79 238 LKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET 317 (441) T ss_pred EEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC Confidence 99998888888888999888886654 78999999999999999999999999999999999999999999999998633 Q ss_pred CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_019719. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEM 386 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~ 386 (424) . + .+.+++...| .+||.|++.+||++|+++|+++. .+++++||+++|++.|.+++++.+++++++|+||+||+ T Consensus 318 ~-~---~s~~q~~~~~-~~tl~P~~~~ie~eln~kl~~~~--~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~ 390 (441) T protein:vir:79 318 A-N---MSITDANLDY-LSTLKPYITCVCAELNFKFNDEY--VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEI 390 (441) T ss_pred C-C---ccHHHHHHHH-HHHHHHHHHHHHHHHhhhccccc--cCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 2 2 2445665554 57999999999999999998654 35789999999999999999999999999999999999 Q ss_pred HHHhCCCCCCCCC--eeeecccccchhhccccCCCccc-------CC Q lcl|NC_019719. 387 RRTDNLPPLPGGD--VAMRQSQYVPITDLGTNKEPRNN-------GA 424 (424) Q Consensus 387 R~~~G~~p~~~gd--~~~~~~n~~~~~~~~~~~~~~~~-------ga 424 (424) |+++|+||+|||| .+++++|++|++..++.+..+.+ |+ T Consensus 391 R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgG 437 (441) T protein:vir:79 391 RQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGG 437 (441) T ss_pred HHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCC Confidence 9999999999988 58899999999887553322221 11 No 31 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=7e-94 Score=531.31 Aligned_cols=397 Identities=26% Similarity=0.456 Sum_probs=336.4 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|+|+|++....... ..................+..++.+.|+++++|++||++||++||++||++|+.++++. T Consensus 1 Mgl~~~~f~~~~~~~--~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~---- 74 (409) T protein:vir:84 1 MSLFTRIFSGPSEER--TLTKISGIPSPAEDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVR---- 74 (409) T ss_pred CchhhhhhcCCCccc--ccccccccccccchhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcc---- Confidence 577887743322111 11111111111223345677889999999999999999999999999999999765542 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCceeeEEeecCceEEEEEc--CCceEEEEEecC Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGDVISLLPLQSANMDVKLV--GKKVVYRYQRDS 170 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~~G~~~~l~~l~~~~v~~~~~--~~~~~~~~~~~~ 170 (424) ...|+++++|+.+||++||+++||+.++.+++++||+|+++. ++..|.+.+|||++|.+|++... ....++.+.... T Consensus 75 ~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~~~~~~~~~ 154 (409) T protein:vir:84 75 IPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGDWIEPVYRI 154 (409) T ss_pred cccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcceEEEEEecC Confidence 356999999999999999999999999999999999999986 78889999999999999998754 334444444444 Q ss_pred ceEEecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHH Q lcl|NC_019719. 171 EYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) Q Consensus 171 ~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~ 249 (424) ....|+++||||+|+++.++ ++|+||+..+..++.+..++++++.++|+||++|+|+|+.+..+ ++++.+++++.|.+ T Consensus 155 ~g~~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l-~~e~~~~~~~~~~~ 233 (409) T protein:vir:84 155 DGKVVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDADL-TPDQVKQTQKQWIQ 233 (409) T ss_pred CceEEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCC-CHHHHHHHHHHHHH Confidence 45679999999999888776 68999999999999999999999999999999999999998776 66777777777766 Q ss_pred HhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHH Q lcl|NC_019719. 250 IAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 250 ~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P 329 (424) .. .|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||++||..++++++++|.|++.+.|+++||.| T Consensus 234 ~~--~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P 311 (409) T protein:vir:84 234 SH--HNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLP 311 (409) T ss_pred Hh--ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHH Confidence 54 4678999999999999999999999999999999999999999999999999888888889999999999999999 Q ss_pred HHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccc Q lcl|NC_019719. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~ 409 (424) ++++||++|+++|. .+++++||+++|+++|.+++++.+.+++++|+||+||+|+++|+||+||||++++|+|++| T Consensus 312 ~~~~ie~~l~~~L~-----~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD~~~~~~n~~~ 386 (409) T protein:vir:84 312 WLRCIEQALDTFLP-----RGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGDIHLQPMNFVP 386 (409) T ss_pred HHHHHHHHHHHhcc-----CCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccc Confidence 99999999999883 2568999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhcccc---CCCcccCC Q lcl|NC_019719. 410 ITDLGTN---KEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~---~~~~~~ga 424 (424) ++..... ++++.+++ T Consensus 387 ~~~~~~~~~~~~~~~~~~ 404 (409) T protein:vir:84 387 LGYVPPEEPAQEPQPNSA 404 (409) T ss_pred cccCCccccCcCCCCCCc Confidence 9775443 23333333 No 32 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=9.9e-94 Score=530.46 Aligned_cols=398 Identities=20% Similarity=0.290 Sum_probs=337.9 Q ss_pred CCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcc Q lcl|NC_019719. 11 RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 11 ~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) -.+.+++.|+++.+.......+..... .+..+...++..++.+.|+++++|++||++||++||++||++|++++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~~--- 75 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQSTSKLY--DFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV--- 75 (409) T ss_pred CCccchhhhhhhhhhhhhhcccccccc--ccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeecccc--- Confidence 457789999888765543332222111 122222334556788899999999999999999999999999986542 Q ss_pred ccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc--eEEEEE- Q lcl|NC_019719. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK--VVYRYQ- 167 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~--~~~~~~- 167 (424) .+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.+.+|||++|++|++..+.+. .+|.+. T Consensus 76 ----~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~ 151 (409) T protein:vir:93 76 ----VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHA 151 (409) T ss_pred ----ccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEc Confidence 4689999999999999999999999999999999999999999999999999999999999876543 344443 Q ss_pred ecCceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019719. 168 RDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 168 ~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~ 246 (424) .++....|+++||||+|++ +.++++|+||+.++..++.++.+++++. ++.++..++++++.+..+ ++++.+++++. T Consensus 152 ~~g~~~~~~~~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~l-~~e~~~~~~~~ 228 (409) T protein:vir:93 152 ATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNV-GKEKRQQVLED 228 (409) T ss_pred CCceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecCCCC-CHHHHHHHHHH Confidence 3456778999999999975 6788999999999999999999998885 455555566777776655 77888889999 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) |++.++ ++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+.+++ +++|+|++.+.|++.| T Consensus 229 ~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~--~~sn~e~~~~~f~~~~ 304 (409) T protein:vir:93 229 FKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT--NFAKNEELNRFYLQHT 304 (409) T ss_pred HHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHHHHHHH Confidence 988764 578899999999999999999999999999999999999999999999976555 4569999999999999 Q ss_pred HHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS 405 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~ 405 (424) |.|++++||++|+++|+++.++. +++++||++++++.|.+++++.+++++++|++|+||+|+++|+||+||||++++++ T Consensus 305 l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~ 384 (409) T protein:vir:93 305 LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISG 384 (409) T ss_pred HHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecc Confidence 99999999999999999998874 58899999999999999999999999999999999999999999999999999999 Q ss_pred cccchhhccccCCCcccCC Q lcl|NC_019719. 406 QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 406 n~~~~~~~~~~~~~~~~ga 424 (424) |++|++...++++..++|. T Consensus 385 n~~~~~~~~~~~~~~~gG~ 403 (409) T protein:vir:93 385 DLYPIDTPLELRKSLKGGD 403 (409) T ss_pred cccccccchhhcccccCCC Confidence 9999987765544333333 No 33 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=2.1e-93 Score=528.63 Aligned_cols=398 Identities=20% Similarity=0.289 Sum_probs=336.8 Q ss_pred CCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcc Q lcl|NC_019719. 11 RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 11 ~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) -.+..++.|+++.+.......+.... ..+..+...+...++.+.|+++++|++||++||++||++||++|++.+. T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~~~--~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~--- 75 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDNWIDQSASKL--YDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV--- 75 (409) T ss_pred CcccccchhhhhHHhhhhhcCCcccc--cccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeecccc--- Confidence 45677888888876554433222111 1122223334555788999999999999999999999999999986542 Q ss_pred ccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC--ceEEEEE- Q lcl|NC_019719. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQ- 167 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~--~~~~~~~- 167 (424) .+|++.++|+.+||++||+++||+.++.+++++||+|++++|+.+|.+++|||++|++|++..+.+ ..+|.+. T Consensus 76 ----~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~ 151 (409) T protein:vir:94 76 ----VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHA 151 (409) T ss_pred ----cchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEc Confidence 468999999999999999999999999999999999999999999999999999999999887654 3344443 Q ss_pred ecCceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019719. 168 RDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 168 ~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~ 246 (424) .++....|+++||||+|++ +.++++|+||+..+..++++..+++++. ++.++..++++++.+..+ ++++.+++++. T Consensus 152 ~~g~~~~~~~~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~l-~~e~~~~~~~~ 228 (409) T protein:vir:94 152 ATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNV-GKEKRQQVLED 228 (409) T ss_pred CCceEEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCeeEEecCCCC-CHHHHHHHHHH Confidence 3456778999999999976 5688999999999999999999999885 444555556677766655 77888899999 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) |++.++ ++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+.+++ +++|+|++.+.|+++| T Consensus 229 ~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~sn~e~~~~~f~~~~ 304 (409) T protein:vir:94 229 FKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT--NFAKNEELNRFYLQHT 304 (409) T ss_pred HHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHHHHHHH Confidence 988774 678999999999999999999999999999999999999999999999976555 5569999999999999 Q ss_pred HHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS 405 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~ 405 (424) |.|++++||++|+++|+++.++. +++++||++++++.|.+++++.+++++++|+||+||+|+++|+||+||||++++++ T Consensus 305 l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~ 384 (409) T protein:vir:94 305 LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISG 384 (409) T ss_pred HHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeEeecc Confidence 99999999999999999998874 68899999999999999999999999999999999999999999999999999999 Q ss_pred cccchhhccccCCCcccCC Q lcl|NC_019719. 406 QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 406 n~~~~~~~~~~~~~~~~ga 424 (424) |++|++...+.+...++|. T Consensus 385 n~~~~~~~~~~~~~~kGG~ 403 (409) T protein:vir:94 385 DLYPIDTPLELRKSLKGGD 403 (409) T ss_pred cccccccchhhcccccCCC Confidence 9999987655443333332 No 34 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=2.4e-93 Score=528.32 Aligned_cols=398 Identities=20% Similarity=0.289 Sum_probs=335.9 Q ss_pred CCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcc Q lcl|NC_019719. 11 RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 11 ~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) -.+.++|.|+++.+.......+..... .+..+...+...++.+.|+++++|++||++||++||++||++|++.+ T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~~~~--~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~---- 74 (409) T protein:vir:96 1 MAKENIVTRIKKKLIDNWIDQSASKLY--DFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK---- 74 (409) T ss_pred CccccchhhhhhHHhhhhhcccccccc--ccccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCceEEeeccc---- Confidence 467889999998865554433222111 12222233445578889999999999999999999999999998653 Q ss_pred ccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc--eEEEEE- Q lcl|NC_019719. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK--VVYRYQ- 167 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~--~~~~~~- 167 (424) ..+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|||++|.+|++..+++. .+|.+. T Consensus 75 ---~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~ 151 (409) T protein:vir:96 75 ---VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHA 151 (409) T ss_pred ---ccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEc Confidence 24699999999999999999999999999999999999999999999999999999999999876543 344443 Q ss_pred ecCceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019719. 168 RDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 168 ~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~ 246 (424) .++....|+++||||+|++ +.++++|+||+..+..++.+..+++++. ++.++..++++++.+.. .++++.+++++. T Consensus 152 ~~g~~~~~~~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~-l~~e~~~~~~~~ 228 (409) T protein:vir:96 152 ATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSN-VSTEKRQQVLED 228 (409) T ss_pred CCceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEecCCC-CCHHHHHHHHHH Confidence 3456788999999999975 6788999999999999999999998875 34444444556666555 477788889999 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) |++.+. ++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+.+++ +++|+|++.+.|+++| T Consensus 229 ~~~~~~--n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~s~~e~~~~~f~~~~ 304 (409) T protein:vir:96 229 FKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNT--NFAKNEELNRFYLQHT 304 (409) T ss_pred HHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHHHHHHH Confidence 988774 678999999999999999999999999999999999999999999999986655 4569999999999999 Q ss_pred HHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS 405 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~ 405 (424) |.|++++||++|+++|+++.++. +++++||++++++.|.+++++.+++++++|+||+||+|+++|+||+||||++++++ T Consensus 305 l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD~~~~~~ 384 (409) T protein:vir:96 305 LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISG 384 (409) T ss_pred HHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcceeeecc Confidence 99999999999999999998874 68899999999999999999999999999999999999999999999999999999 Q ss_pred cccchhhccccCCCcccCC Q lcl|NC_019719. 406 QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 406 n~~~~~~~~~~~~~~~~ga 424 (424) |++|++...+.+...++|. T Consensus 385 n~~~~~~~~~~~~~~~gG~ 403 (409) T protein:vir:96 385 DLYPIDTPLELRKSLKGGD 403 (409) T ss_pred cccccccchhhcccccCCC Confidence 9999977655443333332 No 35 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=2.1e-93 Score=528.70 Aligned_cols=394 Identities=18% Similarity=0.270 Sum_probs=328.1 Q ss_pred CchHHHHHhhccCcccCcccccc--cccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGS--QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~ 91 (424) =|||++. .++....+.... ....+..+....+..++.+.|+++++||+||++||+++|++||++|+.+. T Consensus 1 Mg~f~~~----~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~----- 71 (416) T protein:vir:45 1 MGIFYKN----EKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQ----- 71 (416) T ss_pred CCccccc----ccccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCcc----- Confidence 3444332 111111111111 11112233455677889999999999999999999999999999986432 Q ss_pred cccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCce-EEEEEe-- Q lcl|NC_019719. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV-VYRYQR-- 168 (424) Q Consensus 92 ~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~-~~~~~~-- 168 (424) ....|+++++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|||++|++|++..++++. +|.+.. T Consensus 72 -~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~ 150 (416) T protein:vir:45 72 -INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRID 150 (416) T ss_pred -ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEec Confidence 3457899999999999999999999999999999999999999999999999999999999999876544 333321 Q ss_pred ---cCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019719. 169 ---DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 169 ---~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~ 245 (424) .+..+.++++||||+|+++.++++|+||+..+..++.+..++++++.++|+||++|++|++++....++++++++++ T Consensus 151 ~~~~~~~~~~~~~evihir~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~ 230 (416) T protein:vir:45 151 SNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRARE 230 (416) T ss_pred CCCceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHH Confidence 22346899999999999999999999999999999999999999999999999999999999988888888899998 Q ss_pred HHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHH Q lcl|NC_019719. 246 NFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ 324 (424) Q Consensus 246 ~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~ 324 (424) .|++.+++ .|+|+++++++|++|++++.+++|+||+|.+++++++||++|||||.+||.... + .+.+++... +. T Consensus 231 ~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~---~~~~~~~~~-~~ 305 (416) T protein:vir:45 231 EFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N---MSITDANLD-YL 305 (416) T ss_pred HHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-C---ccHHHHHHH-HH Confidence 88887665 789999999999999999999999999999999999999999999999986332 2 134555554 56 Q ss_pred HHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC--eee Q lcl|NC_019719. 325 YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD--VAM 402 (424) Q Consensus 325 ~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd--~~~ 402 (424) +||.|++.+||++|+++|+++.+ +++++||+++|++.|.+++++.+++++++|+||+||+|+++|+||+|||| +++ T Consensus 306 ~~l~P~~~~ie~~ln~~l~~~~~--~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~ 383 (416) T protein:vir:45 306 STLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHR 383 (416) T ss_pred HHHHHHHHHHHHHHhhhcccccc--CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEe Confidence 79999999999999999987653 57899999999999999999999999999999999999999999999987 688 Q ss_pred ecccccchhhccccCCCcccC-------C Q lcl|NC_019719. 403 RQSQYVPITDLGTNKEPRNNG-------A 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~~~~~~g-------a 424 (424) +++|++|++...+.+..+.+. + T Consensus 384 ~~~n~~~~~~~~~~~~~~~~~~~~~~kgG 412 (416) T protein:vir:45 384 VDLNHVNIELVDEYQMNKSRATDKKLKGG 412 (416) T ss_pred ecccccccccccccCcccccccccccCCC Confidence 999999998776543333222 2 No 36 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=2.1e-93 Score=528.70 Aligned_cols=394 Identities=18% Similarity=0.270 Sum_probs=328.1 Q ss_pred CchHHHHHhhccCcccCcccccc--cccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGS--QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~ 91 (424) =|||++. .++....+.... ....+..+....+..++.+.|+++++||+||++||+++|++||++|+.+. T Consensus 1 Mg~f~~~----~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~----- 71 (416) T protein:vir:81 1 MGIFYKN----EKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQ----- 71 (416) T ss_pred CCccccc----ccccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCcc----- Confidence 3444332 111111111111 11112233455677889999999999999999999999999999986432 Q ss_pred cccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCce-EEEEEe-- Q lcl|NC_019719. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV-VYRYQR-- 168 (424) Q Consensus 92 ~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~-~~~~~~-- 168 (424) ....|+++++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|||++|++|++..++++. +|.+.. T Consensus 72 -~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~ 150 (416) T protein:vir:81 72 -INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRID 150 (416) T ss_pred -ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEec Confidence 3457899999999999999999999999999999999999999999999999999999999999876544 333321 Q ss_pred ---cCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019719. 169 ---DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 169 ---~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~ 245 (424) .+..+.++++||||+|+++.++++|+||+..+..++.+..++++++.++|+||++|++|++++....++++++++++ T Consensus 151 ~~~~~~~~~~~~~evihir~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~ 230 (416) T protein:vir:81 151 SNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRARE 230 (416) T ss_pred CCCceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHH Confidence 22346899999999999999999999999999999999999999999999999999999999988888888899998 Q ss_pred HHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHH Q lcl|NC_019719. 246 NFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ 324 (424) Q Consensus 246 ~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~ 324 (424) .|++.+++ .|+|+++++++|++|++++.+++|+||+|.+++++++||++|||||.+||.... + .+.+++... +. T Consensus 231 ~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~---~~~~~~~~~-~~ 305 (416) T protein:vir:81 231 EFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N---MSITDANLD-YL 305 (416) T ss_pred HHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-C---ccHHHHHHH-HH Confidence 88887665 789999999999999999999999999999999999999999999999986332 2 134555554 56 Q ss_pred HHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC--eee Q lcl|NC_019719. 325 YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD--VAM 402 (424) Q Consensus 325 ~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd--~~~ 402 (424) +||.|++.+||++|+++|+++.+ +++++||+++|++.|.+++++.+++++++|+||+||+|+++|+||+|||| +++ T Consensus 306 ~~l~P~~~~ie~~ln~~l~~~~~--~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~ 383 (416) T protein:vir:81 306 STLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHR 383 (416) T ss_pred HHHHHHHHHHHHHHhhhcccccc--CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEe Confidence 79999999999999999987653 57899999999999999999999999999999999999999999999987 688 Q ss_pred ecccccchhhccccCCCcccC-------C Q lcl|NC_019719. 403 RQSQYVPITDLGTNKEPRNNG-------A 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~~~~~~g-------a 424 (424) +++|++|++...+.+..+.+. + T Consensus 384 ~~~n~~~~~~~~~~~~~~~~~~~~~~kgG 412 (416) T protein:vir:81 384 VDLNHVNIELVDEYQMNKSRATDKKLKGG 412 (416) T ss_pred ecccccccccccccCcccccccccccCCC Confidence 999999998776543333222 2 No 37 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=3.9e-93 Score=527.20 Aligned_cols=396 Identities=17% Similarity=0.266 Sum_probs=327.8 Q ss_pred HhhccCcccCcccccccc----cccccccccCcc------cccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcc Q lcl|NC_019719. 21 QSWFVGGRLVTPNQGSQT----GPVSAHGHLGDS------SINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 21 ~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~------~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) +-.-.|...++|...... ..+.+ .+..+. .+....++++++|++||++||++||++||++|+++.++.. T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~~~~~~~ 79 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQMQDSYYY-APAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) T ss_pred CcccCceeecCchhhhhhhhhhccccc-ccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEEcCCCce Confidence 111223333444321111 11111 112222 2334568899999999999999999999999999877654 Q ss_pred ccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC--ceEEEEEe Q lcl|NC_019719. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQR 168 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~--~~~~~~~~ 168 (424) . ..+|++ .+|+.+||++||+++||+.++.+++++||+|++++|+.+|.+++|||++|.+|++..+.. ...|.|.. T Consensus 80 ~--~~~~~~-~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~~~~~y~~~~ 156 (518) T protein:vir:10 80 E--ESDTGY-AKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) T ss_pred e--ccchHH-HHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCCCEEEEEEEe Confidence 3 345665 456679999999999999999999999999999999999999999999999999988754 34455543 Q ss_pred c----CceEEecHhHeeEeccCCCCcc-ccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019719. 169 D----SEYADFSQKEIFHLKGFGFTGL-VGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 169 ~----~~~~~~~~~evih~r~~~~~~~-~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~ 243 (424) . +..+.|+++||||+|+++.++. +|+||+..+..++.+..++++++.++|+||++|+||++.+..+ ++++++++ T Consensus 157 ~~~~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~l-s~e~~~~~ 235 (518) T protein:vir:10 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL-SEAAQQRL 235 (518) T ss_pred cCCccceEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCC-CHHHHHHH Confidence 2 3457899999999999998885 8999999999999999999999999999999999999999776 66778889 Q ss_pred HHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHH Q lcl|NC_019719. 244 EENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 244 ~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~ 322 (424) ++.|++.+++ .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.| T Consensus 236 k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t--~sn~eq~~~~f 313 (518) T protein:vir:10 236 REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT--FSNISAQMRAF 313 (518) T ss_pred HHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCC--chhHHHHHHHH Confidence 9988886655 79999999999999999999999999999999999999999999999999887765 45999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCe Q lcl|NC_019719. 323 LQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDV 400 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~ 400 (424) +++||.|++.+||++|+++|++..++ .++++||+++++++|.+++++.+++++++||||+||+|+++|+||++ |||+ T Consensus 314 ~~~tL~P~l~~ie~~ln~~L~~~~~~-~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~ 392 (518) T protein:vir:10 314 YRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADE 392 (518) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccC-CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCe Confidence 99999999999999999999987664 57899999999999999999999999999999999999999999985 8999 Q ss_pred eeecccccchhhccccC--CCcc-----cCC Q lcl|NC_019719. 401 AMRQSQYVPITDLGTNK--EPRN-----NGA 424 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~--~~~~-----~ga 424 (424) +++++|++|++...++. +++. .++ T Consensus 393 ~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~ 423 (518) T protein:vir:10 393 LYANSALQPLGATPDGAVEGEEAPAPKRPAS 423 (518) T ss_pred eeecccceecccccccccCCCCCCCCCCCCc Confidence 99999999886543211 0000 000 No 38 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=5e-93 Score=526.63 Aligned_cols=396 Identities=17% Similarity=0.262 Sum_probs=327.5 Q ss_pred HhhccCcccCcccccc----cccccccccccCcc------cccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcc Q lcl|NC_019719. 21 QSWFVGGRLVTPNQGS----QTGPVSAHGHLGDS------SINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 21 ~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~------~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) +-.-+|...+.|.... ....+. +.+..+. .+....|+++++|++||++||++||++||++|+++.++.. T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~~~~-~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~~~~~ 79 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQDSYY-YAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) T ss_pred CcccCceeeccchhhhhhhhhhhccc-ccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEEcCCccc Confidence 1112233333443211 111111 1222232 2334568899999999999999999999999998776644 Q ss_pred ccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC--ceEEEEEe Q lcl|NC_019719. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQR 168 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~--~~~~~~~~ 168 (424) + ..+|+ ..+|+.+||++||+++||+.++.+++++||+|++++|+..|.+++||||+|.+|++..+.+ ...|.|.. T Consensus 80 ~--~~~~~-~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~~~~~y~~~~ 156 (518) T protein:vir:78 80 E--EHDTG-YAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) T ss_pred c--ccchH-HHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCCCEEEEEEEe Confidence 3 23444 4556679999999999999999999999999999999999999999999999999988753 34455543 Q ss_pred c----CceEEecHhHeeEeccCCCCcc-ccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019719. 169 D----SEYADFSQKEIFHLKGFGFTGL-VGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 169 ~----~~~~~~~~~evih~r~~~~~~~-~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~ 243 (424) . +..+.|+++||||+|++++++. +|+||+..+..++.+..++++++.++|+||++|+++|+.+..+ ++++.+++ T Consensus 157 ~~~~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~l-s~e~~~~~ 235 (518) T protein:vir:78 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL-SPEAQQRL 235 (518) T ss_pred cCCccceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCC-CHHHHHHH Confidence 2 3457899999999999998885 7999999999999999999999999999999999999998776 67778889 Q ss_pred HHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHH Q lcl|NC_019719. 244 EENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 244 ~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~ 322 (424) ++.|++.+++ .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.| T Consensus 236 k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st--~sn~e~~~~~f 313 (518) T protein:vir:78 236 REQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT--FSNISAQMRAF 313 (518) T ss_pred HHHHHHHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCC--chhHHHHHHHH Confidence 9988876655 78999999999999999999999999999999999999999999999999887764 56999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCe Q lcl|NC_019719. 323 LQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDV 400 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~ 400 (424) +++||.|++.+||++||++|++..++ +++++||+++|+++|.+++++.+.+++++|+||+||+|+++|+||++ |||+ T Consensus 314 ~~~tL~P~~~~ie~eln~~L~~~~~~-~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~ 392 (518) T protein:vir:78 314 YRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADE 392 (518) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccC-cceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCce Confidence 99999999999999999999987664 56899999999999999999999999999999999999999999996 7999 Q ss_pred eeecccccchhhccccC--CCcc-----cCC Q lcl|NC_019719. 401 AMRQSQYVPITDLGTNK--EPRN-----NGA 424 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~--~~~~-----~ga 424 (424) +++++|++|++...++. +++. .++ T Consensus 393 ~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~ 423 (518) T protein:vir:78 393 LYANSALQPLGATPDGAVEGEEAPAPKRPAS 423 (518) T ss_pred eeecccceecccccccccCCCCCCCCCCCCc Confidence 99999999986543211 0000 000 No 39 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=4.9e-92 Score=521.19 Aligned_cols=390 Identities=16% Similarity=0.204 Sum_probs=321.7 Q ss_pred HhhccCcccCccccccc-ccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchh Q lcl|NC_019719. 21 QSWFVGGRLVTPNQGSQ-TGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPL 99 (424) Q Consensus 21 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l 99 (424) |.+|++........+.. ...........+.. +...|+++++||+||++||++||++|+++|++..++. ...|++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~Al~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~----~~~~~~ 75 (417) T protein:vir:38 1 MKLFRGLATEVDPHWADHLLDSGVIPSFRGGY-LGISALRNSDVLTAVSIVSGDVSRFPLVITDSSTDEV----IDLANI 75 (417) T ss_pred CccccccccCCCccchhhhcccccccccCCce-echhhcccHHHHHHHHHHHHhhccCeeEEEEcCCcce----eccchH Confidence 33333322111111111 11112222233333 3456899999999999999999999999998866543 456899 Q ss_pred hhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-CceeeEEeecCceEEEEEcC-CceEEEEEe--cCceEEe Q lcl|NC_019719. 100 ARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-GDVISLLPLQSANMDVKLVG-KKVVYRYQR--DSEYADF 175 (424) Q Consensus 100 ~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-G~~~~l~~l~~~~v~~~~~~-~~~~~~~~~--~~~~~~~ 175 (424) .++|+.+||++||+++||+.++.+++++||+|++++|+.. |.+..|+|++|.+|.+..++ +...|.|.. ++....+ T Consensus 76 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~~~~y~~~~~~~~~~~~~ 155 (417) T protein:vir:38 76 EYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPDNIIYRFTPYNSSMQKVC 155 (417) T ss_pred HHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCCeEEEEEEEcCCcEEEEe Confidence 9999999999999999999999999999999999999865 67999999999999997764 444555543 3445678 Q ss_pred cHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc Q lcl|NC_019719. 176 SQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV 255 (424) Q Consensus 176 ~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (424) +++||||||+++.|+++|+||+.++..++.++.++++++.++|+||++|+++++.+..+ ++++.+++++.|++.+++.| T Consensus 156 ~~~dviH~r~~~~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l-~~e~~~~~~~~~~~~~~g~n 234 (417) T protein:vir:38 156 GFEDVIHWKFFSYDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESRL-SAEARQKIREDFERAQAGAD 234 (417) T ss_pred cCcceEEecCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCC-CHHHHHHHHHHHHHHhcccc Confidence 99999999999999999999999999999999999999999999999999999988776 67778999999999888889 Q ss_pred cCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 256 KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWE 335 (424) Q Consensus 256 ~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie 335 (424) +|+++++++|++|++++++++|+||+|.+++++++||++|||||++||. +.+++|++++.++|+++||.|++++|| T Consensus 235 ~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~----~~~~s~~e~~~~~~~~~tl~P~~~~ie 310 (417) T protein:vir:38 235 AGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQ----NSPNQSVKQLADDYIRNDLPFYFEPIT 310 (417) T ss_pred cCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCC----CCcchhHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999974 234568999999999999999999999 Q ss_pred HHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--Ceeeecccccchhhc Q lcl|NC_019719. 336 NSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVAMRQSQYVPITDL 413 (424) Q Consensus 336 ~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g--d~~~~~~n~~~~~~~ 413 (424) ++|+++|+++.++.+++++||++.+.+.+ .+.+++++++|+||+||+|+++|+||+||| |++++|+|+++++.. T Consensus 311 ~~l~~~Ll~~~~~~~~~~~fd~~~l~~~~----~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~ 386 (417) T protein:vir:38 311 SEFELKLLDDAQRHQYCIGFDTKSVNGLP----IADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQK 386 (417) T ss_pred HHHHhhhcChhhcccceEEechhhhhHHH----HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccccccccc Confidence 99999999988877788999988875433 344778999999999999999999999886 889999999999865 Q ss_pred cccCCCccc---CC Q lcl|NC_019719. 414 GTNKEPRNN---GA 424 (424) Q Consensus 414 ~~~~~~~~~---ga 424 (424) .+.+.++.. |+ T Consensus 387 ~~~~~~~~~~~kgg 400 (417) T protein:vir:38 387 EAYQAEHAAELKGG 400 (417) T ss_pred cccccccccccCCC Confidence 442211100 01 No 40 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=4.5e-91 Score=515.88 Aligned_cols=403 Identities=18% Similarity=0.278 Sum_probs=331.1 Q ss_pred CchHHHHHhhccCcccCccccccccccc-ccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPV-SAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) =|||++|..+. ...+.+......+.+ ......+...+....++++|+|++||++||++||++|+++|++..+|..++ T Consensus 1 Mg~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~~~ 78 (423) T protein:vir:81 1 MGFLQKLGLAP--SVVATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGRER 78 (423) T ss_pred CchhHhhcccc--ccccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCceee Confidence 68888874322 222222221112222 222222333344566678999999999999999999999999887776544 Q ss_pred ccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC--CceeeEEeecCceEEEEEcC---CceEEEEE Q lcl|NC_019719. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA--GDVISLLPLQSANMDVKLVG---KKVVYRYQ 167 (424) Q Consensus 93 ~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~--G~~~~l~~l~~~~v~~~~~~---~~~~~~~~ 167 (424) ..+|++.+||. +||++||+++||+.++.+++++||+|+++.|+.. +.+..|+|+++..+.+.... +...|.+. T Consensus 79 -~~~~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~~~~~Y~~~ 156 (423) T protein:vir:81 79 -VREGHLARVCK-LANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGWGSLDYIII 156 (423) T ss_pred -eccchHHHHhh-cCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCCcceEEEEE Confidence 46789998886 8999999999999999999999999999998753 46778888888888776542 23344443 Q ss_pred ----ecCceEEecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC----CCCHH Q lcl|NC_019719. 168 ----RDSEYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK----VLTEQ 238 (424) Q Consensus 168 ----~~~~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~----~~~~~ 238 (424) .++....++++||||+|.++.++ .+|+||+..++.++....++++++.++|+||++|+++|+.+.. ..+++ T Consensus 157 ~~~~~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e 236 (423) T protein:vir:81 157 ESGDNDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAE 236 (423) T ss_pred EecCCCceEEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHH Confidence 24566889999999999888776 5799999999999999999999999999999999999987643 24778 Q ss_pred HHHHHHHHHHHHh--CCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH Q lcl|NC_019719. 239 QRSQVEENFKEIA--GGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 239 ~~~~~~~~~~~~~--~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) +++++++.|++.+ +.+|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+| T Consensus 237 ~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t--~sn~e 314 (423) T protein:vir:81 237 SRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNAN--YSNVR 314 (423) T ss_pred HHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCC--cccHH Confidence 8888999888865 4578899999999999999999999999999999999999999999999999877664 55899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCcccc--ccceeeecchhhhccCHHHHHHHHHHHHh-CCCCCHHHHHHHhCCC Q lcl|NC_019719. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDV--GRIHAEHNLDGLLRGDSASRAAFMKAMGE-AGLRTINEMRRTDNLP 393 (424) Q Consensus 317 ~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~-~g~~T~NE~R~~~G~~ 393 (424) ++.++|+++||.|++.+||++|+++|+++.+. .+++++||.++|+++|.+++++.+++++. +||||+||+|+++|+| T Consensus 315 ~~~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~ 394 (423) T protein:vir:81 315 EFRKALYGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLP 394 (423) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCC Confidence 99999999999999999999999999998764 46899999999999999999999999885 6999999999999999 Q ss_pred CCCCCCeeeecccccchhhccccCCCccc Q lcl|NC_019719. 394 PLPGGDVAMRQSQYVPITDLGTNKEPRNN 422 (424) Q Consensus 394 p~~~gd~~~~~~n~~~~~~~~~~~~~~~~ 422 (424) |+||||++++|.|+.+.+......+..+- T Consensus 395 p~~gGD~~~~p~n~~~~~~~~~~~~~~~t 423 (423) T protein:vir:81 395 SIDGGDDLARPLNTEFGDSEDAPGEEVET 423 (423) T ss_pred CCCCcceeecccccccCccCCCCCCCCCC Confidence 99999999999999886543322222222 No 41 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=9.1e-91 Score=514.23 Aligned_cols=388 Identities=15% Similarity=0.189 Sum_probs=325.2 Q ss_pred HhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhh Q lcl|NC_019719. 21 QSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLA 100 (424) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~ 100 (424) |+||++..............+ ........++...|+++++||+||++||++||++|+++++.+.. ...+|++. T Consensus 1 m~~f~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~g~-----~~~~~~~~ 73 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDDYISSV--LAGDVSQKYLGVSALKNSDILTATSIIAGDIARFPLVKKDVNGD-----IIHDEDIN 73 (406) T ss_pred CccccccCCCCCCcchHHHHH--hcCCCCcccccchhhccHHHHHHHHHHHHhhhhCeeEEEecCcc-----ccccchHH Confidence 777766543322111111111 11122234555679999999999999999999999988765432 34579999 Q ss_pred hhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-CCceeeEEeecCceEEEEEcCC-ceEEEEE--ecCceEEec Q lcl|NC_019719. 101 RLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-AGDVISLLPLQSANMDVKLVGK-KVVYRYQ--RDSEYADFS 176 (424) Q Consensus 101 ~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~-~G~~~~l~~l~~~~v~~~~~~~-~~~~~~~--~~~~~~~~~ 176 (424) ++|+.+||++||+++||+.++.+|+++||||++++|+. .|.+.+|+|++|++|++..+++ ..+|.+. .++....++ T Consensus 74 ~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~~~~y~~~~~~~~~~~~~~ 153 (406) T protein:vir:97 74 YLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNHEIVYTFTDMLTAKQVKCF 153 (406) T ss_pred HHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCceEEEEEEecCCceEEEEc Confidence 99999999999999999999999999999999999985 6899999999999999987654 4455554 456678899 Q ss_pred HhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCccc Q lcl|NC_019719. 177 QKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVK 256 (424) Q Consensus 177 ~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 256 (424) ++||||+|+++.++++|+||+.++..++.++.+++++..++|+||+.|++++..+.. .++++.+++++.|++..++.|+ T Consensus 154 ~~evih~r~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~-l~~e~~~~~~~~~~~~~~g~n~ 232 (406) T protein:vir:97 154 AHDVIHWKFFSHDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGAQ-LSGDARQRARQEFEKMREGSVG 232 (406) T ss_pred cccEEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCCC-CCHHHHHHHHHHHHHHhccccc Confidence 999999999999999999999999999999999999999999999988877666554 5788889999999999998999 Q ss_pred CcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 257 KRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWEN 336 (424) Q Consensus 257 g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~ 336 (424) |+++++++|++|+++++++.|+||+|.+++++++||++|||||.+||.. .+++|.+++.+.|++.||.|++++||+ T Consensus 233 g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~----~~~~~~e~~~~~f~~~~l~P~~~~ie~ 308 (406) T protein:vir:97 233 GSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVN----SPNQSVAQLMEDYVTNDLPFYFDAITS 308 (406) T ss_pred CceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCC----CCcchHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999853 234588999999999999999999999 Q ss_pred HHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeeeecccccchhhcc Q lcl|NC_019719. 337 SIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAMRQSQYVPITDLG 414 (424) Q Consensus 337 ~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~--gd~~~~~~n~~~~~~~~ 414 (424) +|+++|+++.++..++++||++++ .+.+++.+.+++++|+||+||+|+++|+||+++ ||++++|+|++|++... T Consensus 309 ~l~~kll~~~~~~~~~i~fd~~~~----~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~ 384 (406) T protein:vir:97 309 ELGLKTLNDKDRRLYHIEFDTRSV----TGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKE 384 (406) T ss_pred HHhhhhcChhhccceeEEEecCcc----chhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccCccchhccc Confidence 999999999888778899997654 556677888999999999999999999999965 99999999999998764 Q ss_pred ccC--------CCcccCC Q lcl|NC_019719. 415 TNK--------EPRNNGA 424 (424) Q Consensus 415 ~~~--------~~~~~ga 424 (424) +.+ +++++|- T Consensus 385 ~~~~~~~~~~~gg~~~~~ 402 (406) T protein:vir:97 385 EYQDKVGIKGKGGEVNAE 402 (406) T ss_pred ccccccccccCCCCCCCC Confidence 322 2222222 No 42 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=1.6e-90 Score=512.83 Aligned_cols=404 Identities=15% Similarity=0.156 Sum_probs=333.6 Q ss_pred hHHHHHhhccCccc-Ccccccccc---cccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccc Q lcl|NC_019719. 16 WWARLQSWFVGGRL-VTPNQGSQT---GPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 16 ~~~~l~~~~~~~~~-~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~ 91 (424) +++.+...+++... .++....+. ++.....+.++..++.+.|+++|+||+||++||++||++||++|+...++..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g~~~ 80 (460) T protein:vir:10 1 MANRIIRALRELTGLDNKFNDAFIKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTKAYQ 80 (460) T ss_pred CchhHHHHHhhhhccCCCchHHHHHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCCccch Confidence 66655555544322 222111111 11112233466778899999999999999999999999999999998887532 Q ss_pred c-------------------------ccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC----CCce Q lcl|NC_019719. 92 K-------------------------VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS----AGDV 142 (424) Q Consensus 92 ~-------------------------~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~----~G~~ 142 (424) + ....+++..+|+.+||++||+++||+.++.+++++||||++++|+. .|.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~~G~~ 160 (460) T protein:vir:10 81 QLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGINAGVP 160 (460) T ss_pred hhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCccCcee Confidence 1 2234455667888999999999999999999999999999999964 4789 Q ss_pred eeEEeecCceEEEEEcCCce---------EEEEEecCceEEecHhHeeEeccCCC------CccccCchHHHHHHHHHHH Q lcl|NC_019719. 143 ISLLPLQSANMDVKLVGKKV---------VYRYQRDSEYADFSQKEIFHLKGFGF------TGLVGLSPIAFACKSAGVA 207 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~~---------~~~~~~~~~~~~~~~~evih~r~~~~------~~~~G~s~~~~~~~~i~~~ 207 (424) .+||||+|++|++..++++. .|.+..++....|+++||||||+++. ++++|+||+..++.++.+. T Consensus 161 ~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~ 240 (460) T protein:vir:10 161 SQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQ 240 (460) T ss_pred EEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHHHHH Confidence 99999999999999876542 24445677788999999999997654 3589999999999999999 Q ss_pred HHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHH Q lcl|NC_019719. 208 VAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKF 286 (424) Q Consensus 208 ~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~ 286 (424) .+++++..++|+||+.|+++++.+..+ ++++.+++++.|++.+++ +|+|+++++++|++|+++++++.|+||+|.+++ T Consensus 241 ~~~~~~~~~~f~ng~~~~~i~~~~~~l-~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~ 319 (460) T protein:vir:10 241 NSTIDNNVKTMQNGGVFGFIHGGSTGL-TQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKY 319 (460) T ss_pred HHHHHHHHHHHhcCCCcceeeecCCCC-CHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHH Confidence 999999999999999999998877665 677888888888886654 689999999999999999999999999999999 Q ss_pred HHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccc-cceeeecchhh--hcc Q lcl|NC_019719. 287 QVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGL--LRG 363 (424) Q Consensus 287 ~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l--~~~ 363 (424) ++++||++|||||++||..++++.+++|+|++.+.|+++||.|++.+||++||++|+++.++. +++++||++.+ ++. T Consensus 320 ~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~ 399 (460) T protein:vir:10 320 DQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQT 399 (460) T ss_pred HHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHH Confidence 999999999999999999988888889999999999999999999999999999999988764 68899999887 333 Q ss_pred CHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--CCCCeeeecccccchhhccccC-CCcccCC Q lcl|NC_019719. 364 DSASRAAFMKAMGEAGLRTINEMRRTDNLPPL--PGGDVAMRQSQYVPITDLGTNK-EPRNNGA 424 (424) Q Consensus 364 d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~--~~gd~~~~~~n~~~~~~~~~~~-~~~~~ga 424 (424) |.+ ....++++|++|+||+|+++|+||+ +|||++++|+|++|+++..++. ++.+|.. T Consensus 400 d~~----~~~~~~~~g~~T~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~~~~~~~nq~ 459 (460) T protein:vir:10 400 DMV----AMASWLNTIPVTPNEIRIAMKYETLNQDGMDIVFMPSNKVRIDDVSNNLIDSAFNQN 459 (460) T ss_pred HHH----HHHHHHhCCCCCHHHHHHHhCCCCCCCCCCCeeeecccccchhhcccccCCCcccCC Confidence 333 4455788999999999999999998 5799999999999998876542 2222222 No 43 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=4.5e-88 Score=499.46 Aligned_cols=374 Identities=21% Similarity=0.272 Sum_probs=313.4 Q ss_pred CchHHHHHhh------------------------ccCcccCcccc-ccccccc------ccccccCcccccHHHHhhhHH Q lcl|NC_019719. 14 NGWWARLQSW------------------------FVGGRLVTPNQ-GSQTGPV------SAHGHLGDSSINDERILQIST 62 (424) Q Consensus 14 ~G~~~~l~~~------------------------~~~~~~~~~~~-~~~~~~~------~~~~~~~~~~~~~~~~~~~~~ 62 (424) =|||+++++. |++.......+ .....+. ..+....+..++.+.++++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 7999999885 22222111110 1011111 122344677889999999999 Q ss_pred HHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE-eeCCCCc Q lcl|NC_019719. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV-DRNSAGD 141 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~-~r~~~G~ 141 (424) ||+||++||++||++|+++|+.++. .+.+..+|+.+||++||+++||+.++.+|++ ||+|+++ .|+.+|. T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~~--------~~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~ 151 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGRI--------IDSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGY 151 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCcc--------ccchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCc Confidence 9999999999999999999975432 2445668999999999999999999999987 9999975 5899999 Q ss_pred eeeEEeecCceEEEEEcCCceEEEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019719. 142 VISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFAN 220 (424) Q Consensus 142 ~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 220 (424) |++|+||+|++|++..+.++.. .|..++ ...++||||+|+++ .++++|+||+..++.++.+..++++++.++|+| T Consensus 152 ~~~L~pl~p~~v~v~~~~~g~~-~y~~~~---~~~~~eiiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~n 227 (409) T protein:vir:83 152 PIRFRVVPPWLVNVELKKGARR-EYRIGG---LNVTDEILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAET 227 (409) T ss_pred EEEEEEECCcceEEEEcCCceE-EEEEcc---ccCccceEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 9999999999999988766542 233332 23468999999875 577899999999999999999999999999999 Q ss_pred cCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCcee-eecccChhHHHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_019719. 221 GAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFST-SAIGVTPQDAEMMASRKFQVSELARFFGVPP 299 (424) Q Consensus 221 ~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~-~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~ 299 (424) |++|+|+++++..+ ++++.+++++.|+...++ |+|+++++++|+++ ++++++++|+||+|.+++++++||++||||| T Consensus 228 ga~p~gil~~~~~l-s~e~~~~~~~~~~~~~~~-nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp 305 (409) T protein:vir:83 228 GGVPLYWLGVERRL-SETEAVDLMDRWIESRSK-YAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPP 305 (409) T ss_pred CCCcceEeecCCCC-CHHHHHHHHHHHHHhhCC-ccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCH Confidence 99999999998776 677778888888876654 78999999999997 5699999999999999999999999999999 Q ss_pred HHhcCCCCCC-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhC Q lcl|NC_019719. 300 HLVGDVEKST-SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEA 378 (424) Q Consensus 300 ~~l~~~~~~~-~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~ 378 (424) ++||..++++ .+++|+|++.++|+++||.|++++||++|+++|+++. .+++||++.++++|.++|++.+++++++ T Consensus 306 ~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~----~~~~f~~~~llr~d~~~r~~~~~~~~~~ 381 (409) T protein:vir:83 306 FLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPSP----QHLELNRDDYTRPSLVERATAYKIMIEA 381 (409) T ss_pred HHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCC----cEEEeehhhhhccCHHHHHHHHHHHHhC Confidence 9999876644 3678999999999999999999999999999999763 4689999999999999999999999999 Q ss_pred CCCCHHHHHHHhCCCCCCCCCeeeeccc Q lcl|NC_019719. 379 GLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 379 g~~T~NE~R~~~G~~p~~~gd~~~~~~n 406 (424) |+||+||+|+++||||++|||++.-.+- T Consensus 382 G~lT~NE~R~~~glpp~~ggd~l~~~gv 409 (409) T protein:vir:83 382 GVMEPNEARAMERLHSEAAAVRLSGGGV 409 (409) T ss_pred CCcCHHHHHHHhCCCCCCCCcccCCCCC Confidence 9999999999999999999998843222 No 44 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=1.4e-87 Score=496.69 Aligned_cols=380 Identities=19% Similarity=0.234 Sum_probs=310.1 Q ss_pred ccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCC Q lcl|NC_019719. 28 RLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSP 107 (424) Q Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~p 107 (424) -.+.| +..+.+..++......++.+.++++++||+||++||++||++||++|+.+. +....||++++|+.+| T Consensus 1 ~~~~~---~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~-----~~~~~~~l~~lL~~~P 72 (723) T protein:vir:94 1 MTTFP---SGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDG-----ELDELHPLSQLWNVMP 72 (723) T ss_pred Ccccc---cCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCC-----ccchhhHHHHHHhhCC Confidence 10111 111122233334455567889999999999999999999999999986532 2235699999999999 Q ss_pred CCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC---CCceeeEEeecCceEEEEEcCCc--------eEEEEE-ecCceEEe Q lcl|NC_019719. 108 NQYMTAQEFREAMTMQLCFYGNAYALVDRNS---AGDVISLLPLQSANMDVKLVGKK--------VVYRYQ-RDSEYADF 175 (424) Q Consensus 108 N~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~---~G~~~~l~~l~~~~v~~~~~~~~--------~~~~~~-~~~~~~~~ 175 (424) |++||+++||+.++.+++++||+|++++|+. .|.|.+|+++++..+.+....+. ..|.+. .++....| T Consensus 73 N~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~~~~G~~~~~ 152 (723) T protein:vir:94 73 NRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIERTDGVRVPV 152 (723) T ss_pred CCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEEecCceeEEe Confidence 9999999999999999999999999999654 58899999999988777654332 123332 45667889 Q ss_pred cHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHh-CC Q lcl|NC_019719. 176 SQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIA-GG 253 (424) Q Consensus 176 ~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~-~~ 253 (424) +++||||||.++ .++++|+||+..+..++.+..++++++.++|+||++|++||+.+ . .++++.+++++.|++.+ |. T Consensus 153 ~~~dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~-~-l~~e~~~~~~~~~~~~~~G~ 230 (723) T protein:vir:94 153 LADEMLWLRFSDPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLG-D-MDEQTFTKTVAAFRSQVEGV 230 (723) T ss_pred cccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-C-CCHHHHHHHHHHHHHHhhch Confidence 999999999875 79999999999999999999999999999999999999999976 4 47788888888887755 55 Q ss_pred cccCcceecC----------CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHH Q lcl|NC_019719. 254 PVKKRLWILE----------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 254 ~~~g~~~~l~----------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~ 323 (424) .|+|++++|+ .|++|++++++++|+||+|++++++++||++|||||.+|++. .+++|.+++.+.|+ T Consensus 231 ~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~----st~sN~e~~~~~f~ 306 (723) T protein:vir:94 231 QNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGG----STYENQAEAKAAVW 306 (723) T ss_pred hhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCC----CCcccHHHHHHHHH Confidence 7999999986 589999999999999999999999999999999999999642 34568999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC--ee Q lcl|NC_019719. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD--VA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd--~~ 401 (424) ++||+|++++||++||++|++..+. .++++||...++++|.+++++.+..++++|+||+||+|+++|+||+|||| .+ T Consensus 307 ~~tL~P~~~~ie~~ln~~Ll~~~g~-~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~ 385 (723) T protein:vir:94 307 TETLIPQMEVMASITDLQLLPDIGW-TVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMT 385 (723) T ss_pred HHHHHHHHHHHHHHHhHhhcccccC-ceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccce Confidence 9999999999999999999976543 36778888889999999999999999999999999999999999999987 33 Q ss_pred eecc--cccchhhccccCCCcccCC Q lcl|NC_019719. 402 MRQS--QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~--n~~~~~~~~~~~~~~~~ga 424 (424) +.|. +++|.++.. ...+++++ T Consensus 386 ~~p~~~~~a~~~~~~--p~~~e~~~ 408 (723) T protein:vir:94 386 LTPYRAQFAPAPAPA--PAVEEGAA 408 (723) T ss_pred eccccccccCCCCCC--ccchhhhH Confidence 4543 444443322 11122222 No 45 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=2.9e-86 Score=489.53 Aligned_cols=391 Identities=17% Similarity=0.205 Sum_probs=325.8 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||++|+....+... .........+.......+..++...++++++|++||++||++||++||++|+..+++.. T Consensus 1 Mg~f~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~--- 75 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKI--RADTGYVGLFMSGEDVSFLVPGYVRLSDNPEVRMAVHKIADLISSMTIYLMQNTEDGDI--- 75 (406) T ss_pred Ccchhhhccccccccc--cccchhhhhhccCcccCccccCHHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce--- Confidence 5888877544433221 11122223344444556666788899999999999999999999999999998876643 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCe--EEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEecCc Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNA--YALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a--~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.+++++|++ |+++.|+..|.+.+|||++|.+|++..+.++..|. .++ T Consensus 76 ~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~~~~~~--~~~- 152 (406) T protein:vir:95 76 RIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPDGYQVL--YGG- 152 (406) T ss_pred eecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCCeEEEE--ecc- Confidence 3468999999999999999999999999999999765 66678999999999999999999999998864443 333 Q ss_pred eEEecHhHeeEecc--CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHH Q lcl|NC_019719. 172 YADFSQKEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) Q Consensus 172 ~~~~~~~evih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~ 249 (424) +.|+++||||+|+ .+.++++|+||+..+..++.+..++.+++.++|+||+.|+++++.+..+ ++++.+++++.|.+ T Consensus 153 -~~~~~~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l-~~e~~~~~~~~~~~ 230 (406) T protein:vir:95 153 -QTFNYDEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAAT-AELSSEEGRNAVFK 230 (406) T ss_pred -EEEchhHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCC-CHHHHHHHHHHHHH Confidence 5799999999985 3567899999999999999999999999999999999999999999877 45555666666655 Q ss_pred -HhCCcccCcceecCCC-ceeeecc-cChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 250 -IAGGPVKKRLWILEAG-FSTSAIG-VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 250 -~~~~~~~g~~~~l~~g-~~~~~l~-~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) +.|..|+|++++++.| .+++++. ++++|+||+|.+++++++||++|||||++||... +.+++..+|+++| T Consensus 231 ~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~-------~~~~~~~~~~~~~ 303 (406) T protein:vir:95 231 KYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIGE-------FNRDEYNNFINST 303 (406) T ss_pred HhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-------chHHHHHHHHHHH Confidence 5566789999888654 5667764 6899999999999999999999999999997432 4578889999999 Q ss_pred HHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccc Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n 406 (424) |.|++++||++|+++|+++.+ ++++||++++++.|.+++++.+.+++++||||+||+|+++|+||+||||++++|+| T Consensus 304 l~P~~~~ie~~l~~~l~~~~~---~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~gd~~~~~~n 380 (406) T protein:vir:95 304 ILPIAKGIEQELTRKLLISPD---LYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEGLSELVILEN 380 (406) T ss_pred HHHHHHHHHHHHHHhcCCCCC---cEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccC Confidence 999999999999999998754 57999999999999999999999999999999999999999999999999999999 Q ss_pred ccchhhcccc---CCCcccCC Q lcl|NC_019719. 407 YVPITDLGTN---KEPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~---~~~~~~ga 424 (424) ++|++..++. +++++++. T Consensus 381 ~~~~~~~~~~~~~k~g~~~~~ 401 (406) T protein:vir:95 381 YIPLDKIGDQSKLKGGDNSGA 401 (406) T ss_pred ccchhhcccccccCCCCCCCC Confidence 9999876543 33334444 No 46 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=1.7e-86 Score=490.76 Aligned_cols=402 Identities=16% Similarity=0.227 Sum_probs=320.6 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccccccccc-ccccccCccccc-HHHHhhhHHHHHHHHHHHHhhccCc Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPV-SAHGHLGDSSIN-DERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) |++-+ + .++-+||++-++-....+..... .....+. ....+......+ ...++++++|++||++||++||++| T Consensus 4 ~~~~~-~---~~~m~~F~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~ 78 (413) T protein:vir:96 4 VSEIR-K---DKNLKFFNNKRSPTEESKAKDEI-PKAPQVVMTLPNFFKELISDGYTKLSDSPEVRMAVDCIADLVSNMT 78 (413) T ss_pred cchhh-h---hhcCCccccCCCcchhhhhhccc-cccccccccchhhHhhhccchhHHHhhchHHHHHHHHHHHhhccCc Confidence 44322 1 12224444322111100000000 0000010 001111111111 1336789999999999999999999 Q ss_pred eEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC-ceeeEEeecCceEEEEE Q lcl|NC_019719. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG-DVISLLPLQSANMDVKL 157 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G-~~~~l~~l~~~~v~~~~ 157 (424) |++|+++.++.. ..+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+| .+.+|||++|.+|++.. T Consensus 79 ~~~~~~~~~~~~---~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~ 155 (413) T protein:vir:96 79 IQLMQNGETGDK---RIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNV 155 (413) T ss_pred eEEEEecCCCcc---ccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEE Confidence 999998876643 34699999999999999999999999999999999999999999887 57899999999999999 Q ss_pred cCCceEEEEEecCceEEecHhHeeEecc-CC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC Q lcl|NC_019719. 158 VGKKVVYRYQRDSEYADFSQKEIFHLKG-FG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL 235 (424) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~evih~r~-~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~ 235 (424) +++...|.+..++ ..++++||||+|. ++ .++++|+||+.++..++.+..+++++..++|+||++|+++++++..+ T Consensus 156 ~~~~~~y~~~~~~--~~~~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l- 232 (413) T protein:vir:96 156 SDDDLDYSITFDN--KEYDPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDS- 232 (413) T ss_pred cCCeEEEEEeecC--cEEchhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCC- Confidence 9888777776665 4789999999985 33 46789999999999999999999999999999999999999998776 Q ss_pred CHHHHHHHHHHHHHHhC-CcccCcceecCCCc-eeeec-ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccc Q lcl|NC_019719. 236 TEQQRSQVEENFKEIAG-GPVKKRLWILEAGF-STSAI-GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWG 312 (424) Q Consensus 236 ~~~~~~~~~~~~~~~~~-~~~~g~~~~l~~g~-~~~~l-~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~ 312 (424) ++++.+++++.|++.++ ..|+|+++++++|. ++.++ .++++|+||+|.+++++++||++|||||.+||... T Consensus 233 ~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~------ 306 (413) T protein:vir:96 233 DELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVGT------ 306 (413) T ss_pred CHHHHHHHHHHHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCc------ Confidence 66777888888877655 57899999997765 45565 46899999999999999999999999999997521 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCC Q lcl|NC_019719. 313 SGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNL 392 (424) Q Consensus 313 ~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~ 392 (424) +.+++..+|+++||.|++++||++||++|+++ +++++||++++++.|.+++++.+++++++|+||+||+|+++|+ T Consensus 307 -~~~~~~~~~~~~~l~P~~~~ie~~ln~~ll~~----~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~ 381 (413) T protein:vir:96 307 -YNKDEFNNFINTKIMSIAQVIQQTYNKLIVEE----DMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGM 381 (413) T ss_pred -chHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC----CcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 34788899999999999999999999999875 4688999999999999999999999999999999999999999 Q ss_pred CCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 393 PPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 393 ~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ||+||||++++++|++|++..++++..+.+-- T Consensus 382 ~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 382 PPDAEMDDLLVLENYLQQKDLVNQKKLIQDET 413 (413) T ss_pred CCCCCcceeeecccccchhhcccccCCCCCCC Confidence 99999999999999999988766543322222 No 47 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=2.6e-86 Score=489.75 Aligned_cols=388 Identities=18% Similarity=0.230 Sum_probs=314.7 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccH-HHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIND-ERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) =|||+. |++.....+.... ..+..........++. ..+..+|+|++||++||++||++|+++|++.++|.. T Consensus 1 Mg~~~~----f~~k~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~-- 72 (403) T protein:vir:80 1 MGLFNF----FRRKTRSEPTNAI--SWFLTQEAYDTLAIPGYTRLSDNPEVRMAVHKIAELISSMTIHLMQNTDNGDI-- 72 (403) T ss_pred Cccccc----ccccccccccchh--hhhcccccccccccchhhhhhhhHHHHHHHHHHHHhhhhCceEEEEecCCcee-- Confidence 466653 3332222221111 1111111112222222 234568999999999999999999999998776643 Q ss_pred ccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHc--CCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEecC Q lcl|NC_019719. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFY--GNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~--G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~ 170 (424) ...|++.++|+.+||++||+++||+.++.++++. ||||+++.++..|++.+||||+|.+|++..++++..+.|. T Consensus 73 -~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g~~~~y~--- 148 (403) T protein:vir:80 73 -RIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTGYQIWYQ--- 148 (403) T ss_pred -ecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCceEEEEe--- Confidence 3479999999999999999999999999999985 7899999999999999999999999999998887655543 Q ss_pred ceEEecHhHeeEecc--CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019719. 171 EYADFSQKEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 171 ~~~~~~~~evih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) ...++++||||++. .+.++++|+||+..+..++....++++++.++|+||+.|++|++.+....+++..+..+++.+ T Consensus 149 -~~~~~~~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~ 227 (403) T protein:vir:80 149 -GKAYNYDEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFK 227 (403) T ss_pred -ecccchhhEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHHHH Confidence 24689999999983 456789999999999999999999999999999999999999999988755555444445556 Q ss_pred HHhCCcccCcceecCCCc-eeeecc-cChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 249 EIAGGPVKKRLWILEAGF-STSAIG-VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 249 ~~~~~~~~g~~~~l~~g~-~~~~l~-~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) ++.++.++|++++++.+. +++++. ++++|+|++|.+++++.+||++|||||++||... +.++...+|+++| T Consensus 228 ~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-------~~~~~~~~f~~~~ 300 (403) T protein:vir:80 228 KYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVGK-------YDKDEYNNFINST 300 (403) T ss_pred HHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC-------ccHHHHHHHHHHH Confidence 677778899999987664 455554 5889999999999999999999999999997532 2245667899999 Q ss_pred HHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccc Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n 406 (424) |.|++++||++|+++|+++.+ ++++||+++++++|.+++++.+.+++++||||+||+|+++|+||+||||++++++| T Consensus 301 l~P~~~~ie~~l~~kll~~~~---~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd~~~~~~n 377 (403) T protein:vir:80 301 ILPIAKGIEQELTRKLLISPD---LYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLSELVILEN 377 (403) T ss_pred HHHHHHHHHHHHHHhccCCCC---cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeeccc Confidence 999999999999999998755 57899999999999999999999999999999999999999999999999999999 Q ss_pred ccchhhcccc---CCCcccCC Q lcl|NC_019719. 407 YVPITDLGTN---KEPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~---~~~~~~ga 424 (424) ++|++..+++ ++++++|+ T Consensus 378 ~~pl~~~~~~~~~k~ge~~~~ 398 (403) T protein:vir:80 378 YIPLDKIGDQNKLKGGEKGGA 398 (403) T ss_pred ccchhhccchhhccCCCCCCC Confidence 9999876553 34444454 No 48 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=1.1e-85 Score=486.28 Aligned_cols=419 Identities=15% Similarity=0.197 Sum_probs=323.8 Q ss_pred CCCCcccccCCCCCchHHH--HHhhccCcccC--------cccccccccccccccccCcccccHHHHhhhHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWAR--LQSWFVGGRLV--------TPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLI 70 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~--l~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~i 70 (424) -..--|.|-|-+.+-++++ +...|++.... .+................+..++.+.++++++|++||++| T Consensus 55 ~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~I 134 (945) T protein:vir:10 55 NSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEI 134 (945) T ss_pred cceeeeeeeeehhhhHHHhhcccccccccccchhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHH Confidence 2233455555555544332 11112211110 0000000001111112334557788999999999999999 Q ss_pred HHhhccCceEEEEecccCcc----ccccccchhhhhhccCCCCCCCHHH----HHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019719. 71 STLTACLPLDVFETDQNDNR----KKVDLSNPLARLLRYSPNQYMTAQE----FREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 71 a~~ia~~~~~v~~~~~~~~~----~~~~~~~~l~~lL~~~pN~~~s~~~----f~~~~~~~~l~~G~a~~~~~r~~~G~~ 142 (424) |++||++|+++|++.++|.. ++....|++.++|+ +||++||+++ |++.++.+++++||+|++++|+.+|.+ T Consensus 135 A~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~-rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~i 213 (945) T protein:vir:10 135 PKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLE-RPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNL 213 (945) T ss_pred HhhhccCceEEEEecccCcccccccccccchHHHHHHh-CCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcE Confidence 99999999999998877753 34457899999997 9999999998 556788999999999999999999999 Q ss_pred eeEEeecCceEEEEEcCCce-E--EEEEecC-ceEEecHhHee-EeccCCCCcc---ccCchHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 143 ISLLPLQSANMDVKLVGKKV-V--YRYQRDS-EYADFSQKEIF-HLKGFGFTGL---VGLSPIAFACKSAGVAVAMEDQQ 214 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~~-~--~~~~~~~-~~~~~~~~evi-h~r~~~~~~~---~G~s~~~~~~~~i~~~~~~~~~~ 214 (424) ++|+|++|.+|++..++++. . |.+..++ ....++++|+| |+|+++.++. +|+||+.+++.++..+.++++++ T Consensus 214 i~L~pLdPs~Vti~~ddDG~~~y~Yv~~idG~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~a 293 (945) T protein:vir:10 214 VAITPVDGTTIKPILSEDTGIVVGYVQEVDGAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGN 293 (945) T ss_pred EEEEEECCcceEEEEcCCCcEEEEEEEecCCceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHH Confidence 99999999999998875542 2 3333343 34568888866 5677777764 59999999999999999999999 Q ss_pred HHHHh-ccCCCceeEEcCC---------CCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHH Q lcl|NC_019719. 215 RDFFA-NGAKSPQILSTGE---------KVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASR 284 (424) Q Consensus 215 ~~~~~-n~~~p~~vl~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~ 284 (424) .++|. ||++|+|+++++. +..++++.+++++.|++..++.++|+++++++|++++++++++.|+||+|++ T Consensus 294 ar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s~~DaQfLEsr 373 (945) T protein:vir:10 294 LDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRDMQFKELA 373 (945) T ss_pred HHHHHhCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCChhHHHHHHHH Confidence 99995 7889999998753 3347888999999999998888889999999999999999999999999999 Q ss_pred HHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccC Q lcl|NC_019719. 285 KFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGD 364 (424) Q Consensus 285 ~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d 364 (424) ++++++||++|||||++||..++++ ++|+|++...|+++||.|++.+||++||++|+...+... ++|+++.+...| T Consensus 374 kfs~eeIArAFGVPP~lLG~~e~st--~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~eg~~--i~fdFd~ldl~D 449 (945) T protein:vir:10 374 EFVARKICAVYQVSPQDVGILEGSN--KATAEVMASLTKAKGLEPLMATISKGFDEVVSEFRNEKD--IKLWFKEDDLEK 449 (945) T ss_pred HHHHHHHHHHhCCCHHHcccCCCCC--cchHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccCce--eEEEecchhccC Confidence 9999999999999999999876654 458999999999999999999999999999876555443 445555666678 Q ss_pred HHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc-cccchhhccccC--------------CCcccCC Q lcl|NC_019719. 365 SASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS-QYVPITDLGTNK--------------EPRNNGA 424 (424) Q Consensus 365 ~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~-n~~~~~~~~~~~--------------~~~~~ga 424 (424) .+++++.+++++++|+||+||+|+++|+||+||||+++++. |+.|.+...+.+ ++...|+ T Consensus 450 ~ksraEal~kli~sGiLTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGG 524 (945) T protein:vir:10 450 ERDWWNIIQGQLNTGFRSINEARMEKGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGG 524 (945) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCC Confidence 99999999999999999999999999999999999999987 455654322110 0111111 No 49 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=7.2e-85 Score=481.88 Aligned_cols=401 Identities=15% Similarity=0.143 Sum_probs=322.9 Q ss_pred CchHHHHHhhccCcccCc-ccccc---------------cccc---------cccccccCcccccHHHHhhhHHHHHHHH Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVT-PNQGS---------------QTGP---------VSAHGHLGDSSINDERILQISTVWRCVS 68 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~-~~~~~---------------~~~~---------~~~~~~~~~~~~~~~~~~~~~~v~~~i~ 68 (424) =|||+||++.+.++.... +.... ...+ ....+..++..|+.+.|+++++|++||+ T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~v~~~i~ 80 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGPVFACML 80 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccchhhhhccHHHHHHHH Confidence 699999998876542111 10000 0000 0111223577789999999999999999 Q ss_pred HHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC--------C Q lcl|NC_019719. 69 LISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA--------G 140 (424) Q Consensus 69 ~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~--------G 140 (424) +||++||++||++|+..+.+ . +...+|+++.|+ .+||++||+++||+.++.+++++||||++++|+.. | T Consensus 81 ~Ia~~ia~lp~~~~~~~~~~-~-~~~~~~~~~~L~-~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g 157 (466) T protein:vir:81 81 VRQLVFSSVRFRWQRLRDGK-P-SDTFGSRDLQIL-ETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVD 157 (466) T ss_pred HHHHhhccCceEEEEecCCc-e-eeccccHHHHHh-hCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCc Confidence 99999999999999876433 2 334667777655 59999999999999999999999999999999765 4 Q ss_pred ceeeEEeecCceEEEEEcCCc---eEEEEEecC-----ceEEecHhHeeEeccC--CCCccccCchHHHHHHHHHHHHHH Q lcl|NC_019719. 141 DVISLLPLQSANMDVKLVGKK---VVYRYQRDS-----EYADFSQKEIFHLKGF--GFTGLVGLSPIAFACKSAGVAVAM 210 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~~~~---~~~~~~~~~-----~~~~~~~~evih~r~~--~~~~~~G~s~~~~~~~~i~~~~~~ 210 (424) .+.+|+|++|.+|++..+.++ ..|.|..++ ....++++||||||++ +.++++|+||+..+.+++.+..++ T Consensus 158 ~~~~l~~l~~~~v~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~dviHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~ 237 (466) T protein:vir:81 158 VVVEERMVRGGRGELGGGQLGWRKVGYLYTEGGRQSGNESVGFLAEDVVHFAPIPDPLASYRGMSWLTPILREIRADQAM 237 (466) T ss_pred ceeEEEEecCcceEEEEcCCCceEEEEEEEecCcccccceeeeccccEEEEcCCCCcccccccccHHHHHHHHHHHHHHH Confidence 589999999999999887553 245554443 4578999999999965 468999999999999999999999 Q ss_pred HHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhC-CcccCcceecCCCceeeecccChhHHHHHHHHHHHHH Q lcl|NC_019719. 211 EDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAG-GPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVS 289 (424) Q Consensus 211 ~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~ 289 (424) +++..++|+||+.|++|++++..+ ++++++++++.|++.++ ..|+|+++++++|++|++++++++|+||+|+++++++ T Consensus 238 ~~~~~~~f~ng~~p~gil~~~~~l-~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~ 316 (466) T protein:vir:81 238 SKHQAKFFDNGATVNLVIKHNPMA-DPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGET 316 (466) T ss_pred HHHHHHHHhcCCCcceEEecCCCC-CHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHH Confidence 999999999999999999998776 67788889888877654 5789999999999999999999999999999999999 Q ss_pred HHHHHhCCCHHHhcCCCC-CCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHH Q lcl|NC_019719. 290 ELARFFGVPPHLVGDVEK-STSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASR 368 (424) Q Consensus 290 ~Ia~~fgVP~~~l~~~~~-~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~ 368 (424) +||++|||||++||..++ ++++++|+|++.+.|+++||.|++++||++|+++|++..++..++++||.++++++|.+++ T Consensus 317 ~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~~r 396 (466) T protein:vir:81 317 RIAAAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEKDA 396 (466) T ss_pred HHHHHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHHHH Confidence 999999999999998765 4567889999999999999999999999999999999888777889999999999999988 Q ss_pred HHH-------HHHHHhCCCCCHHHHHHHhCCCCCCCCCeeee-cccccchhhcc------------ccCCCcccCC Q lcl|NC_019719. 369 AAF-------MKAMGEAGLRTINEMRRTDNLPPLPGGDVAMR-QSQYVPITDLG------------TNKEPRNNGA 424 (424) Q Consensus 369 ~~~-------~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~-~~n~~~~~~~~------------~~~~~~~~ga 424 (424) +++ +..++++|+ ||||+|+ ++++||.++. +.+..+++... ..+++++||- T Consensus 397 ~~~~~~~~~~~~~~~~~g~-t~nE~r~-----~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 397 ADIQKVRAETINTLITAGY-EPESVVA-----AVNSGDLRLLKHTGLTSVQLLPPGVSASASSDTPTSGGADDNGN 466 (466) T ss_pred HHHHHHHHHHHHHHHHcCC-Chhhccc-----cccCCccccccCCCcchhhhcccccccccCCCCcccCCCCcCCC Confidence 765 677889996 9999995 3456665443 33444433221 1122222222 No 50 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=6e-84 Score=476.85 Aligned_cols=384 Identities=14% Similarity=0.174 Sum_probs=316.4 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||++|++++.+.... .......++...+.++..++.+.++++++|++||++||++||++||++|+++.+ . T Consensus 1 MGl~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~-----~ 72 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEK---RGYLDNVLGKSIRYSGVYVTDSNILQSSDVYELLQDISNQMVLADIVVEDEFGN-----E 72 (394) T ss_pred CchhhhhhhhccCCCCc---hhhhhhhhhcccccCccccChhhhhccHHHHHHHHHHHHhhcccceEEEcCCCc-----c Confidence 69999998776433221 112334455566677888999999999999999999999999999999976432 2 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEecCceE Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..+|+++.|| .+||++||+++||+.++.+++++||+|+++.++..+.+ ..+.+..++... +.|..+ .+ T Consensus 73 ~~~~~~~~Ll-~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~~~--------~~~~~~~~~~~~-~~~~~~--~~ 140 (394) T protein:vir:62 73 IKDDIALQIL-RNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIHLA--------SNVFTELDDNLV-EHFNIG--GH 140 (394) T ss_pred cchhhHHHHh-ccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceeecc--------ccceEEECCceE-EEEeeC--CE Confidence 3568887766 59999999999999999999999999999976543322 344555555543 344443 46 Q ss_pred EecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhC Q lcl|NC_019719. 174 DFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAG 252 (424) Q Consensus 174 ~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~ 252 (424) .|+++||||+|+++.|+++|+||+..+..++....+++++..++|+||++|+++++++.... ++++++++++.|++.++ T Consensus 141 ~~~~~eiih~r~~~~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (394) T protein:vir:62 141 EIPPCMIRHVKNIGADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLE 220 (394) T ss_pred EechhheEEecCcCCCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhc Confidence 79999999999999999999999999999999999999999999999999999999988765 34556777878877655 Q ss_pred -CcccCcceecCCCc--eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHH Q lcl|NC_019719. 253 -GPVKKRLWILEAGF--STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 253 -~~~~g~~~~l~~g~--~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P 329 (424) ..|+|++++++.|. ++++++.+++|+||+|++++++++||++|||||.+||+.. ++|+|++.++|+++||.| T Consensus 221 g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-----~sn~e~~~~~~~~~~l~P 295 (394) T protein:vir:62 221 SIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELI-----KEDIEKAMMYIHNKAVRP 295 (394) T ss_pred cccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-----CcCHHHHHHHHHHHHHHH Confidence 47889999998776 5668888999999999999999999999999999998643 348899999999999999 Q ss_pred HHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--CCCCeeeecccc Q lcl|NC_019719. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL--PGGDVAMRQSQY 407 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~--~~gd~~~~~~n~ 407 (424) ++.+||++|+++|+++.++..++++||...++.. .++++.+.+++++|+||+||+|+++|+||+ |+||++++++|+ T Consensus 296 ~~~~ie~~l~~kll~~~~~~~~~~~fd~~~~~~~--~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~~n~ 373 (394) T protein:vir:62 296 IMKNFEDHLSLLFYAQNSGKRIKFKINILDFVTY--SNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAIYISNDV 373 (394) T ss_pred HHHHHHHHHhhhhcCccccCceEEEechhhhcCH--HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeccccc Confidence 9999999999999998887778888888887654 567889999999999999999999999999 679999999999 Q ss_pred cchhhccccCCCcccCC Q lcl|NC_019719. 408 VPITDLGTNKEPRNNGA 424 (424) Q Consensus 408 ~~~~~~~~~~~~~~~ga 424 (424) +|++.....+++.++|- T Consensus 374 ~~~~~~~~~~~~~kgge 390 (394) T protein:vir:62 374 TEIGKKEATDGSLGGGE 390 (394) T ss_pred ccccccccccccCCCCC Confidence 99977654433333333 No 51 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=2e-83 Score=473.97 Aligned_cols=386 Identities=19% Similarity=0.230 Sum_probs=318.3 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =||++.+..++..+...... ..+..... -.....+.++++++++|++||++||++||++||+++++......... T Consensus 1 mg~~~~~~~~~~~~~~~~~~----~~~~~~~~-~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~~~~~ 75 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRIIRD----MEPVSHRT-NRKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNIVTYANG 75 (403) T ss_pred Ccchhhhhhccchhhhhhhc----cccccccc-CCcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeecccccccccc Confidence 67777776666533221110 01111111 11222456889999999999999999999999999988765555555 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEecCceE Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ...|++.++|+.+||++||+++||+.++.+++++||||+++.+ ..|+++++..|++..+.+...+.+..+ ... T Consensus 76 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~~------~~l~~l~~~~~~v~~~~~~~~~~~~~~-~~~ 148 (403) T protein:vir:10 76 VKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWDG------TSLYHVPAALMQVEADANKFIKKFIFN-NQI 148 (403) T ss_pred cccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEeC------ceeEeecCcceEEEEcCCceEEEEEec-Cce Confidence 6679999999999999999999999999999999999998753 268999999999998877766655443 346 Q ss_pred EecHhHeeEeccCCC-----CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019719. 174 DFSQKEIFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 174 ~~~~~evih~r~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) .+.++||+|+|.++. ++++|+||+.++..++.+..+++++..++|+||++|++|++.+..+ ++++.+++++.|+ T Consensus 149 ~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l-~~e~~~~~~~~~~ 227 (403) T protein:vir:10 149 NYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEIL-NKKLRERKQEELQ 227 (403) T ss_pred eecccceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCC-CHHHHHHHHHHHH Confidence 788999999996543 7899999999999999999999999999999999999999998776 6777888998888 Q ss_pred HHhC-CcccCcceecCCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHH Q lcl|NC_019719. 249 EIAG-GPVKKRLWILEAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 249 ~~~~-~~~~g~~~~l~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) +.++ .+|+|+++++++|++|+++++ ++.|+||+|.+++++++||++|||||.+||.. +++|.|++.+.|+++ T Consensus 228 ~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-----~~sn~e~~~~~f~~~ 302 (403) T protein:vir:10 228 LDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGG-----NNANIRPNIELFYYM 302 (403) T ss_pred HHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCC-----CCcCHHHHHHHHHHH Confidence 7665 578999999999999999985 57899999999999999999999999999742 345889999999999 Q ss_pred HHHHHHHHHHHHHHhhccCccccccceeeecchhh--hccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCee Q lcl|NC_019719. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL--LRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVA 401 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l--~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~ 401 (424) ||.|++.+||++|+++|. ++++||++.+ ++.|.+++++.+++++++|+||+||+|+++|+||+| +||++ T Consensus 303 tl~P~~~~ie~~l~~~L~-------~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~d~~ 375 (403) T protein:vir:10 303 TIIPMLNKLTSSLTFFFG-------YKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQMNKI 375 (403) T ss_pred HHHHHHHHHHHHHHHhcC-------ceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCccccccc Confidence 999999999999999883 3567887755 889999999999999999999999999999999995 69999 Q ss_pred eecccccchhhc-cccCCCcccCC Q lcl|NC_019719. 402 MRQSQYVPITDL-GTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~-~~~~~~~~~ga 424 (424) ++|+|+...... ..++.+..+++ T Consensus 376 ~~p~n~~~~~~~~~~~e~~~~~~~ 399 (403) T protein:vir:10 376 RIPANVAGSATGVSGQEGGRPKGS 399 (403) T ss_pred ccccccccccccCCCCcCCCCCCC Confidence 999999865442 33344444444 No 52 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=100.00 E-value=1.2e-83 Score=475.22 Aligned_cols=337 Identities=22% Similarity=0.329 Sum_probs=296.4 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceE Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) ||++||++|++++ ..+||++++|+.+||++||+++||+.++.+++++||||++++|+..|.+++|||++|.+| T Consensus 1 ia~lp~~~~~~~~-------~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v 73 (348) T protein:vir:93 1 MASLPLKMYEDYK-------VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVV 73 (348) T ss_pred CcccceEeEecCc-------CcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCce Confidence 9999999998653 246999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCc--eEEEEE-ecCceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE Q lcl|NC_019719. 154 DVKLVGKK--VVYRYQ-RDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 154 ~~~~~~~~--~~~~~~-~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~ 229 (424) ++..+++. .+|.+. .++..+.|+++||||+|++ +.++++|+||+..+..++.+..++++++.. .++..+.++++ T Consensus 74 ~~~~~~~~~~~~y~~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~--~~~~~~~~i~~ 151 (348) T protein:vir:93 74 EMLIENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLT--EMQKPDSFMLK 151 (348) T ss_pred EEEEeCCCcEEEEEEEcCCCeEEEEccccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHH--hcCCCceeEEe Confidence 98876543 344443 3456788999999999986 468899999999999999999999988643 33334456666 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC Q lcl|NC_019719. 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST 309 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~ 309 (424) .+.. .++++.+++++.|++.++ ++|+++++++|++|++++++++|+||+|++++++++||++|||||.+||+.+++ T Consensus 152 ~~~~-l~~e~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~- 227 (348) T protein:vir:93 152 YGSN-VSTEKRQQVLEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNT- 227 (348) T ss_pred cCCC-CCHHHHHHHHHHHHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC- Confidence 6655 477888889999988774 678999999999999999999999999999999999999999999999976655 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_019719. 310 SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRR 388 (424) Q Consensus 310 ~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~ 388 (424) +++|+|+++++|+++||.|+++.||++|+++|+++.++. +++++||.+++++.|.+++++.+++++++|++|+||+|+ T Consensus 228 -~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~ 306 (348) T protein:vir:93 228 -NFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIRE 306 (348) T ss_pred -CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHH Confidence 456999999999999999999999999999999998874 688999999999999999999999999999999999999 Q ss_pred HhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 389 TDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 389 ~~G~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++|+||+||||++++++|++|++...+.++...+|. T Consensus 307 ~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gg~ 342 (348) T protein:vir:93 307 WEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGD 342 (348) T ss_pred HhCCCCCCCcCeEeecccccccccchhhcccccCCC Confidence 999999999999999999999988766654443333 No 53 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=5e-83 Score=471.79 Aligned_cols=380 Identities=13% Similarity=0.128 Sum_probs=310.9 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||++++.. +............ ...++..+..|+.+.++++++|++||++||++||++||++ T Consensus 1 M~~f~~~~~~----~~~~~~~~~~~~~-~~~~~~~~~~v~~~~al~~~~V~~~v~~ia~~ia~~p~~~------------ 63 (397) T protein:vir:38 1 MPLLKLNKSH----SQGFSLNDPDWVN-FLTGGEAQKYVSADTALKNSDIFSLIMQLSGDLAMVRYTS------------ 63 (397) T ss_pred Ccchhhhhcc----cCcccCCchhhhh-hhcCCcCCceechHHhhccHHHHHHHHHHHHHHhhCcccc------------ Confidence 3455543211 1111111111111 1223446778999999999999999999999999999964 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC--ceEEEEEe--- Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQR--- 168 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~--~~~~~~~~--- 168 (424) .|+..++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|+|++|.+|++..+.+ ...|.+.. T Consensus 64 --~~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~~~y~~~~~~~ 141 (397) T protein:vir:38 64 --ESDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSGLIYNINFDEP 141 (397) T ss_pred --cccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEeccc Confidence 24567788899999999999999999999999999999999999999999999999999887643 34555543 Q ss_pred -cCceEEecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019719. 169 -DSEYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 169 -~~~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~ 246 (424) ++..+.|+++||||+|+++.++ ++|+||+.++..++.+..++++++.++|+||+.|+++++.+... ++++.+++++. T Consensus 142 ~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~-~~e~~~~~~~~ 220 (397) T protein:vir:38 142 AIGYMENVPAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGG-LLDAETRIARS 220 (397) T ss_pred cccceeEecCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCC-CHHHHHHHHHH Confidence 3445789999999999988766 78999999999999999999999999999999999999999876 56677889999 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) ++...++.|+|+++++++|++|++++.++.|+||+|.+++.+++||++|||||.+||+...++ +|.| +...|+++| T Consensus 221 ~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~---~~~e-~~~~~~~~~ 296 (397) T protein:vir:38 221 KEISKQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQ---SSIT-QISGQYAKS 296 (397) T ss_pred HHHHhcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc---cHHH-HHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999865433 3565 456788999 Q ss_pred HHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccc Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n 406 (424) |+|++..||++||++|+++.+ |++..+++.|.+++++.+++++++|+||+||+|+++|++|++++|.+..... T Consensus 297 l~P~~~~ie~~ln~~l~~~~~-------~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~ 369 (397) T protein:vir:38 297 LNRYVQAIVGELNDKLHANIS-------ANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKE 369 (397) T ss_pred HHHHHHHHHHHHHHhccChhc-------ccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccccccc Confidence 999999999999999998644 4555567889999999999999999999999999999999999997766555 Q ss_pred ccchhhccccCCCcccCC Q lcl|NC_019719. 407 YVPITDLGTNKEPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~~~~~~~ga 424 (424) ..+.......+++.+++. T Consensus 370 ~~~~~~~~~~~~g~~~~~ 387 (397) T protein:vir:38 370 PQQAIQLIQQEGGENDGN 387 (397) T ss_pred ccccccccccccCCCCCC Confidence 544433333333322222 No 54 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=1.9e-82 Score=468.55 Aligned_cols=376 Identities=15% Similarity=0.202 Sum_probs=312.5 Q ss_pred CchHHHHHhhccCcccCccccccccc-ccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTG-PVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) =|||+++. +. +............. .....+...+..++.+.|+++++|++||++||++||++||++++ T Consensus 1 Mg~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~~--------- 69 (385) T protein:vir:10 1 MGLLTPRN-FN-KRKAKNMVYPSNPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVASAHFKTEN--------- 69 (385) T ss_pred Cccccchh-cc-cccccccccccchhhhhhhccccCccccCHHHhhccHHHHHHHHHHHHHHhhCceeeec--------- Confidence 46776542 11 11111111111111 11222334566789999999999999999999999999999863 Q ss_pred ccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEe--cC Q lcl|NC_019719. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQR--DS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~--~~ 170 (424) |+...+|+ +||++||+++||+.++.+++++||||++++|+ +.+++|+++.+|++..++++..|.+.. ++ T Consensus 70 ----~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~----~~~~~p~~~~~v~~~~~~~~~~~~~~~~~~~ 140 (385) T protein:vir:10 70 ----TATLNRLE-SPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGIVYTVLESNDR 140 (385) T ss_pred ----cchhhhhh-cCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCCceEEEEEcCCceEEEEEEcCCc Confidence 34455565 99999999999999999999999999999875 468999999999999988877766643 44 Q ss_pred ceEEecHhHeeEeccCC---CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHH Q lcl|NC_019719. 171 EYADFSQKEIFHLKGFG---FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENF 247 (424) Q Consensus 171 ~~~~~~~~evih~r~~~---~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~ 247 (424) ..+.|+++||||+|+++ .++++|+||+..+..++.+..++++++.++|+||++|+++++++....++++.+++++.| T Consensus 141 ~~~~~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~ 220 (385) T protein:vir:10 141 PQMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEF 220 (385) T ss_pred eEEEEccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHH Confidence 56789999999999865 457899999999999999999999999999999999999999998888889999999999 Q ss_pred HHHhCCcccCcceecCCCceeeecccChhHHHHH-HHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 248 KEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMM-ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 248 ~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~-e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) ++..++.|+|+++++++|++|++++.++.|+|++ |.+++++++||++|||||++||+.+.++.+++|.|++.. ++..| T Consensus 221 ~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~-~~~~~ 299 (385) T protein:vir:10 221 EKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKA-TYLAN 299 (385) T ss_pred HHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHH-HHHHH Confidence 9999999999999999999999999999999975 999999999999999999999987777778888886644 55679 Q ss_pred HHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeeeec Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAMRQ 404 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~--gd~~~~~ 404 (424) |.|+++.||++|+++|+++ .++||++++++.|.+++++.+++++++|+||+||+|+++|++|+|+ +|++..+ T Consensus 300 l~P~~~~ie~~l~~~l~~~------~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~~~~~ 373 (385) T protein:vir:10 300 LNSYVNPIVDELRLKMNAP------DLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPL 373 (385) T ss_pred HHHHHHHHHHHHHHhhCCc------eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCccccCc Confidence 9999999999999999864 4789999999999999999999999999999999999999999964 5677766 Q ss_pred ccccchhhccccCCCccc Q lcl|NC_019719. 405 SQYVPITDLGTNKEPRNN 422 (424) Q Consensus 405 ~n~~~~~~~~~~~~~~~~ 422 (424) .+....++. + +| T Consensus 374 ~~~~~~g~~-----~-dn 385 (385) T protein:vir:10 374 TTQVKGGDE-----G-DN 385 (385) T ss_pred ccccCCCCC-----C-CC Confidence 664332111 1 11 No 55 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=2.1e-81 Score=462.94 Aligned_cols=376 Identities=15% Similarity=0.212 Sum_probs=315.7 Q ss_pred CchHHHHHhhccCcccCcccccccccc-cccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGP-VSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) =|||+++. |.+.............. ....+...+..++.++|+++++|++||++||+++|++||++++ T Consensus 1 Mg~~~~~~--~~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~--------- 69 (383) T protein:vir:10 1 MGLLTPKN--FSKRNAKNMVYPSNPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVSSAHFKTEN--------- 69 (383) T ss_pred CCcccccc--cccccccccccccchhhhhhhccCccccccchhHhhcchHHHHHHHHHHHhhccCceeecc--------- Confidence 46666531 22222221111111111 1223344566788999999999999999999999999998863 Q ss_pred ccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEE--ecC Q lcl|NC_019719. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQ--RDS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~--~~~ 170 (424) |+...+|+ +||++||+++||+.++.+++++||||++++|+ +.+++|+++.+|++..+.+..+|.+. .++ T Consensus 70 ----~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~----~~~~~p~~~~~v~~~~~~~~~~~~~~~~~~~ 140 (383) T protein:vir:10 70 ----TATLNRLE-SPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGIVYTVLESNDR 140 (383) T ss_pred ----cchhhhhh-CCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCcceEEEEEcCCceEEEEEEcCCc Confidence 34455665 99999999999999999999999999999875 46889999999999888777666554 345 Q ss_pred ceEEecHhHeeEeccCCC---CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHH Q lcl|NC_019719. 171 EYADFSQKEIFHLKGFGF---TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENF 247 (424) Q Consensus 171 ~~~~~~~~evih~r~~~~---~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~ 247 (424) ..+.|+++||||+|+++. ++++|+||+.++..++.+..++++++.++|+||++|+++++++....++++.+++++.| T Consensus 141 ~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~ 220 (383) T protein:vir:10 141 PKMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEF 220 (383) T ss_pred eEEEEcccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHH Confidence 678899999999997653 46799999999999999999999999999999999999999998888899999999999 Q ss_pred HHHhCCcccCcceecCCCceeeecccChhHHHHH-HHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 248 KEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMM-ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 248 ~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~-e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) +++.++.|+|+++++++|++|++++.++.|+|++ |++++++++||++|||||++||..+.++.+++|.|++...| ..| T Consensus 221 ~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~~-~~~ 299 (383) T protein:vir:10 221 EKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATY-LAN 299 (383) T ss_pred HHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHHH-HHH Confidence 9999889999999999999999999999999975 89999999999999999999998777777788888876655 569 Q ss_pred HHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccc Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n 406 (424) |.|+++.||++|+++|+.+ +++||++.+++.|.+++++.+.+++++|+||+||+|+++|++|+|+||.+....+ T Consensus 300 l~P~~~~ie~~l~~~l~~~------~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~ 373 (383) T protein:vir:10 300 LNSYVNPIVDELRLKMNAP------DLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPL 373 (383) T ss_pred HHHHHHHHHHHHHHhhCCc------eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCcccccCCC Confidence 9999999999999999754 5789999999999999999999999999999999999999999999998777666 Q ss_pred ccchhhccccC Q lcl|NC_019719. 407 YVPITDLGTNK 417 (424) Q Consensus 407 ~~~~~~~~~~~ 417 (424) ..++.. +++| T Consensus 374 ~~~~~g-Gd~e 383 (383) T protein:vir:10 374 TNETKG-GDDK 383 (383) T ss_pred cccCCC-CCCC Confidence 665532 2222 No 56 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=5.4e-81 Score=460.63 Aligned_cols=420 Identities=11% Similarity=0.117 Sum_probs=312.4 Q ss_pred CCCCcccccCCCC-------CchH-HHHHhhccCcccCcccccccc--------------cccc---cccccC-cccccH Q lcl|NC_019719. 1 MEEPKYTIDLRTN-------NGWW-ARLQSWFVGGRLVTPNQGSQT--------------GPVS---AHGHLG-DSSIND 54 (424) Q Consensus 1 ~~~~~~~~~~~~~-------~G~~-~~l~~~~~~~~~~~~~~~~~~--------------~~~~---~~~~~~-~~~~~~ 54 (424) -+.-.|.|.+... .++= +.+.....+...+...+.... .+.. ....+. ...+.. T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~~ 99 (574) T protein:vir:80 20 RNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILNA 99 (574) T ss_pred HhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccHHHHHHhhccChhHHH Confidence 1111233322211 1100 011111111111110000000 0000 000111 111223 Q ss_pred HHHhhhHHHHHHHHHHHHhhccCceEEEEecccCc--cccccccchhhhhhcc---CCCCCC-CHHHHHHHHHHHHHHcC Q lcl|NC_019719. 55 ERILQISTVWRCVSLISTLTACLPLDVFETDQNDN--RKKVDLSNPLARLLRY---SPNQYM-TAQEFREAMTMQLCFYG 128 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~--~~~~~~~~~l~~lL~~---~pN~~~-s~~~f~~~~~~~~l~~G 128 (424) .-.+....|++|+.+|+.++|++||+||+++.++. .++....|++..+|.. .|||++ |+.+|++.++.+++++| T Consensus 100 ~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~G 179 (574) T protein:vir:80 100 IINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYD 179 (574) T ss_pred HHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcC Confidence 33445667889999999999999999998766542 3345677999998864 356665 78899999999999999 Q ss_pred CeEEEEeeCCCCceeeEEeecCceEEEEEcCCc-------eEEEEEecCceEEecHhHeeEeccCCC----CccccCchH Q lcl|NC_019719. 129 NAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-------VVYRYQRDSEYADFSQKEIFHLKGFGF----TGLVGLSPI 197 (424) Q Consensus 129 ~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-------~~~~~~~~~~~~~~~~~evih~r~~~~----~~~~G~s~~ 197 (424) |+|++++|+.+|.|++||||+|.+|++..+..+ .+|++..++....|+++||||+++++. ++.+|+||+ T Consensus 180 nayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi 259 (574) T protein:vir:80 180 QVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKLIKNGERFVQVIDNRIVAKFNERELAFAVRNPRADIEVGQYGYPEL 259 (574) T ss_pred CeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccccCceEEEEEeCCceEEEEccccEEEEeccCCCCcccccccccHH Confidence 999999999999999999999999999877543 345666677788999999999986542 367899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhC-CcccCcc-eecCCCceeeecccC Q lcl|NC_019719. 198 AFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAG-GPVKKRL-WILEAGFSTSAIGVT 274 (424) Q Consensus 198 ~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~g~~-~~l~~g~~~~~l~~~ 274 (424) .++..++.++.++++++.++|+||++|+|||+++.+ ..++++.+++++.|++.++ ..|+|++ +++++|++|++++++ T Consensus 260 ~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s 339 (574) T protein:vir:80 260 EIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPS 339 (574) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCC Confidence 999999999999999999999999999999998754 3588889999999988655 4789886 555789999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC--------ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc Q lcl|NC_019719. 275 PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST--------SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK 346 (424) Q Consensus 275 ~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~--------~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~ 346 (424) +.|+||+|++++++++||++|||||++||..++++ .+++|+|++.+.|+++||.|++.+||++||++|++.. T Consensus 340 ~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~ 419 (574) T protein:vir:80 340 ANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEF 419 (574) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhc Confidence 99999999999999999999999999999877654 3567999999999999999999999999999999876 Q ss_pred ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 347 DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 347 ~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +. .++++|+...++..+.. + .+..++.+||||+||+|+++|+||+||||++++|.|+++++.....+..+...+ T Consensus 420 ~~-~~~~~f~~~d~~~~~~~--~-~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~ 493 (574) T protein:vir:80 420 GE-KYQFQFRGGDLSAQLDK--L-KIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRS 493 (574) T ss_pred CC-ceEEEecccchhhHHHH--H-HHHHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccch Confidence 64 46788887666543222 2 234578899999999999999999999999999999999876543322211111 No 57 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=5e-81 Score=460.80 Aligned_cols=352 Identities=16% Similarity=0.209 Sum_probs=300.9 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||+++ .++....+..+ .........+..+..|+.+.|+++++||+||++||++||++|+. T Consensus 1 M~~~~~f----~~r~~~~~~~~-~~~~~~~~~~~~~~~v~~~~al~~~av~~cv~~ia~~ia~~p~~------------- 62 (359) T protein:vir:10 1 MSILNPF----ERRSSITPNNY-YPFMVQNGSIVPNSLVDATEALKNSDLYAVTSLISSDIAGTRFI------------- 62 (359) T ss_pred Ccccchh----hccccCCCCcc-hhhhhccccccCCcccCHHHhhcchHHHHHHHHHHHhhhcCccc------------- Confidence 3455433 22222122111 11222344566788899999999999999999999999999983 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEe--cCc Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQR--DSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~--~~~ 171 (424) .|++.++|+.+||++||+++||+.++.+++++||+|++++|+.+|.+.+|||++|++|++..+++...|.+.. ++. T Consensus 63 --~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~y~~~~~~~~~ 140 (359) T protein:vir:10 63 --GNQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDDTLTYEVNQFDDYP 140 (359) T ss_pred --cchHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCCeEEEEEEecCCce Confidence 3567777888999999999999999999999999999999999999999999999999999888887777653 456 Q ss_pred eEEecHhHeeEeccCC-----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019719. 172 YADFSQKEIFHLKGFG-----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 172 ~~~~~~~evih~r~~~-----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~ 246 (424) ...++++||||+|.++ .++++|+||+..+..++....+++++..++|+||++|+|+++++.+..++++++++++. T Consensus 141 ~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~ 220 (359) T protein:vir:10 141 SAKYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKE 220 (359) T ss_pred EEEEcccceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHH Confidence 7889999999999765 37889999999999999999999999999999999999999998877789999999999 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) |++++++.|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||++||+.++.+.++++.+++...|+..+ T Consensus 221 ~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~ 300 (359) T protein:vir:10 221 FEKANGGNNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRF 300 (359) T ss_pred HHHHhCccccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999988777778889999999999999 Q ss_pred HHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~ 396 (424) |.|++..|+..|++++. ++...+.+.|.......+.+++++|+||+||+|+++|++|+= T Consensus 301 l~p~~~~l~~~l~~~~~-----------~~~~~~~~~d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 301 IEPLISELRIKCDSSIG-----------VDMSPITDYSNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred HHHHHHHHHHHhhhhhc-----------ccchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999888888877653 333344444555556677889999999999999999999974 No 58 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=1.2e-79 Score=453.29 Aligned_cols=363 Identities=13% Similarity=0.146 Sum_probs=297.1 Q ss_pred hHHHHHhhccCcccCc----ccccc----cccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEeccc Q lcl|NC_019719. 16 WWARLQSWFVGGRLVT----PNQGS----QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 16 ~~~~l~~~~~~~~~~~----~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~ 87 (424) |+-.|+++|++..... ..... ....+..+...++..|+.+.|+++++|++||++||++||++|++++++.. T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~- 79 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN- 79 (392) T ss_pred CcchhhhhhhcccCcccccccccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchh- Confidence 4444555555432211 11111 11123344456678899999999999999999999999999999986542 Q ss_pred CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcC--CceEEE Q lcl|NC_019719. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~--~~~~~~ 165 (424) +.|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|+|++|++|++..+. +...|. T Consensus 80 -------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~y~ 146 (392) T protein:vir:74 80 -------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYN 146 (392) T ss_pred -------------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEE Confidence 23667999999999999999999999999999999999999999999999999998764 344565 Q ss_pred EEecC----ceEEecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019719. 166 YQRDS----EYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 166 ~~~~~----~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~ 240 (424) +...+ ....++++||||+|+++.++ ++|+||+.++..++.+..++++++.++|+||+.|+++++++.+...++ T Consensus 147 ~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~-- 224 (392) T protein:vir:74 147 ITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSD-- 224 (392) T ss_pred EEecCCccceeEEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchH-- Confidence 55433 35789999999999998887 789999999999999999999999999999999999999987654332 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH Q lcl|NC_019719. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~ 320 (424) +..+++.+.+.+..|+|+++++++|++|++++++++|+||+|++++++++||++|||||.+||+..+++ +.+++.+ T Consensus 225 ~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~----~~~e~~~ 300 (392) T protein:vir:74 225 KDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ----SSIQQIS 300 (392) T ss_pred HHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc----cHHHHHH Confidence 223444556777889999999999999999999999999999999999999999999999999765433 3457789 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh---------- Q lcl|NC_019719. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD---------- 390 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~---------- 390 (424) +|+++||.|+++.||++|+++|++. ++||...+++.|.+.+++.+..++++|++|+||+|+++ T Consensus 301 ~~~~~~l~p~~~~ie~~l~~~l~~~-------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~ 373 (392) T protein:vir:74 301 GMYASALNRYLRPAISELEYKLSDH-------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDL 373 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhccch-------hcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCcccc Confidence 9999999999999999999999864 56888999999999999999999999999999999886 Q ss_pred ----CCCCCCCCCeeeecccccc Q lcl|NC_019719. 391 ----NLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 391 ----G~~p~~~gd~~~~~~n~~~ 409 (424) |+||++|||+ .+.+| T Consensus 374 r~~enl~~~~~Gd~----~~p~p 392 (392) T protein:vir:74 374 PAPENTNKKTTGQS----NEPVP 392 (392) T ss_pred chhcCCCCCCCCCC----CCCCC Confidence 5555555542 12222 No 59 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=6e-79 Score=449.41 Aligned_cols=409 Identities=12% Similarity=0.107 Sum_probs=304.0 Q ss_pred CCCCCchHHHHHh-------hccCcccC--------------------ccccccccccccccccc------Cccccc--- Q lcl|NC_019719. 10 LRTNNGWWARLQS-------WFVGGRLV--------------------TPNQGSQTGPVSAHGHL------GDSSIN--- 53 (424) Q Consensus 10 ~~~~~G~~~~l~~-------~~~~~~~~--------------------~~~~~~~~~~~~~~~~~------~~~~~~--- 53 (424) +.+.-|||++++- ++. .... .........+....... .....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~ 79 (551) T protein:vir:80 1 MKNKLGLFESIRLVGVNKSDAVK-HIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQD 79 (551) T ss_pred CchhhhhHHHhhhccCChhhccc-ccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccCccccChhH Confidence 5566788888871 110 0000 00000111111100000 011111 Q ss_pred ----HHHHhhhHHHHHHHHHHHHhhccC-----------ceEEEEecccC--ccccccccchhhhhhccCCCCCC----- Q lcl|NC_019719. 54 ----DERILQISTVWRCVSLISTLTACL-----------PLDVFETDQND--NRKKVDLSNPLARLLRYSPNQYM----- 111 (424) Q Consensus 54 ----~~~~~~~~~v~~~i~~ia~~ia~~-----------~~~v~~~~~~~--~~~~~~~~~~l~~lL~~~pN~~~----- 111 (424) .+.+..+|+|++||++||+.||++ +|.+.-++.+. ........+.+..+|+ +||+++ T Consensus 80 l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~-~pn~~~~p~~~ 158 (551) T protein:vir:80 80 LHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIE-KTGVDNDINRD 158 (551) T ss_pred HHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHH-hcCCCCCCccc Confidence 234566899999999999999984 44443222111 1111122234555554 898874 Q ss_pred CHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCce-------EEEEEecCceEEecHhHeeEec Q lcl|NC_019719. 112 TAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV-------VYRYQRDSEYADFSQKEIFHLK 184 (424) Q Consensus 112 s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~-------~~~~~~~~~~~~~~~~evih~r 184 (424) |+.+|++.++.+++++||+|++++|+.+|.|++||||+|.+|++..+.++. ++++..++....|+++||||++ T Consensus 159 s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~eiiH~~ 238 (551) T protein:vir:80 159 SFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAV 238 (551) T ss_pred hHHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccccccCceEEEEEeCCcEEEEEcccceEEec Confidence 888999999999999999999999999999999999999999998775442 3344445557789999999998 Q ss_pred cCCC----CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhC-CcccCc Q lcl|NC_019719. 185 GFGF----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAG-GPVKKR 258 (424) Q Consensus 185 ~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~g~ 258 (424) +++. ++.+|+||+.++..++.+..++++++.++|+||++|+|+|+++.. ..++++.+++++.|+..++ ..|+|+ T Consensus 239 ~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~ 318 (551) T protein:vir:80 239 RNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQ 318 (551) T ss_pred ccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCc Confidence 6543 357899999999999999999999999999999999999998644 3578889999999988655 479999 Q ss_pred ceec-CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCC--------CccchhHHHHHHHHHHHHHHH Q lcl|NC_019719. 259 LWIL-EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS--------TSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 259 ~~~l-~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~--------~~~~~n~e~~~~~~~~~tl~P 329 (424) ++++ ++|++|+++++++.|+||+|++++++++||++|||||.+||...++ +.+++|+|++...|+++||+| T Consensus 319 ~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P 398 (551) T protein:vir:80 319 IPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQP 398 (551) T ss_pred cccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHH Confidence 7665 6899999999999999999999999999999999999999976553 346789999999999999999 Q ss_pred HHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCC-CCCCCeeeeccccc Q lcl|NC_019719. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPP-LPGGDVAMRQSQYV 408 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p-~~~gd~~~~~~n~~ 408 (424) ++.+||++|+++|++..+. . +.|+++.+...+..++++.+. ++.+|+||+||+|+++|+|| +||||+++.|.++. T Consensus 399 ~~~~ie~~ln~~L~~~~~~-~--~~f~f~~~~~~~~~~~~~~~~-~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~ 474 (551) T protein:vir:80 399 LLGFIEDFINKHIVAEFGD-K--YTFQFVGGDIKSELESVKILA-EKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQ 474 (551) T ss_pred HHHHHHHHHHhhhccccCC-c--eEEEeeccChhhHHHHHHHHH-HHhcCCcCHHHHHHHhCCCCCCCCCceeecccccc Confidence 9999999999999986553 2 445556777777777777654 66789999999999999998 79999999998887 Q ss_pred chhhccccCCCc--------------------------ccCC Q lcl|NC_019719. 409 PITDLGTNKEPR--------------------------NNGA 424 (424) Q Consensus 409 ~~~~~~~~~~~~--------------------------~~ga 424 (424) +++.....+.++ .+++ T Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 516 (551) T protein:vir:80 475 RIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGK 516 (551) T ss_pred cccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCcc Confidence 764432111100 0000 No 60 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=100.00 E-value=4.1e-79 Score=450.32 Aligned_cols=373 Identities=13% Similarity=0.107 Sum_probs=299.5 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||++|++. .. .+... . .......++.+.|+++++|++||++||+++|++||++|+++.. T Consensus 1 Mg~f~~~f~~---~~--~~~~~-----~---~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~------ 61 (385) T protein:vir:95 1 MGLFDSVFKR---HS--ELSWM-----Y---DLEFLQDKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTK------ 61 (385) T ss_pred Cchhhhhhcc---Cc--ccccc-----c---chhhhhccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCcc------ Confidence 5778776432 11 11110 0 1112334567889999999999999999999999999975432 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEecCceE Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..|++.++|+.+||++||+++||+.++.+++++||||+++.++. +.+..++++.+..+....+.. ....+...+... T Consensus 62 -~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 138 (385) T protein:vir:95 62 -EKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEG-HFFVADDFEKEDELGLYSHRF-TNVLVNDFEFKR 138 (385) T ss_pred -ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEecCC-Ceeeccccccccccccccccc-eeeeecccceee Confidence 35899999999999999999999999999999999999887764 345556666666554433221 111222334457 Q ss_pred EecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC-CCCCHHHHHHHHHHHHHHh Q lcl|NC_019719. 174 DFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE-KVLTEQQRSQVEENFKEIA 251 (424) Q Consensus 174 ~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~-~~~~~~~~~~~~~~~~~~~ 251 (424) .++++||||+|+++.++ .+|.||+..+..++....++.. +++.|+++++++. ...++++.+++++.|++.+ T Consensus 139 ~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~-------~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~ 211 (385) T protein:vir:95 139 VFTMDDVIYLKYNNQKLDAFSLGLFEDYGEIFGRMIDLQM-------LNNQIRGILKVDATKFYNKEKQKELQAYIDTLF 211 (385) T ss_pred eeccccEEEecCCCCCcccccchHHHHHHHHHHHHHHHHH-------hcCCCceEEEeCCccCCCHHHHHHHHHHHHHHh Confidence 89999999999988775 7899999999998876655432 2344788888864 3468888899999999887 Q ss_pred CC--cccCcceecCCCceeeeccc------ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHH Q lcl|NC_019719. 252 GG--PVKKRLWILEAGFSTSAIGV------TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 252 ~~--~~~g~~~~l~~g~~~~~l~~------~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~ 323 (424) ++ .+.++++++++|++++++++ ++.|+||+|.+++++++||++|||||.+|++ +++|.|++..+|+ T Consensus 212 ~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~------~~sn~e~~~~~~~ 285 (385) T protein:vir:95 212 DAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLG------EMADLEKTIESYL 285 (385) T ss_pred hhhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcC------CCcCHHHHHHHHH Confidence 65 34556899999999999975 6679999999999999999999999999963 3558999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--CCCCee Q lcl|NC_019719. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL--PGGDVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~--~~gd~~ 401 (424) ++||.|++.+||++|+++|+++.++..++++||++++++.|.+++++.+++++++|+||+||+|+++|+||+ ||||++ T Consensus 286 ~~~l~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd~~ 365 (385) T protein:vir:95 286 QFCINPLLRKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELDKF 365 (385) T ss_pred HHHHHHHHHHHHHHHHhhcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 999999999999999999999988877789999999999999999999999999999999999999999998 689999 Q ss_pred eecccccchhhccccCCCcc Q lcl|NC_019719. 402 MRQSQYVPITDLGTNKEPRN 421 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~ 421 (424) ++|+|+++++...+.+...+ T Consensus 366 ~~~~n~~~~~~~kgge~~~e 385 (385) T protein:vir:95 366 IITKNLQSADAFKGGESNEE 385 (385) T ss_pred eecccceecccccCCCCCCC Confidence 99999999977543333323 No 61 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=5.8e-79 Score=449.49 Aligned_cols=373 Identities=13% Similarity=0.179 Sum_probs=303.4 Q ss_pred CchHHHHHhhccCcccCccccc----ccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQG----SQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDN 89 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~ 89 (424) =|||++++. ...+.+... ....+.....+.++..++.+.++++|+|++||++||++||++|+++|+.. T Consensus 1 M~~f~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~~---- 72 (386) T protein:vir:48 1 MPIFNITNL----ATESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRKQ---- 72 (386) T ss_pred Ccccccccc----cccccccccccccccccchhcccccCCceechhhhhcchHHHHHHHHHHHhhccCceeeccch---- Confidence 345544321 121111111 11112223345678889999999999999999999999999999998532 Q ss_pred cccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC--ceEEEEE Q lcl|NC_019719. 90 RKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQ 167 (424) Q Consensus 90 ~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~--~~~~~~~ 167 (424) .+.|+.+||++||+++||+.++.+++++||+|++++|+.+|.+++|+|++|++|++..+.+ ..+|.+. T Consensus 73 ----------~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~ 142 (386) T protein:vir:48 73 ----------LQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNIT 142 (386) T ss_pred ----------hHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEE Confidence 3357789999999999999999999999999999999999999999999999999987754 3455554 Q ss_pred ecC----ceEEecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHH Q lcl|NC_019719. 168 RDS----EYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQ 242 (424) Q Consensus 168 ~~~----~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~ 242 (424) ..+ ..+.|+++||||+|+++.++ ++|+||+..+..++.+..++++++.++|+||++|+++++.+... ++++.++ T Consensus 143 ~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~-~~e~~~~ 221 (386) T protein:vir:48 143 FDDPRIPPKQHVPQGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGG-LLDFKTK 221 (386) T ss_pred ecCccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCC-CHHHHHH Confidence 433 45679999999999988776 89999999999999999999999999999999999999999886 4555566 Q ss_pred HHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHH Q lcl|NC_019719. 243 VEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 243 ~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~ 322 (424) +++.+.... .++|+++++++|++|++++.+++|+||+|++++++++||++|||||.+||+.. ++++++++.++| T Consensus 222 ~~~~~~~~~--~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~----~~~~~e~~~~~~ 295 (386) T protein:vir:48 222 LSRSRQAMK--QMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQG----DQQSSLEMSLDL 295 (386) T ss_pred HHHHHHHhh--cCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC----CcccHHHHHHHH Confidence 666665543 57899999999999999999999999999999999999999999999998532 345889999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCee- Q lcl|NC_019719. 323 LQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVA- 401 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~- 401 (424) ++.||.|+++.||++|+++|++.. ++|....+..|...++..+++++++|++|+||+|+++|++|++++|.. T Consensus 296 ~~~~l~P~~~~ie~~l~~~l~~~~-------~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~~~~~ 368 (386) T protein:vir:48 296 YNKAVSRYLRPFLSELSQKLSCDV-------DADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILPKELPE 368 (386) T ss_pred HHHHHHHHHHHHHHHHHHhhcchh-------hcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCCccchh Confidence 999999999999999999998753 466667778888999999999999999999999999999999877744 Q ss_pred eecccccchhhccccCCCcccCC Q lcl|NC_019719. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ....|..|+.. ++ +||. T Consensus 369 ~~~~~~~~~~g-Gd-----~~~~ 385 (386) T protein:vir:48 369 GENPNKTTLKG-GE-----INGE 385 (386) T ss_pred hcCCCCCccCC-CC-----CCCC Confidence 33345555533 11 2222 No 62 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=100.00 E-value=7.9e-79 Score=448.78 Aligned_cols=372 Identities=14% Similarity=0.147 Sum_probs=300.3 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||++|+ ++... +.. ...+..+..++.+.++++++|++||++||+++|++||++|+++. T Consensus 1 Mg~f~~lf---~~~~~--~~~--------~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~------- 60 (395) T protein:vir:10 1 MSILEKIF---KTRKD--ITY--------MLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR------- 60 (395) T ss_pred Cchhhhhh---ccCcc--ccc--------cccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc------- Confidence 46677763 32221 111 11123356677889999999999999999999999999996532 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCce--EEEEEecCc Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV--VYRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~--~~~~~~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.++++.|++|+++.++. .++++++..++........ .+.+...+. T Consensus 61 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (395) T protein:vir:10 61 IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTVKDYTY 135 (395) T ss_pred cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEEcCcee Confidence 356999999999999999999999999999999999998876553 3566666665554433322 233334445 Q ss_pred eEEecHhHeeEeccCCCC-ccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019719. 172 YADFSQKEIFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~evih~r~~~~~-~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ...++++||||+|+++.+ ..+|.||+..+..++....+ .|.+++.++++|+.+....++++++++++.|+++ T Consensus 136 ~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~~-------~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~ 208 (395) T protein:vir:10 136 QRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMIG-------AQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKL 208 (395) T ss_pred eeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHHH-------HHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 678999999999987654 57899999999888766543 4667888999999998888999999999999999 Q ss_pred hCCcccCc--ceecCCCceeeecccChhHH-----HHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHH Q lcl|NC_019719. 251 AGGPVKKR--LWILEAGFSTSAIGVTPQDA-----EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 251 ~~~~~~g~--~~~l~~g~~~~~l~~~~~d~-----~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~ 323 (424) .++.++++ ++++++|++|+++++++.++ ||+|++++++++||++|||||++|++ +++|.|++.++|+ T Consensus 209 ~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~------~~sn~e~~~~~~~ 282 (395) T protein:vir:10 209 FNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG------ETADLEKNTLVFE 282 (395) T ss_pred hccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------cccCHHHHHHHHH Confidence 98876665 55579999999999988765 99999999999999999999999973 3458899999999 Q ss_pred HHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--Cee Q lcl|NC_019719. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g--d~~ 401 (424) ++||.|++.+||++|+++|+++.++.. +++||++.+++.|.+++++++.+++++||||+||+|+++|+||+||| |++ T Consensus 283 ~~~l~P~~~~ie~~l~~kL~~~~~~~~-~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:10 283 KFCLTPLLKKIQNELNAKLITQSMYLK-DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhhcc-cceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 999999999999999999999877543 35899999999999999999999999999999999999999999875 999 Q ss_pred eecccccchhhccccCCCcccCC Q lcl|NC_019719. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++|+|+++++...+++.+.+..+ T Consensus 362 ~~~~n~~~~~~~~~~~~~~~~~~ 384 (395) T protein:vir:10 362 LITKNYEKANSGENDEKEKDENT 384 (395) T ss_pred eeccccccccccccccCcccccc Confidence 99999999876544332221111 No 63 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=100.00 E-value=7.9e-79 Score=448.78 Aligned_cols=372 Identities=14% Similarity=0.147 Sum_probs=300.3 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||++|+ ++... +.. ...+..+..++.+.++++++|++||++||+++|++||++|+++. T Consensus 1 Mg~f~~lf---~~~~~--~~~--------~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~------- 60 (395) T protein:vir:95 1 MSILEKIF---KTRKD--ITY--------MLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR------- 60 (395) T ss_pred Cchhhhhh---ccCcc--ccc--------cccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc------- Confidence 46677763 32221 111 11123356677889999999999999999999999999996532 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCce--EEEEEecCc Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV--VYRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~--~~~~~~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.++++.|++|+++.++. .++++++..++........ .+.+...+. T Consensus 61 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (395) T protein:vir:95 61 IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTVKDYTY 135 (395) T ss_pred cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEEcCcee Confidence 356999999999999999999999999999999999998876553 3566666665554433322 233334445 Q ss_pred eEEecHhHeeEeccCCCC-ccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019719. 172 YADFSQKEIFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~evih~r~~~~~-~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ...++++||||+|+++.+ ..+|.||+..+..++....+ .|.+++.++++|+.+....++++++++++.|+++ T Consensus 136 ~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~~-------~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~ 208 (395) T protein:vir:95 136 QRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMIG-------AQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKL 208 (395) T ss_pred eeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHHH-------HHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 678999999999987654 57899999999888766543 4667888999999998888999999999999999 Q ss_pred hCCcccCc--ceecCCCceeeecccChhHH-----HHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHH Q lcl|NC_019719. 251 AGGPVKKR--LWILEAGFSTSAIGVTPQDA-----EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 251 ~~~~~~g~--~~~l~~g~~~~~l~~~~~d~-----~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~ 323 (424) .++.++++ ++++++|++|+++++++.++ ||+|++++++++||++|||||++|++ +++|.|++.++|+ T Consensus 209 ~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~------~~sn~e~~~~~~~ 282 (395) T protein:vir:95 209 FNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG------ETADLEKNTLVFE 282 (395) T ss_pred hccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------cccCHHHHHHHHH Confidence 98876665 55579999999999988765 99999999999999999999999973 3458899999999 Q ss_pred HHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--Cee Q lcl|NC_019719. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g--d~~ 401 (424) ++||.|++.+||++|+++|+++.++.. +++||++.+++.|.+++++++.+++++||||+||+|+++|+||+||| |++ T Consensus 283 ~~~l~P~~~~ie~~l~~kL~~~~~~~~-~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:95 283 KFCLTPLLKKIQNELNAKLITQSMYLK-DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhhcc-cceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 999999999999999999999877543 35899999999999999999999999999999999999999999875 999 Q ss_pred eecccccchhhccccCCCcccCC Q lcl|NC_019719. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++|+|+++++...+++.+.+..+ T Consensus 362 ~~~~n~~~~~~~~~~~~~~~~~~ 384 (395) T protein:vir:95 362 LITKNYEKANSGENDEKEKDENT 384 (395) T ss_pred eeccccccccccccccCcccccc Confidence 99999999876544332221111 No 64 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=100.00 E-value=7.9e-79 Score=448.78 Aligned_cols=372 Identities=14% Similarity=0.147 Sum_probs=300.3 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||++|+ ++... +.. ...+..+..++.+.++++++|++||++||+++|++||++|+++. T Consensus 1 Mg~f~~lf---~~~~~--~~~--------~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~------- 60 (395) T protein:vir:10 1 MSILEKIF---KTRKD--ITY--------MLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR------- 60 (395) T ss_pred Cchhhhhh---ccCcc--ccc--------cccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc------- Confidence 46677763 32221 111 11123356677889999999999999999999999999996532 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCce--EEEEEecCc Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV--VYRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~--~~~~~~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.++++.|++|+++.++. .++++++..++........ .+.+...+. T Consensus 61 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (395) T protein:vir:10 61 IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTVKDYTY 135 (395) T ss_pred cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEEcCcee Confidence 356999999999999999999999999999999999998876553 3566666665554433322 233334445 Q ss_pred eEEecHhHeeEeccCCCC-ccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019719. 172 YADFSQKEIFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~evih~r~~~~~-~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ...++++||||+|+++.+ ..+|.||+..+..++....+ .|.+++.++++|+.+....++++++++++.|+++ T Consensus 136 ~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~~-------~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~ 208 (395) T protein:vir:10 136 QRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMIG-------AQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKL 208 (395) T ss_pred eeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHHH-------HHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 678999999999987654 57899999999888766543 4667888999999998888999999999999999 Q ss_pred hCCcccCc--ceecCCCceeeecccChhHH-----HHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHH Q lcl|NC_019719. 251 AGGPVKKR--LWILEAGFSTSAIGVTPQDA-----EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 251 ~~~~~~g~--~~~l~~g~~~~~l~~~~~d~-----~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~ 323 (424) .++.++++ ++++++|++|+++++++.++ ||+|++++++++||++|||||++|++ +++|.|++.++|+ T Consensus 209 ~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~------~~sn~e~~~~~~~ 282 (395) T protein:vir:10 209 FNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG------ETADLEKNTLVFE 282 (395) T ss_pred hccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------cccCHHHHHHHHH Confidence 98876665 55579999999999988765 99999999999999999999999973 3458899999999 Q ss_pred HHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--Cee Q lcl|NC_019719. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g--d~~ 401 (424) ++||.|++.+||++|+++|+++.++.. +++||++.+++.|.+++++++.+++++||||+||+|+++|+||+||| |++ T Consensus 283 ~~~l~P~~~~ie~~l~~kL~~~~~~~~-~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:10 283 KFCLTPLLKKIQNELNAKLITQSMYLK-DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhhcc-cceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 999999999999999999999877543 35899999999999999999999999999999999999999999875 999 Q ss_pred eecccccchhhccccCCCcccCC Q lcl|NC_019719. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++|+|+++++...+++.+.+..+ T Consensus 362 ~~~~n~~~~~~~~~~~~~~~~~~ 384 (395) T protein:vir:10 362 LITKNYEKANSGENDEKEKDENT 384 (395) T ss_pred eeccccccccccccccCcccccc Confidence 99999999876544332221111 No 65 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=2.1e-78 Score=446.40 Aligned_cols=418 Identities=12% Similarity=0.138 Sum_probs=309.5 Q ss_pred CCCCccc--ccCCCCCc-hHHHHHhhccCcccC-ccccc---cccccccc---------------ccccCccc---ccHH Q lcl|NC_019719. 1 MEEPKYT--IDLRTNNG-WWARLQSWFVGGRLV-TPNQG---SQTGPVSA---------------HGHLGDSS---INDE 55 (424) Q Consensus 1 ~~~~~~~--~~~~~~~G-~~~~l~~~~~~~~~~-~~~~~---~~~~~~~~---------------~~~~~~~~---~~~~ 55 (424) |-.-|-+ +++++..+ ++++-. ..++... ..... ..+.+.+. .....+.. ...+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~~~~~~~~~~~~i~ 90 (535) T protein:vir:10 13 LSNKKSTSYIELGDYDKDIVNKAI--RPGRASARDTVDGIDIADGNVAGQYSVASISDVLSTKKLLKAYADNDIVQAIIR 90 (535) T ss_pred hhhhhhhhhHHHhhhhHHHHHhhh--hhhhhhhhccccccccccCCcccccccCccccccCHHHHHHHhccChhHHHHHH Confidence 2222221 22222221 111100 0000000 00000 00000000 00000111 1123 Q ss_pred HHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHH----HHHHHHHHHHHc-CCe Q lcl|NC_019719. 56 RILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQE----FREAMTMQLCFY-GNA 130 (424) Q Consensus 56 ~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~----f~~~~~~~~l~~-G~a 130 (424) .+....++|+|+.++++.++++|+++++.+..+..++....|++.++|+.+||++|++++ |+++++.+++++ |++ T Consensus 91 t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~a 170 (535) T protein:vir:10 91 TRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQIN 170 (535) T ss_pred HHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCce Confidence 455678889999999999999999999988877777778889999999999999998876 555667776655 589 Q ss_pred EEEEeeCCCCceeeEEeecCceEEEEEcCC-----ceEEEEEecCceEEecHhHeeEeccCCC----CccccCchHHHHH Q lcl|NC_019719. 131 YALVDRNSAGDVISLLPLQSANMDVKLVGK-----KVVYRYQRDSEYADFSQKEIFHLKGFGF----TGLVGLSPIAFAC 201 (424) Q Consensus 131 ~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-----~~~~~~~~~~~~~~~~~~evih~r~~~~----~~~~G~s~~~~~~ 201 (424) |++++|+..|+|++||||+|.+|++..+.. ..+|.+..++....|+++||||+|+++. ++.+|+||+.++. T Consensus 171 y~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~~~~~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~ 250 (535) T protein:vir:10 171 IERIFKNDSNELDHFNAVDASKVVISYSPRSKDQPRKFEQFVSETKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASI 250 (535) T ss_pred EEEEEECCCCcEEEEEEeCCceeEEEEcCccccCceEEEEEecCceeEEECcccEEEEeccCCCCcccccccccHHHHHH Confidence 999999999999999999999999987643 3456667777788999999999997653 3578999999999 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceeEEcCCC---CCCHHHHHHHHHHHHHHhC-CcccCcceecC-CCceeeecccChh Q lcl|NC_019719. 202 KSAGVAVAMEDQQRDFFANGAKSPQILSTGEK---VLTEQQRSQVEENFKEIAG-GPVKKRLWILE-AGFSTSAIGVTPQ 276 (424) Q Consensus 202 ~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~---~~~~~~~~~~~~~~~~~~~-~~~~g~~~~l~-~g~~~~~l~~~~~ 276 (424) .++.+..++++++.++|+||++|+|||+++.. ..++++.+.+++.|+..++ ..|+|+++++. .|++|+++++++. T Consensus 251 ~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~ 330 (535) T protein:vir:10 251 PLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKDAKFVNMTQNSR 330 (535) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEecCCChh Confidence 99999999999999999999999999998754 3577889999999988665 46889876665 7999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcc----------chhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc Q lcl|NC_019719. 277 DAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW----------GSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK 346 (424) Q Consensus 277 d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~----------~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~ 346 (424) |+||+|++++++++||++|||||++||..++++++ .+++|++...|+++||.|++.+||++||++|++.. T Consensus 331 D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~ 410 (535) T protein:vir:10 331 DMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRYV 410 (535) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 99999999999999999999999999998876653 35678899999999999999999999999999876 Q ss_pred ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc---cccchhhccccC--CCcc Q lcl|NC_019719. 347 DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS---QYVPITDLGTNK--EPRN 421 (424) Q Consensus 347 ~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~---n~~~~~~~~~~~--~~~~ 421 (424) +. .++|+++.+++.|.++++++++.+. .|+||+||+|+++|+||+||||+++... +++.....++.. ++.+ T Consensus 411 ~~---~~~f~f~~l~~~d~~~r~~~~~~~~-~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~ 486 (535) T protein:vir:10 411 DT---DYRFSFTLGDAQDKLQEEQVWKLKL-ANGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSD 486 (535) T ss_pred CC---eEEEEeccccccCHHHHHHHHHHHH-cCCCCHHHHHHHhCCCCCCCccccccccchhhcccccccccccCCCCCC Confidence 53 4567778999999999998887655 6789999999999999999999876543 222111111100 0000 Q ss_pred -cCC Q lcl|NC_019719. 422 -NGA 424 (424) Q Consensus 422 -~ga 424 (424) .++ T Consensus 487 ~~~~ 490 (535) T protein:vir:10 487 DSGS 490 (535) T ss_pred Cccc Confidence 000 No 66 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=6.3e-79 Score=449.32 Aligned_cols=367 Identities=14% Similarity=0.197 Sum_probs=307.9 Q ss_pred CchHHHHHhhccCcccCccccc-c---cccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQG-S---QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDN 89 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~-~---~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~ 89 (424) =|||+++. .+.. ..+... . ...+.....+.++..++.+.++++++|++||++||++||++|++++++.. T Consensus 1 Mglf~~~~---~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~~~--- 73 (384) T protein:vir:49 1 MPIFNITN---LATE-SPPSNQDSFFDITDPEFLDALNGSEWVSAETALKNSDLFSIISQLSNDLATAKITTSRKQL--- 73 (384) T ss_pred Cccccccc---cCcc-cccccchhhccccchhhcccccCCceechhhhhccHHHHHHHHHHHHHHhhCceeeecchh--- Confidence 35555431 1111 111111 1 11111222345678899999999999999999999999999999986532 Q ss_pred cccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcC--CceEEEEE Q lcl|NC_019719. 90 RKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYRYQ 167 (424) Q Consensus 90 ~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~--~~~~~~~~ 167 (424) +.|+.+||++||+++||+.++.+++++||+|++++|+.+|.+++|+|++|.+|++..++ +..+|.+. T Consensus 74 -----------~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~ 142 (384) T protein:vir:49 74 -----------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNGLYYNIT 142 (384) T ss_pred -----------hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEE Confidence 23678999999999999999999999999999999999999999999999999987654 34456655 Q ss_pred ec----CceEEecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHH Q lcl|NC_019719. 168 RD----SEYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQ 242 (424) Q Consensus 168 ~~----~~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~ 242 (424) .. +..+.|+++||||+|+++.++ ++|+||+.++..++.+..++++++.++|+||+.|++++++++...+++.. T Consensus 143 ~~~~~~~~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~-- 220 (384) T protein:vir:49 143 FDDPRIPPKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKT-- 220 (384) T ss_pred ecCccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHH-- Confidence 43 345789999999999988776 89999999999999999999999999999999999999999887655443 Q ss_pred HHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHH Q lcl|NC_019719. 243 VEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 243 ~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~ 322 (424) ++..+.+.+.+|+|+++++++|++|+++++++.|+||+|.+++++++||++|||||++||+..++++++++.+++...| T Consensus 221 -~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~ 299 (384) T protein:vir:49 221 -KQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKA 299 (384) T ss_pred -HHHHHHHhcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHH Confidence 3455566778899999999999999999999999999999999999999999999999999877778888999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccC---cc-ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019719. 323 LQYTLQPYISRWENSIQRWLIP---AK-DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~l~~---~~-~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g 398 (424) ++.++.|++..|+++|++++.. .. .....+++|+++.+++.|..++.+++..+.+.|+++ ||+|+.+|++|+||| T Consensus 300 i~~~l~pi~~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~-ne~r~~~~~~p~~gG 378 (384) T protein:vir:49 300 VSRFLRPFVSELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP-KDLPEGETDSTLKGG 378 (384) T ss_pred HHHHHHHHHHHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC-hhHHHHcCCCCCCCC Confidence 9999999999999999998743 22 234577899999999999999999999999999986 999999999999986 Q ss_pred C--eee Q lcl|NC_019719. 399 D--VAM 402 (424) Q Consensus 399 d--~~~ 402 (424) | +.+ T Consensus 379 d~~~~~ 384 (384) T protein:vir:49 379 ETNEQY 384 (384) T ss_pred CCCCCC Confidence 3 444 No 67 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=100.00 E-value=1.2e-78 Score=447.73 Aligned_cols=356 Identities=14% Similarity=0.106 Sum_probs=284.6 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccc-- Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK-- 91 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~-- 91 (424) =|||+++.++.++...... ..+.. . .+...+++.++|++||++||++||++|+++++..+++... T Consensus 1 Mg~f~~~~~~~~~~~~~~~------~~~~~---~----~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDT------QRVTA---W----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CCccccchhcccccccCCc------ceeee---e----ccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCccccc Confidence 6888887765433222111 11111 1 1122356778999999999999999999998887766432 Q ss_pred -cccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC-CCCceeeEEeecCceEEEEEcCCceEEEEEec Q lcl|NC_019719. 92 -KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN-SAGDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 92 -~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~-~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 169 (424) ....+|++++||+.+||++||+++||+.++.+++++||+|++++++ ..|.+..++|. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l~p~--------------------- 126 (378) T protein:vir:94 68 LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDLLFA--------------------- 126 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEEEec--------------------- Confidence 3346799999999999999999999999999999999999997654 45666666552 Q ss_pred CceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH---HHHHHHHH Q lcl|NC_019719. 170 SEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ---QRSQVEEN 246 (424) Q Consensus 170 ~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~---~~~~~~~~ 246 (424) +..++|+++||||+|++ .++..|+||+..+...+.. ++.+ +.++|+++++..+.++. .++++++. T Consensus 127 ~~~~~~~~~diiH~~~~-~~~~~g~s~l~~~~~~i~~----------~~~~-~~~~gil~~~~~l~~~~~~~~~~~~~~~ 194 (378) T protein:vir:94 127 DDKKEYKPEELVRLTSP-FYINEDTSILDNALASIQT----------KLEQ-GKLRGLLKINAFLDIDNTQEYREKALTT 194 (378) T ss_pred CCeeEeeeeeeEEecCc-CCccchhHHHHHHHHHHHH----------HHhc-ccccceeeeCCcCCHHHHHHHHHHHHHH Confidence 12346788999999965 5677899999988876643 2344 45899999987764432 34555566 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) ++...+++++|+++++++|++|+++++++.++|+ +.+++++++||++|||||.+|++ +++|++..+|+++| T Consensus 195 ~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~l~~--------~~se~~~~~f~~~t 265 (378) T protein:vir:94 195 IKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG--------TASQEQQIYFYNST 265 (378) T ss_pred HHHhhcccccccceecCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC--------ChHHHHHHHHHHHH Confidence 6666778889999999999999999999999997 66789999999999999999953 14578999999999 Q ss_pred HHHHHHHHHHHHHhhccCccccccc-------eeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVGRI-------HAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD 399 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~-------~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd 399 (424) |.|++.+||++|+++||++.++... .++||++.++++|.+++++.+++++++||||+||+|+++|+||+|||| T Consensus 266 L~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD 345 (378) T protein:vir:94 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999998776432 378999999999999999999999999999999999999999999999 Q ss_pred eeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 400 VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 400 ~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++++|+|++|++..++++..++++- T Consensus 346 ~~~~~~n~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:94 346 VYIANLNAVAVKNLSDLQGSRKDVT 370 (378) T ss_pred eeeecccccccccchhhcCCcCCCC Confidence 9999999999988776554333221 No 68 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=100.00 E-value=2.5e-78 Score=446.02 Aligned_cols=356 Identities=14% Similarity=0.110 Sum_probs=283.8 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcccc- Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK- 92 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~- 92 (424) =|||+++.++.++...... ..+.. + .+...+++.++|++||++||++||++||++|++.+++.... T Consensus 1 Mg~f~~~~~f~~~~~~~~~------~~~~~---~----~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLNNDT------QRVTA---W----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CccchhhhhhhccccCCCc------ceeee---c----ccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEccccccccc Confidence 6888888764332221111 11111 1 11233567889999999999999999999998877664332 Q ss_pred --ccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-CceeeEEeecCceEEEEEcCCceEEEEEec Q lcl|NC_019719. 93 --VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-GDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 93 --~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 169 (424) ...+|++++||+.+||++||+++||+.++.+++++||+|++++++.. |.+..++|. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l~~~--------------------- 126 (378) T protein:vir:93 68 LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFA--------------------- 126 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEec--------------------- Confidence 24679999999999999999999999999999999999999887643 555555442 Q ss_pred CceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH---HHHHHHHHH Q lcl|NC_019719. 170 SEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE---QQRSQVEEN 246 (424) Q Consensus 170 ~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~---~~~~~~~~~ 246 (424) +...+|+++||||+|++ .++..|.|++..+...+. .++.+| .++|+++++..+.++ +.++++++. T Consensus 127 ~~~~~~~~~diih~r~~-~~~~~~~s~l~~~~~~i~----------~~~~~~-~~~g~l~~~~~l~~~~~~~~~~~~~~~ 194 (378) T protein:vir:93 127 DDKKEYKTEELVRLTSP-FYINEDTSILDNALASIQ----------TKLEQG-KLRGLLKINAFLDIDNTQEYREKALTT 194 (378) T ss_pred CCeeEeccceeEEecCc-cccchhhHHHHHHHHHHH----------HHHhcC-cccceeeeCCcCCHHHHHHHHHHHHHH Confidence 23456889999999964 566778999887766542 345555 589999998776443 234455566 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) ++...+++++|+++++++|++|++++.+++|+|+ +.+++++++||++|||||.+|++ +++|++..+|+++| T Consensus 195 ~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~g--------~~~e~~~~~f~~~t 265 (378) T protein:vir:93 195 IKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG--------TATQEQQIYFYNST 265 (378) T ss_pred HHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC--------CcHHHHHHHHHHHH Confidence 6666778889999999999999999999999997 66789999999999999999953 24578999999999 Q ss_pred HHHHHHHHHHHHHhhccCcccccc-------ceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVGR-------IHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD 399 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~-------~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd 399 (424) |.|++.+||++|+++||++.++.. ..++||++.++++|.+++++.+.+++++|+||+||+|+++|+||+|||| T Consensus 266 l~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD 345 (378) T protein:vir:93 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 999999999999999999887642 2378999999999999999999999999999999999999999999999 Q ss_pred eeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 400 VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 400 ~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++++|+|++|++..++++.++.+.- T Consensus 346 ~~~~~~n~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:93 346 VYIANLNAVAVKNLSDLQGSRKDVT 370 (378) T ss_pred eeeeccccccccchhhhcCccCCCC Confidence 9999999999988876654433332 No 69 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=8.7e-78 Score=443.07 Aligned_cols=363 Identities=13% Similarity=0.148 Sum_probs=292.5 Q ss_pred hHHHHHhhccCcccCc----ccccc----cccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEeccc Q lcl|NC_019719. 16 WWARLQSWFVGGRLVT----PNQGS----QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 16 ~~~~l~~~~~~~~~~~----~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~ 87 (424) |+-.|++++++..... +.... ....+......++..|+.+.++++++|++||++||++||++|++++++.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~- 79 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN- 79 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh- Confidence 3333333333322111 11111 11122333445677899999999999999999999999999999986532 Q ss_pred CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcC--CceEEE Q lcl|NC_019719. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~--~~~~~~ 165 (424) +.|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|+|++|.+|++..+. +..+|. T Consensus 80 -------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~ 146 (392) T protein:vir:10 80 -------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYN 146 (392) T ss_pred -------------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEE Confidence 23667999999999999999999999999999999999999999999999999988764 344555 Q ss_pred EEecC----ceEEecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019719. 166 YQRDS----EYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 166 ~~~~~----~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~ 240 (424) +...+ ....++++||||+|+++.++ ++|+||+.++..++.+..++++++.++|+||+.|+++++++.+...++. T Consensus 147 ~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~- 225 (392) T protein:vir:10 147 ITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDK- 225 (392) T ss_pred EEecCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHH- Confidence 55433 35789999999999998887 7999999999999999999999999999999999999999876533322 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH Q lcl|NC_019719. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~ 320 (424) ..+++.+++.+..++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+...++ +.+++.+ T Consensus 226 -~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~----~~~~~~~ 300 (392) T protein:vir:10 226 -DKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ----SSIQQIS 300 (392) T ss_pred -HHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc----cHHHHHH Confidence 23344456677788999999999999999999999999999999999999999999999998754332 4467789 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh---------- Q lcl|NC_019719. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD---------- 390 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~---------- 390 (424) +|+++||.|+++.||++|+++|++. ++||...+++.|...+++.+.+++++|++|+||+|+++ T Consensus 301 ~f~~~~l~P~~~~ie~~l~~~L~~~-------~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~ 373 (392) T protein:vir:10 301 GMYASALNRYLRPAISELEYKLSDH-------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDL 373 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccc-------ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcccc Confidence 9999999999999999999999864 46888888999999999999999999999999999987 Q ss_pred ----CCCCCCCCCeeeecccccc Q lcl|NC_019719. 391 ----NLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 391 ----G~~p~~~gd~~~~~~n~~~ 409 (424) |+||++|||. .+.+| T Consensus 374 r~~e~l~~~~~Gd~----~~p~p 392 (392) T protein:vir:10 374 PAPENTNKKTTGQS----NEPVP 392 (392) T ss_pred chhcCCCCCCCCCC----CCCCC Confidence 4555554442 11122 No 70 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=8.7e-78 Score=443.07 Aligned_cols=363 Identities=13% Similarity=0.148 Sum_probs=292.5 Q ss_pred hHHHHHhhccCcccCc----ccccc----cccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEeccc Q lcl|NC_019719. 16 WWARLQSWFVGGRLVT----PNQGS----QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 16 ~~~~l~~~~~~~~~~~----~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~ 87 (424) |+-.|++++++..... +.... ....+......++..|+.+.++++++|++||++||++||++|++++++.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~- 79 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN- 79 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh- Confidence 3333333333322111 11111 11122333445677899999999999999999999999999999986532 Q ss_pred CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcC--CceEEE Q lcl|NC_019719. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~--~~~~~~ 165 (424) +.|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|+|++|.+|++..+. +..+|. T Consensus 80 -------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~ 146 (392) T protein:vir:39 80 -------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYN 146 (392) T ss_pred -------------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEE Confidence 23667999999999999999999999999999999999999999999999999988764 344555 Q ss_pred EEecC----ceEEecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019719. 166 YQRDS----EYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 166 ~~~~~----~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~ 240 (424) +...+ ....++++||||+|+++.++ ++|+||+.++..++.+..++++++.++|+||+.|+++++++.+...++. T Consensus 147 ~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~- 225 (392) T protein:vir:39 147 ITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDK- 225 (392) T ss_pred EEecCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHH- Confidence 55433 35789999999999998887 7999999999999999999999999999999999999999876533322 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH Q lcl|NC_019719. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~ 320 (424) ..+++.+++.+..++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+...++ +.+++.+ T Consensus 226 -~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~----~~~~~~~ 300 (392) T protein:vir:39 226 -DKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ----SSIQQIS 300 (392) T ss_pred -HHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc----cHHHHHH Confidence 23344456677788999999999999999999999999999999999999999999999998754332 4467789 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh---------- Q lcl|NC_019719. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD---------- 390 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~---------- 390 (424) +|+++||.|+++.||++|+++|++. ++||...+++.|...+++.+.+++++|++|+||+|+++ T Consensus 301 ~f~~~~l~P~~~~ie~~l~~~L~~~-------~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~ 373 (392) T protein:vir:39 301 GMYASALNRYLRPAISELEYKLSDH-------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDL 373 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccc-------ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcccc Confidence 9999999999999999999999864 46888888999999999999999999999999999987 Q ss_pred ----CCCCCCCCCeeeecccccc Q lcl|NC_019719. 391 ----NLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 391 ----G~~p~~~gd~~~~~~n~~~ 409 (424) |+||++|||. .+.+| T Consensus 374 r~~e~l~~~~~Gd~----~~p~p 392 (392) T protein:vir:39 374 PAPENTNKKTTGQS----NEPVP 392 (392) T ss_pred chhcCCCCCCCCCC----CCCCC Confidence 4555554442 11122 No 71 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=1.5e-77 Score=441.75 Aligned_cols=406 Identities=12% Similarity=0.110 Sum_probs=299.5 Q ss_pred CchHHHHHhhccCc--------------------------ccCcccccccccccc------cccccCccccc-------H Q lcl|NC_019719. 14 NGWWARLQSWFVGG--------------------------RLVTPNQGSQTGPVS------AHGHLGDSSIN-------D 54 (424) Q Consensus 14 ~G~~~~l~~~~~~~--------------------------~~~~~~~~~~~~~~~------~~~~~~~~~~~-------~ 54 (424) -|||++|+-.++.. +..+.....+..+.. ......+.... . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~ 80 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 80 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHH Confidence 34444444322200 000001111111111 00000111111 2 Q ss_pred HHHhhhHHHHHHHHHHHHhhccC-------------ceEEEEecccCccccccccchhhhhhccCCCCCC-----CHHHH Q lcl|NC_019719. 55 ERILQISTVWRCVSLISTLTACL-------------PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYM-----TAQEF 116 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~-------------~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~-----s~~~f 116 (424) +.+..+|+|++||++||+.||++ +++++.++..........-+.+..+|. +||+++ |+.+| T Consensus 81 ~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~-~pn~~~~p~~~s~~~f 159 (547) T protein:vir:63 81 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIE-KTGVDNDINRDSFSSF 159 (547) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHH-hhCCCCCCccchHHHH Confidence 34566899999999999999974 233332222222222223355666665 788774 88999 Q ss_pred HHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc-------eEEEEEecCceEEecHhHeeEeccCCC- Q lcl|NC_019719. 117 REAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-------VVYRYQRDSEYADFSQKEIFHLKGFGF- 188 (424) Q Consensus 117 ~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-------~~~~~~~~~~~~~~~~~evih~r~~~~- 188 (424) ++.++.+++++||+|++++|+.+|.+++||||+|.+|++..+.++ .++++..++....|+++||||+|+++. T Consensus 160 ~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~eiih~r~n~~~ 239 (547) T protein:vir:63 160 VKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVRNPRS 239 (547) T ss_pred HHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccccccCceEEEEEcCCcEEEEeccccEEEecccCCC Confidence 999999999999999999999999999999999999999876543 233444555667899999999997653 Q ss_pred ---CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhC-CcccCcceec- Q lcl|NC_019719. 189 ---TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAG-GPVKKRLWIL- 262 (424) Q Consensus 189 ---~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~g~~~~l- 262 (424) .+.+|+||+..+..++.+..++++++.++|+||++|+|+|+++.. ..++++.+++++.|++.++ ..|+|+++++ T Consensus 240 ~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~ 319 (547) T protein:vir:63 240 DIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS 319 (547) T ss_pred CcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCccccccccccc Confidence 256899999999999999999999999999999999999998754 3588889999999988655 4789997665 Q ss_pred CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCC--------CccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 263 EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS--------TSWGSGIEQQNLGFLQYTLQPYISRW 334 (424) Q Consensus 263 ~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~--------~~~~~n~e~~~~~~~~~tl~P~~~~i 334 (424) ++|++|+++++++.|+||+|++++++++||++|||||++||...++ +.+++|+|++.+.|+++||+|++..| T Consensus 320 ~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~i 399 (547) T protein:vir:63 320 AEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFI 399 (547) T ss_pred CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 6889999999999999999999999999999999999999976543 34678999999999999999999999 Q ss_pred HHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCC-CCCCCeeeecccccchhhc Q lcl|NC_019719. 335 ENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPP-LPGGDVAMRQSQYVPITDL 413 (424) Q Consensus 335 e~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p-~~~gd~~~~~~n~~~~~~~ 413 (424) |++||++|++..+. .+ .|+++.+...+..++++.+ +++.+|+||+||+|+++|+|| +||||+++.+.+..+++.. T Consensus 400 e~~ln~~L~~~~~~-~~--~~~f~~~~~~~~~~~~~~~-~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~ 475 (547) T protein:vir:63 400 EDFINKHIVAEFGD-KY--TFQFVGGDIKSELESVKIL-AEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQL 475 (547) T ss_pred HHHHHhhcccccCC-ce--EEEeeccccccHHHHHHHH-HHHhCCCcCHHHHHHHhCCCCCCCCCceeeccccccccccc Confidence 99999999976542 33 4555667777777777655 577889999999999999998 6999999999888776442 Q ss_pred cccCCCcc--------------------------cCC Q lcl|NC_019719. 414 GTNKEPRN--------------------------NGA 424 (424) Q Consensus 414 ~~~~~~~~--------------------------~ga 424 (424) ...+.+++ +++ T Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (547) T protein:vir:63 476 MQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGK 512 (547) T ss_pred ccccCCccccchhhccccccccCCCCCCCCCCCCCCc Confidence 21111100 000 No 72 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=100.00 E-value=3.6e-78 Score=445.17 Aligned_cols=356 Identities=14% Similarity=0.108 Sum_probs=284.7 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcccc- Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK- 92 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~- 92 (424) =|||+++.++.++...... ..+.. + .....+++.++|++||++||++||++||++|++.+++.... T Consensus 1 Mg~f~~~~~~~~~~~~~~~------~~~~~---~----~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~ 67 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNNDT------QRVTA---W----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CccchhhhhhhcccccCCc------ceeee---c----ccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEccccccccc Confidence 6888888775443322111 11111 1 11233567889999999999999999999998877664332 Q ss_pred --ccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-CceeeEEeecCceEEEEEcCCceEEEEEec Q lcl|NC_019719. 93 --VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-GDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 93 --~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 169 (424) ...+|+++++|+.+||++||+++||+.++.+++++||+|++++++.. |.+..++|. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~--------------------- 126 (378) T protein:vir:16 68 LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFA--------------------- 126 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEec--------------------- Confidence 24579999999999999999999999999999999999999988754 555555442 Q ss_pred CceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH---HHHHHHHHH Q lcl|NC_019719. 170 SEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE---QQRSQVEEN 246 (424) Q Consensus 170 ~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~---~~~~~~~~~ 246 (424) +..+.|+++||||+|.+ .++..|.|++..+...+. .++.+ +.++|+++.+..+.++ +.++++++. T Consensus 127 ~~~~~~~~~diih~r~~-~~~~~~~s~l~~~~~~i~----------~~~~~-~~~~g~l~~~~~l~~~~~~~~~~~~~~~ 194 (378) T protein:vir:16 127 DDKKEYKPEELVRLTSP-FYINEDTSILDNALASIQ----------TKLEQ-GKLRGLLKINAFLDIDNTQEYREKALTT 194 (378) T ss_pred CCeeEecccceEEecCc-cCccchhHHHHHHHHHHH----------HHHhc-CccceeeEeCCcCCHHHHHHHHHHHHHH Confidence 22456789999999963 566788999888776553 33444 4688999998776443 334556666 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) ++...+++++|+++++++|++|+++++++.++|+ +.+++++++||++|||||.+|++ +++|++.++|+++| T Consensus 195 ~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~g--------~~~e~~~~~f~~~t 265 (378) T protein:vir:16 195 IKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG--------TASQEQQIYFYNST 265 (378) T ss_pred HHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC--------CchHHHHHHHHHHH Confidence 6666788899999999999999999999999997 55689999999999999999953 24578999999999 Q ss_pred HHHHHHHHHHHHHhhccCccccccc-------eeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVGRI-------HAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD 399 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~-------~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd 399 (424) |.|++++||++|+++||++.++..+ .++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+|||| T Consensus 266 l~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD 345 (378) T protein:vir:16 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999998876432 378999999999999999999999999999999999999999999999 Q ss_pred eeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 400 VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 400 ~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++++|+|++|++...+++.+++++- T Consensus 346 ~~~~~~n~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:16 346 VYIANLNAVAVKNLSDLQGSRKDVT 370 (378) T ss_pred eEeeccccccccchhhhcCccCCCC Confidence 9999999999988776554433332 No 73 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=100.00 E-value=1.3e-77 Score=442.15 Aligned_cols=379 Identities=15% Similarity=0.114 Sum_probs=286.6 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =||++++++||.+....... +....+.....++.+.++++++|++||++||+++|++||+++++++ T Consensus 1 Mg~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~------- 66 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNL-------TDTVWCSIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGE------- 66 (395) T ss_pred CchHHHHHhhhccccccccc-------ccchhhccccccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCc------- Confidence 69999999999765432221 1112223344567788999999999999999999999999987532 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEecC--c Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDS--E 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~--~ 171 (424) ...|+++++|+.+||++||+++||+.++.+++++||||+++.++. +++.++..+.........+..+...+ . T Consensus 67 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~------~~~~~~~~~~~~~~~~~~~~~v~~~~~~~ 140 (395) T protein:vir:40 67 EVRKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEY------IYVADSFTKNDKSLYENTYTEVTLKDLTL 140 (395) T ss_pred cccchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCc------eeecCCccccccccccceeeeeeecCcee Confidence 245899999999999999999999999999999999999998764 23333322221111111111222222 2 Q ss_pred eEEecHhHeeEeccCCCCccccCch-HHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019719. 172 YADFSQKEIFHLKGFGFTGLVGLSP-IAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~evih~r~~~~~~~~G~s~-~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) .+.|+++||||+|+.+..+...++. .......+... .....+.++.++.++++.+.. .++++.+++++.|++. T Consensus 141 ~~~~~~~evih~r~~~~~~~~~~~~l~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~~~~~ 214 (395) T protein:vir:40 141 KKEFKESEVLHLTLNNESIKSIIDGFYLLYGDLLTAA-----VNKYKKLNSRKIIVKLKAMFG-QTPEAEEKLRLMLSER 214 (395) T ss_pred eeeeccccEEEeecCCCCccccchhHHHHHHHHHHHH-----HHHHHhcCCCCceEEEecccC-CCHHHHHHHHHHHHHH Confidence 4679999999999765433222222 23222222211 222334455555555554544 4778888899999887 Q ss_pred hCC--cccCcceecCCCceeeecccChhHHHHHHHHHHHH---HHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHH Q lcl|NC_019719. 251 AGG--PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQV---SELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 251 ~~~--~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~---~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) +++ .++++++++++|++|+++++++.|+||+|.+++.. ++||++|||||.+|++ +++|.|++.+.|+++ T Consensus 215 ~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~------~~sn~e~~~~~f~~~ 288 (395) T protein:vir:40 215 MKKFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKG------DTVGLSEQVNSFLMF 288 (395) T ss_pred HHHhhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------CCcCHHHHHHHHHHH Confidence 654 57788999999999999999999999999998875 7999999999999963 345889999999999 Q ss_pred HHHHHHHHHHHHHHhhccCccccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeee Q lcl|NC_019719. 326 TLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAM 402 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~--gd~~~ 402 (424) ||.|++++||++|+++||++.++. +++++||+++++++|.+++++.+.+++++|+||+||+|+++|+||+++ ||+++ T Consensus 289 ~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD~~~ 368 (395) T protein:vir:40 289 SINPIAEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQERF 368 (395) T ss_pred HHHHHHHHHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCceee Confidence 999999999999999999998874 689999999999999999999999999999999999999999999954 99999 Q ss_pred ecccccchhhcccc-CCCcccCC Q lcl|NC_019719. 403 RQSQYVPITDLGTN-KEPRNNGA 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~-~~~~~~ga 424 (424) +|+|++|++...+. +++++++. T Consensus 369 ~~~n~~~~~~~~~~~kgge~~~~ 391 (395) T protein:vir:40 369 VTKNYAPLGENEEDLKGGDINEN 391 (395) T ss_pred eccccccccccccccCCCCCCCC Confidence 99999999865443 22222222 No 74 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=100.00 E-value=4e-77 Score=439.44 Aligned_cols=366 Identities=13% Similarity=0.119 Sum_probs=284.7 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||+++++.. .. . ... .....+..++.+.|+++++|++||++||+++|++||++|++. . T Consensus 1 Mg~f~~l~~~~---~~--~-~~~-------~~~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~-------~ 60 (376) T protein:vir:78 1 MGFFSELFKRN---KE--I-EWM-------WDLDFLEDKTTKVYLKKMALNTCVKHIARTIAKSDFRLKNGE-------T 60 (376) T ss_pred CchhhhhhccC---Cc--c-ccc-------cchhhccccchhhhhhhHHHHHHHHHHHHhhcccceeecccc-------c Confidence 57888774331 11 0 000 011123457788899999999999999999999999998643 2 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEecCceE Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..+|+++++|+.+||++||+++||+.++.+++++||+|+++.|+..|.+.+++|+.+..+........ ......... T Consensus 61 ~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~ 137 (376) T protein:vir:78 61 SVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFPDVFEGV---TVKDYRYNR 137 (376) T ss_pred cccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceeeeeeeee---eeecceeee Confidence 45799999999999999999999999999999999999999999999999999999887655433221 112222346 Q ss_pred EecHhHeeEeccCCCCccccCchH-HHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhC Q lcl|NC_019719. 174 DFSQKEIFHLKGFGFTGLVGLSPI-AFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAG 252 (424) Q Consensus 174 ~~~~~evih~r~~~~~~~~G~s~~-~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~ 252 (424) .++++||||+|+.+.++....+++ ..+... .......++.+++.+++++.......++++.+++++.|++.++ T Consensus 138 ~~~~~evih~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 211 (376) T protein:vir:78 138 NFSMDDVIFLEYGNERLSAFTDGMFEDYGEL------FGKMIRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYA 211 (376) T ss_pred eeccccEEEeccCCCCchhhhhHHHHHHHHH------HHHHHHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhc Confidence 799999999997664433222222 222111 2222233334444434333333445678888999999998887 Q ss_pred Cc--ccCcceecCCCceeeecccChhH-----HHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHH Q lcl|NC_019719. 253 GP--VKKRLWILEAGFSTSAIGVTPQD-----AEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 253 ~~--~~g~~~~l~~g~~~~~l~~~~~d-----~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) +. +.++++++++|++|+++++++.| +||+|++++++++||++|||||.+|++ +++|.|++.+.|+++ T Consensus 212 g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~------~~s~~e~~~~~f~~~ 285 (376) T protein:vir:78 212 SFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHG------DMADLSNNMKAYMEY 285 (376) T ss_pred cccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCC------CCCCHHHHHHHHHHH Confidence 63 44568889999999999998865 499999999999999999999999973 345889999999999 Q ss_pred HHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--Ceeee Q lcl|NC_019719. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVAMR 403 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g--d~~~~ 403 (424) ||.|++.+||++|+++|+++.+ ++++|+++.+++.|.+++++.+++++++||+|+||+|+++|+||+||| |++++ T Consensus 286 ~l~P~~~~ie~~l~~kll~~~~---~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~d~~~~ 362 (376) T protein:vir:78 286 CIDPLTKKLEDELNAKLFTFSE---FLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPELDKYLI 362 (376) T ss_pred HHHHHHHHHHHHHHhhhCCccc---ceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeee Confidence 9999999999999999999765 457788899999999999999999999999999999999999999876 99999 Q ss_pred cccccchhhccccCCCcccC Q lcl|NC_019719. 404 QSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 404 ~~n~~~~~~~~~~~~~~~~g 423 (424) |+|++|+++.++ +| T Consensus 363 ~~n~~~~~~~~e------~g 376 (376) T protein:vir:78 363 TKNYQSADEGGE------DG 376 (376) T ss_pred ccCceehhcccc------CC Confidence 999999975432 33 No 75 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=2.7e-76 Score=434.86 Aligned_cols=417 Identities=13% Similarity=0.121 Sum_probs=298.1 Q ss_pred CCCC----cccccCCCCC--chHHHHHhhcc-CcccCcccccccccccc----cccccCcccc----------cHHHHhh Q lcl|NC_019719. 1 MEEP----KYTIDLRTNN--GWWARLQSWFV-GGRLVTPNQGSQTGPVS----AHGHLGDSSI----------NDERILQ 59 (424) Q Consensus 1 ~~~~----~~~~~~~~~~--G~~~~l~~~~~-~~~~~~~~~~~~~~~~~----~~~~~~~~~~----------~~~~~~~ 59 (424) ..-+ .-|+.+ ++. +.|.++...-. ..+..+.....+..|+. +.......+. ..+.+.. T Consensus 14 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~ 92 (576) T protein:vir:96 14 LGRDYEDIIDTVPI-DDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGN 92 (576) T ss_pred ccCccccchhhhhc-ccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhc Confidence 1111 112222 221 34444422100 01111111222233321 1111111010 1133456 Q ss_pred hHHHHHHHHHHHHhhccC-------------ceEEEEecccCccccccccchhhhhh---ccCCCCC-CCHHHHHHHHHH Q lcl|NC_019719. 60 ISTVWRCVSLISTLTACL-------------PLDVFETDQNDNRKKVDLSNPLARLL---RYSPNQY-MTAQEFREAMTM 122 (424) Q Consensus 60 ~~~v~~~i~~ia~~ia~~-------------~~~v~~~~~~~~~~~~~~~~~l~~lL---~~~pN~~-~s~~~f~~~~~~ 122 (424) +|+|++||++||+.||++ ++.++..+.....+.....|++...| +..|||+ +|+.+||+.++. T Consensus 93 npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~ 172 (576) T protein:vir:96 93 NPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVR 172 (576) T ss_pred CHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHH Confidence 889999999999999973 33344333222222223333333332 2345555 589999999999 Q ss_pred HHHHcCCeEEEEeeC--CCCceeeEEeecCceEEEEEcCCceE-------EEEEecCceEEecHhHeeEec-cCCCC--- Q lcl|NC_019719. 123 QLCFYGNAYALVDRN--SAGDVISLLPLQSANMDVKLVGKKVV-------YRYQRDSEYADFSQKEIFHLK-GFGFT--- 189 (424) Q Consensus 123 ~~l~~G~a~~~~~r~--~~G~~~~l~~l~~~~v~~~~~~~~~~-------~~~~~~~~~~~~~~~evih~r-~~~~~--- 189 (424) +++++||+|++++++ ..|.+++||||+|.+|++..+.++.. +++..+.....|++++|||++ +++.+ T Consensus 173 dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~dii~~~~~~~~d~~~ 252 (576) T protein:vir:96 173 DTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREMAMGIRNPRTELSS 252 (576) T ss_pred HHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccceEEEeecCCCCccc Confidence 999999999998854 45789999999999999998876543 223344566789999998765 44444 Q ss_pred ccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhC-CcccCc-ceecCCCc Q lcl|NC_019719. 190 GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAG-GPVKKR-LWILEAGF 266 (424) Q Consensus 190 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~g~-~~~l~~g~ 266 (424) +.+|+||+.++..++.+..++++++.++|+||++|+|||+.+.+ ..++++.+++++.|++.++ ..|+|+ ++++++|+ T Consensus 253 ~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~ 332 (576) T protein:vir:96 253 SGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDI 332 (576) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCc Confidence 67899999999999999999999999999999999999998765 3578889999999988665 468888 58899999 Q ss_pred eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCC---------CccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS---------TSWGSGIEQQNLGFLQYTLQPYISRWENS 337 (424) Q Consensus 267 ~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~---------~~~~~n~e~~~~~~~~~tl~P~~~~ie~~ 337 (424) +|+++++++.|+||+|++++++++||++|||||.+||..+.+ +.+++|+|++.+.|+++||+|++.+||++ T Consensus 333 ~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ 412 (576) T protein:vir:96 333 KFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDL 412 (576) T ss_pred eEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999987654 33678999999999999999999999999 Q ss_pred HHhhccCccccccceeeecchhhhccCHHHHHHHHHHH--HhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhhccc Q lcl|NC_019719. 338 IQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAM--GEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGT 415 (424) Q Consensus 338 l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~--~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~ 415 (424) |+++|++..+. .++++ +++.|.+++++.++.+ +.+|+||+||+|+++|+||+||||+++.|.++.+++.... T Consensus 413 ln~~Ll~~~~~-~~~~~-----f~r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~ 486 (576) T protein:vir:96 413 INTHIISEYSD-KYVFQ-----FVGGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQ 486 (576) T ss_pred HHhhhchhccC-ceEEE-----eccCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcceecccccccccccccc Confidence 99999987543 23333 3567888888877654 5579999999999999999999999999999887755332 Q ss_pred cCCCcccCC Q lcl|NC_019719. 416 NKEPRNNGA 424 (424) Q Consensus 416 ~~~~~~~ga 424 (424) ....+...+ T Consensus 487 ~~~~e~~~~ 495 (576) T protein:vir:96 487 KEQYEDTKQ 495 (576) T ss_pred CCCCCCccc Confidence 211111111 No 76 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=1.4e-75 Score=430.99 Aligned_cols=414 Identities=13% Similarity=0.170 Sum_probs=304.2 Q ss_pred CCCC--cccccCCCCCchHHHHHhh---------ccCcccCcccccccccccccc----cc-------cCcc---cccHH Q lcl|NC_019719. 1 MEEP--KYTIDLRTNNGWWARLQSW---------FVGGRLVTPNQGSQTGPVSAH----GH-------LGDS---SINDE 55 (424) Q Consensus 1 ~~~~--~~~~~~~~~~G~~~~l~~~---------~~~~~~~~~~~~~~~~~~~~~----~~-------~~~~---~~~~~ 55 (424) .+.- --++++ +.|+=.++... +.+.. ......+..++... .. .... ...-+ T Consensus 14 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~ 89 (563) T protein:vir:95 14 YGNNSTIAQVPI--DEGLQANIKKIEQDNKEYQDLTKSL--YGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLK 89 (563) T ss_pred cccccccceeec--cCChhhhHhhhhccchhHHHHHhhh--ccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHH Confidence 1111 112233 22322222211 11110 01111112222111 00 0000 01123 Q ss_pred HHhhhHHHHHHHHHHHHhhcc-------------CceEEEEecccCccccccccchhhhhhc---cCCCCC-CCHHHHHH Q lcl|NC_019719. 56 RILQISTVWRCVSLISTLTAC-------------LPLDVFETDQNDNRKKVDLSNPLARLLR---YSPNQY-MTAQEFRE 118 (424) Q Consensus 56 ~~~~~~~v~~~i~~ia~~ia~-------------~~~~v~~~~~~~~~~~~~~~~~l~~lL~---~~pN~~-~s~~~f~~ 118 (424) .+..+++|.+||+++++.||. +|+++++++..+..++....|++..+|. ..|||+ +|+++||+ T Consensus 90 ~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~ 169 (563) T protein:vir:95 90 KFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCK 169 (563) T ss_pred HhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHH Confidence 344478889999999998885 6888887777666666666677766553 223333 58999999 Q ss_pred HHHHHHHHcCCeEEEEe--eCCCCceeeEEeecCceEEEEEcCCceE-------EEEEecCceEEecHhHeeEe-ccCCC Q lcl|NC_019719. 119 AMTMQLCFYGNAYALVD--RNSAGDVISLLPLQSANMDVKLVGKKVV-------YRYQRDSEYADFSQKEIFHL-KGFGF 188 (424) Q Consensus 119 ~~~~~~l~~G~a~~~~~--r~~~G~~~~l~~l~~~~v~~~~~~~~~~-------~~~~~~~~~~~~~~~evih~-r~~~~ 188 (424) .++.+++++||+|++++ |+..|++++||||+|.+|++..++++.. +++..+.....|++++|||+ ++++. T Consensus 170 ~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~ 249 (563) T protein:vir:95 170 KIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRT 249 (563) T ss_pred HHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCC Confidence 99999999999999865 7778999999999999999988766542 23334555678999998755 56655 Q ss_pred C---ccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC-CCHHHHHHHHHHHHHHhCC-cccCcc-eec Q lcl|NC_019719. 189 T---GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV-LTEQQRSQVEENFKEIAGG-PVKKRL-WIL 262 (424) Q Consensus 189 ~---~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~g~~-~~l 262 (424) + +.+|+||+.++..++.+..++++++.++|+||++|+|||+++++. .++++.+++++.|++.+++ .|+|++ +++ T Consensus 250 d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl 329 (563) T protein:vir:95 250 ELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVM 329 (563) T ss_pred CcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEc Confidence 4 788999999999999999999999999999999999999987653 5888999999999986654 688886 789 Q ss_pred CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC---------ccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 263 EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST---------SWGSGIEQQNLGFLQYTLQPYISR 333 (424) Q Consensus 263 ~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~---------~~~~n~e~~~~~~~~~tl~P~~~~ 333 (424) ++|++|+++++++.|+||+|++++++++||++|||||++||..++++ .+++|+|++.+.|+++||.|++.. T Consensus 330 ~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ 409 (563) T protein:vir:95 330 ADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRF 409 (563) T ss_pred CCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999876543 356789999999999999999999 Q ss_pred HHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHH--HHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchh Q lcl|NC_019719. 334 WENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMK--AMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPIT 411 (424) Q Consensus 334 ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~--~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~ 411 (424) ||++|+++|++..+. .+ .|+ ++++|.+++++.+. +++++||||+||+|+++|+||+||||+++.|.++.+++ T Consensus 410 ie~~ln~~L~~~~~~-~~--~~~---f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~ 483 (563) T protein:vir:95 410 IEDLVNRHIISEYGD-KY--TFQ---FVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTA 483 (563) T ss_pred HHHHHHhhhchhccc-cc--EEE---eccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeeccccccccc Confidence 999999999987553 23 333 46778888888765 46889999999999999999999999999999888775 Q ss_pred hccccCCCcc---------------------------cCC Q lcl|NC_019719. 412 DLGTNKEPRN---------------------------NGA 424 (424) Q Consensus 412 ~~~~~~~~~~---------------------------~ga 424 (424) .......... +++ T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (563) T protein:vir:95 484 QLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSS 523 (563) T ss_pred ccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCC Confidence 4322111000 000 No 77 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=1.4e-75 Score=430.99 Aligned_cols=414 Identities=13% Similarity=0.170 Sum_probs=304.2 Q ss_pred CCCC--cccccCCCCCchHHHHHhh---------ccCcccCcccccccccccccc----cc-------cCcc---cccHH Q lcl|NC_019719. 1 MEEP--KYTIDLRTNNGWWARLQSW---------FVGGRLVTPNQGSQTGPVSAH----GH-------LGDS---SINDE 55 (424) Q Consensus 1 ~~~~--~~~~~~~~~~G~~~~l~~~---------~~~~~~~~~~~~~~~~~~~~~----~~-------~~~~---~~~~~ 55 (424) .+.- --++++ +.|+=.++... +.+.. ......+..++... .. .... ...-+ T Consensus 14 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~ 89 (563) T protein:vir:99 14 YGNNSTIAQVPI--DEGLQANIKKIEQDNKEYQDLTKSL--YGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLK 89 (563) T ss_pred cccccccceeec--cCChhhhHhhhhccchhHHHHHhhh--ccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHH Confidence 1111 112233 22322222211 11110 01111112222111 00 0000 01123 Q ss_pred HHhhhHHHHHHHHHHHHhhcc-------------CceEEEEecccCccccccccchhhhhhc---cCCCCC-CCHHHHHH Q lcl|NC_019719. 56 RILQISTVWRCVSLISTLTAC-------------LPLDVFETDQNDNRKKVDLSNPLARLLR---YSPNQY-MTAQEFRE 118 (424) Q Consensus 56 ~~~~~~~v~~~i~~ia~~ia~-------------~~~~v~~~~~~~~~~~~~~~~~l~~lL~---~~pN~~-~s~~~f~~ 118 (424) .+..+++|.+||+++++.||. +|+++++++..+..++....|++..+|. ..|||+ +|+++||+ T Consensus 90 ~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~ 169 (563) T protein:vir:99 90 KFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCK 169 (563) T ss_pred HhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHH Confidence 344478889999999998885 6888887777666666666677766553 223333 58999999 Q ss_pred HHHHHHHHcCCeEEEEe--eCCCCceeeEEeecCceEEEEEcCCceE-------EEEEecCceEEecHhHeeEe-ccCCC Q lcl|NC_019719. 119 AMTMQLCFYGNAYALVD--RNSAGDVISLLPLQSANMDVKLVGKKVV-------YRYQRDSEYADFSQKEIFHL-KGFGF 188 (424) Q Consensus 119 ~~~~~~l~~G~a~~~~~--r~~~G~~~~l~~l~~~~v~~~~~~~~~~-------~~~~~~~~~~~~~~~evih~-r~~~~ 188 (424) .++.+++++||+|++++ |+..|++++||||+|.+|++..++++.. +++..+.....|++++|||+ ++++. T Consensus 170 ~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~ 249 (563) T protein:vir:99 170 KIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRT 249 (563) T ss_pred HHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCC Confidence 99999999999999865 7778999999999999999988766542 23334555678999998755 56655 Q ss_pred C---ccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC-CCHHHHHHHHHHHHHHhCC-cccCcc-eec Q lcl|NC_019719. 189 T---GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV-LTEQQRSQVEENFKEIAGG-PVKKRL-WIL 262 (424) Q Consensus 189 ~---~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~g~~-~~l 262 (424) + +.+|+||+.++..++.+..++++++.++|+||++|+|||+++++. .++++.+++++.|++.+++ .|+|++ +++ T Consensus 250 d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl 329 (563) T protein:vir:99 250 ELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVM 329 (563) T ss_pred CcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEc Confidence 4 788999999999999999999999999999999999999987653 5888999999999986654 688886 789 Q ss_pred CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC---------ccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 263 EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST---------SWGSGIEQQNLGFLQYTLQPYISR 333 (424) Q Consensus 263 ~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~---------~~~~n~e~~~~~~~~~tl~P~~~~ 333 (424) ++|++|+++++++.|+||+|++++++++||++|||||++||..++++ .+++|+|++.+.|+++||.|++.. T Consensus 330 ~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ 409 (563) T protein:vir:99 330 ADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRF 409 (563) T ss_pred CCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999876543 356789999999999999999999 Q ss_pred HHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHH--HHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchh Q lcl|NC_019719. 334 WENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMK--AMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPIT 411 (424) Q Consensus 334 ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~--~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~ 411 (424) ||++|+++|++..+. .+ .|+ ++++|.+++++.+. +++++||||+||+|+++|+||+||||+++.|.++.+++ T Consensus 410 ie~~ln~~L~~~~~~-~~--~~~---f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~ 483 (563) T protein:vir:99 410 IEDLVNRHIISEYGD-KY--TFQ---FVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTA 483 (563) T ss_pred HHHHHHhhhchhccc-cc--EEE---eccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeeccccccccc Confidence 999999999987553 23 333 46778888888765 46889999999999999999999999999999888775 Q ss_pred hccccCCCcc---------------------------cCC Q lcl|NC_019719. 412 DLGTNKEPRN---------------------------NGA 424 (424) Q Consensus 412 ~~~~~~~~~~---------------------------~ga 424 (424) .......... +++ T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (563) T protein:vir:99 484 QLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSS 523 (563) T ss_pred ccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCC Confidence 4322111000 000 No 78 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=4.8e-76 Score=433.51 Aligned_cols=391 Identities=11% Similarity=0.078 Sum_probs=292.6 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCccccc----HHHHhhhHHHHHHHHHHHHhhcc Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIN----DERILQISTVWRCVSLISTLTAC 76 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~v~~~i~~ia~~ia~ 76 (424) |-. |.|.+++..= +.++++. ...+......+.. ++ ...++ .+.+..+++|++||++||+.||+ T Consensus 1 ~~~--~~~~~~~~~~-~~~~~~~-------~~~~~~~~~~~~~--~~-~pp~~~~~La~~~~~n~~v~scI~~ia~~ia~ 67 (540) T protein:vir:41 1 MFN--YHLSIKSLEK-YRAIKGD-------TDSQALKEDRFEE--YV-EPKVHPLVLLSLLQVNPYHASACSIKANDILR 67 (540) T ss_pred CCC--cccChhhccc-hhhhhcc-------ccccccccCCCCc--cc-cCCCCHHHHHHHHHhcHHHHHHHHHHHHHHhc Confidence 322 2333332111 2222211 1111111111111 11 11122 35566789999999999999999 Q ss_pred CceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEE Q lcl|NC_019719. 77 LPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVK 156 (424) Q Consensus 77 ~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~ 156 (424) +|++++.+. +.+.. ..||++||+.+||+.++.+++++||||++++|+..|.+++|+||+|.+|++. T Consensus 68 ~~~~i~~~~-----------~~~~~---~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~ 133 (540) T protein:vir:41 68 TGYLIDGDD-----------GGVEE---LLRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVH 133 (540) T ss_pred CCceEecCc-----------cchhh---hccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEe Confidence 999986432 22322 2499999999999999999999999999999999999999999999999988 Q ss_pred EcCCceEEE--------E-----------EecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 157 LVGKKVVYR--------Y-----------QRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRD 216 (424) Q Consensus 157 ~~~~~~~~~--------~-----------~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 216 (424) .++...+.. + ..+.....++++||||+|.++ .++++|+||+..+..++.+..++++++.+ T Consensus 134 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~ 213 (540) T protein:vir:41 134 RDGSRYMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQKIDEYNYA 213 (540) T ss_pred EcCceeEeeecCceeeeeecccccceeeccccccceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 776543211 0 112234578999999999876 68899999999999999999999999999 Q ss_pred HHhccCCCceeEEcCCCCCCHH---------HHHHHHHHHHHHhCC--cccCcceecC------CCceeeecccChhHHH Q lcl|NC_019719. 217 FFANGAKSPQILSTGEKVLTEQ---------QRSQVEENFKEIAGG--PVKKRLWILE------AGFSTSAIGVTPQDAE 279 (424) Q Consensus 217 ~~~n~~~p~~vl~~~~~~~~~~---------~~~~~~~~~~~~~~~--~~~g~~~~l~------~g~~~~~l~~~~~d~~ 279 (424) +|+||++|+++|++++...+++ .++.+++.|+....+ .|+|++++++ .|++|++++++++|+| T Consensus 214 ~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~q 293 (540) T protein:vir:41 214 FFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELS 293 (540) T ss_pred HHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHH Confidence 9999999999999987665432 235566666654443 5889999984 7999999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchh Q lcl|NC_019719. 280 MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDG 359 (424) Q Consensus 280 ~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~ 359 (424) |+|++++++++||++|||||.+||..+.++++++|+|++.+.|+++||.|++++||++||++|++..+. +++++||.+. T Consensus 294 fle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~-~~~i~f~~~~ 372 (540) T protein:vir:41 294 FREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIVSSVLTDFIQLKLDP-GARFVFNEEI 372 (540) T ss_pred HHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC-ceEEEecchh Confidence 999999999999999999999999988888888899999999999999999999999999999876553 5789999999 Q ss_pred hhccCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCCCCCeeeecccccchhhccccCC---Cccc-----CC Q lcl|NC_019719. 360 LLRGDSASRAAFMKAMGEAGLRTINEMRRTD-NLPPLPGGDVAMRQSQYVPITDLGTNKE---PRNN-----GA 424 (424) Q Consensus 360 l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~-G~~p~~~gd~~~~~~n~~~~~~~~~~~~---~~~~-----ga 424 (424) +++.|.++ .+.+++++|++|+||+|+.+ |++| ++|.++.|.|+...+..+..+. ++.+ .+ T Consensus 373 ll~~D~~~---~~~~lv~~G~lT~NE~Re~L~g~e~--gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~ 441 (540) T protein:vir:41 373 LMESEFVH---NYALLVQCGVLTPSEVREKLFGLDG--GPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYA 441 (540) T ss_pred hcchHHHH---HHHHHHhCCCCCHHHHHHHhCcCcC--CCcccccccccccccccccccccCCCCccccccccc Confidence 99886554 46678999999999999854 4443 4466777877765433221110 0100 11 No 79 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=100.00 E-value=2.8e-76 Score=434.78 Aligned_cols=380 Identities=13% Similarity=0.085 Sum_probs=289.9 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||++|.. ++..... .. . .......++.+.++++++|++||++||++||++||++|+.+.. . T Consensus 1 MGlf~~~~~----~~~~~~~-~~------~-~~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~-----~ 63 (395) T protein:vir:98 1 MGILDFFSF----KKSGTLS-DD------D-SGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKL-----T 63 (395) T ss_pred CcchhhhcC----CCccccc-cc------c-cchhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCc-----c Confidence 578887632 1111100 00 0 1111223567788999999999999999999999999976432 2 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEE--ecCc Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQ--RDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~--~~~~ 171 (424) ..+|++.++|+.+||++||+++||+.++.+++++||||++++++..+.+. +..+.........++... .... T Consensus 64 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~ 137 (395) T protein:vir:98 64 ENQKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGIYVA------DSFTQDKKISGSQFKVSRVQGQTY 137 (395) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCceecC------CcccccccccCcccceeeecCcee Confidence 34689999999999999999999999999999999999999998643322 222222111111112221 2222 Q ss_pred eEEecHhHeeEeccCCCCcc-ccCchHHHHHHHHHH--HHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019719. 172 YADFSQKEIFHLKGFGFTGL-VGLSPIAFACKSAGV--AVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 172 ~~~~~~~evih~r~~~~~~~-~G~s~~~~~~~~i~~--~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) ...++++||||+|+.+.++. ++.+++......+.. .........+++.++..+.+++.......++++.+..+++++ T Consensus 138 ~~~~~~~evih~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (395) T protein:vir:98 138 EKTFTFDQVIYLKNDNSDLMSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFK 217 (395) T ss_pred eeEecCccEEEecCCCCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccCCcHHHHHHHHHHHH Confidence 46789999999998776653 333444444444433 334445567788888888888887777777788888888898 Q ss_pred HHhCCc--ccCcceecCCCceeeecccC------hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH Q lcl|NC_019719. 249 EIAGGP--VKKRLWILEAGFSTSAIGVT------PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 249 ~~~~~~--~~g~~~~l~~g~~~~~l~~~------~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~ 320 (424) ++.++. +.++++++++|++|++++++ +.++||.+.+++++++||++|||||++|++ +++|.|++.+ T Consensus 218 ~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~------~~sn~e~~~~ 291 (395) T protein:vir:98 218 RTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHG------DIADNQKNYE 291 (395) T ss_pred HHHhhhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------CcccHHHHHH Confidence 888764 44568889999999999854 677899999999999999999999999963 3558999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C Q lcl|NC_019719. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--G 398 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~--g 398 (424) +|+++||.|++.+||++|+++|+++.++.. ..+|+++++++.|.+++++.+++++++||+|+||+|+++|+||++| | T Consensus 292 ~f~~~tl~P~~~~ie~~l~~kll~~~~~~~-g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~g 370 (395) T protein:vir:98 292 LLLEGPIESLITNIVDGLEYAIFDKSETLQ-GSFIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLG 370 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCChhhhcC-cceeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC Confidence 999999999999999999999999887643 2357888999999999999999999999999999999999999976 9 Q ss_pred CeeeecccccchhhccccCCCcccC Q lcl|NC_019719. 399 DVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) |++++++|++|++..++...++++. T Consensus 371 D~~~~~~n~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:98 371 KVLYMTKNYESVLERGGEVDEEVET 395 (395) T ss_pred ceeeecccceecccccCCCCCCCCC Confidence 9999999999998655444444444 No 80 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=1.2e-75 Score=431.39 Aligned_cols=390 Identities=12% Similarity=0.065 Sum_probs=291.0 Q ss_pred hHHHHH---hhccCcccCcccccccccccccccccCcccccH----HHHhhhHHHHHHHHHHHHhhccCceEEEEecccC Q lcl|NC_019719. 16 WWARLQ---SWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIND----ERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 16 ~~~~l~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) +|+..+ +.....................+..+....++. +.+..+++|++||++||++||++||+++.... T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~~~s~~~~~~~~~~~~~pp~~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~~~~~~-- 78 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREEVESQALGETRFEEYVEPKVNPLVLLSLLQVNPYHASACSIKANDIIRTGYILEGDDE-- 78 (542) T ss_pred CccccccccccccchhhhhccccccccccccCCccccCCCCHHHHHHHHhhcHHHHHHHHHHHHHHhhCceeeecccc-- Confidence 444222 211111100000000000111111111122333 33455899999999999999999999864321 Q ss_pred ccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEE- Q lcl|NC_019719. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQ- 167 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~- 167 (424) + .+++..||++||+++||+.++.+++++||||++++|+..|.+.+|+|++|.+|++..+++.....+. T Consensus 79 --------~---~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~~~~~~~~ 147 (542) T protein:vir:41 79 --------G---VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSRYRQTWDG 147 (542) T ss_pred --------h---hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCeeEeeecC Confidence 1 1344569999999999999999999999999999999999999999999999999887654321111 Q ss_pred ------------------ecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeE Q lcl|NC_019719. 168 ------------------RDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL 228 (424) Q Consensus 168 ------------------~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl 228 (424) .+.....++++||||+|+++ .++++|+||+..+..++.+..++++++.++|+||++|++|| T Consensus 148 ~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL 227 (542) T protein:vir:41 148 VNITHFKDYRYEGEINPETGEDQDSVGANELVFIHIPSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVI 227 (542) T ss_pred CcceeEEeecccccccccccccccccCcccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEE Confidence 11123457889999999876 68899999999999999999999999999999999999999 Q ss_pred EcCCCC---------CCHHHHHHHHHHHHHHhCC--cccCcceecC------CCceeeecccChhHHHHHHHHHHHHHHH Q lcl|NC_019719. 229 STGEKV---------LTEQQRSQVEENFKEIAGG--PVKKRLWILE------AGFSTSAIGVTPQDAEMMASRKFQVSEL 291 (424) Q Consensus 229 ~~~~~~---------~~~~~~~~~~~~~~~~~~~--~~~g~~~~l~------~g~~~~~l~~~~~d~~~~e~~~~~~~~I 291 (424) ++++.. .++++.+.+++.|++.+.+ .|+|++++++ +|++|++++++++|++|+|++++++++| T Consensus 228 ~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~I 307 (542) T protein:vir:41 228 TVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDI 307 (542) T ss_pred EeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHH Confidence 987543 3567788888888876543 5788899984 7999999999999999999999999999 Q ss_pred HHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHH Q lcl|NC_019719. 292 ARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAF 371 (424) Q Consensus 292 a~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~ 371 (424) |++|||||.+||..++++++++|+|++.+.|+++||+|++++||++||++|+++.++ +++++|+.+.+++.|.. +. T Consensus 308 a~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~-~~~~~f~~~~ll~~d~~---~~ 383 (542) T protein:vir:41 308 AAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTDFFQVKFNP-KTRFKFNDETLLESDSV---RN 383 (542) T ss_pred HHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC-ceEEEecchhhcchHHH---HH Confidence 999999999999998888888899999999999999999999999999999887765 57889999999887644 45 Q ss_pred HHHHHhCCCCCHHHHHHHhCCCCCCCC-CeeeecccccchhhccccC--CC------cccCC Q lcl|NC_019719. 372 MKAMGEAGLRTINEMRRTDNLPPLPGG-DVAMRQSQYVPITDLGTNK--EP------RNNGA 424 (424) Q Consensus 372 ~~~~~~~g~~T~NE~R~~~G~~p~~~g-d~~~~~~n~~~~~~~~~~~--~~------~~~ga 424 (424) +..++++|++|+||+|+.+ +++|+| |.++.|.|.........++ +. +..-+ T Consensus 384 ~~~~v~~GilT~NE~Re~L--~g~~pgdd~~l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~~ 443 (542) T protein:vir:41 384 CALLVQSGVLTPAEARERL--FGLDGGPDIFMVPSKGAAKSVKRQERNYEKNQIREIRKIYA 443 (542) T ss_pred HHHHHhCCCCCHHHHHHhh--CCCCCCCccccccccccccccccCCcCCCCCchhhhhhccc Confidence 6779999999999999753 344444 4455565543221111100 00 00000 No 81 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=1.5e-75 Score=430.80 Aligned_cols=378 Identities=12% Similarity=0.149 Sum_probs=296.2 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||++++....+.....+.............+.++..++.+.++++|+|++||++||++||++|+++++... T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~~~~------- 73 (386) T protein:vir:49 1 MPIFNITNLATESPPINQESFFDIADSDFLASLNSSEWVSAENALKNSDLFSIISQLSNDLATAKITTSRKQL------- 73 (386) T ss_pred CchhhhhccCCCCcccchhhhhhhhhccccccccCCceechhhhhccHHHHHHHHHHHHHhhhCceeeccchh------- Confidence 4566665332221111111000011122233355678899999999999999999999999999999986542 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC--ceEEEEEe--- Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQR--- 168 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~--~~~~~~~~--- 168 (424) +.|+.+||++||+++||+.++.+++++||||++++|+.+|++++|||++|.+|++..+++ ...|.|.. T Consensus 74 -------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~ 146 (386) T protein:vir:49 74 -------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDP 146 (386) T ss_pred -------hhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEEEcCc Confidence 236789999999999999999999999999999999999999999999999999888654 34555543 Q ss_pred -cCceEEecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019719. 169 -DSEYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 169 -~~~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~ 246 (424) ++..+.|+++||||+|+++.++ ++|+||+.++..++.+..++.+++.++|+||+.|+++++++....+++ .+++++. T Consensus 147 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~-~~~~~~~ 225 (386) T protein:vir:49 147 HIAPKQHVPQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDF-KTKVSRS 225 (386) T ss_pred cccceeEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHH-HHHHHHH Confidence 3456789999999999988776 899999999999999999999999999999999999999998875554 4555555 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) ++. +..++|+++++++|++|++++.++.|+||+|++++++++||++|||||.+||+...+++ +. ++..+|+..+ T Consensus 226 ~~~--~~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~---~~-~~~~~~~~~~ 299 (386) T protein:vir:49 226 RQA--MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQS---SL-EMIYNIYFKS 299 (386) T ss_pred HHH--hccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc---hH-HHHHHHHHHH Confidence 554 34688999999999999999999999999999999999999999999999987544332 33 3556889999 Q ss_pred HHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeee-cc Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMR-QS 405 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~-~~ 405 (424) |.|++..|+++|+++|++ +++||.+.+++.|...++..+.+++++|++|+||+|++++..++...+.... .. T Consensus 300 i~~~l~~i~~~~~~~l~~-------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~~~~ 372 (386) T protein:vir:49 300 VSRYLRPFVSEMSKKLSC-------EVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKELPDGKNP 372 (386) T ss_pred HHHHHHHHHHHHHHHhcc-------hhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcCcchhcc Confidence 999999999999999865 3678999999999999999999999999999999999987665432221111 11 Q ss_pred cccchhhccccCCCcc Q lcl|NC_019719. 406 QYVPITDLGTNKEPRN 421 (424) Q Consensus 406 n~~~~~~~~~~~~~~~ 421 (424) +..++.. ++ ..++| T Consensus 373 ~~~~~~g-Gd-~~~~~ 386 (386) T protein:vir:49 373 NRTSLKG-GE-INEQD 386 (386) T ss_pred CCCCCCC-CC-CCCCC Confidence 1122211 11 11222 No 82 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=100.00 E-value=2.8e-76 Score=434.78 Aligned_cols=377 Identities=14% Similarity=0.102 Sum_probs=278.5 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =|||++|+...... .+. .... ..-..++.+.|+++++|++||++||++||++||++|++++. . T Consensus 1 Mgl~d~~~~~~~~~---~~~--------~~~~-~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~-----~ 63 (395) T protein:vir:96 1 MGILDFFSFKKSGT---LSD--------DDSG-STTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKL-----T 63 (395) T ss_pred CcchhhhcCCCCcc---ccc--------cccc-cchhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCcc-----c Confidence 58887763321110 000 0000 01123567889999999999999999999999999976432 3 Q ss_pred cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCce-EEEEEecCce Q lcl|NC_019719. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV-VYRYQRDSEY 172 (424) Q Consensus 94 ~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~-~~~~~~~~~~ 172 (424) ..+|++.+||+.+||++||+++||+.++.+++++||+|+++.|+..+.+...++.. ....+... .+.+...... T Consensus 64 ~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~v~~~~~~~~ 138 (395) T protein:vir:96 64 ENQKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIYVADAFTQD-----KKLSGNKFKVSRVQGQTYE 138 (395) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCceecCCccccc-----cccccceeeeeeeccceee Confidence 45799999999999999999999999999999999999999998654333333221 11111111 1112222235 Q ss_pred EEecHhHeeEeccCCCCc-cccCchHHHHHHH------HHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019719. 173 ADFSQKEIFHLKGFGFTG-LVGLSPIAFACKS------AGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 173 ~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~------i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~ 245 (424) ..++++||||+|+.+.+. .++.+++...... +.....+.++..+++.+++.+.++++..+.... +..++ T Consensus 139 ~~~~~~dvih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~ 214 (395) T protein:vir:96 139 KIFTFDQVIYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGGRQP----KSDKD 214 (395) T ss_pred eEeccCceEEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCchhhH----HHHHH Confidence 679999999999876543 2333333333222 222233457888999999999999876655443 34445 Q ss_pred HHHHHhCCc--ccCcceecCCCceeeecccChhHHHHHHHHHHH------HHHHHHHhCCCHHHhcCCCCCCccchhHHH Q lcl|NC_019719. 246 NFKEIAGGP--VKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQ------VSELARFFGVPPHLVGDVEKSTSWGSGIEQ 317 (424) Q Consensus 246 ~~~~~~~~~--~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~------~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~ 317 (424) +++++.++. +.++++++++|++|++++.++.|+|++|.+++. .++||++|||||++|++ +++|.|+ T Consensus 215 ~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~~------~~sn~e~ 288 (395) T protein:vir:96 215 FFKRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHG------DIADNQK 288 (395) T ss_pred HHHHHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------CCccHHH Confidence 555544432 345688899999999999999999999988776 58999999999999963 3458999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Q lcl|NC_019719. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~ 397 (424) +.++|+++||.|++.+||++|+++|+++.++.. ..+|+++.++++|.+++++.+.+++++||||+||+|+++|+||+|| T Consensus 289 ~~~~f~~~~L~P~~~~ie~~l~~~Ll~~~e~~~-~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~ 367 (395) T protein:vir:96 289 NYELLLEGPIESLITNIVDGLEYAIFDKSETLE-GSFIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPD 367 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcC-ceeEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 999999999999999999999999999877643 2457889999999999999999999999999999999999999976 Q ss_pred --CCeeeecccccchhhccccCCCcccC Q lcl|NC_019719. 398 --GDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 398 --gd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) ||++++|+|++|++..+...+++++. T Consensus 368 ~~gD~~~~~~N~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:96 368 GLGKVLYMTKNYESVLERGGEVDEEVET 395 (395) T ss_pred CCCceeeecccceechhccCCCCCCCCC Confidence 99999999999997744433333333 No 83 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=2.2e-75 Score=429.92 Aligned_cols=371 Identities=17% Similarity=0.152 Sum_probs=294.9 Q ss_pred cHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCc--cccccccchhhhhhccCCCCCC--------CHHHHHHHHHH Q lcl|NC_019719. 53 NDERILQISTVWRCVSLISTLTACLPLDVFETDQNDN--RKKVDLSNPLARLLRYSPNQYM--------TAQEFREAMTM 122 (424) Q Consensus 53 ~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~--~~~~~~~~~l~~lL~~~pN~~~--------s~~~f~~~~~~ 122 (424) =.+.+..+++|++||++||++||++|++++.+..... ......++....++..+||+.| ++.+||+.++. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 2334445899999999999999999999986543221 1112233334456777888765 66789999999 Q ss_pred HHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceE---------EEE----------------------EecCc Q lcl|NC_019719. 123 QLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVV---------YRY----------------------QRDSE 171 (424) Q Consensus 123 ~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~---------~~~----------------------~~~~~ 171 (424) +++++||+|++++|+..|.+++|+||+|.+|++..+....+ +.+ ...+. T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDADDGSTGT 160 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeeeeeeeccccccc Confidence 99999999999999999999999999999999877654321 111 01234 Q ss_pred eEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019719. 172 YADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ...++++||||+|.++ .++++|+||+.++..++.++.++++++.++|+||++|+|+++++....++++.+.+++.|+.. T Consensus 161 ~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~ 240 (467) T protein:vir:31 161 SVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDN 240 (467) T ss_pred eeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhh Confidence 5679999999999776 678999999999999999999999999999999999999999876667889999999999875 Q ss_pred h------------CCcccCcceecCCCceeeecc--------cChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCc Q lcl|NC_019719. 251 A------------GGPVKKRLWILEAGFSTSAIG--------VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS 310 (424) Q Consensus 251 ~------------~~~~~g~~~~l~~g~~~~~l~--------~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~ 310 (424) . |..+++++++++.|+++++++ .+++|+||++++++++++||++|||||.+||..+++++ T Consensus 241 ~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~ 320 (467) T protein:vir:31 241 NEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAF 320 (467) T ss_pred hcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCc Confidence 5 345788899998887665554 36789999999999999999999999999998776554 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc-ccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHH Q lcl|NC_019719. 311 WGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRT 389 (424) Q Consensus 311 ~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~ 389 (424) ++|+|++.+.|+++||.|+++.||++||++|++..+. .+++++|+++.+++.|.+++++.++.++++|++|+||+|++ T Consensus 321 -~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 399 (467) T protein:vir:31 321 -STDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDE 399 (467) T ss_pred -ccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 3589999999999999999999999999999987765 46889999999999999999999999999999999999999 Q ss_pred hCCCCCCCCCeee-------ecccccchhhccccCCC-cccCC Q lcl|NC_019719. 390 DNLPPLPGGDVAM-------RQSQYVPITDLGTNKEP-RNNGA 424 (424) Q Consensus 390 ~G~~p~~~gd~~~-------~~~n~~~~~~~~~~~~~-~~~ga 424 (424) +|+||+++++.+- ++++..|.+..+++.++ .++-+ T Consensus 400 ~Gl~pi~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (467) T protein:vir:31 400 FGFEPFPEEHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDRA 442 (467) T ss_pred hCCCCCCcccccCCcccccccccccCCCCcccCcCCCCCCCcc Confidence 9999996543221 11222222222221111 11111 No 84 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=100.00 E-value=8.7e-76 Score=432.10 Aligned_cols=356 Identities=13% Similarity=0.090 Sum_probs=281.5 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccc-- Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK-- 91 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~-- 91 (424) =|+|+++++++......... . ....+....+++.++|++||++||++||++|+++|++...+... T Consensus 1 M~if~~~~~~~~~~~~~~~~--------~-----~~~~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQ--------R-----VTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CchhHHhHhhhhcccccCcc--------e-----eeeeecchhhhhhHHHHHHHHHHHHhHhhCceeeeeeccccccccc Confidence 68999999887543322111 0 11122334467778999999999999999999999887665432 Q ss_pred -cccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCceeeEEeecCceEEEEEcCCceEEEEEec Q lcl|NC_019719. 92 -KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 92 -~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 169 (424) +....||+.+||+.+||++||+++||+.++.+++++||||++++ ++..|.+..+++. . T Consensus 68 ~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~-------------------~- 127 (378) T protein:vir:94 68 LISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDLLFA-------------------N- 127 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEEEEe-------------------c- Confidence 33567999999999999999999999999999999999999854 4555666544432 1 Q ss_pred CceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH---HHHHHHHHH Q lcl|NC_019719. 170 SEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE---QQRSQVEEN 246 (424) Q Consensus 170 ~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~---~~~~~~~~~ 246 (424) ..+.|+++||+|+|.+...+ .+.+++..+...+ ...+++ +.++|+++.+..+.++ ++++++++. T Consensus 128 -~~~~~~~~dvih~~~~~~~~-~~~~~~~~~~~~~----------~~~~~~-~~~~g~l~~~~~l~~~~~~~~~e~~~~~ 194 (378) T protein:vir:94 128 -DKKEYKPEELVRLTSPFYIN-EDTSILDNALASI----------QTKLEQ-GKLRGLLKINAFLDIDNTQEYREKALAT 194 (378) T ss_pred -CcEEechhceeeecCcCCcc-cchhHHHHHHHHH----------HHHHhh-CCcccceeeCCcCCHHHHHHHHHHHHHH Confidence 23568899999999654221 2445555554432 233344 4578999998776443 345666777 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) ++++.++.++|+++++++|++|+++++++.++|+ +.++++.++||++|||||.+|++. ..|++.++|+++| T Consensus 195 ~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l~g~--------~~e~~~~~f~~~t 265 (378) T protein:vir:94 195 IKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLGT--------ATQEQQIYFYNST 265 (378) T ss_pred HHHhhcccccccceeccCCceEEEccCChHHhhH-HHHHHHHHHHHHHhCCCHHHhcCC--------chHHHHHHHHHHH Confidence 7777888899999999999999999999999996 778999999999999999999631 3478899999999 Q ss_pred HHHHHHHHHHHHHhhccCcccccc-------ceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC Q lcl|NC_019719. 327 LQPYISRWENSIQRWLIPAKDVGR-------IHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD 399 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~-------~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd 399 (424) |.|++.+||++|+++||++.++.. ..++||++.++++|.+++++.+.+++++|+||+||+|+++|+||+|||| T Consensus 266 l~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~ggd 345 (378) T protein:vir:94 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 999999999999999999877542 2367999999999999999999999999999999999999999999999 Q ss_pred eeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 400 VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 400 ~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++++|+|++|++..++++..++++. T Consensus 346 ~~~~~~n~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:94 346 VYIANLNAVAVKNLSDLQGNRKDVT 370 (378) T ss_pred eeeecccccchhcchhcccccCCCC Confidence 9999999999988887765555433 No 85 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=100.00 E-value=3e-75 Score=429.12 Aligned_cols=355 Identities=13% Similarity=0.090 Sum_probs=276.0 Q ss_pred CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccc-- Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK-- 91 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~-- 91 (424) =|||+++.++++........ + ....++...+++.++|++||++||++||++|+++|++..++... T Consensus 1 M~~f~k~~~~~~~~~~~~~~------~-------~~~~~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:85 1 MNLFGKVVSFSRGKLNNDTQ------R-------VTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CchhhhhhhhhhcccccCCc------c-------eeeeeccchhhhhHHHHHHHHHHHHhHhhCceeEEEEecccccccc Confidence 68888887776543221110 0 01122334467889999999999999999999999988765433 Q ss_pred -cccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCceeeEEeecCceEEEEEcCCceEEEEEec Q lcl|NC_019719. 92 -KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 92 -~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 169 (424) +...+||+.++|+.+||++||+++||+.++.+++++||||++++ ++..|.+..+++. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~--------------------- 126 (378) T protein:vir:85 68 LISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLLFA--------------------- 126 (378) T ss_pred ccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEEec--------------------- Confidence 23578999999999999999999999999999999999999864 4555655443332 Q ss_pred CceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH---HHHHHHH Q lcl|NC_019719. 170 SEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ---QRSQVEE 245 (424) Q Consensus 170 ~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~---~~~~~~~ 245 (424) ...+.+.++||||++.+ +.++ +.+.+..+... ...++.+ +.++|+++.+..+.++. .++.+++ T Consensus 127 ~~~~~~~~~dvih~~~~~~~~~--~~~~~~~a~~~----------~~~~~~~-~~~~g~l~~~~~l~~~~~~~~~~~~~~ 193 (378) T protein:vir:85 127 NDKKEYKPEELVRLVSPFYINE--DTSILDNALAS----------IQTKLEQ-GKLRGLLKINAFLDIDNTQEYREKALA 193 (378) T ss_pred CCCEEEcccceEEEecCcCccc--hhhHHHHHHHH----------HHHHHhc-CCcceEEEeCCcCCHHHHHHHHHHHHH Confidence 12346778999999843 3333 23334333332 2234444 46899999887764332 2344455 Q ss_pred HHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHH Q lcl|NC_019719. 246 NFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 246 ~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) .++.+.++.++|+++++++|++|+++++++.++++ +.++++.++||++|||||.+|++ ++.|++..+|+++ T Consensus 194 ~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~~--------s~~e~~~~~f~~~ 264 (378) T protein:vir:85 194 TIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIELIKSELLTGYFMNENILLG--------TATQEQQIYFYNS 264 (378) T ss_pred HHHHhhcccccccceecCCCceEEeccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC--------CchHHHHHHHHHH Confidence 56666788899999999999999999999999996 77899999999999999999953 2457899999999 Q ss_pred HHHHHHHHHHHHHHhhccCcccccc-c------eeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019719. 326 TLQPYISRWENSIQRWLIPAKDVGR-I------HAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~-~------~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g 398 (424) ||.|++.+||++|+++||++.++.. + .+.||++.++++|.+++++.+.+++++|+||+||+|+++|+||+||| T Consensus 265 tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~gG 344 (378) T protein:vir:85 265 TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGG 344 (378) T ss_pred HHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 9999999999999999999887642 2 36799999999999999999999999999999999999999999999 Q ss_pred CeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 399 DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |++++|+|++|++..++++.++++.. T Consensus 345 D~~~~~~N~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:85 345 DIYIANLNAVAVKNLSDLQGSRKDVA 370 (378) T ss_pred CeEeecccccccccchhhcCccCCCC Confidence 99999999999988777654443322 No 86 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=2.3e-75 Score=429.79 Aligned_cols=369 Identities=12% Similarity=0.170 Sum_probs=286.4 Q ss_pred CchHHHHHhhccCcccCccccc-ccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQG-SQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) =|||+++... ...+.... ..........+.++..++.+.++++++|++||++||++||++||++|+... T Consensus 1 Mg~f~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~------ 70 (382) T protein:vir:48 1 MPIFNLATES----PPDNQGGFFDVVDSDFLASLKGNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKKL------ 70 (382) T ss_pred CccccccccC----CcccccccccchhhhccccccCCcccchHhhhccHHHHHHHHHHHHhhccCceeeecchh------ Confidence 4556554221 11111111 111122233456778899999999999999999999999999999986532 Q ss_pred ccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC--ceEEEEEecC Q lcl|NC_019719. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQRDS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~--~~~~~~~~~~ 170 (424) +.|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|+|++|++|++..+.+ ..+|.+..++ T Consensus 71 --------~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~ 142 (382) T protein:vir:48 71 --------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDD 142 (382) T ss_pred --------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCeEEEEEEecC Confidence 247789999999999999999999999999999999999999999999999999987653 3456655433 Q ss_pred ----ceEEecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019719. 171 ----EYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 171 ----~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~ 245 (424) ..+.|+++||||+|+++.++ ++|+||+.++..++.+..+++++..++|+||+.|+++++++....++ +.+++++ T Consensus 143 ~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e-~~~~~~~ 221 (382) T protein:vir:48 143 PRIPPKQHVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLD-FKTKLSR 221 (382) T ss_pred ccccceeEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChH-HHHHHHH Confidence 45789999999999988776 89999999999999999999999999999999999999998877554 4455555 Q ss_pred HHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHH Q lcl|NC_019719. 246 NFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 246 ~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) .+.. +..++|+++++++|++|++++.++.|+||+|.+++++++||++|||||.+||+.+++ ++.+++.+.|++. T Consensus 222 ~~~~--~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~----~~~~~~~~~~~~~ 295 (382) T protein:vir:48 222 SRQA--MKQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQ----QSSLEMSSDLYSK 295 (382) T ss_pred HHHh--hccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc----ccHHHHHHHHHHH Confidence 5554 335789999999999999999999999999999999999999999999999875543 2567889999999 Q ss_pred HHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCC-----CCCCCCe Q lcl|NC_019719. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLP-----PLPGGDV 400 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~-----p~~~gd~ 400 (424) ||.|+++.|+++|+++|+++.+.. ....+..+.......+.+++++|++|+||+|+.++.. ++++++. T Consensus 296 ~l~p~~~~i~~~l~~~l~~~~~~~-------~~~~~~~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~ 368 (382) T protein:vir:48 296 AVSRYLRPFLSELSQKLSCDVDAD-------IFPAVDPTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGEN 368 (382) T ss_pred HHHHHHHHHHHHHHHHhcChhhhh-------hhhhhccchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhc Confidence 999999999999999999876532 2222233444555667788889999999999886432 2344443 Q ss_pred eeecccccchhhccccCCCcc Q lcl|NC_019719. 401 AMRQSQYVPITDLGTNKEPRN 421 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~~~~~ 421 (424) +.. ++.. ++ +.+++ T Consensus 369 ~~~-----~~~G-Gd-~~~~~ 382 (382) T protein:vir:48 369 PNS-----TLKG-GE-EDGQD 382 (382) T ss_pred CCC-----CCCC-CC-CCCCC Confidence 321 1211 11 11222 No 87 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=1.4e-70 Score=403.59 Aligned_cols=413 Identities=15% Similarity=0.161 Sum_probs=304.9 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcc----cCcccccccccccccccccCcccccH---HHHhh-hHHHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGR----LVTPNQGSQTGPVSAHGHLGDSSIND---ERILQ-ISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~-~~~v~~~i~~ia~ 72 (424) |..-|=+++-+. ++.--.+.+ .....++.....+......-.-.+++ ..... ++++++||+++++ T Consensus 1 ~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~~ 73 (651) T protein:vir:99 1 MTDTTGETQETK-------VHVEGLGGEADLAKSPNSTQIPDHRIQSHNVGVNPPYNPDRLAAFLELNETLATGIRKKSR 73 (651) T ss_pred CCCccceeeeeE-------EEeecccccccccccccccccchhhhcccCCCCCCCCCHHHHHHHHhcChHHHHHHHHHhh Confidence 444332221110 000000000 00011111111111111111111233 33344 8999999999999 Q ss_pred hhccCceEEEEecc-cCcc-----ccc-----cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCc Q lcl|NC_019719. 73 LTACLPLDVFETDQ-NDNR-----KKV-----DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGD 141 (424) Q Consensus 73 ~ia~~~~~v~~~~~-~~~~-----~~~-----~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~ 141 (424) .||+++|.+..... ++.. .+. ...++.+..+...+|+.+|+.++++.++.|++.+|++|+.++++..|. T Consensus 74 ~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~ 153 (651) T protein:vir:99 74 YEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGR 153 (651) T ss_pred hhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccc Confidence 99999998854221 1111 101 123455556667789999999999999999999999999999999999 Q ss_pred eeeEEeecCceEEEEEcCCc----------------------------------eE------------------------ Q lcl|NC_019719. 142 VISLLPLQSANMDVKLVGKK----------------------------------VV------------------------ 163 (424) Q Consensus 142 ~~~l~~l~~~~v~~~~~~~~----------------------------------~~------------------------ 163 (424) ++.|+++++..+++..++.. .. T Consensus 154 pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~ 233 (651) T protein:vir:99 154 PVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIR 233 (651) T ss_pred hhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEE Confidence 99999999987765432110 00 Q ss_pred --------------------EEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019719. 164 --------------------YRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGA 222 (424) Q Consensus 164 --------------------~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~ 222 (424) |.+...+....++++||||||+++ .++++|+||+..+..++.++.++++++.++|+||+ T Consensus 234 ~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~ 313 (651) T protein:vir:99 234 YREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDT 313 (651) T ss_pred eccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 011112223457899999999887 58999999999999999999999999999999999 Q ss_pred CCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-----------CceeeecccCh-hHHHHHHHHHHHHHH Q lcl|NC_019719. 223 KSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-----------GFSTSAIGVTP-QDAEMMASRKFQVSE 290 (424) Q Consensus 223 ~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~-----------g~~~~~l~~~~-~d~~~~e~~~~~~~~ 290 (424) +|++||+++.+..++++.+++++.|++..+ |+|++++|+. |++|++++.++ +|+||+|++++++++ T Consensus 314 ~p~gil~~~~~~ls~e~~~~lr~~~~~~~~--nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~e 391 (651) T protein:vir:99 314 IPRMVIKVTGGELSEESKRDLRQMLNGLRE--ESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHE 391 (651) T ss_pred CCceEEEecCCCCCHHHHHHHHHHHHHHhc--cCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHH Confidence 999999998776799999999999998775 6788998865 99999999877 599999999999999 Q ss_pred HHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccc-c--ceeeecchhhhccCHHH Q lcl|NC_019719. 291 LARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-R--IHAEHNLDGLLRGDSAS 367 (424) Q Consensus 291 Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~--~~~~fd~~~l~~~d~~~ 367 (424) ||++|||||.+||..++++ ++|+|++.+.|+++||+|++.+||++||++|+++.++. + ++++|+.+.+++.|.++ T Consensus 392 Ia~afgVPp~~lG~~~~~~--~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~ 469 (651) T protein:vir:99 392 IAKVLEVPPVKIGVTDSAN--RSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQEAQL 469 (651) T ss_pred HHHHhCCCHHHhccCCCCC--cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhccHHH Confidence 9999999999999887654 55999999999999999999999999999999987753 3 56788889999999999 Q ss_pred HHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeeeecccccchhhccccC-------CCcccCC Q lcl|NC_019719. 368 RAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAMRQSQYVPITDLGTNK-------EPRNNGA 424 (424) Q Consensus 368 ~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~--gd~~~~~~n~~~~~~~~~~~-------~~~~~ga 424 (424) +++.+..++++||||+||+|+++|+||+++ ||..+.+.+...+....+.. .++++.. T Consensus 470 ~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~~ 535 (651) T protein:vir:99 470 AEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENKI 535 (651) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccccccccccCCCCcccccCcccccc Confidence 999999999999999999999999999954 89888887766554322111 1111111 No 88 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=1.4e-68 Score=392.65 Aligned_cols=408 Identities=12% Similarity=0.064 Sum_probs=278.6 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCc-------------------------------cccccccc--cccccccc Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVT-------------------------------PNQGSQTG--PVSAHGHL 47 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~-------------------------------~~~~~~~~--~~~~~~~~ 47 (424) |.... =.+|||+|+...|+...-.. |....... .....+.. T Consensus 1 ~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~ 74 (648) T protein:vir:79 1 MARKV------WGRGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGG 74 (648) T ss_pred Cccch------hcchhhhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcC Confidence 32211 24699999999998432110 00000000 00000001 Q ss_pred Cc-----cccc----HHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHH Q lcl|NC_019719. 48 GD-----SSIN----DERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFRE 118 (424) Q Consensus 48 ~~-----~~~~----~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~ 118 (424) ++ ..++ .+.+..+|.|++||++||++||++||.++.++.+. ...++.. ++..+||++||+++||+ T Consensus 75 g~~~~~epp~d~~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~-----~~~~~~~-~ll~rPn~~~t~~~f~~ 148 (648) T protein:vir:79 75 GGRDFEEPEFDFNEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNA-----VEYIRMR-FTLMAEATQIPTNQLFI 148 (648) T ss_pred CccccccCCcCHHHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCcc-----chhhHHH-HHhhccCCCCCHHHHHH Confidence 11 1122 24455799999999999999999999986554321 2223433 34569999999999999 Q ss_pred HHHHHHHHcCCeEEEEeeCCCCc---------------eeeEEeecCceEEEEEcCCce--EEEEEe--cCceEEecHhH Q lcl|NC_019719. 119 AMTMQLCFYGNAYALVDRNSAGD---------------VISLLPLQSANMDVKLVGKKV--VYRYQR--DSEYADFSQKE 179 (424) Q Consensus 119 ~~~~~~l~~G~a~~~~~r~~~G~---------------~~~l~~l~~~~v~~~~~~~~~--~~~~~~--~~~~~~~~~~e 179 (424) .++.+++++||||++++|+.+|. +.++||++|.+|++..+..+. .|.|.. ++....|++++ T Consensus 149 ~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~~Y~y~~~g~~~~~~~~~~d 228 (648) T protein:vir:79 149 EIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIKGWQQEQEGQDKPQKFKPED 228 (648) T ss_pred HHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCceeeeEEEecCCceeEEecCcc Confidence 99999999999999999999884 478999999999998876543 344443 33456789999 Q ss_pred eeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCc Q lcl|NC_019719. 180 IFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKR 258 (424) Q Consensus 180 vih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 258 (424) |||||++ +.++++|+||+.++..++.+..+++++..++|+||++|+++++++.+....+..+..++.+.....+.+.++ T Consensus 229 IIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i~g 308 (648) T protein:vir:79 229 IVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQEGFGAEEGEVDLVRGEVENMDVEG 308 (648) T ss_pred EEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCccchHHHHHHHHHHHHhcccccccc Confidence 9999965 578999999999999999999999999999999999999999986443333444444444444443322222 Q ss_pred ceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 259 LWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSI 338 (424) Q Consensus 259 ~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l 338 (424) ..+....+.+.+. .+++|+||++++++++++||++|||||.+||..+++++ ++.+++ ..++..++.|++..++..+ T Consensus 309 g~v~~~~~~i~~~-~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~--stae~~-~~~~~~~i~~l~~~i~~~l 384 (648) T protein:vir:79 309 GMVTTERVNISSI-ASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASR--STGDNL-SSDFKDRIKALQKVMATFI 384 (648) T ss_pred cccccceeecccc-CCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccc--hHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 2222222233222 25689999999999999999999999999998765544 355544 4556777888776665554 Q ss_pred Hhh----ccCccc-----cccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC-eeeeccccc Q lcl|NC_019719. 339 QRW----LIPAKD-----VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD-VAMRQSQYV 408 (424) Q Consensus 339 ~~~----l~~~~~-----~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd-~~~~~~n~~ 408 (424) +.. ++.+.. ...++++|+++++++.|.+++++.+.+++++||||+||+|+++|+||+|+|+ ..++..++. T Consensus 385 e~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~ 464 (648) T protein:vir:79 385 NEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMV 464 (648) T ss_pred HHHHHHHHhhhhhccccccccceEEEeecccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccc Confidence 443 322221 1235689999999999999999999999999999999999999999998764 334445544 Q ss_pred chhhccccC----CC-------cccCC Q lcl|NC_019719. 409 PITDLGTNK----EP-------RNNGA 424 (424) Q Consensus 409 ~~~~~~~~~----~~-------~~~ga 424 (424) +........ .+ .++.+ T Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~a~~eg 491 (648) T protein:vir:79 465 TIAQATALAALAPTPAGGSSASASGDK 491 (648) T ss_pred cchhccccccCCCCCCCCCCCCccccc Confidence 432211100 00 00000 No 89 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=100.00 E-value=1.4e-62 Score=359.77 Aligned_cols=273 Identities=20% Similarity=0.291 Sum_probs=243.4 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceE Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) ||++||++|++++. .+|++.++|+.+||++||+.+||+.++.+++++||||++++|+.+|.+++|||++|++| T Consensus 1 ia~l~~~~~~~~~~-------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v 73 (278) T protein:vir:78 1 MASLPLKMYEDYKV-------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVV 73 (278) T ss_pred CccceeEEEecCcc-------cccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCcee Confidence 99999999976543 36899999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCc--eEEEEEe-cCceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE Q lcl|NC_019719. 154 DVKLVGKK--VVYRYQR-DSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 154 ~~~~~~~~--~~~~~~~-~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~ 229 (424) ++..++++ .+|.+.. ++....|+++||||+|++ +.++++|+||+..+..++....++++++...+.+ .|+++++ T Consensus 74 ~v~~~~~~~~~~y~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~i~~ 151 (278) T protein:vir:78 74 EMLIENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQK--PDSFMLK 151 (278) T ss_pred EEEEcCCCceEEEEEEcCCceEEEEccccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhcC--CCcEEEE Confidence 99887543 3444443 446788999999999977 4678999999999999999999999997766555 4788888 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC Q lcl|NC_019719. 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST 309 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~ 309 (424) .+..+ ++++.+++++.|++..+ ++|+++++++|++++++++++.|++++|.+++++++||++|||||.+||..++++ T Consensus 152 ~~~~l-~~e~~~~~~~~~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~ 228 (278) T protein:vir:78 152 YGSNV-GKEKRQQVLEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTN 228 (278) T ss_pred eCCCC-CHHHHHHHHHHHHHHhc--cCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC Confidence 87765 67778889999988774 5789999999999999999999999999999999999999999999999877664 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc-ccceeeecchhh Q lcl|NC_019719. 310 SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV-GRIHAEHNLDGL 360 (424) Q Consensus 310 ~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~-~~~~~~fd~~~l 360 (424) ++|++++.++|++.||+|+++.|+++||++|+++.++ .+++++||+++| T Consensus 229 --~sn~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 229 --FAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred --cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 5599999999999999999999999999999999886 469999999999 No 90 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=1.2e-59 Score=343.63 Aligned_cols=347 Identities=14% Similarity=0.166 Sum_probs=259.1 Q ss_pred ccCCCCCchHHHHHhhccCcccCc-cc-ccccccccccccccCccccc-HH------------HHhhhHHHHHHHHHHHH Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVT-PN-QGSQTGPVSAHGHLGDSSIN-DE------------RILQISTVWRCVSLIST 72 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~-~~------------~~~~~~~v~~~i~~ia~ 72 (424) |.=++++..-+.-....+...... +. .....+....++....+.+. .. .+++.|.-+.|+..+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~~~ 80 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLARSFR 80 (368) T ss_pred CCccccccchhccCcccccccccCcchhhccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHHHHHh Confidence 444444433332222211111110 00 00111111111111112111 11 12222222333332222 Q ss_pred hhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCce Q lcl|NC_019719. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) Q Consensus 73 ~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~ 152 (424) +-+ ... .....+|++..++ .+||++||+++|++ ++.+++++||||++++|+..|++++|+|++|.+ T Consensus 81 ~~~-----------~h~-~~~~~~~n~l~l~-~~Pn~~~t~~~f~~-l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~ 146 (368) T protein:vir:79 81 AAA-----------HHS-SAVYVKRNILVST-FIPHPLLSRATFER-LVLDWQVFGNAYLERRENVLGGTIRLDTPLAKY 146 (368) T ss_pred hcc-----------ccc-hhhhhhcchhhhh-cCCCcCCCHHHHHH-HHHHHhhcCCeEEEEEEcCCCCEEEEEEeCccc Confidence 211 111 1123457777655 59999999999976 678999999999999999999999999999999 Q ss_pred EEEEEcCCceEEEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC Q lcl|NC_019719. 153 MDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG 231 (424) Q Consensus 153 v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~ 231 (424) |++..+++.. |++..++..+.|++++|||+|.++ .++++|+||+..+..++.+..++..|.+++|+||++|++||..+ T Consensus 147 v~~~~~~~~~-~~~~~~~~~~~~~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~ 225 (368) T protein:vir:79 147 VRRGLDLNTY-FFVQNWQQPYTFAAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMT 225 (368) T ss_pred ceeeccCCEE-EEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC Confidence 9988776654 455566778899999999999876 57899999999999999999999999999999999999999987 Q ss_pred CCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC Q lcl|NC_019719. 232 EKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~ 306 (424) +...++++.+.+++.|++..|..|+|+++++ ++|++|++++.+++|+||+|.+++++++||++|||||.+||..+ T Consensus 226 ~~~l~~e~~~~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~ 305 (368) T protein:vir:79 226 DAAQKQEDVDTLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIP 305 (368) T ss_pred CCCCCHHHHHHHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccC Confidence 6667899999999999999999999999998 67899999999999999999999999999999999999999988 Q ss_pred CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHH Q lcl|NC_019719. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMG 376 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~ 376 (424) +++.+++|+|++.+.|+++||.|+++.|| +++.+|.. .+++|+...|++.|.+.++....+-. T Consensus 306 ~~t~~~sn~e~~~~~f~~~~l~Pl~~~ie-~ln~~l~~------e~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 306 NNTGGFGDVEKAAMVFARNEVKPLQDRLL-AINDWIGD------EVVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred CCCCccccHHHHHHHHHHHHHHHHHHHHH-HHHhccCc------ceeeechhHhhcccccccCCcccccC Confidence 88888899999999999999999999998 67877733 25789999999999998876322211 No 91 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=3.6e-57 Score=330.02 Aligned_cols=327 Identities=14% Similarity=0.179 Sum_probs=253.3 Q ss_pred CCCCcccccCCCCCchHHHHH---------hh-ccCcccCcccccccccccccccccCcc----cccH----HHHhhhHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQ---------SW-FVGGRLVTPNQGSQTGPVSAHGHLGDS----SIND----ERILQIST 62 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~---------~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~----~~~~~~~~ 62 (424) |...|++-.-+... --..-. .+ |..+.+.-...+. ... ..++..+. .++. +....++. T Consensus 26 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~f~fg~p~~v~~~~~~-~~~--~~~~~~~~~~~pp~~~~~La~~~~~~~~ 101 (376) T protein:vir:10 26 MSKRRSRAPRTFAA-APNPSAGSAAPARAEVFTFDDPTPVMNRAEI-LDY--VECWSNGEWFEPPVSFAGLAKSFRASTH 101 (376) T ss_pred chhccCCCcccchh-hhhHhhhccCcceeEEEEcCCceeccCcchh-hhh--hhhhhcCceecCCCCHHHHHHHHhhhHH Confidence 66666653221111 000000 00 1111100000000 000 01111222 2333 33333566 Q ss_pred HHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019719. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~ 142 (424) ..+||...++.+++ ..+||++||+.+|++. +.+++++||+|++++|+..|.+ T Consensus 102 h~s~l~~k~n~l~~---------------------------~~~Pnp~lT~~~f~~~-v~d~ll~Gnay~~~~rn~~G~~ 153 (376) T protein:vir:10 102 HSSALFFKANVLAS---------------------------TFRPHRWLSRHAFERW-ALDFLTFGNGYLERRRNMVGGT 153 (376) T ss_pred hhhhHHHHhHHHHh---------------------------ccCCCCCCCHHHHHHH-HHHHHhcCCeEEEEEECCCCCE Confidence 66677665554432 2479999999999865 5689999999999999999999 Q ss_pred eeEEeecCceEEEEEcCCceEEEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019719. 143 ISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANG 221 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~ 221 (424) ++|+|++|.+|++..+++..+ ++..++....|+++||||+|.++ .++++|+||+.++..++.+..++..|.+++|+|| T Consensus 154 ~~L~pl~~~~vr~~~d~~~~~-~~~~~~~~~~~~~~eViHir~~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NG 232 (376) T protein:vir:10 154 LRLEPALAKYVRRKADFNGFV-YVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENG 232 (376) T ss_pred EEEEEeCCcceEEEeeCCeEE-EEEcCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 999999999999998877544 45556677889999999999887 4789999999999999999999999999999999 Q ss_pred CCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019719. 222 AKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFG 296 (424) Q Consensus 222 ~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fg 296 (424) ++|++||..++...++++.+.+++.|++..|..|+++++++ ++|+++++++.+++|+||+|.+++++++||++|| T Consensus 233 a~pggIl~~~d~~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~ 312 (376) T protein:vir:10 233 SHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHR 312 (376) T ss_pred CCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhC Confidence 99999999877667899999999999998888899998887 5789999999999999999999999999999999 Q ss_pred CCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHH Q lcl|NC_019719. 297 VPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 297 VP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~ 367 (424) |||.++|..++++.+++|+|++.+.|+++||.|+++.||+ +|.+|..+ .++|+...|+++|.+. T Consensus 313 VPp~llGi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~iee-ln~~L~~~------~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 313 VPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGEE------VVRFDDYEIPPAPVAA 376 (376) T ss_pred CCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcccc------ccccChhHhhcccccC Confidence 9999999998888888999999999999999999999984 88777332 4789999999999988 No 92 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=3.2e-57 Score=330.30 Aligned_cols=323 Identities=15% Similarity=0.218 Sum_probs=246.8 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHH----------------HHH Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVS----------------LIS 71 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~----------------~ia 71 (424) |.=++++ ...........+....++....+.|- ...++..|+. -+| T Consensus 1 m~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~p~~~~-----~~~~~~~~~~~~~~~~~~~~pp~~~~~la 62 (346) T protein:vir:10 1 MKKQLRK-------------NLTQNDRLQPQAQTEIFSFGDPIPVL-----DRADILNYLECSAMYEKWYNPPMSFDGLA 62 (346) T ss_pred CCcccCC-------------CCCcccccccccCeEEEecCCcceec-----CchhHHHHHHHhhcCCceEecCCCHHHHH Confidence 2222111 11110000001111111111112211 1111111111 122 Q ss_pred HhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCc Q lcl|NC_019719. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~ 151 (424) +.+-..+.+- ..-....|.+..++. +||++||+.+|++ ++.+++++||||++++|+..|++++|+|++|. T Consensus 63 ~l~~~~~~h~--------~~i~~k~n~l~~l~~-~Pn~~~t~~~f~~-~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~ 132 (346) T protein:vir:10 63 KSLRSSTHHE--------SAIITKANILLSTCE-VDSRYLSRRDLSS-FVKDYLVFGNAYFEVVRNRLGQVQRIESPLAK 132 (346) T ss_pred HHHHhhhhcc--------hhhhhhhhhHHHHHh-CCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEecCC Confidence 2222222210 011133567777664 8999999999987 56789999999999999999999999999999 Q ss_pred eEEEEEcCCceEEEE-EecCceEEecHhHeeEeccCCC-CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE Q lcl|NC_019719. 152 NMDVKLVGKKVVYRY-QRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 152 ~v~~~~~~~~~~~~~-~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~ 229 (424) +|++..+++..+|.+ ..++....|+++||||+|.+++ ++++|+||+..+..++.+..+++.+.+++|+||++|++||+ T Consensus 133 ~v~~~~~~~~~~~~~~~~~g~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~ 212 (346) T protein:vir:10 133 YVRKGLEAGQFYYVPQRFDHQEHEFAKGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFY 212 (346) T ss_pred ceEEEEcCCeEEEEEEccCCeEEEEecccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 999988877765544 4567789999999999998875 78999999999999999999999999999999999999999 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-----CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcC Q lcl|NC_019719. 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-----AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD 304 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~-----~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~ 304 (424) .++...++++.+.+++.|++..|+.|+++++++. .|+++++++.+++|+||+|.+++++++||++|||||.+||. T Consensus 213 ~~d~~l~~e~~~~i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~ 292 (346) T protein:vir:10 213 MSDASQKQEDVENIRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGI 292 (346) T ss_pred eCCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcc Confidence 8776678999999999999999999999999885 47899999999999999999999999999999999999999 Q ss_pred CCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCH Q lcl|NC_019719. 305 VEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 305 ~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~ 365 (424) .++++.+++|+|++.+.|++++|.|+++.||+ ++.+|.. ..++|+..+|++.|. T Consensus 293 ~~~~~~~~s~~e~~~~~f~~~~l~P~~~~iee-~n~~L~~------e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 293 IPNNTGGFGNVADAAEVFFITEIEPLQERLKE-FNQWLGQ------EVIKFKPSKLLQRTQ 346 (346) T ss_pred cCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhccc------ceeeechhhhcccCC Confidence 98888888899999999999999999999985 6666643 257899999999988 No 93 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=1.2e-56 Score=327.10 Aligned_cols=332 Identities=15% Similarity=0.141 Sum_probs=243.4 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccC--- Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACL--- 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~--- 77 (424) |.+....+.-++... ....++.. +. ++-.+...+++.|+.++.+..... T Consensus 1 ~~~~~~~~~~~~~~~------------------------~~~~~~~~-~~---p~~~~~~~~~~~~~~~~~~~~~~~~ep 52 (348) T protein:vir:26 1 MTEQLIHSHTTDGTE------------------------SKSVYSFD-PN---PEPVDTNSWMTRYCELFYNDFDDYWEP 52 (348) T ss_pred CCccccchhhccccC------------------------CceEEEec-CC---CeeecCcchHHHHHHHHhcCCCccccC Confidence 433332222111110 00000100 00 111223334444444444333221 Q ss_pred ceEE------EEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCc Q lcl|NC_019719. 78 PLDV------FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) Q Consensus 78 ~~~v------~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~ 151 (424) |+.. ++...--..--....+-+.. ..+||++||..+|++. +.+++++||||++++|+..|++++|+|++|. T Consensus 53 p~~~~~La~l~~~n~~h~~~i~~k~N~l~~--~~~Pn~~~t~~~f~~~-~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~ 129 (348) T protein:vir:26 53 PISLKGLAEIANANGYHGSLLKARANYVAG--RFMNGGGLPMYKMNSA-CWDYFGLGMSAFVKIRSYLKNVIALEPLPMV 129 (348) T ss_pred CCCHHHHHHHHhhhhhhhhhHhhhhhHHhh--cccCCCCCCHHHHHHH-HHHHHhcCCeEEEEEEcCCCcEEEEEEecCc Confidence 2111 00000000000000001111 2379999999999775 5699999999999999999999999999999 Q ss_pred eEEEEEcCCceEEEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEc Q lcl|NC_019719. 152 NMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILST 230 (424) Q Consensus 152 ~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~ 230 (424) +|++..++. +|.+..++..+.|+++||||+|.++ .++++|+||+..+..++.+..++..+.+++|+||++|++||.. T Consensus 130 ~v~~~~d~~--~~~~~~~g~~~~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~ 207 (348) T protein:vir:26 130 HMRKRKNGD--FVQLLRNNEQKVFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYA 207 (348) T ss_pred eeEeeecCc--EEEEEecCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEe Confidence 999987654 5667778888999999999999877 4789999999999999999999999999999999999999988 Q ss_pred CCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCC Q lcl|NC_019719. 231 GEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDV 305 (424) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~ 305 (424) +....++++++.+++.|++..|+.|.++++++ ++|+++++++.+++|+||++.+++++++||++|||||.++|.. T Consensus 208 ~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~ 287 (348) T protein:vir:26 208 TDPNLSEADEKALKEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGML 287 (348) T ss_pred cCCCCCHHHHHHHHHHHHHhcCcccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHcccc Confidence 77667899999999999999888899999888 7899999999999999999999999999999999999999988 Q ss_pred CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhh-hccCHHHHHHH Q lcl|NC_019719. 306 EKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL-LRGDSASRAAF 371 (424) Q Consensus 306 ~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l-~~~d~~~~~~~ 371 (424) ..++.+++|+|++.+.|+++||.|+++.||++||++|..+.+ .+++||++.. .+.|.. .. T Consensus 288 ~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~---~~~~fdl~~~~e~~~~~---a~ 348 (348) T protein:vir:26 288 PQQGANVPDPLKVSQVYDFYEVIPVCKRFMDAVNNDPEIPDN---LKLKFNLNPGVESANGS---AV 348 (348) T ss_pred CCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhhCCCCc---cEEEEecCcccccchhh---cC Confidence 777778889999999999999999999999999999865543 3567776632 222222 11 No 94 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=2.2e-56 Score=325.71 Aligned_cols=327 Identities=14% Similarity=0.183 Sum_probs=250.3 Q ss_pred CCCCcccccCCCCCchHHH---HH-----hhccCcccCccccccccc-ccc-cccccCcc----cccH----HHHhhhHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWAR---LQ-----SWFVGGRLVTPNQGSQTG-PVS-AHGHLGDS----SIND----ERILQIST 62 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~---l~-----~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~----~~~~----~~~~~~~~ 62 (424) |...|++=.-+. ..-=+. .. ..|.-+ +|..-.... .+. ...+..+. .++. +.+..++. T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~---~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~ 76 (351) T protein:vir:79 1 MSKRRSRAPRTF-AAAPNPSAGSAAPARAEVFTFD---DPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTH 76 (351) T ss_pred CCCCCCCCCCCC-CCCCchhhhhcccceeEEEEcC---CceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHh Confidence 777666522111 000000 00 001000 111000000 000 01112222 2232 22223444 Q ss_pred HHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019719. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~ 142 (424) ..+||...++.+++ ..+||++||..+|++ ++.+++++||||++++|+..|.+ T Consensus 77 h~~~l~~k~n~l~~---------------------------~~~Pnp~~t~~~f~~-~v~d~ll~Gnay~~~~r~~~G~~ 128 (351) T protein:vir:79 77 HSSALFFKANVLAS---------------------------TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGT 128 (351) T ss_pred hhhhhhhhhhHHhh---------------------------cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCE Confidence 45555444333322 247999999999975 66799999999999999999999 Q ss_pred eeEEeecCceEEEEEcCCceEEEEEecCceEEecHhHeeEeccCCC-CccccCchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019719. 143 ISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANG 221 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~ 221 (424) ++|+|++|.+|++..+++.. +++..++....|+++||||+|.++. ++++|+||+..+..++.+..++..|.+++|+|| T Consensus 129 ~~L~~l~~~~v~~~~~~~~~-~~~~~~g~~~~~~~~eIihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NG 207 (351) T protein:vir:79 129 LRLEPALAKYVRRKADFSGF-VYVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENG 207 (351) T ss_pred EEEEEeCCcceeeeecCCeE-EEEecCceEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 99999999999998887764 4555667788999999999998874 789999999999999999999999999999999 Q ss_pred CCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019719. 222 AKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFG 296 (424) Q Consensus 222 ~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fg 296 (424) ++|++|++.+....++++.+.+++.|++..|..|+++++++ ++|+++++++.+++|+||+|.+++++++||++|| T Consensus 208 a~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~ 287 (351) T protein:vir:79 208 SHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHR 287 (351) T ss_pred CCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhC Confidence 99999999887667899999999999998888899998887 5789999999999999999999999999999999 Q ss_pred CCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHH Q lcl|NC_019719. 297 VPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 297 VP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~ 367 (424) |||.++|..++++.+++|+|++.+.|+++||.|+++.||+ +|.+|.. ..++|+..+|+++|.+. T Consensus 288 VPp~llGi~~~~t~~~~n~e~~~~~f~~~~l~Pl~~~ie~-ln~~lg~------~~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 288 VPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGD------EVVTFDDYEIPPAPVAA 351 (351) T ss_pred CCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCc------ceeeeChhhhccccccC Confidence 9999999988888888999999999999999999999985 7766622 25799999999999988 No 95 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=3.2e-56 Score=324.81 Aligned_cols=327 Identities=14% Similarity=0.194 Sum_probs=249.8 Q ss_pred CCCCcccccCCCCCchHHH---HH-----hhccCcccCccccccccc-ccc-cccccCcc----cccHH----HHhhhHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWAR---LQ-----SWFVGGRLVTPNQGSQTG-PVS-AHGHLGDS----SINDE----RILQIST 62 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~---l~-----~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~----~~~~~----~~~~~~~ 62 (424) |...|++=.-+. ..-=+. .. ..|.-+ +|..-.... .+. ...+..+. .++.. .+..++. T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~---~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~ 76 (351) T protein:vir:78 1 MSKRRSRAPRTF-AAAPNPSAGSAAPARAEVFTFD---DPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTH 76 (351) T ss_pred CCCCCCCCCCCC-CCCCchhhhhcccceeEEEEcC---CceeecCcchhhhhhhhhccCceecCCCCHHHHHHHHhhhHh Confidence 776666522111 000000 00 001000 111000000 000 01112222 22322 2223444 Q ss_pred HHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019719. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~ 142 (424) ..+||...++.+++ ..+||++||..+|++ ++.+++++||+|++++|+..|++ T Consensus 77 h~~~l~~k~n~l~~---------------------------~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~ 128 (351) T protein:vir:78 77 HSSALFFKANVLAS---------------------------TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGT 128 (351) T ss_pred hhhhhhhhhhHHhh---------------------------cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCE Confidence 45555444433322 247999999999975 55789999999999999999999 Q ss_pred eeEEeecCceEEEEEcCCceEEEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019719. 143 ISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANG 221 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~ 221 (424) ++|+|+++.+|++..+++..+| +..++....|+++||||+|.++ .++++|+|++..+..++.+..++..|.+++|+|| T Consensus 129 ~~L~pl~~~~v~~~~~~~~~~~-~~~~~~~~~~~~~eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NG 207 (351) T protein:vir:78 129 LRLEPALAKYVRRKADFSGFVY-VNGWQERHEFAPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENG 207 (351) T ss_pred EEEEEecCcceEEeeeCCeEEE-EecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 9999999999999988876544 4456677889999999999887 5789999999999999999999999999999999 Q ss_pred CCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019719. 222 AKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFG 296 (424) Q Consensus 222 ~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fg 296 (424) ++|++||..++...++++.+.+++.|++..|..|+++++++ ++|+++++++.+++|+||+|.+++++++||++|| T Consensus 208 a~pggIl~~~~~~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~ 287 (351) T protein:vir:78 208 SHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHR 287 (351) T ss_pred CCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhC Confidence 99999999877667899999999999999888999999887 5789999999999999999999999999999999 Q ss_pred CCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHH Q lcl|NC_019719. 297 VPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 297 VP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~ 367 (424) |||.++|..++++.+++|+|++.+.|++++|.|+++.||+ ++.+|.. .+++|+..+|+++|.+. T Consensus 288 VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~P~~~~iee-~n~~l~~------~~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 288 VPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGD------EVVRFDDYEIPPAPVAA 351 (351) T ss_pred CCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCc------cceecChhhhccccccC Confidence 9999999988888888999999999999999999999995 6666632 25899999999999988 No 96 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=5.2e-56 Score=323.67 Aligned_cols=324 Identities=17% Similarity=0.243 Sum_probs=241.1 Q ss_pred ccCCCCCchHHHHHhhccCcccC-ccccc---ccccccccccc---cC-cccccHHHHhhhHHHHHHHHHHHHhhcc--C Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLV-TPNQG---SQTGPVSAHGH---LG-DSSINDERILQISTVWRCVSLISTLTAC--L 77 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~-~~~~~---~~~~~~~~~~~---~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~--~ 77 (424) |.=++.+ ....... .+... +...|...... .. .+......+++.|.-+.++..+.++-+. . T Consensus 1 m~~~~~~---------~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~s 71 (340) T protein:vir:98 1 MSKRKPR---------KAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHSS 71 (340) T ss_pred CCCCCCC---------ccccccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHHhccccch Confidence 3322211 0000000 00000 00011100000 00 0000111122223333333333222211 1 Q ss_pred ceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEE Q lcl|NC_019719. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~ 157 (424) ++..+ .+.+.. ..+||++||..+|++ ++.+++++||+|++++|+..|++++|+|+++.+|++.. T Consensus 72 ~i~~k-------------~n~l~~--~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~ 135 (340) T protein:vir:98 72 PIYVK-------------RNVLAS--TYIPHPLLSRQDFSR-FALDYLVFGNAFLEQRHSVTGQLIKLLTSPAKYTRRGV 135 (340) T ss_pred hhhhh-------------hhHHhh--ccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceEEEcc Confidence 11111 011111 238999999999975 55799999999999999999999999999999999987 Q ss_pred cCCceEEEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC Q lcl|NC_019719. 158 VGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT 236 (424) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~ 236 (424) +++ .+|++..++....|+++||||+|.++ .++++|+||+..+..++.+..++..+.+++|+||++|++||.+++...+ T Consensus 136 ~~~-~~~~~~~~~~~~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~~~ls 214 (340) T protein:vir:98 136 DDS-VFWFVENFTQPHEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTDPAQS 214 (340) T ss_pred cCc-EEEEEecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCC Confidence 765 45677778888899999999999876 5789999999999999999999999999999999999999998877678 Q ss_pred HHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcc Q lcl|NC_019719. 237 EQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW 311 (424) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~g~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~ 311 (424) +++.+++++.|++..|..|.++++++ ++|+++++++.+++|+||+|.+++++++||++|||||.++|..++++.+ T Consensus 215 ~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~ 294 (340) T protein:vir:98 215 ATDVESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGS 294 (340) T ss_pred HHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCc Confidence 99999999999998888899998888 6799999999999999999999999999999999999999998888888 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccC Q lcl|NC_019719. 312 GSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGD 364 (424) Q Consensus 312 ~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d 364 (424) ++|+|++.+.|+++||.|+++.||+ +|.+|..+ .++|+...|++.| T Consensus 295 ~sn~e~~~~~f~~~~l~Pl~~~iee-~n~~L~~e------~~rF~~~~l~~~d 340 (340) T protein:vir:98 295 LGDVEKVAKVFVRNELSPLQDRFRE-VNDWLGME------VIRFKEYTLDNPE 340 (340) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHH-HHhccccc------ccccCccccccCC Confidence 8899999999999999999999995 88887543 3679989999988 No 97 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=8.9e-56 Score=322.38 Aligned_cols=322 Identities=15% Similarity=0.144 Sum_probs=245.5 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhcc---C Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTAC---L 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~---~ 77 (424) |..+|.. .... ....+...++....+++ +...++..|+.+..+..+. . T Consensus 1 m~~~~~~-----------------------~~~~-~~~~~~~~~~~~~p~~~-----~~~~~~~~~~~~~~~~~~~~~~p 51 (337) T protein:vir:78 1 MTKRQQQ-----------------------PAQA-AASSPRPSVVFSMPEAI-----DPTAWMTDYTGVFYNPYGEYYQP 51 (337) T ss_pred CCCcccC-----------------------cccc-cccCceeEEEecCcccc-----cCcchhHhhhhhhhccCcceecC Confidence 2211111 1000 01111112222222222 3344566666665554442 2 Q ss_pred ceEEEEecccCccccccccch-hhhhhccCCCCCCCHH----HHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCce Q lcl|NC_019719. 78 PLDVFETDQNDNRKKVDLSNP-LARLLRYSPNQYMTAQ----EFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~-l~~lL~~~pN~~~s~~----~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~ 152 (424) |+... +-.+- ...++ ...+|..+||+.++++ ++++.++.|++++||||++++|+..|++++|+|++|.+ T Consensus 52 P~~~~-----~La~l-~~~~~~h~~~L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~~~~ 125 (337) T protein:vir:78 52 PIDRK-----GLAKV-ARANAHHGAILMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLSSVY 125 (337) T ss_pred CCCHH-----HHHHH-hhcchhhhhHHHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeCCce Confidence 33211 00000 00011 1335778999877654 68999999999999999999999999999999999999 Q ss_pred EEEEEcCCceEEEEEecCceEEecHhHeeEeccCCC-CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC Q lcl|NC_019719. 153 MDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG 231 (424) Q Consensus 153 v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~ 231 (424) |++..++.. +++..++....|+++||||+|.+++ ++++|+||+..+..++.+..+++++.+++|+||++|++|++.+ T Consensus 126 v~~~~d~~~--~~~~~~~~~~~~~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~ 203 (337) T protein:vir:78 126 LRRREDGCF--VYLQQGKPNLIYRPDDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYAT 203 (337) T ss_pred eEeeeCCeE--EEEEcCCceEEECCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC Confidence 998876543 3445566778999999999998874 7899999999999999999999999999999999999999988 Q ss_pred CCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC Q lcl|NC_019719. 232 EKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~ 306 (424) +...++++.+.+++.|++..|..|.++++++ ++|+++++++.+++|+||+|.+++++++||++|||||.++|... T Consensus 204 ~~~l~~e~~~~lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~ 283 (337) T protein:vir:78 204 DPNMDDDTEEEMKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIP 283 (337) T ss_pred CCCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHccccc Confidence 7667899999999999998888888888887 67899999999999999999999999999999999999999877 Q ss_pred CC-CccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhh Q lcl|NC_019719. 307 KS-TSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLL 361 (424) Q Consensus 307 ~~-~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~ 361 (424) ++ +.+++|+|++.+.|+++||.|+++.||++++++|++.... ..++++...++ T Consensus 284 ~~~~~~~~n~e~~~~~f~~~~L~P~~~~ie~~~n~~ll~~~~~--~~f~~~~~~~~ 337 (337) T protein:vir:78 284 TNGGGGLGDPEKYDATYARNEVLPLCELVQDAINSAGLPRALW--VTFRETIGAAV 337 (337) T ss_pred CCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhcCChhhc--eeccccccccC Confidence 65 4566799999999999999999999999999988876543 24566777776 No 98 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=4e-55 Score=318.84 Aligned_cols=324 Identities=15% Similarity=0.195 Sum_probs=236.6 Q ss_pred ccCCCCCchHHHHHhhccCcccCccccccccccccc----------------ccccCcc----cccHHHHhhhHHHHHHH Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSA----------------HGHLGDS----SINDERILQISTVWRCV 67 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~----~~~~~~~~~~~~v~~~i 67 (424) |.=.++.+--..-.-.-.......+........|.. ..+..+. .++.....+ +. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~la~------~~ 74 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEGLAK------SV 74 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHHHHH------HH Confidence 433333221111000000000000000000000000 0011111 122211100 00 Q ss_pred HHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEe Q lcl|NC_019719. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~ 147 (424) -++...+.++..++ +-+. ...+||++||..+|++ ++.+++++||||++++|+..|++++|+| T Consensus 75 --~~~~~h~~~l~~k~-------------n~l~--~~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~~~rn~~G~~~~L~~ 136 (350) T protein:vir:11 75 --GSSVYLQSGLKFKR-------------NMLA--KTFIPHRLLSRATFEQ-FSLDWLTFGSAYLEQPRSRLGTRMPLQA 136 (350) T ss_pred --hhhhhhccchhhhh-------------hhhh--hcccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCCEEEEEE Confidence 01111111221110 1111 1248999999999986 5679999999999999999999999999 Q ss_pred ecCceEEEEEcCCceEEEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_019719. 148 LQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQ 226 (424) Q Consensus 148 l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~ 226 (424) ++|.+|++..+++. +|++..++....|+++||||+|.++ .++++|+||+.++..++.+..++..|.+++|+||++|++ T Consensus 137 l~~~~vr~~~~~~~-~~~~~~~~~~~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~g 215 (350) T protein:vir:11 137 PLAKYMRRGTDLET-FYQVRSWKDEHEFEKGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGF 215 (350) T ss_pred eCCceeEeeecCCe-EEEEeeCCeEEEECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Confidence 99999999887765 4667778888999999999999876 457999999999999999999999999999999999999 Q ss_pred eEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_019719. 227 ILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHL 301 (424) Q Consensus 227 vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~ 301 (424) ||+.++...++++++.+++.|++..|..|+++++++ +.|+++++++.+++|+||+|.+++++++||++|||||.+ T Consensus 216 il~~~~~~ls~e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~l 295 (350) T protein:vir:11 216 ILYMTDAAQNEEDIDALRTALKTAKGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQL 295 (350) T ss_pred EEEecCCCCCHHHHHHHHHHHHHhcCccccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHH Confidence 999887667899999999999999888999998887 468999999999999999999999999999999999999 Q ss_pred hcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhh Q lcl|NC_019719. 302 VGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL 360 (424) Q Consensus 302 l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l 360 (424) +|..++++.+++|+|++.+.|+++||.|++++||+ +|.+|..+.. .+.+|++.+| T Consensus 296 lGi~~~~t~~~sn~e~~~~~f~~~~L~P~~~~ie~-ln~~l~~~~~---~F~~~~~~~l 350 (350) T protein:vir:11 296 MGVVPQNAGGFGSISDAAAVWASLELAPMQTRLQQ-VNEMIGEEVV---RFAQFDAPGL 350 (350) T ss_pred hcccCCCCCCcCCHHHHHHHHHHHHHHHHHHHHHH-HHhhcCcccc---ccCcccccCC Confidence 99988888888899999999999999999999995 7888754322 2456788887 No 99 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=6e-55 Score=317.83 Aligned_cols=325 Identities=17% Similarity=0.202 Sum_probs=237.2 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccccccccc----------cccCcccccHHHHhhhHHHHHHHHHH--HHhhc Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAH----------GHLGDSSINDERILQISTVWRCVSLI--STLTA 75 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~v~~~i~~i--a~~ia 75 (424) |.=++++..-..-...-..... -..-+...|.... ....+.. +.-|.-+.++-.+ |+... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~--~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~------~~pp~~~~~la~~~~a~~~h 72 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPK--MEAFTFGEPVPVLDRRDILDYVECISNGRW------YEPPISFTGLAKSLRAAVHH 72 (344) T ss_pred CCcccCCCCCchHHhhcCCcCc--EEEEEcCCceeecCCcchhHHHHhhhcCcc------ccCCCCHHHHHHHHHhhhhh Confidence 5444444322211110000000 0000001111000 0111111 1111111111111 11112 Q ss_pred cCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEE Q lcl|NC_019719. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDV 155 (424) Q Consensus 76 ~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~ 155 (424) +.++..++ +.+.. ..+||++||+.+| +.++.+++++||||++++|+..|++++|+|++|.+|++ T Consensus 73 ~~~i~~k~-------------n~l~~--~~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~~l~~~~vr~ 136 (344) T protein:vir:60 73 SSPIYVKR-------------NILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRR 136 (344) T ss_pred ccchhhhh-------------hHHHh--hccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCcceEEE Confidence 22222211 11222 2489999999999 57889999999999999999999999999999999999 Q ss_pred EEcCCceEEEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC Q lcl|NC_019719. 156 KLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV 234 (424) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~ 234 (424) ..+++. +|++..++....|+++||||+|.++ .++++|+||+..+..++.+..+++.+.+++|+||++|++||+++... T Consensus 137 ~~~~~~-~~~v~~~~~~~~~~~~eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ 215 (344) T protein:vir:60 137 GVEEDV-YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAV 215 (344) T ss_pred eecCCe-EEEEccCCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcC Confidence 888764 4566777788899999999999877 57899999999999999999999999999999999999999987666 Q ss_pred CCHHHHHHHHHHHHHHhCCcccCcceec------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCC Q lcl|NC_019719. 235 LTEQQRSQVEENFKEIAGGPVKKRLWIL------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS 308 (424) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~g~~~~l------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~ 308 (424) .++++.+.+++.|++..|+ ++++.+++ ++|+++++++.+++|+||+|.+++++++||++|||||.++|..+++ T Consensus 216 ls~e~~~~ik~~~~~~~g~-~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~ 294 (344) T protein:vir:60 216 QDRNDIEMLRENMVKSKGR-NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPEN 294 (344) T ss_pred CCHHHHHHHHHHHHHhcCC-CCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCC Confidence 7889999999999987765 56777777 5799999999999999999999999999999999999999998888 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCH Q lcl|NC_019719. 309 TSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 309 ~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~ 365 (424) +.+++|+|++.+.|+++||.|++++|| ++|.+|..+ .++|+...+...|. T Consensus 295 t~~~~n~e~~~~~f~~~~L~Pl~~~~e-~ln~~lg~~------~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 295 VGSLGDIEKVAKVFVRNELIPLQDRIR-EINGWLGQE------VIRFKNYSLDTDNG 344 (344) T ss_pred CCccccHHHHHHHHHHHHHHHHHHHHH-HHHHhcCCc------ccccCccccCCCCC Confidence 888889999999999999999999998 588887432 24566666665655 No 100 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=5.2e-55 Score=318.19 Aligned_cols=327 Identities=16% Similarity=0.194 Sum_probs=233.3 Q ss_pred ccCCCCCchHHHHHhhccCccc------Cccccccccc-ccc-cccccCcccccHHHHhhhHHHHHHHHHH--HHhhccC Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRL------VTPNQGSQTG-PVS-AHGHLGDSSINDERILQISTVWRCVSLI--STLTACL 77 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~------~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~i--a~~ia~~ 77 (424) |.=++.+---......-..... ..|..-.... .+. ...+..+..+ .=|.-+.++..+ |+..-+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~------~pp~~~~~la~~~~a~~~h~s 74 (344) T protein:vir:56 1 MSKKKGKTPQPAAKTMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWY------EPPVSFTGLAKSLRAAVHHSS 74 (344) T ss_pred CCCCCCCCCchhhHHhhcCCCceEEEEcCCceeecCcchhhhHHHhhhcCccc------cCCCCHHHHHHHHhhhhhhCc Confidence 3332222000000000000000 0000000000 000 0001111111 111112222222 1111222 Q ss_pred ceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEE Q lcl|NC_019719. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~ 157 (424) ++..++ +-+.. ..+||++||+.+| +.++.+++++||||++++|+..|++++|+|+++.+|++.. T Consensus 75 ~i~~k~-------------n~l~~--~~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~ 138 (344) T protein:vir:56 75 PIYVKR-------------NILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGV 138 (344) T ss_pred cceehh-------------hhHHh--hcCCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEEee Confidence 333221 11221 2489999999999 6778899999999999999999999999999999999988 Q ss_pred cCCceEEEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC Q lcl|NC_019719. 158 VGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT 236 (424) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~ 236 (424) +++. +|++..++....|+++||||+|.++ .++++|+||+..+..++.+..+++.+.+++|+||++|++||+++....+ T Consensus 139 ~~~~-~~~~~~~g~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~~ls 217 (344) T protein:vir:56 139 EEDV-YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQD 217 (344) T ss_pred cCCE-EEEEecCCeEEEEcCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCC Confidence 7765 4566677788899999999999876 5789999999999999999999999999999999999999998776678 Q ss_pred HHHHHHHHHHHHHHhCCcccCcceec------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCc Q lcl|NC_019719. 237 EQQRSQVEENFKEIAGGPVKKRLWIL------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS 310 (424) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~g~~~~l------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~ 310 (424) +++.+.+++.|++..|+ +++++++| ++|+++++++.+++|+||+|.+++++++||++|||||.++|..+.++. T Consensus 218 ~e~~~~lk~~~~~~~g~-~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~ 296 (344) T protein:vir:56 218 RNDIEMLRENMVKSKGR-NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVG 296 (344) T ss_pred HHHHHHHHHHHHHhcCC-CCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCC Confidence 89999999999987654 67888888 579999999999999999999999999999999999999999888888 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCH Q lcl|NC_019719. 311 WGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 311 ~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~ 365 (424) +++|+|++.+.|+++||.|+++.||+ ++.+|..+. ++|+.-.|...|- T Consensus 297 ~~~n~eq~~~~f~~~tL~Pl~~~ie~-~n~~l~~~~------~~F~~y~l~~~~~ 344 (344) T protein:vir:56 297 SLGDIEKVAKVFVRNELIPLQDRIRE-INGWIGQEV------IRFKNYSLDTDNG 344 (344) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHH-HHhhhcccc------ccCCCccccccCC Confidence 88899999999999999999999995 777875432 3454444433333 No 101 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=3.3e-54 Score=313.79 Aligned_cols=327 Identities=16% Similarity=0.159 Sum_probs=237.1 Q ss_pred CCCCccccc--CCCCCchHHHHHhhccCcccCccccc-cccccc-ccccccCcccccHHHH----hhhHHHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTID--LRTNNGWWARLQSWFVGGRLVTPNQG-SQTGPV-SAHGHLGDSSINDERI----LQISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~--~~~~~G~~~~l~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~~~----~~~~~v~~~i~~ia~ 72 (424) |..-+-+-. -.+..+.-. ..|..+.+. +... .+.+.+ ...+.+....++.... ..++...+||...++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~~y~~~~~~~~~~~~epp~~~~~la~~~~~~~~h~~~i~~k~n 76 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPIND---RTFSLSEIT-ASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRAN 76 (345) T ss_pred CCccccccchhhhcCCCceE---EEeecCCcc-cchhhcccceeeecCCccccCCCCHHHHHHHhhcchhhcchhhhhhh Confidence 221111100 011111000 001111110 0000 000000 0000011111221111 122333333333222 Q ss_pred hhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCce Q lcl|NC_019719. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) Q Consensus 73 ~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~ 152 (424) .+ +...+||++||+.+|++ ++.+++++||||++++|+..|++++|+|++|.+ T Consensus 77 ~l---------------------------~~~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~ 128 (345) T protein:vir:37 77 MV---------------------------SATYEGGKALSKMEMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLY 128 (345) T ss_pred HH---------------------------hhccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEecCce Confidence 22 12348999999999975 457899999999999999999999999999999 Q ss_pred EEEEEcCCceEEE----EEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCcee Q lcl|NC_019719. 153 MDVKLVGKKVVYR----YQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQI 227 (424) Q Consensus 153 v~~~~~~~~~~~~----~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~v 227 (424) |++..+++..++. +...+....|+++||||+|.++ .++++|+||+..+..++.+..++.+|.+++|+||++|++| T Consensus 129 vr~~~d~~~~~~~~~~~~~~~g~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~I 208 (345) T protein:vir:37 129 LRVHKDGGYSYLMKKSLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFI 208 (345) T ss_pred eEEeecCCeeEEEeeeeeccCceEEEEccccEEEEcCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE Confidence 9998888765432 2234667889999999999876 4789999999999999999999999999999999999999 Q ss_pred EEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_019719. 228 LSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV 302 (424) Q Consensus 228 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l 302 (424) +..+....++++.+.+++.|++..|+.|.+.++++ ++|+++++++.+++|+||++.+++++++||++|||||.++ T Consensus 209 l~~t~~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~li 288 (345) T protein:vir:37 209 LYSTDPDLTEEMEEEIARKISESKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLS 288 (345) T ss_pred EEeCCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHh Confidence 98877667899999999999998888776655555 5689999999999999999999999999999999999999 Q ss_pred cCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhc Q lcl|NC_019719. 303 GDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLR 362 (424) Q Consensus 303 ~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~ 362 (424) |..++++.+++|+|++.+.|+++||.|++++||+++|+.+ + ....++++||..+|++ T Consensus 289 Gi~~~~t~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~~--e-~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 289 GIIPTNTGGLGDPLKYREVYHYDEVMPLQEIIAETINQDP--E-IKNLLKIKFREQNFAK 345 (345) T ss_pred ccccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhhhhh--c-cCCcceEEECchhhcC Confidence 9988888888999999999999999999999999999742 1 1235788999999988 No 102 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=1.3e-54 Score=315.96 Aligned_cols=323 Identities=17% Similarity=0.215 Sum_probs=235.4 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccc--ccccccccc----------ccccCcccccHHHHhhhHHHHHHHHHH--HHh Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQ--GSQTGPVSA----------HGHLGDSSINDERILQISTVWRCVSLI--STL 73 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~--~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~v~~~i~~i--a~~ 73 (424) |.=++..---..-...- ...+.. -+...|... .++..+..+ .=|.-+.++-.+ |+. T Consensus 1 ~~~~~~~~~~~~~~~~~----~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~------~pp~~~~~la~~~~a~~ 70 (344) T protein:vir:20 1 MSKKKGKTPQPAAKTMT----ASGPKMEAFTFGEPVPVLDRRDILDYVECISNGRWY------EPPVSFTGLAKSLRAAV 70 (344) T ss_pred CCcccCCCCcchhhhhh----ccCCceEEEEcCCceEecCcchhhhhhhhhhcCcee------cCCCCHHHHHHHHhhhh Confidence 33332220000000000 000000 000011000 001111111 111112222111 222 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceE Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) ..+.++..++ +-+.. ..+||++||+.+| +.++.+++++||||++++|+..|++++|+|+++.+| T Consensus 71 ~h~~~i~~k~-------------n~l~~--~~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~v 134 (344) T protein:vir:20 71 HHSSPIYVKR-------------NILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYT 134 (344) T ss_pred hhCccceehh-------------hhHHH--hccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCCcee Confidence 2233333321 11221 2389999999999 577899999999999999999999999999999999 Q ss_pred EEEEcCCceEEEEEecCceEEecHhHeeEeccCCC-CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC Q lcl|NC_019719. 154 DVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE 232 (424) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~ 232 (424) ++..+++. +|++..++....|+++||||+|.++. ++++|+||+..+..++.+..+++.+.+++|+||++|++||+++. T Consensus 135 r~~~~~~~-~~~~~~~~~~~~~~~~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d 213 (344) T protein:vir:20 135 RRGVEEDV-YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD 213 (344) T ss_pred EeeecCCE-EEEEccCCeEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC Confidence 99887765 45666777889999999999998874 78999999999999999999999999999999999999999877 Q ss_pred CCCCHHHHHHHHHHHHHHhCCcccCcceec------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC Q lcl|NC_019719. 233 KVLTEQQRSQVEENFKEIAGGPVKKRLWIL------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~ 306 (424) ...++++.+.+++.|++..|+ ++++.+++ ++|+++++++.+++|+||+|.+++++++||++|||||.++|..+ T Consensus 214 ~~l~~e~~~~ik~~~~~~~g~-~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~ 292 (344) T protein:vir:20 214 AVQDRNDIEMLRENMVKSKGR-NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKP 292 (344) T ss_pred cCCCHHHHHHHHHHHHHhcCC-CCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCC Confidence 667899999999999987655 56777777 46999999999999999999999999999999999999999988 Q ss_pred CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCH Q lcl|NC_019719. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~ 365 (424) +++.+++|+|++.+.|++++|.|+++.|| +++.+|..+ .++|+...|...|. T Consensus 293 ~~t~~~~n~e~~~~~f~~~~l~P~~~~~e-~in~~lg~~------~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 293 ENVGSLGDIEKVAKVFVRNELIPLQDRIR-EINGWLGQE------VIRFKNYSLDTDND 344 (344) T ss_pred CCCCccccHHHHHHHHHHHHHHHHHHHHH-HHHHhcCCc------ccccCccccccCCC Confidence 88888889999999999999999999998 577777432 24576666665555 No 103 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=7.5e-54 Score=311.84 Aligned_cols=331 Identities=15% Similarity=0.123 Sum_probs=235.8 Q ss_pred CCCCcccccC--CCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHH--HHhhcc Q lcl|NC_019719. 1 MEEPKYTIDL--RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLI--STLTAC 76 (424) Q Consensus 1 ~~~~~~~~~~--~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~i--a~~ia~ 76 (424) |..-+.+-.= ++....- . ..|..+. |...........+....+. ++.-|.-+.++-.+ |+...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~--~-~~f~~~~---~~~~~~~~y~~~~~~~~~~------~~epp~~~~~la~l~~~~~~h~ 68 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPIN--D-RTFSLNE---ISASPALDYVGIGFDENYN------CYLPPVNRHALAKLPHQNAQHG 68 (345) T ss_pred CCCCccccchhhcccCcce--e-EEeecCC---cccccchhhhhhhhcCCcc------ccCCCCCHHHHHHHhhcccccc Confidence 2111100000 0000000 0 0010000 0000000000000000111 11111112222222 122222 Q ss_pred CceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEE Q lcl|NC_019719. 77 LPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVK 156 (424) Q Consensus 77 ~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~ 156 (424) -.+..+ .+-+. ...+||++||+.+|++. +.+++++||||++++|+..|++++|+|++|.+|++. T Consensus 69 ~~i~~k-------------~n~l~--~~~~Pn~~lt~~~f~~~-~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~ 132 (345) T protein:vir:37 69 GILHSR-------------ANMVS--SLYEGGKALSRMDMRAL-CLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVR 132 (345) T ss_pred cceeee-------------chHHH--hhccCCCCCCHHHHHHH-HHHHHhcCCeEEEEEEcCCCcEEEEEEEcCceeEEE Confidence 222221 11222 22489999999999865 578999999999999999999999999999999999 Q ss_pred EcCCceEEEE----EecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC Q lcl|NC_019719. 157 LVGKKVVYRY----QRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG 231 (424) Q Consensus 157 ~~~~~~~~~~----~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~ 231 (424) .+++..++.. ..++....|+++||||+|.++ .++++|+||+..+..++.+..++..|.+++|+||++|++||.++ T Consensus 133 ~d~~~~~~~~~~~~~~~g~~~~~~~~dVihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~ 212 (345) T protein:vir:37 133 KDGGYSYLMKKSLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYST 212 (345) T ss_pred EeCCeeEEEEEeEecCCceEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEec Confidence 8877654322 234567889999999999876 57899999999999999999999999999999999999999987 Q ss_pred CCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC Q lcl|NC_019719. 232 EKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~ 306 (424) ....++++.+.+++.|++..|..|.++++++ ++|+++++++.+++|+||+|.+++++++||++|||||.++|..+ T Consensus 213 d~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~ 292 (345) T protein:vir:37 213 DPDLTEEMEEEIARKISESKGVGNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIP 292 (345) T ss_pred CCCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccC Confidence 6667899999999999998888888888777 67999999999999999999999999999999999999999988 Q ss_pred CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhc Q lcl|NC_019719. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLR 362 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~ 362 (424) .++.+++|+|++.+.|+++||.|+++.||+++|+.+.. .....++|+..+|.+ T Consensus 293 ~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~~~~~---~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 293 TNTGGLGDPLKYREVYHYDEVMPLQEIIAETINQDPEI---KNLLKIKFREQNFAK 345 (345) T ss_pred CCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhhhhccC---CCcceEEecchhhcC Confidence 88888899999999999999999999999999974321 123567898888766 No 104 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=5.5e-51 Score=296.13 Aligned_cols=242 Identities=16% Similarity=0.227 Sum_probs=194.2 Q ss_pred CchHHHHHhhccCcccCcccccc--cccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGS--QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~ 91 (424) =|||.+. + ++....+.... ....+.......+..++.+.|+++|+|++||++||++||++|+++|++.+ T Consensus 1 MglF~~~---~-~r~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~----- 71 (251) T protein:vir:46 1 MGIFYKN---E-KRDLQYNEDDLQMMVQTLPSFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQ----- 71 (251) T ss_pred CCccccc---c-ccccCCCccchhhhhhhhccccCcCcceechhhhhccHHHHHHHHHHHHhHhhCceEEeeCcc----- Confidence 2333322 1 11111111111 11122233444566789999999999999999999999999999997543 Q ss_pred cccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCce-EEEEE--- Q lcl|NC_019719. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV-VYRYQ--- 167 (424) Q Consensus 92 ~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~-~~~~~--- 167 (424) ...+||++++|+.+||++||+++||+.++.+++++||||++++|+.+|++++|+||+|.+|++..++++. .|.+. T Consensus 72 -~~~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~ 150 (251) T protein:vir:46 72 -INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRID 150 (251) T ss_pred -ccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCCCcEEEEEEEec Confidence 2356999999999999999999999999999999999999999999999999999999999999876543 33332 Q ss_pred --ecCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019719. 168 --RDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 168 --~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~ 245 (424) .++....|+++||||+|+++.|+++|+||+.++..++.++.+++++..++|+||++|+|+++++..+.++++++++++ T Consensus 151 ~~~~g~~~~~~~~diiH~r~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e~~~~~~~ 230 (251) T protein:vir:46 151 SNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRARE 230 (251) T ss_pred cCCcceeEEECCccEEEecCcCCCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHHHHHHHHH Confidence 234567899999999999999999999999999999999999999999999999999999999988888888888988 Q ss_pred HHHHHhCC-cccCcceecCCCcee Q lcl|NC_019719. 246 NFKEIAGG-PVKKRLWILEAGFST 268 (424) Q Consensus 246 ~~~~~~~~-~~~g~~~~l~~g~~~ 268 (424) .|++.+++ +|+|++++ |++= T Consensus 231 ~~~~~~~g~~n~g~~~~---gm~~ 251 (251) T protein:vir:46 231 EFPKVLVELNKLGKLSY---SMNQ 251 (251) T ss_pred HHHHHhcCccccccccc---ccCC Confidence 88887665 68887665 3322 No 105 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=100.00 E-value=1.7e-45 Score=265.96 Aligned_cols=208 Identities=16% Similarity=0.197 Sum_probs=176.0 Q ss_pred EEEEEcCCceEEEEE-----ecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_019719. 153 MDVKLVGKKVVYRYQ-----RDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQ 226 (424) Q Consensus 153 v~~~~~~~~~~~~~~-----~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~ 226 (424) |++..++. .+|.+. .++....++++||+|+|.++ .++++|+||+.+++.++.+..++++|+.++|+||++|+| T Consensus 1 ~r~~~dg~-~~y~~~~~~~~~~g~~~~~~~~eilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng~~p~g 79 (219) T protein:vir:98 1 MRVCKDGN-YKYLMKKSLYDTKSEIYEYNKNDVIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNGAHMGF 79 (219) T ss_pred CceeecCe-EEEEEecceecCCceeEEeccccEEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcCCCCce Confidence 44444443 333332 23567889999999999877 689999999999999999999999999999999999999 Q ss_pred eEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_019719. 227 ILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHL 301 (424) Q Consensus 227 vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~ 301 (424) ||+++....++++++++++.|++..|+.|+++++++ +.|++|++++++++|+||+|++++++++||++|||||++ T Consensus 80 il~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp~~ 159 (219) T protein:vir:98 80 ILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPPGL 159 (219) T ss_pred EEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHH Confidence 998887666888999999999998888887666665 568999999999999999999999999999999999999 Q ss_pred hcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccC Q lcl|NC_019719. 302 VGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGD 364 (424) Q Consensus 302 l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d 364 (424) ||..+.++.+++|+|++.+.|+++||.|++++||++||.+++.+.+ .+++|+.+.+.-.+ T Consensus 160 lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~~~---~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 160 SGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDYEIKSA---LKVNFKQPEKRDKN 219 (219) T ss_pred cccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCc---cEEeecCcccccCC Confidence 9988888888899999999999999999999999999998655443 24667655543333 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.93 E-value=1.6e-26 Score=162.03 Aligned_cols=387 Identities=10% Similarity=0.007 Sum_probs=231.8 Q ss_pred CCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCc Q lcl|NC_019719. 10 LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDN 89 (424) Q Consensus 10 ~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~ 89 (424) ..+-+|+.+...+. +. +...... .+.......+..+ ...|.+++.++++|+.+|+.+-+.++.+.-. +.+ T Consensus 1 ~~~~D~~~~~~~~~---g~---~~~~~~~-~~~~~~~~~~~~l-~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~--d~~ 70 (437) T protein:vir:52 1 MKFFDGIKSLALKL---GS---KQEQTYY-SPSLSLTDDLVQL-EALWRDNWIANKVCIKRPEDMVRNWREIYSN--DLN 70 (437) T ss_pred CchhhhhHhHHhcC---CC---cccccee-ecCccccccHHHH-HHHHHhCchhhHHhhcchHHhhcCCceEecC--CCC Confidence 34444555533211 11 1111111 0111111222222 2346678999999999999999999988531 111 Q ss_pred cccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---------CceeeEEeecCceEEEEE--c Q lcl|NC_019719. 90 RKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---------GDVISLLPLQSANMDVKL--V 158 (424) Q Consensus 90 ~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~---------G~~~~l~~l~~~~v~~~~--~ 158 (424) .... ..+...+. +- ...+-+..++.+.-++|.|+++++.+.. |.+..+.++++++|.+.. + T Consensus 71 ~~~~---~~~~~~~~-~l----~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~ 142 (437) T protein:vir:52 71 SKQL---DLFTKFER-SL----KLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKD 142 (437) T ss_pred HHHH---HHHHHHHH-hh----cHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhcccccccc Confidence 1111 11222222 11 1234444555555589999999988753 678889999999887422 1 Q ss_pred --------CCceEEEEEecCceEEecHhHeeEeccCC----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_019719. 159 --------GKKVVYRYQRDSEYADFSQKEIFHLKGFG----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQ 226 (424) Q Consensus 159 --------~~~~~~~~~~~~~~~~~~~~evih~r~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~ 226 (424) +....|.+..++..+.+.++.||||.... .+...|+|.++.+.+.+.....+......++.+...+ T Consensus 143 ~dp~s~~fg~p~~y~v~~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~-- 220 (437) T protein:vir:52 143 DDVLSPNFGRYSEYSILGGSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKID-- 220 (437) T ss_pred ccccccccCcceEEEEecCCcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCC-- Confidence 23345666666666789999999997532 3567899999999999999999988888877665443 Q ss_pred eEEcCCC--CCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcC Q lcl|NC_019719. 227 ILSTGEK--VLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD 304 (424) Q Consensus 227 vl~~~~~--~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~ 304 (424) +++++.- .......+.+.+.++......+.+++++++.+.++++++.++.+. .+.......+||++++||.++|.+ T Consensus 221 v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~e~~~~~~sgl--~~~l~~~~~~iaaa~~iP~t~L~G 298 (437) T protein:vir:52 221 IFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDAENEYDRKELTFTGL--KDLLTEFRNAVAGAADMPVTILFG 298 (437) T ss_pred ceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCCcceEEEecCcCCH--HHHHHHHHHHHHHHhcCchhhhcC Confidence 3444421 111112334444555444445567899999999999998877665 477788899999999999999977 Q ss_pred CCCCCccchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHH-------HHH Q lcl|NC_019719. 305 VEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS-------RAA 370 (424) Q Consensus 305 ~~~~~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~-------~~~ 370 (424) ...+..+ +.++..+.|+. .-+.|.++.+-..+-+..+.+.. ..+. |.++.|...+.++ +++ T Consensus 299 ~s~~Gla--sge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~-~~~~--~~f~pL~~~s~kekae~~~~~a~ 373 (437) T protein:vir:52 299 QSVSGLA--SGDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGLP-ADWW--FEFVPLTTVKQEQQINMLNTFAT 373 (437) T ss_pred cCccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-Ccce--EEeCCcCCcCHHHHHHHHHHHHH Confidence 6555443 45666677776 45778887777766655544322 1234 4445777677554 445 Q ss_pred HHHHHHhCCCCCHHHHHHHhC----CCCCCCCCeeeecccccchhhccc--cC-CCc--ccCC Q lcl|NC_019719. 371 FMKAMGEAGLRTINEMRRTDN----LPPLPGGDVAMRQSQYVPITDLGT--NK-EPR--NNGA 424 (424) Q Consensus 371 ~~~~~~~~g~~T~NE~R~~~G----~~p~~~gd~~~~~~n~~~~~~~~~--~~-~~~--~~ga 424 (424) .+.+++++|+++++|+|+++. ++.++..|..-.. +..+.....+ .+ .+. ..++ T Consensus 374 a~~~~~~~g~i~~~e~r~~L~~~g~~~~i~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 435 (437) T protein:vir:52 374 AANTLIQNGVLNEYQIANELRESGLFANISAEHIEELK-NADEFAGNFEEPEKMEGAQVQNSE 435 (437) T ss_pred HHHHHHhcCCCCHHHHHHHHHhcCCCCCCCcccccccc-CCCCCCCccCCCCCCCCCCCCCCC Confidence 688899999999999999872 3444433321111 1111100000 00 000 0011 No 107 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.88 E-value=2.2e-22 Score=139.30 Aligned_cols=404 Identities=11% Similarity=0.044 Sum_probs=217.7 Q ss_pred CCCCcccccCCCCC-chHHH---HHhhccCcccCc----------ccccccccccccccccCcccccHHHHhhhHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNN-GWWAR---LQSWFVGGRLVT----------PNQGSQTGPVSAHGHLGDSSINDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~-G~~~~---l~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ 66 (424) +.-..--|.++-.. ++..+ +.+.+..-.... .........+.......+..+ ...|.+++.++++ T Consensus 41 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~a~Y~~~~l~r~i 119 (537) T protein:vir:10 41 QLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIGHQM-CALIATHWLVNKA 119 (537) T ss_pred HhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCccHHH-HHHHHhCchhhhh Confidence 11111112221111 11111 000000000000 000000001111111122111 2345678999999 Q ss_pred HHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee-CCC------ Q lcl|NC_019719. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-NSA------ 139 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r-~~~------ 139 (424) |+++|+.+.+-++.+.-.+.+.. +......|...-+....+..|.+.+.+.. ++|.+++++.- ..+ T Consensus 120 Vd~~A~d~~r~~~~i~~~~~~~~------~~~~~~~l~~~~~~l~~~~~l~~a~~~~r-lyG~~~i~i~v~~~D~~~~~~ 192 (537) T protein:vir:10 120 CSQMPRDAMRKGYKIISDDGNEL------DPKDAKFIDRYDRAFNIKKHAIQFVRKGR-IFGIRIALFKVDSPDPYYYEK 192 (537) T ss_pred hhhhhHHhhcCCceeecCCcccc------cHHHHHHHHHHHHHhhHHHHHHHHHHhcc-cccceEEEEeecCcCCccccc Confidence 99999999999988853322211 11122233333333333444444444444 57888877642 222 Q ss_pred ---------CceeeEEeecCceEEEEEc----CCc---eE---EEEEecCceEEecHhHeeEeccCCC-------Ccccc Q lcl|NC_019719. 140 ---------GDVISLLPLQSANMDVKLV----GKK---VV---YRYQRDSEYADFSQKEIFHLKGFGF-------TGLVG 193 (424) Q Consensus 140 ---------G~~~~l~~l~~~~v~~~~~----~~~---~~---~~~~~~~~~~~~~~~evih~r~~~~-------~~~~G 193 (424) |....|.+++|.++.+... .+. .+ -.|... .+.|.++.|+|+..... .++.| T Consensus 193 Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~--g~~iH~SRli~f~g~~~p~~~~~~~~~~G 270 (537) T protein:vir:10 193 PFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLIN--GKKYHRSHLAIYINDEVVDFLKPSYIYGG 270 (537) T ss_pred ccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeeec--CeEecceeEEEecCCCCchhhhcccCccc Confidence 2345788888888775321 111 11 122222 34678999999965431 34579 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-Cceeeecc Q lcl|NC_019719. 194 LSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-GFSTSAIG 272 (424) Q Consensus 194 ~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~-g~~~~~l~ 272 (424) .|.++.+.+.+............++........-+.....+.++++ +.+.++.+....+..++++++. +.++++++ T Consensus 271 ~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~---~~~r~~~~~~~r~n~g~~~id~e~e~~e~~~ 347 (537) T protein:vir:10 271 VPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQ---FDETMSWWTATRDNYQVRVVDKDNEDVVQID 347 (537) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHH---HHHHHHHHHhhcCCcceeEecCCCceeEEEe Confidence 9999999999999888888888877776554322222223344443 3333444333333456788876 58888888 Q ss_pred cChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHH------HHHHHHHHHHHHHHhhccCcc Q lcl|NC_019719. 273 VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY------TLQPYISRWENSIQRWLIPAK 346 (424) Q Consensus 273 ~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~------tl~P~~~~ie~~l~~~l~~~~ 346 (424) .+...+ .+........||.+.|||..+|.+...+..+ ++.+.....|+.. .|.|.++.+.+.+-+..+.+. T Consensus 348 ~~lsgl--~~~l~~~~~~iAa~~~IP~t~L~G~sp~Gln-atGe~D~~~yyd~I~~~Qe~l~p~l~~l~~ll~~~~~~~~ 424 (537) T protein:vir:10 348 TTLNDL--DKVIMNQYQLVCAIARTPAPKMLGTVPTGFN-STGDYEEASYHEECESTQDDMRPLIDRHHQLVCRSHLRKR 424 (537) T ss_pred ccCCCH--HHHHHHHHHHHHhhhCCCceeeccCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 776654 4778888889999999999977554322221 1234444555532 478999988887776665432 Q ss_pred ccccceeeecchhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeec--------------- Q lcl|NC_019719. 347 DVGRIHAEHNLDGLLRGDSASRAA-------FMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQ--------------- 404 (424) Q Consensus 347 ~~~~~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~--------------- 404 (424) ..+.|.++.|...|.+++++ .+.+++++|++++||+|+.++.+|..+-+.+... T Consensus 425 ----~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~ 500 (537) T protein:vir:10 425 ----IRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEG 500 (537) T ss_pred ----cceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCccC Confidence 13456667888888887765 4889999999999999999887653222211100 Q ss_pred ------------ccccchhhccc-cCCCcccCC Q lcl|NC_019719. 405 ------------SQYVPITDLGT-NKEPRNNGA 424 (424) Q Consensus 405 ------------~n~~~~~~~~~-~~~~~~~ga 424 (424) ....+....++ .++++++|| T Consensus 501 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 533 (537) T protein:vir:10 501 KPVRIIEDQPAPSEMFGATSSGESANDPRDSGA 533 (537) T ss_pred CcCCCCCCCCCccccCCCCccccccCCCccCcc Confidence 00001111111 123333344 No 108 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=99.86 E-value=1e-21 Score=135.63 Aligned_cols=395 Identities=11% Similarity=-0.004 Sum_probs=248.8 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) ++.|--++++.|+.|+.+.+.+++.+... |... ++. ....+....-+..++.+.|.+|++.+...|.+++|. T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~~~~l~~--~~~~----il~--~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~~w~ 72 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPFISGLQV--PNDS----ILQ--RRGGNDLRVYEEILSDAQVKTVWGQRQLAVVSREWK 72 (488) T ss_pred CCccchhHHHHHHHhhhhhhccccCCCCC--CChH----HHH--hhccCCHHHHHHHhhChHHHHHHHHHHHHHhcCCce Confidence 88888888888888888877777655432 2211 110 111121112244567899999999999999999999 Q ss_pred EEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC---ceeeEEeecCceEEEEE Q lcl|NC_019719. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG---DVISLLPLQSANMDVKL 157 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G---~~~~l~~l~~~~v~~~~ 157 (424) |...+.+ ........-+...|. + ..+.++++.+. +.+++|-++.++++..+| .+..+.+.|+.++.+.. T Consensus 73 i~p~~~~--~~~~~~ae~v~~~l~-~----~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~ 144 (488) T protein:vir:99 73 VEAGGDR--PIDQAAAEHLEQQLQ-R----VGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRFRYDQ 144 (488) T ss_pred EEcCCCC--hHHHHHHHHHHHHHh-C----CCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccceeecC Confidence 9644322 111111223344443 3 35677777776 467899999999886543 46789999999888766 Q ss_pred cCCceEEEEEecCceEEecHh-H-eeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC Q lcl|NC_019719. 158 VGKKVVYRYQRDSEYADFSQK-E-IFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL 235 (424) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~-e-vih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~ 235 (424) +++..............++.. . |+|........++|.+.+..+.-.........++...|....|.|-.+.+.+.... T Consensus 145 ~~~l~~~~~~~~~~g~~lp~~~~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a 224 (488) T protein:vir:99 145 DGGLRLLTPNNMFEGEPCPAPYFWHFSTGADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTA 224 (488) T ss_pred CCceEEeccCCCCCccccccCceEEEEeecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCC Confidence 655433222222344555433 2 34444444556899999999999999999999999999999999988888775444 Q ss_pred CHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchh Q lcl|NC_019719. 236 TEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG 314 (424) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n 314 (424) +++.++.+.+.+.++.. + ..+++|.|++++-+..+ .....|.++.++..++|+.+.-= ..+-.. .++.+++ T Consensus 225 ~~~ek~~l~~av~~~~~--~--~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLG-qtlts~--~~~Gs~a- 296 (488) T protein:vir:99 225 TPEDKAKLLAALHAIQT--D--SAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLG-QVASTQ--GTPGRLG- 296 (488) T ss_pred CHHHHHHHHHHHHHHhc--C--cEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhh-hhhccc--ccccchh- Confidence 66666777766666653 2 35666777766555432 22235788888889999887411 112121 1122222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc------ccceeeecchhhhccCHHHHHHHHHHHHhC-CC-CCHHHH Q lcl|NC_019719. 315 IEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV------GRIHAEHNLDGLLRGDSASRAAFMKAMGEA-GL-RTINEM 386 (424) Q Consensus 315 ~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~------~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~-g~-~T~NE~ 386 (424) ..+.........+.-.+..|+..||+.|+.+.-. ...++.|+.. ...|.+.+++.+.++++. |+ ++..++ T Consensus 297 ~~~vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~~~p~~~~~~~--e~edl~~~a~~~~~l~~~~G~~i~~~~i 374 (488) T protein:vir:99 297 NDDLQADVRLDLVKADADLICESFNLGPARWLTEWNFPGAQPPRVYRVIE--EPEDITAKAERDEKVFRMSGFRPTRGYV 374 (488) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCcCCceeEecCC--CcccHHHHHHHHHHHHhhcCCCCCHHHH Confidence 2334445667778888899999998877643211 1123455443 457888999999999986 64 788899 Q ss_pred HHHhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 387 RRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 387 R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |+.+|+|+-..++....+..... ..+...+.+... T Consensus 375 ~e~~Gip~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 409 (488) T protein:vir:99 375 QETYGVEVESTQAEATAPTPSTE---FAEGDQPSDPAA 409 (488) T ss_pred HHHcCCCCcccccccccCCCccc---CCCCCCCCCchH Confidence 99999997655555443321111 111111111111 No 109 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.84 E-value=3.7e-21 Score=132.58 Aligned_cols=404 Identities=10% Similarity=0.045 Sum_probs=217.8 Q ss_pred CCC----CcccccCCCCCchHHHHHhhccCc--------ccCc-----c-ccccccc------------------ccccc Q lcl|NC_019719. 1 MEE----PKYTIDLRTNNGWWARLQSWFVGG--------RLVT-----P-NQGSQTG------------------PVSAH 44 (424) Q Consensus 1 ~~~----~~~~~~~~~~~G~~~~l~~~~~~~--------~~~~-----~-~~~~~~~------------------~~~~~ 44 (424) |.. |.-.|.. ...|.-.|.++.-..+ ...+ | ....... .+... T Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~ 79 (532) T protein:vir:94 1 MADTDPTPRPEITY-ATLQQAQRVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEA 79 (532) T ss_pred CCCCCCCCCcceeh-hhhhhHhhhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCcccccccccccccc Confidence 111 1111100 0112222222111000 0000 0 0000000 00000 Q ss_pred cccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHH Q lcl|NC_019719. 45 GHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQL 124 (424) Q Consensus 45 ~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~ 124 (424) ....+..+ ...|.+++.++++|+.+|+.+-+-.+.+.-.+++.. .......|...-..+ ...+-+..++... T Consensus 80 ~~~~~~~l-~a~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~------~~~~~~~i~~~~~~l-~v~~~l~~a~~~~ 151 (532) T protein:vir:94 80 TSWPGFPT-LALLAQLPEYRTMHETPADECVRAWGKITCSSKDEL------AADKATRITQKLEQY-NVRTLVRTVVIHD 151 (532) T ss_pred cccchHHH-HHHHHcCchhhhhhccchHHHhhCCceEeeCCcccc------chHHHHHHHHHHHhh-hHHHHHHHHHHhh Confidence 01111111 134567888999999999999999988854322211 111222222211112 2234444555555 Q ss_pred HHcCCeEEEEeeCC-------------------CCceeeEEeecCceEEEEEcC--Cc--------eEEEEEecCceEEe Q lcl|NC_019719. 125 CFYGNAYALVDRNS-------------------AGDVISLLPLQSANMDVKLVG--KK--------VVYRYQRDSEYADF 175 (424) Q Consensus 125 l~~G~a~~~~~r~~-------------------~G~~~~l~~l~~~~v~~~~~~--~~--------~~~~~~~~~~~~~~ 175 (424) .++|.+++++.-.. .|.+.+|.+++|.+|++.... +. ..|... . ...+ T Consensus 152 rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~-~--g~~i 228 (532) T protein:vir:94 152 QAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIAT-S--GKKI 228 (532) T ss_pred hcccceEEEEEeccCCccccccccccccccccccceeeEEEeechheecccccccccccccccCCceeEEEc-c--Ceee Confidence 67898888764322 233467889999988864321 11 122221 2 3467 Q ss_pred cHhHeeEeccCCC-------CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC-CCCCCHHHHHHHHHHH Q lcl|NC_019719. 176 SQKEIFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG-EKVLTEQQRSQVEENF 247 (424) Q Consensus 176 ~~~evih~r~~~~-------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~-~~~~~~~~~~~~~~~~ 247 (424) .++.|+|+..... .+..|.|.++.+.+.+............+...... . ++++. ......+..+.+.+.+ T Consensus 229 H~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~-~-v~k~~~a~~ls~~~~~~~~~r~ 306 (532) T protein:vir:94 229 HSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSM-T-NLATDMAQLLAPGGAQSLDARL 306 (532) T ss_pred ccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCC-c-eeeechHHhhcchhHHHHHHHH Confidence 8999999975432 24579999999999999998888888776555333 2 23332 2222334456666666 Q ss_pred HHHhCCcccCcceecCC-CceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHH-- Q lcl|NC_019719. 248 KEIAGGPVKKRLWILEA-GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-- 324 (424) Q Consensus 248 ~~~~~~~~~g~~~~l~~-g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~-- 324 (424) +......+..++++++. +.+|++++.+..++ .+........||.+.|||..+|.+...+..+ ++.+.....|+. T Consensus 307 ~~~~~~~~n~g~~~id~~~e~~e~~~~~lsgl--~~~l~~~~~~iAaa~~IP~t~LfG~sp~Gln-stGe~D~~~yyd~I 383 (532) T protein:vir:94 307 QLFNLYRDNRNIGALDKGTEEIQQTNTPLSGL--DSLQAQSQEQMAAVSHIPLVKLLGITPNGLN-ASSDGEIRVWYDFI 383 (532) T ss_pred HHHHhhcCCccceEEcCCCceeEEEecccCCH--HHHHHHHHHHHHhHhCCCeeeeecCCccccc-ccchHHHHHHHHHH Confidence 65544444456788875 46888888777654 5678888889999999999987654433332 123444455554 Q ss_pred -----HHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHH-------HHHHHHHHhCCCCCHHHHHHHhCC Q lcl|NC_019719. 325 -----YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASR-------AAFMKAMGEAGLRTINEMRRTDNL 392 (424) Q Consensus 325 -----~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~-------~~~~~~~~~~g~~T~NE~R~~~G~ 392 (424) .-+.|.++.+-+.|-+..+.... ..++ |.++.|...+.+++ ++.+.+++++|++++||+|++++. T Consensus 384 ~s~Qe~~l~p~le~l~~~l~~s~~g~~~-~d~~--~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~ 460 (532) T protein:vir:94 384 AGYQATNLTPLMEWIIDLIQLSEYGQID-PGLA--WEWSPLMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAA 460 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCce--EEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhc Confidence 44788888888888765543322 1234 44466777777665 455688999999999999999999 Q ss_pred CCCCCCCeeeecccc----------------cchhhccccCCCcccCC Q lcl|NC_019719. 393 PPLPGGDVAMRQSQY----------------VPITDLGTNKEPRNNGA 424 (424) Q Consensus 393 ~p~~~gd~~~~~~n~----------------~~~~~~~~~~~~~~~ga 424 (424) .|..+.+......+. .+.......+.+..+++ T Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (532) T protein:vir:94 461 DPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSE 508 (532) T ss_pred CCccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCC Confidence 887554322211110 01000000001111111 No 110 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.83 E-value=8.8e-20 Score=125.06 Aligned_cols=392 Identities=10% Similarity=-0.016 Sum_probs=228.4 Q ss_pred HHHHHhhccCcccCccccccc-------ccccc--------------ccc-ccCccccc----HHHHh-hhHHHHHHHHH Q lcl|NC_019719. 17 WARLQSWFVGGRLVTPNQGSQ-------TGPVS--------------AHG-HLGDSSIN----DERIL-QISTVWRCVSL 69 (424) Q Consensus 17 ~~~l~~~~~~~~~~~~~~~~~-------~~~~~--------------~~~-~~~~~~~~----~~~~~-~~~~v~~~i~~ 69 (424) +.+|..+++++.......... ...+. ... ...|.... .+.+. +.+.|.+|++. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l~~ 80 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAEMSK 80 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 222222222211111000000 00000 000 00111100 11122 57889999999 Q ss_pred HHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC---ceeeEE Q lcl|NC_019719. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG---DVISLL 146 (424) Q Consensus 70 ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G---~~~~l~ 146 (424) +...|.+++|.|.....+.. .......-+...|...|+ +.+++..+ .+.+++|-++.++++..+| .|..+. T Consensus 81 Rk~av~~~~w~I~p~~~~~~-~~~~~a~~v~~~l~~~~~----f~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~~~ 154 (528) T protein:vir:10 81 RKRAVLGLDWTIEPPRNASA-AEKADAEYLHELLLDLEG----IEDLMLDC-MDGVGHGYSAIELDWSLQGREWLPQAFD 154 (528) T ss_pred HHHHHhcCCceEecCCCCCH-HHHHHHHHHHHHHhCCcc----HHHHHHHH-HhhhhhcceeEEEEEeecCCceeEEEee Confidence 99999999999975433221 111112223444443232 33333333 3455799999999875543 477899 Q ss_pred eecCceEEEEEcCCceEEEEEec-CceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCC Q lcl|NC_019719. 147 PLQSANMDVKLVGKKVVYRYQRD-SEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKS 224 (424) Q Consensus 147 ~l~~~~v~~~~~~~~~~~~~~~~-~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p 224 (424) ++++.++.+..+++... .+..+ .....+++...++.++.. ...++|.+.+..+.-.........++...|....|.| T Consensus 155 ~r~~~~f~~~~~~~~~l-~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P 233 (528) T protein:vir:10 155 HRPQSWFQLNPDDQDEL-RLRDNSIAGEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLP 233 (528) T ss_pred eecccceeeccCCCcEE-eccCCCCCceeecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCC Confidence 99999888777665443 33333 345677777765555544 4558999999999999999999999999999999999 Q ss_pred ceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_019719. 225 PQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVG 303 (424) Q Consensus 225 ~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~ 303 (424) -.+.+.+.+. +++.++.+.+.+.++.+ + ..+++|.|++++-+..+ ..-..|.++.++..++|+.+. +-..+-. T Consensus 234 ~~igky~~~a-~~~ek~~L~~al~~i~~--~--~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGqtlTs 307 (528) T protein:vir:10 234 IRLGKYPPGT-PDEEKVTLLRAVTGLGH--A--AAGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAI-LGGTLTS 307 (528) T ss_pred eEEEecCCCC-CHHHHHHHHHHHHHHhh--C--cEEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHH-hhhhhhc Confidence 8888888775 55556666666666653 2 34666777766555432 222347788888899998876 2122221 Q ss_pred C-CCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc----------ccceeeecchhhhccCHHHHHHHH Q lcl|NC_019719. 304 D-VEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV----------GRIHAEHNLDGLLRGDSASRAAFM 372 (424) Q Consensus 304 ~-~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~----------~~~~~~fd~~~l~~~d~~~~~~~~ 372 (424) . ..+.+++++- .+.........+.-.++.|+..||+.|+.+.-. ...++.|+. ....|.+.+++.+ T Consensus 308 ~~~~g~~gS~Al-g~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~--~e~eDl~~~a~~~ 384 (528) T protein:vir:10 308 QTSESGGGAYAL-GQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDL--KDRADLAAMATSL 384 (528) T ss_pred cccccccchhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecC--CCcccHHHHHHHH Confidence 1 1111122211 233445667777788888999998877543211 112345544 4467888899999 Q ss_pred HHHHhCCC-CCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCCCccc------CC Q lcl|NC_019719. 373 KAMGEAGL-RTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNN------GA 424 (424) Q Consensus 373 ~~~~~~g~-~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~------ga 424 (424) .+++..|+ ++..++|+.+|+|.-..++.++.+....+.........+... ++ T Consensus 385 ~~L~~~G~~i~~~~i~e~~gip~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (528) T protein:vir:10 385 PPLVKLGVQVPVNWVQEQLGIPLPANGEAVLGDQAGAGIAQLSRRPGPRIAALAQVIGP 443 (528) T ss_pred HHHHhCCCCCCHHHHHHHhCCCCCCCCcccccCCCcccccccCcccccccccccccccc Confidence 99999998 899999999999876666666554333222111111011000 00 No 111 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=99.82 E-value=8e-20 Score=125.29 Aligned_cols=402 Identities=8% Similarity=-0.032 Sum_probs=232.2 Q ss_pred CCCCcccccCCCC--CchHHHHHhhccCcccCcccccccccccccccccCccccc----HHHHh-hhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTN--NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIN----DERIL-QISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~--~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~-~~~~v~~~i~~ia~~ 73 (424) ...+.- ++-|+- .|+.+.+......+ .+|.+.. ..+- ....|.... .+..+ +.+.|.+|++.+... T Consensus 12 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~g--ltp~~l~--~iLr--~a~~gd~~~~~~L~e~m~e~D~~i~s~l~~Rk~a 84 (526) T protein:vir:99 12 IRTQQL-REPQTSRLAGLAKEFAQHPAKG--LTPAKLA--RILV--EAEQGNLQAQAELFMDMEERDAHLFAEMSKRKRA 84 (526) T ss_pred cccccc-cchhhhhhhhhhhhhcccCcCC--CCHHHHH--HHHH--hhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHH Confidence 000000 111111 12222221111100 0111100 0000 000111100 11222 578999999999999 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC---ceeeEEeecC Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG---DVISLLPLQS 150 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G---~~~~l~~l~~ 150 (424) |.+++|.|.....+... ......-+...|+..| .+.+++..+. +.+++|-+..++++..+| .|..+.+.++ T Consensus 85 v~~~~w~I~p~~~~~~~-~~~~a~~v~~~l~~~~----~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~r~~ 158 (526) T protein:vir:99 85 ILGLDWAVEPPRNASAA-EKADADYLHELLLDLE----GLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQ 158 (526) T ss_pred HhCCCceEecCCCCCHH-HHHHHHHHHHHHhccc----CHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEeeeecc Confidence 99999999754332211 1112223444454334 2555565555 466799999999875543 4778999999 Q ss_pred ceEEEEEcCCceEEEEEecCceEEecHhHeeEecc-CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE Q lcl|NC_019719. 151 ANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKG-FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 151 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~ 229 (424) .++.+..+++..............+++...+..++ .....++|.+.+..+.-.........++...|....|.|-.+.+ T Consensus 159 ~~f~~~~~~~~~l~~~~~~~~g~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igk 238 (526) T protein:vir:99 159 SWFQLNPEDQNELRLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGK 238 (526) T ss_pred cceeeccCCCcEEEecCCCCCceeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEe Confidence 98888777665443333334566777775544444 44566899999999999999999999999999999999988888 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCC-CC Q lcl|NC_019719. 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDV-EK 307 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~-~~ 307 (424) .+.+. ++++++.+.+.+.++.. + ..+++|.|++++-+..+ .....|.++.++..++|+.++ +-..+-... .+ T Consensus 239 y~~~a-~~~ek~~L~~av~~i~~--d--~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGqtlTs~~~~g 312 (526) T protein:vir:99 239 YPPGT-ADEEKATLLRAVTGLGH--A--AAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LGGTLTSTTSQS 312 (526) T ss_pred cCCCC-CHHHHHHHHHHHHHHhh--C--cEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhhhhccccccC Confidence 88775 55566666666666643 2 35667777766655532 222347788889999998875 222221111 11 Q ss_pred CCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc----------ccceeeecchhhhccCHHHHHHHHHHHHh Q lcl|NC_019719. 308 STSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV----------GRIHAEHNLDGLLRGDSASRAAFMKAMGE 377 (424) Q Consensus 308 ~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~----------~~~~~~fd~~~l~~~d~~~~~~~~~~~~~ 377 (424) +.++++. .+.........+.-.++.|+..||+.|+.+.-. ...+++|+. ....|.+.+++.+.+++. T Consensus 313 ~~gS~a~-g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~--~e~eDl~~~a~~~~~L~~ 389 (526) T protein:vir:99 313 GGGAFAL-GQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDL--REQADITSMAQSIPALVN 389 (526) T ss_pred cchhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCC--CCcccHHHHHHHHHHHHh Confidence 1122222 233345566677788888889998877643211 112345544 346788899999999999 Q ss_pred CCC-CCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCCC----cccC-------C Q lcl|NC_019719. 378 AGL-RTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEP----RNNG-------A 424 (424) Q Consensus 378 ~g~-~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~~~----~~~g-------a 424 (424) .|+ ++..++|+.+|+|.-.+++..+.+....+.......... ...+ + T Consensus 390 ~G~~i~~~~i~e~~Gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (526) T protein:vir:99 390 VGLEIPSAWVYDKLGIPQPAKNEPVLRSAAQPAILSRQHGQRVAALATIVGPRYGDQQA 448 (526) T ss_pred CCCccCHHHHHHHhCCCCCCCcccccCCCCCCcccccccccccccccccccccCcchhh Confidence 997 899999999999865556655543222111110000000 0000 0 No 112 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.82 E-value=3.1e-20 Score=127.52 Aligned_cols=401 Identities=11% Similarity=0.031 Sum_probs=208.7 Q ss_pred CCCCcccccC-------CCCC---c------------------------hHHHHHhhccC-cccCccccc---ccccccc Q lcl|NC_019719. 1 MEEPKYTIDL-------RTNN---G------------------------WWARLQSWFVG-GRLVTPNQG---SQTGPVS 42 (424) Q Consensus 1 ~~~~~~~~~~-------~~~~---G------------------------~~~~l~~~~~~-~~~~~~~~~---~~~~~~~ 42 (424) -|+|.=+++- +... | ..+.+.+...+ +...+.... .....+. T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~ 129 (862) T protein:vir:99 50 KEKPNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWY 129 (862) T ss_pred cccCCCCCCcccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccc Confidence 2222111110 0000 0 00111111100 000000000 0000000 Q ss_pred cccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHH Q lcl|NC_019719. 43 AHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTM 122 (424) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~ 122 (424) ......+..+ ...|.+++.++++|+++|+.+.+-.+.+.-..++... .......+...+. + ..-+..|. .++. T Consensus 130 ~~~~f~gyql-~alY~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~-~~e~~~~ie~~~~-r---L~v~~~l~-eair 202 (862) T protein:vir:99 130 LSQGFIGHQA-CALIAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEI-DEESLEKFKAIDV-E---FKVKENLI-EFNR 202 (862) T ss_pred cccCcccHHH-HHHHHhCchhhhhhhhhhHHHhhCCceEeecCccccc-CHHHHHHHHHHHH-H---hhHHHHHH-HHHH Confidence 0001111111 2356678899999999999999999988643222111 1111111222221 1 11133333 3444 Q ss_pred HHHHcCCeEEEEe-eCCC---------------CceeeEEeecCceEEEEE----cCCc---eEE---EEEecCceEEec Q lcl|NC_019719. 123 QLCFYGNAYALVD-RNSA---------------GDVISLLPLQSANMDVKL----VGKK---VVY---RYQRDSEYADFS 176 (424) Q Consensus 123 ~~l~~G~a~~~~~-r~~~---------------G~~~~l~~l~~~~v~~~~----~~~~---~~~---~~~~~~~~~~~~ 176 (424) ..-++|.+++++. ...+ |.+..|..++|.++.+.. +.+. .++ .|...+ ..+- T Consensus 203 ~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y~I~g--~~IH 280 (862) T protein:vir:99 203 FKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFWIISG--QKYH 280 (862) T ss_pred hcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhcccccccccccccccccCCceeeeecC--eeec Confidence 4445776666653 2222 344678888887776532 1111 111 122222 3577 Q ss_pred HhHeeEeccCCC-------CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC--CCCCHHHHHHHHHHH Q lcl|NC_019719. 177 QKEIFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE--KVLTEQQRSQVEENF 247 (424) Q Consensus 177 ~~evih~r~~~~-------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~--~~~~~~~~~~~~~~~ 247 (424) ++.|||+..... .++.|+|.++.+...+.....+......++.+.... +++++. .+.+++ .+.+.+ T Consensus 281 ~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~--v~ktd~l~~l~~ed---~l~~r~ 355 (862) T protein:vir:99 281 RSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTT--AIHTDTAKAIANED---KFIQRL 355 (862) T ss_pred cceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeechhHhhhccHH---HHHHHH Confidence 889998865432 235799999999999999999998888888775533 344332 222322 233333 Q ss_pred HHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC-CCCccchhHHHHHHHHHH-- Q lcl|NC_019719. 248 KEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE-KSTSWGSGIEQQNLGFLQ-- 324 (424) Q Consensus 248 ~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-~~~~~~~n~e~~~~~~~~-- 324 (424) +......+..++++++.+-+++.++.+..++ .+........||.+.+||..+|.+.. .+..+ +.++....||. T Consensus 356 ~~~~~~rdN~Gi~liD~eEe~e~ls~slSGL--~dll~~~~q~IAaas~IP~tiLfGqspaGlnA--TGE~D~~nYyD~I 431 (862) T protein:vir:99 356 MFWVRYRDNHAVKVLGTDETMEQFDTSLADF--DAVIMGQYQLVASIAKTPATKLLGTAPKGFNS--TGEFETISYHEEL 431 (862) T ss_pred HHHHhccCcceeEEecCCCceeEEecccCCh--HHHHHHHHHHHHhhhCCCceeecccCcccccC--chHHHHHHHHHHH Confidence 4333333345689999998999988877755 46777778899999999999776543 23222 33445566665 Q ss_pred -----HHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHH-------HHHHHhCCCCCHHHHHHHh-- Q lcl|NC_019719. 325 -----YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAF-------MKAMGEAGLRTINEMRRTD-- 390 (424) Q Consensus 325 -----~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~-------~~~~~~~g~~T~NE~R~~~-- 390 (424) .-|.|+++.+...+...+..+. .+ .|.++.|...+.+++++. ++.++++|+++++|+|+++ T Consensus 432 ~s~QE~~L~P~LerL~~li~~~lg~~~---d~--~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~ 506 (862) T protein:vir:99 432 ESIQEHVYMPFLQRHYLISRLSLGIQH---EI--DVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRD 506 (862) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCC---cc--eEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHh Confidence 4577999999887765553222 23 344568888887776644 7899999999999999975 Q ss_pred ----CCCCCCCCCee----eecccccchhhcccc--C---CCcccCC Q lcl|NC_019719. 391 ----NLPPLPGGDVA----MRQSQYVPITDLGTN--K---EPRNNGA 424 (424) Q Consensus 391 ----G~~p~~~gd~~----~~~~n~~~~~~~~~~--~---~~~~~ga 424 (424) |++.++..|.. ..+.+.......+.. + +....|+ T Consensus 507 ~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~~aga 553 (862) T protein:vir:99 507 DKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETASAKETQAGA 553 (862) T ss_pred cCCcCCCCCCcccccccCCCCcccccccccCCccccccccccccccc Confidence 44444322211 112222211111100 0 0001111 No 113 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.82 E-value=1.9e-19 Score=123.26 Aligned_cols=402 Identities=12% Similarity=0.014 Sum_probs=234.0 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccccccc--ccccccccCcccccHHHHh-hhHHHHHHHHHHHHhhccC Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTG--PVSAHGHLGDSSINDERIL-QISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~-~~~~v~~~i~~ia~~ia~~ 77 (424) |-|- ...++-++=...+.+. ..... ...... ......+.....+ .+..+ +.+.|.+|++.+...|.++ T Consensus 1 ~~~~---~~~~~p~~~~g~~~~~----~~~~~-~~~~~~~e~~~~lr~~~~~~l-y~~m~e~D~~i~s~l~~rk~av~~~ 71 (469) T protein:vir:10 1 MTER---VKTAAPVSEAGYVFGS----GVVDG-WTVWDPFEQTPELQWPQSVAV-YSRMDNEDSRVTSLLEAISLPIRST 71 (469) T ss_pred CCCc---ccCCCCccchhhhhhc----ccccc-hhhccccccccccccccchHH-HHHHHhhChHHHHHHHHHHHHHhcC Confidence 2110 0001111111111100 00000 000000 0000000011112 23344 5899999999999999999 Q ss_pred ceEEEEecccCccccccccchhhhhhcc----CC--------CCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-----C Q lcl|NC_019719. 78 PLDVFETDQNDNRKKVDLSNPLARLLRY----SP--------NQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-----G 140 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~l~~lL~~----~p--------N~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-----G 140 (424) +|.|...+.+.... .-+...|.. .+ +-..++.+++..++.+.+.+|-++.++++... | T Consensus 72 ~w~v~p~~~~~e~~-----~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG 146 (469) T protein:vir:10 72 PWRIRANGASDEVT-----EFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDG 146 (469) T ss_pred CceEecCCCCHHHH-----HHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCC Confidence 99996544322111 112222211 11 11236888888888888899999999987543 4 Q ss_pred --ceeeEEeecCceEEE---EEcCCceEEEE------------EecCceEEecHhHeeEeccCC-CCccccCchHHHHHH Q lcl|NC_019719. 141 --DVISLLPLQSANMDV---KLVGKKVVYRY------------QRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACK 202 (424) Q Consensus 141 --~~~~l~~l~~~~v~~---~~~~~~~~~~~------------~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~ 202 (424) .+..|.+.|+.++.. ..+++...+.. ..+.....+++...|+.++.. ...++|.|.+..+.- T Consensus 147 ~~~~~~l~~rp~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~ 226 (469) T protein:vir:10 147 RFWLRKLAPRPQWTISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYK 226 (469) T ss_pred ceeeeeeeecCcccceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecCCCCCcccchhHHHHHH Confidence 366777778775532 22333222221 122234567777777666554 455899999999999 Q ss_pred HHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHH Q lcl|NC_019719. 203 SAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMA 282 (424) Q Consensus 203 ~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e 282 (424) .........++...|....+.|--+.+.+.+. +++.++.+.+....+..+.++ .++++.|++++-+..+.....|.+ T Consensus 227 ~~~fK~~~~~~w~~f~EryG~P~~vgky~~~a-~~~ek~~l~~a~~~~~~g~~a--~~iip~~~~ie~~ea~g~~~~~~~ 303 (469) T protein:vir:10 227 HWLLKDKLLRIEAATAERNGMGIPVGTASSAT-DEDEVRKMAALARSVRGGINA--GVGLAQGQILELLGVSGNLPDIRR 303 (469) T ss_pred HHHHHHHHHHHHHHHHHHcCCcceEEecCCCC-CHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEeecCCCchHHHH Confidence 99999999999999999999998888887765 556677777788877765554 466788888877776655567888 Q ss_pred HHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc-----ccccceeeecc Q lcl|NC_019719. 283 SRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-----DVGRIHAEHNL 357 (424) Q Consensus 283 ~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-----~~~~~~~~fd~ 357 (424) +.++..++|+.+.--. .+-....++++ ...+.........+.-.++.|+..||+.|+.+. +....+.+|.+ T Consensus 304 li~~~d~~Isk~iLG~-tlTs~~~gGS~---a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~ 379 (469) T protein:vir:10 304 AIEGHDRSIALSGLAH-FLNLDGKGGSY---ALASVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAPVLTF 379 (469) T ss_pred HHHHHHHHHHHHHhcc-cccccCccchh---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEe Confidence 9999999998876332 11111122222 234555667777888889999999998776532 11112223333 Q ss_pred hhhhccCHHHHHHHHHHHHhCCCC-----CHHHHHHHhCCCCCCCCCeeeecccc--cchhh-ccccCC--CcccCC Q lcl|NC_019719. 358 DGLLRGDSASRAAFMKAMGEAGLR-----TINEMRRTDNLPPLPGGDVAMRQSQY--VPITD-LGTNKE--PRNNGA 424 (424) Q Consensus 358 ~~l~~~d~~~~~~~~~~~~~~g~~-----T~NE~R~~~G~~p~~~gd~~~~~~n~--~~~~~-~~~~~~--~~~~ga 424 (424) +... .+.+..++.+++++..|++ +.+.+|+.+|+|+-+.++....+... .|... .+.... +....+ T Consensus 380 ~~~e-~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (469) T protein:vir:10 380 DPIG-SRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPEEPAAVPNQSAAPARTRSSGNADAR 455 (469) T ss_pred cCCC-CcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcccccchhcccCCCCCccccccCCCCCcccc Confidence 3432 4567789999999999994 45789999999976555544332211 11100 000000 000000 No 114 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.82 E-value=3.9e-20 Score=127.01 Aligned_cols=397 Identities=10% Similarity=0.044 Sum_probs=208.3 Q ss_pred CCC----------------------------------CcccccCCCCCchHHHHHhhccCccc-Cccccccccccccccc Q lcl|NC_019719. 1 MEE----------------------------------PKYTIDLRTNNGWWARLQSWFVGGRL-VTPNQGSQTGPVSAHG 45 (424) Q Consensus 1 ~~~----------------------------------~~~~~~~~~~~G~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~ 45 (424) -.+ ++..||=-.-.|..+.+.+...+... ..+.. ...+.... T Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 106 (765) T protein:vir:96 30 QHDPLDPMIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTM---LQDWYNSQ 106 (765) T ss_pred CCCCcccchhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhH---HHhhhccc Confidence 011 11222222222222222211111000 00000 00000011 Q ss_pred ccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHH Q lcl|NC_019719. 46 HLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLC 125 (424) Q Consensus 46 ~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l 125 (424) ...+..+ ...|.+++.+.++|+++|+.+-+-.+.+.-.+.+.. ......|...-... .-.+-+..++.+.- T Consensus 107 ~f~gyql-~alY~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~~-------~~~~~~l~~~~~rl-~v~~~l~ea~~~~R 177 (765) T protein:vir:96 107 GFIGYQA-CAIISQHWLVDKACSMSGEDAARNGWELKSDGRKLS-------DEQSALIARRDMEF-RVKDNLVELNRFKN 177 (765) T ss_pred CCccHHH-HHHHHhCchhhhhhhcchHHhhcCCceeecCccccC-------HHHHHHHHHHHHHh-hHHHHHHHHHHHhh Confidence 1111111 234667888999999999999998888742211111 11111221111111 23444555556666 Q ss_pred HcCCeEEEEeeC-CC---------------CceeeEEeecCceEEEEEc----CCc---eEE---EEEecCceEEecHhH Q lcl|NC_019719. 126 FYGNAYALVDRN-SA---------------GDVISLLPLQSANMDVKLV----GKK---VVY---RYQRDSEYADFSQKE 179 (424) Q Consensus 126 ~~G~a~~~~~r~-~~---------------G~~~~l~~l~~~~v~~~~~----~~~---~~~---~~~~~~~~~~~~~~e 179 (424) ++|.+|+++.-. .+ |....|..++|..+..... .+. .++ .|...+ ..|-++. T Consensus 178 lyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i~g--~~IH~SR 255 (765) T protein:vir:96 178 VFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWIISG--KKYHRSH 255 (765) T ss_pred hceeeEEEEEecccCcchhhccccccccccceeeEEEEechhhcccccchhccccccccccCcceeeeecC--ceeccce Confidence 788888776432 22 2345677777766665321 111 111 122222 3567888 Q ss_pred eeEeccCCC-------CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC--CCCCHHHHHHHHHHHHHH Q lcl|NC_019719. 180 IFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE--KVLTEQQRSQVEENFKEI 250 (424) Q Consensus 180 vih~r~~~~-------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~--~~~~~~~~~~~~~~~~~~ 250 (424) |||+..... .+..|.|.++.+.+.|............++...... +++++. .+..+++ +.+.++.. T Consensus 256 li~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~--v~k~~~~~~l~~~~~---l~~r~~~~ 330 (765) T protein:vir:96 256 LVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTS--TIHVDVEKAIANEDA---FNARLAFW 330 (765) T ss_pred EEEecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeeechHhhhccHHH---HHHHHHHH Confidence 999865432 345799999999999999988888888887765543 344332 2223332 33334443 Q ss_pred hCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHH------ Q lcl|NC_019719. 251 AGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ------ 324 (424) Q Consensus 251 ~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~------ 324 (424) ....+..++++++.+-+|+.++.+..++ .+........||.+.+||..+|.+...+..+ ++.+...+.||. T Consensus 331 ~~~r~n~g~~~id~ee~~e~~s~~lsgl--~d~l~~~~~~iAaas~IP~t~LfGqsp~Gln-ATGe~D~~nYyD~I~s~Q 407 (765) T protein:vir:96 331 IANRDNHGVKVIGIDETMEQFDTNLSDF--DSVIMNQYQLVAAIAKTPATKLLGTSPKGFN-ATGEHETISYHEELESIQ 407 (765) T ss_pred HHhcCCceeEEecCCcceeEEecccCCH--HHHHHHHHHHHHhhhCCCeeeeccCCccccc-CcchHHHHHHHHHHHHHH Confidence 3333345688899999999998887764 5778888999999999999888765422221 233445556665 Q ss_pred -HHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHhCCCCC- Q lcl|NC_019719. 325 -YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAA-------FMKAMGEAGLRTINEMRRTDNLPPL- 395 (424) Q Consensus 325 -~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~g~~T~NE~R~~~G~~p~- 395 (424) .-+.|.++.+-+.|-+.- ... ..+.|.++.|...+.+++++ .++.++++|++++||+|+.++.++. T Consensus 408 e~~l~p~le~L~~li~~s~----~i~-~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~ 482 (765) T protein:vir:96 408 EHIFDPLLERHYLLLAKSE----SID-VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRS 482 (765) T ss_pred HHHHHHHHHHHHHHHHHhc----CCC-CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccC Confidence 456677666655554321 111 12445557888788776654 5888999999999999999865543 Q ss_pred -----CCCC----eeeecccccchhhccccC------CCccc-------CC Q lcl|NC_019719. 396 -----PGGD----VAMRQSQYVPITDLGTNK------EPRNN-------GA 424 (424) Q Consensus 396 -----~~gd----~~~~~~n~~~~~~~~~~~------~~~~~-------ga 424 (424) ++.+ ....|.+.......+++. +..+. |+ T Consensus 483 g~~~l~d~~~e~~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~ 533 (765) T protein:vir:96 483 GYNRLTDDQAETEPGMSPENLAELEKAGAQSAKAKGEAERAEAQAGAVEGA 533 (765) T ss_pred CCCCCCccccccccCCCccccccccCCCcccccccCccccccCCCCccCCC Confidence 2211 111111221111111100 00000 00 No 115 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=99.80 E-value=4.9e-19 Score=120.97 Aligned_cols=402 Identities=8% Similarity=-0.038 Sum_probs=233.9 Q ss_pred CCCCcccccCCCCC--chHHHHHhhccCcccCcccccccccccccccccCcccc----cHHHHh-hhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTNN--GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSI----NDERIL-QISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~--G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~-~~~~v~~~i~~ia~~ 73 (424) +. +.--++.|+-. |+.+++......+ .+|.+.. ..+- ....|... -.+..+ +.+.|.+|++.+... T Consensus 12 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~g--ltp~~l~--~il~--~a~~gd~~~~~~L~edm~e~D~~i~s~l~~Rk~a 84 (526) T protein:vir:79 12 IR-PQQLREPQTSRLAGLAKEFAQHPAKG--LTPAKLA--RILV--EAEQGNLQAQAELFMDMEERDAHLFAEMSKRKRA 84 (526) T ss_pred cC-ccccchhhhhhhhhhhhhcccCCCCC--cCHHHHH--HHHH--HhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHH Confidence 11 00012333311 3333322211111 0111100 0000 00011110 012222 578999999999999 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC---ceeeEEeecC Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG---DVISLLPLQS 150 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G---~~~~l~~l~~ 150 (424) |.+++|.|.....+... ......-+...|+..| .+.+++..+.. .+.+|-+..++++..+| .+..+.+.++ T Consensus 85 v~~~~w~I~p~~~~~~~-~~~~a~~v~~~l~~~~----~~~~~i~~~ld-A~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~ 158 (526) T protein:vir:79 85 ILGLDWAVEPPRNASAA-EKADADYLHELLLDLE----GLEDLLLDALD-GIGHGYSCIELEWALQGREWMPLAFHHRPQ 158 (526) T ss_pred HhCCCceEecCCCCChH-HHHHHHHHHHHHhccc----CHHHHHHHHHh-hhhhcceeEEEEEeecCCceeEEEeeeecc Confidence 99999999754333211 1112223444454333 25555555544 56799999999876543 4778999999 Q ss_pred ceEEEEEcCCceEEEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE Q lcl|NC_019719. 151 ANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 151 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~ 229 (424) .++.+..+++..............+++...+..++.. ...++|.+.+..+.-.........++...|...-|.|-.+.+ T Consensus 159 ~~F~~~~~~~~~l~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igk 238 (526) T protein:vir:79 159 SWFQLNPEDQNELRLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGK 238 (526) T ss_pred cceEeccCCCcEEEecCCCCCceeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEe Confidence 9888777666543322333456677877665555544 455899999999999999999999999999999999988888 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC-C Q lcl|NC_019719. 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVE-K 307 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-~ 307 (424) .+.+. +++.++.+.+.+.++.. + ..+++|.|++++-+..+ .....|.++.++..++|+++. +-..+-.... + T Consensus 239 y~~~a-~~~ek~~L~~av~~i~~--d--a~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGqtlTs~~~~g 312 (526) T protein:vir:79 239 YPPGT-ADEEKATLLRAVTGLGH--A--AAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LGGTLTSTTSQS 312 (526) T ss_pred cCCCC-CHHHHHHHHHHHHHHhc--C--cEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhhhhccccccC Confidence 88775 55556666666666653 2 35777777766655532 233457888889999998875 2222221111 1 Q ss_pred CCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccc----------cceeeecchhhhccCHHHHHHHHHHHHh Q lcl|NC_019719. 308 STSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG----------RIHAEHNLDGLLRGDSASRAAFMKAMGE 377 (424) Q Consensus 308 ~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~----------~~~~~fd~~~l~~~d~~~~~~~~~~~~~ 377 (424) ++++++. .+........-+.-.+++|+..||+.|+.+.-.- ..++.|+. ....|.+.+++.+.+++. T Consensus 313 ~~gS~a~-g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~--~e~eDl~~~a~~~~~L~~ 389 (526) T protein:vir:79 313 GGGAFAL-GQVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDL--REQADITSMAQSIPALVN 389 (526) T ss_pred cchhhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCC--CCcccHHHHHHHHHHHHh Confidence 1222222 2334555667777888999999998776432111 12345544 346788899999999999 Q ss_pred CCC-CCHHHHHHHhCCCCCCCCCeeeecccccchhhccccC----CCcccCC Q lcl|NC_019719. 378 AGL-RTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNK----EPRNNGA 424 (424) Q Consensus 378 ~g~-~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~----~~~~~ga 424 (424) .|+ ++..++|+.+|+|...+++.++.|............. .....++ T Consensus 390 ~G~~i~~~~i~e~~gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (526) T protein:vir:79 390 VGLEIPSAWVYDKLGIPQPAKNEPVLRPAAQPAILSRQHGQRVAALATIVGP 441 (526) T ss_pred CCCcCCHHHHHHHhCCCCCCCchhhccccCCccccccccccccccccccccc Confidence 997 7999999999997655555554432111100000000 0000000 No 116 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.78 E-value=3e-19 Score=122.10 Aligned_cols=373 Identities=10% Similarity=0.058 Sum_probs=211.1 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEeccc Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~ 87 (424) |+.-|..|.-+-+ .++...... .......+..+ ...|.+++.+.++|+.+|+.+.+..+.+.-. T Consensus 1 ~~~~~~d~~~~~~----~~~~~~~~~--------~~~~~~~~~~l-~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~--- 64 (427) T protein:vir:10 1 MKIVKHDGYNDIF----NGGADGSPK--------PFFMSDASYHV-GSFYNDNATAKRIVDVIPEEMVTAGFKMSGV--- 64 (427) T ss_pred CCccccchHHHHh----hcCCCCccc--------CccccCchHHH-HHHHHcCchhhhhhccchHHhhcCCccccCc--- Confidence 9999999997643 222111110 01111112222 2446678889999999999999998887421 Q ss_pred CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-e---------CCCCceeeEEeecCceEEEEE Q lcl|NC_019719. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-R---------NSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r---------~~~G~~~~l~~l~~~~v~~~~ 157 (424) +. +.... ..+..|+ -.+-+..++....++|.+++++. + ...|.+..|.+++++.+++.. T Consensus 65 ~~--~~~~~-~~~~~l~--------~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~ 133 (427) T protein:vir:10 65 KD--EKEFK-SLWDSYK--------LDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEK 133 (427) T ss_pred cH--HHHHH-HHHHHhh--------HHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccc Confidence 11 11111 1122222 23445555666667898888764 2 234678899999998887643 Q ss_pred c---------CCceEEEEEecC--ceEEecHhHeeEeccCC-------CCccccCchHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 158 V---------GKKVVYRYQRDS--EYADFSQKEIFHLKGFG-------FTGLVGLSPIA-FACKSAGVAVAMEDQQRDFF 218 (424) Q Consensus 158 ~---------~~~~~~~~~~~~--~~~~~~~~evih~r~~~-------~~~~~G~s~~~-~~~~~i~~~~~~~~~~~~~~ 218 (424) . +....|.+...+ ..+.+.++.|+|+.+.. ..+.+|.|++. .+.+.+............++ T Consensus 134 ~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~ 213 (427) T protein:vir:10 134 RVTNARSPRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQIL 213 (427) T ss_pred cccCccccccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 2 123345554332 33678899999996543 24568999986 46787888877777777766 Q ss_pred hccCCCceeEEcCCC---CCCHHHHHHHHHHHHHHhC-CcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 219 ANGAKSPQILSTGEK---VLTEQQRSQVEENFKEIAG-GPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARF 294 (424) Q Consensus 219 ~n~~~p~~vl~~~~~---~~~~~~~~~~~~~~~~~~~-~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~ 294 (424) ...... +++++.- +..........+.+..... .++.+.+++...+-++++++.+...+ .+........||.+ T Consensus 214 ~k~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl--~~~~~~~~~~iaaa 289 (427) T protein:vir:10 214 RRKQQA--VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSDISGV--PEFLSSKMDRIVSL 289 (427) T ss_pred HHhccc--cccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEecccCCh--HHHHHHHHHHHHhh Confidence 554332 3343311 1122222222223333222 22344566666668899888777664 57788889999999 Q ss_pred hCCCHHHhcCCCCCCccchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHH Q lcl|NC_019719. 295 FGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 295 fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~ 367 (424) .+||..+|.+...+..+ ++.+.....|+. .-+.|.++.+-+.+- .. ..+.++| +.|...+.++ T Consensus 290 ~~IP~t~L~G~sp~Gln-stgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~----~s---~~~~~~f--~pL~~~s~kE 359 (427) T protein:vir:10 290 SGIHEIIIKNKNVGGVS-ASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----DE---EEWSIEF--EPLSVPSKKE 359 (427) T ss_pred hCCCeeeeccCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----cC---CCcEEEe--CCCCCCCHHH Confidence 99999988665544433 233444555555 345666555443332 11 1344455 5666666555 Q ss_pred -------HHHHHHHHHhCCCCCHHHHHHHh----CCCCCCCCCeeee--ccccc-chhhccccCCCcc Q lcl|NC_019719. 368 -------RAAFMKAMGEAGLRTINEMRRTD----NLPPLPGGDVAMR--QSQYV-PITDLGTNKEPRN 421 (424) Q Consensus 368 -------~~~~~~~~~~~g~~T~NE~R~~~----G~~p~~~gd~~~~--~~n~~-~~~~~~~~~~~~~ 421 (424) +++.+.+++++|+++++|+|+.+ +.+.+.+++..-. +.... +-...++.+.++| T Consensus 360 kaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~~~~~~e~~~~~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 360 ESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLEDEN 427 (427) T ss_pred HHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCCccccccccchhcCCCCCCCCCCCCCC Confidence 45678889999999999999876 3444433332110 11110 1011111222222 No 117 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.77 E-value=1.6e-17 Score=112.62 Aligned_cols=388 Identities=10% Similarity=0.036 Sum_probs=224.5 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccccc--------cccccc--------ccccCcccccHHHHhhhHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQ--------TGPVSA--------HGHLGDSSINDERILQISTVW 64 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~--------~~~~~~--------~~~~~~~~~~~~~~~~~~~v~ 64 (424) |- .||+..-..++.......+..... ..+..+ ....++..-..+..++.+.|. T Consensus 1 ~~-----------~~i~~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m~~D~~i~ 69 (491) T protein:vir:79 1 MS-----------KGLWVSPTEFVKFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVG 69 (491) T ss_pred CC-----------CeeeCCCCCcccccccchhHHHHHhhhccccccccccccCcchhHHHhhccCCHHHHHHHhhChHHH Confidence 11 122222211111110000000000 000000 000111111234456789999 Q ss_pred HHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC---c Q lcl|NC_019719. 65 RCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG---D 141 (424) Q Consensus 65 ~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G---~ 141 (424) +|++.+...|.+++|.|.....+. + ....+...|. ++ .+.+++..+. +.+++|-++.++++..+| . T Consensus 70 s~l~~Rk~av~~~~w~i~~~~~~~---~--~a~~i~e~l~-~~----~~~~~i~~~l-da~~~G~s~~Ei~w~~~~g~~~ 138 (491) T protein:vir:79 70 GCVRRRKAAVKALEWGLDRGKAKS---R--VAKSIADVFA-DL----DLSRIATEML-DAVLYGYQPMEITWGKVGNYIV 138 (491) T ss_pred HHHHHHHHHHhCCCcEEecCCCCH---H--HHHHHHHHHh-cC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeee Confidence 999999999999999997544322 1 1123444453 33 4566666664 566799999999876543 4 Q ss_pred eeeEEeecCceEEEEEcCCceEEEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019719. 142 VISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFAN 220 (424) Q Consensus 142 ~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 220 (424) |..+.+.|+.++.+..++...+...........+++...|+.++.. ...++|.+.+..+.-.........++...|... T Consensus 139 ~~~l~~r~~~~f~~d~~~~l~l~~~~~~~~g~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~ 218 (491) T protein:vir:79 139 PIDVVGKPADWFVYDPENQLRFRSKEHWVQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEK 218 (491) T ss_pred EEeeeeecccceeeccCCceEEeecCCCCCceeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHH Confidence 6789999999888776655433222223456778888888777654 455899999999999999999999999999999 Q ss_pred cCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccc--C-hhHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019719. 221 GAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV--T-PQDAEMMASRKFQVSELARFFGV 297 (424) Q Consensus 221 ~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~--~-~~d~~~~e~~~~~~~~Ia~~fgV 297 (424) .|.|-.+.+.+.+. +++.++.+.+.+.++.+ + ..+++|.|++++-+.. . ..-..|.++.++..++|+.+. T Consensus 219 ~G~P~~igky~~~a-~~~ek~~l~~al~~~~~--~--a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i-- 291 (491) T protein:vir:79 219 YGSPMLVGKHPRSA-SDAETNLLLDRLEDMVQ--D--AVAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIAL-- 291 (491) T ss_pred cCCCeEEEecCCCC-CHHHHHHHHHHHHHHhc--C--eEEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHH-- Confidence 99998888887765 45555666666666543 2 3466677766655432 2 222347777888888888865 Q ss_pred CHHHhcCC--CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccc-c---ccceeeecchhhhccCHHHHHHH Q lcl|NC_019719. 298 PPHLVGDV--EKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKD-V---GRIHAEHNLDGLLRGDSASRAAF 371 (424) Q Consensus 298 P~~~l~~~--~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~-~---~~~~~~fd~~~l~~~d~~~~~~~ 371 (424) ||.+ .+++.+++.. +........-+.-.+..++..||+ |+.+.- . ......|.+.... .+.+.+++. T Consensus 292 ----LGqtlTt~~~gs~a~~-~vh~~v~~~i~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~e~e-e~~~~~a~~ 364 (491) T protein:vir:79 292 ----LGQNQTTEATSTRASA-QAGLEVTDDIRDGDKAIVVEAMNM-LIRWICDLNFDGAARPVFDMWEQE-QVDEIQAGR 364 (491) T ss_pred ----hhhhhccCcccchhhH-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEeecCcC-chhHHHHHH Confidence 3322 1223333222 234445566666777778877775 443221 0 1112345444432 223568899 Q ss_pred HHHHHhCCC-CCHHHHHHHhCCCCCCCCCeeeecccccchhhc-----cccCCCcccCC Q lcl|NC_019719. 372 MKAMGEAGL-RTINEMRRTDNLPPLPGGDVAMRQSQYVPITDL-----GTNKEPRNNGA 424 (424) Q Consensus 372 ~~~~~~~g~-~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~-----~~~~~~~~~ga 424 (424) +.++++.|+ ++.+++|+.+|+|+-+.++....+....+.... ........+.+ T Consensus 365 ~~~L~~~G~~i~~~~~~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 423 (491) T protein:vir:79 365 DEKLTRAGARFTPAYFKRAYNLQDGDLDERPLPVSAVDAVGAASFAEFEAPDQDALDAA 423 (491) T ss_pred HHHHHhCCCccCHHHHHHHhCCCCCCCCccccCcCcccccccccccccCCCCCcchHHH Confidence 999999987 799999999999876555544332211111100 00011111111 No 118 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.76 E-value=1.4e-17 Score=112.95 Aligned_cols=387 Identities=10% Similarity=0.042 Sum_probs=226.0 Q ss_pred CCC--CcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCc Q lcl|NC_019719. 1 MEE--PKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~--~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) +.. +.-+-++++++.+.+.. ..+. ++.. ..++. ....+..-.-+..++.+.|.+|++.+...|.+++ T Consensus 15 ~~~~~~~~~~~ia~~~~~~~~~----~~~~--~~~~---~~~iL--r~~~~~~~~y~~m~~D~~i~s~l~~Rk~av~~~~ 83 (491) T protein:vir:10 15 FGEPDKSLSSQIATRARSIDFF----ALGM--YLPN---PDPVL--KALGKDIRVYRELRADAHVGGCVRRRKAAVKALE 83 (491) T ss_pred cccCChHHHHHHHhhhcccccc----cccC--Cccc---hHHHH--HhcCCCHHHHHHHhhChHHHHHHHHHHHHHhCCC Confidence 111 11233344433222211 1110 0100 00000 0111111122445678999999999999999999 Q ss_pred eEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC---ceeeEEeecCceEEE Q lcl|NC_019719. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG---DVISLLPLQSANMDV 155 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G---~~~~l~~l~~~~v~~ 155 (424) |.|.....+.. ....+...|. ++ .+.+++..+. +.+++|.+..++++..+| .+..+.++|+.++.+ T Consensus 84 w~i~~~~~~~~-----~~e~v~e~l~-~~----~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~ 152 (491) T protein:vir:10 84 WGLDRGKAKSR-----VAKSIADVFA-DL----DLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVY 152 (491) T ss_pred cEEecCCCCHH-----HHHHHHHHHh-cC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeecccceee Confidence 99965433221 1123444453 33 4677777775 567899999999886543 466899999998887 Q ss_pred EEcCCceEEEEEe-cCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC Q lcl|NC_019719. 156 KLVGKKVVYRYQR-DSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK 233 (424) Q Consensus 156 ~~~~~~~~~~~~~-~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~ 233 (424) ..++... +.... ......+++...|+.++.. ...++|.+.+..+.-.........++...|....+.|-.+.+.+.+ T Consensus 153 d~~~~l~-~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~ 231 (491) T protein:vir:10 153 DPENQLR-FRSKDHWMQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRS 231 (491) T ss_pred ccCCceE-EecCCCCCCcceecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCC Confidence 6665433 33333 2455678888777776654 4558999999999999999999999999999999999888888777 Q ss_pred CCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccc--ChhH-HHHHHHHHHHHHHHHHHhCCCHHHhcCC--CCC Q lcl|NC_019719. 234 VLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV--TPQD-AEMMASRKFQVSELARFFGVPPHLVGDV--EKS 308 (424) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~--~~~d-~~~~e~~~~~~~~Ia~~fgVP~~~l~~~--~~~ 308 (424) . +++.++.+.+.+.++.+ + ..+++|.|++++-+.. +... ..|.+..++..++|+.+. ||.+ .++ T Consensus 232 a-~~~ek~~l~~al~~~~~--~--a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i------LGqtlTt~~ 300 (491) T protein:vir:10 232 A-SDGEKNLLLDCLEDMVQ--D--AVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIAL------LGQNQTTEA 300 (491) T ss_pred C-CHHHHHHHHHHHHHHhc--C--cEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHH------hhhhcccCc Confidence 6 45556666666666643 2 3566777776655543 2222 247777888888888763 3322 122 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccc----cccceeeecchhhhccCHHHHHHHHHHHHhCCC-CCH Q lcl|NC_019719. 309 TSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKD----VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGL-RTI 383 (424) Q Consensus 309 ~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~----~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~-~T~ 383 (424) +.+++.. +........-+.-.+..++..+|+ |+.+.- ....+.+|.+... ..+.+.+++.+.+++..|+ ++. T Consensus 301 ~gs~a~~-~vh~~v~~di~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~~~-~e~~~~~a~~~~~L~~~G~~i~~ 377 (491) T protein:vir:10 301 TSTRASA-QAGLEVTDDIRDGDKAVVSEAMNM-LIRWICDLNFDGADRPVFDMWEQ-EQVDEIQAGRDQKLTQAGARFTP 377 (491) T ss_pred ccchhHH-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEEecCc-CchhHHHHHHHHHHHhCCCcCCH Confidence 2233222 334445566666777788887774 543211 0011223333333 2334778999999999987 788 Q ss_pred HHHHHHhCCCCCCCCCeeeecccccch--hhcccc---CCCcccCC Q lcl|NC_019719. 384 NEMRRTDNLPPLPGGDVAMRQSQYVPI--TDLGTN---KEPRNNGA 424 (424) Q Consensus 384 NE~R~~~G~~p~~~gd~~~~~~n~~~~--~~~~~~---~~~~~~ga 424 (424) .++|+.+|+|+-+.++.........+. ...... ..+..+.+ T Consensus 378 ~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 423 (491) T protein:vir:10 378 AYFKRAYNLQDGDLDERPLPVSAVDTVGAASFAEFEAPDQDALDAA 423 (491) T ss_pred HHHHHHhCCCCCCcCccccccCCCCCcccccccccCCCCCCchHHH Confidence 999999999875444433221111111 000000 00000111 No 119 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.76 E-value=1.2e-18 Score=118.92 Aligned_cols=378 Identities=9% Similarity=0.034 Sum_probs=201.5 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |..-| +...+..|+-+.+ .+.. +.....+.......... -...|.+++.+.++|+.+|+.+-+-.+. T Consensus 5 m~~~~--~~~~~~D~~~~~~----~~~~------g~~~~~~~~~~~~~~~~-l~~~Y~~~~l~~~~Vd~~aed~~r~g~~ 71 (435) T protein:vir:79 5 MSDKV--KAITKEDGYNEIF----GSKD------GTFRPNAFYMQRAAFKA-LSQFYEEDGMARRIVDVIPEEMVTPGFK 71 (435) T ss_pred ccccc--ccchhhcchhhhh----cccc------cccccCcccCCcCCHHH-HHHHHhcCchhhhhhccchHHhhcCCce Confidence 43332 3333444433321 1100 00000001111111111 1234567888999999999999999888 Q ss_pred EEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCC---------CCceeeEEeecC Q lcl|NC_019719. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNS---------AGDVISLLPLQS 150 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~---------~G~~~~l~~l~~ 150 (424) +.-. +. +. .+...+. + . ...+-+..++....++|.+++++. ++. .|.+..+.++++ T Consensus 72 i~g~---~~--~~----~~~~~~~-~---l-~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~ 137 (435) T protein:vir:79 72 VDGV---KN--EK----SFKSRWD-E---L-RLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDR 137 (435) T ss_pred ecCC---Ch--HH----HHHHHHH-H---h-hHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeech Confidence 7311 11 11 1111121 1 1 123344455555567888877765 332 244568888888 Q ss_pred ceEEEEEc---------CCceEEEEEecC--ceEEecHhHeeEeccCC-------CCccccCchH-HHHHHHHHHHHHHH Q lcl|NC_019719. 151 ANMDVKLV---------GKKVVYRYQRDS--EYADFSQKEIFHLKGFG-------FTGLVGLSPI-AFACKSAGVAVAME 211 (424) Q Consensus 151 ~~v~~~~~---------~~~~~~~~~~~~--~~~~~~~~evih~r~~~-------~~~~~G~s~~-~~~~~~i~~~~~~~ 211 (424) .++++... +....|.+...+ .++.+.++.|||+.+.. .....|.|++ +.+.+.+....... T Consensus 138 ~~i~~~~~~~dp~sp~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~ 217 (435) T protein:vir:79 138 YQITIHERETNARSVRYGEPKLYKISPGGDIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQ 217 (435) T ss_pred hhccchhhccCCcccccCcceEEEEecCCCCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHH Confidence 88875332 122345554332 35678999999996432 2456799998 57888888888888 Q ss_pred HHHHHHHhccCCCceeEEcCC---CCCCHHHHHHHHHHHHHHhCCc-ccCcceecCCCceeeecccChhHHHHHHHHHHH Q lcl|NC_019719. 212 DQQRDFFANGAKSPQILSTGE---KVLTEQQRSQVEENFKEIAGGP-VKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQ 287 (424) Q Consensus 212 ~~~~~~~~n~~~p~~vl~~~~---~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~ 287 (424) .....++...... +++++. ....+.......+.+....... +.+.+++...+.+++.++.+..++ .+..... T Consensus 218 ~~~~~l~~~~~~~--v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~~lsgl--~~~~~~~ 293 (435) T protein:vir:79 218 ELATQLLRRKQQA--VWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLNSDVSGV--PEFLQEK 293 (435) T ss_pred HHHHHHHHHhcCc--cccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEecccCCH--HHHHHHH Confidence 8887766554432 233321 1222222333333333332222 234455555556788888777654 5778888 Q ss_pred HHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccCccccccceeeecchhh Q lcl|NC_019719. 288 VSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY-------TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL 360 (424) Q Consensus 288 ~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~-------tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l 360 (424) ...||.+.|||..+|.+...+..+ ++.+.....|+.. -+.|.++.+-..+ ... ..+. |.++.| T Consensus 294 ~~~iaaa~~IP~t~L~G~s~~gln-stgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li----~~s---~d~~--~~f~pL 363 (435) T protein:vir:79 294 IDRIVALTGIHEIIIKNKNTGGVS-ASQNTALETFYKLIDRKRVEDYKPILEFLLPFM----ISE---TEWS--IEFEPL 363 (435) T ss_pred HHHHHhhhCCCeeeeccCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcC---CCCe--EEeCCC Confidence 999999999999888665544432 2334444555543 3445544433332 211 1233 444677 Q ss_pred hccCHHH-------HHHHHHHHHhCCCCCHHHHHHHh-CCCC---CCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 361 LRGDSAS-------RAAFMKAMGEAGLRTINEMRRTD-NLPP---LPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 361 ~~~d~~~-------~~~~~~~~~~~g~~T~NE~R~~~-G~~p---~~~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ...|.++ .++.+.+++++|+++++|+|+.+ ...+ +.+.+..-.+ ..++ .+.+++.++|. T Consensus 364 ~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~----~~~d-~~~~~~~e~g~ 433 (435) T protein:vir:79 364 SVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDTLRSICPDLKIMDNDNIELP----EPED-LDPEPGQEGGL 433 (435) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccccCCCCcccccCC----cccc-CCCCCCCCCCC Confidence 7777754 45667788999999999999877 2222 1111111111 0111 11122223333 No 120 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.76 E-value=1.9e-18 Score=117.71 Aligned_cols=373 Identities=11% Similarity=0.055 Sum_probs=205.7 Q ss_pred CCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCc Q lcl|NC_019719. 10 LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDN 89 (424) Q Consensus 10 ~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~ 89 (424) +-+-.|.-+-++ ++..... .+..........+ ...|.+++.++++|+.+|+.+-+-.|.+.-. +. T Consensus 1 ~~~~D~~~n~~~----gg~~~~~-------~~~~~~~~~~~~l-~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~---~~ 65 (422) T protein:vir:10 1 MVKTDSYANIFL----GGSDGSE-------IYGSLQNQAPTIL-ASLYADNALVRRIIDTIPETALAAGFHIDGI---DD 65 (422) T ss_pred CccchhhHHHHc----CCCCCcc-------ccCcccccCHHHH-HHHHHhChhhHHHHhhhhHHHhcCCccccCC---CH Confidence 334446555432 2211000 0111111111111 2346678899999999999999988887311 11 Q ss_pred cccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee-C---------CCCceeeEEeecCceEEEEEc- Q lcl|NC_019719. 90 RKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-N---------SAGDVISLLPLQSANMDVKLV- 158 (424) Q Consensus 90 ~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r-~---------~~G~~~~l~~l~~~~v~~~~~- 158 (424) +...... +..|+ ..+-+..++....++|.+++++.. + ..|.+..+.++++..|++... T Consensus 66 --~~~~~~~-~~~l~--------~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~ 134 (422) T protein:vir:10 66 --EPAFWSR-WDDLE--------MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTRE 134 (422) T ss_pred --HHHHHHH-HHHhh--------HHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcc Confidence 1111111 12222 234445555556678888888754 3 235577888999988876431 Q ss_pred --------CCceEEEEEecC--ceEEecHhHeeEeccCC-------CCccccCchHHH-HHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019719. 159 --------GKKVVYRYQRDS--EYADFSQKEIFHLKGFG-------FTGLVGLSPIAF-ACKSAGVAVAMEDQQRDFFAN 220 (424) Q Consensus 159 --------~~~~~~~~~~~~--~~~~~~~~evih~r~~~-------~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~~~n 220 (424) +....|.+..++ ..+.+-++.|||+.... ..+.+|.|++.. +.+.+.....+......++.. T Consensus 135 ~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~ 214 (422) T protein:vir:10 135 ENPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKR 214 (422) T ss_pred cCccccccCcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222345554432 33678899999996532 235689999986 678888888888888776655 Q ss_pred cCCCceeEEcCC---CCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019719. 221 GAKSPQILSTGE---KVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFG 296 (424) Q Consensus 221 ~~~p~~vl~~~~---~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fg 296 (424) .... +++++. ............++++..... .+.+.+++...+.++++++.+..++ .+.......+||.+.| T Consensus 215 ~~~~--v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl--~~~~~~~~~~iaaa~~ 290 (422) T protein:vir:10 215 KQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI--DAFLDKKFDRIVALSG 290 (422) T ss_pred hccc--cccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEecccCCh--HHHHHHHHHHHHhhhC Confidence 4433 344332 112222233333333333322 2334455555678899988887764 5788888999999999 Q ss_pred CCHHHhcCCCCCCccchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHH-- Q lcl|NC_019719. 297 VPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS-- 367 (424) Q Consensus 297 VP~~~l~~~~~~~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~-- 367 (424) ||..+|.+...+..+ ++.++..+.|+. .-+.|.++.+-..+- .. ..+.++| +.|...+.++ T Consensus 291 IP~t~L~G~s~~Gln-atgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~----~s---~~~~~~f--~pL~~~sekeka 360 (422) T protein:vir:10 291 IHEIILKNKNVGGVS-SSQNTALETFHKLVDRKRNAELLPILEFLIPFIV----NA---EEWSVEF--NPLAQESSKDKA 360 (422) T ss_pred CCeeeeccCCccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----cc---CCcEEEe--CCCCCCCHHHHH Confidence 999988665544432 233444455554 345565554433331 11 1344444 5676666664 Q ss_pred -----HHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC-Ceeeecccccchhhcc-ccCCCccc Q lcl|NC_019719. 368 -----RAAFMKAMGEAGLRTINEMRRTDNLPPLPGG-DVAMRQSQYVPITDLG-TNKEPRNN 422 (424) Q Consensus 368 -----~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g-d~~~~~~n~~~~~~~~-~~~~~~~~ 422 (424) .++.+++++++|+++++|+|+.+--.....+ ..-..+.......... ..+++.+| T Consensus 361 ei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 361 EILEKNVNSIAALIAAGAMDIDEARDTLRTIAPEVKINDGSVETEVTISETSNDPLEVPTDD 422 (422) T ss_pred HHHHHHHHHHHHHHhcCCCCHHHHHHHhhhhcccccCCCCCCccccchhhcCCCCCCCCCCC Confidence 5566888999999999999988733221110 0001111111111111 11223333 No 121 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=99.75 E-value=1.4e-17 Score=113.02 Aligned_cols=397 Identities=10% Similarity=0.002 Sum_probs=228.5 Q ss_pred CCCCcccccCC--CCCchHHHHHhhccCcccCcccccccccccccccccCccc-----ccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLR--TNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSS-----INDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~--~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~v~~~i~~ia~~ 73 (424) +..+.- ++-| +-.|+++.+...+..+ .+|.+.. ..+- ....|.. +-.+-..+.+.|.+|++.+... T Consensus 12 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~g--ltp~~l~--~iL~--~a~~gd~~~~~~L~~dm~~~D~hi~s~l~~Rk~a 84 (512) T protein:vir:19 12 FDFDDE-MQSRSDELAMVMKRTQEHPSSG--VTPNRAA--QMLR--DAERGDLTAQADLAFDMEEKDTHLFSELSKRRLA 84 (512) T ss_pred cccccc-cccccchhcccchhhccccccC--CCHHHHH--HHHH--HhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHH Confidence 000000 0000 0012222221111100 0110000 0000 0001111 0112223578899999999999 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC---CCceeeEEeecC Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS---AGDVISLLPLQS 150 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~---~G~~~~l~~l~~ 150 (424) |.+++|.|....+.. ........-+...|...| .+.+++..+. +.+++|-+.+++++.. ...|..+.+.++ T Consensus 85 v~~~~w~I~p~~~~~-~~~~~~a~~v~~~l~~~~----~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~ 158 (512) T protein:vir:19 85 IQALEWRIAPARDAS-AQEKKDADMLNEYLHDAA----WFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALHHRDP 158 (512) T ss_pred HhCCCceEecCCCCC-HHHHHHHHHHHHHHhcCC----CHHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeeeeecc Confidence 999999996543221 111111122344454444 2555565554 4667999999988743 346778999999 Q ss_pred ceEEEEEcCCceEEEEEecCceEEecHhHeeEeccCC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE Q lcl|NC_019719. 151 ANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 151 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~ 229 (424) .++.+..++...............+++...+..++.. ...++|.+.+..+.-.........++...|....|.|-.+.+ T Consensus 159 ~~f~~~~~~~~~lr~~~~~~~G~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igk 238 (512) T protein:vir:19 159 ALFCANPDNLNELRLRDASYHGLELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGK 238 (512) T ss_pred ccceeccCCCcEEEecCCCCCceeecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEe Confidence 9888777665544333333455677777666555544 456899999999999999999999999999999999988888 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC-- Q lcl|NC_019719. 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVE-- 306 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-- 306 (424) .+.+. +++.++.+.+.+.++.+ + ..+++|.|++++-+..+ .....|.++.++..++|+++. ||.+- T Consensus 239 y~~~a-~~~ek~~L~~al~~~~~--~--a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~i------LGqtlTs 307 (512) T protein:vir:19 239 YPTGS-TNREKATLMQAVMDIGR--R--AGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAI------LGGTLTT 307 (512) T ss_pred cCCCC-CHHHHHHHHHHHHHHhh--C--cEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHH------hhhhhcc Confidence 87775 45556666666666543 2 35667777766555432 233457888899999999873 33321 Q ss_pred --CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc----------ccceeeecchhhhccCHHHHHHHHHH Q lcl|NC_019719. 307 --KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV----------GRIHAEHNLDGLLRGDSASRAAFMKA 374 (424) Q Consensus 307 --~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~----------~~~~~~fd~~~l~~~d~~~~~~~~~~ 374 (424) +++.+++ ..+.........+.-.++.|+..||+.|+.+.-. ...+++|+.. ...|.+..++.+.+ T Consensus 308 ~~g~~Gs~a-~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~--e~eDl~~~a~~~~~ 384 (512) T protein:vir:19 308 EAGDKGARS-LGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTS--EAGDITALSDAIPK 384 (512) T ss_pred cccccchhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCC--ChhhHHHHHHHHHH Confidence 1222222 2344566677788889999999999888763311 1124455543 45677888888888 Q ss_pred HHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhhccc------cCCCcccCC Q lcl|NC_019719. 375 MGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGT------NKEPRNNGA 424 (424) Q Consensus 375 ~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~------~~~~~~~ga 424 (424) +..+--++..++|+.+|+|.-..++....+....+-..... ...+..+.. T Consensus 385 l~~G~~i~~~~i~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (512) T protein:vir:19 385 LAAGMRIPVSWIQEKLHIPQPVGDEAVFTIQPVVPDNGSQKEAALSAEDIPQEDDI 440 (512) T ss_pred HhcCCCCCHHHHHHHhCCCCCCCccccccCCCccccccccccccccccCCCchhhH Confidence 87544679999999999975444444433211111100000 000000000 No 122 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.74 E-value=6.3e-18 Score=114.88 Aligned_cols=396 Identities=12% Similarity=0.066 Sum_probs=208.8 Q ss_pred CCCCc--ccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCc Q lcl|NC_019719. 1 MEEPK--YTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~~--~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) |-.-. =..++.++..=...+......+...+ ......+.......... -...|..++.+.++|+.+|+.+-+-+ T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d---~~~~~~~~~~~~~~~~~-l~~lY~~~~l~r~iVd~~a~d~~r~g 76 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDFMVGHGKANSRD---KLTRQTPGNGQKLDLKA-CENLYASNSIAMNIVDIISEDMVRAG 76 (461) T ss_pred CccchhhhhhhhhhhhhhhhHHHhhcCCcchhh---hhhccccCcccccCHHH-HHHHHHhCCccchhhccchHHhhcCC Confidence 21100 00011111111111211111111111 11111111111111111 12445667888999999999999988 Q ss_pred eEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee-CCC------------CceeeE Q lcl|NC_019719. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-NSA------------GDVISL 145 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r-~~~------------G~~~~l 145 (424) +.+.-. +.+.. ..+...+. + . .-.+-+..++.+..++|.+++++.- +.+ +.+..+ T Consensus 77 ~~i~~~--~~~~~-----~~~~~~~~-~---l-~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~ 144 (461) T protein:vir:80 77 WSLKTD--NKEMK-----KNIESKWR-K---L-KTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSI 144 (461) T ss_pred eeeecC--CHHHH-----HHHHHHHH-H---h-hHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccce Confidence 876321 11110 11222221 1 1 1234455556666789999988743 221 112233 Q ss_pred Eeec---CceEEEE---Ec------CCceEEEEEe-------------cCceEEecHhHeeEeccCCC-CccccCchHHH Q lcl|NC_019719. 146 LPLQ---SANMDVK---LV------GKKVVYRYQR-------------DSEYADFSQKEIFHLKGFGF-TGLVGLSPIAF 199 (424) Q Consensus 146 ~~l~---~~~v~~~---~~------~~~~~~~~~~-------------~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~ 199 (424) ..|. +..+... .+ +....|.+.. +...+.+.++.|||+.+... +..+|.|.++. T Consensus 145 ~~l~~~~~~~i~~~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~ 224 (461) T protein:vir:80 145 PYINTFNTQKVTQLYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFEGETKGRSIFES 224 (461) T ss_pred eEEEeccccccchhhhcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCCCCCccccCcchHHH Confidence 3332 2222211 11 1222344432 22346789999999987654 55789999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHH Q lcl|NC_019719. 200 ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDA 278 (424) Q Consensus 200 ~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~ 278 (424) +.+.+.....+......+..+...+ +++++.- ....+....+.+.++...+ ..++++++.+-+++.++.+..++ T Consensus 225 ~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~~~~~~---~~g~~~~d~~e~~e~~~~~lsgl 299 (461) T protein:vir:80 225 LYDIITVMDTSLWSVGQILYDFAFK--VYKTDDIDALNKDDKANLTAMLDFMFR---TEALAIIKGDEQLTKESTNVSGM 299 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCC--ceecchHHhhhchHHHHHHHHHHHhcC---CceEEEEcCCcceEEEecCcCCH Confidence 9999999988888888777665443 4454421 1122233344455555443 34588889888999998877765 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhccCccc---c Q lcl|NC_019719. 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKD---V 348 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l~~~l~~~~~---~ 348 (424) .+..+.....||.+-+||..+|.+...+..+ +.++..+.|+. .-+.|+++.+-..+-+..+.... . T Consensus 300 --~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~a--sge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p 375 (461) T protein:vir:80 300 --KDLLDYGWDYLAGAVRMPKTVLKGQEAGTLT--GAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDP 375 (461) T ss_pred --HHHHHHHHHHHhhhhcCCeeeeecccCCccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCc Confidence 5788889999999999999988664443332 34545555543 34567777777666554332111 1 Q ss_pred ccceeeecchhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHh----CCCCCCCCCeeeecccccchhhcc--c Q lcl|NC_019719. 349 GRIHAEHNLDGLLRGDSASRAA-------FMKAMGEAGLRTINEMRRTD----NLPPLPGGDVAMRQSQYVPITDLG--T 415 (424) Q Consensus 349 ~~~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~g~~T~NE~R~~~----G~~p~~~gd~~~~~~n~~~~~~~~--~ 415 (424) ..+.+.|.++.|...|.+++++ .+.+++++|++|++|+|+.+ +++|. +...-......++.... . T Consensus 376 ~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 453 (461) T protein:vir:80 376 DSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENS--SKFSGDSAEIDKLAKLVYDA 453 (461) T ss_pred cccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCC--ccCCCCCchhhhhhhhcccc Confidence 1234556667888888777654 58889999999999999854 33332 11000001111111111 1 Q ss_pred cCCCcccC Q lcl|NC_019719. 416 NKEPRNNG 423 (424) Q Consensus 416 ~~~~~~~g 423 (424) .++.+.+| T Consensus 454 ~~~e~~~g 461 (461) T protein:vir:80 454 YAKKNADG 461 (461) T ss_pred ccccCCCC Confidence 22223333 No 123 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=99.68 E-value=3.6e-16 Score=105.28 Aligned_cols=397 Identities=12% Similarity=0.054 Sum_probs=224.7 Q ss_pred CC----CCc---------ccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHH Q lcl|NC_019719. 1 ME----EPK---------YTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~----~~~---------~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i 67 (424) |- .|| -.++..+-.|....+.+....+..... . .+.. ....+..+ .+..+..+.|.+|+ T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~-~----~~iL--r~~~~~~l-y~~m~~D~hi~s~l 72 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDRE-F----DELL--QGKDGLLV-YHKMLSDGTVKNAL 72 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccchhhhcccccccccccc-h----hHhh--ccccchHH-HHHHhhChHHHHHH Confidence 11 111 112222222333333322211110000 0 0000 01111112 24456689999999 Q ss_pred HHHHHhhccCceEEEEecccCccccccccchhhhhhccCCC---CCCCHHHHHHHHHHHHHHcCCeEEEEeeC--CCCc- Q lcl|NC_019719. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPN---QYMTAQEFREAMTMQLCFYGNAYALVDRN--SAGD- 141 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN---~~~s~~~f~~~~~~~~l~~G~a~~~~~r~--~~G~- 141 (424) +.+...|.+++|.|-..+.+.... ....-+...|. .+. ...++.+++..+ .+.+.+|-+.+++++. .+|. T Consensus 73 ~~Rk~av~~~~w~v~p~~~~~~d~--~~ae~v~~~l~-~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~~ 148 (448) T protein:vir:77 73 NYIFGRIRSAKWYVEPASTDPEDI--AIAAFIHAQLG-IDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKL 148 (448) T ss_pred HHHHHHHhcCCceEecCCCCHHHH--HHHHHHHHHhh-chhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCce Confidence 999999999999986433322211 11112222232 221 123566777776 4778899999999874 3564 Q ss_pred -eeeEEeecCceE---EEEEcCCceEEEEEec--------CceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHH Q lcl|NC_019719. 142 -VISLLPLQSANM---DVKLVGKKVVYRYQRD--------SEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVA 209 (424) Q Consensus 142 -~~~l~~l~~~~v---~~~~~~~~~~~~~~~~--------~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~ 209 (424) +..|.+.++.++ .+..+++.. +....+ .....++...++|.++.....++|.+.+..+.-....... T Consensus 149 ~~~~l~~r~~~~~~~f~~~~~~~l~-~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~ 227 (448) T protein:vir:77 149 ILDKIVPIHPFNIDEVLYDEEGGPK-ALKLSGEVKGGSQFVNGLEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRA 227 (448) T ss_pred eeccccccCCCccceeeeecCCceE-EEecCCcccccccCCCccccccceEEEEecCCcCCcccchHHHHHHHHHHHHHh Confidence 446777777543 333444332 222221 1234567788888876554558999999999999999999 Q ss_pred HHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHH Q lcl|NC_019719. 210 MEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQV 288 (424) Q Consensus 210 ~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~ 288 (424) ..++...|.+.-+.|--+.+.+.+.+ +++.++.+.+...+...+.+++ ++++.|++++-+..+....++.+..++.. T Consensus 228 ~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~--~iiP~g~~ie~~ea~~~~~~~~~~i~~~d 305 (448) T protein:vir:77 228 LILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHG--IILPDDWKFDTVDLKSAMPDAIPYLTYHD 305 (448) T ss_pred hHHHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceE--EEecCCceEEEEecCCCccCHHHHHHHHH Confidence 99999999999999988888876654 4566777777777776555554 66788887766655444455677888888 Q ss_pred HHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc-----ccc--cceeeecchhhh Q lcl|NC_019719. 289 SELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-----DVG--RIHAEHNLDGLL 361 (424) Q Consensus 289 ~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-----~~~--~~~~~fd~~~l~ 361 (424) ++|+.+..-. .+-.. .++.+++.............+.-.+++|++.||+.|+.+. +.. ..++.|+.. . T Consensus 306 ~~Isk~iLGq-tlTs~--~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~--e 380 (448) T protein:vir:77 306 AGIARALGID-FNTVQ--LNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEME--E 380 (448) T ss_pred HHHHHHHhcc-ccccc--cccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCC--C Confidence 9998877443 22111 1222223333333356677778889999999998887533 111 135566544 3 Q ss_pred ccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhhcccc-CCCcccCC Q lcl|NC_019719. 362 RGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTN-KEPRNNGA 424 (424) Q Consensus 362 ~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~-~~~~~~ga 424 (424) ..|.+..++.+.+++ +-+|+.+|+|.-.+......+....+.....+. .+.....| T Consensus 381 ~eDl~~~a~~~~~l~-------~~~~~~~~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (448) T protein:vir:77 381 RNDFSAAANLMGMLI-------NAVKDSEDIPTELKALIDALPSKMRRALGVVDEVREAVRQPA 437 (448) T ss_pred hhhHHHHHHHhHHHH-------HHHHHHhcCCccCCcCCCCCchhcccccCCCCCCCchhhcch Confidence 467778888888876 458999999752222211112111111111111 11111111 No 124 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=99.67 E-value=6.5e-16 Score=103.84 Aligned_cols=402 Identities=11% Similarity=0.039 Sum_probs=224.3 Q ss_pred CCCCcccccCC---------CCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLR---------TNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~~~~~---------~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia 71 (424) +..|+=+..++ +-.|....+.+....+..... ..+.. .+..+..+ .+..+..+.|.+|++.+. T Consensus 5 ~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~-----~~~iL--r~~~~~~l-y~~m~~D~hi~s~l~~Rk 76 (448) T protein:vir:79 5 GRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDRE-----FDELL--QGKDGLLV-YHKMLSDGTVKNALNYIF 76 (448) T ss_pred CCCCccccCcccccccccchhhhhhhhhhcccccccccccc-----hhHhh--ccccchHH-HHHHhhChHHHHHHHHHH Confidence 22222211110 001222222211111100000 00000 00111111 244566899999999999 Q ss_pred HhhccCceEEEEecccCccccccccchhhhhhccCCCCC---CCHHHHHHHHHHHHHHcCCeEEEEeeC--CCCc--eee Q lcl|NC_019719. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQY---MTAQEFREAMTMQLCFYGNAYALVDRN--SAGD--VIS 144 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~---~s~~~f~~~~~~~~l~~G~a~~~~~r~--~~G~--~~~ 144 (424) ..|.+++|.|-..+.+....+ ...-+...|. .++.. .++.+++..+. +.+++|-+++++++. .+|. +.. T Consensus 77 ~av~~~~w~v~p~~~~~~~~~--~ae~v~~~l~-~~~~~~~~~~f~~~~~~~l-da~~~G~s~~Eivw~~~~~g~~~~~~ 152 (448) T protein:vir:79 77 GRIRSAKWYVEPASTDPEDIA--IAAFIHAQLG-IDDASVGKYPFGRLFAIYE-NAYIYGMAAGEIVLTLGADGKLILDK 152 (448) T ss_pred HHHhcCCceEecCCCCHHHHH--HHHHHHHHhh-hhhhhhccCCHHHHHHHHH-HhhhhcceeEEEEeeecCCCceeccc Confidence 999999999964333221111 1112222232 33222 23445454443 466899999999864 3564 456 Q ss_pred EEeecCce---EEEEEcCCceEEEEEe-------cCceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 145 LLPLQSAN---MDVKLVGKKVVYRYQR-------DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQ 214 (424) Q Consensus 145 l~~l~~~~---v~~~~~~~~~~~~~~~-------~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~ 214 (424) |.+.++.+ +.+..+++........ +.....++..-++|..+.....++|.+.+..+.-.........++. T Consensus 153 l~~r~~~~~~~f~~~~d~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w 232 (448) T protein:vir:79 153 IVPIHPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVSGLEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRALILLI 232 (448) T ss_pred ccccCCccccceeeecCCceEEeecCCcccccccCCCccccccceEEEEecCccCCcccchhHHHHHHHHHHHHHHHHHH Confidence 77777763 3344444433322211 1123456778888886654445899999999999999999999999 Q ss_pred HHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 215 RDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELAR 293 (424) Q Consensus 215 ~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~ 293 (424) ..|....+.|--+.+.+.+.+ +++.++.+.+...+...+.+++ ++++.|++++-+.......++.+..++..++|+. T Consensus 233 ~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~--~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk 310 (448) T protein:vir:79 233 NHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHG--IILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIAR 310 (448) T ss_pred HHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCceE--EEecCCceEEEEecCCCcccHHHHHHHHHHHHHH Confidence 999999999988888876654 3566777777777776655554 6678888777666544445567788888888888 Q ss_pred HhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc-----ccc--cceeeecchhhhccCHH Q lcl|NC_019719. 294 FFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-----DVG--RIHAEHNLDGLLRGDSA 366 (424) Q Consensus 294 ~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-----~~~--~~~~~fd~~~l~~~d~~ 366 (424) +.-=. .+-.. .++.+++.............+.-.+++|+..||+.|+.+. +.. ..++.|+.. ...|.+ T Consensus 311 ~iLGq-tlTs~--~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~--e~~Dl~ 385 (448) T protein:vir:79 311 ALGID-FNTVQ--LNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFEME--ERNDFS 385 (448) T ss_pred HHhhh-hhccc--cccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEecCC--ChHHHH Confidence 76322 11111 1112222332333345567777888999999998877533 111 134555543 456778 Q ss_pred HHHHHHHHHHhCCCCCHHHHHHHhCCCC-CCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 367 SRAAFMKAMGEAGLRTINEMRRTDNLPP-LPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 367 ~~~~~~~~~~~~g~~T~NE~R~~~G~~p-~~~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ..++.+.+++..+-...+-+|+.+|+|. .|+ +....+...-+.. ...+.+.++-- T Consensus 386 ~~a~~~~~l~~~~~~~~~~~~~~~~~p~~~~~-~~~~a~~~~~~~~--~~~~~~~~~~~ 441 (448) T protein:vir:79 386 AAANLMGMLINAVKDSEDIPTELKALIDALPS-KMRRALGVVDEVR--EAVRQPADSRY 441 (448) T ss_pred HHHHHhhhhhccchhhHHHHHHhhcCCCCCCC-ccccccCCCCccc--ccccCCccccc Confidence 8899999999877555555788899984 343 2222111000000 01111111111 No 125 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.67 E-value=2.5e-16 Score=106.13 Aligned_cols=413 Identities=11% Similarity=-0.032 Sum_probs=225.1 Q ss_pred ccCCCCCc-----hHHHHHhhccCcccCcccccccccccc-------cccccCcccccHHHHhhhHHHHHHHHHHHHhhc Q lcl|NC_019719. 8 IDLRTNNG-----WWARLQSWFVGGRLVTPNQGSQTGPVS-------AHGHLGDSSINDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 8 ~~~~~~~G-----~~~~l~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia 75 (424) |+...--| -...+.+.-.++........ .+.+.. ......-...+.+.+..++.+.+||+.+.+.+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~-~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvV 79 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFGGQLR-GWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIV 79 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCCCccc-ccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhh Confidence 33322222 12222222221110000000 000000 001111122345566778999999999888877 Q ss_pred cCceEEEEecc------cCccccccccc---hhhhhhccCCC------CCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC- Q lcl|NC_019719. 76 CLPLDVFETDQ------NDNRKKVDLSN---PLARLLRYSPN------QYMTAQEFREAMTMQLCFYGNAYALVDRNSA- 139 (424) Q Consensus 76 ~~~~~v~~~~~------~~~~~~~~~~~---~l~~lL~~~pN------~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~- 139 (424) .-.|.+.-+.+ ++.. ...... .++..+-..|+ ..++..++.+.++..++..|++|+.+.+... T Consensus 80 G~Gi~~~~~p~~~~l~~~~~~-~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~ 158 (530) T protein:vir:38 80 GSFFRLSYRPSWRYLGINEED-SRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDS 158 (530) T ss_pred CCCceeeeccchhhcCCCHhH-HHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccCC Confidence 77777653311 1110 011111 22333334444 3468899999999999999999999886544 Q ss_pred C--ceeeEEeecCceEEEEE--------------c--CCceEEEEE-e--cC----------ceEEecHhHeeEeccCC- Q lcl|NC_019719. 140 G--DVISLLPLQSANMDVKL--------------V--GKKVVYRYQ-R--DS----------EYADFSQKEIFHLKGFG- 187 (424) Q Consensus 140 G--~~~~l~~l~~~~v~~~~--------------~--~~~~~~~~~-~--~~----------~~~~~~~~evih~r~~~- 187 (424) | .+..|..|+|+.+.... | +....|.+. . .+ ....++..+|||+.... T Consensus 159 g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r 238 (530) T protein:vir:38 159 TRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPME 238 (530) T ss_pred CCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHeEeeccccC Confidence 3 35678888887765321 1 111223222 1 11 12346677999997654 Q ss_pred CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC----------CHHHHHHHHHHHHHHhCC---- Q lcl|NC_019719. 188 FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL----------TEQQRSQVEENFKEIAGG---- 253 (424) Q Consensus 188 ~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~----------~~~~~~~~~~~~~~~~~~---- 253 (424) ....+|+|.+..+...+.......+....-.+-.+...++|+.+.+.. .+.....+........+. T Consensus 239 ~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (530) T protein:vir:38 239 DGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAA 318 (530) T ss_pred CCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhccccc Confidence 567999999999988887777666666555555666677776543321 111111111111111100 Q ss_pred ---cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHh-cCCCCCCccchhHH-----------HH Q lcl|NC_019719. 254 ---PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV-GDVEKSTSWGSGIE-----------QQ 318 (424) Q Consensus 254 ---~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l-~~~~~~~~~~~n~e-----------~~ 318 (424) =..|.+..|..|.+++.+..+-...+|.+..+.....||+.+|||-+.| ++..+.| |+++. .. T Consensus 319 ~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~n--YSS~R~~~~e~~r~~~~~ 396 (530) T protein:vir:38 319 PVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMS--YSTARASANESWAYFMGR 396 (530) T ss_pred ceeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccccc--HHHHHHHHHHHHHHHHHH Confidence 1245677889999999888775667888999999999999999999888 4444444 33333 33 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHhhccCccc---------cc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_019719. 319 NLGFLQYTLQPYISR-WENSIQRWLIPAKD---------VG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMR 387 (424) Q Consensus 319 ~~~~~~~tl~P~~~~-ie~~l~~~l~~~~~---------~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R 387 (424) +..|...-+.|+... ++.++....++-.. +. ...+.+-.-.....|+...++....++++|+.|.-|+- T Consensus 397 q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~ 476 (530) T protein:vir:38 397 RKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKEC 476 (530) T ss_pred HHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHH Confidence 334444455665554 34555444433110 00 11234444555667999999999999999999999988 Q ss_pred HHhCCCCCCC----------CCeeeec--ccccch---hhccccC--CCcccCC Q lcl|NC_019719. 388 RTDNLPPLPG----------GDVAMRQ--SQYVPI---TDLGTNK--EPRNNGA 424 (424) Q Consensus 388 ~~~G~~p~~~----------gd~~~~~--~n~~~~---~~~~~~~--~~~~~ga 424 (424) ++.|.++-+- .+++=++ ...... .....++ +...+|| T Consensus 477 a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 477 AKRGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKSNEEEQDGARAA 530 (530) T ss_pred HHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCCCCCCCCCCCCC Confidence 8888876421 1111111 111111 1111122 2222233 No 126 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.65 E-value=8.2e-16 Score=103.28 Aligned_cols=406 Identities=13% Similarity=0.022 Sum_probs=225.6 Q ss_pred CchHHHHHhhccCcccCc---ccc--ccc----ccccccccccCcc-------------cccHHHHhhhHHHHHHHHHHH Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVT---PNQ--GSQ----TGPVSAHGHLGDS-------------SINDERILQISTVWRCVSLIS 71 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~---~~~--~~~----~~~~~~~~~~~~~-------------~~~~~~~~~~~~v~~~i~~ia 71 (424) =+|++|+.++|....... ... ..+ ......+....+. ..+.+.+..++.+.+||+.+. T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 567777777765321110 000 000 0000000000100 122455667899999999777 Q ss_pred HhhccC-ceEEEEe--cccCccccccccch---hhhhhccC--CCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---- Q lcl|NC_019719. 72 TLTACL-PLDVFET--DQNDNRKKVDLSNP---LARLLRYS--PNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---- 139 (424) Q Consensus 72 ~~ia~~-~~~v~~~--~~~~~~~~~~~~~~---l~~lL~~~--pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~---- 139 (424) +.+-.. -+.+.-+ ..+....+ ..... ++..+-.. .+..++..++.+.++..++..|++|+.+++... T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~-~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~ 159 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIAR-DLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLT 159 (502) T ss_pred HhhccCCceeeeeccCCCChhHHH-HHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccC Confidence 766543 3333211 11111100 01111 22222222 223468899999999999999999999866443 Q ss_pred ---CceeeEEeecCceEEEE------------EcC--CceEEEEEe-------cCceEEecHhHeeEeccCC-CCccccC Q lcl|NC_019719. 140 ---GDVISLLPLQSANMDVK------------LVG--KKVVYRYQR-------DSEYADFSQKEIFHLKGFG-FTGLVGL 194 (424) Q Consensus 140 ---G~~~~l~~l~~~~v~~~------------~~~--~~~~~~~~~-------~~~~~~~~~~evih~r~~~-~~~~~G~ 194 (424) +.+..|..|+|+.+... .|. ....|.+.. ......+++++|+|+.... ....+|+ T Consensus 160 ~g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r~gQ~RGi 239 (502) T protein:vir:79 160 PSAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRRLHQMRGT 239 (502) T ss_pred CCcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeecccCCccccCC Confidence 24678899999777522 222 222233221 2244679999999997654 5678999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcce-ecCCCceeeeccc Q lcl|NC_019719. 195 SPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLW-ILEAGFSTSAIGV 273 (424) Q Consensus 195 s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-~l~~g~~~~~l~~ 273 (424) |.+..+...+.......+......+-.+...++++.+.+...... ..-...-+.... -..|.++ .|..|.+++.+.. T Consensus 240 s~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~-~~~~~~~~~~~~-l~pG~i~~~L~pGe~i~~~~p 317 (502) T protein:vir:79 240 SLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPD-GNGSKENERELT-IQPGIIYDDLKPGEEIGMVKS 317 (502) T ss_pred chHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccc-cCCCCCcccccc-ccCCccccccCCCceeeeeCC Confidence 999999888877777666666555556667778876543211100 000000000011 1234454 4889999999887 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH-----------HHHHHHHHHHHHHH-HHHHHhh Q lcl|NC_019719. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL-----------GFLQYTLQPYISRW-ENSIQRW 341 (424) Q Consensus 274 ~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~-----------~~~~~tl~P~~~~i-e~~l~~~ 341 (424) +-....|.+..+.....||+.+|||.+.|.+--.+ +|+++-.... .|+..-++|+.+.+ +.++-.. T Consensus 318 ~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~--nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G 395 (502) T protein:vir:79 318 DRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNG--TYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASG 395 (502) T ss_pred CCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Confidence 65667788999999999999999999888653322 4555543333 34445566655543 4444433 Q ss_pred ccCcc---ccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCeeeecccc Q lcl|NC_019719. 342 LIPAK---DVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG----------GDVAMRQSQY 407 (424) Q Consensus 342 l~~~~---~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~----------gd~~~~~~n~ 407 (424) .++-. ++. ...+.+-.......|+...++....++++|+.|.-|+-+..|.+|-+- .+++=++... T Consensus 396 ~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~ 475 (502) T protein:vir:79 396 VIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFDT 475 (502) T ss_pred CCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCC Confidence 33211 111 123344445556679999999999999999999999988888877421 1111111111 Q ss_pred cc------hhhccccCCCcccCC Q lcl|NC_019719. 408 VP------ITDLGTNKEPRNNGA 424 (424) Q Consensus 408 ~~------~~~~~~~~~~~~~ga 424 (424) .| .+...+.+++.+.++ T Consensus 476 ~~~~~~~~~~~~~~~~e~~~~~~ 498 (502) T protein:vir:79 476 DPASDKGGSSAATKRQEPQHTDD 498 (502) T ss_pred CCCCCCCCCCCCCCCCCCCCCCC Confidence 11 111122223333333 No 127 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.65 E-value=5e-16 Score=104.48 Aligned_cols=412 Identities=11% Similarity=0.002 Sum_probs=230.9 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCc--c-cc---ccccc-cccccc----cc-----------CcccccHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVT--P-NQ---GSQTG-PVSAHG----HL-----------GDSSINDERIL 58 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~--~-~~---~~~~~-~~~~~~----~~-----------~~~~~~~~~~~ 58 (424) |....+.|.+--+. .++........ . .. ..... ....+. .. .-...+.+.+. T Consensus 1 ~~r~~~~~~~~dr~------i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~r 74 (505) T protein:vir:96 1 MKRAEKKPSLAQRM------VNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSI 74 (505) T ss_pred CCCCccccchhhcc------cchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHh Confidence 77777776655444 22211100000 0 00 00000 000000 00 01112345566 Q ss_pred hhHHHHHHHHHHHHhhcc-CceEEEEeccc--Cccccc--cccchhhhhhccCCCCC----CCHHHHHHHHHHHHHHcCC Q lcl|NC_019719. 59 QISTVWRCVSLISTLTAC-LPLDVFETDQN--DNRKKV--DLSNPLARLLRYSPNQY----MTAQEFREAMTMQLCFYGN 129 (424) Q Consensus 59 ~~~~v~~~i~~ia~~ia~-~~~~v~~~~~~--~~~~~~--~~~~~l~~lL~~~pN~~----~s~~~f~~~~~~~~l~~G~ 129 (424) .++.+.++|+.+.+.+-. ..+...-.... +...+. ..-..+++.+...+|.. ++..++.+.++..++..|+ T Consensus 75 Nn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE 154 (505) T protein:vir:96 75 NNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGE 154 (505) T ss_pred cChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCc Confidence 789999999977766654 45554432211 110000 00112333444455533 5788899999999999999 Q ss_pred eEEEEeeCCCC-ceeeEEeecCceEEEEEc----------------C--CceEEEEEe---c----------CceEEecH Q lcl|NC_019719. 130 AYALVDRNSAG-DVISLLPLQSANMDVKLV----------------G--KKVVYRYQR---D----------SEYADFSQ 177 (424) Q Consensus 130 a~~~~~r~~~G-~~~~l~~l~~~~v~~~~~----------------~--~~~~~~~~~---~----------~~~~~~~~ 177 (424) +|+.+.+...+ .+..|..|+|+.+..-.+ . ....|.+.. + .....+++ T Consensus 155 ~f~~~~~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~rvpa 234 (505) T protein:vir:96 155 VLVREHRGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYERVPA 234 (505) T ss_pred eEEEEeecCCCCcceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccccCH Confidence 99988765433 466788888887743221 1 111232211 0 12355899 Q ss_pred hHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCccc Q lcl|NC_019719. 178 KEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVK 256 (424) Q Consensus 178 ~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 256 (424) ++|+|+..+ .....+|+|.+..+...+.......+......+=.+...++|+.+.+...+...+.-... ...=.. T Consensus 235 ~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~----~~~l~p 310 (505) T protein:vir:96 235 DEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDDQGEI----VEEVEA 310 (505) T ss_pred hHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCccccccCcc----ccccCC Confidence 999998654 456789999999998888777766666655555566677788765443222111100000 111134 Q ss_pred CcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhc-CCCCCCccchhHHH-----------HHHHHHH Q lcl|NC_019719. 257 KRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVG-DVEKSTSWGSGIEQ-----------QNLGFLQ 324 (424) Q Consensus 257 g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~-~~~~~~~~~~n~e~-----------~~~~~~~ 324 (424) |.+..|..|.+++.+..+-....|.+..+...+.||+.+|||.+.|. +..+.|| +.+-+ .+..|+. T Consensus 311 G~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nY--SS~R~~~~e~~r~~~~~q~~~~~ 388 (505) T protein:vir:96 311 GTYQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNF--SSLRSGELDERDLYKLLQFFVVT 388 (505) T ss_pred ceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccH--HHHHHHHHHHHHHHHHHHHHHHH Confidence 66888999999999988766678899999999999999999998884 3334443 33332 2334455 Q ss_pred HHHHHHHHHH-HHHHHhhccCccc--ccc-ceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--- Q lcl|NC_019719. 325 YTLQPYISRW-ENSIQRWLIPAKD--VGR-IHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--- 397 (424) Q Consensus 325 ~tl~P~~~~i-e~~l~~~l~~~~~--~~~-~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~--- 397 (424) .-++|+.+.+ +.++-...++-.. ... ..+.+-.......|+...++....++++|+.|+-|+-++.|.++.+- T Consensus 389 ~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~v~~q 468 (505) T protein:vir:96 389 ELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDPEDVFDE 468 (505) T ss_pred HHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHH Confidence 6677766653 4444444433211 111 23455455556679999999999999999999999988888877421 Q ss_pred -------CCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 398 -------GDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 -------gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) .+++=++.+..+........+.+++++ T Consensus 469 ~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~ 502 (505) T protein:vir:96 469 IAWEEQLMRDKGVNPTPPEQESKDATTDEEDDSA 502 (505) T ss_pred HHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCC Confidence 111111111111111111111111222 No 128 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.59 E-value=1e-14 Score=97.28 Aligned_cols=419 Identities=10% Similarity=0.007 Sum_probs=225.5 Q ss_pred CCCCccc--ccCCCCCchHHHHHhhccCcccCcccccccccccc-------cccccCcccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYT--IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVS-------AHGHLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~--~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~v~~~i~~ia 71 (424) |.-|-.- ...+-..-. ..+.+.-.++..... ....+.+.. ......-...+.+.+..++.+..||+.+. T Consensus 1 ~~~p~~~~~~~~~~~~~~-~~~~~y~~~a~~~~~-~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 78 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSL-REYAGYHGGGSGFGG-QLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQ 78 (533) T ss_pred CCCchhhhhhcccccchH-HHHHhhhhccCCCCC-cccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 3322111 000000000 001111111100000 000000000 00001111223455667899999999888 Q ss_pred HhhccCceEEEEecc------cCccccccccc---hhhhhhccCCC------CCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019719. 72 TLTACLPLDVFETDQ------NDNRKKVDLSN---PLARLLRYSPN------QYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 72 ~~ia~~~~~v~~~~~------~~~~~~~~~~~---~l~~lL~~~pN------~~~s~~~f~~~~~~~~l~~G~a~~~~~r 136 (424) +.+-.-.|.+.-..+ ++. ....... .++..+-..++ ..++..++.+.++..++..|++|+.+.+ T Consensus 79 ~nvVG~Gi~~~~~p~~~~lg~~~~-~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~ 157 (533) T protein:vir:34 79 DHIVGSFFRLSHRPSWRYLGIGEE-EARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATW 157 (533) T ss_pred HHhhCCCceeeeccchhhcCCChh-HHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeee Confidence 887666776643211 111 0001111 22333333443 3458899999999999999999999876 Q ss_pred CCC-C--ceeeEEeecCceEEEEEc----------------CCceEEEEEe---cC----------ceEEecHhHeeEec Q lcl|NC_019719. 137 NSA-G--DVISLLPLQSANMDVKLV----------------GKKVVYRYQR---DS----------EYADFSQKEIFHLK 184 (424) Q Consensus 137 ~~~-G--~~~~l~~l~~~~v~~~~~----------------~~~~~~~~~~---~~----------~~~~~~~~evih~r 184 (424) ... | .+..|..|+|+.+..-.+ +....|.+.. ++ ....++..+|||+. T Consensus 158 ~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f 237 (533) T protein:vir:34 158 DTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVF 237 (533) T ss_pred ccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeec Confidence 544 2 356788888877653221 1112233221 11 12336688999997 Q ss_pred cC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC----------CHHHHHHHHHHHHH---H Q lcl|NC_019719. 185 GF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL----------TEQQRSQVEENFKE---I 250 (424) Q Consensus 185 ~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~----------~~~~~~~~~~~~~~---~ 250 (424) .. .....+|+|.+..+...+.......+......+-.+...++++.+.+.. .+...+.+...... . T Consensus 238 ~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (533) T protein:vir:34 238 EPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAY 317 (533) T ss_pred cccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhc Confidence 55 4667999999999988888777766666555555666677777553311 11111111111111 1 Q ss_pred hCC----cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcC-CCCCCccchhH---------- Q lcl|NC_019719. 251 AGG----PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD-VEKSTSWGSGI---------- 315 (424) Q Consensus 251 ~~~----~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~-~~~~~~~~~n~---------- 315 (424) .++ =..|.+..|..|.+++.+..+-....|.+..+.....||+.+|||-+.|.+ ..+.|| +++ T Consensus 318 ~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nY--SS~R~~~~e~~r~ 395 (533) T protein:vir:34 318 YAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSY--STARASANESWAY 395 (533) T ss_pred cCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccH--HHHHHHHHHHHHH Confidence 110 124567788999999988877666788999999999999999999988843 334443 343 Q ss_pred -HHHHHHHHHHHHHHHHHHH-HHHHHhhccC-ccc--------cc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCH Q lcl|NC_019719. 316 -EQQNLGFLQYTLQPYISRW-ENSIQRWLIP-AKD--------VG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI 383 (424) Q Consensus 316 -e~~~~~~~~~tl~P~~~~i-e~~l~~~l~~-~~~--------~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~ 383 (424) +..+..|...-++|+.+.+ +.++....++ +.. +. ...+.+-.......|+...++....++++|+.|. T Consensus 396 ~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~ 475 (533) T protein:vir:34 396 FMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTY 475 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCH Confidence 2333445555667766654 4444443332 110 01 1234555556667799999999999999999999 Q ss_pred HHHHHHhCCCCCCC----------CCeeeecccccc--hhh---ccccCCCcccCC Q lcl|NC_019719. 384 NEMRRTDNLPPLPG----------GDVAMRQSQYVP--ITD---LGTNKEPRNNGA 424 (424) Q Consensus 384 NE~R~~~G~~p~~~----------gd~~~~~~n~~~--~~~---~~~~~~~~~~ga 424 (424) -|+-++.|.++-+- .+++=++....+ ... ..+.++++++++ T Consensus 476 ~~~~a~~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~~~ 531 (533) T protein:vir:34 476 EKECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSR 531 (533) T ss_pred HHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCcccCC Confidence 99988888877421 122111111111 111 112223333333 No 129 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=99.59 E-value=2.7e-14 Score=95.01 Aligned_cols=409 Identities=11% Similarity=0.051 Sum_probs=213.1 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |-+-.- -...-==.||.+.++-+....... ....+.....+.....+ .+..++.+.|.+|++.+...|.+++|. T Consensus 1 ~~~~~~----~~~gl~p~rl~~i~~~~~~~~~~~-~~~~~~~~Lr~~~~~~l-y~~m~~D~hi~s~l~~Rk~av~~~~w~ 74 (488) T protein:vir:95 1 MADITE----TQESLPPFRMGEVGSLGLKVKNGR-IYEEPRQALRFPESIKT-FQLMMRDPAVAASVNIIKMFVRKVNWR 74 (488) T ss_pred CCCccc----cCCCCCHHHHHHHHHHhhccccch-hhccchhhhcccchHHH-HHHHhhChHHHHHHHHHHHHHhcCCce Confidence 322111 111111133433332221111100 00000000011111112 344566899999999999999999999 Q ss_pred EEEecccCcccc-ccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-------------CCc--eee Q lcl|NC_019719. 81 VFETDQNDNRKK-VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-------------AGD--VIS 144 (424) Q Consensus 81 v~~~~~~~~~~~-~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~-------------~G~--~~~ 144 (424) |...+......+ .....-+...|..- ..++.+++..+. +.+++|-+.+++++.. +|. +.. T Consensus 75 v~p~~~~~~d~~~~~~a~~v~~~l~~~---~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~ 150 (488) T protein:vir:95 75 FVPPKGKEQDPKMLERADFFNSLMDDM---EHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAK 150 (488) T ss_pred EecCCCCchhHHHHHHHHHHHHHHhcc---CccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCeeeeee Confidence 964432221111 00111222333221 234666777665 5678999999998753 232 556 Q ss_pred EEeecCc---eEEEEEcCCceEEEEE---------------ecCceEEecHhHeeEeccC-CCCccccCchHHHHHHHHH Q lcl|NC_019719. 145 LLPLQSA---NMDVKLVGKKVVYRYQ---------------RDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAG 205 (424) Q Consensus 145 l~~l~~~---~v~~~~~~~~~~~~~~---------------~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~ 205 (424) |.+.++. ++.+..+++....... .......+++...++.++. ....++|.+.+..+.-... T Consensus 151 i~~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~ 230 (488) T protein:vir:95 151 LPIRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWK 230 (488) T ss_pred eeecCcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecCCCCccchhhHHHHHHHHHH Confidence 6666664 3333333332211100 0112345777776655544 4456899999999999999 Q ss_pred HHHHHHHHHHHHHhccCCCceeEEcCCC----CCCHHHHHH---HHHHHHHHhCCcccCcceecCCCceee--------- Q lcl|NC_019719. 206 VAVAMEDQQRDFFANGAKSPQILSTGEK----VLTEQQRSQ---VEENFKEIAGGPVKKRLWILEAGFSTS--------- 269 (424) Q Consensus 206 ~~~~~~~~~~~~~~n~~~p~~vl~~~~~----~~~~~~~~~---~~~~~~~~~~~~~~g~~~~l~~g~~~~--------- 269 (424) ......++...|....+.|--+...+.+ ..+++.... +++......++..+| ++++.|++.. T Consensus 231 fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag--~iiP~g~~~~~k~~~~e~~ 308 (488) T protein:vir:95 231 YKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAG--LIWPRYIDPDTKEDIFEFS 308 (488) T ss_pred HHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhh--eeeccccccccchhhhhhh Confidence 9999999999998875544444444322 223332222 222333333333344 4556555322 Q ss_pred ecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc-- Q lcl|NC_019719. 270 AIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-- 346 (424) Q Consensus 270 ~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-- 346 (424) -++.. ..-..|.++.++..++|+.+.--.- |....++++++ ...+.........+.-.+++|++.||+.|+.+. T Consensus 309 l~~~~~~~~~~~~~li~~~d~~Isk~iLGqt--LT~~~~~~Gs~-Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~ 385 (488) T protein:vir:95 309 LVSRQGAKAYDTGSIIDRYSKQIMMAFMSDV--LAMGQSKYGSF-SLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYA 385 (488) T ss_pred ccccccCCchhHHHHHHHHHHHHHHHHhccc--cccccCcchhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22221 2223466777788888888763321 11111112222 233455666777788889999999998877542 Q ss_pred ---ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCCCeeeecccccchhhccccCC Q lcl|NC_019719. 347 ---DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI-----NEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKE 418 (424) Q Consensus 347 ---~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~-----NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~~ 418 (424) +....+.+|-++.....|.+..++.+.+++..|+.-+ +.+|+.+|+|+-+++.....+....+....++... T Consensus 386 ~Nfg~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~~~~~~~~~~~~~~~~~ 465 (488) T protein:vir:95 386 LNMWDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVSEKLSPNSQSRSGDGYK 465 (488) T ss_pred hcCCCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCccccccCCCCCCCCCCcccC Confidence 1112223343444556788899999999999998765 57999999997655555544432222212111111 Q ss_pred CcccCC Q lcl|NC_019719. 419 PRNNGA 424 (424) Q Consensus 419 ~~~~ga 424 (424) ...+++ T Consensus 466 ~~~~~~ 471 (488) T protein:vir:95 466 TAGEGT 471 (488) T ss_pred CCcccC Confidence 111111 No 130 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=99.58 E-value=1.1e-14 Score=97.13 Aligned_cols=373 Identities=12% Similarity=0.027 Sum_probs=213.6 Q ss_pred ccccCCCCC--chHHHHHhhccCcccCcccccccccccccccccCcc-----cccHHHHhhhHHHHHHHHHHHHhhccCc Q lcl|NC_019719. 6 YTIDLRTNN--GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDS-----SINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 6 ~~~~~~~~~--G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) ..|++||.- -+-..+...-.+..... + +..+-......++. .+..+-..+.+.|.+|++.+...|.+++ T Consensus 1 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~---g-~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~ 76 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIYAMEHLGLAT---S-YLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKV 76 (446) T ss_pred CcccccCCCchhhhhhhhhccccchhhc---c-cCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCC Confidence 567777732 12222211100000000 0 00000000111111 1112222357999999999999999999 Q ss_pred eEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-C--ce-------eeEEee Q lcl|NC_019719. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-G--DV-------ISLLPL 148 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-G--~~-------~~l~~l 148 (424) |.|...+ ++. ..-+...|.... .++....+.+.+.+|-++.++++... | .| +.+.|+ T Consensus 77 w~V~p~~-----~~~--a~~v~~~l~~~~------~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~ 143 (446) T protein:vir:98 77 GPYQHGD-----KRI--KKFIDDQLRNRA------KTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPL 143 (446) T ss_pred ceecCcc-----HHH--HHHHHHHHhhcC------chhHHHHHHHHHhhCceeeeEEEeecccccccchhhccccccccc Confidence 9995321 111 123445554332 34455557888999999999987432 1 11 123333 Q ss_pred cCceEEEEEcCCc-----eE--------------E-------EEEecCceEEecHhHeeEeccCC-CCccccCchHHHHH Q lcl|NC_019719. 149 QSANMDVKLVGKK-----VV--------------Y-------RYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFAC 201 (424) Q Consensus 149 ~~~~v~~~~~~~~-----~~--------------~-------~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~ 201 (424) ++.. ....++.. .. + .....+....+|...+++.++.. ...++|.|.+..+. T Consensus 144 ~~r~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~~~~~~p~G~gLlr~~~ 222 (446) T protein:vir:98 144 QVML-IANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYNTKGNNPWGTSCLTSVL 222 (446) T ss_pred ccee-eeccCCccccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEecCCCCCccccchHHHHH Confidence 3321 11111100 00 0 00112233557888888887765 44589999999999 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHH--------HHHH-HHHHHHHhCCc-ccCcce---ecCCCcee Q lcl|NC_019719. 202 KSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ--------RSQV-EENFKEIAGGP-VKKRLW---ILEAGFST 268 (424) Q Consensus 202 ~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~--------~~~~-~~~~~~~~~~~-~~g~~~---~l~~g~~~ 268 (424) -.........++...|....+.|--+.+.+.+.++++. .+.. ++..++..... +++.++ .+|.|+++ T Consensus 223 w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~ei 302 (446) T protein:vir:98 223 DYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQV 302 (446) T ss_pred HHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceE Confidence 99999999999999999999999988888776543221 1112 22333333322 233232 23889888 Q ss_pred eecccChh-HHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccc Q lcl|NC_019719. 269 SAIGVTPQ-DAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKD 347 (424) Q Consensus 269 ~~l~~~~~-d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~ 347 (424) +-++.... ...|.+..++..++|+.+.....-.++...+.+.++ +..+.........++-.+++|++.||+.|+.+.- T Consensus 303 e~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~-ala~vh~~V~~d~~~aDa~~i~~tln~~Li~~l~ 381 (446) T protein:vir:98 303 GALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTG-RASEIQLELFDGKINSIFDTVIHAFTEQVIGNLI 381 (446) T ss_pred EeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 76654322 235888899999999999877644444332222222 2233344566677888999999999988764321 Q ss_pred c-----c-------cceeeecchhhhccCHHHHHHHHHHHHhCCCCCH---HHHHHHhCCCCCCCCCe Q lcl|NC_019719. 348 V-----G-------RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI---NEMRRTDNLPPLPGGDV 400 (424) Q Consensus 348 ~-----~-------~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~---NE~R~~~G~~p~~~gd~ 400 (424) . . ..+++|++.+ ..|.+..++.+.++++.|+.++ +.+|+.+|+|+-+. |+ T Consensus 382 ~lNf~~~~~~~~~~~~~~~~~~~e--~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~-~~ 446 (446) T protein:vir:98 382 RLNFDPALYPLASNTGYITRLPGR--ATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAIS-ST 446 (446) T ss_pred HhCCCccccccccccccceeccCC--hhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCC-CC Confidence 0 0 0123444332 4677889999999999998765 45999999986422 22 No 131 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.54 E-value=5.9e-15 Score=98.58 Aligned_cols=405 Identities=13% Similarity=0.005 Sum_probs=218.6 Q ss_pred CchHHHHHhhccCcccCc-----cccccc----cc-ccccccc-c-----------CcccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVT-----PNQGSQ----TG-PVSAHGH-L-----------GDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~-----~~~~~~----~~-~~~~~~~-~-----------~~~~~~~~~~~~~~~v~~~i~~ia 71 (424) =+||+|+.++|....... .....+ .. ....+.. . .-...+.+.+..++.+..||+.+. T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 578888888775332110 000000 00 0000000 0 001123344566788999999876 Q ss_pred Hhhcc-CceEEEEe--cccCcccccccc---chhhhhhccCCC--CCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---- Q lcl|NC_019719. 72 TLTAC-LPLDVFET--DQNDNRKKVDLS---NPLARLLRYSPN--QYMTAQEFREAMTMQLCFYGNAYALVDRNSA---- 139 (424) Q Consensus 72 ~~ia~-~~~~v~~~--~~~~~~~~~~~~---~~l~~lL~~~pN--~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~---- 139 (424) +.+-. .-+.+... ..++...+ ... ..++..+-..+. ..++..++.+.++..++..|++|+.+.+... T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~-~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~ 159 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHA-ELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYT 159 (548) T ss_pred HhccCccccceeeeecCCCHHHHH-HHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeeccccccc Confidence 55543 22332221 11111000 000 112333333332 2367999999999999999999998876432 Q ss_pred ---CceeeEEeecCceEEEEEc-------------C--CceEEEEEe-----------cCceEEecHhHeeEeccCC-CC Q lcl|NC_019719. 140 ---GDVISLLPLQSANMDVKLV-------------G--KKVVYRYQR-----------DSEYADFSQKEIFHLKGFG-FT 189 (424) Q Consensus 140 ---G~~~~l~~l~~~~v~~~~~-------------~--~~~~~~~~~-----------~~~~~~~~~~evih~r~~~-~~ 189 (424) ..+..|..|+|+.+..-.+ . ....|.+.. ......+++++|+|+.... .. T Consensus 160 ~g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r~g 239 (548) T protein:vir:95 160 FATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAERIIHIAYRKRIG 239 (548) T ss_pred CCcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeechhHheecccccCCc Confidence 2466888898877743221 1 111222211 1234679999999997554 56 Q ss_pred ccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcce-ecCCCcee Q lcl|NC_019719. 190 GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLW-ILEAGFST 268 (424) Q Consensus 190 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-~l~~g~~~ 268 (424) ..+|+|.+..+...+.......+......+=.+...++++.+.+...... .....-..... -..|.++ .|..|.++ T Consensus 240 Q~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~--~~~~~~~~~~~-~~pG~iv~~L~pGe~i 316 (548) T protein:vir:95 240 QNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTVE--PGKDRKNRTIP-IAPGMVFDDLEPGEDV 316 (548) T ss_pred cccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCC--CCccccccccc-ccCCccccccCCCcee Confidence 78999999999888877777666665555556666777776533211100 00000000011 1134444 47889898 Q ss_pred eecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHH-----------HHHHHHHHHHHHHHH-HH Q lcl|NC_019719. 269 SAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN-----------LGFLQYTLQPYISRW-EN 336 (424) Q Consensus 269 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~-----------~~~~~~tl~P~~~~i-e~ 336 (424) +.+..+-....|.+..+.....||+.+|||-+.|..-..+ +|+++-... ..|+..-++|+...+ +. T Consensus 317 ~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~--nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~ 394 (548) T protein:vir:95 317 GMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDG--TYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQM 394 (548) T ss_pred eecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccch--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8888765567789999999999999999999888553322 455544333 234455566655543 44 Q ss_pred HHHhhccC-cc--ccc-cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCeee Q lcl|NC_019719. 337 SIQRWLIP-AK--DVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG----------GDVAM 402 (424) Q Consensus 337 ~l~~~l~~-~~--~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~----------gd~~~ 402 (424) ++-.-.++ +. ++. ...+.+-.-.....|+...++....++++|+.|.-|+-++.|.++-+- .+++= T Consensus 395 a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~G 474 (548) T protein:vir:95 395 YLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAG 474 (548) T ss_pred HHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcC Confidence 44333332 11 111 123333334455679999999999999999999988877777766320 01111 Q ss_pred ecccccch----hhccc-----cCCC----------------------------------cccCC Q lcl|NC_019719. 403 RQSQYVPI----TDLGT-----NKEP----------------------------------RNNGA 424 (424) Q Consensus 403 ~~~n~~~~----~~~~~-----~~~~----------------------------------~~~ga 424 (424) ++....|. ....+ +++. +++|| T Consensus 475 L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 539 (548) T protein:vir:95 475 LVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNESNNGGA 539 (548) T ss_pred CCCCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCCCCCcccccCCC Confidence 11111110 00000 0000 01111 No 132 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.54 E-value=1.3e-14 Score=96.80 Aligned_cols=405 Identities=11% Similarity=0.039 Sum_probs=210.7 Q ss_pred ccCCCCCchHHHHHh--------hccCcccCccccccccccccccc--------ccCcccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019719. 8 IDLRTNNGWWARLQS--------WFVGGRLVTPNQGSQTGPVSAHG--------HLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 8 ~~~~~~~G~~~~l~~--------~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~v~~~i~~ia 71 (424) |.+--+ |+...-.. .+.+..... . ...+...+ ...-...+.+.+..++.+..||+.+. T Consensus 1 m~~~~~-~~~a~~~~~~~~~~~~~y~aa~~~~--~---~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 74 (495) T protein:vir:10 1 MNMTPS-GYQSLASGLLVPVGASAYEGASGGH--R---WQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWV 74 (495) T ss_pred CCcccc-cccccchhhhhHHHhhhhhccccCc--c---cCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 555433 44321110 111111100 0 00110000 00011123455666899999999888 Q ss_pred HhhccCceEEEEecccCccccccccchhhhhhccCC--CCCCCHHHHHHHHHHHHHHcCCeEEEEeeC--CCC--ceeeE Q lcl|NC_019719. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSP--NQYMTAQEFREAMTMQLCFYGNAYALVDRN--SAG--DVISL 145 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~p--N~~~s~~~f~~~~~~~~l~~G~a~~~~~r~--~~G--~~~~l 145 (424) +.+-.-.|+..-...+....+. -..++..+...+ ...++..++.+.++..++..|++|+.+... .+| .+..| T Consensus 75 ~~vVG~Gi~p~~~~~~~~~~~~--ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~l 152 (495) T protein:vir:10 75 AAAVGNGLTPRWRMKEQELRQE--LQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQL 152 (495) T ss_pred HhhcCCCcccccCCchHHHHHH--HHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEE Confidence 8876556554333222111111 112333333333 234688999999999999999999987654 333 46788 Q ss_pred EeecCceEEEEEc-------------------CCceEEEEEe-----------cCceEEecHhHeeEeccCCCCccccCc Q lcl|NC_019719. 146 LPLQSANMDVKLV-------------------GKKVVYRYQR-----------DSEYADFSQKEIFHLKGFGFTGLVGLS 195 (424) Q Consensus 146 ~~l~~~~v~~~~~-------------------~~~~~~~~~~-----------~~~~~~~~~~evih~r~~~~~~~~G~s 195 (424) ..|+|+.+..-.+ +....|.+.. ......+++++|+|+........+|+| T Consensus 153 qliepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~~r~gQ~RGis 232 (495) T protein:vir:10 153 QIIEPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTVLTVRSDAGAP 232 (495) T ss_pred EEechhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEeccccCCCcccCcc Confidence 8898888752111 1112233211 113466999999999644456689998 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHH-H--HHHHHHHHHHhCCcccCcceecCCCceeeecc Q lcl|NC_019719. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ-R--SQVEENFKEIAGGPVKKRLWILEAGFSTSAIG 272 (424) Q Consensus 196 ~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~ 272 (424) .+.. ...+.......+......+-.+...++++.+.+...... . ......-... ..-..|.+..|..|.+++.++ T Consensus 233 ~la~-i~~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~pG~i~~L~pGe~i~~~~ 310 (495) T protein:vir:10 233 WFQL-LLRLNELDQYEDAELVRKKTAALFAAFIQEATADSTGGPTIGQPKRSKGGKRI-TGLNPGTLQYLQPGQEVKFSN 310 (495) T ss_pred hhHH-HHHHHHhhHHHHHHHHHHHHhhhheeeeecCCCccccccccCccccccCcccc-eecCCceeeecCCCCeeeeeC Confidence 6543 333443333333333333344555667765432111000 0 0000000000 111345788899999999988 Q ss_pred cChhHHHHHHHHHHHHHHHHHHhCCCHHHhc-CCCCCCccchhHHHHHHH------------HHHHHHHHHHHHH-HHHH Q lcl|NC_019719. 273 VTPQDAEMMASRKFQVSELARFFGVPPHLVG-DVEKSTSWGSGIEQQNLG------------FLQYTLQPYISRW-ENSI 338 (424) Q Consensus 273 ~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~-~~~~~~~~~~n~e~~~~~------------~~~~tl~P~~~~i-e~~l 338 (424) .+-....|.+..+.....||+.+|||.+.|. +..+.|| +++-+.... ++..-++|+.+.+ +.++ T Consensus 311 p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nY--SS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~ 388 (495) T protein:vir:10 311 PADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNY--SSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAV 388 (495) T ss_pred CCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7755667889999999999999999999884 4444443 344332222 2333444544443 3343 Q ss_pred HhhccC-ccc--ccc--ceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCeeee Q lcl|NC_019719. 339 QRWLIP-AKD--VGR--IHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG----------GDVAMR 403 (424) Q Consensus 339 ~~~l~~-~~~--~~~--~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~----------gd~~~~ 403 (424) -.-.++ +.. ... ..+++-.......|+...++....++++|+.|+-|+-++.|.++-+- .+++=+ T Consensus 389 l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl 468 (495) T protein:vir:10 389 ASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDMEELFDMISDANQLIDEYDL 468 (495) T ss_pred HcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCC Confidence 332222 111 111 12344445556679999999999999999999999888888877421 111111 Q ss_pred ccc--ccchhhccccCCCcccCC Q lcl|NC_019719. 404 QSQ--YVPITDLGTNKEPRNNGA 424 (424) Q Consensus 404 ~~n--~~~~~~~~~~~~~~~~ga 424 (424) +.+ ..++...+..+++..+.+ T Consensus 469 ~~~~~p~~~~~~~~~~~~~~~~~ 491 (495) T protein:vir:10 469 RLDSDPRYVNGSGAEQKSVMEAA 491 (495) T ss_pred CCCCCCCcCCCccCCCCCCCCCC Confidence 111 111112222222222222 No 133 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=99.47 E-value=1.9e-13 Score=90.29 Aligned_cols=287 Identities=14% Similarity=0.046 Sum_probs=171.3 Q ss_pred EEEEeeCCC-C--ceeeEEeecCceEE---EEEcCCceEEEEE--ecCceEEecHhHeeEeccCC-CCccccCchHHHHH Q lcl|NC_019719. 131 YALVDRNSA-G--DVISLLPLQSANMD---VKLVGKKVVYRYQ--RDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFAC 201 (424) Q Consensus 131 ~~~~~r~~~-G--~~~~l~~l~~~~v~---~~~~~~~~~~~~~--~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~ 201 (424) +.++++... | .|..|.+.|+.++. +..+++...+... .+.....+++...|+.++.. ...++|.+.+..+. T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~g~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~ 80 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLVAIEQWGVFGKATVRIPVDRLVVFVNEREGANWLGQSLLRQAY 80 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCceeEEEecCCCCCCcceeccCCEEEEEeCCCCCCccchhhHHHHH Confidence 667776544 3 36778888887554 2333333222222 22345678888877666554 45589999999999 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceeEEcCCC--CCC----------HHHHHHHHHHHHHHhCCcccCcceecCCCceee Q lcl|NC_019719. 202 KSAGVAVAMEDQQRDFFANGAKSPQILSTGEK--VLT----------EQQRSQVEENFKEIAGGPVKKRLWILEAGFSTS 269 (424) Q Consensus 202 ~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~--~~~----------~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~ 269 (424) -.........++...|....+.|--+...+.+ ..+ .+.++.+.+..+....+..+ .++++.|++++ T Consensus 81 w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a--~~iip~g~~ie 158 (355) T protein:vir:78 81 KNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAA--GGYIPHGANFT 158 (355) T ss_pred HHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcce--eEeecCCceEE Confidence 99999999999999999987544444444432 211 12233444444555444433 57788888877 Q ss_pred ecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc--- Q lcl|NC_019719. 270 AIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK--- 346 (424) Q Consensus 270 ~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~--- 346 (424) -+..+....++.++.++..++|+.++.-. .+-....+++.++ ...+.........+.-.+..|++.||+.|+... T Consensus 159 ~~ea~g~~~~~~~~i~~~d~~Isk~iLGq-tlTs~~~~~gGS~-Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l 236 (355) T protein:vir:78 159 LTGVQGKLPEMDGPIRYHDEQIARAVLAH-FLTLGGDKSTGSY-ALGDTFASFFTGSLNAVMKHIADVTQQHVVEDLVDQ 236 (355) T ss_pred EeecCCCcccHHHHHHHHHHHHHHHHhhh-hhccccCCccchh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 77665555668888899999999887544 2222111111222 223445577777788888889999988776532 Q ss_pred --cc--ccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHH-----HHHHHhCCCCCCCCCeeeecccc-cchhhcccc Q lcl|NC_019719. 347 --DV--GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTIN-----EMRRTDNLPPLPGGDVAMRQSQY-VPITDLGTN 416 (424) Q Consensus 347 --~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~N-----E~R~~~G~~p~~~gd~~~~~~n~-~~~~~~~~~ 416 (424) +. ...++.|+ ... .+.+..++.+.+++..|+.-++ .+|+.+|+|+-+.+++...+... .+....... T Consensus 237 N~~~~~~~P~~~~~--~~~-~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~~~~~~~~~~~~~~~~~~~ 313 (355) T protein:vir:78 237 NWGPEEPAPRLVPA--QLG-KEQPVTAEAIRALVECGAFTADPELEKDLRARYGLPAPAERDDGADAAAAKAAGRRRAKR 313 (355) T ss_pred cCCCCCCCCEEEec--CcC-hhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCCcccCCccccccccccccc Confidence 11 11234453 332 3556789999999999998764 47999999865555544433221 111111111 Q ss_pred CCCcccC----C Q lcl|NC_019719. 417 KEPRNNG----A 424 (424) Q Consensus 417 ~~~~~~g----a 424 (424) ..+...+ | T Consensus 314 ~~~~~~~~~~~a 325 (355) T protein:vir:78 314 LPGQRQGAALPS 325 (355) T ss_pred cCCccccccccc Confidence 1111111 1 No 134 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.46 E-value=1.5e-13 Score=90.82 Aligned_cols=408 Identities=12% Similarity=0.021 Sum_probs=216.4 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccc---ccc--c---c-cccccccc-------------CcccccHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQ---GSQ--T---G-PVSAHGHL-------------GDSSINDERIL 58 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~---~~~--~---~-~~~~~~~~-------------~~~~~~~~~~~ 58 (424) |- +-.- |..+.+..+....... ..+ . . ....+... .-...+.+.+. T Consensus 1 m~------~~~~------r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~r 68 (553) T protein:vir:63 1 MT------KVTV------RKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMAD 68 (553) T ss_pred Cc------chhh------hhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHh Confidence 10 0000 0001111110000000 000 0 0 00000000 01112345566 Q ss_pred hhHHHHHHHHHHHHhhccCceEEEEeccc----Ccccc--cccc---chhhhhhccCCC------CCCCHHHHHHHHHHH Q lcl|NC_019719. 59 QISTVWRCVSLISTLTACLPLDVFETDQN----DNRKK--VDLS---NPLARLLRYSPN------QYMTAQEFREAMTMQ 123 (424) Q Consensus 59 ~~~~v~~~i~~ia~~ia~~~~~v~~~~~~----~~~~~--~~~~---~~l~~lL~~~pN------~~~s~~~f~~~~~~~ 123 (424) .++.+.++|+.+.+.+-.-.|...-.... |...+ .... ..+++.+.+.++ ..++..++...++.. T Consensus 69 Nn~~a~~av~~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~ 148 (553) T protein:vir:63 69 NDGFTNGAVGYQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVG 148 (553) T ss_pred cChHHHHHHHHHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHH Confidence 68999999998887777667766433110 00000 0011 122333444443 345788999999999 Q ss_pred HHHcCCeEEEEeeCCC-C--ceeeEEeecCceEEEEEcC----------------CceEEEEEe---cC----------- Q lcl|NC_019719. 124 LCFYGNAYALVDRNSA-G--DVISLLPLQSANMDVKLVG----------------KKVVYRYQR---DS----------- 170 (424) Q Consensus 124 ~l~~G~a~~~~~r~~~-G--~~~~l~~l~~~~v~~~~~~----------------~~~~~~~~~---~~----------- 170 (424) ++..|++|+.+.+... | .+..|..|+|+.+....+. ....|.+.. +. T Consensus 149 ~~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~ 228 (553) T protein:vir:63 149 YVKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKW 228 (553) T ss_pred HHhCCceEEEeeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccce Confidence 9999999998875433 2 3567888888777543221 111222211 00 Q ss_pred ----ceEEecHhHeeEeccC-CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019719. 171 ----EYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 171 ----~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~ 245 (424) ....++..+|||+... .....+|+|.+..+...+.......+......+=.+...++|+.+.+. +...+.+.. T Consensus 229 ~r~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~--~~~~~~~~~ 306 (553) T protein:vir:63 229 KFVQQSKPWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPP--EFIHSQMSG 306 (553) T ss_pred eeeccccccChhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCh--hhhhhhccc Confidence 1235789999998655 456799999999998888777766666655555566667777755321 111111110 Q ss_pred ----------------HHHHHhCC-----cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhc- Q lcl|NC_019719. 246 ----------------NFKEIAGG-----PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVG- 303 (424) Q Consensus 246 ----------------~~~~~~~~-----~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~- 303 (424) .......+ =..|.+..|..|.+++.+..+-...+|.+..+.....||+.+|||.+.|. T Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~ 386 (553) T protein:vir:63 307 GSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTR 386 (553) T ss_pred ccccccccccccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhh Confidence 00011100 12456888899999999887755678899999999999999999998774 Q ss_pred CCCCCCccchhHH-----------HHHHHHHHHHHHHHHHHH-HHHHHhhccC-cccc-cc-----------ceeeecch Q lcl|NC_019719. 304 DVEKSTSWGSGIE-----------QQNLGFLQYTLQPYISRW-ENSIQRWLIP-AKDV-GR-----------IHAEHNLD 358 (424) Q Consensus 304 ~~~~~~~~~~n~e-----------~~~~~~~~~tl~P~~~~i-e~~l~~~l~~-~~~~-~~-----------~~~~fd~~ 358 (424) +..+.|| +++- ..+..|+..-++|+.+.| +.++-...++ +... .. ..+.+-.- T Consensus 387 D~s~~nY--SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p 464 (553) T protein:vir:63 387 DFSKANY--SSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGA 464 (553) T ss_pred hcccccH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecC Confidence 3434444 3432 333345566667766654 3444332222 1111 10 12334444 Q ss_pred hhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCeeeecccccchhhcc---c------cCCC Q lcl|NC_019719. 359 GLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG----------GDVAMRQSQYVPITDLG---T------NKEP 419 (424) Q Consensus 359 ~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~----------gd~~~~~~n~~~~~~~~---~------~~~~ 419 (424) .....|+...++....++++|+.|.-|+-++.|.+|-+- .+++=++.+..+....+ . .... T Consensus 465 ~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~ 544 (553) T protein:vir:63 465 SQGQIDQLKETQAAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPA 544 (553) T ss_pred CccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCC Confidence 555679999999999999999999998887778776321 11111111111110000 0 0000 Q ss_pred cccCC Q lcl|NC_019719. 420 RNNGA 424 (424) Q Consensus 420 ~~~ga 424 (424) .++++ T Consensus 545 ~~~~~ 549 (553) T protein:vir:63 545 AAQTS 549 (553) T ss_pred CCCcc Confidence 11111 No 135 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.45 E-value=5.7e-14 Score=93.17 Aligned_cols=386 Identities=13% Similarity=0.108 Sum_probs=182.4 Q ss_pred CCCCcccc---------cC-CCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTI---------DL-RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLI 70 (424) Q Consensus 1 ~~~~~~~~---------~~-~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~i 70 (424) |.+- -++ .+ ++|.|+.+-+.+. + +.+. .....+......+...+ ...|..++.+..+|+.+ T Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~rd~l~~~~~gl---g---~~r~-~~~~~~g~~~~~~~~~l-~~~Yr~~~ia~~iVd~~ 71 (449) T protein:vir:10 1 MTDK-LTLAVNHALNDARMARARMGLMVPTMGL---D---NKRH-SAWCEYGFPELVTYENL-YSLYRRGGIAHGAVEKL 71 (449) T ss_pred Cchh-hHHHHhhhcchhHHHHHHHHHHHHHhcC---C---cccc-hhhhhcCCcccCCHHHH-HHHHhcCchhHHHHHhh Confidence 4332 111 00 1223333322111 1 1111 11111111111111111 12345578889999999 Q ss_pred HHhhccCceEEEEecccCccc-cccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE-eeCCC--------- Q lcl|NC_019719. 71 STLTACLPLDVFETDQNDNRK-KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV-DRNSA--------- 139 (424) Q Consensus 71 a~~ia~~~~~v~~~~~~~~~~-~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~-~r~~~--------- 139 (424) ++.+-.--..+....+....+ +...+..+.+++. ..-+..+.+..-+. .++|-+.+++ +++.. T Consensus 72 ~d~~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~-----~~~~~~l~ea~~~~-rl~Gga~i~i~v~d~~~l~~Pl~~~ 145 (449) T protein:vir:10 72 VGKCWQTNPEIIEGDDADDSEDETSWEKKSKQVFT-----NRLWRSFAEADRRR-LVGRYAGILLHIRDEKDWNLPATKG 145 (449) T ss_pred hhhhhhcCcccccCccccchhhhHHHHHHHHHHHH-----HHHHHHHHHHHHhh-hccCcEEEEEEecCCCCCCcccccC Confidence 986632211222222111110 0001111111111 00122222333333 3566666655 44432 Q ss_pred CceeeEEeecCceEEEEE---c------CCceEEEEEe-----cCceEEecHhHeeEeccCCCCccccCchHHHHHHHHH Q lcl|NC_019719. 140 GDVISLLPLQSANMDVKL---V------GKKVVYRYQR-----DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAG 205 (424) Q Consensus 140 G~~~~l~~l~~~~v~~~~---~------~~~~~~~~~~-----~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~ 205 (424) +.+..|.|+....+++.. | +.+.+|.+.. ...+..+-++.|+||-..+ ..|.|.++.+.+.+. T Consensus 146 ~~i~~i~v~~~~~i~~~~~~~dp~sp~yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~~---~~g~~~L~~~yn~l~ 222 (449) T protein:vir:10 146 RGLQKVSVSWAGSLKVAEWDTGINSKTYGQPKLWKYTERLPNGSSRRVDIHPDRVFILGDYS---EDAIGFLEPAYNAFV 222 (449) T ss_pred cceeeEEeeccccCChhhhhcCCCCCCCCCceEEEEeeeccCCCccceeeccceeEeecCCC---CCChhHHHHHHHHhh Confidence 245556666554444321 1 2233444432 1234568888898885433 336777777766543 Q ss_pred HHHHHH-HHHHHHHhccCC-----------CceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccc Q lcl|NC_019719. 206 VAVAME-DQQRDFFANGAK-----------SPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV 273 (424) Q Consensus 206 ~~~~~~-~~~~~~~~n~~~-----------p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~ 273 (424) ....+. .+...+++|..+ ..++.... +...++..+++.+..+.+..+.+ .+.++.+-+++.++. T Consensus 223 ~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~-~~~~e~~~~~~~~~~~~~~~~~~---~~~i~~~~d~~~~~~ 298 (449) T protein:vir:10 223 SLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLY-GVSIDELQDKFNEVAGEINRGND---VLMTTQGATVTPLVT 298 (449) T ss_pred hHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHh-hCCchHHHHHHHHHHHHHhccch---heeecCCcceEEEec Confidence 332222 122222222111 11111111 11223333444444444443322 445667778999888 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHH------HHHHHHHHHHHHHHhhccCccc Q lcl|NC_019719. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY------TLQPYISRWENSIQRWLIPAKD 347 (424) Q Consensus 274 ~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~------tl~P~~~~ie~~l~~~l~~~~~ 347 (424) ++.+.. +.......+||.+-|||...|-+...+..+. + ++ ...|+.. -|.|.++.+-+.|-+.-+.... T Consensus 299 ~~sgl~--d~l~~~~q~iaaa~~IP~t~L~Gqsp~glns-t-~D-~~nyyd~i~~~Q~~l~p~le~l~~~l~~s~~g~~~ 373 (449) T protein:vir:10 299 SVADPT--ATYNVNLQTAAAGVDIPTRILIGNQQAERSS-T-ED-QKYFNARCQSRRVDLSFEIEDFCDKLIELKIIDAV 373 (449) T ss_pred ccCChh--HHHHHHHHHHHHHhCCCeeeeeccCcccccc-c-hh-HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCC Confidence 877653 5666678889999999999987766554442 2 32 3344332 3567777776666544332221 Q ss_pred cccceeeecchhhhccCHHHHHH-------HHHHHHhCC---CCCHHHHHHHhCCCCCCCCCeeeecccccchhhccccC Q lcl|NC_019719. 348 VGRIHAEHNLDGLLRGDSASRAA-------FMKAMGEAG---LRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNK 417 (424) Q Consensus 348 ~~~~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~g---~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~ 417 (424) . .+.|.++.|...+.+++++ .+++++++| +++++|+|+.+|++|..+ +. .+-+...+.+ T Consensus 374 -~--d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~~~~EiR~~~~~~~~~~-~~-------~~~e~~de~~ 442 (449) T protein:vir:10 374 -A--KKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAFSREEIRTAAGYDNDDE-EP-------LGEEDGDEED 442 (449) T ss_pred -C--ceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCcCHHHHHHHhcccCCCC-CC-------CCCCCCcccc Confidence 1 2456667888888887754 555666666 899999999999998532 21 1222222333 Q ss_pred CCcccCC Q lcl|NC_019719. 418 EPRNNGA 424 (424) Q Consensus 418 ~~~~~ga 424 (424) +..+.+| T Consensus 443 ~~~d~~a 449 (449) T protein:vir:10 443 KATDSAA 449 (449) T ss_pred ccCCcCC Confidence 4444445 No 136 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.43 E-value=2.2e-13 Score=89.97 Aligned_cols=395 Identities=11% Similarity=0.040 Sum_probs=203.7 Q ss_pred CCCCcccccCCCCCchHHHHHhhcc-CcccCcccccccc--------ccccccccc-----CcccccHHHHhhhHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFV-GGRLVTPNQGSQT--------GPVSAHGHL-----GDSSINDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~-~~~~~~~~~~~~~--------~~~~~~~~~-----~~~~~~~~~~~~~~~v~~~ 66 (424) ..||.-...+-+ . |. ......|...... ......++. -|-. .-....+.|.+++| T Consensus 60 ~~~~~~~~~~~~-------~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~-~la~laQ~~eyr~~ 128 (695) T protein:vir:78 60 VAEPSPSLRLAR-------Q---FEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFP-TLVLLAQLPEYRAM 128 (695) T ss_pred ccCCCcccccce-------e---ceeccccCCccccchhhhhhcccccccccchhhhccCcchHH-HHHHHhhccchhhH Confidence 445543222111 0 11 0000111111100 000111111 1111 12345567889999 Q ss_pred HHHHHHhhccCceEEEE-eccc----------CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe Q lcl|NC_019719. 67 VSLISTLTACLPLDVFE-TDQN----------DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD 135 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~-~~~~----------~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~ 135 (424) +..|++.+..- |.-.. ...+ ++... ..+-.-..+|...-..+.-+..|.+.+.+..+ +|-+.+++. T Consensus 129 ~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~-~~d~dqi~~L~~e~erL~V~~~l~eaik~aRl-fGGa~~~i~ 205 (695) T protein:vir:78 129 HEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAAS-TSDGDQLKQINDEIERLRIRDAVRTTVIHDQA-FGRAHPYFK 205 (695) T ss_pred HHHHHHHhhcc-cceeccccchhhhhhcccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-ccceEEEEE Confidence 99999988765 42211 1000 00000 00111223343333333334555555555555 555444443 Q ss_pred eCC-----------------CCceeeEEeecCceEEEEEcC----------CceEEEEEecCceEEecHhHeeEeccCCC Q lcl|NC_019719. 136 RNS-----------------AGDVISLLPLQSANMDVKLVG----------KKVVYRYQRDSEYADFSQKEIFHLKGFGF 188 (424) Q Consensus 136 r~~-----------------~G~~~~l~~l~~~~v~~~~~~----------~~~~~~~~~~~~~~~~~~~evih~r~~~~ 188 (424) -.. .|....|.+++|++|++...+ .+.+|.. .+ ..+-.+.++.+..... T Consensus 206 i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V--~G--~kIH~SRL~~f~g~pl 281 (695) T protein:vir:78 206 IKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWM--IG--TEVHATRLHTIVSRPV 281 (695) T ss_pred eccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEE--ec--eEEeeeeEEEecCCCc Confidence 221 244556888999988875421 1112222 22 2355566655543321 Q ss_pred -------CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC--CCCCCHHHHHHH--HHHHHHHhCCcccC Q lcl|NC_019719. 189 -------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG--EKVLTEQQRSQV--EENFKEIAGGPVKK 257 (424) Q Consensus 189 -------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~--~~~~~~~~~~~~--~~~~~~~~~~~~~g 257 (424) ..+.|+|..+.+.+.+.-..........+...- ...++ +++ ..+......+.. -+.++++. +| . T Consensus 282 Pd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~-~v~~l-k~dla~~L~~g~~~~l~~R~eli~~~R--sn-~ 356 (695) T protein:vir:78 282 GDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF-SVSGI-LMDLAQALMPGANVDLSMRAELINRYR--DN-R 356 (695) T ss_pred hhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhh-hhHHH-HHHHHHhhcChhHHHHHHHHHHHHHhc--Cc-c Confidence 234699999999999888877777776665442 22322 111 112222222222 23334443 33 3 Q ss_pred cceecC-CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHH-------HHHHHH Q lcl|NC_019719. 258 RLWILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL-------QYTLQP 329 (424) Q Consensus 258 ~~~~l~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~-------~~tl~P 329 (424) ++++++ ..-+|.+.+.+...++ +........||.+-+||...|.+......+ ++.|...+.|| +.-|+| T Consensus 357 G~~llDk~~Eefeq~stslSGLd--dVi~qf~q~VAgaa~IPltkLfGqSPkGlN-ATGE~D~rnYYD~I~s~Qe~~L~p 433 (695) T protein:vir:78 357 NILFLDKATEEFFQFNTPLSGLD--ALQAQAQEQMSAVSHIPLIKLLGITPTGLN-ASSEGEIRVWYDYVRAYQRNALQQ 433 (695) T ss_pred ceEEEecCCcceEEEecccCCHH--HHHHHHHHHHHhhhcCchhhhhccCCcccc-ccchhhHHHHHHHHHHHHHHHHHH Confidence 577788 4788999987776654 667777889999999999988765554332 12333334444 457889 Q ss_pred HHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHhCCCCC------- Q lcl|NC_019719. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAA-------FMKAMGEAGLRTINEMRRTDNLPPL------- 395 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~g~~T~NE~R~~~G~~p~------- 395 (424) .++.+-+.+-+..|...+. .+.|.++.|..++.+++++ ....++..|+++++|+|..+.-+|- T Consensus 434 ~L~rl~~ii~rS~~G~idp---di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~ 510 (695) T protein:vir:78 434 LMNDVIVMIQLSLFGAVDP---SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKL 510 (695) T ss_pred HHHHHHHHHHHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCccccccc Confidence 9988888887777655432 3556667887777776554 4567888999999999999877652 Q ss_pred CCCCeeeecccc-cchh-----hccc-cCCCcccCC Q lcl|NC_019719. 396 PGGDVAMRQSQY-VPIT-----DLGT-NKEPRNNGA 424 (424) Q Consensus 396 ~~gd~~~~~~n~-~~~~-----~~~~-~~~~~~~ga 424 (424) +-.|++-.+... ++.. ...+ ++.+..+++ T Consensus 511 D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (695) T protein:vir:78 511 DANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGA 546 (695) T ss_pred ccccCCCcCccchhhhhHhhhcCcccccccCCCCCC Confidence 223444444322 1111 1111 111111111 No 137 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.42 E-value=2.7e-13 Score=89.53 Aligned_cols=396 Identities=10% Similarity=0.021 Sum_probs=203.6 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccccccc-------------ccccccccCcccccHHHHhhhHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTG-------------PVSAHGHLGDSSINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~v~~~i 67 (424) .-||.-+ +=+.+++..- .....|....... .|.....+-|-. .-....+.|-+++|+ T Consensus 59 ~~~~~~~-------~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~-~la~laQ~~eyr~~~ 128 (694) T protein:vir:10 59 VAEPSPS-------LRLARQFEVD--VSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFP-TLVLLAQLPEYRAMH 128 (694) T ss_pred cCCCCcc-------hhhhhhcccc--ccCCCccccchhhhhhccCcccccchhhhhccCcchHH-HHHHHhhccchhhHH Confidence 3444421 1111121100 0011111111000 000001111111 123455678899999 Q ss_pred HHHHHhhccCceEEEE-eccc----------CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019719. 68 SLISTLTACLPLDVFE-TDQN----------DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~-~~~~----------~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r 136 (424) ..|++.+..- |.-.. ...+ ++... ..+-.-..+|...-..+.-+..|.+.+.+..+ +|-+.+++.- T Consensus 129 ~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~-~~d~dqi~~L~~e~erl~V~~~l~eaik~aRl-fGGa~~~i~I 205 (694) T protein:vir:10 129 EVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAAS-TSDGDQLKQINDEIERLRIRDAVRTTVIHDQA-FGRAHPYFKI 205 (694) T ss_pred HHHHHHhhcc-cceeccccchhhhhhcccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-ccceEEEEEe Confidence 9999988765 42211 1000 00000 00111223343333333334555555555555 5554444432 Q ss_pred CC-----------------CCceeeEEeecCceEEEEEcC----------CceEEEEEecCceEEecHhHeeEeccCCC- Q lcl|NC_019719. 137 NS-----------------AGDVISLLPLQSANMDVKLVG----------KKVVYRYQRDSEYADFSQKEIFHLKGFGF- 188 (424) Q Consensus 137 ~~-----------------~G~~~~l~~l~~~~v~~~~~~----------~~~~~~~~~~~~~~~~~~~evih~r~~~~- 188 (424) .. .|....|.+++|++|++...+ .+.+|.. .+ ..+-.+.++.+..... T Consensus 206 ~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V--~G--~~IH~SRL~~f~g~plP 281 (694) T protein:vir:10 206 KGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWM--IG--TEVHATRLHTIVSRPVG 281 (694) T ss_pred ecCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEE--ec--eEEeeeeEEEecCCCch Confidence 11 244556888999988875421 1112222 22 2355566655543321 Q ss_pred ------CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC--CCCCCHHHHHHH--HHHHHHHhCCcccCc Q lcl|NC_019719. 189 ------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG--EKVLTEQQRSQV--EENFKEIAGGPVKKR 258 (424) Q Consensus 189 ------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~--~~~~~~~~~~~~--~~~~~~~~~~~~~g~ 258 (424) ..+.|+|..+.+...+.-..........+...- ...++ +++ ..+......+.. -+.++++. +| .+ T Consensus 282 d~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~-~v~~l-k~dla~~L~~g~~~~l~~R~eli~~~R--sn-~G 356 (694) T protein:vir:10 282 DMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF-SVSGI-LMDLAQALMPGANVDLSMRAELINRYR--DN-RN 356 (694) T ss_pred hhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhh-hhHHH-HHHHHHhhcChhHHHHHHHHHHHHHhc--Cc-cc Confidence 234699999999999888877777776665442 22322 111 112222222222 23334443 33 35 Q ss_pred ceecC-CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHH-------HHHHHHH Q lcl|NC_019719. 259 LWILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL-------QYTLQPY 330 (424) Q Consensus 259 ~~~l~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~-------~~tl~P~ 330 (424) +++++ ..-+|.+.+.+...++ +........||.+-+||...|.+......+ ++.|...+.|| +.-|+|. T Consensus 357 ~~llDk~~Eefeq~stslSGLd--dVi~qf~q~VAgaa~IPltkLfGqSPkGlN-ATGE~D~rnYYD~I~s~Qe~~L~p~ 433 (694) T protein:vir:10 357 ILFLDKATEEFFQFNTPLSGLD--ALQAQAQEQMSAVSHIPLIKLLGITPTGLN-ASSEGEIRVWYDYVRAYQRNALQQL 433 (694) T ss_pred eEEEecCCcceEEEecccCCHH--HHHHHHHHHHHhhhcCchhhhhccCccccc-ccchhhHHHHHHHHHHHHHHHHHHH Confidence 77788 4788999987776654 667777889999999999988765554332 12333334444 4578899 Q ss_pred HHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHhCCCCC-------C Q lcl|NC_019719. 331 ISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAA-------FMKAMGEAGLRTINEMRRTDNLPPL-------P 396 (424) Q Consensus 331 ~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~g~~T~NE~R~~~G~~p~-------~ 396 (424) ++.+-+.+-+..|...+. .+.|.++.|..++.+++++ ....++..|+++++|+|..+.-+|- + T Consensus 434 L~rl~~ii~rS~~G~idp---~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D 510 (694) T protein:vir:10 434 MNDVIVMIQLSLFGAVDP---SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLD 510 (694) T ss_pred HHHHHHHHHHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccc Confidence 988888887777655432 3556667787777766554 4567888999999999999877652 2 Q ss_pred CCCeeeecccc-cchh-----hccc-cCCCcccCC Q lcl|NC_019719. 397 GGDVAMRQSQY-VPIT-----DLGT-NKEPRNNGA 424 (424) Q Consensus 397 ~gd~~~~~~n~-~~~~-----~~~~-~~~~~~~ga 424 (424) -.|++-.+... ++.. ...+ ++.+..+|| T Consensus 511 ~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (694) T protein:vir:10 511 ANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGA 545 (694) T ss_pred cccCCCcCccchhhhhHhhhcCcccccccCCCCcc Confidence 23444444322 1111 1111 111112222 No 138 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.42 E-value=2.4e-13 Score=89.78 Aligned_cols=395 Identities=11% Similarity=0.039 Sum_probs=201.5 Q ss_pred CCCCcccccCCCCCchHHHHHhhcc-CcccCcccccccc--------ccccccccc-----CcccccHHHHhhhHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFV-GGRLVTPNQGSQT--------GPVSAHGHL-----GDSSINDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~-~~~~~~~~~~~~~--------~~~~~~~~~-----~~~~~~~~~~~~~~~v~~~ 66 (424) .-||.-...+-+ . |. ......|...... ......++. -|-. .-....+.|.+++| T Consensus 60 ~~~~~~~~~~~~-------~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~-~la~laQ~~eyr~~ 128 (695) T protein:vir:36 60 VVEPSPSLRLAR-------Q---FEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFP-TLVLLAQLPEYRAM 128 (695) T ss_pred ccCCCcccccce-------e---ceecccccCccccchhhhhhcccccccccchhhhccCcchHH-HHHHHhhccchhhH Confidence 334433221111 0 10 0000111111000 000111111 1111 12345567889999 Q ss_pred HHHHHHhhccCceEEEE-eccc----------CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe Q lcl|NC_019719. 67 VSLISTLTACLPLDVFE-TDQN----------DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD 135 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~-~~~~----------~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~ 135 (424) +..|++.+..- |.-.. ...+ ++.... .+-.-...|...-..+.-+..|.+.+.+..+ +|-+.+++. T Consensus 129 ~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~-~d~dqik~L~~e~erL~V~~~l~eaik~aRl-fGGa~~~i~ 205 (695) T protein:vir:36 129 HEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAAST-SDGDQLKQINDEIERLRIRDAVRTTVIHDQA-FGRAHPYFK 205 (695) T ss_pred HHHHHHHhhcc-cceecccchhhhhhcccccccccccc-CchHHHHHHHHHHHHHHHHHHHHHHHHhhcc-ccceEEEEE Confidence 99999988765 42211 1000 000000 0001222333222223334445555555555 555544443 Q ss_pred eCC-----------------CCceeeEEeecCceEEEEEcC----------CceEEEEEecCceEEecHhHeeEeccCCC Q lcl|NC_019719. 136 RNS-----------------AGDVISLLPLQSANMDVKLVG----------KKVVYRYQRDSEYADFSQKEIFHLKGFGF 188 (424) Q Consensus 136 r~~-----------------~G~~~~l~~l~~~~v~~~~~~----------~~~~~~~~~~~~~~~~~~~evih~r~~~~ 188 (424) -.. .|....|.+++|++|++...+ .+.+|.. .+ ..+-.+.++.+..... T Consensus 206 i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V--~G--~kIH~SRL~~f~g~pl 281 (695) T protein:vir:36 206 IKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWM--IG--TEVHATRLHTIVSRPV 281 (695) T ss_pred eccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEE--ec--eEEeeeeEEEecCCCc Confidence 222 244556888999988875421 1112222 22 2355566655543321 Q ss_pred -------CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC--CCCCCHHHHH--HHHHHHHHHhCCcccC Q lcl|NC_019719. 189 -------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG--EKVLTEQQRS--QVEENFKEIAGGPVKK 257 (424) Q Consensus 189 -------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~--~~~~~~~~~~--~~~~~~~~~~~~~~~g 257 (424) ..+.|+|..+.+.+.+.-..........+...- ...++ +++ ..+......+ ..-+.++++. +| . T Consensus 282 Pd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~-~v~~l-k~dla~aL~~g~~~~l~~R~eli~~~R--sn-~ 356 (695) T protein:vir:36 282 GDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF-SVSGI-LMDLAQALMPGANVDLSMRAELINRYR--DN-R 356 (695) T ss_pred hhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhh-hHHHH-HHHHHHhhcChhHHHHHHHHHHHHHhc--Cc-c Confidence 234699999999998888877777666665432 22222 111 1111222222 1223334443 33 3 Q ss_pred cceecC-CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHH-------HHHHHH Q lcl|NC_019719. 258 RLWILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL-------QYTLQP 329 (424) Q Consensus 258 ~~~~l~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~-------~~tl~P 329 (424) ++++++ ..-+|.+.+.+...++ +........||.+-+||...|.+......+ ++.|...+.|| +.-|+| T Consensus 357 G~~llDk~~Eefeq~stslSGLd--dVi~qf~q~VAgaa~IPltkLfGqSPkGlN-ATGE~D~rnYYD~I~s~Qe~~L~p 433 (695) T protein:vir:36 357 NILFLDKATEEFFQFNTPLSGLD--ALQAQAQEQMSAVSHIPLIKLLGITPTGLN-ASSEGEIRVWYDYVRAYQRNALQQ 433 (695) T ss_pred ceEEEecCCcceEEEecccCCHH--HHHHHHHHHHHhhhcCchhhhhccCccccc-ccchhhHHHHHHHHHHHHHHHHHH Confidence 577788 4788999987776654 667777889999999999988765554332 12333334444 457889 Q ss_pred HHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHhCCCCC------- Q lcl|NC_019719. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAA-------FMKAMGEAGLRTINEMRRTDNLPPL------- 395 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~g~~T~NE~R~~~G~~p~------- 395 (424) .++.+-+.+-+..|...+. .+.|.++.|..++.+++++ ....++..|+++++|+|..+.-+|- T Consensus 434 ~L~rl~~ii~rS~~G~idp---di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~ 510 (695) T protein:vir:36 434 LMNDVIVMIQLSLFGAVDP---SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKL 510 (695) T ss_pred HHHHHHHHHHHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCccccccc Confidence 9988888887777655432 3556667887777776554 4567888999999999999877652 Q ss_pred CCCCeeeecccc-cchh-----hccc-cCCCcccCC Q lcl|NC_019719. 396 PGGDVAMRQSQY-VPIT-----DLGT-NKEPRNNGA 424 (424) Q Consensus 396 ~~gd~~~~~~n~-~~~~-----~~~~-~~~~~~~ga 424 (424) +-.|++-.+... ++.. ...+ ++.+..+|| T Consensus 511 D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (695) T protein:vir:36 511 DANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGA 546 (695) T ss_pred ccccCCCcCccchhhhhHhhhcCcccccccCCCCcc Confidence 223444444322 1111 1111 111222222 No 139 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.42 E-value=1.8e-13 Score=90.50 Aligned_cols=396 Identities=11% Similarity=0.035 Sum_probs=202.1 Q ss_pred CCCCcccccCCCCCchHHHHHhhcc-CcccCccccccccc--------cccccccc-----CcccccHHHHhhhHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFV-GGRLVTPNQGSQTG--------PVSAHGHL-----GDSSINDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~-~~~~~~~~~~~~~~--------~~~~~~~~-----~~~~~~~~~~~~~~~v~~~ 66 (424) ..||.-...+-+ . |. ......|....... .....++. -|-. .-....+.|.+++| T Consensus 60 ~~~~~~~~~~~~-------~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~-~la~laQ~~eyr~~ 128 (698) T protein:vir:10 60 VAEPSPSLRLAR-------Q---FEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFP-TLVLLAQLPEYRAM 128 (698) T ss_pred ccCCCccccccc-------c---ceeccccCCccccchhhhhhcccccccccchhhhccCcchHH-HHHHHhhccchhhH Confidence 445543222111 1 11 00011111111000 00111111 1111 12345567889999 Q ss_pred HHHHHHhhccCceEEEE-eccc----------CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe Q lcl|NC_019719. 67 VSLISTLTACLPLDVFE-TDQN----------DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD 135 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~-~~~~----------~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~ 135 (424) +.+|++.+..- |.-.. ...+ ++... ..+-.-..+|...-..+.-+..+.+.+.+..++-|. .+++. T Consensus 129 ~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~-~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGGa-~~~i~ 205 (698) T protein:vir:10 129 HEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAAS-TSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRA-HPYFK 205 (698) T ss_pred HHHHHHHhhcc-cceeccccchhhhhhcccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccce-EEEEE Confidence 99999988765 42211 1000 00000 001112223433333333355555566666654444 44432 Q ss_pred e-C----------------CCCceeeEEeecCceEEEEEcC----------CceEEEEEecCceEEecHhHeeEeccCCC Q lcl|NC_019719. 136 R-N----------------SAGDVISLLPLQSANMDVKLVG----------KKVVYRYQRDSEYADFSQKEIFHLKGFGF 188 (424) Q Consensus 136 r-~----------------~~G~~~~l~~l~~~~v~~~~~~----------~~~~~~~~~~~~~~~~~~~evih~r~~~~ 188 (424) - . ..|....|.+++|++|++...+ .+.+|.+ .+. .+-++.++.+..... T Consensus 206 I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V--~G~--~IH~SRL~~~vg~pv 281 (698) T protein:vir:10 206 IKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWM--IGS--EVHATRLHTIVSRPV 281 (698) T ss_pred eecCccccccccccccccccCccceeeeeecccccccchhhhccchhhccCCCceEEE--ecc--eecceeEEEecCCCc Confidence 2 2 1244556888999988875421 1112222 222 355666655543321 Q ss_pred -------CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEc-CCCCCCHHHHHHH--HHHHHHHhCCcccCc Q lcl|NC_019719. 189 -------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILST-GEKVLTEQQRSQV--EENFKEIAGGPVKKR 258 (424) Q Consensus 189 -------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~-~~~~~~~~~~~~~--~~~~~~~~~~~~~g~ 258 (424) ..+.|+|..+.+.+.+.-..........+...-. ..++... -..+......+.. -+.++.+. +| .+ T Consensus 282 pd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~-~~~l~~dla~aL~~g~~~~l~~R~eli~~~R--sn-~G 357 (698) T protein:vir:10 282 GDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFS-VSGILMDLAQALTPGANVDLSMRAELINRYR--DN-RN 357 (698) T ss_pred hhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHhh-HHHHHHHHHHhcCChhhHHHHHHHHHHHHhc--Cc-cc Confidence 2346999999999998888777766666654322 2222110 0111122221222 23333343 33 35 Q ss_pred ceecC-CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHH-------HHHHHHH Q lcl|NC_019719. 259 LWILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL-------QYTLQPY 330 (424) Q Consensus 259 ~~~l~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~-------~~tl~P~ 330 (424) +++++ .+-+|++.+.+...++ +........||.+-+||...|.+......+ ++.|...+.|| +.-|+|. T Consensus 358 ~~llDk~~Eefeq~st~lSGLd--dVi~qf~q~VAgaa~IPltkLfGqSPkGlN-ATGE~D~rnYYD~I~s~Qe~~L~p~ 434 (698) T protein:vir:10 358 ILFLDKATEEFFQFNTPLSGLD--ALQAQAQEQMSAVSHIPLIKLLGITPTGLN-ASSEGEIRVWYDYVRAYQRNALQQL 434 (698) T ss_pred eEEEecCCcceEEEecCcCCHH--HHHHHHHHHHHhhhcCchhhhhccCCcccC-ccchhhHHHHHHHHHHHHHHHHHHH Confidence 77788 5788999987776654 667777889999999999988776554332 12333344444 4578899 Q ss_pred HHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHhCCCCC-------C Q lcl|NC_019719. 331 ISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAA-------FMKAMGEAGLRTINEMRRTDNLPPL-------P 396 (424) Q Consensus 331 ~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~g~~T~NE~R~~~G~~p~-------~ 396 (424) ++.+-+.+-+..|...+. .+.|.++.|..++.+++++ ....++..|+++++|+|+++.-+|- + T Consensus 435 L~rl~~ii~rS~~G~idp---~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d 511 (698) T protein:vir:10 435 MNDVIVMIQLSLFGAVDP---SIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLD 511 (698) T ss_pred HHHHHHHHHHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccC Confidence 999888887777655432 3556667888888776554 4567788999999999999876642 1 Q ss_pred CCCeeeecc-cccchhh--------ccccCCC-cccCC Q lcl|NC_019719. 397 GGDVAMRQS-QYVPITD--------LGTNKEP-RNNGA 424 (424) Q Consensus 397 ~gd~~~~~~-n~~~~~~--------~~~~~~~-~~~ga 424 (424) --|++..|. |.+.... .++..++ ..+|| T Consensus 512 ~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (698) T protein:vir:10 512 ANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGA 549 (698) T ss_pred CcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccc Confidence 123333332 2222111 1111111 12222 No 140 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=99.19 E-value=5.8e-12 Score=82.19 Aligned_cols=405 Identities=12% Similarity=0.087 Sum_probs=219.5 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccc--ccccc-------ccccccCccc--cc--HHHHhhhHHHHHHHHHHHHhh Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGS--QTGPV-------SAHGHLGDSS--IN--DERILQISTVWRCVSLISTLT 74 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~--~~~~~-------~~~~~~~~~~--~~--~~~~~~~~~v~~~i~~ia~~i 74 (424) |---+..-+. .+.+|+.++.. ... .+.++ ...++.+... -+ -+-+-..+.++..+..|++++ T Consensus 1 ~~a~~~lr~~----rrpkg~~~a~~-r~L~aAs~~~~dpg~~~~~~~g~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~ 75 (631) T protein:vir:10 1 MAATQSLRLV----RRPKGGRPAPS-RALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSC 75 (631) T ss_pred CCcccceeee----ecCCCCCccch-hhhhhhhccccchhhhhhhhcCCcccchhhHHHHHHHHhhhhHHHHhhhhhhhh Confidence 2221211111 11222221111 000 00111 0111111110 00 122233577888999999999 Q ss_pred ccCceEEEEecccC-----cccccccc-chhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCc------ Q lcl|NC_019719. 75 ACLPLDVFETDQND-----NRKKVDLS-NPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGD------ 141 (424) Q Consensus 75 a~~~~~v~~~~~~~-----~~~~~~~~-~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~~G~------ 141 (424) +++.+..-+-+.+. ..++.... ....++...-+..-+...++++.+..++-+-|++|+.+. +..+|. T Consensus 76 sr~rL~as~idpDtg~ptg~iee~~~~~~~v~~~~~~i~gG~lgQ~~llkrl~~~ltV~GE~wiv~l~~p~~~~~~~pd~ 155 (631) T protein:vir:10 76 SRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDG 155 (631) T ss_pred ceeeeEeeeeccCCCCCccccccCCchhHHHHHHHHhcCCCcchHHHHHHHHHhheecccceEEEEEeccCcCCCCCccc Confidence 99999888777662 22221111 223334444566678899999999999999999999874 333221 Q ss_pred ----eeeEEeecCceEEEEEcCCceEEEEEecCceEEecHhHeeEec-c-CC-CCccccCchHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 142 ----VISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLK-G-FG-FTGLVGLSPIAFACKSAGVAVAMEDQQ 214 (424) Q Consensus 142 ----~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r-~-~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~ 214 (424) ..+++++....|.....+.+.-+....+..-.....-+++ || + ++ .....--||+.++...+.-.....+.. T Consensus 156 ~~r~~~~W~~vt~~ei~~~~~g~g~~v~lp~g~~h~~~~~~D~l-~RiW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i 234 (631) T protein:vir:10 156 SVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTDII-FRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTI 234 (631) T ss_pred ccccccceeeccHHHHhcccCcccceeecCCCCccceecCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHH Confidence 3356667666666555555444444433322222333433 33 1 22 234567799999888888777777666 Q ss_pred HHHHhccCCCceeEEcCCCCCC--------------------HHHHHHHHH-HHHHHhC----CcccCc--ceec--C-- Q lcl|NC_019719. 215 RDFFANGAKSPQILSTGEKVLT--------------------EQQRSQVEE-NFKEIAG----GPVKKR--LWIL--E-- 263 (424) Q Consensus 215 ~~~~~n~~~p~~vl~~~~~~~~--------------------~~~~~~~~~-~~~~~~~----~~~~g~--~~~l--~-- 263 (424) .+..+.-..-.||+-++..++= +-...++.+ .++.... .+.+.. ++++ + T Consensus 235 ~aaakSRl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~vPii~~~p~E 314 (631) T protein:vir:10 235 ANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGE 314 (631) T ss_pred HHHHHHHHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechH Confidence 6666555556666655544321 112233322 2222221 111211 2222 2 Q ss_pred --CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC-CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019719. 264 --AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE-KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR 340 (424) Q Consensus 264 --~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~ 340 (424) ++++.-.+... -+.--+++++..+..+|....|||+.|-+.+ ++|-+ +.-+....-++--|.|.+..|+++|++ T Consensus 315 ~i~~i~hlkf~~e-i~e~aiktR~daI~RlA~glDi~pE~LLGlGsd~NHW--sAWqI~dedVrlHI~P~l~lic~AlT~ 391 (631) T protein:vir:10 315 QIKDVKHIRFDNE-ITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHW--SAWQISDEDVQLHIAPVMEIFCQALTD 391 (631) T ss_pred HhcCeeEEeecCc-hhHHHHhhHHHHHHHHHhccCCchhhheeccCCccce--EEEEecccceeeecchHHHHHHHHHHh Confidence 22333333333 3344578999999999999999999887764 66554 223444456677799999999999999 Q ss_pred hccCcc------ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC--------------- Q lcl|NC_019719. 341 WLIPAK------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD--------------- 399 (424) Q Consensus 341 ~l~~~~------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd--------------- 399 (424) .+|.+- +...|-+.||.+.|.. | -++.+.+..+.+.|.+|-...|+.+|+.-..+-| T Consensus 392 q~Lrp~Le~eGvDp~kYvvW~DaS~Lt~-d-Pdr~deA~qa~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~~~a~~av 469 (631) T protein:vir:10 392 QILRVTLAREGIDPSKYVVWYDPSQLTI-D-PDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAV 469 (631) T ss_pred hHHHHHHHHhCCCHHHhEeeecCccccc-C-CCCcHHHHHHHHcCCcCHHHHHHHhcCchhcccCcCchHHHHHHHHHHh Confidence 887532 2235888999988743 2 3456667778899999999999999997533312 Q ss_pred ---eeeecccccch----------------hhccccC----CCcccCC Q lcl|NC_019719. 400 ---VAMRQSQYVPI----------------TDLGTNK----EPRNNGA 424 (424) Q Consensus 400 ---~~~~~~n~~~~----------------~~~~~~~----~~~~~ga 424 (424) --+. .++.|+ ...++.. ++.++|. T Consensus 470 ~~dpaLi-p~lApl~~~~~~~v~~P~~~a~~~~g~ed~~~~~~~~~g~ 516 (631) T protein:vir:10 470 SKDPTLI-PMLAPLIAGVLKQIEFPQQQAIDSGGNEDTSDADDLDDGE 516 (631) T ss_pred hcccCcc-hhhHHHHHHHhhhccCCCCCCCCCCCCCccccccccccCC Confidence 1111 111111 0000000 0111111 No 141 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=99.11 E-value=9.7e-11 Score=75.47 Aligned_cols=396 Identities=11% Similarity=0.031 Sum_probs=219.2 Q ss_pred HhhccCc-ccCccc-------cccc--cccccccc------ccCccc--cc--HHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 21 QSWFVGG-RLVTPN-------QGSQ--TGPVSAHG------HLGDSS--IN--DERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 21 ~~~~~~~-~~~~~~-------~~~~--~~~~~~~~------~~~~~~--~~--~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) +..++.. .+..|. .... +....... ..++.. -+ -+-+-..|.++..+..|++++|++.+. T Consensus 1 ~~~~rPk~~p~~p~~~~~arrr~LtaAsa~l~~~~~~~~kt~~~~~~~WQ~eAW~~~d~vpELry~vgW~~~a~SR~rL~ 80 (646) T protein:vir:10 1 MALLKPKSAPPEPFGAEVARRIALAGATAQVDLGASSSWKTWKFGNKDWQTEGWRLYDIIPEHHFLAGRIGDSVAQARLY 80 (646) T ss_pred CcccCCCCCCCCcccccccchhhhhhccccccCCCcceeecCCCcchhhhHHHHHHHhhhhhHhhHhhhhhhhhceeeee Confidence 2222211 111110 0000 00110000 011100 00 122223477888899999999999999 Q ss_pred EEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE---eeCCCCceeeEEeecCceEEEEE Q lcl|NC_019719. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV---DRNSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~---~r~~~G~~~~l~~l~~~~v~~~~ 157 (424) .-+-++.|...-...++.+..+-...--.-.-..++++.+..++-+-|++|++. .....+--..++++..+.|.. T Consensus 81 aseiddtG~~tg~v~~~~v~~iv~~~~Gg~~gQ~qlLkr~~~~ltV~GE~wiv~~~~~~~~~~~~~~W~vvt~~Ev~~-- 158 (646) T protein:vir:10 81 VTEVDDTGEETGEVQDERIKRLAAVPLGTGSQRDDNLRLAGLDLAVGGECWIVGEGAATSPEAAEGSWFVVTGSAISR-- 158 (646) T ss_pred eeeecCCCCCcCccchHHHHHHhhhhccchhhHHHHHHHHHhheecccceEEeeccccCCCCCCccceeeecHHHhcc-- Confidence 888876666555555566665555444444456889999999999999999974 111122222455555555522 Q ss_pred cCCceEEEEEe---cCceEEecHhHeeEecc--CC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC Q lcl|NC_019719. 158 VGKKVVYRYQR---DSEYADFSQKEIFHLKG--FG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG 231 (424) Q Consensus 158 ~~~~~~~~~~~---~~~~~~~~~~evih~r~--~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~ 231 (424) .+++....-.. +..-+.+++.+++. |- ++ .....--||+.++...+.-.....+...+..+.-..-.||+-++ T Consensus 159 tg~~~~i~~p~~~~g~~~v~~~~~d~lv-RiW~P~Prr~~epDSpvra~l~~l~Ei~~lt~~I~aaakSRL~GnGvLfvP 237 (646) T protein:vir:10 159 TGDEIAVRRPQQRGGSKLVLVDGQDILI-RCWRPHPNDTDQADSFTRSAIVPLREIELLTKREFAELDSRLTGAGIMFLP 237 (646) T ss_pred CCCeeeeecCccCCCCCcceecCCceEE-EEecCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHHHHhcCceeeec Confidence 23332222111 22334455666643 42 22 23456789999998888877777777766666666666777665 Q ss_pred CCCC------CHHHHHHHHHHHH----HHhCC-cccCc--ceecCC-Cc------eeeecccC-hhHHHHHHHHHHHHHH Q lcl|NC_019719. 232 EKVL------TEQQRSQVEENFK----EIAGG-PVKKR--LWILEA-GF------STSAIGVT-PQDAEMMASRKFQVSE 290 (424) Q Consensus 232 ~~~~------~~~~~~~~~~~~~----~~~~~-~~~g~--~~~l~~-g~------~~~~l~~~-~~d~~~~e~~~~~~~~ 290 (424) ...+ ++.....+...+- ..+.. +.+.. ++++.. |- +++.+... .-+.--+++++..+.. T Consensus 238 ~e~s~p~~~~~~a~~~~l~~~l~qaa~tAi~De~S~aA~vPiia~~P~E~i~~~~~ik~l~f~~eite~aiktR~daI~R 317 (646) T protein:vir:10 238 EGVDFPRGEEDPAGLAGFMAYLQRAAAASMADQSRASAMVPIMATIPNEMMEHLDKIKPLTFWSELSAEITPMKDKAIAR 317 (646) T ss_pred cccccCCCCCCCcchhHHHHHHHHHHHhhhcCCCCccceeeeEEeeChHHHhhhhcceeeccCchhhHHHhhhHHHHHHH Confidence 5432 2222223333222 22221 11221 333221 11 33433332 2234457899999999 Q ss_pred HHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc-------ccccceeeecchhhhcc Q lcl|NC_019719. 291 LARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-------DVGRIHAEHNLDGLLRG 363 (424) Q Consensus 291 Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-------~~~~~~~~fd~~~l~~~ 363 (424) +|....|||+.|-+.+++|-++ .-+....-++ -|.|.+..|++.|++.+|.+- +...|-+.||.+.|.. T Consensus 318 lA~glDIppE~LLGlgd~NHWt--AWqI~de~vr-HI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~kyvvW~DaS~Lt~- 393 (646) T protein:vir:10 318 LASSAEIPGEVLTGIGDANHWT--AWLISDEGIR-WIRGYLGLIADALTRGFLRRALESMGVTNPERYAFAFDTSTLAS- 393 (646) T ss_pred HHhccCCchhheeeccccceee--eeeeccccch-hhhhHHHHHHHHHHhhHHHHHHHHcCCCChhHeEEeecCccccc- Confidence 9999999999998877776653 2344445555 699999999999999877532 2235788999988843 Q ss_pred CHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCe--e----------------eecc-------------cccc--h Q lcl|NC_019719. 364 DSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDV--A----------------MRQS-------------QYVP--I 410 (424) Q Consensus 364 d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~--~----------------~~~~-------------n~~~--~ 410 (424) | -++.+.+.++.+.|.+|-...|+.+|+.-.++=+. . +.|. .+.| + T Consensus 394 ~-pd~~deA~qa~drGAIt~eAlrk~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~Lil~P~~qa~~~~P~~~~~~lpp~~~ 472 (646) T protein:vir:10 394 K-PNRLDEAIQLHERNLIKDEEVVKAGAFSVDQMPTVQERAVQILLGLVKTQPDLILDPAIQAALGLPAVQSVGLPPTAA 472 (646) T ss_pred C-CCCcHHHHHHHHcCCccHHHHHHHhcccccccCChHHHHHHHHHHHhcCCccccccchhhccccCCCcCccccCCccc Confidence 2 34566677788999999999999999863221111 0 0010 1111 1 Q ss_pred hhc-cccCCCcccCC Q lcl|NC_019719. 411 TDL-GTNKEPRNNGA 424 (424) Q Consensus 411 ~~~-~~~~~~~~~ga 424 (424) +.. ++.++++++|+ T Consensus 473 ~~~dg~~~~~e~~g~ 487 (646) T protein:vir:10 473 QRTDGDLDDDESEGA 487 (646) T ss_pred ccccCCCCChhhcCC Confidence 111 12234445555 No 142 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=99.11 E-value=5.6e-11 Score=76.77 Aligned_cols=413 Identities=11% Similarity=0.058 Sum_probs=210.6 Q ss_pred CCCCcccccCCCCC-chHHHHHhhccCcccC-cccccccccccccccccCccccc-HHHHhhhHHHHHHHHHHHHhhccC Q lcl|NC_019719. 1 MEEPKYTIDLRTNN-GWWARLQSWFVGGRLV-TPNQGSQTGPVSAHGHLGDSSIN-DERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~-G~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |---...+--+-+. ---.|=.+.-....+. .|.+...-. ..++..+.-... -+-+-..+.++..+..|+++++++ T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks--~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~ 78 (629) T protein:vir:86 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKA--MGSSTRTDWQEDAWKAYDAVGELRYYVGWRSSSASRV 78 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhccccccchhhhh--cCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhcee Confidence 22111111111111 0001101111111110 111000000 000000000000 122223677888999999999999 Q ss_pred ceEEEEecccCccccccccch------hhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCc------e-ee Q lcl|NC_019719. 78 PLDVFETDQNDNRKKVDLSNP------LARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGD------V-IS 144 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~------l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~------~-~~ 144 (424) .+..-+-+.++.......+++ +.++...--..-+-..++++.+..++-+-|++|+.+.....|. + .+ T Consensus 79 rL~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~e 158 (629) T protein:vir:86 79 RLIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPE 158 (629) T ss_pred eeEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhh Confidence 999888876665444333332 2222332333345678999999999999999999987433332 2 23 Q ss_pred EEeecCceEEEEEcCCceEEEEEecCceEEecHhHeeEecc--CC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019719. 145 LLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKG--FG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANG 221 (424) Q Consensus 145 l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~--~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~ 221 (424) ++.+-++.|.-. .++.......+..-...+..+++ +|- ++ .....--||+.++...+.-.....+...+..+.- T Consensus 159 W~~vt~~ei~~~--~~~~~i~lP~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakSR 235 (629) T protein:vir:86 159 WLALTPEEVRAS--EKKTIIELPTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKSR 235 (629) T ss_pred heeechHHhhhc--cCceeeEcCCCCcceeeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHH Confidence 444544444322 22222333333333333444444 552 22 2345677898888888777766666655554444 Q ss_pred CCCceeEEcCCCCC----------------------CHHHHHHHHHHHH----HHhCC-cccCc--ceec--C----CCc Q lcl|NC_019719. 222 AKSPQILSTGEKVL----------------------TEQQRSQVEENFK----EIAGG-PVKKR--LWIL--E----AGF 266 (424) Q Consensus 222 ~~p~~vl~~~~~~~----------------------~~~~~~~~~~~~~----~~~~~-~~~g~--~~~l--~----~g~ 266 (424) ..-.||+-++...+ .+ ..+.+.+.+- ..+.. +.+.. ++++ + +++ T Consensus 236 L~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~p-a~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i 314 (629) T protein:vir:86 236 LIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTP-AVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNV 314 (629) T ss_pred HhhCceeeeccCcccCccCCCCCCCCCCcccccccccc-hHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCe Confidence 44455543332211 11 2222333322 22221 11111 2222 2 223 Q ss_pred eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC-CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCc Q lcl|NC_019719. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE-KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPA 345 (424) Q Consensus 267 ~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~ 345 (424) +.-.+... -+.--+++++..+..+|....|||+.|-+.+ ++|-+ ..-+....-++--|.|.+..|++.|++.+|.+ T Consensus 315 ~hlkf~~e-i~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHW--sAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp 391 (629) T protein:vir:86 315 THLKFDNQ-VTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHW--SAWQIGDEDVRLHILPPVEMLCEAITNQVLRT 391 (629) T ss_pred eEEeecCc-hhHHHHhhHHHHHHHHHhccCCchhhheeccCCccce--EEEEecccceeeecchHHHHHHHHHHhhHHHH Confidence 33333333 3344578999999999999999999887764 66554 22344445667779999999999999987753 Q ss_pred c------ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC-------------eeeeccc Q lcl|NC_019719. 346 K------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD-------------VAMRQSQ 406 (424) Q Consensus 346 ~------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd-------------~~~~~~n 406 (424) - +...|-+.||.+.|.. | -++.+.+..+.+.|.+|-...|+.+|+.-..+-| .+....+ T Consensus 392 ~Le~eGiDp~kYvvW~DaS~Lt~-d-Pd~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~ 469 (629) T protein:vir:86 392 VLMREGIDPNAYVVWHDASQLTV-D-PDKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPN 469 (629) T ss_pred HHHHhCCCHHHhEeeecCccccc-C-CCCcHHHHHHHHcCCcCHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcc Confidence 2 2235888999988743 2 3456667778899999999999999996532221 1111111 Q ss_pred cc---------------chhh-----ccccC-CCcccCC Q lcl|NC_019719. 407 YV---------------PITD-----LGTNK-EPRNNGA 424 (424) Q Consensus 407 ~~---------------~~~~-----~~~~~-~~~~~ga 424 (424) ++ |... ..+++ ..+.+|+ T Consensus 470 Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga 508 (629) T protein:vir:86 470 LLPTLAVLIPELADVEFPTPTVALPPAEEQDGDEEASGA 508 (629) T ss_pred hhhhhhhhhhhhcccccCccCCCCCccccCCCcccccCC Confidence 10 1000 00000 0111222 No 143 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=99.10 E-value=6e-11 Score=76.62 Aligned_cols=413 Identities=11% Similarity=0.057 Sum_probs=210.6 Q ss_pred CCCCcccccCCCCC-chHHHHHhhccCcccC-cccccccccccccccccCccccc-HHHHhhhHHHHHHHHHHHHhhccC Q lcl|NC_019719. 1 MEEPKYTIDLRTNN-GWWARLQSWFVGGRLV-TPNQGSQTGPVSAHGHLGDSSIN-DERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~-G~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |---...+--+-+. ---.|=.+.-....+. .|.+...-. ..++..+.-... -+-+-..+.++..+..|+++++++ T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks--~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~ 78 (629) T protein:vir:99 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKA--MGSSTRTDWQDDAWKAYDAVGELRYYVGWRSSSASRV 78 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhcccccchhhhhh--cCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhcee Confidence 22111111111111 0001101111111110 111000000 000000000000 122223677888999999999999 Q ss_pred ceEEEEecccCccccccccch------hhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCc------e-ee Q lcl|NC_019719. 78 PLDVFETDQNDNRKKVDLSNP------LARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGD------V-IS 144 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~------l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~------~-~~ 144 (424) .+..-+-+.++.......+++ +.++...--..-+-..++++.+..++-+-|++|+.+.....|. + .+ T Consensus 79 rL~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~e 158 (629) T protein:vir:99 79 RLIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPE 158 (629) T ss_pred eeEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhh Confidence 999888876665444333332 2222332333345678999999999999999999987433332 2 23 Q ss_pred EEeecCceEEEEEcCCceEEEEEecCceEEecHhHeeEecc--CC-CCccccCchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019719. 145 LLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKG--FG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANG 221 (424) Q Consensus 145 l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~--~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~ 221 (424) ++.+-++.|.-. .++.......+..-...+..+++ +|- ++ .....--||+.++...+.-.....+...+..+.- T Consensus 159 W~~vt~~ei~~~--~~~~~i~lP~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakSR 235 (629) T protein:vir:99 159 WLALTPEEVRAS--EKKTIIELPTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKSR 235 (629) T ss_pred heeechHHhhhc--cCceeEEcCCCCccceeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHH Confidence 444544444322 22222333333333333444444 542 22 2345677898888888777766666665555444 Q ss_pred CCCceeEEcCCCCC----------------------CHHHHHHHHHHHH----HHhCC-cccCc--ceec--C----CCc Q lcl|NC_019719. 222 AKSPQILSTGEKVL----------------------TEQQRSQVEENFK----EIAGG-PVKKR--LWIL--E----AGF 266 (424) Q Consensus 222 ~~p~~vl~~~~~~~----------------------~~~~~~~~~~~~~----~~~~~-~~~g~--~~~l--~----~g~ 266 (424) ..-.||+-++...+ .+ ..+.+.+.+- ..+.. +.+.. ++++ + +++ T Consensus 236 L~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~p-a~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i 314 (629) T protein:vir:99 236 LIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTP-AVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNV 314 (629) T ss_pred HhhCceeEeccCcccCccCCCCCCCCCCcccccccccc-hHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCe Confidence 44455543332211 11 2222333322 22221 11111 2222 2 223 Q ss_pred eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC-CCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCc Q lcl|NC_019719. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE-KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPA 345 (424) Q Consensus 267 ~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~ 345 (424) +.-.+... -+.--+++++..+..+|....|||+.|-+.+ ++|-+ ..-+....-++--|.|.+..|+++|++.+|.+ T Consensus 315 ~hlkf~~e-i~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHW--sAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp 391 (629) T protein:vir:99 315 THLKFDNQ-VTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHW--SAWQIGDEDVRLHILPPVEMLCEAITNQVLRT 391 (629) T ss_pred eEEeecCc-hhHHHHhhHHHHHHHHHhccCCchhhheeccCCccce--EEEEecccceeeecchhHHHHHHHHHhhHHHH Confidence 33333333 3344578999999999999999999887764 66554 22344445667779999999999999987753 Q ss_pred c------ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC-------------eeeeccc Q lcl|NC_019719. 346 K------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD-------------VAMRQSQ 406 (424) Q Consensus 346 ~------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd-------------~~~~~~n 406 (424) - +...|-+.||.+.|.. | -++.+.+..+.+.|.+|-...|+.+|+.-..+-| .+....+ T Consensus 392 ~Le~eGiDp~kYvvW~DaS~Lt~-d-Pd~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~ 469 (629) T protein:vir:99 392 VLMREGIDPNAYVVWHDASQLTV-D-PDKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPN 469 (629) T ss_pred HHHHhCCCHHHhEeeecCccccc-C-CCCcHHHHHHHHcCCccHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcc Confidence 2 2235888999988743 2 3456667778899999999999999996532221 1111111 Q ss_pred cc---------------chhh-----ccccC-CCcccCC Q lcl|NC_019719. 407 YV---------------PITD-----LGTNK-EPRNNGA 424 (424) Q Consensus 407 ~~---------------~~~~-----~~~~~-~~~~~ga 424 (424) ++ |... ..+++ ..+.+|+ T Consensus 470 Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga 508 (629) T protein:vir:99 470 LLPTLAVLIPELADVEFPTPTVALPPAEEQDGDEEASGA 508 (629) T ss_pred hhhhhhhhhhhhcccccCccCCCCCccccCCCcccccCC Confidence 10 1000 00000 0111222 No 144 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=99.00 E-value=1.8e-10 Score=73.98 Aligned_cols=405 Identities=11% Similarity=0.047 Sum_probs=208.5 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccccc---cccc-----ccccccCcccc------cHHHHhhhHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQ---TGPV-----SAHGHLGDSSI------NDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~---~~~~-----~~~~~~~~~~~------~~~~~~~~~~v~~~ 66 (424) |---.. |+..+.+++..+. .+... +.+. ....+..+... .-+.+-..+.++-. T Consensus 1 ma~~~l------------r~~rrpk~~p~~~-rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~ 67 (639) T protein:vir:97 1 MAATSL------------RVVRRPKGSAPAA-RRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYY 67 (639) T ss_pred CCccce------------eeeecCCCCCcch-hhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHH Confidence 211100 0111111111100 11000 0011 01111111110 01223335778889 Q ss_pred HHHHHHhhccCceEEEEecccCccc-------cccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCC Q lcl|NC_019719. 67 VSLISTLTACLPLDVFETDQNDNRK-------KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNS 138 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~-------~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~ 138 (424) +..|+++++++.+..-+-+.+.... +....+.+.+..+.--..-+-..++++.+..++-+-|++|+.++ +.. T Consensus 68 vgW~~~s~sr~rL~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~ 147 (639) T protein:vir:97 68 VSWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQE 147 (639) T ss_pred hhhhhhhhceeeeEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecC Confidence 9999999999999887777554321 11122333333332333445678899999999999999998754 444 Q ss_pred CCc------eeeEEe-ecCceEEEEEcCCceEEEEEecCceEEecHhHeeEecc--CC-CCccccCchHHHHHHHHHHHH Q lcl|NC_019719. 139 AGD------VISLLP-LQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKG--FG-FTGLVGLSPIAFACKSAGVAV 208 (424) Q Consensus 139 ~G~------~~~l~~-l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~--~~-~~~~~G~s~~~~~~~~i~~~~ 208 (424) ++. +.+-|. +....|. ...+++.-... .++...+|.-+.=+.+|- ++ .....--||+.++...+.-.. T Consensus 148 k~~~~~~~~~~~~W~vvs~~Ei~-~~~~~~~~i~l-PdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~ 225 (639) T protein:vir:97 148 KDPVTGLAAPRARWYAVTREEIK-SKAGETAEISL-PDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIE 225 (639) T ss_pred ccccCcccccccceeeeeHHHhc-ccCCCeeEeec-CCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHH Confidence 432 233333 3333332 11222221222 133333444333233442 22 233567788888888877776 Q ss_pred HHHHHHHHHHhccCCCceeEEcCCCCCCH------------------------HHHHHHHHHH----HHHhCC-cccCc- Q lcl|NC_019719. 209 AMEDQQRDFFANGAKSPQILSTGEKVLTE------------------------QQRSQVEENF----KEIAGG-PVKKR- 258 (424) Q Consensus 209 ~~~~~~~~~~~n~~~p~~vl~~~~~~~~~------------------------~~~~~~~~~~----~~~~~~-~~~g~- 258 (424) ...+...+..+.-..-.||+-++..++-+ ...+.+...+ ...+.. +.+.. T Consensus 226 ~~t~~i~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~ 305 (639) T protein:vir:97 226 RTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAY 305 (639) T ss_pred HhhhHHHHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccce Confidence 66666655555444455555443322100 1122222222 222221 11111 Q ss_pred -ceecC----CCceeeecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 259 -LWILE----AGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYIS 332 (424) Q Consensus 259 -~~~l~----~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~ 332 (424) ++++. ..-+++.|... .-+.--+++++..+..+|....|||+.|-+..++|-+ ..-+....-++--|.|.+. T Consensus 306 vPiia~~p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHW--sAWqI~dedvrlHI~P~l~ 383 (639) T protein:vir:97 306 IPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHW--SAWAIGDEDVQLHIKPVMD 383 (639) T ss_pred eeeeEeechHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccce--EEEEecccceeeecchhHH Confidence 22322 11244555542 2334457899999999999999999998777776655 2334445566777999999 Q ss_pred HHHHHHHhhccCcc------ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC------- Q lcl|NC_019719. 333 RWENSIQRWLIPAK------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD------- 399 (424) Q Consensus 333 ~ie~~l~~~l~~~~------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd------- 399 (424) .|++.|++.+|.+- +...|-+.||.+.|.. | -++.+.+.++.+.|.+|-.-.|+.+|+.-.++=| T Consensus 384 ~icdAlT~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~-d-Pd~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~ 461 (639) T protein:vir:97 384 LICQAIYNDILTPLLAREGIDPTKYILWYDASGLTS-D-PDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGC 461 (639) T ss_pred HHHHHHHhhHHHHHHHHhCCCHHHhEeeecCccccc-C-CCCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHH Confidence 99999999887532 2235888999988843 2 3456667778899999999999999986432211 Q ss_pred ------eeeecccc----------------------cchhhccccCCCcccCC Q lcl|NC_019719. 400 ------VAMRQSQY----------------------VPITDLGTNKEPRNNGA 424 (424) Q Consensus 400 ------~~~~~~n~----------------------~~~~~~~~~~~~~~~ga 424 (424) .+-.+..+ +....-.+..+++++|| T Consensus 462 ~~~A~~~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga 514 (639) T protein:vir:97 462 REFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGA 514 (639) T ss_pred HHHHHHHhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCC Confidence 00000000 00000111122233333 No 145 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=99.00 E-value=1.8e-10 Score=73.98 Aligned_cols=405 Identities=11% Similarity=0.047 Sum_probs=208.5 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccccc---cccc-----ccccccCcccc------cHHHHhhhHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQ---TGPV-----SAHGHLGDSSI------NDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~---~~~~-----~~~~~~~~~~~------~~~~~~~~~~v~~~ 66 (424) |---.. |+..+.+++..+. .+... +.+. ....+..+... .-+.+-..+.++-. T Consensus 1 ma~~~l------------r~~rrpk~~p~~~-rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~ 67 (639) T protein:vir:10 1 MAATSL------------RVVRRPKGSAPAA-RRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYY 67 (639) T ss_pred CCccce------------eeeecCCCCCcch-hhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHH Confidence 211100 0111111111100 11000 0011 01111111110 01223335778889 Q ss_pred HHHHHHhhccCceEEEEecccCccc-------cccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCC Q lcl|NC_019719. 67 VSLISTLTACLPLDVFETDQNDNRK-------KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNS 138 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~-------~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~ 138 (424) +..|+++++++.+..-+-+.+.... +....+.+.+..+.--..-+-..++++.+..++-+-|++|+.++ +.. T Consensus 68 vgW~~~s~sr~rL~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~ 147 (639) T protein:vir:10 68 VSWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQE 147 (639) T ss_pred hhhhhhhhceeeeEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecC Confidence 9999999999999887777554321 11122333333332333445678899999999999999998754 444 Q ss_pred CCc------eeeEEe-ecCceEEEEEcCCceEEEEEecCceEEecHhHeeEecc--CC-CCccccCchHHHHHHHHHHHH Q lcl|NC_019719. 139 AGD------VISLLP-LQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKG--FG-FTGLVGLSPIAFACKSAGVAV 208 (424) Q Consensus 139 ~G~------~~~l~~-l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~--~~-~~~~~G~s~~~~~~~~i~~~~ 208 (424) ++. +.+-|. +....|. ...+++.-... .++...+|.-+.=+.+|- ++ .....--||+.++...+.-.. T Consensus 148 k~~~~~~~~~~~~W~vvs~~Ei~-~~~~~~~~i~l-PdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~ 225 (639) T protein:vir:10 148 KDPVTGLAAPRARWYAVTREEIK-SKAGETAEISL-PDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIE 225 (639) T ss_pred ccccCcccccccceeeeeHHHhc-ccCCCeeEeec-CCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHH Confidence 432 233333 3333332 11222221222 133333444333233442 22 233567788888888877776 Q ss_pred HHHHHHHHHHhccCCCceeEEcCCCCCCH------------------------HHHHHHHHHH----HHHhCC-cccCc- Q lcl|NC_019719. 209 AMEDQQRDFFANGAKSPQILSTGEKVLTE------------------------QQRSQVEENF----KEIAGG-PVKKR- 258 (424) Q Consensus 209 ~~~~~~~~~~~n~~~p~~vl~~~~~~~~~------------------------~~~~~~~~~~----~~~~~~-~~~g~- 258 (424) ...+...+..+.-..-.||+-++..++-+ ...+.+...+ ...+.. +.+.. T Consensus 226 ~~t~~i~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~ 305 (639) T protein:vir:10 226 RTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAY 305 (639) T ss_pred HhhhHHHHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccce Confidence 66666655555444455555443322100 1122222222 222221 11111 Q ss_pred -ceecC----CCceeeecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 259 -LWILE----AGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYIS 332 (424) Q Consensus 259 -~~~l~----~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~ 332 (424) ++++. ..-+++.|... .-+.--+++++..+..+|....|||+.|-+..++|-+ ..-+....-++--|.|.+. T Consensus 306 vPiia~~p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHW--sAWqI~dedvrlHI~P~l~ 383 (639) T protein:vir:10 306 IPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHW--SAWAIGDEDVQLHIKPVMD 383 (639) T ss_pred eeeeEeechHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccce--EEEEecccceeeecchhHH Confidence 22322 11244555542 2334457899999999999999999998777776655 2334445566777999999 Q ss_pred HHHHHHHhhccCcc------ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC------- Q lcl|NC_019719. 333 RWENSIQRWLIPAK------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD------- 399 (424) Q Consensus 333 ~ie~~l~~~l~~~~------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd------- 399 (424) .|++.|++.+|.+- +...|-+.||.+.|.. | -++.+.+.++.+.|.+|-.-.|+.+|+.-.++=| T Consensus 384 ~icdAlT~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~-d-Pd~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~ 461 (639) T protein:vir:10 384 LICQAIYNDILTPLLAREGIDPTKYILWYDASGLTS-D-PDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGC 461 (639) T ss_pred HHHHHHHhhHHHHHHHHhCCCHHHhEeeecCccccc-C-CCCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHH Confidence 99999999887532 2235888999988843 2 3456667778899999999999999986432211 Q ss_pred ------eeeecccc----------------------cchhhccccCCCcccCC Q lcl|NC_019719. 400 ------VAMRQSQY----------------------VPITDLGTNKEPRNNGA 424 (424) Q Consensus 400 ------~~~~~~n~----------------------~~~~~~~~~~~~~~~ga 424 (424) .+-.+..+ +....-.+..+++++|| T Consensus 462 ~~~A~~~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga 514 (639) T protein:vir:10 462 REFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGA 514 (639) T ss_pred HHHHHHHhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCC Confidence 00000000 00000111122233333 No 146 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.95 E-value=1.6e-09 Score=68.74 Aligned_cols=391 Identities=13% Similarity=0.058 Sum_probs=192.9 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccccccccccccC--cccc-----cHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSI-----NDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-----~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |.-.|-.-|+++|...+....... ......+.+..... +... ....-..+.+...+|+..+..+-.-|+. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~---~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~ 77 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRV---RLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHH---HHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCee Confidence 777787788888876654332111 11111121111110 0000 1111123456778888888888888887 Q ss_pred EEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC Q lcl|NC_019719. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +-.. .+.+ ....+.+++. + | ....+...+..+++.+|.||..+-++.+|.+ .+..++|..+.+..|+. T Consensus 78 ~~~~-~d~~-----~~~~~~~i~~-~-N---~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~-~i~~~~p~~~~~i~d~~ 145 (456) T protein:vir:10 78 VGGS-ADSD-----LALRARRIWR-D-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPL 145 (456) T ss_pred cCCC-CCcc-----hHHHHHHHHH-h-c---ChhhHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEEccceeEEEEcCC Confidence 5211 1111 1122444443 2 2 2345556788999999999999988888876 46778888888777643 Q ss_pred ce-------EEEEEecCce----------------------------EEecHhH------eeEeccC----CCCccccCc Q lcl|NC_019719. 161 KV-------VYRYQRDSEY----------------------------ADFSQKE------IFHLKGF----GFTGLVGLS 195 (424) Q Consensus 161 ~~-------~~~~~~~~~~----------------------------~~~~~~e------vih~r~~----~~~~~~G~s 195 (424) .. .|....++.. .....+. .-|.-.. ..+...|+| T Consensus 146 ~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~g 225 (456) T protein:vir:10 146 QPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMG 225 (456) T ss_pred CCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCc Confidence 21 0110000000 0000000 0011000 012345778 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC--CCCCHHHHHHH--HHHHHHHhCCcccCcceecCCCceeeec Q lcl|NC_019719. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE--KVLTEQQRSQV--EENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) Q Consensus 196 ~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~--~~~~~~~~~~~--~~~~~~~~~~~~~g~~~~l~~g~~~~~l 271 (424) .++.....++....+..-........+.|..++.-.. ....++.-..+ ...++.. .+.++.++.+.++.++ T Consensus 226 d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~-----~~~~~~~~~~~~~~q~ 300 (456) T protein:vir:10 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAA-----PGALWELPPGVDIWES 300 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhh-----ccccccCCCCcceEEe Confidence 7777666555554433322222223333433332110 00011111111 1112211 2356778888888877 Q ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhc--c--Cccc Q lcl|NC_019719. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWL--I--PAKD 347 (424) Q Consensus 272 ~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l--~--~~~~ 347 (424) .....+ .+.+..+....+|++.=++|++.++.... +.|+...+.....+...+ .=..+.|+..|.+.+ + -... T Consensus 301 ~~~~~~-~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~Sg~Ai~~~~~~l~~k~-~~~~~~f~~~l~~~~rl~~~~~g~ 377 (456) T protein:vir:10 301 QANDFT-PMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKC-EDRLSIAKIGLEAILVKALQIEGE 377 (456) T ss_pred cccChh-HHHHHHHHHHHHHHhccCCChHHhccccc-ChHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCC Confidence 653322 47888999999999999999999986432 223222222222222211 112222222222211 1 0111 Q ss_pred cccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC----CCCeeeecccccchhhccccCCCcccC Q lcl|NC_019719. 348 VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP----GGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 348 ~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~----~gd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) .....++..+......+..+.++.+.++++.|+.+..-+++++|+.+.+ ..+...... ......-.+.|.++| T Consensus 378 ~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~---~~~~~~~~~~~~~~~ 454 (456) T protein:vir:10 378 SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQI---TLFAGNPVQRPQEDG 454 (456) T ss_pred CcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHH---HHHhhhhhhcCCCCC Confidence 1112344444555677888999999999999999998888899987531 111110000 000011124456677 Q ss_pred C Q lcl|NC_019719. 424 A 424 (424) Q Consensus 424 a 424 (424) + T Consensus 455 ~ 455 (456) T protein:vir:10 455 S 455 (456) T ss_pred C Confidence 7 No 147 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.95 E-value=1.6e-09 Score=68.74 Aligned_cols=391 Identities=13% Similarity=0.058 Sum_probs=192.9 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccccccccccccC--cccc-----cHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSI-----NDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-----~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |.-.|-.-|+++|...+....... ......+.+..... +... ....-..+.+...+|+..+..+-.-|+. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~---~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~ 77 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRV---RLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHH---HHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCee Confidence 777787788888876654332111 11111121111110 0000 1111123456778888888888888887 Q ss_pred EEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC Q lcl|NC_019719. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +-.. .+.+ ....+.+++. + | ....+...+..+++.+|.||..+-++.+|.+ .+..++|..+.+..|+. T Consensus 78 ~~~~-~d~~-----~~~~~~~i~~-~-N---~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~-~i~~~~p~~~~~i~d~~ 145 (456) T protein:vir:10 78 VGGS-ADSD-----LALRARRIWR-D-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPL 145 (456) T ss_pred cCCC-CCcc-----hHHHHHHHHH-h-c---ChhhHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEEccceeEEEEcCC Confidence 5211 1111 1122444443 2 2 2345556788999999999999988888876 46778888888777643 Q ss_pred ce-------EEEEEecCce----------------------------EEecHhH------eeEeccC----CCCccccCc Q lcl|NC_019719. 161 KV-------VYRYQRDSEY----------------------------ADFSQKE------IFHLKGF----GFTGLVGLS 195 (424) Q Consensus 161 ~~-------~~~~~~~~~~----------------------------~~~~~~e------vih~r~~----~~~~~~G~s 195 (424) .. .|....++.. .....+. .-|.-.. ..+...|+| T Consensus 146 ~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~g 225 (456) T protein:vir:10 146 QPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMG 225 (456) T ss_pred CCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCc Confidence 21 0110000000 0000000 0011000 012345778 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC--CCCCHHHHHHH--HHHHHHHhCCcccCcceecCCCceeeec Q lcl|NC_019719. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE--KVLTEQQRSQV--EENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) Q Consensus 196 ~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~--~~~~~~~~~~~--~~~~~~~~~~~~~g~~~~l~~g~~~~~l 271 (424) .++.....++....+..-........+.|..++.-.. ....++.-..+ ...++.. .+.++.++.+.++.++ T Consensus 226 d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~-----~~~~~~~~~~~~~~q~ 300 (456) T protein:vir:10 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAA-----PGALWELPPGVDIWES 300 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhh-----ccccccCCCCcceEEe Confidence 7777666555554433322222223333433332110 00011111111 1112211 2356778888888877 Q ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhc--c--Cccc Q lcl|NC_019719. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWL--I--PAKD 347 (424) Q Consensus 272 ~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l--~--~~~~ 347 (424) .....+ .+.+..+....+|++.=++|++.++.... +.|+...+.....+...+ .=..+.|+..|.+.+ + -... T Consensus 301 ~~~~~~-~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~Sg~Ai~~~~~~l~~k~-~~~~~~f~~~l~~~~rl~~~~~g~ 377 (456) T protein:vir:10 301 QANDFT-PMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKC-EDRLSIAKIGLEAILVKALQIEGE 377 (456) T ss_pred cccChh-HHHHHHHHHHHHHHhccCCChHHhccccc-ChHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCC Confidence 653322 47888999999999999999999986432 223222222222222211 112222222222211 1 0111 Q ss_pred cccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC----CCCeeeecccccchhhccccCCCcccC Q lcl|NC_019719. 348 VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP----GGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 348 ~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~----~gd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) .....++..+......+..+.++.+.++++.|+.+..-+++++|+.+.+ ..+...... ......-.+.|.++| T Consensus 378 ~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~---~~~~~~~~~~~~~~~ 454 (456) T protein:vir:10 378 SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQI---TLFAGNPVQRPQEDG 454 (456) T ss_pred CcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHH---HHHhhhhhhcCCCCC Confidence 1112344444555677888999999999999999998888899987531 111110000 000011124456677 Q ss_pred C Q lcl|NC_019719. 424 A 424 (424) Q Consensus 424 a 424 (424) + T Consensus 455 ~ 455 (456) T protein:vir:10 455 S 455 (456) T ss_pred C Confidence 7 No 148 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.87 E-value=3.2e-09 Score=67.12 Aligned_cols=393 Identities=13% Similarity=0.047 Sum_probs=182.0 Q ss_pred ccCCCCCchHHHHHhhccCcccCccccccccccccccccc-------CcccccHHHHhhhHHHHHHHHHHHHhhccCceE Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL-------GDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |.--|-.-++++|........... ......+.+.... ....-.......+.+...+|+..+..+-.-|+. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~---~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~ 77 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRV---RLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHH---HHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCee Confidence 333333344444443332211100 0001111111100 000001111122346678889888888788887 Q ss_pred EEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCC Q lcl|NC_019719. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +. ...+.. ....+.+++. + | ....+...+..+.+.+|.||+.+-++.+|.+ .+..++|..+.+..++. T Consensus 78 ~~-~~~d~~-----~~~~~~~~~~-~-n---~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~-~i~~~~p~~~~~i~d~~ 145 (456) T protein:vir:79 78 VG-GSADSD-----LALRARRIWR-D-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPL 145 (456) T ss_pred cC-CCCCcc-----HHHHHHHHHH-h-c---ChhHHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEeccceeEEEEcCC Confidence 52 111111 1123444443 2 2 2345567788999999999999988989987 57888898888776642 Q ss_pred ce-----E--EEEEecCce---EE-------------------------------ecHhHeeEecc-C---CCCccccCc Q lcl|NC_019719. 161 KV-----V--YRYQRDSEY---AD-------------------------------FSQKEIFHLKG-F---GFTGLVGLS 195 (424) Q Consensus 161 ~~-----~--~~~~~~~~~---~~-------------------------------~~~~evih~r~-~---~~~~~~G~s 195 (424) .. . |....+... .. ....++-|.-. + ..+...|+| T Consensus 146 ~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~~~g 225 (456) T protein:vir:79 146 QPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMG 225 (456) T ss_pred CCCceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEecCCCCCc Confidence 21 0 110000000 00 00011111100 0 012235667 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC--CCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccc Q lcl|NC_019719. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE--KVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV 273 (424) Q Consensus 196 ~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~ 273 (424) .++.....++.......-........+.|..++.-.. ....++.-..+ ...+.+.. ..+.++.++.+.++.++.. T Consensus 226 d~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i-~~~~~~~~--~~~~~~~~~~~~~~~q~~~ 302 (456) T protein:vir:79 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAI-DYASIFEA--APGALWELPPGVDIWESQT 302 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCccccccccccccc-chhhhhhh--hccccccCCCCcceeeecc Confidence 6666555444433322221222222223333332110 00001110001 11111111 1235677788888877765 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHh----hccCccccc Q lcl|NC_019719. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR----WLIPAKDVG 349 (424) Q Consensus 274 ~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~----~l~~~~~~~ 349 (424) +..+ .+.+..+....+|+..-++|++.++.... +.|+...+.....+...+- =....|...|.+ .+-...... T Consensus 303 ~~~~-~~~~~l~~~i~~i~~~t~~p~~~~~~~~~-N~Sg~Al~~~~~~l~~k~~-~~~~~f~~~l~~~~~l~~~~~g~~~ 379 (456) T protein:vir:79 303 NDFT-PMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKCE-DRLSIAKIGLEAILVKALQIEGESV 379 (456) T ss_pred cChH-HHHHHHHHHHHHHHhhcCCChhHhccccc-CcHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCCc Confidence 4332 37888999999999999999999986432 2233333332222222211 111122222221 110111111 Q ss_pred cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC----CCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 350 RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP----GGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 350 ~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~----~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ...++..+......+..+.++.+.++++.|+.+..-+++.+|+.+.+ ..+....-.+. .... --+.+.++|| T Consensus 380 ~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~-~~~~--~~~~~~~~~~ 455 (456) T protein:vir:79 380 EDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITL-FAGN--PVQRPQEDGS 455 (456) T ss_pred cccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHHHHHHHHHHHHHHH-Hhhh--HhhcCCCCCC Confidence 12334444455667888999999999999999998888899987631 11111110000 0111 1235666777 No 149 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=98.86 E-value=2.4e-09 Score=67.84 Aligned_cols=402 Identities=12% Similarity=0.086 Sum_probs=204.8 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccc---ccc----ccccccCcccc-c------HHHHhhhHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQT---GPV----SAHGHLGDSSI-N------DERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~---~~~----~~~~~~~~~~~-~------~~~~~~~~~v~~~ 66 (424) |---...+- .+.. ..|.+.... .+. .......|... + -+-+-..+.++-. T Consensus 1 ma~~~lrv~------------rrpk----~~p~~r~l~aasqp~~P~~~~~~~~~g~~~~~~WQ~eAW~~~d~VgElryy 64 (629) T protein:vir:10 1 MAASTLRVS------------RRPK----GSPARRSLTAASQPMEPGRTPSRQVAGTVVRTSWQNEAWECMDLVGELRYY 64 (629) T ss_pred CCccceeEE------------ecCC----CccceeeeccccCCCCcchhhchhhhhhhhhhhhhHHHHHHHHhhhhHHHH Confidence 211111000 0000 011111100 000 00001111110 0 0112224667778 Q ss_pred HHHHHHhhccCceEEEEecccCccccccc--cchhhhh----hccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019719. 67 VSLISTLTACLPLDVFETDQNDNRKKVDL--SNPLARL----LRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~~~~~--~~~l~~l----L~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) +..++++++++.+..-+-+.++....... +||-... ...--..-+-..++++.+..++-+-|+.|++++.-.++ T Consensus 65 vgW~~ss~Sr~rL~as~idpDtg~ptg~i~ed~p~~~~v~~~v~~iagG~lGqaqLlkr~~~~ltV~GE~~i~il~~~~~ 144 (629) T protein:vir:10 65 VGWRASSCSRVELIASELDPDTGKPTGGIRDDDPDGLRFLEIVKTMAGGPLGQAQLQKRAAECLTVPGEHRICLLDQGDK 144 (629) T ss_pred hhhhhhhheeeeEEEeeecCCCCCCccccccCchhHHHHHHHHHHhcCccchHHHHHHHHHhheeccCceEEEEeecCCC Confidence 88899999999998877776654333322 3333222 22223334567888999999999999999998754443 Q ss_pred ----cee-eEEeecCceEEEEEcCCceEEEEEecCceEEecHhHeeEecc--CC-CCccccCchHHHHHHHHHHHHHHHH Q lcl|NC_019719. 141 ----DVI-SLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKG--FG-FTGLVGLSPIAFACKSAGVAVAMED 212 (424) Q Consensus 141 ----~~~-~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~--~~-~~~~~G~s~~~~~~~~i~~~~~~~~ 212 (424) .+. ..+.+....|. ..+.+..---..++...+|..+.=+.+|- ++ .....--||+.++...+.-.....+ T Consensus 145 ~pd~~~r~~W~vVt~~Ei~--~kg~g~~~i~lpdg~~he~~~~~D~l~RvW~P~Prr~~e~DSpvra~l~~lrEi~r~tk 222 (629) T protein:vir:10 145 NPDGSVRHNWYVVTNDEVK--NKGAGKTDIELPDGTIHEYSKGRDVMFRVWNPRPRRAKEPDSPVRACLDSLREIIRTTK 222 (629) T ss_pred CCCcccccceeeecHHHhc--cccCceeEEEcCCCceeeeeCCCeeEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhh Confidence 333 23333333322 22222221112233344454333333332 22 2334567888888888777766666 Q ss_pred HHHHHHhccCCCceeEEcCCCCC-----------C----------HHHHHHHHHHH----HHHhCCc-ccCc--ceec-- Q lcl|NC_019719. 213 QQRDFFANGAKSPQILSTGEKVL-----------T----------EQQRSQVEENF----KEIAGGP-VKKR--LWIL-- 262 (424) Q Consensus 213 ~~~~~~~n~~~p~~vl~~~~~~~-----------~----------~~~~~~~~~~~----~~~~~~~-~~g~--~~~l-- 262 (424) ..++..+.-..-.||+-++...+ + ....+.+...+ ...+-.. .+.. ++++ T Consensus 223 ~i~~aakSRL~gnGvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~De~S~aA~vPiia~v 302 (629) T protein:vir:10 223 KIRNASKSRLIGNGVVFLPQELSLPRATAPVADNQPGAPVPIVDGVAAADELSNLLFQTAAAAVDDEDSQAALIPLLATV 302 (629) T ss_pred HhHHHHHhHHhhCceeEeccCcccccccCCCCCCCCcccccccCCCcchHHHHHHHHHHHHhhhcCCCCccceeeeEEee Confidence 66555554444455543332211 0 01222233322 2222211 1111 2222 Q ss_pred CC--CceeeecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC-CCCccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 263 EA--GFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVE-KSTSWGSGIEQQNLGFLQYTLQPYISRWENSI 338 (424) Q Consensus 263 ~~--g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l 338 (424) ++ --+++.|... --+.--+++++..+..+|....|||+.|-+.+ ++|-+ +.-+....-++--|.|.+..|++.+ T Consensus 303 P~E~l~~ikhLkf~~eite~~iktR~daI~RlAmglDispErLLGlGsd~NHW--sAWqI~dedvrlHI~P~l~~ic~Ai 380 (629) T protein:vir:10 303 PGEHLQKIFHLKIGNEITEVEIKTRNDAIARLAMGLDVSPERLLGLGSNSNHW--SAWQIGDEDVQLHIKPVMEVLCAAI 380 (629) T ss_pred chHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCChhheeeccCCccce--eeEEecccceeeecchHHHHHHHHH Confidence 21 1244555442 22334578999999999999999999887764 55554 2234444556677899999999999 Q ss_pred HhhccCcc------ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC-------------C Q lcl|NC_019719. 339 QRWLIPAK------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG-------------D 399 (424) Q Consensus 339 ~~~l~~~~------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g-------------d 399 (424) ++.+|.+- +...|-+.||.+.|.. |+ ++.+.+..+...|.+|-...|+.+|+.-.++= | T Consensus 381 t~~~Lrp~L~~eGiDp~~Yvvw~DaS~Lt~-dP-d~~deA~~a~drGaIt~eAlRr~lG~~~dd~y~~~t~~~~q~~A~~ 458 (629) T protein:vir:10 381 YREVLVATLRAEGIDPDRYVLWYDASGLTV-DP-DKTDEATAAKEQGAITHEAYRRYLGLADEDGYDLETLEGAQAWARD 458 (629) T ss_pred HhHHHHHHHHHhCCCHHHhEeeecCccccc-CC-CCcHHHHHHHHcCCccHHHHHHHhccccccCCCcCCcHHHHHHHHH Confidence 99877432 2235888999988743 33 35566677889999999999999999643221 1 Q ss_pred eeeeccccc----ch-----------------hhccccCCCcccCC Q lcl|NC_019719. 400 VAMRQSQYV----PI-----------------TDLGTNKEPRNNGA 424 (424) Q Consensus 400 ~~~~~~n~~----~~-----------------~~~~~~~~~~~~ga 424 (424) ....+..++ |+ ...++.+++.+++. T Consensus 459 ~v~~~P~Li~~~apll~~~l~~i~~P~p~~a~~~~~~~~~~~E~~~ 504 (629) T protein:vir:10 459 AIVADPSLIKVLAPLLTDELAEIDWPEPPAALPPGEDDQADEEQDT 504 (629) T ss_pred HhcCCCchhhhhhhhcCCccccccccCCCCcCCCCCcccCccccCC Confidence 111111111 11 01111111111111 No 150 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.85 E-value=2.3e-08 Score=62.50 Aligned_cols=357 Identities=10% Similarity=-0.008 Sum_probs=170.3 Q ss_pred ccCcccc-cHHHH---hhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHH Q lcl|NC_019719. 46 HLGDSSI-NDERI---LQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMT 121 (424) Q Consensus 46 ~~~~~~~-~~~~~---~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~ 121 (424) .+....- ..... ....+...||+.+++.+---.|.+ .++. ....+.+++. + | ........+. T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~----~d~~-----~~~~~~~i~~-~-N---~~d~~~~~~~ 66 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVTG----PDGE-----PDTRASRWWQ-A-N---RLDSRQKLVW 66 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccCceec----CCCc-----hHHHHHHHHH-h-c---ChhHHHHHHH Confidence 0000000 00000 012345567777776543333321 1111 1233444443 2 2 2445666788 Q ss_pred HHHHHcCCeEEEEeeCCCCce------eeEEeecCceEEEEEcCCce------EEEE-EecCce---EEe---------- Q lcl|NC_019719. 122 MQLCFYGNAYALVDRNSAGDV------ISLLPLQSANMDVKLVGKKV------VYRY-QRDSEY---ADF---------- 175 (424) Q Consensus 122 ~~~l~~G~a~~~~~r~~~G~~------~~l~~l~~~~v~~~~~~~~~------~~~~-~~~~~~---~~~---------- 175 (424) .+.+.+|.+|+.+.++.++.. ..+..++|..+.+..|.... .|.. ...+.. ..+ T Consensus 67 ~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (434) T protein:vir:98 67 RMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYARVFFDDTSFPYRTR 146 (434) T ss_pred HHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEEEEEeCcEEEEEEe Confidence 899999999999987765432 23667889888877764321 1100 000000 000 Q ss_pred ----------c----------------Hh--HeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCcee Q lcl|NC_019719. 176 ----------S----------------QK--EIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQI 227 (424) Q Consensus 176 ----------~----------------~~--evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~v 227 (424) + -. =|+|+.+...-.-.|.|.++.....++..............-.+.|..+ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~ 226 (434) T protein:vir:98 147 ERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKW 226 (434) T ss_pred eccccccccccccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhh Confidence 0 00 0334433321112588888777777777666655555444445556555 Q ss_pred EEcC--CCCCCHHHHHHHHHHHHHHhCCcccCcceecC-CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcC Q lcl|NC_019719. 228 LSTG--EKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD 304 (424) Q Consensus 228 l~~~--~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~ 304 (424) +.-. ....++ . ......++..... .++++.++ ++.++.++.....+ .+++..+..+..++..=++|++.++. T Consensus 227 i~G~~~~~~~~~-~-~~~~~~~~~~~~~--~~~i~~~~~~~~~~~q~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~~~~ 301 (434) T protein:vir:98 227 IKGHKFAKRTDP-A-TGMTVVDQPFVPS--PSAVWASEGENTQFGQLDATDLS-GFLKEHASDVRDMLTISQTPTYLYAT 301 (434) T ss_pred hcCCCccccccc-c-cccchhhhhhhcc--ccccccCCCCCceEEEecCcchH-HHHHHHHHHHHHHhcccCCCHHHhcc Confidence 5311 111111 1 1111222222221 23466665 35677777654333 37788888899999999999999985 Q ss_pred CCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh--ccCc-cc--cccceeeecchhhhccCHHHHHHHHHHHHhCC Q lcl|NC_019719. 305 VEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRW--LIPA-KD--VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAG 379 (424) Q Consensus 305 ~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~--l~~~-~~--~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g 379 (424) .. ++.|+...+.+...+...+- =..+.|...|.+- |... .+ .....+++.+......+..+.++.+.+++..| T Consensus 302 ~~-~n~Sg~Al~~~~~~l~~k~~-~k~~~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g 379 (434) T protein:vir:98 302 DL-VNISADTIGALDILHVAKVR-EHIASFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVKADAATKLKSIG 379 (434) T ss_pred cc-CChHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCCCCHHHHHHHHHHHHhcC Confidence 32 23332223222222222221 1223333333221 1100 01 11233455556667788999999999999988 Q ss_pred CCCHHHHHHHhCCCCCC------C--CCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 380 LRTINEMRRTDNLPPLP------G--GDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 380 ~~T~NE~R~~~G~~p~~------~--gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) + +..-+++++|+++-+ . .+...........+.......+.+++| T Consensus 380 ~-~~e~~~~~lg~~~~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 431 (434) T protein:vir:98 380 Y-PLDVIAEELDESPARVRRIVAGAASQALLAASLLPAPGAPSAGNVPDSGGA 431 (434) T ss_pred C-cHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcccCC Confidence 6 777778888887632 0 000000000000011112222333333 No 151 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.67 E-value=1e-07 Score=58.93 Aligned_cols=394 Identities=11% Similarity=0.078 Sum_probs=172.7 Q ss_pred CCCCcccccCCCCC-chHHHHHhhccCcccCcccccccccccccccccC--ccccc---HHHHhhhHHHHHHHHHHHHhh Q lcl|NC_019719. 1 MEEPKYTIDLRTNN-GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSIN---DERILQISTVWRCVSLISTLT 74 (424) Q Consensus 1 ~~~~~~~~~~~~~~-G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~---~~~~~~~~~v~~~i~~ia~~i 74 (424) |.-|-=-|+--.+. =+++.|...+...... .......+.+-.... +..+. ...-....+...+|+..+..+ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~r---~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l 77 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDSTQN---LKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQ 77 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhh Confidence 33222212111111 2344444443322211 011111111111100 00111 111112345566777766655 Q ss_pred ccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce-------eeEEe Q lcl|NC_019719. 75 ACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV-------ISLLP 147 (424) Q Consensus 75 a~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~-------~~l~~ 147 (424) ---++. ..++. .....+.+++. + | ....+...+..+++.+|.||+.+.++..+.. ..+.+ T Consensus 78 ~~~g~~---~~~~~-----~~~~~~~~i~~-~-N---~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~ 144 (485) T protein:vir:10 78 AVEGFR---FGDAD-----EADEELWQWWQ-A-N---NLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRV 144 (485) T ss_pred ccccee---cCCCc-----hhHHHHHHHHH-h-c---CHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEE Confidence 322332 11111 11123444443 2 2 3456778889999999999999988765432 25777 Q ss_pred ecCceEEEEEcCCce------EEEEEecCc----eEEecHhHe-------------------------eEeccC-CCCcc Q lcl|NC_019719. 148 LQSANMDVKLVGKKV------VYRYQRDSE----YADFSQKEI-------------------------FHLKGF-GFTGL 191 (424) Q Consensus 148 l~~~~v~~~~~~~~~------~~~~~~~~~----~~~~~~~ev-------------------------ih~r~~-~~~~~ 191 (424) ++|..+.+..|+... .+.+...+. ...+.++.+ +++.+. ...+. T Consensus 145 ~~p~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~ 224 (485) T protein:vir:10 145 EPPTRMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDL 224 (485) T ss_pred EccceeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCC Confidence 888888877663221 111111110 011222222 333322 12345 Q ss_pred ccCchHH----HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC--CCCCHHHHHHHHHHHHHHhCCcccCcceecC-C Q lcl|NC_019719. 192 VGLSPIA----FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE--KVLTEQQRSQVEENFKEIAGGPVKKRLWILE-A 264 (424) Q Consensus 192 ~G~s~~~----~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~-~ 264 (424) +|.|-+. .+.+.+....+-......+|. .|..++.-.. ....+. ..-...++. ..+.++.++ + T Consensus 225 ~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a---~p~~~i~G~~~~~~~~~~--~~~~~~~~~-----~~~~i~~~~~~ 294 (485) T protein:vir:10 225 YGTSEITPELRSMTDAAARILMLMQATAELMG---VPQRLIFGIKPEEIGVDP--ETGQTLFDA-----YLARILAFEDA 294 (485) T ss_pred CCccchhHHHHHHHHHHHHHHHHHHHHHHhhc---chHHHHhcCCcccccccc--cccchhhhh-----cccceeccCCC Confidence 6777543 333333333333333334433 3444443110 000000 000111111 123466665 4 Q ss_pred CceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh--- Q lcl|NC_019719. 265 GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRW--- 341 (424) Q Consensus 265 g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~--- 341 (424) +.++.++....-+ .+++..+.....++..=++|+..++....+..|+.........+...+- =....|...|.+. T Consensus 295 d~k~~q~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~-~k~~~f~~~l~~~~~l 372 (485) T protein:vir:10 295 EGKIQQFSAAELA-NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVE-RKNSIFGGAWEEAMRL 372 (485) T ss_pred CceEEeecccchH-HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 5677776654333 3778888889999999999999998654433332223322222222221 1222222222211 Q ss_pred ---ccCcccc--ccceeeecchhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC--CCCe------------ Q lcl|NC_019719. 342 ---LIPAKDV--GRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP--GGDV------------ 400 (424) Q Consensus 342 ---l~~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~T~NE~R~~~G~~p~~--~gd~------------ 400 (424) +....+. ....+++.+......+..+.++.+.+++++| +++..-+++.+|+.+-+ .... T Consensus 373 ~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~ 452 (485) T protein:vir:10 373 AYRMMKGGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGL 452 (485) T ss_pred HHHHhCCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHH Confidence 1111111 1234555556666788999999999999866 88888889999887632 1110 Q ss_pred ---eeecccccchhhccc--------cCCCcccCC Q lcl|NC_019719. 401 ---AMRQSQYVPITDLGT--------NKEPRNNGA 424 (424) Q Consensus 401 ---~~~~~n~~~~~~~~~--------~~~~~~~ga 424 (424) +..+.... ++..+ .+....+|| T Consensus 453 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 453 IGTMVDPNPTV--PGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred HHHhhccCCCC--CCCCCccccccCcCCCCCCCCC Confidence 00111100 00000 001112222 No 152 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.65 E-value=1.4e-07 Score=58.15 Aligned_cols=396 Identities=9% Similarity=0.051 Sum_probs=170.2 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccC--cccccH---HHHhhhHHHHHHHHHHHHhhc Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSIND---ERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~~v~~~i~~ia~~ia 75 (424) |.-|-..-+=-+-.=+..+|...+....... ......+.+..... +..+.. .....+.+...+|+..+..+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl---~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~ 77 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDL---GDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQE 77 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHH---HHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHHhhhc Confidence 3222111111110112233333332211100 01111111111110 111111 111223455667777776554 Q ss_pred cCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce-------eeEEee Q lcl|NC_019719. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV-------ISLLPL 148 (424) Q Consensus 76 ~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~-------~~l~~l 148 (424) -..|.+ .+ +. .....+.+++. + | ........+..+.+.+|.||+.+.++.+|.+ ..+.++ T Consensus 78 ~~g~~~---~~-~~----~~~~~l~~i~~-~-N---~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~ 144 (484) T protein:vir:77 78 LEGFRL---GG-AD----KADEQLWDWWQ-A-N---DLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVE 144 (484) T ss_pred cCceec---CC-cc----hhHHHHHHHHH-h-c---CHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEe Confidence 334432 11 11 11223444443 2 2 3456678889999999999999999888754 247778 Q ss_pred cCceEEEEEcCCce------EEEEEec-Cce---EEecHhH-------------------------eeEeccCC-CCccc Q lcl|NC_019719. 149 QSANMDVKLVGKKV------VYRYQRD-SEY---ADFSQKE-------------------------IFHLKGFG-FTGLV 192 (424) Q Consensus 149 ~~~~v~~~~~~~~~------~~~~~~~-~~~---~~~~~~e-------------------------vih~r~~~-~~~~~ 192 (424) +|..+.+..|+... .+.+... +.. ..+.++. |+++.+.. ..++. T Consensus 145 ~p~~~~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~ 224 (484) T protein:vir:77 145 PPTNLYAQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLY 224 (484) T ss_pred ccceeEEEecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCCCcceEEeccccccCccC Confidence 88888876664211 0111100 000 0111121 34444322 23456 Q ss_pred cCchHH----HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH--HHHHHHHHHHhCCcccCcceecC-CC Q lcl|NC_019719. 193 GLSPIA----FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR--SQVEENFKEIAGGPVKKRLWILE-AG 265 (424) Q Consensus 193 G~s~~~----~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~--~~~~~~~~~~~~~~~~g~~~~l~-~g 265 (424) |.|.+. .+.+.+....+-......++ +.|..++.-- .. ++... ..-...++.. .+.++.++ ++ T Consensus 225 G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~---a~p~~~i~G~-~~-~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~ 294 (484) T protein:vir:77 225 GTTEITPELRSVTDAAARTLMLMQATAELM---GVPQRLLFGV-KG-EELGVDPETGQTLFDAY-----LARILAFEDHE 294 (484) T ss_pred CcccchHHHHHHHHHHHHHHHHHHHHHHhh---hhhHHHHhCC-Cc-chhcccccccchhhhhh-----hhhhcccCCCC Confidence 777554 33333333333333333443 3354444311 10 11000 0011112211 23456665 45 Q ss_pred ceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHh----h Q lcl|NC_019719. 266 FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR----W 341 (424) Q Consensus 266 ~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~----~ 341 (424) .++.++....-+ -+++..+.....++..-++|++.+|+...+..|+.........+...+ .-....|...|.+ . T Consensus 295 ~~~~q~~~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka-~~k~~~f~~~l~~~~~l~ 372 (484) T protein:vir:77 295 SKAQQFSAAELR-NFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTV-ERKNKIFGGAWEQAMRVA 372 (484) T ss_pred ceeEeecCCChH-HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Confidence 777777654433 377888888999999999999999865433233222222222211111 1111112222211 1 Q ss_pred c--cCcccc--ccceeeecchhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC--CCCeee------ecccc Q lcl|NC_019719. 342 L--IPAKDV--GRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP--GGDVAM------RQSQY 407 (424) Q Consensus 342 l--~~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~T~NE~R~~~G~~p~~--~gd~~~------~~~n~ 407 (424) + ....+. ....+++.+......+....++.+.+++++| +++..-+++++|+-+.+ ...... ....+ T Consensus 373 ~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~ 452 (484) T protein:vir:77 373 YKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLM 452 (484) T ss_pred HHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHH Confidence 1 111111 1123455555666788889999999999876 88888888888885432 110000 00000 Q ss_pred cchhhc----------cccCCCcccCC Q lcl|NC_019719. 408 VPITDL----------GTNKEPRNNGA 424 (424) Q Consensus 408 ~~~~~~----------~~~~~~~~~ga 424 (424) .++... .+..++..+.+ T Consensus 453 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (484) T protein:vir:77 453 GTMFGTDPSGGGNPDNPETPEPQPNPA 479 (484) T ss_pred hhhccccccCCCCCCCCCcccccCCCc Confidence 011000 01111111111 No 153 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.53 E-value=3.4e-07 Score=56.02 Aligned_cols=385 Identities=11% Similarity=0.011 Sum_probs=175.1 Q ss_pred hHHHHHhhcc----CcccCcc------ccccc------ccccccccccCc--------------ccccHHHHhhhHHHHH Q lcl|NC_019719. 16 WWARLQSWFV----GGRLVTP------NQGSQ------TGPVSAHGHLGD--------------SSINDERILQISTVWR 65 (424) Q Consensus 16 ~~~~l~~~~~----~~~~~~~------~~~~~------~~~~~~~~~~~~--------------~~~~~~~~~~~~~v~~ 65 (424) +|++|+.+++ .-..... ..... .....+...+.| .... ...+....... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~-~~~~~~n~~k~ 79 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVN-RRQLSMNLPKV 79 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccc-cceeecchHHH Confidence 4444444332 2110000 00000 000000000111 0000 11122344556 Q ss_pred HHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeE Q lcl|NC_019719. 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISL 145 (424) Q Consensus 66 ~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l 145 (424) +++..|+-+..=|..+--.+ . .....+.+++. .-....-...++.+...+|.+|+.+..+.+|.+ .+ T Consensus 80 i~~~~a~~l~~~p~~i~~~d--~-----~~~e~l~~~~~-----~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i 146 (496) T protein:vir:38 80 TAKYMSKLLFNEKVKINIDD--K-----AAEEFVLNVLK-----TNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KV 146 (496) T ss_pred HHHHHhhhhhCCcceEeeCC--h-----HHHHHHHHHHh-----ccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcE-EE Confidence 77777777766666542111 0 01112233332 123566677788899999999999999888765 46 Q ss_pred EeecCceEEEEEcCCceE------EEEEecC--------------c----------------eEEecHh----------- Q lcl|NC_019719. 146 LPLQSANMDVKLVGKKVV------YRYQRDS--------------E----------------YADFSQK----------- 178 (424) Q Consensus 146 ~~l~~~~v~~~~~~~~~~------~~~~~~~--------------~----------------~~~~~~~----------- 178 (424) -.++|..+.+...+.+.. ..+..++ . ...++.. T Consensus 147 ~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~ 226 (496) T protein:vir:38 147 SFATADCMYPLSNDSENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVP 226 (496) T ss_pred EEEcccceEEEEecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCcccccccccccccccee Confidence 667777776543322110 0000000 0 0001000 Q ss_pred ----H---eeEeccC-----CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE-----cCCCCCCHHHHH Q lcl|NC_019719. 179 ----E---IFHLKGF-----GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS-----TGEKVLTEQQRS 241 (424) Q Consensus 179 ----e---vih~r~~-----~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~-----~~~~~~~~~~~~ 241 (424) . +.|++.+ ..+.+.|+|.+..+...++....+.....+-|+.+ .+..++. ...+... +... T Consensus 227 ~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~-~~~i~v~~~~l~~~~~~~g-~~~~ 304 (496) T protein:vir:38 227 LPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLVPSSFVKTAVNLDG-STTQ 304 (496) T ss_pred ecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhc-ccceecchHHhhccCCCCC-cccc Confidence 0 2233322 12346799999988888877766655555556653 3333331 0101000 0000 Q ss_pred HHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH- Q lcl|NC_019719. 242 QVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL- 320 (424) Q Consensus 242 ~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~- 320 (424) ......+.+. .....-.+++..++.+......-++.+..+.....|+...|+||..++....+..+...+..... T Consensus 305 ~~~~~~~~~~----~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~ 380 (496) T protein:vir:38 305 YFDSTDEAFF----LYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSE 380 (496) T ss_pred CCCCccceEE----EeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHH Confidence 0000000000 00011122333466666666666788888888999999999999999875544332222211111 Q ss_pred ---------HHHHHHHHHHHHHHHHHHHhhcc-CccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh Q lcl|NC_019719. 321 ---------GFLQYTLQPYISRWENSIQRWLI-PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD 390 (424) Q Consensus 321 ---------~~~~~tl~P~~~~ie~~l~~~l~-~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~ 390 (424) ..++.+|..++..+-...+.... .........+.+.++.-+..|.++.++.+.+++.+|+++.-.+++.. T Consensus 381 l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~ 460 (496) T protein:vir:38 381 TYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRA 460 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhc Confidence 22233344444433322221111 11111223455555666678889999999999999999988887643 Q ss_pred -CCCCCCCCCeeeec-----ccccchhhccccCCCcc Q lcl|NC_019719. 391 -NLPPLPGGDVAMRQ-----SQYVPITDLGTNKEPRN 421 (424) Q Consensus 391 -G~~p~~~gd~~~~~-----~n~~~~~~~~~~~~~~~ 421 (424) |.+. +..++.+.- ....|.++.+...+.++ T Consensus 461 ~~~~d-~ea~~el~ri~~E~~~~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 461 WNITE-AEADEWAEMLAKEKQAEMPNNDMNGIFGEEE 496 (496) T ss_pred CCCCh-HHHHHHHHHHHHhhhccCccccccCCCCCCC Confidence 4432 222111100 00011111111111111 No 154 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=98.47 E-value=5.2e-07 Score=55.02 Aligned_cols=407 Identities=11% Similarity=0.109 Sum_probs=188.2 Q ss_pred CCCCcccccCCCC---CchHHHHHhhccCccc---Ccccccccccccc----cccccCccccc--------HHHHhhhHH Q lcl|NC_019719. 1 MEEPKYTIDLRTN---NGWWARLQSWFVGGRL---VTPNQGSQTGPVS----AHGHLGDSSIN--------DERILQIST 62 (424) Q Consensus 1 ~~~~~~~~~~~~~---~G~~~~l~~~~~~~~~---~~~~~~~~~~~~~----~~~~~~~~~~~--------~~~~~~~~~ 62 (424) |..-|+-.+.-.. .-+.+.+.++ ..+.. +.+.......++. ....+++..-+ .+.++.+|. T Consensus 1 ~~~~~~w~~~de~~~~~~~~~~~~~~-~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~~pE 79 (533) T protein:vir:58 1 MPSLEKYKKLNEAVNFTNFLSPMYGM-GAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYTDPL 79 (533) T ss_pred CCCcchhhhhhHHHHHHHhhchhhcc-cCccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhccCcc Confidence 4333332222111 1233333332 11110 0111111111110 11112222212 233456799 Q ss_pred HHHHHHHHHHhhccC-----ceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee- Q lcl|NC_019719. 63 VWRCVSLISTLTACL-----PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR- 136 (424) Q Consensus 63 v~~~i~~ia~~ia~~-----~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r- 136 (424) |.+||+.|.+.+.-. |+.+-....+ ..+ .....+..+|+ ...--..++..|...|..|..++. T Consensus 80 Vd~AideIvneaiv~d~~~~pV~v~l~~~e--~s~-~iK~kI~~lld--------f~~~~~~~fR~WYVDGriy~Hkiik 148 (533) T protein:vir:58 80 ISTVLDIIADECTIPNENGNIVDVVTKDIE--LAK-AILSYLDYVIN--------IEKNAYPIIRNMIKYGDMFLHILEK 148 (533) T ss_pred hhhHHHhhhceeeEecCCCceeEeeccccc--ccH-HHHHHHHHHhc--------chhhhhHHHHhhhhcceeEEEeccC Confidence 999999999887643 3333211111 000 11112222222 222234556677789999988763 Q ss_pred CCCCceeeEEeecCceEEEEEcCCc--eEEEEE-------ecCceEEecHhHeeEeccC--CCCccccCchHHHHHHHHH Q lcl|NC_019719. 137 NSAGDVISLLPLQSANMDVKLVGKK--VVYRYQ-------RDSEYADFSQKEIFHLKGF--GFTGLVGLSPIAFACKSAG 205 (424) Q Consensus 137 ~~~G~~~~l~~l~~~~v~~~~~~~~--~~~~~~-------~~~~~~~~~~~evih~r~~--~~~~~~G~s~~~~~~~~i~ 205 (424) +..+-+.+|..|+|..|+...+..+ .+|.|. .+.....++.+.|+|+.+- ..++.+++|-+..+.+.+. T Consensus 149 ~~k~GI~elr~lDPr~i~~vr~~~t~~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~N 228 (533) T protein:vir:58 149 GSDGTIEKFQVVSPYIFSKRYNPETDTWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWN 228 (533) T ss_pred CcccchhhheecCCeeeEEEEeeccceEEEeecccccccccCccccccchhheeeeeeccccCCCCceehhhhHHHHHHH Confidence 3566788999999999988776422 233333 2234577899999999753 3456788899999977777 Q ss_pred HHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCC----cccCcc------e----ec-------- Q lcl|NC_019719. 206 VAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGG----PVKKRL------W----IL-------- 262 (424) Q Consensus 206 ~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~----~~~g~~------~----~l-------- 262 (424) ....++....-+----+.-+-|+..+-+.. +..+.+-++....++... .+.|.+ + .+ T Consensus 229 QLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRR 308 (533) T protein:vir:58 229 QLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRR 308 (533) T ss_pred HHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhccccc Confidence 766666665544333333344555554443 333444455555444321 122222 1 11 Q ss_pred --CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019719. 263 --EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR 340 (424) Q Consensus 263 --~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~ 340 (424) ..|.+++.|... .+.-++-..+..+.+.++++||.+-|+..++...+ +.+.-...- +..-|.-+-..|.+.|.. T Consensus 309 eGgrgTEI~TLpGg--~lgemeDV~YF~kkLy~ALnVP~sRl~~e~~fgr~-~eItRDEiK-F~KFI~rLR~rF~~ll~~ 384 (533) T protein:vir:58 309 GDRRAVEIDILQGS--KVDLAEDVEYMLNRLISALKVPKAFIGYEGDVNAK-NTLATQDIK-FNNTIKRIQGFFVEELER 384 (533) T ss_pred CCCccceeeecCCC--CCCcHHHHHHHHHHHHHHhCCCeeecCCCCCCccc-hhhhHHHHH-HHHHHHHHHHHHHHHHhc Confidence 135566666542 24445666777899999999999999765433221 111111111 444566666678888887 Q ss_pred hccCcccc--ccceeeecchhh----hccC-HHHHHHHHHHH---HhC-----CC--CCHHHHH------HHhCCCCC-- Q lcl|NC_019719. 341 WLIPAKDV--GRIHAEHNLDGL----LRGD-SASRAAFMKAM---GEA-----GL--RTINEMR------RTDNLPPL-- 395 (424) Q Consensus 341 ~l~~~~~~--~~~~~~fd~~~l----~~~d-~~~~~~~~~~~---~~~-----g~--~T~NE~R------~~~G~~p~-- 395 (424) .|....-. ..+.+.|..+.. .... ...|...+..+ ++. .+ || +|+. +..+..++ T Consensus 385 qLilk~iit~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~t-dei~~q~e~ie~E~~~~~~~ 463 (533) T protein:vir:58 385 MVRMNKEFADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIP-YDLKPQEEVAEAAGGGGLFD 463 (533) T ss_pred ccccccCcchhheeeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCC-hhhhHHHHHHHHhhcCCCCC Confidence 77654322 123444443332 1111 11222222211 111 01 12 2222 22333322 Q ss_pred -CCCCeeeeccccc-----chhh--------ccccCCCcc-----------------cCC Q lcl|NC_019719. 396 -PGGDVAMRQSQYV-----PITD--------LGTNKEPRN-----------------NGA 424 (424) Q Consensus 396 -~~gd~~~~~~n~~-----~~~~--------~~~~~~~~~-----------------~ga 424 (424) ++-++-+.|.... |+.. .++..++.. +|| T Consensus 464 ~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~~ 523 (533) T protein:vir:58 464 TGGFGEETTPADFLGERGSPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGGG 523 (533) T ss_pred CCCcccccCCcccCccccCcccCCCChhhHhcccCCcccccccccccccchhhhhhcCCc Confidence 1111111221111 1111 011111111 111 No 155 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.40 E-value=8.3e-07 Score=53.93 Aligned_cols=391 Identities=12% Similarity=0.033 Sum_probs=163.7 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccccccccccccccc--Cccccc---HHHHhhhHHHHHHHHHHHHhhc Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL--GDSSIN---DERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~~~v~~~i~~ia~~ia 75 (424) .+++- |+ .-+++.|...+..... +.......+.+.... .+..+. +.....+.+...+|+..+..+. T Consensus 8 ~~~~~--~~----~~~~~~L~~~~~~~~~---r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~ 78 (485) T protein:vir:24 8 QEEIA--DP----AIARDEMVSAFEDQNQ---NLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQA 78 (485) T ss_pred CCccc--ch----HHHHHHHHHHHHHHHH---HHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhhhc Confidence 11111 00 0111222222211100 000000111111100 001111 1111123445566666666554 Q ss_pred cCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce-------eeEEee Q lcl|NC_019719. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV-------ISLLPL 148 (424) Q Consensus 76 ~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~-------~~l~~l 148 (424) --++.+ .. +. .....+.+++. . | ........+..+++.+|.||+++-++.++.+ ..+.++ T Consensus 79 ~~g~~~---~~-~~----~~~~~l~~i~~-~-N---~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~ 145 (485) T protein:vir:24 79 VEGFRL---GD-AD----EADEELWQWWQ-A-N---NLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVE 145 (485) T ss_pred cCceec---CC-Cc----hhHHHHHHHHH-h-c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEe Confidence 444432 11 11 11123444443 2 2 3456678899999999999999988876543 257788 Q ss_pred cCceEEEEEcCCce------EEEEEecC-c---eEEecHhH-------------------------eeEeccC-CCCccc Q lcl|NC_019719. 149 QSANMDVKLVGKKV------VYRYQRDS-E---YADFSQKE-------------------------IFHLKGF-GFTGLV 192 (424) Q Consensus 149 ~~~~v~~~~~~~~~------~~~~~~~~-~---~~~~~~~e-------------------------vih~r~~-~~~~~~ 192 (424) +|..+.+..+.... .+.+...+ . ...+.++. |+|+++. ...+++ T Consensus 146 ~p~~~~~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~ 225 (485) T protein:vir:24 146 PPTRMYAEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLY 225 (485) T ss_pred ccceeEEEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEeccCcccCCcC Confidence 88888877664321 11111110 0 01122222 2334322 134457 Q ss_pred cCchHHH-HHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH--HHHHHHHHHHHHhCCcccCcceecC-CCcee Q lcl|NC_019719. 193 GLSPIAF-ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ--QRSQVEENFKEIAGGPVKKRLWILE-AGFST 268 (424) Q Consensus 193 G~s~~~~-~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~--~~~~~~~~~~~~~~~~~~g~~~~l~-~g~~~ 268 (424) |.|-+.- +...++....+..-........+.|..++.-- .. ++. ..+.-...++. ..+.++.++ ++.++ T Consensus 226 G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~-~~-~~~~~~~~~~~~~~~~-----~~~~i~~~~~~~~~~ 298 (485) T protein:vir:24 226 GTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGI-KP-EEIGVDPETGQTLFDA-----YLARILAFEDAEGKI 298 (485) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccC-Cc-cccccccccccchhhh-----cccceeccCCCCceE Confidence 7776542 23333332222222222223334455444311 10 000 00001111221 123456664 46677 Q ss_pred eecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHH----------HHHHHHHHHHHHHHHHH Q lcl|NC_019719. 269 SAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG----------FLQYTLQPYISRWENSI 338 (424) Q Consensus 269 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~----------~~~~tl~P~~~~ie~~l 338 (424) .++.....+ .+++..+.....++..=++|+..+|....+..|+......... .+...+.-.++.+.... T Consensus 299 ~q~~~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~ 377 (485) T protein:vir:24 299 QQFSAAELA-NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLM 377 (485) T ss_pred EeecccchH-HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 776654333 3677788888888888899999998654433332222211111 11222222222221111 Q ss_pred HhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC--CCCeee-----ecc---- Q lcl|NC_019719. 339 QRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP--GGDVAM-----RQS---- 405 (424) Q Consensus 339 ~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~T~NE~R~~~G~~p~~--~gd~~~-----~~~---- 405 (424) +. .-...+ ...+++.+......+..+.++.+.+++.+| +++..-+++++|+.+.+ ...... .+. T Consensus 378 ~~-~~~~~d--~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~ 454 (485) T protein:vir:24 378 KG-GDVPPD--MLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLG 454 (485) T ss_pred cC-CCCccc--cceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHH Confidence 10 000111 123444445555677888999999998865 78777778888886532 111000 000 Q ss_pred ---cccchhh------ccccCCCcccCC Q lcl|NC_019719. 406 ---QYVPITD------LGTNKEPRNNGA 424 (424) Q Consensus 406 ---n~~~~~~------~~~~~~~~~~ga 424 (424) +..+... .....++..+|+ T Consensus 455 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~ 482 (485) T protein:vir:24 455 TMVDADPTVPGSPNPTPAPKPQPAIEGG 482 (485) T ss_pred hhcccCCCCCCCCCCCCCCCCccCCCCC Confidence 0000000 000111112222 No 156 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.37 E-value=9.8e-07 Score=53.52 Aligned_cols=388 Identities=12% Similarity=0.047 Sum_probs=172.3 Q ss_pred CchHHHHHhhccCcccC---------ccccccc------ccccccccccCccc--cc---------HHHHhhhHHHHHHH Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLV---------TPNQGSQ------TGPVSAHGHLGDSS--IN---------DERILQISTVWRCV 67 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~---------~~~~~~~------~~~~~~~~~~~~~~--~~---------~~~~~~~~~v~~~i 67 (424) -|+|+++++||++.... ....... .....+..++.|.. +. .+..++......+. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~ 80 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSA 80 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHH Confidence 67888888887553110 0000000 00000001111111 00 00111112222333 Q ss_pred HHHHHhhccCceEEEEecccCcccccccc----chhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCcee Q lcl|NC_019719. 68 SLISTLTACLPLDVFETDQNDNRKKVDLS----NPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI 143 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~----~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~ 143 (424) +.+|+-+..=+..+.-.+.++........ ..+..++. -| .....++..+.+.+..|.+++.+..+.. . + T Consensus 81 ~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~--~n---~f~~~~~~~~e~a~a~G~~a~k~~~d~~-~-~ 153 (517) T protein:vir:98 81 DVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQ--HN---KFIKNLSDYLEPTFALGGLTVRPYVDNG-E-I 153 (517) T ss_pred HHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHH--hc---cHHHHHHHHHHHHhhhCCEEEEEEEeCC-e-e Confidence 33333332222222212211111110111 11222222 11 2344455566777778888887766643 2 2 Q ss_pred eEEeecCceEEEEEc-------------------CCceEEE----------------EEe-------cC---ceEEec-- Q lcl|NC_019719. 144 SLLPLQSANMDVKLV-------------------GKKVVYR----------------YQR-------DS---EYADFS-- 176 (424) Q Consensus 144 ~l~~l~~~~v~~~~~-------------------~~~~~~~----------------~~~-------~~---~~~~~~-- 176 (424) .+..+++.++.+... +...+|. |.+ +. ....++ T Consensus 154 ~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~ 233 (517) T protein:vir:98 154 EFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLE 233 (517) T ss_pred EEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCcccccccccc Confidence 355555655544221 1111111 000 00 000111 Q ss_pred ------HhH----------eeEeccCCC-----CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC Q lcl|NC_019719. 177 ------QKE----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL 235 (424) Q Consensus 177 ------~~e----------vih~r~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~ 235 (424) +.+ +.|++.+-. +.+.|+|....+...+......-.....-|+.|-. ..++ +.... T Consensus 234 ~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~-~i~v--p~~~l 310 (517) T protein:vir:98 234 ELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQR-TVFV--SDVML 310 (517) T ss_pred ccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCc-ceec--Chhhh Confidence 111 124443211 34679999998888887777666555566666543 3222 21111 Q ss_pred C-------HHH---HHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCC Q lcl|NC_019719. 236 T-------EQQ---RSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDV 305 (424) Q Consensus 236 ~-------~~~---~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~ 305 (424) . ... .+.-.+.+....+. +++-.++.++....+-++.+..+...+.|+...|+++..++.. T Consensus 311 ~~~~~~~g~~~~~~~d~~~~~y~~~~~~---------~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~ 381 (517) T protein:vir:98 311 RTVPDESGMPPPQVFDPDVNVYKSIRMG---------TDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFD 381 (517) T ss_pred ccccCCCCcccCCCCCcccceeeeccCC---------CCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCccccccc Confidence 0 000 00000000001100 1223467777777788899999999999999999999999986 Q ss_pred CCCCccchhHHHH--HHHHHHHHHHHHHHHHHHHHHh------------hccCccccccceeeecchhhhccCHHHHHHH Q lcl|NC_019719. 306 EKSTSWGSGIEQQ--NLGFLQYTLQPYISRWENSIQR------------WLIPAKDVGRIHAEHNLDGLLRGDSASRAAF 371 (424) Q Consensus 306 ~~~~~~~~n~e~~--~~~~~~~tl~P~~~~ie~~l~~------------~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~ 371 (424) .++..+ +.+. ...-.-.++.-+...++..|.. .++...-...+.+.+++++-+..|.++.++. T Consensus 382 ~~~~kT---ATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~ 458 (517) T protein:vir:98 382 GRSMKT---ATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRF 458 (517) T ss_pred cccccc---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHH Confidence 654432 2221 1122223333344444433332 1222211223456777778788999999999 Q ss_pred HHHHHhCCCCCHHHHHHHh-CCCCCCCCCeeeec-----ccccchhhccccCCCcccCC Q lcl|NC_019719. 372 MKAMGEAGLRTINEMRRTD-NLPPLPGGDVAMRQ-----SQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 372 ~~~~~~~g~~T~NE~R~~~-G~~p~~~gd~~~~~-----~n~~~~~~~~~~~~~~~~ga 424 (424) ..+++.+|+|++-+++.++ |+.. +..++.+.. ....|.+....++++-.+-. T Consensus 459 ~~~~v~aG~ms~~~~i~~~~g~~e-eeA~~e~~~i~~E~~~~~~~~~~~~~~~~~~gd~ 516 (517) T protein:vir:98 459 YGQAKTFGFIPTVEAIQRIFKVPK-KTAEQWLEEIRKDQIELDPVTISQRAQKRMFGDE 516 (517) T ss_pred HHHHHhcCCCCHHHHHHHhCCCCh-HHHHHHHHHHHHhccccCCCCccccccCCCCCCC Confidence 9999999999999986654 7653 222111110 01111111111000000000 No 157 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.32 E-value=1.4e-06 Score=52.74 Aligned_cols=347 Identities=8% Similarity=0.005 Sum_probs=168.7 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccccccccccccCc--ccccHH---HH-hhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGD--SSINDE---RI-LQISTVWRCVSLISTLTACLPLDV 81 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~-~~~~~v~~~i~~ia~~ia~~~~~v 81 (424) |+.+ .+++|...+....... ......+.+...... ..+.++ .+ +-..+..-+|+.++..+.=-.|. T Consensus 1 ~~~~----~i~~L~~~~~~~~~r~---~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~- 72 (409) T protein:vir:94 1 MTEK----GIGYLRFKLSVHKRRA---EMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFE- 72 (409) T ss_pred CCHH----HHHHHHHHHHHHhHHH---HHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCccc- Confidence 4433 2333433333222111 111111211111111 111111 11 11245556677666543322221 Q ss_pred EEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc Q lcl|NC_019719. 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK 161 (424) Q Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~ 161 (424) . . +..+.+++. + | +.......+..+.+.+|.+|+.+..+.+|.| .+.+++|..+....|... T Consensus 73 --~-~---------d~~l~~i~~-~-N---~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~-~i~~~sp~~~~~i~D~~~ 134 (409) T protein:vir:94 73 --N-D---------DFTVNEIFE-E-N---NPDIFFDSAVLSSLIASCSFTYISKGENDAV-RLQVIEAVNATGIIDPIT 134 (409) T ss_pred --C-C---------chHHHHHHH-h-c---ChhHHHHHHHHHHHHhcceeEEEecCCCCce-EEEEeccceEEEEEecCC Confidence 1 1 112444443 2 2 2344556778889999999999999999976 677888988888776532 Q ss_pred e----EEEEEec---Cc---eEEecHhHe----------------------eEeccC-CCCccccCch----HHHHHHHH Q lcl|NC_019719. 162 V----VYRYQRD---SE---YADFSQKEI----------------------FHLKGF-GFTGLVGLSP----IAFACKSA 204 (424) Q Consensus 162 ~----~~~~~~~---~~---~~~~~~~ev----------------------ih~r~~-~~~~~~G~s~----~~~~~~~i 204 (424) . .|.+... +. ...+.++++ +++.+. ..+.++|.|. +..+.+.+ T Consensus 135 ~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~ 214 (409) T protein:vir:94 135 GLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNA 214 (409) T ss_pred CceeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHH Confidence 1 1111110 00 011222222 233221 2345677774 44555555 Q ss_pred HHHHHHHHHHHHHHhccCCCceeEE-cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-----CCceeeecccChhHH Q lcl|NC_019719. 205 GVAVAMEDQQRDFFANGAKSPQILS-TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-----AGFSTSAIGVTPQDA 278 (424) Q Consensus 205 ~~~~~~~~~~~~~~~n~~~p~~vl~-~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~-----~g~~~~~l~~~~~d~ 278 (424) .-...-......||.+ |..++. ...+. +..+.++.... +++.++ .+.++.++....-+ T Consensus 215 ~r~~~~~~~~~e~~a~---pqr~i~G~d~d~---~~~~~~~~~~~---------~i~~~~~d~dg~~~~v~q~~~~~l~- 278 (409) T protein:vir:94 215 KRTLERADVTAEFYSF---PQKYVTGLSDDA---EPMETWKATVS---------SMLQFTKDEDGDKPTLGQFTQPSMS- 278 (409) T ss_pred HHHHHHHHHHHHHhcC---hhheeEecCCCC---cccchhhhhHH---------HhhcCCCCCCCCCceEEecCCCChh- Confidence 5555555556666655 544443 22111 11222222222 344443 23456565543322 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHH---HHHHHHHHHHHh--hccCccc--c-cc Q lcl|NC_019719. 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ---PYISRWENSIQR--WLIPAKD--V-GR 350 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~---P~~~~ie~~l~~--~l~~~~~--~-~~ 350 (424) .|++..+.....+|..-++|++.+|....+.+|......+...+...+=. -+-..+++.+-. .+....+ . .. T Consensus 279 ~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~ 358 (409) T protein:vir:94 279 PFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQF 358 (409) T ss_pred HHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccc Confidence 47899999999999999999999997655433333333333333222211 112222222211 1111111 0 11 Q ss_pred ceeeecchhhhccC---HHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC Q lcl|NC_019719. 351 IHAEHNLDGLLRGD---SASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP 396 (424) Q Consensus 351 ~~~~fd~~~l~~~d---~~~~~~~~~~~~~~g--~~T~NE~R~~~G~~p~~ 396 (424) ..+++.+..+...+ ....++.+.+++++| +...+-+++++|+..-+ T Consensus 359 ~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 359 RKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred ccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 22333333333333 356678899999998 66678999999998755 No 158 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.29 E-value=1.6e-06 Score=52.36 Aligned_cols=347 Identities=8% Similarity=0.003 Sum_probs=167.7 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccccccccccccCc--ccccHHH---H-hhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGD--SSINDER---I-LQISTVWRCVSLISTLTACLPLDV 81 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~---~-~~~~~v~~~i~~ia~~ia~~~~~v 81 (424) |+.+ .+++|...+....... ......+.+...... ..+.++. + +-..+..-+|+.++..+.=-.|. T Consensus 1 ~~~~----~i~~L~~~~~~~~~r~---~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~- 72 (409) T protein:vir:16 1 MTEK----GIGYLRFKLSVHKRRA---EMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFE- 72 (409) T ss_pred CCHH----HHHHHHHHHHHHhHHH---HHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhccccccc- Confidence 4433 2333333332222111 111111111111111 1111111 1 11244555666666544322221 Q ss_pred EEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc Q lcl|NC_019719. 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK 161 (424) Q Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~ 161 (424) . . +..+.+++. + | +.......+..+.+.+|.+|+.+..+.+|.+ .+.+++|..+....|... T Consensus 73 --~-~---------d~~l~~i~~-~-N---~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~-~i~~~sP~~~~~i~D~~~ 134 (409) T protein:vir:16 73 --N-D---------DFTVNEIFE-E-N---NPDIFFDSTVLSALIASCSFTYISKGENDAV-RLQVIEATNATGIIDPIT 134 (409) T ss_pred --C-c---------chHHHHHHH-h-c---ChhHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEeeccc Confidence 1 1 112444443 2 2 2344556788889999999999999988875 677888888887766422 Q ss_pred e----EEEEEe---cCce---EEecHhH----------------------eeEeccC-CCCccccCc----hHHHHHHHH Q lcl|NC_019719. 162 V----VYRYQR---DSEY---ADFSQKE----------------------IFHLKGF-GFTGLVGLS----PIAFACKSA 204 (424) Q Consensus 162 ~----~~~~~~---~~~~---~~~~~~e----------------------vih~r~~-~~~~~~G~s----~~~~~~~~i 204 (424) . .+.+.. .+.. ..+.+++ |++|.+. ..+.++|.| |+..+.+.+ T Consensus 135 ~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~ 214 (409) T protein:vir:16 135 GLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNA 214 (409) T ss_pred ccceeeeEEEEecCCCceEEEEEEecCcEEEEEecCccccceecCCCCcceEEecccccccccCCccccchhHHHHHHHH Confidence 1 111110 0100 1112222 3333322 234567777 355556666 Q ss_pred HHHHHHHHHHHHHHhccCCCceeEE-cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-----CCceeeecccChhHH Q lcl|NC_019719. 205 GVAVAMEDQQRDFFANGAKSPQILS-TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-----AGFSTSAIGVTPQDA 278 (424) Q Consensus 205 ~~~~~~~~~~~~~~~n~~~p~~vl~-~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~-----~g~~~~~l~~~~~d~ 278 (424) .-...-......||.+ |..++. ...+. ...+. |+.. .++++.++ .+.++.++....-+ T Consensus 215 ~r~~~~~~~~~e~~a~---pqr~i~G~d~d~---~~~~~----~~~~-----~~~i~~~~~d~~g~~~~v~q~~~~~l~- 278 (409) T protein:vir:16 215 KRTLERADVTAEFYSF---PQKYVTGLSDDA---EPMET----WKAT-----VSSMLQFTKDEDGDKPTLGQFTQPSMS- 278 (409) T ss_pred HHHHHHHHHHHHHhcC---hhheeEecCCCC---Cccch----hhhh-----hhHhhccCCCCCCCCceEEecCCCChh- Confidence 6666666666677654 544443 22111 11122 2221 12355553 23466666544333 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHH---HHHHHHHHHHHhhcc--Ccc-c-cc-c Q lcl|NC_019719. 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ---PYISRWENSIQRWLI--PAK-D-VG-R 350 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~---P~~~~ie~~l~~~l~--~~~-~-~~-~ 350 (424) .|++..+.....+|..=++|++.+|....+-+|...+..+...+...+-. -+-..+++.+-.-+. ... + .. . T Consensus 279 ~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~ 358 (409) T protein:vir:16 279 PFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQF 358 (409) T ss_pred HHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhh Confidence 48899999999999999999999997654323322333333333222211 122222222211111 111 1 10 1 Q ss_pred ceeeecchhhh---ccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC Q lcl|NC_019719. 351 IHAEHNLDGLL---RGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP 396 (424) Q Consensus 351 ~~~~fd~~~l~---~~d~~~~~~~~~~~~~~g--~~T~NE~R~~~G~~p~~ 396 (424) ..+++.+.... ..+....++.+.|++++| ++.-+-+++++|+..-+ T Consensus 359 ~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 359 SKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred ccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 22333333332 233577889999999986 33457779999998654 No 159 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.26 E-value=1.9e-06 Score=51.91 Aligned_cols=360 Identities=10% Similarity=0.080 Sum_probs=167.0 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccccccccccccCc--ccccHH--HHhh--hHHHHHHHHHHHHhhccCceEE Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGD--SSINDE--RILQ--ISTVWRCVSLISTLTACLPLDV 81 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~--~~~~--~~~v~~~i~~ia~~ia~~~~~v 81 (424) |+.+.-+.+++.+...... . ......+.+...... ..+..+ ...+ ..+...+|+.++..+-=-.|. T Consensus 1 m~~~~i~~L~~~~~~~~~r----~---~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~- 72 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTG----V---DKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIFREFT- 72 (422) T ss_pred CChHHHHHHHHHHHHHHHH----H---HHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccccceee- Confidence 6666555555554433211 0 111111111111111 111111 1111 134455666655533222221 Q ss_pred EEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-CCceeeEEeecCceEEEEEcCC Q lcl|NC_019719. 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-AGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~-~G~~~~l~~l~~~~v~~~~~~~ 160 (424) . . +..+.+.+. + |. .......+..+.+.+|.||+.+.++. +|.| .+.+++|..+....|.. T Consensus 73 --~-~---------d~~l~~~w~-~-N~---ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p-~i~~~sp~~~~~i~D~~ 134 (422) T protein:vir:97 73 --N-D---------DFNAWEIFK-A-NN---PDIFFDTAIQSALIASCCFVYIMPGAEDGLP-KMQVIEASKATGILDPT 134 (422) T ss_pred --C-C---------chhHHHHHH-h-cC---hHHHHHHHHHHHHHhcceeEEEeeCCCCCee-EEEEechhhEEEEEeCC Confidence 1 1 112455553 2 32 34445577888999999999998875 5655 68888999998877653 Q ss_pred ceE----E-EEE--ecCce---EEecHhH---------------------eeEeccC-CCCccccCchH----HHHHHHH Q lcl|NC_019719. 161 KVV----Y-RYQ--RDSEY---ADFSQKE---------------------IFHLKGF-GFTGLVGLSPI----AFACKSA 204 (424) Q Consensus 161 ~~~----~-~~~--~~~~~---~~~~~~e---------------------vih~r~~-~~~~~~G~s~~----~~~~~~i 204 (424) ... + .+. ..+.. ..++... |++|.+. ....++|.|.+ ..+.+.+ T Consensus 135 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~ 214 (422) T protein:vir:97 135 TFLLTEGYAILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAA 214 (422) T ss_pred CCcceeeEEEEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHH Confidence 210 1 111 11110 1111111 2333322 23456787744 3444444 Q ss_pred HHHHHHHHHHHHHHhccCCCceeEE-cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-----CceeeecccChhHH Q lcl|NC_019719. 205 GVAVAMEDQQRDFFANGAKSPQILS-TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-----GFSTSAIGVTPQDA 278 (424) Q Consensus 205 ~~~~~~~~~~~~~~~n~~~p~~vl~-~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~-----g~~~~~l~~~~~d~ 278 (424) .-...-......|+.. |..++. .+.+. ...+. |+.. .++++.++. +.++.++..+.-+ T Consensus 215 ~r~~~~~~~~~e~~a~---pqr~i~G~d~d~---~~~~~----~~~~-----~~~i~~~~~de~~~~~~v~q~~~~~l~- 278 (422) T protein:vir:97 215 KRTLERAEVTAEFYSF---PQKYVLGMDPDA---KPMEK----WRAT-----VSTLLEISKDEDGDKPTVGQFTTASMA- 278 (422) T ss_pred HHHHHHHHHHHHHhcc---hhhhhcccCccc---ccCch----hhhh-----hhhhhccCCCCCCCcceeeecCCCChh- Confidence 4444444555555544 443432 11111 11111 2221 124555542 3466666544333 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHH---HHHHHHHHHHHhhc--cCccc--c-cc Q lcl|NC_019719. 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ---PYISRWENSIQRWL--IPAKD--V-GR 350 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~---P~~~~ie~~l~~~l--~~~~~--~-~~ 350 (424) .|++..+.....++..=++|++.+|....+.+|...+..+...+...+-. -+-..+++.+-..+ ....+ . .. T Consensus 279 ~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~ 358 (422) T protein:vir:97 279 PFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQF 358 (422) T ss_pred HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhh Confidence 48899999999999999999999998665323333333333333222211 12222222222111 11111 0 01 Q ss_pred ceeeecchhhhccC---HHHHHHHHHHHHhC--CCCCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCCCcccC Q lcl|NC_019719. 351 IHAEHNLDGLLRGD---SASRAAFMKAMGEA--GLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 351 ~~~~fd~~~l~~~d---~~~~~~~~~~~~~~--g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) ..+.+.+......+ ....++.+.+++++ |++..+-+++++|+...+. -... + ++.+-+| T Consensus 359 ~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~~---~~~~--------~---~~~~~d~ 422 (422) T protein:vir:97 359 MDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGADK---PIPA--------I---TEVTTDG 422 (422) T ss_pred ccceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCchhH---HHHH--------H---HhhhccC Confidence 12233333333444 45567777888888 7888899999999964211 0000 0 0011111 No 160 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.26 E-value=2e-06 Score=51.85 Aligned_cols=387 Identities=10% Similarity=0.024 Sum_probs=179.0 Q ss_pred CchHHHHHhhccCcccCc-c---------cccccc------------ccccc-ccccCcccc----cHHHHhhhHHHHHH Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVT-P---------NQGSQT------------GPVSA-HGHLGDSSI----NDERILQISTVWRC 66 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~-~---------~~~~~~------------~~~~~-~~~~~~~~~----~~~~~~~~~~v~~~ 66 (424) =|||+|++++|++....- . ...... ..+.+ ..+...... ..+..........+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTA 80 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHH Confidence 799999999986532110 0 000000 00000 000100000 01111122334455 Q ss_pred HHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEE Q lcl|NC_019719. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLL 146 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~ 146 (424) ++..|+-+..=|..+.-.+ +. ..+..+.++|. -| ....-.+..+.+.+..|.+++.+..+.++ +.+. T Consensus 81 ~~~~A~lv~~e~~~i~v~~-~~-----~~~e~l~~il~--~n---~f~~~~~~~~e~a~a~G~~~~k~~~d~~~--~~i~ 147 (508) T protein:vir:15 81 ARRIASVVFNEKAEIHVKD-NN-----EADKFLNDVLE--DN---DFKNKFEEALEKGVALGGFAMRPYIDGNH--IKIA 147 (508) T ss_pred HHHHHhhhhCCCceEEeCC-ch-----HHHHHHHHHHH--hc---cHHHHHHHHHHHHhhcCceEEEEEEeCCe--eEEE Confidence 5555555544343332111 10 00112333332 12 23444556677778888888877766432 3556 Q ss_pred eecCceEEEEE-cCCc------------------eEEE---E-E--ecCc----------------eEEecH-------- Q lcl|NC_019719. 147 PLQSANMDVKL-VGKK------------------VVYR---Y-Q--RDSE----------------YADFSQ-------- 177 (424) Q Consensus 147 ~l~~~~v~~~~-~~~~------------------~~~~---~-~--~~~~----------------~~~~~~-------- 177 (424) .++|..+.+.. +.+. .+|. + . .++. ...++- T Consensus 148 ~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~ 227 (508) T protein:vir:15 148 WVRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKE 227 (508) T ss_pred EEcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccC Confidence 66666665532 2111 1111 0 0 0000 001110 Q ss_pred --hH----------eeEeccCC-----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC--CHH Q lcl|NC_019719. 178 --KE----------IFHLKGFG-----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL--TEQ 238 (424) Q Consensus 178 --~e----------vih~r~~~-----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~--~~~ 238 (424) ++ ..|++.+. .+.+.|+|....+...++.....-......|+. +.+..++. .... +++ T Consensus 228 l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~-~~~~i~v~--~~~l~~d~~ 304 (508) T protein:vir:15 228 LAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRL-GQKHIAVQ--PGMLRFDDE 304 (508) T ss_pred CCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHh-cccceeec--hHHhcCCCC Confidence 11 12343321 134679999999998888877776666666754 44444441 1110 111 Q ss_pred HHHHHHHHHHHHhCCcccCccee--cCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH Q lcl|NC_019719. 239 QRSQVEENFKEIAGGPVKKRLWI--LEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~~g~~~~--l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) ....+. ...+....+- .++|..++.++....+-++.+..+...+.|....|++|..++....+..+..-+. T Consensus 305 ~~~~~~-------~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~ 377 (508) T protein:vir:15 305 HKPTFD-------TEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVV 377 (508) T ss_pred CccccC-------CCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHH Confidence 000000 0011111111 1334557777777778888999999999999999999999987655433211111 Q ss_pred ----------HHHHHHHHHHHHHHHHHHHHHHHhhccCccc---------cccceeeecchhhhccCHHHHHHHHHHHHh Q lcl|NC_019719. 317 ----------QQNLGFLQYTLQPYISRWENSIQRWLIPAKD---------VGRIHAEHNLDGLLRGDSASRAAFMKAMGE 377 (424) Q Consensus 317 ----------~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~---------~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~ 377 (424) ......++.+|..++..|-......-+...+ ...+.+.++++.-+..|.++..+...+++. T Consensus 378 s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~ 457 (508) T protein:vir:15 378 SNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLA 457 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHh Confidence 1112223344444444433332211111111 112345677777778899999999999999 Q ss_pred CCCCCHHHHHHHh-CCCCCCCCCeeeec--ccccchhhccccCCCcccCC Q lcl|NC_019719. 378 AGLRTINEMRRTD-NLPPLPGGDVAMRQ--SQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 378 ~g~~T~NE~R~~~-G~~p~~~gd~~~~~--~n~~~~~~~~~~~~~~~~ga 424 (424) +|+|++-+++... |++. +..++.+.. .........+...++.+++- T Consensus 458 aGi~s~e~~i~~~~g~~d-eea~~el~ri~~E~~~~~~~~~~~~~~~g~~ 506 (508) T protein:vir:15 458 IGALSKQTFLQRNYGMTD-EQAAEELAKIQSEAPTDTFEGGRSAILNGGD 506 (508) T ss_pred cCCCCHHHHHHhcCCCCh-HHHHHHHHHHHHhccccCccccccccCCCCC Confidence 9999999987653 5543 111111110 00000000011111111111 No 161 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.25 E-value=2e-06 Score=51.83 Aligned_cols=394 Identities=9% Similarity=0.015 Sum_probs=177.4 Q ss_pred CCCCccccc--------C--CCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTID--------L--RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLI 70 (424) Q Consensus 1 ~~~~~~~~~--------~--~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~i 70 (424) -|.++...+ + +.+..-+.++.....+.-........ ..............+..=+.++....+|+.. T Consensus 20 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~---~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~ 96 (503) T protein:vir:59 20 VESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRT---YYDAAGQQLVDDTKTNNRTSHAWHKLFVDQK 96 (503) T ss_pred hhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccch---hcccccccccccccccceeecchHHHHHHHH Confidence 011110000 0 00112223333333221110000000 0000000000000000112245677788888 Q ss_pred HHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecC Q lcl|NC_019719. 71 STLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQS 150 (424) Q Consensus 71 a~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~ 150 (424) +.-+..-|+.+-- .+. + .. ...+.+.. | ........+..+.+.+|.+|+.+-.+.+|++ .+..++| T Consensus 97 ~~yl~g~~~~~~~--~d~---~--~~-~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~p 162 (503) T protein:vir:59 97 TQYLVGEPVTFTS--DNK---T--LL-EYVNELAD--D---DFDDILNETVKNMSNKGIEYWHPFVDEEGEF-DYVIFPA 162 (503) T ss_pred HhhhhcCCeeecc--CcH---H--HH-HHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEEeecCCCce-EEEEEcc Confidence 8888888876521 111 1 11 12222322 2 3556667788899999999999999888876 5888888 Q ss_pred ceEEEEEcCCc---e-----EEEEEe-cCc----eEEecHhHeeEecc-------------------------------C Q lcl|NC_019719. 151 ANMDVKLVGKK---V-----VYRYQR-DSE----YADFSQKEIFHLKG-------------------------------F 186 (424) Q Consensus 151 ~~v~~~~~~~~---~-----~~~~~~-~~~----~~~~~~~evih~r~-------------------------------~ 186 (424) ..+.+..++.. . +|.... .+. ...+.++.+.+++. . T Consensus 163 ~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 242 (503) T protein:vir:59 163 EEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRV 242 (503) T ss_pred ceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCcc Confidence 88887766431 1 111111 110 01223333332211 0 Q ss_pred C----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec Q lcl|NC_019719. 187 G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL 262 (424) Q Consensus 187 ~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l 262 (424) + .+...|.|.+..+...++....+.....+.+...+.|-.+++-......++....+ . .++++.+ T Consensus 243 Piv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~-------~----~~~~~~~ 311 (503) T protein:vir:59 243 PIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANL-------R----YHSVIKV 311 (503) T ss_pred ceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhh-------h----cccceec Confidence 0 12345777777777777666655555555566667776666532222112211111 1 1245556 Q ss_pred CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH----------HHHHHHHHHHHHHHHH Q lcl|NC_019719. 263 EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYIS 332 (424) Q Consensus 263 ~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~~~~~tl~P~~~ 332 (424) +++.+++.+........+....+...+.|...-++|..-... ..++.++...+ +.....+...|.-++. T Consensus 312 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~ 390 (503) T protein:vir:59 312 SGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPET-IGGGATGPALENLYALLDLKANMAERKIRAGLRLFFW 390 (503) T ss_pred cCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCccc-ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 665555555544444445566666677776666666321111 11222322222 1122233333333333 Q ss_pred HHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--Ceeee------- Q lcl|NC_019719. 333 RWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVAMR------- 403 (424) Q Consensus 333 ~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g--d~~~~------- 403 (424) .|...++..-..... ....+.+.+..-+..|..+.++.+.+++.+|+++...+.++++.-+.|.. +..-. T Consensus 391 ~i~~~~~~~~~~~~~-~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~ 469 (503) T protein:vir:59 391 FFAEYLRNTGKGDFN-PDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPFVQDPEEELARIEEEMNQYAE 469 (503) T ss_pred HHHHHHHhccCcccc-cccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHh Confidence 333333321111111 11224555566677899999999999999999999888888766432210 00000 Q ss_pred -cccc---cchhhccccCCC-----cccCC Q lcl|NC_019719. 404 -QSQY---VPITDLGTNKEP-----RNNGA 424 (424) Q Consensus 404 -~~n~---~~~~~~~~~~~~-----~~~ga 424 (424) ..+. .+.....+++++ +++|+ T Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (503) T protein:vir:59 470 MQGNLLDDEGGDDDLEEDDPNAGAAESGGA 499 (503) T ss_pred hhccccCccCCCCCCCcCCCCCCcccCCCC Confidence 0000 000000111111 11111 No 162 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.25 E-value=2e-06 Score=51.81 Aligned_cols=387 Identities=10% Similarity=0.035 Sum_probs=164.0 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccccccccccccccc--CcccccHH--H---HhhhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL--GDSSINDE--R---ILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~--~---~~~~~~v~~~i~~ia~~ 73 (424) ++.|.-+|+-+...=+...+........ ++.......+.+.... -+.....+ . -..+.+...+|+..+.. T Consensus 16 ~~~p~~~~~~~~~~~l~~~l~~~~~~~~---~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~ 92 (501) T protein:vir:25 16 VEFPEDSMSREQLGALVADMWRLHISER---QWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQN 92 (501) T ss_pred ccCCcccCChHHHHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhh Confidence 5666655544333222333322222111 1111111111111110 01111110 0 01123555677766654 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceE Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) +---.|. .. ++. ....+..++. + |. ....-..+..+.+.+|.||+.+.++.+|. .+..++|..+ T Consensus 93 l~~~gf~---~~-d~~-----~~~~l~~i~~-~-N~---~d~~~~~~~~~a~i~G~ay~~v~~de~~~--~i~~~sp~~~ 156 (501) T protein:vir:25 93 LSVVGYR---NA-LAK-----ENDPAWEMWQ-R-NR---MDARQAEVHRPALTYGASYVTVTPTDEGP--VFRTRSPRQI 156 (501) T ss_pred hccccee---cC-Ccc-----chHHHHHHHH-h-cC---hhHHHHHHHHHHhhcCceEEEEecCCCCC--eEEEeccccE Confidence 4322222 21 111 1233444432 2 22 34555678888999999999999988884 4556788888 Q ss_pred EEEE-cCCc---eE--EEEE--e-c-Cc---eEEecHhH----------------------------------------- Q lcl|NC_019719. 154 DVKL-VGKK---VV--YRYQ--R-D-SE---YADFSQKE----------------------------------------- 179 (424) Q Consensus 154 ~~~~-~~~~---~~--~~~~--~-~-~~---~~~~~~~e----------------------------------------- 179 (424) .+.. |... .. ..+. . . +. ...+.+.. T Consensus 157 ~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (501) T protein:vir:25 157 LAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEG 236 (501) T ss_pred EEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCC Confidence 7554 3211 10 1110 0 0 00 00011111 Q ss_pred -----eeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCc Q lcl|NC_019719. 180 -----IFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGP 254 (424) Q Consensus 180 -----vih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (424) |+|+.+...-...|.|.++.....++..............-.+.|..++.- ...++ .+. ++. T Consensus 237 ~~~vPiv~f~N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G---~~~~~-~~~----~~~----- 303 (501) T protein:vir:25 237 KPVCPVVRFVNGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISG---WTGSK-AEV----LKA----- 303 (501) T ss_pred ccceeeEeccCccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhC---CCCCc-cch----hhh----- Confidence 222222111123477755544433333333332222223333335444321 11111 111 111 Q ss_pred ccCcceecC-CCceeeecccChhHH-HHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 255 VKKRLWILE-AGFSTSAIGVTPQDA-EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYIS 332 (424) Q Consensus 255 ~~g~~~~l~-~g~~~~~l~~~~~d~-~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~ 332 (424) ..++++.++ ++.++.++... ++ .+.+..+.....|+..-++|+..++....+. |+.........+...+ .=..+ T Consensus 304 ~~~~i~~~~~~~~~~~q~~~~--~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~-Sg~Al~~~~~~l~~ka-~~k~~ 379 (501) T protein:vir:25 304 SALRVWTFEDPEVKAQAFPPA--SVEPYNLILEEMLQHVAMVAQISPAQVTGKMINV-SAEALAAAEANQQRKL-AAKRE 379 (501) T ss_pred cccceeccCCCCceEEEeccc--ChHHHHHHHHHHHHHHHhhcCCChhhhccccCCh-HHHHHHHHHHHHHHHH-HHHHH Confidence 123566665 35666665432 33 3788999999999999999999998654332 3222222222222221 22223 Q ss_pred HHHHHHHhh--c----cCcccc-ccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHH-HhCCCCCC-------- Q lcl|NC_019719. 333 RWENSIQRW--L----IPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRR-TDNLPPLP-------- 396 (424) Q Consensus 333 ~ie~~l~~~--l----~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~-~~G~~p~~-------- 396 (424) .|...|.+- | ....+. ....+++.+......+..+.++.+.++++.|+ +.-.+.. +.|+.+-+ T Consensus 380 ~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi-s~et~~~~~~g~~~~~ie~~~~~~ 458 (501) T protein:vir:25 380 SFGESWEQLLRLAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI-PIEHLLSMVPGMTQQTIQAIKDSL 458 (501) T ss_pred HHHHHHHHHHHHHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC-CHHHHHHHcCCCCHHHHHHHHHHH Confidence 333333321 1 111111 12345566667778889999999999998885 3333332 34665411 Q ss_pred ---CCCee---eecccccchhhccc-------cCC--CcccCC Q lcl|NC_019719. 397 ---GGDVA---MRQSQYVPITDLGT-------NKE--PRNNGA 424 (424) Q Consensus 397 ---~gd~~---~~~~n~~~~~~~~~-------~~~--~~~~ga 424 (424) ..+.. ..+....+...... +++ .-++|| T Consensus 459 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 459 RGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGVNGNGGA 501 (501) T ss_pred HHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccCCCCCCC Confidence 01100 00111111111110 111 112222 No 163 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.25 E-value=2e-06 Score=51.81 Aligned_cols=391 Identities=10% Similarity=0.019 Sum_probs=178.1 Q ss_pred CchHHHHHhhccCcccC---------ccccccccc------ccccccccCcc-----------cccHHHHhhhHHHHHHH Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLV---------TPNQGSQTG------PVSAHGHLGDS-----------SINDERILQISTVWRCV 67 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~---------~~~~~~~~~------~~~~~~~~~~~-----------~~~~~~~~~~~~v~~~i 67 (424) =|||+++++||++.... ......... .-.+...+.|. ....+..........++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 68999999998763210 000000000 00000001110 00111222333344455 Q ss_pred HHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEe Q lcl|NC_019719. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..|+-+..=|..+.-. +. .....+.++|. -| ....-....+.+.+..|.+++.+..+. |. +.+.. T Consensus 81 ~~~A~lv~~e~~~i~~~---d~----~~~~~l~~il~--~n---~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~-~~I~~ 146 (500) T protein:vir:30 81 KKIASLVFNEQAEIKVD---DD----AANEFISETLK--ND---RFNKNFERYLESCLALGGLAMRPYVDG-DK-VRVAF 146 (500) T ss_pred HHHhhhhcCCcceEecC---Ch----HHHHHHHHHHh--hc---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEE Confidence 55555554433332111 10 01112223332 12 244555666777777888887776664 33 34566 Q ss_pred ecCceEEEEEcC-------------------CceEEE---E--EecCceE-----Ee------------c--------Hh Q lcl|NC_019719. 148 LQSANMDVKLVG-------------------KKVVYR---Y--QRDSEYA-----DF------------S--------QK 178 (424) Q Consensus 148 l~~~~v~~~~~~-------------------~~~~~~---~--~~~~~~~-----~~------------~--------~~ 178 (424) +++..+.+...+ +..+|. + ..++... -| + +. T Consensus 147 v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~ 226 (500) T protein:vir:30 147 VQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKD 226 (500) T ss_pred EcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCc Confidence 677766653221 111110 0 0001000 00 0 01 Q ss_pred H----------eeEeccCC-----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeE-----EcCCCCCCHH Q lcl|NC_019719. 179 E----------IFHLKGFG-----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL-----STGEKVLTEQ 238 (424) Q Consensus 179 e----------vih~r~~~-----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl-----~~~~~~~~~~ 238 (424) + +.|++.+. .+.+.|+|....+...++.....-......++.|.. ..++ .......+.+ T Consensus 227 ~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~g~ 305 (500) T protein:vir:30 227 EAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQR-RVAVPESLTALTVRTTDGD 305 (500) T ss_pred ceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcc-eeeechHHhcccCCCCCcc Confidence 1 22444321 134679999999998888877776666667766443 3332 1111110100 Q ss_pred H-HHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH- Q lcl|NC_019719. 239 Q-RSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE- 316 (424) Q Consensus 239 ~-~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e- 316 (424) . ....-..-+..+..-+ .-..++..++.++....+-++.+..+...++|+...|+++..++....+..+..-+. T Consensus 306 ~~~~~~~d~~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s 381 (500) T protein:vir:30 306 VVPRPRFESDQNVYIRMG----GRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVS 381 (500) T ss_pred ccCCcccCCCcceEEEcC----CCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHH Confidence 0 0000000000000000 001233457777777778889999999999999999999999987655432211110 Q ss_pred ---------HHHHHHHHHHHHHHHHHHHHHHHh-hccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_019719. 317 ---------QQNLGFLQYTLQPYISRWENSIQR-WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEM 386 (424) Q Consensus 317 ---------~~~~~~~~~tl~P~~~~ie~~l~~-~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~ 386 (424) ......++.+|.-++..|...... .++...-...+.+.++++.-...|.++.++...+++.+|+|+.-++ T Consensus 382 ~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~ 461 (500) T protein:vir:30 382 ENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMA 461 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHH Confidence 112233344444444444332221 1221111123445666676677899999999999999999999998 Q ss_pred HHHh-CCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 387 RRTD-NLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 387 R~~~-G~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +.+. |++. +..++.+....-......+......+-.+ T Consensus 462 i~~~~g~~e-eea~~~l~~i~~E~~~~~~~~~~~~~~~g 499 (500) T protein:vir:30 462 IQKVLNVTE-EKAQEIAAEINTGIVDEINQQRTDTHLYG 499 (500) T ss_pred HHhcCCCCH-HHHHHHHHHHHHhccccCCCCCccccccC Confidence 7554 6542 11211111100000000000000001111 No 164 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.25 E-value=2e-06 Score=51.81 Aligned_cols=391 Identities=10% Similarity=0.019 Sum_probs=178.1 Q ss_pred CchHHHHHhhccCcccC---------ccccccccc------ccccccccCcc-----------cccHHHHhhhHHHHHHH Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLV---------TPNQGSQTG------PVSAHGHLGDS-----------SINDERILQISTVWRCV 67 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~---------~~~~~~~~~------~~~~~~~~~~~-----------~~~~~~~~~~~~v~~~i 67 (424) =|||+++++||++.... ......... .-.+...+.|. ....+..........++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 68999999998763210 000000000 00000001110 00111222333344455 Q ss_pred HHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEe Q lcl|NC_019719. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..|+-+..=|..+.-. +. .....+.++|. -| ....-....+.+.+..|.+++.+..+. |. +.+.. T Consensus 81 ~~~A~lv~~e~~~i~~~---d~----~~~~~l~~il~--~n---~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~-~~I~~ 146 (500) T protein:vir:98 81 KKIASLVFNEQAEIKVD---DD----AANEFISETLK--ND---RFNKNFERYLESCLALGGLAMRPYVDG-DK-VRVAF 146 (500) T ss_pred HHHhhhhcCCcceEecC---Ch----HHHHHHHHHHh--hc---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEE Confidence 55555554433332111 10 01112223332 12 244555666777777888887776664 33 34566 Q ss_pred ecCceEEEEEcC-------------------CceEEE---E--EecCceE-----Ee------------c--------Hh Q lcl|NC_019719. 148 LQSANMDVKLVG-------------------KKVVYR---Y--QRDSEYA-----DF------------S--------QK 178 (424) Q Consensus 148 l~~~~v~~~~~~-------------------~~~~~~---~--~~~~~~~-----~~------------~--------~~ 178 (424) +++..+.+...+ +..+|. + ..++... -| + +. T Consensus 147 v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~ 226 (500) T protein:vir:98 147 VQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKD 226 (500) T ss_pred EcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCc Confidence 677766653221 111110 0 0001000 00 0 01 Q ss_pred H----------eeEeccCC-----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeE-----EcCCCCCCHH Q lcl|NC_019719. 179 E----------IFHLKGFG-----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL-----STGEKVLTEQ 238 (424) Q Consensus 179 e----------vih~r~~~-----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl-----~~~~~~~~~~ 238 (424) + +.|++.+. .+.+.|+|....+...++.....-......++.|.. ..++ .......+.+ T Consensus 227 ~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~g~ 305 (500) T protein:vir:98 227 EAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQR-RVAVPESLTALTVRTTDGD 305 (500) T ss_pred ceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcc-eeeechHHhcccCCCCCcc Confidence 1 22444321 134679999999998888877776666667766443 3332 1111110100 Q ss_pred H-HHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH- Q lcl|NC_019719. 239 Q-RSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE- 316 (424) Q Consensus 239 ~-~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e- 316 (424) . ....-..-+..+..-+ .-..++..++.++....+-++.+..+...++|+...|+++..++....+..+..-+. T Consensus 306 ~~~~~~~d~~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s 381 (500) T protein:vir:98 306 VVPRPRFESDQNVYIRMG----GRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVS 381 (500) T ss_pred ccCCcccCCCcceEEEcC----CCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHH Confidence 0 0000000000000000 001233457777777778889999999999999999999999987655432211110 Q ss_pred ---------HHHHHHHHHHHHHHHHHHHHHHHh-hccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_019719. 317 ---------QQNLGFLQYTLQPYISRWENSIQR-WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEM 386 (424) Q Consensus 317 ---------~~~~~~~~~tl~P~~~~ie~~l~~-~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~ 386 (424) ......++.+|.-++..|...... .++...-...+.+.++++.-...|.++.++...+++.+|+|+.-++ T Consensus 382 ~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~ 461 (500) T protein:vir:98 382 ENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMA 461 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHH Confidence 112233344444444444332221 1221111123445666676677899999999999999999999998 Q ss_pred HHHh-CCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 387 RRTD-NLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 387 R~~~-G~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +.+. |++. +..++.+....-......+......+-.+ T Consensus 462 i~~~~g~~e-eea~~~l~~i~~E~~~~~~~~~~~~~~~g 499 (500) T protein:vir:98 462 IQKVLNVTE-EKAQEIAAEINTGIVDEINQQRTDTHLYG 499 (500) T ss_pred HHhcCCCCH-HHHHHHHHHHHHhccccCCCCCccccccC Confidence 7554 6542 11211111100000000000000001111 No 165 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=98.22 E-value=2.4e-06 Score=51.38 Aligned_cols=403 Identities=10% Similarity=-0.031 Sum_probs=177.1 Q ss_pred CCCCcccccCCC--CCchHHHHHhhccCcccCcccccccccccccccccCccccc-HHHH----hhhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRT--NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIN-DERI----LQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~--~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~~~~v~~~i~~ia~~ 73 (424) ..--.++|.=-+ ..-++.+|...+......... ....+.+........++ +..+ .-..+...||+.++.. T Consensus 2 ~~~~~~~~~gl~~~~~~~~~~L~~~~~~~~~~~~~---~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~r 78 (474) T protein:vir:81 2 IQQQTVRIPSLSNDENALINGLLAQIENLRWKNLL---RTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARR 78 (474) T ss_pred cCCCcCcCCCCChhHHHHHHHHHHHHHHHhhHHHH---HHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhh Confidence 000001111000 113455555544433221111 11111111111111111 1111 1235556677777775 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce-eeEEeecCce Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV-ISLLPLQSAN 152 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~-~~l~~l~~~~ 152 (424) +.--.|.+ . ++. ..+..+.+++. +.+ .......+..+.+.+|.+|+.+..+.+|.+ ..+.+++|.. T Consensus 79 l~~~Gf~~---~-d~~----~~~~~l~~iw~-~N~----ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~ 145 (474) T protein:vir:81 79 CNLEGFVW---P-DGD----LDSLGGTEVVD-DNH----LLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASE 145 (474) T ss_pred hcccceEC---C-CCC----ccchHHHHHHH-hcC----hhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccce Confidence 55444432 1 111 11123444442 222 234556678889999999999998888865 4567888988 Q ss_pred EEEEEcCCceE-------EEEEecCce---EEecHhH-------------------------eeEeccC-CCCccccCch Q lcl|NC_019719. 153 MDVKLVGKKVV-------YRYQRDSEY---ADFSQKE-------------------------IFHLKGF-GFTGLVGLSP 196 (424) Q Consensus 153 v~~~~~~~~~~-------~~~~~~~~~---~~~~~~e-------------------------vih~r~~-~~~~~~G~s~ 196 (424) +....|..... +....++.. ..+.+++ |+++.+. ..+.++|.|. T Consensus 146 ~~~~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvPvV~~~n~~~~~~~~G~s~ 225 (474) T protein:vir:81 146 ATGEWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVPAQVLPYKPAPKRPFGQSR 225 (474) T ss_pred EEEEEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcceEEecccccccCcCCccc Confidence 88776642210 011111110 1111222 3333322 2344567764 Q ss_pred ----HHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE-cCCC-CCC--HHHHHHHHHHHHHHhCC-cccCcceecCCCce Q lcl|NC_019719. 197 ----IAFACKSAGVAVAMEDQQRDFFANGAKSPQILS-TGEK-VLT--EQQRSQVEENFKEIAGG-PVKKRLWILEAGFS 267 (424) Q Consensus 197 ----~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~-~~~~-~~~--~~~~~~~~~~~~~~~~~-~~~g~~~~l~~g~~ 267 (424) +..+.+.+.-...-......|+.. |.-++. .... ..+ ......++....+...- .+..+......+.+ T Consensus 226 i~e~v~~l~da~~r~~~~~~~~~e~~a~---pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~ 302 (474) T protein:vir:81 226 ITKPMMGLQDAGVRELARREGHMDVFSY---PEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARAD 302 (474) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhcc---hhheeecCChhhcccccccccchhhhhHHHHhcCCCccccccccccccc Confidence 344455555545555555666544 444432 1111 101 01122233333332211 11111122223456 Q ss_pred eeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC-CCCccchhHHHHHHHHHHHH---HHHHHHHHHHHHHhhcc Q lcl|NC_019719. 268 TSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE-KSTSWGSGIEQQNLGFLQYT---LQPYISRWENSIQRWLI 343 (424) Q Consensus 268 ~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-~~~~~~~n~e~~~~~~~~~t---l~P~~~~ie~~l~~~l~ 343 (424) +.++....-+ -|++..+.....+|..-++|++.||... .+..|...+..+...+...+ -.-+-..+++.+-.-+. T Consensus 303 ~~q~~~a~l~-~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~ 381 (474) T protein:vir:81 303 VKQFPAASPD-AHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALA 381 (474) T ss_pred ccccCCCChh-HHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6666543222 3788999999999999999999998643 22233222322222222221 11122233332222111 Q ss_pred Ccc-----c--cccceeeecchhhhccCHHHHHHHHHHHHhCCC--CCHHHHHHHhCCCCCC---CCCeeeecccccchh Q lcl|NC_019719. 344 PAK-----D--VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGL--RTINEMRRTDNLPPLP---GGDVAMRQSQYVPIT 411 (424) Q Consensus 344 ~~~-----~--~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~--~T~NE~R~~~G~~p~~---~gd~~~~~~n~~~~~ 411 (424) -.. + ...+.++..+......+..+.++.+.+++++|. .+..=+++++|+.+-+ ..++.-.-....+++ T Consensus 382 i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~ 461 (474) T protein:vir:81 382 MKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQ 461 (474) T ss_pred HhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHH Confidence 111 0 011334444455667788999999999999873 3334568888998642 111111011122232 Q ss_pred hccccCCCcccCC Q lcl|NC_019719. 412 DLGTNKEPRNNGA 424 (424) Q Consensus 412 ~~~~~~~~~~~ga 424 (424) .+.. .+.....| T Consensus 462 ~l~~-~~~~~~~a 473 (474) T protein:vir:81 462 ALID-RSNNGATA 473 (474) T ss_pred HHHh-cCCCCCCC Confidence 2211 11111112 No 166 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=98.21 E-value=2.5e-06 Score=51.26 Aligned_cols=386 Identities=10% Similarity=0.033 Sum_probs=179.6 Q ss_pred CchHHHHHhhccCcccCcccc---cc------------cccccccccccCcc-----------cccHHHHhhhHHHHHHH Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQ---GS------------QTGPVSAHGHLGDS-----------SINDERILQISTVWRCV 67 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~---~~------------~~~~~~~~~~~~~~-----------~~~~~~~~~~~~v~~~i 67 (424) -|||+++++||++....-..+ .. ......+..++.+. ....+..........++ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 789999999887543110000 00 00000000011110 00011122223334455 Q ss_pred HHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEe Q lcl|NC_019719. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..|+-+..=|..+.-. + . .....+..+|. -| .....++..+...+..|.+++.+..+. |. +.+-. T Consensus 81 ~~~A~lv~~e~~~i~v~---d-~---~~~~~l~~~l~--~n---~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~-~~i~~ 146 (522) T protein:vir:47 81 KKIASLVYNEQATITTK---N-E---ILQKFLDDMLT--ND---RFNKNFERYLESCLALGGLAMRPYIDG-DK-VRVAF 146 (522) T ss_pred HHHhhhhcCCcceeecC---C-h---HHHHHHHHHHh--hc---chHHHHHHHHHHhhccCCEEEEEEEcC-Cc-eEEEE Confidence 55555554433322111 1 0 11112333332 12 244555666777777787777766653 32 23444 Q ss_pred ecCceEEEEE-cC------------------CceEEE-----------------------EEe------c------CceE Q lcl|NC_019719. 148 LQSANMDVKL-VG------------------KKVVYR-----------------------YQR------D------SEYA 173 (424) Q Consensus 148 l~~~~v~~~~-~~------------------~~~~~~-----------------------~~~------~------~~~~ 173 (424) +++..+.+.. +. ...+|. |.+ . |..+ T Consensus 147 v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v 226 (522) T protein:vir:47 147 IQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRV 226 (522) T ss_pred EcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccc Confidence 4444444421 11 111111 000 0 0000 Q ss_pred Eec--------HhH----------eeEeccCC-----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCc----e Q lcl|NC_019719. 174 DFS--------QKE----------IFHLKGFG-----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSP----Q 226 (424) Q Consensus 174 ~~~--------~~e----------vih~r~~~-----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~----~ 226 (424) .+. +.+ ..|++.+. .+.+.|+|....+...++.....-.....-|+-|-..- . T Consensus 227 ~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~ 306 (522) T protein:vir:47 227 NLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEH 306 (522) T ss_pred cccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchH Confidence 000 111 22454331 13468999999998888777766666666666554321 1 Q ss_pred eEEcCCCCCCHH-H-HHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcC Q lcl|NC_019719. 227 ILSTGEKVLTEQ-Q-RSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD 304 (424) Q Consensus 227 vl~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~ 304 (424) ++.......... . ..... ..+..+.+-+. -.+++-+++.++....+-++.+..+...+.|+...|+++..++. T Consensus 307 ~l~~~~~~~~g~~~~~~~fd-~~~~~f~~~~~----~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~ 381 (522) T protein:vir:47 307 LTQRQYQRPDGTIDFRPRFD-VEQNVYMQIGG----SSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTF 381 (522) T ss_pred HhccCCCCCCcccccccccC-cccceEeecCC----CCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCc Confidence 122211111110 0 00000 00111111000 01233457777777788899999999999999999999999987 Q ss_pred CCCCCccchhHHHH-------------HHHHHHHHHHHHHHHHHHHHHh-hccCccccccceeeecchhhhccCHHHHHH Q lcl|NC_019719. 305 VEKSTSWGSGIEQQ-------------NLGFLQYTLQPYISRWENSIQR-WLIPAKDVGRIHAEHNLDGLLRGDSASRAA 370 (424) Q Consensus 305 ~~~~~~~~~n~e~~-------------~~~~~~~tl~P~~~~ie~~l~~-~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~ 370 (424) ...+.. ++.+. ....++.+|.-++..+....+. .++...-...+.+.++++.-+..|.++..+ T Consensus 382 ~~~~~k---TAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~ 458 (522) T protein:vir:47 382 DGQGMK---TATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELD 458 (522) T ss_pred cccccc---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHH Confidence 655432 22333 3344555555555555443332 122222223355677777777899999999 Q ss_pred HHHHHHhCCCCCHHHHHHHh-CCCCCCCCCeee----------ec--ccccchhhccccCCCccc Q lcl|NC_019719. 371 FMKAMGEAGLRTINEMRRTD-NLPPLPGGDVAM----------RQ--SQYVPITDLGTNKEPRNN 422 (424) Q Consensus 371 ~~~~~~~~g~~T~NE~R~~~-G~~p~~~gd~~~----------~~--~n~~~~~~~~~~~~~~~~ 422 (424) ...+++.+|+|++-+++... |+..- ..++.+ .| .+..+.......++++++ T Consensus 459 ~~~~~v~aG~~s~e~~i~~~~g~~ee-ea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 459 YWAKMVAAGFSTKKRAIGKTLNISGV-EAEKELNAINSELLPMNDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred HHHHHHhcCCCCHHHHHHhcCCCChH-HHHHHHHHHHHhhccCCCCCCCCCCCCCcccccCCCCC Confidence 99999999999999987653 65431 110000 00 111111111111112222 No 167 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=98.19 E-value=7.6e-08 Score=59.62 Aligned_cols=181 Identities=10% Similarity=0.032 Sum_probs=95.0 Q ss_pred eEEcCC---CCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_019719. 227 ILSTGE---KVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV 302 (424) Q Consensus 227 vl~~~~---~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l 302 (424) |++.++ .+..++ ..++++++..... ++.+.+++...+-+|+.++.+...+ .+........||.+-|||...| T Consensus 1 V~k~~~l~~~~~~~~--~~~~~r~~~~~~~~~~~~~~~ld~~~e~~e~~~~~lsGl--~d~l~~~~~~iaa~s~iP~t~L 76 (201) T protein:vir:10 1 MWKAKGLADLCDDSD--GAARLRLAQVDNNSGVGQAIGIDADSEEYNVLNSDIGGI--DTFLSQKFDRIVALSGIHEIIL 76 (201) T ss_pred CccchHHHHHhcCCh--HHHHHHHHHHHHhhhhhhhheeecCCcceeeeecCcCCh--HHHHHHHHHHHHhHhcCchhhh Confidence 444322 111111 1233333322211 1223344555556788888776654 4677788899999999999999 Q ss_pred cCCCCCCccchhHHHHHHHHH-------HHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHH------- Q lcl|NC_019719. 303 GDVEKSTSWGSGIEQQNLGFL-------QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASR------- 368 (424) Q Consensus 303 ~~~~~~~~~~~n~e~~~~~~~-------~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~------- 368 (424) .+...+..+. +.+.....|| +.-+.|.++.+-+-+ ..+. .+.|.++.|...+.+++ T Consensus 77 fG~sp~Glna-tge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~----~~~~-----~~~~~f~pL~~~s~kekAei~~~~ 146 (201) T protein:vir:10 77 KGKNVGGVSA-SQNTALETFYGYVDRKRKAELLPLLEFLLPFI----VTEQ-----EWSVEFNPLSQVSDKDKSEILEKN 146 (201) T ss_pred cCCCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHhh----cCCC-----CceEeeCCCCCCCHHHHHHHHHHH Confidence 7766554432 2233333444 344566666544422 2222 23455577777776654 Q ss_pred HHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--CeeeecccccchhhccccCCCccc Q lcl|NC_019719. 369 AAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVAMRQSQYVPITDLGTNKEPRNN 422 (424) Q Consensus 369 ~~~~~~~~~~g~~T~NE~R~~~G~~p~~~g--d~~~~~~n~~~~~~~~~~~~~~~~ 422 (424) ++.+.+++++|+++++|+|+.+--.+.-++ +... ...............++++ T Consensus 147 a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~-~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 147 VNSVAALIAAGIIDADEARDTLRAISTEVKIGEGSI-QTEVVINESEDPLDVSANN 201 (201) T ss_pred HHHHHHHHHcCCCCHHHHHHHHHhcCCcCCCCCCCC-CccccccccCCCCCCCCCC Confidence 466778889999999999998755443221 1110 1011111111112233334 No 168 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.17 E-value=3.2e-06 Score=50.70 Aligned_cols=389 Identities=12% Similarity=-0.003 Sum_probs=175.4 Q ss_pred CCCchHHHHHhhccCcccC------ccccccc------ccccccccccCcc--------------cccHHHHhhhHHHHH Q lcl|NC_019719. 12 TNNGWWARLQSWFVGGRLV------TPNQGSQ------TGPVSAHGHLGDS--------------SINDERILQISTVWR 65 (424) Q Consensus 12 ~~~G~~~~l~~~~~~~~~~------~~~~~~~------~~~~~~~~~~~~~--------------~~~~~~~~~~~~v~~ 65 (424) ==.|+++++++++++-... ....... .....+...+.|. .. .+..+....... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~-~~~~~s~n~~~~ 79 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPV-NRRQLSMNLPKV 79 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCcc-ccceeecchHHH Confidence 1123333444444321100 0000000 0000000011110 00 111223344455 Q ss_pred HHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeE Q lcl|NC_019719. 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISL 145 (424) Q Consensus 66 ~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l 145 (424) +++..|+-+..=|..+--. +.+....|+. -...-....-...++.+.+..|.+|+.+..+.+|.+ .+ T Consensus 80 iv~~~a~~l~~ep~~i~~~-----------d~~~~e~l~~-~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i 146 (499) T protein:vir:80 80 TAKYMSKLLFNEKVKINID-----------DETAEEFVLN-VLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KV 146 (499) T ss_pred HHHHHHHhhhCCcceEeeC-----------CHHHHHHHHH-HHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcE-EE Confidence 6677777666655554211 1122222321 111112455566677788889999999998888765 46 Q ss_pred EeecCceEEEEEcCCc-e--------------EEE--------------EEec--------C--ceEEecHhH------- Q lcl|NC_019719. 146 LPLQSANMDVKLVGKK-V--------------VYR--------------YQRD--------S--EYADFSQKE------- 179 (424) Q Consensus 146 ~~l~~~~v~~~~~~~~-~--------------~~~--------------~~~~--------~--~~~~~~~~e------- 179 (424) -.++|..+.+...+.+ . +|. |... . ....++..+ T Consensus 147 ~~v~a~~~~Pi~~d~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~ 226 (499) T protein:vir:80 147 SFATADCMYPLSNDSENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEP 226 (499) T ss_pred EEEcCCceEEEEecCCCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCC Confidence 7777777776432211 1 010 0000 0 001111111 Q ss_pred -----------eeEeccCC-----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeE-----EcCCCCCCHH Q lcl|NC_019719. 180 -----------IFHLKGFG-----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL-----STGEKVLTEQ 238 (424) Q Consensus 180 -----------vih~r~~~-----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl-----~~~~~~~~~~ 238 (424) +.|+|.+- .+.+.|+|.+.-+...++..........+.|+.+. ...++ ....+...+ T Consensus 227 ~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~-~~i~v~~~~l~~~~~~~g~- 304 (499) T protein:vir:80 227 VVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGK-KKVLVPSSFVKTAVNLDGS- 304 (499) T ss_pred ceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcc-cceecchhhhhccCCCCCC- Confidence 23444321 23467999999888888877766666666676543 33332 111111000 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH Q lcl|NC_019719. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ 318 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~ 318 (424) .........+.+.. .....-+++-.++.++....+-++.+..+...++|....|+++..++....+..+...+... T Consensus 305 ~~~~~~~~~~~~~~----~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~ 380 (499) T protein:vir:80 305 TTQYFDSTDEAFFL----YQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSE 380 (499) T ss_pred cccCCCcccceeeE----eeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHH Confidence 00000000000000 00011122334677776666677888888999999999999999998755443321111111 Q ss_pred H----------HHHHHHHHHHHHHHHHHHHHhhccCcc-ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_019719. 319 N----------LGFLQYTLQPYISRWENSIQRWLIPAK-DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMR 387 (424) Q Consensus 319 ~----------~~~~~~tl~P~~~~ie~~l~~~l~~~~-~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R 387 (424) . ...++.+|..++..|-...+....... ......+.++++.-...|.++.++...+++.+|+|+.-.++ T Consensus 381 ~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l 460 (499) T protein:vir:80 381 KSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIAL 460 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHH Confidence 1 111222233333322222111111111 12234566666776778999999999999999999998887 Q ss_pred HHh-CCCCCCCCCeeeecc-----cccchhhccccCCCcc Q lcl|NC_019719. 388 RTD-NLPPLPGGDVAMRQS-----QYVPITDLGTNKEPRN 421 (424) Q Consensus 388 ~~~-G~~p~~~gd~~~~~~-----n~~~~~~~~~~~~~~~ 421 (424) ... |.+. +..++.+.-. ...|-.+.+...+..+ T Consensus 461 ~~~~~~~d-~ea~~el~~i~~E~~~~~~~~d~~g~~ge~e 499 (499) T protein:vir:80 461 QRAWNITE-AEADEWAEMLAKEKQAEIPNNDMTGIFGEEE 499 (499) T ss_pred hhcCCCCh-HHHHHHHHHHHHHhhcCCCCCCccccCCCCC Confidence 653 5432 2221111100 0001101001000001 No 169 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.16 E-value=3.4e-06 Score=50.57 Aligned_cols=389 Identities=8% Similarity=0.005 Sum_probs=172.5 Q ss_pred ccccCCCC-CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_019719. 6 YTIDLRTN-NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFET 84 (424) Q Consensus 6 ~~~~~~~~-~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~ 84 (424) +-.++... .-.++++..+..+.-. ....... .. ..+. +..-+.++.....|+..+.-+-+-|+.+.-. T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~-~~~~~~~---~~----~~~~---~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~ 69 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNF-SILSGHR---RL----DDEK---ADYRVRHKWGGYISSFATGYVIGNPVSIGVM 69 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCc-ccccccc---cc----cccC---CcceeecchHHHHHHhhhhheeccCceEeeC Confidence 11111111 1234444444433211 0000000 00 0000 0011234556667777777766666665221 Q ss_pred cccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc--- Q lcl|NC_019719. 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK--- 161 (424) Q Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~--- 161 (424) + .+.. + ....+.+.+. +. ........+..+.+.+|.+|+.+..+.+|.+ .+..++|..+.+..++.. T Consensus 70 ~-~~~~-~--~~~~l~~~~~-~n----~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~ 139 (440) T protein:vir:95 70 E-GGSA-D--QLSTIKDIEW-QN----DINALNSDLAFDASVYGRAYEYHFRDKDKVD-RVVLISPLEMFVIRDLTVEQN 139 (440) T ss_pred C-CccH-H--HHHHHHHHHH-hc----CHhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCc Confidence 1 1111 1 1122333332 11 3445556778889999999999999988876 467788999888776432 Q ss_pred eEE---EEEecCc--eEEecHhHeeEeccC--------------------C----CCccccCchHHHHHHHHHHHHHHHH Q lcl|NC_019719. 162 VVY---RYQRDSE--YADFSQKEIFHLKGF--------------------G----FTGLVGLSPIAFACKSAGVAVAMED 212 (424) Q Consensus 162 ~~~---~~~~~~~--~~~~~~~evih~r~~--------------------~----~~~~~G~s~~~~~~~~i~~~~~~~~ 212 (424) ..+ .+..... ...+.++.+++.+.. + .+...|.|.+..+...++....+.. T Consensus 140 ~~~~i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s 219 (440) T protein:vir:95 140 IIAAVHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQS 219 (440) T ss_pred eEEEEEEEEecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHH Confidence 111 1111111 112333333332100 0 0123467777766666666665555 Q ss_pred HHHHHHhccCCCceeEEcCCC--CCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHH Q lcl|NC_019719. 213 QQRDFFANGAKSPQILSTGEK--VLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSE 290 (424) Q Consensus 213 ~~~~~~~n~~~p~~vl~~~~~--~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~ 290 (424) ...+..+..+.|-.+++.... ..+++....+++...-+.. ........+.+.+++.+........+....+...+. T Consensus 220 ~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~ 297 (440) T protein:vir:95 220 DTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLK--TGISTTGQQTTADASYIYKQYDVNGTEAYKNRLAND 297 (440) T ss_pred HHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceecc--cccccccCCCCcceeEEeecCCHHHHHHHHHHHHHH Confidence 555555555667666653211 1122322222221111111 111122223344444444443444566778888999 Q ss_pred HHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhh Q lcl|NC_019719. 291 LARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL 360 (424) Q Consensus 291 Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l 360 (424) |+..-++|..-.+... ++.|+...+.. ....+...+.-+++.|...+...-- .......+.+.+..- T Consensus 298 i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~--~~~~~~~v~i~f~~~ 374 (440) T protein:vir:95 298 IHRFSRIPNLDDDRFN-STSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAING--PVIEANKLTFTFHPN 374 (440) T ss_pred HHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC--cccccccceEEeCCC Confidence 9999999974443322 22222222211 1122233333333333333322111 111122344555566 Q ss_pred hccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC-CCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 361 LRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP-GGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 361 ~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~-~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ...|..+.++.+.++ .|+++..-+.++++.-..+ +-.+...-.. .......++.+..+++ T Consensus 375 ~p~~~~~~ad~~~kl--~g~iS~et~~~~l~~~d~~~E~~ri~~E~~--~~~~~~~~~~~~~~~~ 435 (440) T protein:vir:95 375 IPQDVWTEIKAYIEA--GGEISQETLMENASFTDYKTEHSRILKQGG--SSDLEIGQIVGDADVG 435 (440) T ss_pred CCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCCcHHHHHHHHHHHH--HhhhhHHhhccCCCCC Confidence 678889999999888 4789887777777653211 0000000000 0000000111111111 No 170 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=98.16 E-value=4.1e-08 Score=61.11 Aligned_cols=252 Identities=13% Similarity=0.111 Sum_probs=131.9 Q ss_pred hhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhh--------ccCCCCCCCHHHHHHHHHHHHHHcCC Q lcl|NC_019719. 58 LQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLL--------RYSPNQYMTAQEFREAMTMQLCFYGN 129 (424) Q Consensus 58 ~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL--------~~~pN~~~s~~~f~~~~~~~~l~~G~ 129 (424) |+. -.++..|.+++-..|.|- +|-.++| ..--|-..+-..-++.+....+ .| T Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~- 60 (279) T protein:vir:40 1 MSL----FNLSRRAEDVSFSTFTVQ--------------DPTTDLLLGKLLGLVSYFDNVDYSEASKLEDLFYWAL-QG- 60 (279) T ss_pred Ccc----cccchhhcccceeeeeec--------------CcchhHHHHHHHHHHHHhhcccchhhhhhhhhhhhhh-cc- Confidence 000 011222333332223221 2222222 2223333333332333322222 12 Q ss_pred eEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEE------------EecCceEEecHhHeeEeccCCCCccccCchH Q lcl|NC_019719. 130 AYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRY------------QRDSEYADFSQKEIFHLKGFGFTGLVGLSPI 197 (424) Q Consensus 130 a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~------------~~~~~~~~~~~~evih~r~~~~~~~~G~s~~ 197 (424) ..|-....++..+|.. ....+.+++|-+++..|- ++++|.-+- T Consensus 61 ---------------------~~~~~~~~~~~~~~~~~~~~d~fn~~vr~~~~~~vtVP~~Dv~Iie----NPlv~v~~e 115 (279) T protein:vir:40 61 ---------------------KEVYRVWYGGFKYYAQRVNADQFNIVVREPNRREVTIRTNDYEMLL----NPFYGANPQ 115 (279) T ss_pred ---------------------ceeehhhhhhHHHHHhhcCcchhhhheecCCcceeEeecchhhhhh----cchheeccc Confidence 2111111222211111 112223455555665553 334565543 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc-cCcceecCCCceeeecccChh Q lcl|NC_019719. 198 AFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV-KKRLWILEAGFSTSAIGVTPQ 276 (424) Q Consensus 198 ~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~l~~g~~~~~l~~~~~ 276 (424) + ...-+++. .+....++ .+.+..+++++++.+...++.+++.+..++++....+ -+|+.+++.|-+++++..+.. T Consensus 116 e-~~kM~~la--~nai~~KL-D~~~qIk~fIKTd~d~glee~kekaR~rIk~mlalAk~~nGityid~~ddItQL~kDYS 191 (279) T protein:vir:40 116 R-FGVMFGMA--SNGIGRRL-DSQAQIKIYWKTKVSSGLKEVWDRIRERLTQQQQLAREFNGVSVIGSDDDIKQIQPDYS 191 (279) T ss_pred h-hhHHHHHH--Hhhhhhhh-cccceeeeEEecCcchhHHHHHHHHHHHHHHHHHHHHhcCCeeeecCCceeEeeccccc Confidence 2 22222222 22233344 6777788999999887788889999999988877655 478999999999999998766 Q ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHh------hccCcccccc Q lcl|NC_019719. 277 DAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR------WLIPAKDVGR 350 (424) Q Consensus 277 d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~------~l~~~~~~~~ 350 (424) .. ..+-.++.++..+..+|||..+|-+. ..|++..+|+..+|.|++++.+-+|.. +.++...+ T Consensus 192 ts-lk~die~lkS~l~Sq~GinekIL~Gs--------AtE~q~iAyy~rtVePILkQyek~liY~~E~fv~y~ttta~-- 260 (279) T protein:vir:40 192 GS-LQNDANLAIEIALSEYGMPRELLYGQ--------SNEVTIIAFAIQKVLPLLKQHDKNIIFNQENFVAYISTTAK-- 260 (279) T ss_pred cc-cHHHHHHHHHHHHhhcCCchhhcccc--------CchhhhhhHHHhhHHHHHHHhcccccchhhhhhhhheeccc-- Confidence 55 35667888999999999999999542 347899999999999999997764432 22222111 Q ss_pred ceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC Q lcl|NC_019719. 351 IHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD 399 (424) Q Consensus 351 ~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd 399 (424) .|.+ |-.-....-+|+ |.| T Consensus 261 ---------------------------gg~~--~s~~~~~~~~~~-~~~ 279 (279) T protein:vir:40 261 ---------------------------GGAI--ESKSSKRDSEPV-GND 279 (279) T ss_pred ---------------------------Cccc--ccccccccCCCC-CCC Confidence 1111 000011111222 111 No 171 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=98.10 E-value=4.5e-06 Score=49.88 Aligned_cols=349 Identities=9% Similarity=0.034 Sum_probs=165.1 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccc--cHHH---H-hhhHHHHHHHHHHHHhhccCceEE Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSI--NDER---I-LQISTVWRCVSLISTLTACLPLDV 81 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~---~-~~~~~v~~~i~~ia~~ia~~~~~v 81 (424) |++- ..=..++..+..+ ........+ .++. . +-..+...+|+.++..+.=-.|. T Consensus 1 l~~~--~~r~~~~~~yY~g-----------------~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~- 60 (410) T protein:vir:95 1 MNLY--QSRVNLRYKHYAM-----------------QHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFA- 60 (410) T ss_pred CCcc--hhhHHHHHHHhcC-----------------CCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhcccccc- Confidence 1110 0001112222221 111111111 1100 0 11244555666666543322221 Q ss_pred EEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc Q lcl|NC_019719. 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK 161 (424) Q Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~ 161 (424) . . +..+.+++. . | +.......+..+.+.+|.||+.+..+.+|.+ .+.+++|..+....|... T Consensus 61 --~-~---------d~~l~~i~~-~-N---~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~-~i~~~sP~~~~~i~Dp~~ 122 (410) T protein:vir:95 61 --N-D---------DFNVTEIFD-R-N---NPDIFFDSAILSALIGSCSFVYISKGEDDEV-RLQVIESSNATGVIDPIT 122 (410) T ss_pred --C-C---------CchHHHHHh-h-c---ChHHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEEeCCC Confidence 1 1 123555443 2 2 2445556778899999999999999888876 678889998888776432 Q ss_pred e--E--EEEEe---cCc---eEEecHhHee---------------------EeccC-CCCccccCc----hHHHHHHHHH Q lcl|NC_019719. 162 V--V--YRYQR---DSE---YADFSQKEIF---------------------HLKGF-GFTGLVGLS----PIAFACKSAG 205 (424) Q Consensus 162 ~--~--~~~~~---~~~---~~~~~~~evi---------------------h~r~~-~~~~~~G~s----~~~~~~~~i~ 205 (424) . . +.+.. ++. ...+.++.++ +|.+. ..+..+|.| ++..+.+.+. T Consensus 123 ~~~~~al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~ 202 (410) T protein:vir:95 123 GLLVEGYAVLARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAK 202 (410) T ss_pred CceEEEEEEEEecCCCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHH Confidence 1 1 11111 111 1122333333 33221 123456766 4555555555 Q ss_pred HHHHHHHHHHHHHhccCCCceeEE-cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-----CceeeecccChhHHH Q lcl|NC_019719. 206 VAVAMEDQQRDFFANGAKSPQILS-TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-----GFSTSAIGVTPQDAE 279 (424) Q Consensus 206 ~~~~~~~~~~~~~~n~~~p~~vl~-~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~-----g~~~~~l~~~~~d~~ 279 (424) -...-......||.+ |..++. .+.+. ...+ .|+.. .++++.++. +.++.++....-+ . T Consensus 203 r~~~~~~~~~e~~a~---pqr~i~G~d~d~---~~~~----~~~~~-----~~~i~~~~~~~~~~~~~v~q~~~~~l~-~ 266 (410) T protein:vir:95 203 RTLERADITAEFYSW---PQKYILGLDPDA---EPME----KWKAT-----VSSLLTISSSDKGVKPSVGQFTTASMS-P 266 (410) T ss_pred HHHHHHHHHHHHhcc---hhheeeccCCCC---CcCc----hhhhh-----hhhheeccCCCCCCcceEEecCCCChH-H Confidence 555555666677644 544443 11111 1111 12222 234566653 2466666543332 4 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH---HHHHHHHHHHHHHhhc--cCccc--c-ccc Q lcl|NC_019719. 280 MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT---LQPYISRWENSIQRWL--IPAKD--V-GRI 351 (424) Q Consensus 280 ~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t---l~P~~~~ie~~l~~~l--~~~~~--~-~~~ 351 (424) |++..+.....||..=++|++.+|....+.+|...+..+...+...+ -.-+-..+++.+-.-+ ..... . ... T Consensus 267 ~~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~ 346 (410) T protein:vir:95 267 FTEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFV 346 (410) T ss_pred HHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccc Confidence 88999999999999999999999976543233222222222222221 1112222332222111 11111 0 112 Q ss_pred eeeecch---hhhccCHHHHHHHHHHHHhC--CCCCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCCCc Q lcl|NC_019719. 352 HAEHNLD---GLLRGDSASRAAFMKAMGEA--GLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPR 420 (424) Q Consensus 352 ~~~fd~~---~l~~~d~~~~~~~~~~~~~~--g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~~~~ 420 (424) .+++.+. .....+....++.+.+++++ |+.+..-+++++|+.+-+-.. . -+.. ....++ T Consensus 347 ~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~~~~-~-------~~~e--~~~~g~ 410 (410) T protein:vir:95 347 RTAVKWEPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGDMSAK-P-------VVSE--GGSNGE 410 (410) T ss_pred eeeEEeeecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChHHHHH-H-------HHHH--HHhCCC Confidence 2333332 22233567788898899998 788888899999997532100 0 0000 000011 No 172 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.08 E-value=5.1e-06 Score=49.60 Aligned_cols=388 Identities=11% Similarity=0.066 Sum_probs=165.3 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccC--cccccHH--H-HhhhHHHHHHHHHHHHhhc Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSINDE--R-ILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~--~-~~~~~~v~~~i~~ia~~ia 75 (424) ++|+---- =+...|...+..... +.......+.+-.... +..+..+ . -....+...+|+..+..+- T Consensus 8 ~~e~~~~~------~~~~~l~~~~~~~~~---r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~ 78 (486) T protein:vir:42 8 MEEIEDPA------VVREEMISAFEDASK---DLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQA 78 (486) T ss_pred CCCcccHH------HHHHHHHHHHHHHHH---HHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhhc Confidence 33322111 133344333322211 1111111111111111 1111110 1 1123455667777776654 Q ss_pred cCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce-------eeEEee Q lcl|NC_019719. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV-------ISLLPL 148 (424) Q Consensus 76 ~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~-------~~l~~l 148 (424) -..+.+ ..+ . .....+.+++. . |. .......+..+++.+|.||+.+.++..|.. ..+.++ T Consensus 79 ~~g~~~---~~~-~----~~~~~~~~i~~-~-N~---~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~ 145 (486) T protein:vir:42 79 VEGFRL---GDA-D----EADEELWQWWQ-A-NN---LDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVE 145 (486) T ss_pred ccceec---CCC-c----hhHHHHHHHHH-h-cC---hhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEe Confidence 334432 111 1 11122444443 2 32 345567788999999999999988765432 356778 Q ss_pred cCceEEEEEcCCc--e----EEEEEecCce----EEecHhHe-------------------------eEeccC-CCCccc Q lcl|NC_019719. 149 QSANMDVKLVGKK--V----VYRYQRDSEY----ADFSQKEI-------------------------FHLKGF-GFTGLV 192 (424) Q Consensus 149 ~~~~v~~~~~~~~--~----~~~~~~~~~~----~~~~~~ev-------------------------ih~r~~-~~~~~~ 192 (424) +|..+.+..+... . .+.+...+.. ..+.++.+ +++++. ...+.+ T Consensus 146 ~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~ 225 (486) T protein:vir:42 146 PPTRMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLY 225 (486) T ss_pred cccceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCceEEEeccccccCCCC Confidence 8888887766321 1 0111111100 11222222 222221 123456 Q ss_pred cCchHH----HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC--CCCCHHHHHHHHHHHHHHhCCcccCcceecC-CC Q lcl|NC_019719. 193 GLSPIA----FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE--KVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AG 265 (424) Q Consensus 193 G~s~~~----~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~-~g 265 (424) |.|-+. .+.+.+....+-......+ .+.|..++.-.. ....+. ......++. ..++++.++ ++ T Consensus 226 G~s~i~~~v~~liDa~~~~~s~~~~~~e~---~a~p~~~i~G~~~~~~~~~~--~~~~~~~~~-----~~~~~~~~~~~~ 295 (486) T protein:vir:42 226 GTSEITPELRSMTDAAARILMLMQATAEL---MGVPQRLIFGIKPEEIGVDS--ETGQTLFDA-----YLARILAFEDAE 295 (486) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHh---hcchHHHhhcCCcccccccc--ccccchhhh-----hhchhcccCCCC Confidence 777544 3333333333322222333 333444443110 000000 001111211 123456554 45 Q ss_pred ceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHH----------HHHHHHHHHHHHH Q lcl|NC_019719. 266 FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF----------LQYTLQPYISRWE 335 (424) Q Consensus 266 ~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~----------~~~tl~P~~~~ie 335 (424) .++.++.....+ .+++..+.....++..=++|+..++....+..|+.........+ +...+.-+++.+. T Consensus 296 ~~~~q~~~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~ 374 (486) T protein:vir:42 296 GKIQQFSAAELA-NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAY 374 (486) T ss_pred ceEEeecccCHH-HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 667666544333 36788888889999999999999986543333332222222222 2222222222221 Q ss_pred HHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhC--CCCCHHHHHHHhCCCCCC--CCCeee-----ec-- Q lcl|NC_019719. 336 NSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEA--GLRTINEMRRTDNLPPLP--GGDVAM-----RQ-- 404 (424) Q Consensus 336 ~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~--g~~T~NE~R~~~G~~p~~--~gd~~~-----~~-- 404 (424) ...+..- .+.+ ...+++.+......+....++.+.+++++ |+++..-+++.+|+-+.+ ....+. .+ T Consensus 375 ~~~~~~~-~~~d--~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~ 451 (486) T protein:vir:42 375 RIMKGGD-VPPD--MLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLG 451 (486) T ss_pred HHhcCCC-cccc--ceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHH Confidence 1111000 0111 12344555555678888899999999986 688888888888885432 111000 00 Q ss_pred --------ccccchhhc----cccCCC-cccCC Q lcl|NC_019719. 405 --------SQYVPITDL----GTNKEP-RNNGA 424 (424) Q Consensus 405 --------~n~~~~~~~----~~~~~~-~~~ga 424 (424) .+-.+.... ...++. +..|| T Consensus 452 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (486) T protein:vir:42 452 LLGTMVDADPTVPGSPSPTAPPKPQPAIESSGG 484 (486) T ss_pred HHHHhhcCCCCCCCCCCCCCCCCCCcccCCCCC Confidence 000000000 000000 11111 No 173 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.07 E-value=5.4e-06 Score=49.45 Aligned_cols=403 Identities=10% Similarity=-0.001 Sum_probs=170.9 Q ss_pred CCCCcccccCCC--CC--chHHHHHhhccCcccCcccccccccccccccccCccccc-HHH----HhhhHHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRT--NN--GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIN-DER----ILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~~~~~~--~~--G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~----~~~~~~v~~~i~~ia 71 (424) -.--+||-.+.. +. -++++|...+..... +.......+.+........++ ++. ..-..+...||+.++ T Consensus 6 ~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~~~---r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd~~a 82 (504) T protein:vir:99 6 TSASKFTFRIPELNDDVVDKVNGLYQQLVDRTP---RNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVDTLA 82 (504) T ss_pred CcccccccccCCCCHHHHHHHHHHHHHHHHHhH---HHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHHHHH Confidence 122334433321 11 344555544432221 111111112211111111111 111 112344455677766 Q ss_pred HhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCcee-eEEeecC Q lcl|NC_019719. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI-SLLPLQS 150 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~-~l~~l~~ 150 (424) ..+---.|. ..+ +. .....+.+++. . |. .......+..+.+.+|.||+.+..+.+|.+. .+.+++| T Consensus 83 ~rl~~~Gf~---~~d-~~----~~~~~l~~i~~-~-N~---ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP 149 (504) T protein:vir:99 83 RRCNLESFV---WPD-GD----YGSIGGPDVWD-E-NF---FATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSA 149 (504) T ss_pred hhhccceee---CCC-CC----hhhHHHHHHHH-h-cC---hhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEecc Confidence 654322332 111 11 11123444442 2 32 3345678888999999999999999888764 5667899 Q ss_pred ceEEEEEcCCce----EEE---EEecCce---EEecHhH------------------------eeEeccC-CCCccccCc Q lcl|NC_019719. 151 ANMDVKLVGKKV----VYR---YQRDSEY---ADFSQKE------------------------IFHLKGF-GFTGLVGLS 195 (424) Q Consensus 151 ~~v~~~~~~~~~----~~~---~~~~~~~---~~~~~~e------------------------vih~r~~-~~~~~~G~s 195 (424) ..+.+..|+... .+. ....+.. ..+.++. |+++.+. ..+..+|.| T Consensus 150 ~~~~~iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~gvPvV~~~n~~~~~~~~G~s 229 (504) T protein:vir:99 150 MQATGEWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKLGVPVEVLPYKPREDRPLGSS 229 (504) T ss_pred ceeEEEEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCCCcceEEecccccCccccCcc Confidence 988877764221 011 1111111 1122333 3333322 123456766 Q ss_pred hH----HHHHHHHHHHHHHHHHHHHHHhccCCCceeEE-cC-CCCCC--HHHHHHHHHHHHHHhCCcccCc-ceecCCCc Q lcl|NC_019719. 196 PI----AFACKSAGVAVAMEDQQRDFFANGAKSPQILS-TG-EKVLT--EQQRSQVEENFKEIAGGPVKKR-LWILEAGF 266 (424) Q Consensus 196 ~~----~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~-~~-~~~~~--~~~~~~~~~~~~~~~~~~~~g~-~~~l~~g~ 266 (424) .+ ..+.+.+.-...-......+|.. |..++. .. ....+ ......++....+...-..... .+.-.... T Consensus 230 ei~~~v~~l~Da~~~~~~~~~~~~e~~a~---p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 306 (504) T protein:vir:99 230 RITRPVMSLQQRALKGCIRMDGHADVYSF---PQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARA 306 (504) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhcc---hhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccccCccc Confidence 43 33444444444333444455444 333332 11 11000 0111222222222222111111 11111234 Q ss_pred eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCC-CCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh---- Q lcl|NC_019719. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STSWGSGIEQQNLGFLQYTLQPYISRWENSIQRW---- 341 (424) Q Consensus 267 ~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~-~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~---- 341 (424) ++.++....-+ .|++..+..+..|+..=++|++.||.... ++.|...++.+...+... +.-..+.|.+.|.+- T Consensus 307 ~~~q~~~~~l~-~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~k-a~~k~~~f~~~l~~~~rla 384 (504) T protein:vir:99 307 DVKQFPASSPQ-PHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAE-AEGATDDWSPAFRRSMIRA 384 (504) T ss_pred eeeecCCCChH-HHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 55555543222 47889999999999999999999986543 333332332222222222 222333333333221 Q ss_pred --ccCccc---cccceeeecchhhhccCHHHHHHHHHHHHhCCCCC--H-HHHHHHhCCCCCC--C---------C---- Q lcl|NC_019719. 342 --LIPAKD---VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRT--I-NEMRRTDNLPPLP--G---------G---- 398 (424) Q Consensus 342 --l~~~~~---~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T--~-NE~R~~~G~~p~~--~---------g---- 398 (424) +....+ .....+++.+......+..+.++.+.++++.|... + .-+.+++|+.+-+ . + T Consensus 385 ~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~ 464 (504) T protein:vir:99 385 LAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSII 464 (504) T ss_pred HHHhcCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHH Confidence 111111 11133444445666778889999999999988532 2 3345566776431 0 0 Q ss_pred Ceeeecccccch------hhccccCCCcccCC Q lcl|NC_019719. 399 DVAMRQSQYVPI------TDLGTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~------~~~~~~~~~~~~ga 424 (424) +.+....+...- ...++...++.++| T Consensus 465 ~~l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~ 496 (504) T protein:vir:99 465 EALNRRQQEAATAGEDQDQGAGEPPANEPPAA 496 (504) T ss_pred HHHhcccCCCCCCCCCCCcCCCCCCCCCCCcc Confidence 001000000000 00000011111111 No 174 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.04 E-value=6.1e-06 Score=49.16 Aligned_cols=397 Identities=11% Similarity=0.091 Sum_probs=167.6 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccccC--cccccH---HHHhhhHHHHHHHHHHHHhhc Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSIND---ERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~~v~~~i~~ia~~ia 75 (424) |- +...++- .-++++|...+........ .....+.+..... +..+.. ..-....+...+|+.+++.+- T Consensus 1 ~~-~~~~~d~---~~~i~~L~~~~~~~~~r~~---~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~ 73 (488) T protein:vir:23 1 MA-ETESIDP---EKLRDQLLDAFENKQNELK---SSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQE 73 (488) T ss_pred CC-cccCCCH---HHHHHHHHHHHHHHHHHHH---HHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhh Confidence 21 1112221 1355555444433221111 1111111111110 111111 111224455667777776553 Q ss_pred cCceEEEEecc----cCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC--------Ccee Q lcl|NC_019719. 76 CLPLDVFETDQ----NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA--------GDVI 143 (424) Q Consensus 76 ~~~~~v~~~~~----~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~--------G~~~ 143 (424) --.+.+-.... .+... .....+.+++. + | ........+..+.+.+|.||+.+.++.. |.+ T Consensus 74 ~~Gf~~~~~~~~~~~~~~d~--~~~~~l~~i~~-~-N---~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~- 145 (488) T protein:vir:23 74 LEGFRIPSANGEEPESGGEN--DPASELWDWWQ-A-N---NLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVP- 145 (488) T ss_pred ccceeccCCcccccccccch--hHHHHHHHHHH-h-c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcc- Confidence 33333311111 01101 11123444442 2 2 3566677788999999999998866432 222 Q ss_pred eEEeecCceEEEEEcCCc--e----EEEEEecCc----eEEecHhH-------------------------eeEeccCC- Q lcl|NC_019719. 144 SLLPLQSANMDVKLVGKK--V----VYRYQRDSE----YADFSQKE-------------------------IFHLKGFG- 187 (424) Q Consensus 144 ~l~~l~~~~v~~~~~~~~--~----~~~~~~~~~----~~~~~~~e-------------------------vih~r~~~- 187 (424) .+.+++|..+.+..++.. . .|.+...+. ...+.++. |+++++.. T Consensus 146 ~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~ 225 (488) T protein:vir:23 146 LIRVEPPTALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTR 225 (488) T ss_pred eEEEeccceeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEeccccc Confidence 466778888777665321 1 111111110 01122222 33443321 Q ss_pred CCccccCchHH----HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHH--HHHHHHHHHHHhCCcccCccee Q lcl|NC_019719. 188 FTGLVGLSPIA----FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ--RSQVEENFKEIAGGPVKKRLWI 261 (424) Q Consensus 188 ~~~~~G~s~~~----~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~--~~~~~~~~~~~~~~~~~g~~~~ 261 (424) ..+.+|.|-+. .+.+.+....+-......++. .|..++.-- .. ++.. ...-...++.. .++++. T Consensus 226 ~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a---~p~~~i~G~-~~-~~~~~~~~~~~~~~~~~-----~~~v~~ 295 (488) T protein:vir:23 226 LSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMA---IPQRLIFGA-KP-EELGINAETGQRMFDAY-----MARILA 295 (488) T ss_pred cCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhh---hHHHHHhCC-Cc-ccccccccccchhhhhh-----hhhhcc Confidence 33456777553 222333322222222233332 343333210 00 1000 00011122222 235666 Q ss_pred cCCC--ceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 262 LEAG--FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQ 339 (424) Q Consensus 262 l~~g--~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~ 339 (424) +++| .++.++.....+ .+++..+..+..|+..=++|+..+|....+..|+.........+...+ .=....|...|. T Consensus 296 ~~~g~~~~~~q~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~-~~~~~~f~~~l~ 373 (488) T protein:vir:23 296 FEGGEGAHAEQFSAAELR-NFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKV-ERKNKIFGGAWE 373 (488) T ss_pred CCCCCCceeEecCCCChH-HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Confidence 7666 456666544333 367888888999999999999999765443333222222222222221 111111122221 Q ss_pred hh------ccCccc--cccceeeecchhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC--CCCeeee---- Q lcl|NC_019719. 340 RW------LIPAKD--VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP--GGDVAMR---- 403 (424) Q Consensus 340 ~~------l~~~~~--~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~T~NE~R~~~G~~p~~--~gd~~~~---- 403 (424) +. +....+ .....+++.+......+....++.+.+++++| +++..-+++++|+-+.+ ..+.... T Consensus 374 ~~~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~ 453 (488) T protein:vir:23 374 QAMRLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQK 453 (488) T ss_pred HHHHHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHH Confidence 11 111111 01123444444555677888999999999866 78888888999875432 1111000 Q ss_pred --cccccch------------hhccccCCCcccCC Q lcl|NC_019719. 404 --QSQYVPI------------TDLGTNKEPRNNGA 424 (424) Q Consensus 404 --~~n~~~~------------~~~~~~~~~~~~ga 424 (424) ...+..+ ...++..+++.+.| T Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 454 QGLGLIGSLYGASTPEGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred HHHHHHHHHhccCCCcccCCCCCCCCCCCCCCCCC Confidence 0000000 01112223334444 No 175 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=98.00 E-value=7.7e-06 Score=48.62 Aligned_cols=387 Identities=8% Similarity=-0.013 Sum_probs=173.6 Q ss_pred CCCCcccccCCCCC----chHHHHHhhccCcccCccccccccccccccc-----------ccCcccccHHHHhhhHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNN----GWWARLQSWFVGGRLVTPNQGSQTGPVSAHG-----------HLGDSSINDERILQISTVWR 65 (424) Q Consensus 1 ~~~~~~~~~~~~~~----G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~v~~ 65 (424) -+.-.+++.+.++. -++.++...+... .++.......+.+.. ........+..=+.++.... T Consensus 20 ~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~---~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ 96 (483) T protein:vir:12 20 TEIFDAIVRTNNKPETLEEMIVRYIKQHLEK---LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHAN 96 (483) T ss_pred hhhhhcccccCCchhhHHHHHHHHHHHHHHH---HHHHHHHHHHhccccccccccccccccccccccccccccccchHHH Confidence 11111222222222 1223332222111 000000000110000 00000000111123466677 Q ss_pred HHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeE Q lcl|NC_019719. 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISL 145 (424) Q Consensus 66 ~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l 145 (424) +|+..+.-+-.-|+.+-- .+.. ....+..++. | ........+..+.+.+|.+|..+-.+.+|.+ .+ T Consensus 97 Ivd~~~~~l~G~p~~~~~--~d~~-----~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~-~i 162 (483) T protein:vir:12 97 LVDQKVSYIVGKPIAFKH--TDDE-----VVKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KL 162 (483) T ss_pred HHHHHhhhhcccCceecc--CChH-----HHHHHHHHHh---c---cHHHHHHHHHHHHhhCCeEEEEEEEcCCCce-EE Confidence 888888877777776521 1110 1112333322 2 2344456677889999999999999988876 57 Q ss_pred EeecCceEEEEEcCCc---e---EEEEEecC--ceEEecHhHeeEecc----------------------CC-------- Q lcl|NC_019719. 146 LPLQSANMDVKLVGKK---V---VYRYQRDS--EYADFSQKEIFHLKG----------------------FG-------- 187 (424) Q Consensus 146 ~~l~~~~v~~~~~~~~---~---~~~~~~~~--~~~~~~~~evih~r~----------------------~~-------- 187 (424) ..++|..+.+..++.. . .+.|.... ....+.+..+.|+.. +. T Consensus 163 ~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 242 (483) T protein:vir:12 163 FRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIP 242 (483) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEEeecceEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEE Confidence 7889998888765321 1 11111111 111222233322210 00 Q ss_pred -CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCc Q lcl|NC_019719. 188 -FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGF 266 (424) Q Consensus 188 -~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~ 266 (424) .+...|.|.+..+...++....+.....+.+...+.|..+++.......++... .. ...+++.++.+. T Consensus 243 ~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~----~~-------~~~~~~~~~~~~ 311 (483) T protein:vir:12 243 FKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKR----LL-------RYYGAIKVSDNG 311 (483) T ss_pred ecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHH----hh-------hhccccccCCCC Confidence 012357777777777776666555555555555666766665322221111111 11 122355556666 Q ss_pred eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH----------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYISRWEN 336 (424) Q Consensus 267 ~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~~~~~tl~P~~~~ie~ 336 (424) +.+.+.....+..+....+...+.|+..-++|..-..... ++.++...+ +.....+...+.-.++.|.. T Consensus 312 ~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~ 390 (483) T protein:vir:12 312 GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE 390 (483) T ss_pred cceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6666655555566677788888899999999854332211 222222211 11122223333333333333 Q ss_pred HHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeeee-----cccccc Q lcl|NC_019719. 337 SIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAMR-----QSQYVP 409 (424) Q Consensus 337 ~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~~-----~~n~~~ 409 (424) .+.. ..+. ..+.+.++.-+..|..+.++.+.++ .|+++..-+.++++.-+.+ +.++.-. ..+... T Consensus 391 ~~~~----~~~~--~~i~v~f~~~~p~~~~~~a~~~~kl--~GiiS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~ 462 (483) T protein:vir:12 391 HFDI----KGEH--KDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPN 462 (483) T ss_pred HhcC----CCcc--ceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccc Confidence 2221 1122 2334444566678889999999988 4899998888888763321 1110000 000111 Q ss_pred hhhcccc---CCCcccCC Q lcl|NC_019719. 410 ITDLGTN---KEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~---~~~~~~ga 424 (424) +...... ++.+.+-+ T Consensus 463 ~~~~~~d~~~~~~~~~~~ 480 (483) T protein:vir:12 463 LDDGGADGAQQQERSNNK 480 (483) T ss_pred ccccccCCcccCCCCCcc Confidence 1111100 00011111 No 176 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=97.92 E-value=1.1e-05 Score=47.82 Aligned_cols=389 Identities=13% Similarity=0.041 Sum_probs=164.2 Q ss_pred CCCCCchHHHHHhhccCcccCccccccccccccccccc--CcccccHH---HHhhhHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_019719. 10 LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL--GDSSINDE---RILQISTVWRCVSLISTLTACLPLDVFET 84 (424) Q Consensus 10 ~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~---~~~~~~~v~~~i~~ia~~ia~~~~~v~~~ 84 (424) +-|-+=++.+|..++...... .......+.+.... .+..+..+ .-....+...+|+..+..+--.++.+ T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r---~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~--- 74 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPN---LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--- 74 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHH---HHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceec--- Confidence 333344556555554322111 01111111111111 01111111 01123445567777666553333321 Q ss_pred cccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC------CCCceeeEEeecCceEEEEEc Q lcl|NC_019719. 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN------SAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~------~~G~~~~l~~l~~~~v~~~~~ 158 (424) ..+. .....+..++. . | ........+..+.+.+|.||..+-++ .+|.+ .+..++|..+.+..| T Consensus 75 ~~d~-----~~~~~l~~i~~-~-N---~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~-~i~~~~p~~~~~~~D 143 (480) T protein:vir:78 75 SEDS-----EGLEELWNWWQ-A-N---DLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIP-LIRVESPLYMYAELD 143 (480) T ss_pred CCCc-----hhHHHHHHHHH-h-c---CHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCee-EEEEEcccceEEEEc Confidence 1111 11234555553 2 2 23566777889999999999988763 34444 467788888888776 Q ss_pred CCc---e----EEEEEecC--c---eEEecHhH-----------------------------eeEeccC-CCCccccCch Q lcl|NC_019719. 159 GKK---V----VYRYQRDS--E---YADFSQKE-----------------------------IFHLKGF-GFTGLVGLSP 196 (424) Q Consensus 159 ~~~---~----~~~~~~~~--~---~~~~~~~e-----------------------------vih~r~~-~~~~~~G~s~ 196 (424) ... . .|.+.... . ...+.++. |+++++. ..++.+|.|- T Consensus 144 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~ 223 (480) T protein:vir:78 144 PRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) T ss_pred CCCccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCccc Confidence 421 1 01000000 0 00111122 2333322 1334577776 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-CCceeeecccC Q lcl|NC_019719. 197 IAF-ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AGFSTSAIGVT 274 (424) Q Consensus 197 ~~~-~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~-~g~~~~~l~~~ 274 (424) +.- +...++..............-.+.|..++. +... ++...+.-...+.... +.++.++ ++.++.++... T Consensus 224 i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~-~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~ 296 (480) T protein:vir:78 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTT-DELTNDGENTTLDIYY-----GRILTLASEAAKISEFKAA 296 (480) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-cCCc-cccccccccchhhhhh-----hhhccCCCCCceEEecCcc Confidence 542 333333322222222222222334554443 1111 1111111111122211 2334443 44667666654 Q ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh------ccCcccc Q lcl|NC_019719. 275 PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRW------LIPAKDV 348 (424) Q Consensus 275 ~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~------l~~~~~~ 348 (424) ..+. +++..+..+..|+..=++|+..+|....+..|+.........+...+ .=....|...|.+. +...... T Consensus 297 ~~~~-~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka-~~~~~~f~~~l~~~~~l~~~~~g~~~~ 374 (480) T protein:vir:78 297 ELRN-FAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMA-ERKGRIFGGAWERAMRIAMQIMGREVT 374 (480) T ss_pred CHHH-HHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHcCCCcc Confidence 3333 67778888889999999999999865433233222222222222221 11222222222211 1111111 Q ss_pred -ccceeeecchhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCC--CCe--------eeeccc--------c Q lcl|NC_019719. 349 -GRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLPG--GDV--------AMRQSQ--------Y 407 (424) Q Consensus 349 -~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~T~NE~R~~~G~~p~~~--gd~--------~~~~~n--------~ 407 (424) ....+++.+......+..+.++.+.+++.+| +++..-+++.+|+.+.+- .++ .+-... - T Consensus 375 ~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~ 454 (480) T protein:vir:78 375 EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA 454 (480) T ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccccCCC Confidence 1123444444555667788888888888866 677777778888765321 110 000000 0 Q ss_pred cchhhccccC-CC-cccCC Q lcl|NC_019719. 408 VPITDLGTNK-EP-RNNGA 424 (424) Q Consensus 408 ~~~~~~~~~~-~~-~~~ga 424 (424) .+-+..++.. +. ...+| T Consensus 455 ~~~~~~~~~~~~~~~~~~~ 473 (480) T protein:vir:78 455 TPKPTVTETKTETQTSPSG 473 (480) T ss_pred CCCCCCCCCCCccccccCC Confidence 0000011110 00 11111 No 177 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=97.92 E-value=1.1e-05 Score=47.81 Aligned_cols=397 Identities=10% Similarity=0.022 Sum_probs=175.9 Q ss_pred CCCCcccc----cCCCC----------------CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhh Q lcl|NC_019719. 1 MEEPKYTI----DLRTN----------------NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQI 60 (424) Q Consensus 1 ~~~~~~~~----~~~~~----------------~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 60 (424) -..-.|.+ +.++. +--++++..+..+.-. .... ........-+..=+.. T Consensus 24 ~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~-i~~~----------~~~~~~~~~~~~ki~~ 92 (511) T protein:vir:96 24 EANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTK-NLVE----------LTRRKEEYMADNRVAH 92 (511) T ss_pred hhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCc-cccc----------cCcCcccccCcceeec Confidence 00000000 00110 1112222222222100 0000 0000000000111123 Q ss_pred HHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019719. 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) ......++..+.-+-+-|+.+-- .++. ....+..++. . -........+..+++.+|.+|.++-++.+| T Consensus 93 n~~k~Iv~~~~~yl~g~p~~~~~--~~~~-----~~~~l~~~~~-~----n~~~~~~~~~~~~~~i~G~a~~~vy~ded~ 160 (511) T protein:vir:96 93 DYASYISDFINGYFLGNPIQYQD--DDKD-----VLEAIEAFND-L----NDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred chHHHHHHHHHhhhccCCceeec--CchH-----HHHHHHHHHh-h----cCHHHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 45566777777777777777531 1111 1123333332 2 235566677888999999999999998888 Q ss_pred ceeeEEeecCceEEEEEcCCc---eE--E-EEEe---c-C--c----eEEecHhHeeEeccCC----------------- Q lcl|NC_019719. 141 DVISLLPLQSANMDVKLVGKK---VV--Y-RYQR---D-S--E----YADFSQKEIFHLKGFG----------------- 187 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~~~~---~~--~-~~~~---~-~--~----~~~~~~~evih~r~~~----------------- 187 (424) .+ .+..++|..+.+..++.. .. + .|.. . . . ...+.++.+.+++... T Consensus 161 ~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 239 (511) T protein:vir:96 161 ET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHS 239 (511) T ss_pred ce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccc Confidence 65 677788988888765432 11 1 1110 0 0 0 0123444444432100 Q ss_pred ---------CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCC-cccC Q lcl|NC_019719. 188 ---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGG-PVKK 257 (424) Q Consensus 188 ---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g 257 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.++.-.......+.....+...-..... .-.+ T Consensus 240 ~~~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (511) T protein:vir:96 240 FERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADS 319 (511) T ss_pred CCceeeEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceeccccccccc Confidence 012357787777777777666665555555666666766665333322222111111110000000 0011 Q ss_pred cceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHH----------HHHHHHHH Q lcl|NC_019719. 258 RLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTL 327 (424) Q Consensus 258 ~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~----------~~~~~~tl 327 (424) .....+.+.+++.+........+....+...+.|...-++|..-.+... ++.|+....-.. ...+...+ T Consensus 320 ~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l 398 (511) T protein:vir:96 320 EGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 398 (511) T ss_pred ccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1122344556666665545556677888889999999999975443222 222322222111 12223333 Q ss_pred HHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CC------ Q lcl|NC_019719. 328 QPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GD------ 399 (424) Q Consensus 328 ~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~--gd------ 399 (424) .-.++.|...+..+.-.........+++.++.-+..|..+.++.+.++ .|+++...+.+++++-..|. .+ T Consensus 399 ~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~G~iS~et~l~~l~~v~D~~~E~~ri~~E~ 476 (511) T protein:vir:96 399 RRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDE 476 (511) T ss_pred HHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHH Confidence 333333333333211111111112334444555677888889988887 58999988888887643211 10 Q ss_pred --eee---ecccccchhhccccCCCccc-CC Q lcl|NC_019719. 400 --VAM---RQSQYVPITDLGTNKEPRNN-GA 424 (424) Q Consensus 400 --~~~---~~~n~~~~~~~~~~~~~~~~-ga 424 (424) ..- ......+-+...+.+++++. -+ T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (511) T protein:vir:96 477 KESIKKAQKGIYKDPRDINDDEQDDDTKDTV 507 (511) T ss_pred HHHHHHHhhccccCCCCCCCCCCCCcccccc Confidence 000 00000010000000011000 01 No 178 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=97.89 E-value=1.2e-05 Score=47.51 Aligned_cols=406 Identities=10% Similarity=0.030 Sum_probs=180.4 Q ss_pred CCCCccccc-CCCCC--------------------------chHHHHHhhccCcccCcccccccccccccccc-----c- Q lcl|NC_019719. 1 MEEPKYTID-LRTNN--------------------------GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGH-----L- 47 (424) Q Consensus 1 ~~~~~~~~~-~~~~~--------------------------G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-----~- 47 (424) ||.--|+-- +|.++ =++.++...+.... .++.......+.+..+ . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~--~~r~~~~~~yY~g~~~~i~~~~~ 78 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQ--APRIQELLDYARGENHDVLKSGR 78 (501) T ss_pred CceeeeeecccceeccccccchhHHhhhcccccccccCChHHHHHHHHHHHHHHH--HHHHHHHHHHhcCCCCcccCccc Confidence 332222210 11100 01122211111000 0000000000111000 0 Q ss_pred CcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHc Q lcl|NC_019719. 48 GDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFY 127 (424) Q Consensus 48 ~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~ 127 (424) ......+..-+..+....+|+..+.-+-.-|+.+.-.+... .+.+...|. +....-........+..+++.+ T Consensus 79 ~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~~~-------~~~~~~~l~-~~~~~n~~~~~~~~~~~~~~~~ 150 (501) T protein:vir:96 79 RKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDD-------NSQNDDAIK-RIGRINDLDSLNRTLIRDLSQT 150 (501) T ss_pred cCccccccceeecchHHHHHHHHhhhhcccCeeEeeCCccc-------hhHHHHHHH-HHHHhcCHHHHHHHHHHHHhhc Confidence 00000011113356677788888887777787763322111 122223232 1211224566778899999999 Q ss_pred CCeEEEEeeCCCCceeeEEeecCceEEEEEcCCc---eE----EEEEec--Cc---eEEecHhHeeEeccC--------- Q lcl|NC_019719. 128 GNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK---VV----YRYQRD--SE---YADFSQKEIFHLKGF--------- 186 (424) Q Consensus 128 G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~---~~----~~~~~~--~~---~~~~~~~evih~r~~--------- 186 (424) |.+|+.+.++.+|.+ .+..++|..+.+..++.. .. |.+... +. ...+.++.+.++... T Consensus 151 G~a~~~v~~dedg~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~ 229 (501) T protein:vir:96 151 GRAYEVIYRSEYDET-RIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISVT 229 (501) T ss_pred CeEEEEEEEcCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeCCCceecccc Confidence 999999999988876 577788998888776431 11 111101 00 011233333322110 Q ss_pred -------C----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc Q lcl|NC_019719. 187 -------G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV 255 (424) Q Consensus 187 -------~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (424) + .+...|.|.+..+...++....+.....+.+...+.|-.++.-......++....++... ...... T Consensus 230 ~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~--~~~~~~ 307 (501) T protein:vir:96 230 THAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTR--LMQLKP 307 (501) T ss_pred ccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcC--eeeecc Confidence 0 123457888877777777666666555666666666766665332222222222221111 111111 Q ss_pred cCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHH Q lcl|NC_019719. 256 KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQY 325 (424) Q Consensus 256 ~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~ 325 (424) .+.......+.+++-+.....+..+....+...+.|...-++|..-.+... ++.++...+.. ....+.. T Consensus 308 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~ 386 (501) T protein:vir:96 308 PKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFS-GNTSGEALKYKLFGLDQDRVDTQSQFTK 386 (501) T ss_pred cccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111122334445555554444455667778888999999999965554332 22222222211 1122333 Q ss_pred HHHHHHHHHHHHHHhhccC-ccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CC---- Q lcl|NC_019719. 326 TLQPYISRWENSIQRWLIP-AKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GG---- 398 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~-~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~g---- 398 (424) .+.-.++.+...++..--. ..+. ..+.+.+...+..|..+.++.+.++. |+++..-+.++++.-..| .. T Consensus 387 ~l~~~~~li~~~~~~~~~~~~~d~--~~i~i~f~~~~p~n~~e~ad~~~kl~--g~iS~et~~~~l~~v~D~~~E~~ri~ 462 (501) T protein:vir:96 387 GLKRRYRLAARIGSLVNEFKDFDE--SLLKITFTPNLPKSLNEQVSILTGLG--GQVSQETALSLSGLVESPNEELDKIN 462 (501) T ss_pred HHHHHHHHHHHHHHhccccccccc--ccceEEeCCCCCcCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHH Confidence 3333333333333221110 1111 12344445666788899999999885 788887777777653211 11 Q ss_pred ------Ceeeecccccchhhcc--ccCCCcccCC Q lcl|NC_019719. 399 ------DVAMRQSQYVPITDLG--TNKEPRNNGA 424 (424) Q Consensus 399 ------d~~~~~~n~~~~~~~~--~~~~~~~~ga 424 (424) +.-..+..+.+..... ++++.+.+.. T Consensus 463 ~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~ 496 (501) T protein:vir:96 463 KEMSEIDFKGYSNDFNEHVGKYTDEVKETHTDDF 496 (501) T ss_pred HHHHHhhccccccchhhcccccCCcCCCCCCCcc Confidence 1111112222221111 1111122222 No 179 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=97.87 E-value=1.3e-05 Score=47.30 Aligned_cols=384 Identities=8% Similarity=-0.019 Sum_probs=171.5 Q ss_pred CCCCcccccCCC--CCchHHHHHhhccCcccCcccccccccccccc-----------cccCcccccHHHHhhhHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRT--NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAH-----------GHLGDSSINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~~~~~~~~--~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~v~~~i 67 (424) -|++-..|++.. ..-++.++...+.... +........+.+. ........-+..=+.++....+| T Consensus 13 ~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv 89 (474) T protein:vir:96 13 HERVVEQIKPKYETQEEMIIRLINDHKPKI---DDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRMFTNYHQNLV 89 (474) T ss_pred hhhHHHHhhhccCChHHHHHHHHHHHHHHH---HHHHHHHHHhccCCcchhccchhcccccccccccchhcccchHHHHH Confidence 222222222222 2244444443322110 0000000000000 00000000011112345666788 Q ss_pred HHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEe Q lcl|NC_019719. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..+.-+-.-|+.+--. +. .....+..++. | ........+..++..+|.+|..+..+.+|.+ .+.. T Consensus 90 d~~~~~l~g~p~~~~~~--d~-----~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~-~i~~ 155 (474) T protein:vir:96 90 DQKVAYAVANPVTFSSD--DD-----KSLKTIQEVLN---H---KWDDKLVDILTAASNKGIEWLQPYIDENGEF-KTFR 155 (474) T ss_pred HhhhhhhcccCceeecC--ch-----HHHHHHHHHHh---c---CHHHHHHHHHHHHHhcCeeEEEEEecCCCce-EEEE Confidence 88887777777765211 11 11233444442 2 2344556667888899999999988888876 4888 Q ss_pred ecCceEEEEEcCCc---e---EEEEEecCc--eEEecHhHeeEecc-------------------------------CC- Q lcl|NC_019719. 148 LQSANMDVKLVGKK---V---VYRYQRDSE--YADFSQKEIFHLKG-------------------------------FG- 187 (424) Q Consensus 148 l~~~~v~~~~~~~~---~---~~~~~~~~~--~~~~~~~evih~r~-------------------------------~~- 187 (424) ++|..+.+..++.. . ...|...+. ...+..+.+.|++. .+ T Consensus 156 ~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 235 (474) T protein:vir:96 156 VPAEQAIPIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPF 235 (474) T ss_pred EcccceEEEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCceeE Confidence 89999888766421 1 111111111 11122222222210 00 Q ss_pred ---CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC- Q lcl|NC_019719. 188 ---FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE- 263 (424) Q Consensus 188 ---~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~- 263 (424) .+...|.|.+......++....+.....+.+...+.|-.+++-.......+ ..... ..++++.++ T Consensus 236 v~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-------~~~~~----~~~~~i~~~~ 304 (474) T protein:vir:96 236 IPFKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDE-------FMRNL----KYYKAINVDG 304 (474) T ss_pred EEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccc-------hhhhh----hcCceEEecC Confidence 022357787777777777766666666666666677766655322211111 11111 123566665 Q ss_pred CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHH----------HHHHHHHHHHHH Q lcl|NC_019719. 264 AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG----------FLQYTLQPYISR 333 (424) Q Consensus 264 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~----------~~~~tl~P~~~~ 333 (424) .|.+++.+........+.+..+...+.|+..-++|..-.... +++.++...+..... .+...+.-+++. T Consensus 305 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~ 383 (474) T protein:vir:96 305 DGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKF-GNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQY 383 (474) T ss_pred CCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 456666666554445567778888999999999996433221 222232222211111 122222222222 Q ss_pred HHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCee-----eeccc Q lcl|NC_019719. 334 WENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVA-----MRQSQ 406 (424) Q Consensus 334 ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~--gd~~-----~~~~n 406 (424) |..-+. ...+.....+.| +.-+..|..+.+ +.+..+|+++...++++++.-+.+. .+++ -.... T Consensus 384 i~~~~~----~~~~~~~i~i~f--~~~~p~~~~e~~---~~~~~ag~iS~et~~~~~~~v~d~~~E~~ri~~E~~e~~~~ 454 (474) T protein:vir:96 384 IIDFYK----LNIKVQDVEITF--NFNVMVNELEQS---QIGVQSQYLSKETVVTNHPWVDDPVAELERIEQDNIDFNKQ 454 (474) T ss_pred HHHHhC----CCcccceeeEEe--ccCCCcCHHHHH---HHHHhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhc Confidence 211111 111112223344 333445555444 4566789999999998876532211 0000 00000 Q ss_pred ccch----hhccccCCCccc Q lcl|NC_019719. 407 YVPI----TDLGTNKEPRNN 422 (424) Q Consensus 407 ~~~~----~~~~~~~~~~~~ 422 (424) ..+. ....++++.++| T Consensus 455 ~~~~~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 455 LPPLEGDANGRAQDNESETN 474 (474) T ss_pred ccccccccccccCCCcccCC Confidence 0010 011111122222 No 180 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=97.86 E-value=1.4e-05 Score=47.18 Aligned_cols=382 Identities=10% Similarity=-0.016 Sum_probs=168.7 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccccccccccccc---------ccCcc--cccHHHHhhhHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHG---------HLGDS--SINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~--~~~~~~~~~~~~v~~~i~~ 69 (424) ..++.+.+ +.-++.++...+.... +........+.+.. ...+. ..-+..-+.++....+|+. T Consensus 20 ~~~~~~~~----~~~~i~~~i~~~~~~~---~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~ 92 (474) T protein:vir:95 20 QLKPQFET----QEEMIIRLIDDHRKQL---DKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQ 92 (474) T ss_pred hhhhccCC----hHHHHHHHHHHHHHHH---HHHHHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHH Confidence 11122211 1123333332221110 00000000000000 00000 0001111224556667888 Q ss_pred HHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeec Q lcl|NC_019719. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQ 149 (424) Q Consensus 70 ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~ 149 (424) .++-+-.-|+.+-- .+. .....+..++. | ........+..+.+.+|.+|+.+.++.+|++ .+..++ T Consensus 93 ~~~~l~g~p~~~~~--~d~-----~~~~~l~~~~~---n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~ 158 (474) T protein:vir:95 93 KVSYVASKPVTYSC--EDE-----SVLKIIHDVLD---T---RWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVP 158 (474) T ss_pred HHhhhccCCceecc--Cch-----HHHHHHHHHHh---c---cHHHHHHHHHHHHhhcCcEEEEEEecCCCce-EEEEEc Confidence 88777777776521 111 11122333332 2 2445566778889999999999988888875 577788 Q ss_pred CceEEEEEcCCc---e---EEEEEecC--ceEEecHhHeeEecc----------------------CC---------CCc Q lcl|NC_019719. 150 SANMDVKLVGKK---V---VYRYQRDS--EYADFSQKEIFHLKG----------------------FG---------FTG 190 (424) Q Consensus 150 ~~~v~~~~~~~~---~---~~~~~~~~--~~~~~~~~evih~r~----------------------~~---------~~~ 190 (424) |..+.+..++.. . .+.|...+ ....+.++++.+.+. ++ .+. T Consensus 159 p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn 238 (474) T protein:vir:95 159 AEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNN 238 (474) T ss_pred ccceEEEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCC Confidence 888887765431 1 11111111 111222333322210 00 123 Q ss_pred cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeee Q lcl|NC_019719. 191 LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSA 270 (424) Q Consensus 191 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~ 270 (424) ..|.|.+..+...++....+.....+.++..+.|-.++.-......++... . ...++++.++++.+.+. T Consensus 239 ~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~-------~----~~~~~~i~~~~~~~~~~ 307 (474) T protein:vir:95 239 PEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMR-------G----LKYYKAINVDGDGGVET 307 (474) T ss_pred CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhh-------h----hhccceeeccCCCceeE Confidence 457787777777777666555555555566666766654322211111111 1 12345676777777766 Q ss_pred cccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019719. 271 IGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWENSIQR 340 (424) Q Consensus 271 l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ie~~l~~ 340 (424) +........+....+...+.|+..-++|..-.+. ..++.++...+.. ....+...+.-+++.|...+.. T Consensus 308 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~ 386 (474) T protein:vir:95 308 IQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDK-FGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNL 386 (474) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 6655555666778888899999999999522211 1122222122111 1122333333333333322211 Q ss_pred hccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeee-----ecccccchhh- Q lcl|NC_019719. 341 WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAM-----RQSQYVPITD- 412 (424) Q Consensus 341 ~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~-----~~~n~~~~~~- 412 (424) ..+.....+.|+ .-...|..+.+ +.+.+.|+++...+.+++++-+.+ ..+++- .......... T Consensus 387 ----~~d~~~i~v~f~--~~~p~d~~e~a---~~~~~~g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~ 457 (474) T protein:vir:95 387 ----KMDVKDIEISFN--FNRMMNDAEQS---QIIAQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDG 457 (474) T ss_pred ----CcccceeeEEec--cCCCcCHHHHH---HHHHhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccc Confidence 122222334443 33344544444 456667999998888887663321 100000 0000000000 Q ss_pred -cc--ccCCCcccCC Q lcl|NC_019719. 413 -LG--TNKEPRNNGA 424 (424) Q Consensus 413 -~~--~~~~~~~~ga 424 (424) .. ++++..++-. T Consensus 458 ~~d~~~~~~~~~~~~ 472 (474) T protein:vir:95 458 GADGAQQQERSNDKE 472 (474) T ss_pred cCCCCcCCCCCccCC Confidence 00 1111111111 No 181 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=97.85 E-value=1.5e-05 Score=47.09 Aligned_cols=408 Identities=11% Similarity=0.037 Sum_probs=181.7 Q ss_pred CCCCc----ccccCCCCCc--hHHHHHhhccCcccC-cccccccccccccccc--c-Ccccc---cHHHHhhhHHHHHHH Q lcl|NC_019719. 1 MEEPK----YTIDLRTNNG--WWARLQSWFVGGRLV-TPNQGSQTGPVSAHGH--L-GDSSI---NDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~----~~~~~~~~~G--~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~--~-~~~~~---~~~~~~~~~~v~~~i 67 (424) -..++ |.++.-.+.- .++.++.+....... .++-......+.+... . ..... .+..-...+.....| T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Iv 99 (502) T protein:vir:48 20 RFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMIS 99 (502) T ss_pred ccChhHHhhhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHH Confidence 11111 2222222111 111122211111000 0000000001111000 0 00000 001112245667788 Q ss_pred HHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEe Q lcl|NC_019719. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..+.-+-.-|+.+.-.+... .+++...|. +....-........+..+++.+|.||+.+.++.+|.+ .+.. T Consensus 100 d~~~~yl~g~p~~~~~~d~~~-------~~~~~~~l~-~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~ 170 (502) T protein:vir:48 100 KFKTGYLAGNPIRVEYDDNED-------NSQNDDAIK-RIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDET-RIKR 170 (502) T ss_pred HHHhhhhcccCeeEecCCccc-------hhHHHHHHH-HHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCce-EEEE Confidence 888888888888764322211 122333332 1112224667788899999999999999999988875 4677 Q ss_pred ecCceEEEEEcCC---ceE--EE-EE--ec-Cc---eEEecHhHeeEeccC-----------C---------CCccccCc Q lcl|NC_019719. 148 LQSANMDVKLVGK---KVV--YR-YQ--RD-SE---YADFSQKEIFHLKGF-----------G---------FTGLVGLS 195 (424) Q Consensus 148 l~~~~v~~~~~~~---~~~--~~-~~--~~-~~---~~~~~~~evih~r~~-----------~---------~~~~~G~s 195 (424) ++|..+.+..++. ... +. |. .. +. ...+.++.++++... . .+...|.| T Consensus 171 ~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~s 250 (502) T protein:vir:48 171 LSPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGTVPITEFLNNADGIG 250 (502) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCCeEEEEEeCCceeeccceecCCCccceEEecCCCCCCC Confidence 8888888776532 111 11 11 11 10 112333333333210 0 12336888 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccCh Q lcl|NC_019719. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTP 275 (424) Q Consensus 196 ~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~ 275 (424) .+..+...++....+.....+.+...+.|-.++.-......++....+++...-... ..+..-..+.+.+++.+.... T Consensus 251 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~d~~~l~~~~ 328 (502) T protein:vir:48 251 DYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRLMQLK--PPKSADGKEGTVKAEYLTKSY 328 (502) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhcceeecc--ccccccccccCcceeEeeecC Confidence 888777777777766666666666666676666543332222222222111100000 000011123344555555444 Q ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcc-C Q lcl|NC_019719. 276 QDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWENSIQRWLI-P 344 (424) Q Consensus 276 ~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ie~~l~~~l~-~ 344 (424) .+..+....+...+.|+..-++|+...+... ++.|+...+-. ....+...+.-.++.+...+....- . T Consensus 329 ~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~-~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~ 407 (502) T protein:vir:48 329 DVSGAEAYKTRLNKDIHVFTNTPDMSDNHFS-GNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFK 407 (502) T ss_pred CHHHHHHHHHHHHHHHHHHhCCCCcCccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 4444556788889999999999975443322 22222222211 1123333333333333333332111 1 Q ss_pred ccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeee-----ecccccc-----h-h Q lcl|NC_019719. 345 AKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAM-----RQSQYVP-----I-T 411 (424) Q Consensus 345 ~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~-----~~~n~~~-----~-~ 411 (424) ..+. ..+.+.+...+..|..+.++.+.++ .|+++..-+.+++++-..| +.+.+. ...+..+ . + T Consensus 408 ~~d~--~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 483 (502) T protein:vir:48 408 DFDE--SRLKITFTPNLPKSLYEQVSILNDL--GGQVSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFYDNVG 483 (502) T ss_pred cccc--ccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhccccccccccc Confidence 1122 2234444666678889999999888 4789988888888763221 111000 0000000 0 0 Q ss_pred hcccc--CCCcccCC Q lcl|NC_019719. 412 DLGTN--KEPRNNGA 424 (424) Q Consensus 412 ~~~~~--~~~~~~ga 424 (424) ...++ +++.++.. T Consensus 484 ~~~d~~~e~~~~~~~ 498 (502) T protein:vir:48 484 KYTDEVKETHTDDFE 498 (502) T ss_pred ccCCCccCCCCcCcC Confidence 00000 01111111 No 182 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=97.85 E-value=1.5e-05 Score=47.01 Aligned_cols=397 Identities=10% Similarity=0.019 Sum_probs=175.5 Q ss_pred CCCCcccc--cCCCCC----------------chHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHH Q lcl|NC_019719. 1 MEEPKYTI--DLRTNN----------------GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQIST 62 (424) Q Consensus 1 ~~~~~~~~--~~~~~~----------------G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 62 (424) -....|.. +.++.+ --++++..+..+.-... ... ....... -+..=+.++. T Consensus 26 n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il-~~~---------~~~~~~~-~~~~ki~~n~ 94 (511) T protein:vir:93 26 NVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL-VEL---------TRRKEEY-MADNRVAHDY 94 (511) T ss_pred CCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccc-ccc---------CcCcccc-cCcceeecch Confidence 00001110 111111 11222222222211000 000 0000000 0001122355 Q ss_pred HHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019719. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~ 142 (424) ....|+..+.-+-+-|+.+-- .+.. ....+..++. . .........+..+++.+|.||.++.++.+|.+ T Consensus 95 ~k~Iv~~~~~yl~g~p~~~~~--~d~~-----~~~~l~~~~~-~----n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~ 162 (511) T protein:vir:93 95 ASYISDFINGYFLGNPIQYQD--DDKD-----VLEVIEAFND-L----NDVESHNRSLGLDLSIYGKAYELMIRNQDDET 162 (511) T ss_pred HHHHHHHHhhhhcccCeeecc--CChH-----HHHHHHHHHh-h----cCHhHHHHHHHHHHHhcCeeEEEEEeCCCCce Confidence 666777777777777776521 1111 1122333332 1 24556677888899999999999999988876 Q ss_pred eeEEeecCceEEEEEcCCc---eEE--EEE-e----cC--c----eEEecHhHeeEeccCC------------------- Q lcl|NC_019719. 143 ISLLPLQSANMDVKLVGKK---VVY--RYQ-R----DS--E----YADFSQKEIFHLKGFG------------------- 187 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~---~~~--~~~-~----~~--~----~~~~~~~evih~r~~~------------------- 187 (424) .+..++|..+.+..++.. ..+ .|. . +. . ...+.++.+.+++... T Consensus 163 -~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g 241 (511) T protein:vir:93 163 -RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFE 241 (511) T ss_pred -EEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCC Confidence 577889998887766421 111 111 0 00 0 1123445554432110 Q ss_pred -------CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCC-cccCcc Q lcl|NC_019719. 188 -------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGG-PVKKRL 259 (424) Q Consensus 188 -------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~ 259 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.++.-......++..+..+...-..... .-.+.. T Consensus 242 ~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (511) T protein:vir:93 242 RMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEG 321 (511) T ss_pred ccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceeccccccccccc Confidence 012357777777777777666655555555666666666655322222222111111110000000 000111 Q ss_pred eecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHHHHH Q lcl|NC_019719. 260 WILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQP 329 (424) Q Consensus 260 ~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P 329 (424) .-.+.+.+++.+.....+..+....+...+.|...-++|..-.+... ++.|+....-. ....+..++.- T Consensus 322 ~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~ 400 (511) T protein:vir:93 322 RETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRR 400 (511) T ss_pred ccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22345556666655444555667788889999999999965443222 22232222211 11223333333 Q ss_pred HHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeee----- Q lcl|NC_019719. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAM----- 402 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~--gd~~~----- 402 (424) .++.|...+..+.-.........+++.++.-+..|..+.++.+.++ .|+++..-+.++++.-+.|. .+.+- T Consensus 401 ~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~ 478 (511) T protein:vir:93 401 RAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEKE 478 (511) T ss_pred HHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHH Confidence 3333333332211111111111234444566678888899988888 48899888888876643221 11000 Q ss_pred -ecccccch---hhcc---ccCCCcccCC Q lcl|NC_019719. 403 -RQSQYVPI---TDLG---TNKEPRNNGA 424 (424) Q Consensus 403 -~~~n~~~~---~~~~---~~~~~~~~ga 424 (424) ........ .... ++.+..++.+ T Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (511) T protein:vir:93 479 SIKKAQKGIYKDPRDINDDEQDDDTKDTV 507 (511) T ss_pred HHHHHhhhcccCCCCCCCCCCCCcccccc Confidence 00000000 0000 0111111111 No 183 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=97.83 E-value=1.6e-05 Score=46.85 Aligned_cols=394 Identities=10% Similarity=0.040 Sum_probs=175.8 Q ss_pred CCCCcccccC--------CC----------------CCchHHHHHhhccCcccCcccccccccccccccccCcccccHHH Q lcl|NC_019719. 1 MEEPKYTIDL--------RT----------------NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDER 56 (424) Q Consensus 1 ~~~~~~~~~~--------~~----------------~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 56 (424) -=.+++.+.+ +. +..-++++..+..+.-.. .... .......-+.. T Consensus 20 ~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i-~~~~----------~~~~~~~~~~~ 88 (511) T protein:vir:10 20 LFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKN-LVEL----------TRRKEEYMADN 88 (511) T ss_pred hhhhhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcc-cccc----------CcccccccCcc Confidence 0001111100 00 011122222222221100 0000 00000000001 Q ss_pred HhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019719. 57 ILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 57 ~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r 136 (424) =+..+.....|+..+.-+-+-|+.+--. ++. ....+..++. . | ........+..+++.+|.||.++.+ T Consensus 89 ki~~n~~k~Iv~~~~~yl~g~p~~~~~~--d~~-----~~~~l~~~~~-~-n---~~~~~~~~~~~~~~i~G~ay~~vy~ 156 (511) T protein:vir:10 89 RVAHDYASYISDFINGYFLGNPIQYQDD--DKD-----VLEAIEAFND-L-N---DVESHNRSLGLDLSIYGKAYEIMIR 156 (511) T ss_pred eeecchHHHHHHHHhhhhcccCceeecC--chH-----HHHHHHHHHh-h-c---CHHHHHHHHHHHHHhcCeeEEEEEe Confidence 1123555667777777777777775211 111 1123333332 2 2 3455667788899999999999999 Q ss_pred CCCCceeeEEeecCceEEEEEcCCc---eE--EEE-Ee----cC--c----eEEecHhHeeEeccCC------------- Q lcl|NC_019719. 137 NSAGDVISLLPLQSANMDVKLVGKK---VV--YRY-QR----DS--E----YADFSQKEIFHLKGFG------------- 187 (424) Q Consensus 137 ~~~G~~~~l~~l~~~~v~~~~~~~~---~~--~~~-~~----~~--~----~~~~~~~evih~r~~~------------- 187 (424) +.+|.+ .+..++|..+.+..++.. .. +++ .. +. . ...+.++.+.++.... T Consensus 157 dedg~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~ 235 (511) T protein:vir:10 157 NQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGF 235 (511) T ss_pred CCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCccccccccccc Confidence 988875 567788888887766432 11 111 10 00 0 0123444444432100 Q ss_pred -------------CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCc Q lcl|NC_019719. 188 -------------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGP 254 (424) Q Consensus 188 -------------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.++.-......++..+..+...-...... T Consensus 236 ~~~~~~~vPvv~f~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~ 315 (511) T protein:vir:10 236 ESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTV 315 (511) T ss_pred ccccCcceeEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceeccccc Confidence 0123577777777777776665555555556666667666653333222221111111100000000 Q ss_pred -ccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH----------HHHHHHH Q lcl|NC_019719. 255 -VKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFL 323 (424) Q Consensus 255 -~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~~~ 323 (424) -.+.....+.|.+++.+.....+..+....+...+.|+..-++|..-.+... ++.|+.... ......+ T Consensus 316 ~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~f 394 (511) T protein:vir:10 316 YADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLF 394 (511) T ss_pred ccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHH Confidence 0111122344566666665555556677888888999999999864333222 222322221 1112223 Q ss_pred HHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCee Q lcl|NC_019719. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~ 401 (424) ...+.-.++.|...+....-.........+++.+..-+..|..+.++.+.++. |+++..-+.+++++-+.| ..+.+ T Consensus 395 ~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~--G~iS~et~~~~l~~v~d~~~E~~ri 472 (511) T protein:vir:10 395 TKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKI 472 (511) T ss_pred HHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCCHHHHHHHH Confidence 33333333333333322111111111123455556667788999999999985 789988888887653321 11100 Q ss_pred e------ecccccchhhccccCCCcccCC Q lcl|NC_019719. 402 M------RQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~------~~~n~~~~~~~~~~~~~~~~ga 424 (424) - ....... .....++.+++. T Consensus 473 ~~E~~~~~~~~~~~---~~~~~~~~~~~~ 498 (511) T protein:vir:10 473 EEDEKESIKKAQKG---IYKDPRDINDDE 498 (511) T ss_pred HHHHHHHHHHHhhh---cccCCCCCCCCC Confidence 0 0000000 000111111111 No 184 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=97.82 E-value=1.6e-05 Score=46.80 Aligned_cols=387 Identities=7% Similarity=-0.027 Sum_probs=173.5 Q ss_pred CCCCcccccCCC--CCchHHHHHhhccCcccCcccccccccccccccc---------cCc--ccccHHHHhhhHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRT--NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGH---------LGD--SSINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~~~~~~~~--~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~--~~~~~~~~~~~~~v~~~i 67 (424) |-.--+.+++.. ..-++.++...+.... +........+.+... ..+ ...-+..=+..+....+| T Consensus 11 ~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv 87 (472) T protein:vir:93 11 IFDAIVRTNNKPETLEEMIVRYIKQHLEKL---PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLV 87 (472) T ss_pred hhhceeeecCchhhHHHHHHHHHHHHHHHH---HHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHH Confidence 111112222211 1123333322221110 000000001101000 000 000001112246677788 Q ss_pred HHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEe Q lcl|NC_019719. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..+.-+-.-|+.+-- .+. + ....+..++. | ........+..+.+.+|.||+.+..+.+|.+ .+.. T Consensus 88 d~~~~~l~g~~~~~~~--~d~---~--~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~-~i~~ 153 (472) T protein:vir:93 88 DQKVSYIVGKPIAFKH--TDD---E--VVKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFR 153 (472) T ss_pred HHHhhhhcccCeeecc--CCh---H--HHHHHHHHHh---c---cHHHHHHHHHHHHhhcCeEEEEEEECCCCce-EEEE Confidence 8888887777766521 111 0 1122333332 2 2345556678889999999999999888876 5777 Q ss_pred ecCceEEEEEcCCc---eE---EEEEec--CceEEecHhHeeEecc----------------------C-----C----C Q lcl|NC_019719. 148 LQSANMDVKLVGKK---VV---YRYQRD--SEYADFSQKEIFHLKG----------------------F-----G----F 188 (424) Q Consensus 148 l~~~~v~~~~~~~~---~~---~~~~~~--~~~~~~~~~evih~r~----------------------~-----~----~ 188 (424) ++|..+.+..++.. .. +.|... .....+.+..+.+++. + + . T Consensus 154 ~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~ 233 (472) T protein:vir:93 154 VPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK 233 (472) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEec Confidence 88988888765321 11 011110 0111112222222110 0 0 0 Q ss_pred CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCcee Q lcl|NC_019719. 189 TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFST 268 (424) Q Consensus 189 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~ 268 (424) +...|.|.+..+...++....+.....+.+...+.|..++.........+.. ..+ + ..+++.++.+.+. T Consensus 234 nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~----~~~----~---~~~~~~~~~~~~~ 302 (472) T protein:vir:93 234 NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFK----RLL----R---YYGAIKVSDNGGV 302 (472) T ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhH----HHH----h---hccccccCCCCcc Confidence 2345788787777777666655555555566667777666532222111111 111 1 2245556666666 Q ss_pred eecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH----------HHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 269 SAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWENSI 338 (424) Q Consensus 269 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~----------~~~~~tl~P~~~~ie~~l 338 (424) +.+.....+..+....+...+.|+..-++|..-.+... ++.++...+.... ..+...+.-+++.+...+ T Consensus 303 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~ 381 (472) T protein:vir:93 303 DTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF 381 (472) T ss_pred eeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 66655555666778888889999999999864433222 2222222211111 112222222222222222 Q ss_pred HhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeee-----ecccccchh Q lcl|NC_019719. 339 QRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAM-----RQSQYVPIT 411 (424) Q Consensus 339 ~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~-----~~~n~~~~~ 411 (424) +. ..+.. .+.+.++.-+..|..+.++.+.++. |+++..-+.+++++-..+ ..+..- .......+. T Consensus 382 ~~----~~~~~--~i~v~f~~~~p~~~~~~~~~~~k~~--giis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~ 453 (472) T protein:vir:93 382 DI----KGEHK--DVDISFNYNKVANTELQVQTAQQSM--GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLD 453 (472) T ss_pred CC----Ccccc--eeeEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccCcC Confidence 11 11222 3344445556678888888888874 789888887777663221 111000 001111111 Q ss_pred hcccc--CCCcccCC Q lcl|NC_019719. 412 DLGTN--KEPRNNGA 424 (424) Q Consensus 412 ~~~~~--~~~~~~ga 424 (424) ..... ++.+..+- T Consensus 454 ~~~~d~~~~~~~~~~ 468 (472) T protein:vir:93 454 DGGADGAQQQERSNN 468 (472) T ss_pred cccCCCCCCCCCCCc Confidence 11110 01111111 No 185 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=97.75 E-value=2.2e-05 Score=46.09 Aligned_cols=395 Identities=11% Similarity=0.063 Sum_probs=175.0 Q ss_pred CCCCccccc---------------CCC-----CCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhh Q lcl|NC_019719. 1 MEEPKYTID---------------LRT-----NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQI 60 (424) Q Consensus 1 ~~~~~~~~~---------------~~~-----~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 60 (424) ...-.|.|. +.+ +..-++++..+..+.-.. .... .......-+..=+.. T Consensus 24 ~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i-~~~~----------~~~~~~~~~~~ki~~ 92 (512) T protein:vir:97 24 EANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN-LVEL----------TRRKEEYMADNRVAH 92 (512) T ss_pred ccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcc-cccc----------CcccccccCcceeec Confidence 111122221 000 011122222222221100 0000 000000000011123 Q ss_pred HHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019719. 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) +.....|+..+.-+-+-|+.+--. ++. ....+..++. . | ........+..+++.+|.+|.++.++.+| T Consensus 93 n~~k~Ivd~~~~yl~g~p~~~~~~--d~~-----~~~~l~~~~~-~-n---~~~~~~~~~~~~~~i~G~ay~~vy~ded~ 160 (512) T protein:vir:97 93 DYASYISDFINGYFLGNPIQCQDD--DKD-----VLEAIEAFND-L-N---DVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) T ss_pred chHHHHHHHHhhhhcccCceeccC--ChH-----HHHHHHHHHh-h-c---CHHHHHHHHHHHHHhcCeEEEEEEeCCCC Confidence 555667787777777777765311 111 1123333332 2 2 35566677888999999999999999888 Q ss_pred ceeeEEeecCceEEEEEcCCc---eE--EEEE-----ecC------ceEEecHhHeeEeccCC----------------- Q lcl|NC_019719. 141 DVISLLPLQSANMDVKLVGKK---VV--YRYQ-----RDS------EYADFSQKEIFHLKGFG----------------- 187 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~~~~---~~--~~~~-----~~~------~~~~~~~~evih~r~~~----------------- 187 (424) .+ .+..++|..+.+..++.. .. ++|. .+. ....+.++.|.+++... T Consensus 161 ~~-~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 239 (512) T protein:vir:97 161 ET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHS 239 (512) T ss_pred ce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccc Confidence 75 577889998888776432 11 1111 000 01133455555542110 Q ss_pred ---------CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCC--ccc Q lcl|NC_019719. 188 ---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGG--PVK 256 (424) Q Consensus 188 ---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 256 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.++.-......++.........-..... .+. T Consensus 240 ~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (512) T protein:vir:97 240 FERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENR 319 (512) T ss_pred CcccceEeecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhc Confidence 012357787777777777776666555555666666766665332222222111111111111100 111 Q ss_pred CcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHH Q lcl|NC_019719. 257 KRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYT 326 (424) Q Consensus 257 g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~t 326 (424) ....-.+.|.+++.+........+....+...+.|...-++|..-.+... ++.|+....-. ....+... T Consensus 320 ~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~ 398 (512) T protein:vir:97 320 DTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKG 398 (512) T ss_pred ccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11122345566666665444555567788888999999999865443222 22222222111 11122222 Q ss_pred HHHHHHHHHHHHHhhcc--CccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCe-- Q lcl|NC_019719. 327 LQPYISRWENSIQRWLI--PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDV-- 400 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~--~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~-- 400 (424) +.-.+..|...+...-- ...+.. .+++.++.-+..|..+.++.+.++. |+++..-+.++++.-+.| ..+. T Consensus 399 l~~~~~li~~~~~~~~~~~~~~d~~--~i~~~f~~~~p~~~~e~~~~~~kl~--giiS~et~~~~l~~v~d~~~E~eri~ 474 (512) T protein:vir:97 399 LRRRAKLLETILKNTRSIDANKDFN--TVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIE 474 (512) T ss_pred HHHHHHHHHHHHHhcCCcccccccc--cceEEeCCCCCcCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHH Confidence 22222222222221110 011111 2344445556677888888888884 889998888887663321 1000 Q ss_pred ---------eeecccccchhhc-cccCCCcccCC Q lcl|NC_019719. 401 ---------AMRQSQYVPITDL-GTNKEPRNNGA 424 (424) Q Consensus 401 ---------~~~~~n~~~~~~~-~~~~~~~~~ga 424 (424) ...+....+-... .++++..++.+ T Consensus 475 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (512) T protein:vir:97 475 EDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTV 508 (512) T ss_pred HHHHHHHHHHhhcccCCCCCCCCCCCCCCccccc Confidence 0000000000000 00111111111 No 186 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=97.71 E-value=2.6e-05 Score=45.75 Aligned_cols=397 Identities=9% Similarity=0.018 Sum_probs=169.3 Q ss_pred CCCCccccc--------CCC----------------CCchHHHHHhhccCcccCcccccccccccccccccCcccccHHH Q lcl|NC_019719. 1 MEEPKYTID--------LRT----------------NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDER 56 (424) Q Consensus 1 ~~~~~~~~~--------~~~----------------~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 56 (424) -=.+++.+. .++ ++--++++..+..+.-.. .... ....... -+.. T Consensus 20 ~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i-l~~~---------~~~~~~~-~~~~ 88 (511) T protein:vir:96 20 LFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN-LVEL---------TRRKEEY-MADN 88 (511) T ss_pred hhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCcc-cccc---------Ccccccc-cCcc Confidence 000000000 000 011122222222221100 0000 0000000 0001 Q ss_pred HhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019719. 57 ILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 57 ~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r 136 (424) =+..+.....|+..+.-+-+-|+.+-- .++. ....+..++. . | ....+...+..+++.+|.+|.++-+ T Consensus 89 ki~~n~~k~Iv~~~~~yl~g~p~~~~~--~d~~-----~~~~l~~~~~-~-n---~~~~~~~~~~~~~~~~G~a~~~vy~ 156 (511) T protein:vir:96 89 RVAHDYASYISDFINGYFLGNPIQYQD--DDKD-----VLEAIEAFND-L-N---DVESHNRSLGLDLSIYGKAYELMIR 156 (511) T ss_pred eeecchHHHHHHHHhhhhcccCceeec--CchH-----HHHHHHHHHh-h-c---ChhHHHHHHHHHHHhcCeeEEEEEe Confidence 112345566777777777777776521 1111 1123344332 2 2 3445667788899999999999999 Q ss_pred CCCCceeeEEeecCceEEEEEcCCc---eEE---EEEe----cC--c----eEEecHhHeeEeccCC------------- Q lcl|NC_019719. 137 NSAGDVISLLPLQSANMDVKLVGKK---VVY---RYQR----DS--E----YADFSQKEIFHLKGFG------------- 187 (424) Q Consensus 137 ~~~G~~~~l~~l~~~~v~~~~~~~~---~~~---~~~~----~~--~----~~~~~~~evih~r~~~------------- 187 (424) +.+|.+ .+..++|..+.+..++.. ..+ .|.. +. . ...+.++.+.+++... T Consensus 157 d~dg~~-~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~ 235 (511) T protein:vir:96 157 NQDDET-RLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSF 235 (511) T ss_pred CCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccc Confidence 988875 577788888888766422 111 1110 00 0 1133555555442110 Q ss_pred -------------CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCc Q lcl|NC_019719. 188 -------------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGP 254 (424) Q Consensus 188 -------------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.++.-......++.....+..+-...... T Consensus 236 ~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~ 315 (511) T protein:vir:96 236 ESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTV 315 (511) T ss_pred ccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccc Confidence 0123477777777777776665554444445555556655553322222221111111000000000 Q ss_pred c-cCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHH----------HHHH Q lcl|NC_019719. 255 V-KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFL 323 (424) Q Consensus 255 ~-~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~----------~~~~ 323 (424) . .+.-.-.+.+.+++.+........+....+...+.|+..-++|..-.+... ++.|+....... ...+ T Consensus 316 ~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f 394 (511) T protein:vir:96 316 YVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLF 394 (511) T ss_pred eeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 000011233444554554444555667788889999999999965443322 222222222111 1222 Q ss_pred HHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCee Q lcl|NC_019719. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~ 401 (424) ...+.-.++.|...+...--.........+++.+..-+..|..+.++.+.++. |+++..-+.+++++-+.+ +.+.+ T Consensus 395 ~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~v~d~~~El~ri 472 (511) T protein:vir:96 395 TKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKI 472 (511) T ss_pred HHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHH Confidence 22333333333332222111011111122344445666788888999998885 789887787777553211 11100 Q ss_pred e--------e---cccccchhhcc-ccCCCcccCC Q lcl|NC_019719. 402 M--------R---QSQYVPITDLG-TNKEPRNNGA 424 (424) Q Consensus 402 ~--------~---~~n~~~~~~~~-~~~~~~~~ga 424 (424) - . .....+-+... ++.+.+++.+ T Consensus 473 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (511) T protein:vir:96 473 EEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTV 507 (511) T ss_pred HHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcc Confidence 0 0 00000000000 0111111111 No 187 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=97.71 E-value=2.6e-05 Score=45.75 Aligned_cols=397 Identities=9% Similarity=0.018 Sum_probs=169.3 Q ss_pred CCCCccccc--------CCC----------------CCchHHHHHhhccCcccCcccccccccccccccccCcccccHHH Q lcl|NC_019719. 1 MEEPKYTID--------LRT----------------NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDER 56 (424) Q Consensus 1 ~~~~~~~~~--------~~~----------------~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 56 (424) -=.+++.+. .++ ++--++++..+..+.-.. .... ....... -+.. T Consensus 20 ~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i-l~~~---------~~~~~~~-~~~~ 88 (511) T protein:vir:78 20 LFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN-LVEL---------TRRKEEY-MADN 88 (511) T ss_pred hhhhhhCCcccccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCcc-cccc---------Ccccccc-cCcc Confidence 000000000 000 011122222222221100 0000 0000000 0001 Q ss_pred HhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019719. 57 ILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 57 ~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r 136 (424) =+..+.....|+..+.-+-+-|+.+-- .++. ....+..++. . | ....+...+..+++.+|.+|.++-+ T Consensus 89 ki~~n~~k~Iv~~~~~yl~g~p~~~~~--~d~~-----~~~~l~~~~~-~-n---~~~~~~~~~~~~~~~~G~a~~~vy~ 156 (511) T protein:vir:78 89 RVAHDYASYISDFINGYFLGNPIQYQD--DDKD-----VLEAIEAFND-L-N---DVESHNRSLGLDLSIYGKAYELMIR 156 (511) T ss_pred eeecchHHHHHHHHhhhhcccCceeec--CchH-----HHHHHHHHHh-h-c---ChhHHHHHHHHHHHhcCeeEEEEEe Confidence 112345566777777777777776521 1111 1123344332 2 2 3445667788899999999999999 Q ss_pred CCCCceeeEEeecCceEEEEEcCCc---eEE---EEEe----cC--c----eEEecHhHeeEeccCC------------- Q lcl|NC_019719. 137 NSAGDVISLLPLQSANMDVKLVGKK---VVY---RYQR----DS--E----YADFSQKEIFHLKGFG------------- 187 (424) Q Consensus 137 ~~~G~~~~l~~l~~~~v~~~~~~~~---~~~---~~~~----~~--~----~~~~~~~evih~r~~~------------- 187 (424) +.+|.+ .+..++|..+.+..++.. ..+ .|.. +. . ...+.++.+.+++... T Consensus 157 d~dg~~-~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~ 235 (511) T protein:vir:78 157 NQDDET-RLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSF 235 (511) T ss_pred CCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccc Confidence 988875 577788888888766422 111 1110 00 0 1133555555442110 Q ss_pred -------------CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCc Q lcl|NC_019719. 188 -------------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGP 254 (424) Q Consensus 188 -------------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.++.-......++.....+..+-...... T Consensus 236 ~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~ 315 (511) T protein:vir:78 236 ESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTV 315 (511) T ss_pred ccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccc Confidence 0123477777777777776665554444445555556655553322222221111111000000000 Q ss_pred c-cCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHH----------HHHH Q lcl|NC_019719. 255 V-KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFL 323 (424) Q Consensus 255 ~-~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~----------~~~~ 323 (424) . .+.-.-.+.+.+++.+........+....+...+.|+..-++|..-.+... ++.|+....... ...+ T Consensus 316 ~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f 394 (511) T protein:vir:78 316 YVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLF 394 (511) T ss_pred eeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 000011233444554554444555667788889999999999965443322 222222222111 1222 Q ss_pred HHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCee Q lcl|NC_019719. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~ 401 (424) ...+.-.++.|...+...--.........+++.+..-+..|..+.++.+.++. |+++..-+.+++++-+.+ +.+.+ T Consensus 395 ~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~v~d~~~El~ri 472 (511) T protein:vir:78 395 TKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKI 472 (511) T ss_pred HHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHH Confidence 22333333333332222111011111122344445666788888999998885 789887787777553211 11100 Q ss_pred e--------e---cccccchhhcc-ccCCCcccCC Q lcl|NC_019719. 402 M--------R---QSQYVPITDLG-TNKEPRNNGA 424 (424) Q Consensus 402 ~--------~---~~n~~~~~~~~-~~~~~~~~ga 424 (424) - . .....+-+... ++.+.+++.+ T Consensus 473 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (511) T protein:vir:78 473 EEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTV 507 (511) T ss_pred HHHHHHHHHHHhhccccCCCCCCCCCCCCCccCcc Confidence 0 0 00000000000 0111111111 No 188 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=97.69 E-value=2.8e-05 Score=45.54 Aligned_cols=386 Identities=12% Similarity=0.051 Sum_probs=164.3 Q ss_pred CCCCCchHHHHHhhccCcccCccccccccccccccccc--CcccccH---HHHhhhHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_019719. 10 LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL--GDSSIND---ERILQISTVWRCVSLISTLTACLPLDVFET 84 (424) Q Consensus 10 ~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~---~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~ 84 (424) +-|-+=++.+|...+....... ......+.+.... .+..+.. ..-....+...+|+..++.+---++.+ T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~---~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~--- 74 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNL---LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--- 74 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHH---HHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCceec--- Confidence 2233335555544432211100 0001111111110 0111111 111123445567777666553333322 Q ss_pred cccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC------CCCceeeEEeecCceEEEEEc Q lcl|NC_019719. 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN------SAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~------~~G~~~~l~~l~~~~v~~~~~ 158 (424) .++. .....+..++. . | ........+..+.+.+|.||+.+.++ .+|.+ .+.+++|..+.+..| T Consensus 75 ~~d~-----~~~~~l~~i~~-~-N---~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~i~D 143 (480) T protein:vir:78 75 SEDS-----EGLEELWNWWQ-A-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELD 143 (480) T ss_pred CCCc-----hhHHHHHHHHH-h-c---CHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCee-EEEEEcccceEEEEc Confidence 1111 11234555553 2 2 24556678889999999999988753 34544 577888888888776 Q ss_pred CCc---e--E--EEEEecC-c----eEEecHhHe-----------------------------eEeccCC-CCccccCch Q lcl|NC_019719. 159 GKK---V--V--YRYQRDS-E----YADFSQKEI-----------------------------FHLKGFG-FTGLVGLSP 196 (424) Q Consensus 159 ~~~---~--~--~~~~~~~-~----~~~~~~~ev-----------------------------ih~r~~~-~~~~~G~s~ 196 (424) ... . . |.+.... . ...+.++.+ +|+.+.. .++.+|.|- T Consensus 144 ~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sd 223 (480) T protein:vir:78 144 PRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) T ss_pred CCCccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccc Confidence 431 1 0 1111000 0 011122222 3333221 234567775 Q ss_pred HHH-H---HHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-CCceeeec Q lcl|NC_019719. 197 IAF-A---CKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AGFSTSAI 271 (424) Q Consensus 197 ~~~-~---~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~-~g~~~~~l 271 (424) +.- + .+.+....+-......+| +.|..++. +... ++...+.-...+... .+.++.++ ++.++.++ T Consensus 224 i~~~i~~l~Da~~~~~s~~~~~~~~~---a~p~~~i~-G~~~-~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 293 (480) T protein:vir:78 224 ISPELRKVTDAASRTLMNLQSASQIL---GTPLRVIS-GVTT-DELTNDGENTTLDIY-----YGRILTLASEAAKISEF 293 (480) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhh---cchhhhhh-CCCc-cccccccccchhhhh-----hhhhccCCCCCceEEec Confidence 542 3 333333333322333333 34544443 1111 111111111112111 12344444 34567666 Q ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh------ccCc Q lcl|NC_019719. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRW------LIPA 345 (424) Q Consensus 272 ~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~------l~~~ 345 (424) .....+ .+++..+..+.+++..-++|+..+|....+.+|+.........+...+ .=....+...|.+. +... T Consensus 294 ~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~-~~~~~~f~~~l~~~~rl~~~~~~~ 371 (480) T protein:vir:78 294 KAAELR-NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMA-ERKGRIFGGAWERAMRIAMQIMGR 371 (480) T ss_pred CccCHH-HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHcCC Confidence 654333 367888889999999999999999864433222222222121211111 11111122222211 1111 Q ss_pred ccc-ccceeeecchhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC--CCCeeeeccc-------------- Q lcl|NC_019719. 346 KDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP--GGDVAMRQSQ-------------- 406 (424) Q Consensus 346 ~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~T~NE~R~~~G~~p~~--~gd~~~~~~n-------------- 406 (424) ... ....+++.+......+..+.++.+.+++.+| +++..-+++++|+.+.+ ..++...... T Consensus 372 ~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~ 451 (480) T protein:vir:78 372 EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQ 451 (480) T ss_pred CccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCC Confidence 111 1233455555555677888898899988866 67776778888887532 1111000000 Q ss_pred --ccchhhcccc--CCCcccCC Q lcl|NC_019719. 407 --YVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 407 --~~~~~~~~~~--~~~~~~ga 424 (424) -.+-...++. +.....+| T Consensus 452 ~~~~~~~~~~~~~~~~~~~~~~ 473 (480) T protein:vir:78 452 ADATPKPTVTETKTETQTSPSG 473 (480) T ss_pred CccccCCCCCCCCCccCCCccc Confidence 0000011100 00111111 No 189 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=97.67 E-value=3e-05 Score=45.36 Aligned_cols=401 Identities=8% Similarity=0.001 Sum_probs=178.2 Q ss_pred CCCCcccccCCCCC---chHHHHHhhccCccc-Cccccccccc-----------------ccccccccCcccccHHHHhh Q lcl|NC_019719. 1 MEEPKYTIDLRTNN---GWWARLQSWFVGGRL-VTPNQGSQTG-----------------PVSAHGHLGDSSINDERILQ 59 (424) Q Consensus 1 ~~~~~~~~~~~~~~---G~~~~l~~~~~~~~~-~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~~ 59 (424) |.--||-=++.... -++.++.....+... .......+.+ .+............+..=+. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 22222211111111 122333222111000 0000000000 00000000000000000122 Q ss_pred hHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC Q lcl|NC_019719. 60 ISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA 139 (424) Q Consensus 60 ~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~ 139 (424) ++....+|+..+.-+-+-|+.+--....... + .....+.+++. ..........+..+++.+|.||.++..+.+ T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~-e-~~~~~l~~~~~-----~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~ 153 (474) T protein:vir:94 81 NSFDSEIVDTRVGYLHGVPVTYDLDENAEKN-E-KLKKFITNFAI-----RNSVDDEDSEIGKMAAICGYGARLAYIDTN 153 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCCcch-H-HHHHHHHHHHh-----hcCHhHHHHHHHHHHhhcCeEEEEEEeCCC Confidence 4556677888887777778775322111111 1 11112222222 124566777888999999999999988888 Q ss_pred CceeeEEeecCceEEEEEcCCce------EEEEEe--cCc---e-EEecHhHeeEeccC------------------C-- Q lcl|NC_019719. 140 GDVISLLPLQSANMDVKLVGKKV------VYRYQR--DSE---Y-ADFSQKEIFHLKGF------------------G-- 187 (424) Q Consensus 140 G~~~~l~~l~~~~v~~~~~~~~~------~~~~~~--~~~---~-~~~~~~evih~r~~------------------~-- 187 (424) |.+ .+..++|..+.+..++... +|.... ++. . ..+....+++++.. + T Consensus 154 ~~~-~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 232 (474) T protein:vir:94 154 GDI-RIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLF 232 (474) T ss_pred Cee-EEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceE Confidence 875 5677888887776654321 111110 000 0 01112222222110 0 Q ss_pred --CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCC Q lcl|NC_019719. 188 --FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAG 265 (424) Q Consensus 188 --~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g 265 (424) .+...|.|.+..+...++....+.....+.+...+.|-.+++- ... +++....+ ...+.+.+.+.+ T Consensus 233 ~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g-~~~-~~~~~~~~----------~~~~~i~~~~~~ 300 (474) T protein:vir:94 233 GVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRG-MGM-SEEMIQET----------QKSGAFELFDKD 300 (474) T ss_pred EecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcc-CCC-Cchhhhhh----------hhcceeEecCCC Confidence 1233577777777666666655555555555555556655542 222 22221111 112334455666 Q ss_pred ceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHHHHHHHHHHH Q lcl|NC_019719. 266 FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWE 335 (424) Q Consensus 266 ~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ie 335 (424) .+++-+.....+..+....+...+.|...-++|..-.+... ++.++....-. ....+...+.-.++.|. T Consensus 301 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~ 379 (474) T protein:vir:94 301 MDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFN-GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVIL 379 (474) T ss_pred CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 67766665545555677888889999999999864433222 22232222211 11233444444444444 Q ss_pred HHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeee-----eccccc Q lcl|NC_019719. 336 NSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAM-----RQSQYV 408 (424) Q Consensus 336 ~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~-----~~~n~~ 408 (424) ..++.+-....+.....+++.+..-+..|..+.++.+.++. |+++..-+.++++.-+.+ ..+++- ...... T Consensus 380 ~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~ 457 (474) T protein:vir:94 380 SALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLP 457 (474) T ss_pred HHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcc Confidence 44433211111111123445556666788899999999884 899998888888763321 111100 000111 Q ss_pred chhhccccCCCcccCC Q lcl|NC_019719. 409 PITDLGTNKEPRNNGA 424 (424) Q Consensus 409 ~~~~~~~~~~~~~~ga 424 (424) .......+++++++.. T Consensus 458 ~~~~~~~~~~~~~~~s 473 (474) T protein:vir:94 458 DIDEGDANDKSQNNQS 473 (474) T ss_pred cccCCCcCCCCccccC Confidence 1111111111111111 No 190 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=97.67 E-value=3e-05 Score=45.36 Aligned_cols=401 Identities=8% Similarity=0.001 Sum_probs=178.2 Q ss_pred CCCCcccccCCCCC---chHHHHHhhccCccc-Cccccccccc-----------------ccccccccCcccccHHHHhh Q lcl|NC_019719. 1 MEEPKYTIDLRTNN---GWWARLQSWFVGGRL-VTPNQGSQTG-----------------PVSAHGHLGDSSINDERILQ 59 (424) Q Consensus 1 ~~~~~~~~~~~~~~---G~~~~l~~~~~~~~~-~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~~ 59 (424) |.--||-=++.... -++.++.....+... .......+.+ .+............+..=+. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 22222211111111 122333222111000 0000000000 00000000000000000122 Q ss_pred hHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC Q lcl|NC_019719. 60 ISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA 139 (424) Q Consensus 60 ~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~ 139 (424) ++....+|+..+.-+-+-|+.+--....... + .....+.+++. ..........+..+++.+|.||.++..+.+ T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~-e-~~~~~l~~~~~-----~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~ 153 (474) T protein:vir:10 81 NSFDSEIVDTRVGYLHGVPVTYDLDENAEKN-E-KLKKFITNFAI-----RNSVDDEDSEIGKMAAICGYGARLAYIDTN 153 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCCcch-H-HHHHHHHHHHh-----hcCHhHHHHHHHHHHhhcCeEEEEEEeCCC Confidence 4556677888887777778775322111111 1 11112222222 124566777888999999999999988888 Q ss_pred CceeeEEeecCceEEEEEcCCce------EEEEEe--cCc---e-EEecHhHeeEeccC------------------C-- Q lcl|NC_019719. 140 GDVISLLPLQSANMDVKLVGKKV------VYRYQR--DSE---Y-ADFSQKEIFHLKGF------------------G-- 187 (424) Q Consensus 140 G~~~~l~~l~~~~v~~~~~~~~~------~~~~~~--~~~---~-~~~~~~evih~r~~------------------~-- 187 (424) |.+ .+..++|..+.+..++... +|.... ++. . ..+....+++++.. + T Consensus 154 ~~~-~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 232 (474) T protein:vir:10 154 GDI-RIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLF 232 (474) T ss_pred Cee-EEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceE Confidence 875 5677888887776654321 111110 000 0 01112222222110 0 Q ss_pred --CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCC Q lcl|NC_019719. 188 --FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAG 265 (424) Q Consensus 188 --~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g 265 (424) .+...|.|.+..+...++....+.....+.+...+.|-.+++- ... +++....+ ...+.+.+.+.+ T Consensus 233 ~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g-~~~-~~~~~~~~----------~~~~~i~~~~~~ 300 (474) T protein:vir:10 233 GVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRG-MGM-SEEMIQET----------QKSGAFELFDKD 300 (474) T ss_pred EecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcc-CCC-Cchhhhhh----------hhcceeEecCCC Confidence 1233577777777666666655555555555555556655542 222 22221111 112334455666 Q ss_pred ceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHHHHHHHHHHH Q lcl|NC_019719. 266 FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWE 335 (424) Q Consensus 266 ~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ie 335 (424) .+++-+.....+..+....+...+.|...-++|..-.+... ++.++....-. ....+...+.-.++.|. T Consensus 301 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~ 379 (474) T protein:vir:10 301 MDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFN-GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVIL 379 (474) T ss_pred CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 67766665545555677888889999999999864433222 22232222211 11233444444444444 Q ss_pred HHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeee-----eccccc Q lcl|NC_019719. 336 NSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAM-----RQSQYV 408 (424) Q Consensus 336 ~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~-----~~~n~~ 408 (424) ..++.+-....+.....+++.+..-+..|..+.++.+.++. |+++..-+.++++.-+.+ ..+++- ...... T Consensus 380 ~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~ 457 (474) T protein:vir:10 380 SALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLP 457 (474) T ss_pred HHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcc Confidence 44433211111111123445556666788899999999884 899998888888763321 111100 000111 Q ss_pred chhhccccCCCcccCC Q lcl|NC_019719. 409 PITDLGTNKEPRNNGA 424 (424) Q Consensus 409 ~~~~~~~~~~~~~~ga 424 (424) .......+++++++.. T Consensus 458 ~~~~~~~~~~~~~~~s 473 (474) T protein:vir:10 458 DIDEGDANDKSQNNQS 473 (474) T ss_pred cccCCCcCCCCccccC Confidence 1111111111111111 No 191 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=97.66 E-value=3.2e-05 Score=45.23 Aligned_cols=387 Identities=7% Similarity=-0.036 Sum_probs=172.9 Q ss_pred CC--------CCc--ccccC--CCCCchHHHHHhhccCcccCccccccccccccccc-----------ccCcccccHHHH Q lcl|NC_019719. 1 ME--------EPK--YTIDL--RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHG-----------HLGDSSINDERI 57 (424) Q Consensus 1 ~~--------~~~--~~~~~--~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~ 57 (424) |+ +-+ +..+. .+..-++.++...+... .++.......+.+.. .......-+..= T Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~r 97 (492) T protein:vir:94 21 LYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEK---LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDR 97 (492) T ss_pred eecCccchhhhhhcccccCCchhhHHHHHHHHHHHHHHH---HHHHHHHHHHhccccccccccccccccccccccccccc Confidence 10 000 00010 11112223332221110 000000000110000 000000001111 Q ss_pred hhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC Q lcl|NC_019719. 58 LQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN 137 (424) Q Consensus 58 ~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~ 137 (424) +.++....+|+..+.-+-+-|+.+.- .+.. ....+..++. | ........+..+.+.+|.+|.++-.+ T Consensus 98 i~~n~~k~Ivd~~~~yl~G~p~~~~~--~d~~-----~~~~l~~~~~---n---~~~~~~~~~~~~a~~~G~a~~~v~~d 164 (492) T protein:vir:94 98 MITNFHANLVDQKVSYIVGKPIAFKH--TDDE-----VVKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLD 164 (492) T ss_pred cccchHHHHHHHHHhhhcccCceecc--CchH-----HHHHHHHHHh---c---cHHHHHHHHHHHHhhCCeEEEEEEec Confidence 23566777888888887777766521 1111 1122333332 2 23455667888999999999999998 Q ss_pred CCCceeeEEeecCceEEEEEcCCc---eE--E-EEEecC--ceEEecHhHeeEecc----------------------CC Q lcl|NC_019719. 138 SAGDVISLLPLQSANMDVKLVGKK---VV--Y-RYQRDS--EYADFSQKEIFHLKG----------------------FG 187 (424) Q Consensus 138 ~~G~~~~l~~l~~~~v~~~~~~~~---~~--~-~~~~~~--~~~~~~~~evih~r~----------------------~~ 187 (424) .+|.+ .+..++|..+.+..++.. .. . .|.... ....+.+..|.++.. ++ T Consensus 165 ~dg~~-~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (492) T protein:vir:94 165 EEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGS 243 (492) T ss_pred CCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccccccccC Confidence 88876 577789988888765321 11 1 111111 111122222222210 00 Q ss_pred ---------CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCc Q lcl|NC_019719. 188 ---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKR 258 (424) Q Consensus 188 ---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 258 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.+++--.+....+... .. ...+ T Consensus 244 ~g~vPvv~~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~----~~-------~~~~ 312 (492) T protein:vir:94 244 WGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKR----LL-------RYYG 312 (492) T ss_pred CCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHH----HH-------hhcc Confidence 022357788877777777777666666666666677776665322221111111 11 1234 Q ss_pred ceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHH----------HHHHHHHHH Q lcl|NC_019719. 259 LWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQ 328 (424) Q Consensus 259 ~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~----------~~~~~~tl~ 328 (424) ++.++.+.+.+.+........+....+...+.|+..-++|..-.+.. +++.|+...+-.. ...+...+. T Consensus 313 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~ 391 (492) T protein:vir:94 313 AIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF-GSAPSGVALEFLYTNLNLKADKLARKAKVAIQ 391 (492) T ss_pred ceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCcccc-ccCchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55566666655555444445556677788888888888885322211 1222221111111 111222222 Q ss_pred HHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeeee--- Q lcl|NC_019719. 329 PYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAMR--- 403 (424) Q Consensus 329 P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~~--- 403 (424) -.++.+...++. ..+. ..+.+.++.-+..|..+.++.+.++. |+++..-++++++.-+.+ +.++.-. T Consensus 392 ~~~~li~~~~~~----~~~~--~~i~v~f~~~~p~~~~e~~~~~~kl~--giiS~et~~~~l~~v~d~~~E~eri~~E~~ 463 (492) T protein:vir:94 392 ELLWFVFEHFDI----KGEH--KDVDISFNYNKVANTELQVQTAQQSM--GIVSHETVLENHPFVEDLQAELERIEQEQM 463 (492) T ss_pred HHHHHHHHHhcC----Cccc--ceeeEEecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHHHH Confidence 222222222211 1122 23344445556788888999988885 889988888888763321 1111100 Q ss_pred --cccccchhhccccCCCcccCC Q lcl|NC_019719. 404 --QSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 404 --~~n~~~~~~~~~~~~~~~~ga 424 (424) ..............++.++++ T Consensus 464 ~~~~~~~~~~~~~~~~~~~~~~~ 486 (492) T protein:vir:94 464 EYNKQLPNLDDGGADSAQQQERS 486 (492) T ss_pred HHHhhccccccccCCCCccccCC Confidence 011111111111111111111 No 192 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=97.58 E-value=4.2e-05 Score=44.59 Aligned_cols=373 Identities=12% Similarity=0.034 Sum_probs=159.7 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccccccccccccccc--Cccccc---HHHHhhhHHHHHHHHHHHHhhc Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL--GDSSIN---DERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~~~v~~~i~~ia~~ia 75 (424) |...- .=++++|...+.... ++.......+.+.... .+.... +..-..+.+...+|+..++.+- T Consensus 1 ~~~~~--------~~~i~~l~~~~~~~~---~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~ 69 (441) T protein:vir:80 1 MNSDE--------LALIEGMYDRIQRLS---SWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLD 69 (441) T ss_pred CCccH--------HHHHHHHHHHHHHHH---HHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhc Confidence 21110 022333333322111 0000000111111100 001110 1111123444556665555442 Q ss_pred cCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEE Q lcl|NC_019719. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDV 155 (424) Q Consensus 76 ~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~ 155 (424) -..| +.. + ...+..++. . | +.......+..+++.+|.||+.+-++.+|.+ .+..++|..+.+ T Consensus 70 ~~g~---~~~-d--------~~~l~~i~~-~-n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~ 131 (441) T protein:vir:80 70 WLGW---TNG-D--------GYGLDGVYA-A-N---RLATASCDVHLDALIFGLSFVAIIPHGDGTV-SVRPQSPKNCTG 131 (441) T ss_pred cccc---cCC-C--------hHHHHHHHH-h-c---CHHHHHHHHHHHHhhcCeeEEEEEeCCCCce-EEEEEccceEEE Confidence 1111 111 1 122444443 2 2 3677778888999999999999999999987 578889999887 Q ss_pred EEcCCce------EEEEEecCc---eEEecHhH--------------------------eeEeccC-CCCccccCchHHH Q lcl|NC_019719. 156 KLVGKKV------VYRYQRDSE---YADFSQKE--------------------------IFHLKGF-GFTGLVGLSPIAF 199 (424) Q Consensus 156 ~~~~~~~------~~~~~~~~~---~~~~~~~e--------------------------vih~r~~-~~~~~~G~s~~~~ 199 (424) ..+.... .+.+..... ...+.++. |+|+.+. ....++|.|.+.- T Consensus 132 i~d~~~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~ 211 (441) T protein:vir:80 132 KFSADGSRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITR 211 (441) T ss_pred EEeCCCCceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchh Confidence 7654221 011111000 00111111 3444332 2344567775432 Q ss_pred -HHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCC-----ceeeeccc Q lcl|NC_019719. 200 -ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAG-----FSTSAIGV 273 (424) Q Consensus 200 -~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g-----~~~~~l~~ 273 (424) +...++.......-........+.|..++. +... ++..... ++. ..++++.++.+ .++.++.. T Consensus 212 ~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~-G~~~-~~~~~~~----~~~-----~~~~i~~~~~~~~~~~~~~~~~~~ 280 (441) T protein:vir:80 212 SIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT-GVSA-DEFSQPG----WVL-----SMASVWAVDKDDDGDTPNVGSFPV 280 (441) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhcCceeeee-cCCc-cccccch----hhh-----cccccccCCCCCCCCcceeEecCc Confidence 223333322222222222333344555553 1111 1111111 111 12334444432 34444443 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHH----------HHHHHHHHHHHHHHHHhhcc Q lcl|NC_019719. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL----------QYTLQPYISRWENSIQRWLI 343 (424) Q Consensus 274 ~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~----------~~tl~P~~~~ie~~l~~~l~ 343 (424) ...+ .+++..+.....++..-++|+..+|....+..|+.........+. ...|.-.++.+...++...- T Consensus 281 ~~~~-~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~ 359 (441) T protein:vir:80 281 NSPT-PYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVD 359 (441) T ss_pred cchH-HHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Confidence 3222 367778888999999999999999875543323222222211111 11111111111111111110 Q ss_pred CccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCC--HHHHHHHhCCCCCCCCCeeeecccccchhhccccC-CCc Q lcl|NC_019719. 344 PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRT--INEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNK-EPR 420 (424) Q Consensus 344 ~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T--~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~-~~~ 420 (424) ... ....+++.+...+..+..+.++.+.+++.+|+.. ...+++.+|+.+.+- .+... ......+ -.. T Consensus 360 ~~~--~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e~-~~~~~-------e~~e~~~~~~~ 429 (441) T protein:vir:80 360 EAD--FFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDVQV-EAVMR-------HRAESSDPLAV 429 (441) T ss_pred ccc--cceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHHHH-HHHHH-------HHHHHHHHHHH Confidence 001 1134455556667788899999999999999764 345677777764321 00000 0000000 000 Q ss_pred ccCC Q lcl|NC_019719. 421 NNGA 424 (424) Q Consensus 421 ~~ga 424 (424) ..|. T Consensus 430 ~~~~ 433 (441) T protein:vir:80 430 LAGA 433 (441) T ss_pred Hhhh Confidence 0111 No 193 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=97.58 E-value=4.2e-05 Score=44.58 Aligned_cols=384 Identities=13% Similarity=0.079 Sum_probs=178.9 Q ss_pred CchHHHHHhhccCcccCc---cc-------ccccccc--c----ccccccCccc--c---------cHHHHhhhHHHHHH Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVT---PN-------QGSQTGP--V----SAHGHLGDSS--I---------NDERILQISTVWRC 66 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~---~~-------~~~~~~~--~----~~~~~~~~~~--~---------~~~~~~~~~~v~~~ 66 (424) =|+|++++++|++..... +. ......+ . .+...+.|.. + ..+..........+ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHH Confidence 689999999887642211 00 0000000 0 0000011100 0 00111222333445 Q ss_pred HHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEE Q lcl|NC_019719. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLL 146 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~ 146 (424) ++..|+-+..=|..+.-. +. .....+.+++.. | ....-.+..+.+.+..|.+++.+..+. |. +.+. T Consensus 81 ~~~~A~ll~~e~~~i~~~---d~----~~~e~l~~i~~~--n---~f~~~~~~~~e~a~a~G~~~~k~~~D~-~~-~~i~ 146 (505) T protein:vir:79 81 SAKLASLIFNEQCQVTVS---DE----TANDFLDDVFQQ--N---DFYTTFEEKLEEWIALGSGCVRPYVDS-GK-IKLA 146 (505) T ss_pred HHHHHhhhcCCCceeecC---Ch----HHHHHHHHHHHh--c---cHHHHHHHHHHHHhhcCCeEEEEEEeC-Cc-eEEE Confidence 555555554434333111 10 011122233321 1 234555667777888888888776663 33 3455 Q ss_pred eecCceEEEEE-cCCc------------------eEEE-----------EEecC----------ceEEecHh-------- Q lcl|NC_019719. 147 PLQSANMDVKL-VGKK------------------VVYR-----------YQRDS----------EYADFSQK-------- 178 (424) Q Consensus 147 ~l~~~~v~~~~-~~~~------------------~~~~-----------~~~~~----------~~~~~~~~-------- 178 (424) .++|..+.+.. +.+. .+|. |.+.. ....++.. T Consensus 147 ~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l 226 (505) T protein:vir:79 147 WATADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGL 226 (505) T ss_pred EEcCCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhccccccc Confidence 56666666532 2111 0110 00000 00001101 Q ss_pred ------------HeeEeccCCC-----CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeE-----EcCCCCCC Q lcl|NC_019719. 179 ------------EIFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL-----STGEKVLT 236 (424) Q Consensus 179 ------------evih~r~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl-----~~~~~~~~ 236 (424) -+.|++.+.. ..+.|+|.+..+...++.....-......|+.|.. ..++ ........ T Consensus 227 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~ 305 (505) T protein:vir:79 227 EPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQR-RLIVPAEWLKTGSSYGG 305 (505) T ss_pred CcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhccc-ceeechHHhcccCCCCc Confidence 1234543211 34679999999998888777766666666766543 3222 21111111 Q ss_pred H--HH-HHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccch Q lcl|NC_019719. 237 E--QQ-RSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGS 313 (424) Q Consensus 237 ~--~~-~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~ 313 (424) + .. ..........+.+ +..-+++..++.++....+.++.+..+...+.|+...|+++..++....+..+.. T Consensus 306 ~~~~~~~~~fd~~~~~y~~------~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAt 379 (505) T protein:vir:79 306 QASETHPPMFDPDETVYQA------MYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTAT 379 (505) T ss_pred ccccccccCCCccceeeee------ccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHH Confidence 0 00 0000000000111 0011223457778877788888999999999999999999999987655433211 Q ss_pred hHH----------HHHHHHHHHHHHHHHHHHHHHHHhhccCcc-------ccccceeeecchhhhccCHHHHHHHHHHHH Q lcl|NC_019719. 314 GIE----------QQNLGFLQYTLQPYISRWENSIQRWLIPAK-------DVGRIHAEHNLDGLLRGDSASRAAFMKAMG 376 (424) Q Consensus 314 n~e----------~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~ 376 (424) -+. ......++.+|..++..|........+... ....+.+.++++.-+..|.++..+...+++ T Consensus 380 ei~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v 459 (505) T protein:vir:79 380 EVVTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAV 459 (505) T ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHH Confidence 111 111222344444444444333222111111 112345667777777899999999999999 Q ss_pred hCCCCCHHHHHHHh-CCCCCCCCCeeeecccccchhh--ccccCCCcccCC Q lcl|NC_019719. 377 EAGLRTINEMRRTD-NLPPLPGGDVAMRQSQYVPITD--LGTNKEPRNNGA 424 (424) Q Consensus 377 ~~g~~T~NE~R~~~-G~~p~~~gd~~~~~~n~~~~~~--~~~~~~~~~~ga 424 (424) .+|+|++-+++... |++. +..++.+ ..+.. .....+..+-|+ T Consensus 460 ~~Gi~s~e~~l~~~~~~~e-eea~~el-----~ri~~E~~~~~p~~~~~gg 504 (505) T protein:vir:79 460 QAQVMPKKQFLMRNYGLDE-EEADEWL-----AQIDAENSTAEPEFNQFGG 504 (505) T ss_pred HcCCCCHHHHHHhcCCCCh-HHHHHHH-----HHHHHhccccCCCchhccC Confidence 99999998887653 4432 1111111 11111 001111122222 No 194 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=97.57 E-value=4.4e-05 Score=44.49 Aligned_cols=394 Identities=9% Similarity=0.021 Sum_probs=175.1 Q ss_pred CCCCcccccCCCCCc----hHHHHHhhccCcccCccccccccccccccc-ccCc--cccc-HHHHhhhHHHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNG----WWARLQSWFVGGRLVTPNQGSQTGPVSAHG-HLGD--SSIN-DERILQISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~~~~~~G----~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~--~~~~-~~~~~~~~~v~~~i~~ia~ 72 (424) --|+.=+|.+-++.. ++.++........ ++.......+.+.. .... ..-. +..=+..+....+|+..+. T Consensus 2 ~~~~~~~~~~p~d~~~~~~~l~~~i~~~~~~~---~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~ 78 (453) T protein:vir:39 2 KYKPPKLMTFPKDEPITNEVVTKFMEKHRLEV---ARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVNFTKYIVDTFTG 78 (453) T ss_pred eecCCcceEcCCCCCCCHHHHHHHHHHHHHHH---HHHHHHHHHhhccCchhcCCCccccCccceeecchHHHHHHHHhh Confidence 224444566666654 3444433322110 11000000111100 0000 0000 0011224566778888888 Q ss_pred hhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCce Q lcl|NC_019719. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) Q Consensus 73 ~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~ 152 (424) -+-+-|+.+--. +. .....+.+++.. | ........+..+.+.+|.||+.+.++.+|.+ .+..++|.. T Consensus 79 ~l~g~~~~~~~~--d~-----~~~~~l~~i~~~--N---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~ 145 (453) T protein:vir:39 79 YFNGIPVKKSHS--DK-----ETLSKLQEFDNL--N---DMEDEESELAKMACIYGRAFELLYQNEETQT-NVIYNTPEN 145 (453) T ss_pred hhcccCceeccC--Ch-----HHHHHHHHHHHh--c---ChhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccc Confidence 777777665211 11 111234444442 2 3455677788999999999999999988876 466678888 Q ss_pred EEEEEcCCc---e--EEEEEecCce----EEecHhHeeEeccCC---------------------CCccccCchHHHHHH Q lcl|NC_019719. 153 MDVKLVGKK---V--VYRYQRDSEY----ADFSQKEIFHLKGFG---------------------FTGLVGLSPIAFACK 202 (424) Q Consensus 153 v~~~~~~~~---~--~~~~~~~~~~----~~~~~~evih~r~~~---------------------~~~~~G~s~~~~~~~ 202 (424) +.+..++.. . .+++...... ..+.++.+.++...+ .+...|.|.+..+.. T Consensus 146 ~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~ 225 (453) T protein:vir:39 146 MFMVYDDTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVIS 225 (453) T ss_pred eEEEecCCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEEEEecCCceeeecccccCCCceeEEEecCCCCCCcchhhhHH Confidence 887765422 1 1111111111 112333333322110 012357777776666 Q ss_pred HHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHH Q lcl|NC_019719. 203 SAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMA 282 (424) Q Consensus 203 ~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e 282 (424) .++....+..-..+.....+.|..++.- .+. +++....++.. ...... +....+.+.++..+........+.+ T Consensus 226 liDa~~~~~s~~~~~~~~~~~p~~~~~g-~~~-~~~~~~~~~~~--~~~~~~---~~~~~~~~~~~~~lt~~~~~~~~~~ 298 (453) T protein:vir:39 226 LVNAFNKAISEKANDVDYFSDQYLTFLG-AAV-EEEDLKNIRSN--RVINYY---GESSEAKNVDVKFLEKPDSDSQTEN 298 (453) T ss_pred HHHHHHHHHHHHHHHHHHhhCceeeeec-CCC-Cchhhhhhhhc--ceeeec---CCCCCCCCCceeEEeecCCHHHHHH Confidence 6655555544444445555667666542 223 22222222110 011100 0011123334444444334455566 Q ss_pred HHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHH----------HHHHHHHHHHHHHHHHHHHHhhccCccccccce Q lcl|NC_019719. 283 SRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYISRWENSIQRWLIPAKDVGRIH 352 (424) Q Consensus 283 ~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~----------~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~ 352 (424) ..+...+.|+..-++|..-.... ++.++...+... ...+...+...++.+...++..- ...+. .. T Consensus 299 ~~~~l~~~I~~~s~~p~~~~~~~--gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~-~~~~~--~~ 373 (453) T protein:vir:39 299 LLDRLTKLIFQTTMVANISDESF--GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVS-NKEAW--KD 373 (453) T ss_pred HHHHHHHHHHHHhCCcccccccc--cCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-Ccccc--cc Confidence 77788888888888884322211 222322222111 12233333443333333222111 11111 22 Q ss_pred eeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeeee------cccccchhhc-cccCCCcccC Q lcl|NC_019719. 353 AEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAMR------QSQYVPITDL-GTNKEPRNNG 423 (424) Q Consensus 353 ~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~--gd~~~~------~~n~~~~~~~-~~~~~~~~~g 423 (424) +.+.+..-+..|..+.++.+.++ .|+++..-+.++++.-+.+. .+.... ..+....... +.+.+..+++ T Consensus 374 i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (453) T protein:vir:39 374 IEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVPETN 451 (453) T ss_pred ceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCcC Confidence 34444556678888899998888 47899988888887633221 111000 0000000000 0000000000 Q ss_pred C Q lcl|NC_019719. 424 A 424 (424) Q Consensus 424 a 424 (424) - T Consensus 452 ~ 452 (453) T protein:vir:39 452 E 452 (453) T ss_pred C Confidence 0 No 195 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=97.55 E-value=4.6e-05 Score=44.34 Aligned_cols=393 Identities=11% Similarity=0.060 Sum_probs=168.7 Q ss_pred CCCCc----ccc----cCCCCC----------------chHHHHHhhccCcccCcccccccccccccccccCcccccHHH Q lcl|NC_019719. 1 MEEPK----YTI----DLRTNN----------------GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDER 56 (424) Q Consensus 1 ~~~~~----~~~----~~~~~~----------------G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 56 (424) -=.++ |.+ +.++.+ --++++..+..+.-.. .... ...... .-+.. T Consensus 20 ~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i-~~~~---------~~~~~~-~~~~~ 88 (511) T protein:vir:99 20 LFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN-LVEL---------TRRKEE-YMADN 88 (511) T ss_pred hhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcc-cccc---------Cccccc-ccCcc Confidence 00001 110 011111 1122222222221000 0000 000000 00000 Q ss_pred HhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019719. 57 ILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 57 ~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r 136 (424) =+..+.....|+..+.-+-+-|+.+--. +.. ....+..++. . | ........+..+++.+|.+|.++.+ T Consensus 89 ki~~n~~k~Iv~~~~~yl~g~p~~~~~~--d~~-----~~~~l~~~~~-~-n---~~~~~~~~~~~~~~i~G~a~~~vy~ 156 (511) T protein:vir:99 89 RVAHDYASYISDFINGYFLGNPIQYQDD--DKD-----VLEAIEAFND-L-N---DVESHNRSLGLDLSIYGKAYELMIR 156 (511) T ss_pred eeecchHHHHHHHHHhhhcccCceeecC--chH-----HHHHHHHHHh-h-c---CHhHHHHHHHHHHHhcCeeEEEEEe Confidence 1123445567777777777777765211 111 1123333333 2 2 3556667788899999999999999 Q ss_pred CCCCceeeEEeecCceEEEEEcCCc---eE--EE-EEe-----cC-c----eEEecHhHeeEeccCC------------- Q lcl|NC_019719. 137 NSAGDVISLLPLQSANMDVKLVGKK---VV--YR-YQR-----DS-E----YADFSQKEIFHLKGFG------------- 187 (424) Q Consensus 137 ~~~G~~~~l~~l~~~~v~~~~~~~~---~~--~~-~~~-----~~-~----~~~~~~~evih~r~~~------------- 187 (424) +.+|.+ .+..++|..+.+..++.. .. +. |.. .. . ...+.++.+.+++... T Consensus 157 ded~~~-~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~ 235 (511) T protein:vir:99 157 NQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGF 235 (511) T ss_pred CCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCcccccccccccc Confidence 988875 677788988888766432 11 11 110 00 0 1134445554442110 Q ss_pred -------------CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH---HHHHHh Q lcl|NC_019719. 188 -------------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE---NFKEIA 251 (424) Q Consensus 188 -------------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~---~~~~~~ 251 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.++.-.... ++......++ .+.... T Consensus 236 ~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~-~~~~~~~~~~~~~~~~~~~ 314 (511) T protein:vir:99 236 ESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL-DPVEVRKQKEANVLFLEPT 314 (511) T ss_pred ccCCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCccc-Cchhhcccccccceecccc Confidence 01235777777766666655555544444444445555555432222 2222111111 000000 Q ss_pred CCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHH---------- Q lcl|NC_019719. 252 GGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG---------- 321 (424) Q Consensus 252 ~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~---------- 321 (424) ... .+.....++|.+++.+.....+..+....+...+.|+..-++|..-.+... ++.|+....-.... T Consensus 315 ~~~-~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-gn~Sg~Alk~~~~~l~~ka~~k~~ 392 (511) T protein:vir:99 315 VYA-DSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEG 392 (511) T ss_pred ccc-ccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHH Confidence 000 111122345566666665555555667788889999999999975443322 22332222211111 Q ss_pred HHHHHHHHHHHHHHHHHHhhcc--CccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC-- Q lcl|NC_019719. 322 FLQYTLQPYISRWENSIQRWLI--PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG-- 397 (424) Q Consensus 322 ~~~~tl~P~~~~ie~~l~~~l~--~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~-- 397 (424) .+...+.-.++.|...+...-- ...+.. .+++.+..-+..|..+.++.+.++. |+++..-+.++++.-+.+. T Consensus 393 ~~~~~l~~~~~li~~~~~~~~~~~~~~~~~--~i~i~f~~~~p~n~~e~~~~~~kl~--GiiS~et~l~~l~~v~D~~~E 468 (511) T protein:vir:99 393 LFTKGLRRRAKLLETILKNTRSIDVSKDFN--TVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELE 468 (511) T ss_pred HHHHHHHHHHHHHHHHHHhcCCcccccccc--cceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCCCCHHHH Confidence 1112222222222222221110 011111 2344445556678888999888885 8899888888875532211 Q ss_pred CCe--------e---eecccccchhhccccCCCcccCC Q lcl|NC_019719. 398 GDV--------A---MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd~--------~---~~~~n~~~~~~~~~~~~~~~~ga 424 (424) .++ . ..+....+-....+++++++... T Consensus 469 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) T protein:vir:99 469 VKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDSTKDS 506 (511) T ss_pred HHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCCcCc Confidence 000 0 00000000000011111111111 No 196 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=97.50 E-value=5.4e-05 Score=43.96 Aligned_cols=386 Identities=10% Similarity=-0.014 Sum_probs=169.0 Q ss_pred CCCCcc---------ccc--CCCCCchHHHHHhhccCcccCccccccccccccccc---------ccCcc--cccHHHHh Q lcl|NC_019719. 1 MEEPKY---------TID--LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHG---------HLGDS--SINDERIL 58 (424) Q Consensus 1 ~~~~~~---------~~~--~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~--~~~~~~~~ 58 (424) .+-|-+ .++ .+...=++.++...+.... ++.......+.+-. ...+. ...+..=+ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 81 (474) T protein:vir:97 5 IRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQL---DKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRI 81 (474) T ss_pred ccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHH---HHHHHHHHHhccccchhcccchhccccccccccCccee Confidence 011100 011 1111123333322221110 00000000000000 00000 00000112 Q ss_pred hhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC Q lcl|NC_019719. 59 QISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS 138 (424) Q Consensus 59 ~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~ 138 (424) ..+....+|+..+.-+-.-|+.+--. +. .... ..+.++. | ........+..+.+.+|.+|+.+..+. T Consensus 82 ~~n~~k~Ivd~~~~~l~g~p~~~~~~--d~-----~~~~-~l~~~~~--n---~~~~~~~e~~~~~~~~G~~~~~~~~d~ 148 (474) T protein:vir:97 82 TTNFHQNLVDQKVSYVASKPVTYSCE--DE-----NVLK-VIHDVLD--T---RWDNKLIDILTATSNKGIDWLQVYINE 148 (474) T ss_pred ecchHHHHHHHHHhhhhcCCceeccC--cH-----HHHH-HHHHHHh--c---cHHHHHHHHHHHHhhcCceEEEEEecC Confidence 24556678888888887778765211 11 0111 2222321 2 244555667888999999999999998 Q ss_pred CCceeeEEeecCceEEEEEcCCc---e---EEEEEecCc--eEEecHhHeeEecc------------------------- Q lcl|NC_019719. 139 AGDVISLLPLQSANMDVKLVGKK---V---VYRYQRDSE--YADFSQKEIFHLKG------------------------- 185 (424) Q Consensus 139 ~G~~~~l~~l~~~~v~~~~~~~~---~---~~~~~~~~~--~~~~~~~evih~r~------------------------- 185 (424) +|.+ .+..++|..+.+..++.. . ...|...+. ...+.++.+.+.+. T Consensus 149 ~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (474) T protein:vir:97 149 NGEM-KLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNW 227 (474) T ss_pred CCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCC Confidence 8875 577788888888776431 1 111111111 11122222222110 Q ss_pred --CC----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcc Q lcl|NC_019719. 186 --FG----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRL 259 (424) Q Consensus 186 --~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 259 (424) .+ .+...|.|.+..+...++....+.....+.+...+.|..++.-......++... . ...+++ T Consensus 228 g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-------~----~~~~~~ 296 (474) T protein:vir:97 228 GRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMR-------G----LKYYKA 296 (474) T ss_pred CccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhh-------h----hhccce Confidence 00 023467887777777777766665555555666666766665322211111111 1 123456 Q ss_pred eecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH----------HHHHHHHHHHHHH Q lcl|NC_019719. 260 WILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQP 329 (424) Q Consensus 260 ~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~~~~~tl~P 329 (424) +.+++|.+.+.+........+.+..+...+.|...-++|..-.... .++.++...+ +.....+...+.- T Consensus 297 i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~ 375 (474) T protein:vir:97 297 INVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKF-GSAPSGIALKFLYGNLDLKANKLKNKATVAIQE 375 (474) T ss_pred eeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7777777766666554555556677888888888888884222111 1222221111 1111233333334 Q ss_pred HHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeee----- Q lcl|NC_019719. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAM----- 402 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~----- 402 (424) ++..|...+.. ..+.....+.|+ .-...|... .++.++..|+++..-++++++.-+.+ ..+..- T Consensus 376 ~~~li~~~~~~----~~d~~~i~v~f~--~~~p~~~~e---~a~~~~~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~ 446 (474) T protein:vir:97 376 LISFIIDFNNL----KTDVKDIEISFN--FNRMMNDAE---QSQIIAQSQYLSRETLVKSSPLVDDYKAELERIEQEQME 446 (474) T ss_pred HHHHHHHHhCC----CcccceeeEEec--cCcccCHHH---HHHHHHHcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHH Confidence 43333332221 112222334443 333344444 44556667999998888888763221 110000 Q ss_pred ecccccchhhccccC--CCcccCC Q lcl|NC_019719. 403 RQSQYVPITDLGTNK--EPRNNGA 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~--~~~~~ga 424 (424) ......+....+.+. +.++.+. T Consensus 447 ~~~~~~~~~~~~~~~~~~~~~~~~ 470 (474) T protein:vir:97 447 YNKQLPNLDDGGADGAQQQEGSNN 470 (474) T ss_pred HHhhccccCCCCCCCcccCCCCcc Confidence 001111122211111 1111111 No 197 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=97.50 E-value=5.4e-05 Score=43.96 Aligned_cols=386 Identities=10% Similarity=-0.014 Sum_probs=169.0 Q ss_pred CCCCcc---------ccc--CCCCCchHHHHHhhccCcccCccccccccccccccc---------ccCcc--cccHHHHh Q lcl|NC_019719. 1 MEEPKY---------TID--LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHG---------HLGDS--SINDERIL 58 (424) Q Consensus 1 ~~~~~~---------~~~--~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~--~~~~~~~~ 58 (424) .+-|-+ .++ .+...=++.++...+.... ++.......+.+-. ...+. ...+..=+ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 81 (474) T protein:vir:94 5 IRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQL---DKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRI 81 (474) T ss_pred ccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHH---HHHHHHHHHhccccchhcccchhccccccccccCccee Confidence 011100 011 1111123333322221110 00000000000000 00000 00000112 Q ss_pred hhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC Q lcl|NC_019719. 59 QISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS 138 (424) Q Consensus 59 ~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~ 138 (424) ..+....+|+..+.-+-.-|+.+--. +. .... ..+.++. | ........+..+.+.+|.+|+.+..+. T Consensus 82 ~~n~~k~Ivd~~~~~l~g~p~~~~~~--d~-----~~~~-~l~~~~~--n---~~~~~~~e~~~~~~~~G~~~~~~~~d~ 148 (474) T protein:vir:94 82 TTNFHQNLVDQKVSYVASKPVTYSCE--DE-----NVLK-VIHDVLD--T---RWDNKLIDILTATSNKGIDWLQVYINE 148 (474) T ss_pred ecchHHHHHHHHHhhhhcCCceeccC--cH-----HHHH-HHHHHHh--c---cHHHHHHHHHHHHhhcCceEEEEEecC Confidence 24556678888888887778765211 11 0111 2222321 2 244555667888999999999999998 Q ss_pred CCceeeEEeecCceEEEEEcCCc---e---EEEEEecCc--eEEecHhHeeEecc------------------------- Q lcl|NC_019719. 139 AGDVISLLPLQSANMDVKLVGKK---V---VYRYQRDSE--YADFSQKEIFHLKG------------------------- 185 (424) Q Consensus 139 ~G~~~~l~~l~~~~v~~~~~~~~---~---~~~~~~~~~--~~~~~~~evih~r~------------------------- 185 (424) +|.+ .+..++|..+.+..++.. . ...|...+. ...+.++.+.+.+. T Consensus 149 ~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (474) T protein:vir:94 149 NGEM-KLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNW 227 (474) T ss_pred CCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCC Confidence 8875 577788888888776431 1 111111111 11122222222110 Q ss_pred --CC----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcc Q lcl|NC_019719. 186 --FG----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRL 259 (424) Q Consensus 186 --~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 259 (424) .+ .+...|.|.+..+...++....+.....+.+...+.|..++.-......++... . ...+++ T Consensus 228 g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-------~----~~~~~~ 296 (474) T protein:vir:94 228 GRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMR-------G----LKYYKA 296 (474) T ss_pred CccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhh-------h----hhccce Confidence 00 023467887777777777766665555555666666766665322211111111 1 123456 Q ss_pred eecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH----------HHHHHHHHHHHHH Q lcl|NC_019719. 260 WILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQP 329 (424) Q Consensus 260 ~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~~~~~tl~P 329 (424) +.+++|.+.+.+........+.+..+...+.|...-++|..-.... .++.++...+ +.....+...+.- T Consensus 297 i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~ 375 (474) T protein:vir:94 297 INVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKF-GSAPSGIALKFLYGNLDLKANKLKNKATVAIQE 375 (474) T ss_pred eeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7777777766666554555556677888888888888884222111 1222221111 1111233333334 Q ss_pred HHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeee----- Q lcl|NC_019719. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAM----- 402 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~----- 402 (424) ++..|...+.. ..+.....+.|+ .-...|... .++.++..|+++..-++++++.-+.+ ..+..- T Consensus 376 ~~~li~~~~~~----~~d~~~i~v~f~--~~~p~~~~e---~a~~~~~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~ 446 (474) T protein:vir:94 376 LISFIIDFNNL----KTDVKDIEISFN--FNRMMNDAE---QSQIIAQSQYLSRETLVKSSPLVDDYKAELERIEQEQME 446 (474) T ss_pred HHHHHHHHhCC----CcccceeeEEec--cCcccCHHH---HHHHHHHcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHH Confidence 43333332221 112222334443 333344444 44556667999998888888763221 110000 Q ss_pred ecccccchhhccccC--CCcccCC Q lcl|NC_019719. 403 RQSQYVPITDLGTNK--EPRNNGA 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~--~~~~~ga 424 (424) ......+....+.+. +.++.+. T Consensus 447 ~~~~~~~~~~~~~~~~~~~~~~~~ 470 (474) T protein:vir:94 447 YNKQLPNLDDGGADGAQQQEGSNN 470 (474) T ss_pred HHhhccccCCCCCCCcccCCCCcc Confidence 001111122211111 1111111 No 198 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=97.50 E-value=5.6e-05 Score=43.91 Aligned_cols=387 Identities=7% Similarity=-0.024 Sum_probs=174.4 Q ss_pred CC----CCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccc---------cCc--ccccHHHHhhhHHHHH Q lcl|NC_019719. 1 ME----EPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGH---------LGD--SSINDERILQISTVWR 65 (424) Q Consensus 1 ~~----~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~--~~~~~~~~~~~~~v~~ 65 (424) -+ -.+++-+......++.++...+.... ++.......+.+... ..+ ...-+..=+.++.... T Consensus 29 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~---~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~ 105 (492) T protein:vir:97 29 TEIFDAIVRTNNKPETLEEMIVRYIKQHLEKL---PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHAN 105 (492) T ss_pred hhHhhhcccCCCchhhHHHHHHHHHHHHHHHH---HHHHHHHHHhcccCccccccccccccccccccccccccccchHHH Confidence 00 01111111222233444433222110 000000001100000 000 0000111123466677 Q ss_pred HHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeE Q lcl|NC_019719. 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISL 145 (424) Q Consensus 66 ~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l 145 (424) +|+..+.-+-+-|+.+.- .+.. ....+..++. | ........+..+++.+|.||.++..+.+|.+ .+ T Consensus 106 Ivd~~~~yl~g~p~~~~~--~d~~-----~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~-~~ 171 (492) T protein:vir:97 106 LVDQKVSYIVGKPIAFKH--TDDE-----VVKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KL 171 (492) T ss_pred HHHHHhhhhcccCceecc--CchH-----HHHHHHHHHh---c---cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EE Confidence 888888877777776521 1110 1122333322 2 2334556678889999999999999988875 57 Q ss_pred EeecCceEEEEEcCCc---eE---EEEEecC--ceEEecHhHeeEecc----------------------CCC------- Q lcl|NC_019719. 146 LPLQSANMDVKLVGKK---VV---YRYQRDS--EYADFSQKEIFHLKG----------------------FGF------- 188 (424) Q Consensus 146 ~~l~~~~v~~~~~~~~---~~---~~~~~~~--~~~~~~~~evih~r~----------------------~~~------- 188 (424) ..++|..+.+..++.. .. ..|.... ....+.+..+.++.. +++ T Consensus 172 ~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 251 (492) T protein:vir:97 172 FRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIP 251 (492) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEE Confidence 7789988888765321 11 1111111 111122222222210 000 Q ss_pred --CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCc Q lcl|NC_019719. 189 --TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGF 266 (424) Q Consensus 189 --~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~ 266 (424) +...|.|.+..+...++....+..-..+.+...+.|-.++.-.......+ .... . ...+++.++.+. T Consensus 252 ~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~----~~~~----~---~~~~~~~~~~~~ 320 (492) T protein:vir:97 252 FKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPE----FKRL----L---RYYGAIKVSDNG 320 (492) T ss_pred ecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchh----HHHH----H---hhccceecCCCC Confidence 12357788887777777666665555666666666766664322221111 1111 1 122355566666 Q ss_pred eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWEN 336 (424) Q Consensus 267 ~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ie~ 336 (424) +.+.+.....+..+....+...+.|+..-++|..-..... ++.|+...+.. ....+...+...++.|.. T Consensus 321 ~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~ 399 (492) T protein:vir:97 321 GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE 399 (492) T ss_pred cceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6666655545566677888888999999999853332211 22222222111 111222223333333322 Q ss_pred HHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeeee-----cccccc Q lcl|NC_019719. 337 SIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAMR-----QSQYVP 409 (424) Q Consensus 337 ~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~~-----~~n~~~ 409 (424) .++. ..+. ..+.+.++.-+..|..+.++.+.++ .|+++..-+.++++.-+.+ +.+++-. ..+... T Consensus 400 ~~~~----~~~~--~~i~v~f~~~~p~~~~e~a~~~~kl--~G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~~~~~~~ 471 (492) T protein:vir:97 400 HFDI----KGEH--KDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQTEYNKQLPN 471 (492) T ss_pred HhcC----Cccc--ceeeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhc Confidence 2211 1122 2334444555667888899998888 4889988888888763321 1111000 011111 Q ss_pred hhhccccCCCcccCC Q lcl|NC_019719. 410 ITDLGTNKEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~~~~~~~ga 424 (424) +...+...+..+.+. T Consensus 472 ~~~~~~~~~~~~~~~ 486 (492) T protein:vir:97 472 LDDGGADSAQQQERS 486 (492) T ss_pred cccCCCCCCcccccc Confidence 111111111111111 No 199 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=97.45 E-value=6.4e-05 Score=43.58 Aligned_cols=386 Identities=10% Similarity=-0.008 Sum_probs=172.3 Q ss_pred CCCCc-----c--cccCCCCCchHHHHHhhccCcccCccccccccccccccccc---------Cccc--ccHHHHhhhHH Q lcl|NC_019719. 1 MEEPK-----Y--TIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL---------GDSS--INDERILQIST 62 (424) Q Consensus 1 ~~~~~-----~--~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~--~~~~~~~~~~~ 62 (424) ..||. - ..+.++..-++.++........ +........+.+.... .+.. .-+..=+.++. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:95 9 WDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKL---KDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNF 85 (474) T ss_pred CCCCCCcchhhhccccccchHHHHHHHHHHHHHHH---HHHHHHHHHhcccCccccccchhhhcccccccccccccccch Confidence 11110 0 1223344445555554433211 1000000001000000 0000 00011122455 Q ss_pred HHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019719. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~ 142 (424) ....|+..+.-+-+-|+.+--. ++. ..+.+...+. | ........+..+++.+|.+|..+-++.+|.+ T Consensus 86 ~k~Iv~~~~~yl~g~p~~~~~~--~~~-----~~~~l~~~~~---n---~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~ 152 (474) T protein:vir:95 86 HQNLVDQKVSYVAGKPVTYAHD--DDK-----VLDVIHQVLD---T---RWDNKLIDILTAASNKGIDWLQVYINEDGEL 152 (474) T ss_pred HHHHHHhhhhhhcccCceeccC--ChH-----HHHHHHHHHh---c---cHHHHHHHHHHHHhhCCeEEEEeeeCCCCce Confidence 6667888888887778775311 111 1123333332 2 3455666778899999999999989888875 Q ss_pred eeEEeecCceEEEEEcCCc--e----EEEEEecCc--eEEecHhHeeEeccC----------------------C----- Q lcl|NC_019719. 143 ISLLPLQSANMDVKLVGKK--V----VYRYQRDSE--YADFSQKEIFHLKGF----------------------G----- 187 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~--~----~~~~~~~~~--~~~~~~~evih~r~~----------------------~----- 187 (424) .+..++|..+.+..++.. . .+.|...+. ...+.++.+.++..- . T Consensus 153 -~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP 231 (474) T protein:vir:95 153 -KLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVP 231 (474) T ss_pred -EEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccc Confidence 577788888887765421 1 111111111 111223333332210 0 Q ss_pred ----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC Q lcl|NC_019719. 188 ----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE 263 (424) Q Consensus 188 ----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~ 263 (424) .+...|.|.+......++....+..-..+.+...+.|-.++.--......+ .... ....+++.++ T Consensus 232 vv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~-------~~~~----~~~~~~i~~~ 300 (474) T protein:vir:95 232 FIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSE-------FMEG----LKYYKAINVS 300 (474) T ss_pred eEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccc-------hhhh----hhccceeecc Confidence 022357777776666666655555444555555555665554221110011 1111 1233566677 Q ss_pred CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHHHHHHHHH Q lcl|NC_019719. 264 AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISR 333 (424) Q Consensus 264 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ 333 (424) ++.+.+.+.....+..+....+...+.|...-++|..-.... .++.++...+.. ....+...+.-+++. T Consensus 301 ~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~ 379 (474) T protein:vir:95 301 SDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKF-GSATSGIALKFLYTNLNLKANKLKNKANVALQELMQF 379 (474) T ss_pred CCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 776666666555556667788888899999999985432221 122222112111 112222233333332 Q ss_pred HHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeee-----eccc Q lcl|NC_019719. 334 WENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAM-----RQSQ 406 (424) Q Consensus 334 ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~-----~~~n 406 (424) |...+. ...+.....+.| +.-+..|..+.++. +.+.|+++...++++++.-..+ +.+++- ...+ T Consensus 380 i~~~~g----~~~d~~~i~i~f--~~~~p~~~~e~a~~---~~~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~ 450 (474) T protein:vir:95 380 ILDFNK----IKLDAKEIEITF--NFNVMVNDLEQSQI---GAQSQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQ 450 (474) T ss_pred HHHHhC----CCcccceeeEEe--cCCCccCHHHHHHH---HHHcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhh Confidence 222111 111222223344 44444555555554 4557999998888888764322 111100 0001 Q ss_pred ccchhhccccC--C-CcccCC Q lcl|NC_019719. 407 YVPITDLGTNK--E-PRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~~--~-~~~~ga 424 (424) ...+.+..... + .+.++- T Consensus 451 ~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:95 451 LPNLDDGGADGAQQQQQSENN 471 (474) T ss_pred ccccccccCCCCCCcCCCCcc Confidence 11111111111 1 111111 No 200 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=97.45 E-value=6.4e-05 Score=43.58 Aligned_cols=386 Identities=10% Similarity=-0.008 Sum_probs=172.3 Q ss_pred CCCCc-----c--cccCCCCCchHHHHHhhccCcccCccccccccccccccccc---------Cccc--ccHHHHhhhHH Q lcl|NC_019719. 1 MEEPK-----Y--TIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL---------GDSS--INDERILQIST 62 (424) Q Consensus 1 ~~~~~-----~--~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~--~~~~~~~~~~~ 62 (424) ..||. - ..+.++..-++.++........ +........+.+.... .+.. .-+..=+.++. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:96 9 WDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKL---KDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNF 85 (474) T ss_pred CCCCCCcchhhhccccccchHHHHHHHHHHHHHHH---HHHHHHHHHhcccCccccccchhhhcccccccccccccccch Confidence 11110 0 1223344445555554433211 1000000001000000 0000 00011122455 Q ss_pred HHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019719. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~ 142 (424) ....|+..+.-+-+-|+.+--. ++. ..+.+...+. | ........+..+++.+|.+|..+-++.+|.+ T Consensus 86 ~k~Iv~~~~~yl~g~p~~~~~~--~~~-----~~~~l~~~~~---n---~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~ 152 (474) T protein:vir:96 86 HQNLVDQKVSYVAGKPVTYAHD--DDK-----VLDVIHQVLD---T---RWDNKLIDILTAASNKGIDWLQVYINEDGEL 152 (474) T ss_pred HHHHHHhhhhhhcccCceeccC--ChH-----HHHHHHHHHh---c---cHHHHHHHHHHHHhhCCeEEEEeeeCCCCce Confidence 6667888888887778775311 111 1123333332 2 3455666778899999999999989888875 Q ss_pred eeEEeecCceEEEEEcCCc--e----EEEEEecCc--eEEecHhHeeEeccC----------------------C----- Q lcl|NC_019719. 143 ISLLPLQSANMDVKLVGKK--V----VYRYQRDSE--YADFSQKEIFHLKGF----------------------G----- 187 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~--~----~~~~~~~~~--~~~~~~~evih~r~~----------------------~----- 187 (424) .+..++|..+.+..++.. . .+.|...+. ...+.++.+.++..- . T Consensus 153 -~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP 231 (474) T protein:vir:96 153 -KLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVP 231 (474) T ss_pred -EEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccc Confidence 577788888887765421 1 111111111 111223333332210 0 Q ss_pred ----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC Q lcl|NC_019719. 188 ----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE 263 (424) Q Consensus 188 ----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~ 263 (424) .+...|.|.+......++....+..-..+.+...+.|-.++.--......+ .... ....+++.++ T Consensus 232 vv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~-------~~~~----~~~~~~i~~~ 300 (474) T protein:vir:96 232 FIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSE-------FMEG----LKYYKAINVS 300 (474) T ss_pred eEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccc-------hhhh----hhccceeecc Confidence 022357777776666666655555444555555555665554221110011 1111 1233566677 Q ss_pred CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHHHHHHHHH Q lcl|NC_019719. 264 AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISR 333 (424) Q Consensus 264 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ 333 (424) ++.+.+.+.....+..+....+...+.|...-++|..-.... .++.++...+.. ....+...+.-+++. T Consensus 301 ~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~ 379 (474) T protein:vir:96 301 SDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKF-GSATSGIALKFLYTNLNLKANKLKNKANVALQELMQF 379 (474) T ss_pred CCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 776666666555556667788888899999999985432221 122222112111 112222233333332 Q ss_pred HHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeee-----eccc Q lcl|NC_019719. 334 WENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAM-----RQSQ 406 (424) Q Consensus 334 ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~-----~~~n 406 (424) |...+. ...+.....+.| +.-+..|..+.++. +.+.|+++...++++++.-..+ +.+++- ...+ T Consensus 380 i~~~~g----~~~d~~~i~i~f--~~~~p~~~~e~a~~---~~~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~ 450 (474) T protein:vir:96 380 ILDFNK----IKLDAKEIEITF--NFNVMVNDLEQSQI---GAQSQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQ 450 (474) T ss_pred HHHHhC----CCcccceeeEEe--cCCCccCHHHHHHH---HHHcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhh Confidence 222111 111222223344 44444555555554 4557999998888888764322 111100 0001 Q ss_pred ccchhhccccC--C-CcccCC Q lcl|NC_019719. 407 YVPITDLGTNK--E-PRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~~--~-~~~~ga 424 (424) ...+.+..... + .+.++- T Consensus 451 ~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:96 451 LPNLDDGGADGAQQQQQSENN 471 (474) T ss_pred ccccccccCCCCCCcCCCCcc Confidence 11111111111 1 111111 No 201 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=97.43 E-value=6.8e-05 Score=43.43 Aligned_cols=391 Identities=10% Similarity=0.025 Sum_probs=173.4 Q ss_pred CCCCcccccCCCCC----chHHHHHhhccCcccCccccccccccccccccc-Cccc-cc--HHHHhhhHHHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNN----GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL-GDSS-IN--DERILQISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~~~~~~----G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~--~~~~~~~~~v~~~i~~ia~ 72 (424) |+.+|+-. +-.+. ..+.++.+..... .++.......+.+.... .... .. +..=+..+.....|+..+. T Consensus 3 ~~~~~~~~-~~~~~~~~~~~i~~~i~~~~~~---~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~ 78 (453) T protein:vir:73 3 LKPIKLMT-YSRDEEITDKVVNDFMKKHQEE---VERYEYLGNMYKGIMEISSQKAKDSWKPDNRLTNNFAKYIVDTFVG 78 (453) T ss_pred cccceeee-ccccccCCHHHHHHHHHHHHHH---HHHHHHHHHHhccccchhcCCCCCccCccceeecchHHHHHHHhhh Confidence 55554321 11112 2233332222111 01100000011000000 0000 00 0011224556667777777 Q ss_pred hhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCce Q lcl|NC_019719. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) Q Consensus 73 ~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~ 152 (424) -+-.-|+.+-- + +. ...+.+...+.. | ........+..+.+.+|.+|+.+-++.+|.+ .+..++|.. T Consensus 79 ~l~g~~~~~~~-~-d~-----~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~ 145 (453) T protein:vir:73 79 YFNGIPIKKTH-D-DK-----SVLEAMQLFDNL--N---DMEDEESELAKIACVYGRAYELMYQNESTES-EVIYCSPLN 145 (453) T ss_pred hhcccCceeec-C-Ch-----HHHHHHHHHHHh--c---ChhHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEcccc Confidence 66666665421 1 11 011223333321 2 3455667788899999999999999988876 466778888 Q ss_pred EEEEEcCCc-e------EEEEEecCce--EEecHhHeeEeccC-----------------C----CCccccCchHHHHHH Q lcl|NC_019719. 153 MDVKLVGKK-V------VYRYQRDSEY--ADFSQKEIFHLKGF-----------------G----FTGLVGLSPIAFACK 202 (424) Q Consensus 153 v~~~~~~~~-~------~~~~~~~~~~--~~~~~~evih~r~~-----------------~----~~~~~G~s~~~~~~~ 202 (424) +.+..++.. . .|.+...+.. ..+.++.+++++.. + .+...|.|.+..+.. T Consensus 146 ~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~~ 225 (453) T protein:vir:73 146 VFMVYDDSIKQKPLFAVYYGFDEEGNLSGTVYTLLETISITGKAGEVKFGESTYNVYSDLPIVEYNFNEERQSIFEPVHS 225 (453) T ss_pred eEEEEeCCCCceeEEEEEEEEecCceEEEEEEeCCeEEEEEecCCceEEccceeccCCceeEEEecCCCCCCcchhhHHH Confidence 877665432 1 1111111111 11233344333210 0 022357777777766 Q ss_pred HHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHH Q lcl|NC_019719. 203 SAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMA 282 (424) Q Consensus 203 ~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e 282 (424) .++....+.....+.....+.|..++.-. .. +++....++..-.........+.....+.+.+++.+.....+..+.. T Consensus 226 liDa~~~~~S~~~~~~~~~~~~~l~~~g~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~ 303 (453) T protein:vir:73 226 LINSYNKVTSEKANDVEYFSDQYLVFLGA-EV-DEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTEN 303 (453) T ss_pred HHHHHHHHHHHHHHHHHHhccceeeeecC-CC-CchhhhcccccccccccccccccccccccCceeEEeeecCCHHHHHH Confidence 66665555544555555556676666422 22 22222222221111111112223333445555555555444555667 Q ss_pred HHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH----------HHHHHHHHHHHHHHHHHHHhhccCccccccce Q lcl|NC_019719. 283 SRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWENSIQRWLIPAKDVGRIH 352 (424) Q Consensus 283 ~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~----------~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~ 352 (424) ..+...+.|+..-++|.. +....++.++...+.... ..+...+.-.++.+..-++.. -...+. .. T Consensus 304 ~~~~l~~~I~~~s~~p~~--~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~~--~~ 378 (453) T protein:vir:73 304 LLNRLERSIFQFTMAANI--SDENFGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNA-SNKDAW--KD 378 (453) T ss_pred HHHHHHHHHHHHhCCccc--CcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CCcccc--cc Confidence 778888889888888842 222223333322222111 222223333333322222211 111111 23 Q ss_pred eeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhhccc-----------cCCCcc Q lcl|NC_019719. 353 AEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGT-----------NKEPRN 421 (424) Q Consensus 353 ~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~-----------~~~~~~ 421 (424) +++.++.-+..|..+.++.+.+++ |+++..-+.++++.-+.+..+ ...+.+-.+ .+...+ T Consensus 379 i~v~f~~~~p~~~~~~a~~~~k~~--giis~et~~~~~~~~~d~~~E-------~~ri~~E~~~~~~~~~~~~~~~~~~~ 449 (453) T protein:vir:73 379 IEYTFTRNEPKDIKEQAETANILK--GITSEETALSVISVIPDVQAE-------MEKIKKKKLLQLSLTRTSNLVRMKQM 449 (453) T ss_pred ceEEeCCCCCCCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCCHHHH-------HHHHHHHHHHHHHHHHhccCCcchhh Confidence 344445666788999999999886 789887777777663321111 000100000 000000 Q ss_pred cCC Q lcl|NC_019719. 422 NGA 424 (424) Q Consensus 422 ~ga 424 (424) .|- T Consensus 450 ~~~ 452 (453) T protein:vir:73 450 RGN 452 (453) T ss_pred hcC Confidence 001 No 202 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.41 E-value=7.2e-05 Score=43.29 Aligned_cols=382 Identities=9% Similarity=-0.015 Sum_probs=177.9 Q ss_pred CCCC---------ccccc--------CCCC-CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHH Q lcl|NC_019719. 1 MEEP---------KYTID--------LRTN-NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQIST 62 (424) Q Consensus 1 ~~~~---------~~~~~--------~~~~-~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 62 (424) .|.+ ..|.+ .+++ ..-++++..+..+.-. .... ...... +..=+.++. T Consensus 11 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~-i~~~----------~~~~~~---~~~ki~~n~ 76 (470) T protein:vir:99 11 VTGNSSFIFPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHK-ILTA----------PEKETG---ADNRIVVNS 76 (470) T ss_pred ccCCceEEeCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccc-cccC----------cccccC---Ccceeecch Confidence 1111 01100 0010 1222333333322110 0000 000000 111123455 Q ss_pred HHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019719. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~ 142 (424) ...+|+..+.-+-.-|+.+.-.+++ . ....+.+++. . .........+..+.+.+|.+|+.+..+.+|.+ T Consensus 77 ~~~Ivd~~~~~l~g~p~~~~~~~d~-~-----~~~~l~~~~~-~----n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~ 145 (470) T protein:vir:99 77 AKYVVDVYNGYFCGIEPKLALLNDS-S-----KIDEIARWNR-Q----ENFFDTINEISKQCDIFGRSIASIYQGEDARP 145 (470) T ss_pred HHHHHHHHhhhhccCCeeEeeCCch-h-----HHHHHHHHHH-h----cCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeE Confidence 6677887777777777765322111 0 1122444333 1 24567778889999999999999999888876 Q ss_pred eeEEeecCceEEEEEcCCce---E--E-EEE-ecCc-----eEEecHhHeeEecc-------------------CC---- Q lcl|NC_019719. 143 ISLLPLQSANMDVKLVGKKV---V--Y-RYQ-RDSE-----YADFSQKEIFHLKG-------------------FG---- 187 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~~---~--~-~~~-~~~~-----~~~~~~~evih~r~-------------------~~---- 187 (424) .+..++|..+.+..++... . + .+. .++. ...+.++.+++++. .+ T Consensus 146 -~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 224 (470) T protein:vir:99 146 -HLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEF 224 (470) T ss_pred -EEEEEccceeEEEEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCCccceEee Confidence 5777899998887765321 1 1 111 1110 01122333332211 00 Q ss_pred CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec----- Q lcl|NC_019719. 188 FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL----- 262 (424) Q Consensus 188 ~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l----- 262 (424) .+...|.|.+..+...++....+.....+.+...+.|-.++.-.... .++.-+.++.. . ..+++.+ T Consensus 225 ~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~-~~~~g~~~~~~----~----~~~~~~~~~~~~ 295 (470) T protein:vir:99 225 FENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLP-EDDEGNPKFDF----K----NNRVLYVSQLDP 295 (470) T ss_pred cCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcc-cccccchhhhh----h----hcceeeecCCCC Confidence 12335777777777777766666555555566667777666532221 21111111111 1 1122322 Q ss_pred CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHH----------HHHHHHHHHHHHH Q lcl|NC_019719. 263 EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYIS 332 (424) Q Consensus 263 ~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~----------~~~~~~tl~P~~~ 332 (424) +.+.++..+........+....+...+.|+..-++|....+... ++.|+...+... ...+..+|.-.++ T Consensus 296 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~ 374 (470) T protein:vir:99 296 DTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFA-GNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYR 374 (470) T ss_pred CCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23445555655444445566788889999999999965443322 222322222111 1222333333333 Q ss_pred HHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCC-CCCCCeeee-------- Q lcl|NC_019719. 333 RWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPP-LPGGDVAMR-------- 403 (424) Q Consensus 333 ~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p-~~~gd~~~~-------- 403 (424) .+...+...--...+. ..+.+.+..-+..|..+.++.+.++. |+++...+.++++.-. ..+.+++.. T Consensus 375 li~~~~~~~~~~~~~~--~~i~v~f~~~~p~~~~e~a~~~~kl~--giis~et~l~~l~~vd~~~E~eri~~E~~~~~~~ 450 (470) T protein:vir:99 375 IVLATLFNNKQDQELW--SELDFKFTRNLPEDMASAIDNAKNAE--GIVSKKTQLGMIPDIEPDAEMKQIAKEKADAIKQ 450 (470) T ss_pred HHHHHHhccCCccccc--ccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCCCHHHHHHHHHHHHHHHHHH Confidence 3333332221111122 23444446666778888999999885 7899888888876531 111111100 Q ss_pred -cccccchhhccccCCCccc Q lcl|NC_019719. 404 -QSQYVPITDLGTNKEPRNN 422 (424) Q Consensus 404 -~~n~~~~~~~~~~~~~~~~ 422 (424) .....+.+......+++++ T Consensus 451 ~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 451 TQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred HHhhcCCCCcCCCCCCccCC Confidence 0011111222222222222 No 203 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=97.41 E-value=7.4e-05 Score=43.22 Aligned_cols=392 Identities=10% Similarity=0.036 Sum_probs=174.6 Q ss_pred CchHHHHH----hhccCcccC-----ccc-ccccccc------c---c-cccccCcccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019719. 14 NGWWARLQ----SWFVGGRLV-----TPN-QGSQTGP------V---S-AHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 14 ~G~~~~l~----~~~~~~~~~-----~~~-~~~~~~~------~---~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) -|||.+++ +|+++.... .+. ....... . . .++......+. ..-+..+.-..+++.+|+- T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~-~~~~~~~l~~~i~~~~A~l 79 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVH-DKLMNSGTGNEIVVVAAEY 79 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccc-cccccCChHHHHHHHHHHh Confidence 67777665 455443211 000 0000000 0 0 01111111111 1122334455677777777 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceE Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) +..=+..+.-.+.+....+. ....+.++|.. | ....-+...+.+.+..|.+++.+..+ +|++ .+..+++..+ T Consensus 80 l~~e~~~i~v~~~~~~d~e~-~~~~l~~il~~--n---~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~~-~i~~v~ad~~ 151 (518) T protein:vir:78 80 ISGKPLSIDVTGVNGSKDEN-LTKQLKEALRI--D---NFDSKSVKIVELAGGSGVSAVKINIL-NGRP-SISVHSSSQF 151 (518) T ss_pred hcCCCceEEecCccccCcHH-HHHHHHHHHHh--c---cHHHHHHHHHHHhhccCceEEEEEEE-CCee-EEEEEcCCee Confidence 76544433211111111111 11123333321 1 23344455566777778777766554 3543 5666777776 Q ss_pred EEEEcCC----------------ceEEE-E-------------------------Ee-cCceEEe-------------cH Q lcl|NC_019719. 154 DVKLVGK----------------KVVYR-Y-------------------------QR-DSEYADF-------------SQ 177 (424) Q Consensus 154 ~~~~~~~----------------~~~~~-~-------------------------~~-~~~~~~~-------------~~ 177 (424) .+...++ ..+|. . .. .+..... .. T Consensus 152 ~P~~~~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~ 231 (518) T protein:vir:78 152 WIDFKNNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHT 231 (518) T ss_pred EEEeecCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCccccccccccccccccccccc Confidence 6643221 11111 0 00 0000000 00 Q ss_pred hH---------------eeEeccCC-----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE-----cCC Q lcl|NC_019719. 178 KE---------------IFHLKGFG-----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS-----TGE 232 (424) Q Consensus 178 ~e---------------vih~r~~~-----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~-----~~~ 232 (424) +. +.|++... .+.+.|+|.+..+...+......-......|+. +.+..++. ... T Consensus 232 ~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~ 310 (518) T protein:vir:78 232 NDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKV 310 (518) T ss_pred ccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCC Confidence 00 12333321 124579999999999888877777666666766 44454441 111 Q ss_pred CCCCHHHHHHHHHHHHHHhCCcccCcceecCCCc----eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCC Q lcl|NC_019719. 233 KVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGF----STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS 308 (424) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~----~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~ 308 (424) ........-.+..-.+.+..- + ...++|. .++.++....+.++.+..+...+.|....|++|..++...+ T Consensus 311 ~~~~~~~~~~fd~~~~~y~~i-~----~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~- 384 (518) T protein:vir:78 311 NKSTDKEEWSMNVDEDYFMQF-K----GTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNR- 384 (518) T ss_pred CCCCCccccccCCCCceEEEe-c----CcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccc- Confidence 100000000000000000000 0 0011222 36777777778889999999999999999999999975322 Q ss_pred CccchhHH----------HHHHHHHHHHHHHHHHHHHHHHHhhccCcc--c-cccceeeecchhhhccCHHHHHHHHHHH Q lcl|NC_019719. 309 TSWGSGIE----------QQNLGFLQYTLQPYISRWENSIQRWLIPAK--D-VGRIHAEHNLDGLLRGDSASRAAFMKAM 375 (424) Q Consensus 309 ~~~~~n~e----------~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~--~-~~~~~~~fd~~~l~~~d~~~~~~~~~~~ 375 (424) ..+..-+. ......++.+|.-++..+...+........ . ...+.+.++++.-+..|.++.++...++ T Consensus 385 ~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~ 464 (518) T protein:vir:78 385 EVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNM 464 (518) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHH Confidence 22211111 111223333333333333322221111101 1 1224567777888889999999999999 Q ss_pred HhCCCCCHHHHHHHh--CCCCCCCCCeeeec-----ccc-cchhhccccCCCcccC Q lcl|NC_019719. 376 GEAGLRTINEMRRTD--NLPPLPGGDVAMRQ-----SQY-VPITDLGTNKEPRNNG 423 (424) Q Consensus 376 ~~~g~~T~NE~R~~~--G~~p~~~gd~~~~~-----~n~-~~~~~~~~~~~~~~~g 423 (424) +.+|+|++.++-+++ |+.. ++.++.+.. +.. .+....-...++ ++| T Consensus 465 v~aGimS~e~~i~~~~~~~~d-eea~~e~~ri~~E~~~~~~~~p~~~~g~~~-~~g 518 (518) T protein:vir:78 465 NSALAMSVEEKVKLIHPKWED-EEIQAEVKRIYLENAIGEVPDPEAIGGMET-KGG 518 (518) T ss_pred HhcCCCCHHHHHHHhCCCCCH-HHHHHHHHHHHHHhcccCCCCCccccCCCC-CCC Confidence 999999998865554 3321 111111100 000 000000000111 122 No 204 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=97.40 E-value=7.5e-05 Score=43.19 Aligned_cols=394 Identities=10% Similarity=0.010 Sum_probs=161.6 Q ss_pred CCCCcccccCCCCCchH-HHHHhhccCcccCcccccccccccccccccC---ccc-ccHHHHh----hhHHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWW-ARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG---DSS-INDERIL----QISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~-~~~~~~~----~~~~v~~~i~~ia 71 (424) .+-|.=.|+-..-..++ .+|...+.... ++.......+.+..... ... -.....+ .+.+...+|+..+ T Consensus 2 ~~~p~~~l~~~~~~~~~~~~l~~~~~~~~---~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~ 78 (479) T protein:vir:99 2 IDLPDEDLSSEGLAKYLETKVFPKMNTEC---ERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFA 78 (479) T ss_pred ccCCcccCChhHHHHHHHHHHHHHHHHHh---HHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHH Confidence 33333222211100111 11111111110 00000001111111000 000 0111111 2345566777776 Q ss_pred HhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee-----CCCCceeeEE Q lcl|NC_019719. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-----NSAGDVISLL 146 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r-----~~~G~~~~l~ 146 (424) +.+---.|. .. ++. ....+.+++. . | ........+..+++.+|.||+++-+ +..|.+ .+. T Consensus 79 ~~l~~~gf~---~~-d~~-----~~~~~~~i~~-~-N---~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~-~i~ 143 (479) T protein:vir:99 79 QQLIVDGYR---KT-GTN-----ENAKGWDTWR-L-N---QMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVA-RIK 143 (479) T ss_pred hhccccccc---CC-Cch-----hhHHHHHHHH-h-c---ChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCce-EEE Confidence 654322222 11 111 1233555554 2 3 2335567788889999999998764 333443 466 Q ss_pred eecCceEEEEEcCCce----EEEEE--ecCceEEec-----------------------HhH--eeEeccCCCCccccCc Q lcl|NC_019719. 147 PLQSANMDVKLVGKKV----VYRYQ--RDSEYADFS-----------------------QKE--IFHLKGFGFTGLVGLS 195 (424) Q Consensus 147 ~l~~~~v~~~~~~~~~----~~~~~--~~~~~~~~~-----------------------~~e--vih~r~~~~~~~~G~s 195 (424) .++|..+.+..++... .|.+. ..+....+. -.. |+++++.......|.| T Consensus 144 ~~~p~~~~~iydd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~g~s 223 (479) T protein:vir:99 144 CIDPRDAFAIWEDPYWDEWPKYLLERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDLRGVCYG 223 (479) T ss_pred EechhheEEEecCCcccceeeEEEeecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeecCCCcCcCCcc Confidence 7788888776543211 11111 010000000 011 3444433222236888 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-CCCceeeecccC Q lcl|NC_019719. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-EAGFSTSAIGVT 274 (424) Q Consensus 196 ~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l-~~g~~~~~l~~~ 274 (424) .+......++.......-......-.+.|..++.-.. ..+....+ ...+.. ..++++.+ +++.++.++... T Consensus 224 d~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~-~~~~~~~~--~~~~~~-----~~~~i~~~~~~~~~~~q~~~~ 295 (479) T protein:vir:99 224 DVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLM-LPEGANAD--QEKMRF-----AQESMLISQNEKASFGAIPAA 295 (479) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCC-cccccccc--hhcccc-----ccccceeecCCCceEEEeccc Confidence 7777666666655554444444444455655553211 11111000 011111 11234433 455677666532 Q ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHH----HHHHHHHh--hccCcccc Q lcl|NC_019719. 275 PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYIS----RWENSIQR--WLIPAKDV 348 (424) Q Consensus 275 ~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~----~ie~~l~~--~l~~~~~~ 348 (424) .. -.+.+..+.....|+..=++|++.+|...+ .|+.........+... ..=... .|++.+.. .+....+. T Consensus 296 ~~-~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n--~Sg~Al~~~~~~l~~k-a~~~~~~f~~al~~~~~l~~~~~~~~~~ 371 (479) T protein:vir:99 296 PL-DGLLNAYKESLLEFLALAQLPPHIAGQIVN--VAADALAAGTRQTMQK-LFEKQATWKASHNQTMRLVNKIEGRTEE 371 (479) T ss_pred ch-HHHHHHHHHHHHHHhccCCCCHHHcccccc--hHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHcCCCcc Confidence 22 246677888888999999999999975432 2221222111111111 111111 12221111 01111111 Q ss_pred -ccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCC--CCC----------Ceeeeccc--ccchh- Q lcl|NC_019719. 349 -GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD-NLPPL--PGG----------DVAMRQSQ--YVPIT- 411 (424) Q Consensus 349 -~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~-G~~p~--~~g----------d~~~~~~n--~~~~~- 411 (424) ..+.+.+.+......+..+.++.+.+++++|+++...+.+++ |+.+. +.. +.+..... ..+.. T Consensus 372 ~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (479) T protein:vir:99 372 ATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQ 451 (479) T ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccc Confidence 112344444455566788899999999999998887777766 77542 100 00000000 00000 Q ss_pred -----hccccCCCcc-cCC Q lcl|NC_019719. 412 -----DLGTNKEPRN-NGA 424 (424) Q Consensus 412 -----~~~~~~~~~~-~ga 424 (424) ...+.++..+ .|. T Consensus 452 ~~~~~~~~~~~~~~~~~~~ 470 (479) T protein:vir:99 452 RGGPNGATNMQQANNKTGE 470 (479) T ss_pred cCCCCCCCCCCCCCCCCcc Confidence 0000000000 000 No 205 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=97.36 E-value=8.6e-05 Score=42.89 Aligned_cols=391 Identities=7% Similarity=-0.040 Sum_probs=179.4 Q ss_pred CCCCcccccCCCCCchH--HHHHhhccCcccCccccccccccccccc--------c-cCccccc----HHHHhhhHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWW--ARLQSWFVGGRLVTPNQGSQTGPVSAHG--------H-LGDSSIN----DERILQISTVWR 65 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~--~~l~~~~~~~~~~~~~~~~~~~~~~~~~--------~-~~~~~~~----~~~~~~~~~v~~ 65 (424) |+.--+.+.+......+ +.+.....+.. .+........+.+.. . ..+.... +..=+.++.... T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~--~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~ 84 (479) T protein:vir:79 7 SETDLIKVQLKKESTINLVKVIEHYILKHR--PEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKL 84 (479) T ss_pred cccceEeeccccCChhHHHHHHHHHHhhhh--HHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHH Confidence 44444444444444322 11111111110 000000000000000 0 0000000 000122455666 Q ss_pred HHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeE Q lcl|NC_019719. 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISL 145 (424) Q Consensus 66 ~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l 145 (424) +|+..+.-+-+-|+.+-- .+.. ...+.+.+.. | ........+..+.+.+|.+|..+..+.+|.+ .+ T Consensus 85 Ivd~~~~~l~g~p~~~~~--~~~~------~~~~~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i 150 (479) T protein:vir:79 85 LVDQKVGYSVGNPIVFNA--DDDN------LTKLLNDLLG--E---EFDDTITELYLNASNKGVEWLHPYINRKGEF-KY 150 (479) T ss_pred HHHHHHhhhhcCCceecc--CCHH------HHHHHHHHHh--c---CHHHHHHHHHHHHHhcCeEEEEEEeCCCCce-EE Confidence 788887777777766521 1111 1123333332 2 3455567778899999999999988888876 47 Q ss_pred EeecCceEEEEEcCCc---e-----EEEEE-ecCce----EEecHhHeeEeccC-------------------------- Q lcl|NC_019719. 146 LPLQSANMDVKLVGKK---V-----VYRYQ-RDSEY----ADFSQKEIFHLKGF-------------------------- 186 (424) Q Consensus 146 ~~l~~~~v~~~~~~~~---~-----~~~~~-~~~~~----~~~~~~evih~r~~-------------------------- 186 (424) ..++|..+.+..++.. . +|... ..+.. ..+.++.+.|++.- T Consensus 151 ~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (479) T protein:vir:79 151 VIIPAEEAIPIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRI 230 (479) T ss_pred EEEccceeEEEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccc Confidence 7788888887765321 1 11111 11111 11223333333210 Q ss_pred ----------C----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhC Q lcl|NC_019719. 187 ----------G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAG 252 (424) Q Consensus 187 ----------~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~ 252 (424) + .+...|.|.+..+...++....+.....+.+...+.|-.++.--.+...++... T Consensus 231 ~~~~~~~~~vPvv~~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~----------- 299 (479) T protein:vir:79 231 NNKEQGWGKVPFIPFKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFID----------- 299 (479) T ss_pred cccccCCCcccEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchh----------- Confidence 0 022357777777777776666555555555666666776665322221121111 Q ss_pred CcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH----------HHHHHH Q lcl|NC_019719. 253 GPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGF 322 (424) Q Consensus 253 ~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~~ 322 (424) ....++++.++++.+++.+.....+..+....+...+.|...-++|..-.+.. ++.++.... ...... T Consensus 300 ~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--gn~Sg~Ai~~~~~~l~~k~~~~~~~ 377 (479) T protein:vir:79 300 NIRYYKSIKVDGGGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNT--GDKSGVALKFLYSLLDLKCSKTEKK 377 (479) T ss_pred hhhhccceecCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccccccccc--cchhHHHHHHHHHHHHHHHHHHHHH Confidence 11234566677766666665554555566778888888988888886433322 222222221 112223 Q ss_pred HHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCe Q lcl|NC_019719. 323 LQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDV 400 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~--gd~ 400 (424) +...+.-.++.+...++..-....+ ...+.+.+..-+..|.++.++.+.++. |+++...+.++++.-+.+. .+. T Consensus 378 ~~~~l~~~~~li~~~~~~~~~~~~~--~~~i~i~f~~~~p~~~~~~a~~~~kl~--g~iS~et~l~~l~~v~d~~~E~~r 453 (479) T protein:vir:79 378 FKKAIRELLWFVCEYLKISGNKSYD--YKTVQITFNHSMIINEAEKIDMAAKST--GIVSDETIVSNHPWVEDVNDELER 453 (479) T ss_pred HHHHHHHHHHHHHHHHhccCCCccc--cccceEEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCCHHHHHHH Confidence 3334444444443333322111111 223444445556678888999988874 8899888888876532111 110 Q ss_pred eeeccc--ccchhhccccCCCcccCC Q lcl|NC_019719. 401 AMRQSQ--YVPITDLGTNKEPRNNGA 424 (424) Q Consensus 401 ~~~~~n--~~~~~~~~~~~~~~~~ga 424 (424) +-.-.. ...........++..+=+ T Consensus 454 i~~E~~~~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 454 LKKQEDTQKEYDDLIPNNQDGVIDET 479 (479) T ss_pred HHHHHHHHHHHHhccCcccCCCcCcC Confidence 000000 000000000000000001 No 206 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=97.35 E-value=8.8e-05 Score=42.82 Aligned_cols=399 Identities=10% Similarity=0.045 Sum_probs=172.0 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccccc--cC------cccccHHHHhhhHHHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGH--LG------DSSINDERILQISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~--~~------~~~~~~~~~~~~~~v~~~i~~ia~ 72 (424) ..+|+..+.+.. ..+.++...+.... .++.......+.+... .. .....+..-+.++....+|+..+. T Consensus 21 ~~~~~~~~~~~~--~~i~~~i~~~~~~~--~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~ 96 (481) T protein:vir:10 21 FVVSDLAELLKE--ENLRNFISRHQTEQ--VPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVG 96 (481) T ss_pred eeeecchhhcCH--HHHHHHHHHHHHHH--HHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHh Confidence 112222222221 11222222111000 0000000000000000 00 000000111235666778888888 Q ss_pred hhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCce Q lcl|NC_019719. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) Q Consensus 73 ~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~ 152 (424) -+..-|+.+--. +. .....+..++. + | ....+...+..+.+.+|.+|+.+.++.+|.+ .+..++|.. T Consensus 97 ~l~g~~~~~~~~--d~-----~~~~~l~~~~~-~-n---~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~-~i~~~~p~~ 163 (481) T protein:vir:10 97 YLTGNPITITHQ--DN-----QTNDKIIELND-L-N---DADEVNSDLALNLSIYGRAYEIVYRDFEDRD-TFKVLDPKS 163 (481) T ss_pred hhccCCceEecC--Ch-----hHHHHHHHHHH-h-c---ChhHHHHHHHHHHHhcCeEEEEEEeCCCCeE-EEEEEcccc Confidence 777777654321 11 11234555554 2 2 3567888899999999999999999888876 577889988 Q ss_pred EEEEEcCCc---e-----EEEEEecC-c----eEEecHhHeeEeccCC---------------------CCccccCchHH Q lcl|NC_019719. 153 MDVKLVGKK---V-----VYRYQRDS-E----YADFSQKEIFHLKGFG---------------------FTGLVGLSPIA 198 (424) Q Consensus 153 v~~~~~~~~---~-----~~~~~~~~-~----~~~~~~~evih~r~~~---------------------~~~~~G~s~~~ 198 (424) +.+..++.. . +|...... . ...+.++.+.+++... .+...|.|.+. T Consensus 164 ~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~ 243 (481) T protein:vir:10 164 TFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPIIEYLNDQFKQGDFE 243 (481) T ss_pred eEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCceeEEEeecCCCCCCchh Confidence 887766432 1 11111111 1 1123444444432110 01235677666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHH Q lcl|NC_019719. 199 FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDA 278 (424) Q Consensus 199 ~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~ 278 (424) .+...++.......-..+.+...+.|..++.-.... +++....++..-. ....... .....+.+.+++-+....... T Consensus 244 ~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~-~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~l~~~~~~~ 320 (481) T protein:vir:10 244 NVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDL-DSEDAKAFRDANM-IHLEPGT-NANGSEGKAEVKYVYKQYDVA 320 (481) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCC-Cccchhhhhhccc-eeccccc-cccCCCCCcceeEEeecCCHH Confidence 655555544444333333344445566555432222 2222222211100 0111000 011122333444444444445 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHh------hccCcc---ccc Q lcl|NC_019719. 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR------WLIPAK---DVG 349 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~------~l~~~~---~~~ 349 (424) .+.+..+...+.|...-++|....+... ++.++...+.....+.. .+.-....++..|.+ +++... ... T Consensus 321 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~-k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 398 (481) T protein:vir:10 321 GVEAYKKRLQNDIHKYTNTPDLNDEQFS-GVQSGESMKYKLFGLEQ-VRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHN 398 (481) T ss_pred HHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc Confidence 5677888889999999999976554332 22222222211111111 111111222222221 011111 111 Q ss_pred cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--CCCCee-------eecccccchhhccccC--C Q lcl|NC_019719. 350 RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL--PGGDVA-------MRQSQYVPITDLGTNK--E 418 (424) Q Consensus 350 ~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~--~~gd~~-------~~~~n~~~~~~~~~~~--~ 418 (424) ...+.+.+..-...|..+.++.+.++. |+++...+.+++++-.. ++.+.+ ....+.....+...+. . T Consensus 399 ~~~i~v~f~~~~~~~~~~~a~~~~kl~--g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 476 (481) T protein:vir:10 399 YAELTITFTPNLPKSMMESINAFNALS--GGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGYGEAFENHLNV 476 (481) T ss_pred cceeeEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCCccCCCCCCC Confidence 123445455666788889999998885 78988778888776321 111100 0000011111111111 1 Q ss_pred CcccC Q lcl|NC_019719. 419 PRNNG 423 (424) Q Consensus 419 ~~~~g 423 (424) ..++| T Consensus 477 dd~~g 481 (481) T protein:vir:10 477 DDSNG 481 (481) T ss_pred CCCCC Confidence 12222 No 207 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=97.31 E-value=9.9e-05 Score=42.53 Aligned_cols=387 Identities=10% Similarity=0.008 Sum_probs=168.9 Q ss_pred CCCCcccccCCC--------------CCchHHHHHhhccCcccCcccccccccccccccc---------cCccc--ccHH Q lcl|NC_019719. 1 MEEPKYTIDLRT--------------NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGH---------LGDSS--INDE 55 (424) Q Consensus 1 ~~~~~~~~~~~~--------------~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~--~~~~ 55 (424) |-+-.|-++.-. ..-++.++....... .++.......+.+... ..... ..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~ 77 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKEN---IDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPD 77 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHH---HHHHHHHHHHhcccccccccchhhhccccccccccc Confidence 433322222211 112333332221110 0000000000000000 00000 0001 Q ss_pred HHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe Q lcl|NC_019719. 56 RILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD 135 (424) Q Consensus 56 ~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~ 135 (424) .=+.++....+|+..+.-+-+-|+.+.- .+.. ....+...+. | ........+..+.+.+|.+|+.+. T Consensus 78 ~ki~~n~~k~ivd~~~~yl~g~p~~~~~--~~~~-----~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~ 144 (478) T protein:vir:10 78 WRMYTNYHQNLVDQKVAYAVANPVTFGV--DNDK-----ALKQIQHTLN---H---KWDDKLVDILTAASNKGIEWVQPY 144 (478) T ss_pred ceeccchHHHHHHHHhhhhcccCceeec--CChH-----HHHHHHHHHh---c---cHHHHHHHHHHHHhhCCeEEEEEE Confidence 1122456677888888888777876521 1111 1122333332 2 355666777889999999999998 Q ss_pred eCCCCceeeEEeecCceEEEEEcCC---ce---EEEEEecCc--eEEecHhHeeEeccC--------------------- Q lcl|NC_019719. 136 RNSAGDVISLLPLQSANMDVKLVGK---KV---VYRYQRDSE--YADFSQKEIFHLKGF--------------------- 186 (424) Q Consensus 136 r~~~G~~~~l~~l~~~~v~~~~~~~---~~---~~~~~~~~~--~~~~~~~evih~r~~--------------------- 186 (424) .+.+|.+ .+..++|..+.+..++. .. .+.|...+. ...+.++.|.+.+.. T Consensus 145 ~d~~~~~-~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (478) T protein:vir:10 145 VDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQ 223 (478) T ss_pred ecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecCCeeeccccccccccccceec Confidence 8888875 57778888888776532 11 111111111 112233443332210 Q ss_pred -----C---------CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhC Q lcl|NC_019719. 187 -----G---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAG 252 (424) Q Consensus 187 -----~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~ 252 (424) . .+...|.|.+..+...++....+.....+.+...+.|-.+++-.......+....+ . T Consensus 224 ~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~-------~- 295 (478) T protein:vir:10 224 GNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNL-------K- 295 (478) T ss_pred ccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchhhhh-------h- Confidence 0 01235777777766666666655544444455555565555422221111111111 1 Q ss_pred CcccCcceec--CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH---------- Q lcl|NC_019719. 253 GPVKKRLWIL--EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL---------- 320 (424) Q Consensus 253 ~~~~g~~~~l--~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~---------- 320 (424) ..+++.+ +.|.+++.+........+.+..+...+.|...-++|..-.... .++.++...+-... T Consensus 296 ---~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Ai~~~~~~l~~k~~~~~ 371 (478) T protein:vir:10 296 ---YYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKF-GNSPSGIALKFMYSNLDLKANKLK 371 (478) T ss_pred ---hCceeEecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCcccc-ccchHHHHHHHHHHHHHHHHHHHH Confidence 1123333 2334444444444455567778888899999999985322211 12222212211111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CC Q lcl|NC_019719. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GG 398 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~g 398 (424) ..+...+.-.++.|...+. ...+.....+.| +.-+..|..+.++.+.++ +|+++...+.++++.-..+ +. T Consensus 372 ~~~~~~l~~~~~li~~~~~----~~~d~~~i~i~f--~~~~p~~~~e~~~~~~~~--~g~iS~et~i~~~~~v~d~~~E~ 443 (478) T protein:vir:10 372 NKTLTALQELLQYIIDFYR----LDVRVQDIEITF--NFNVMVNELENSQIAMNS--TGLLSKETILGNHSWVQDPVAEM 443 (478) T ss_pred HHHHHHHHHHHHHHHHHhC----CCcccccceEEe--CCCCCCCHHHHHHHHHHH--hCCCChHHHHHhCCCCCCHHHHH Confidence 1122222222222221111 111222233344 455567788888888776 5889887777777652211 10 Q ss_pred Cee-----eecccccchhhc---cccCCCcccCC Q lcl|NC_019719. 399 DVA-----MRQSQYVPITDL---GTNKEPRNNGA 424 (424) Q Consensus 399 d~~-----~~~~n~~~~~~~---~~~~~~~~~ga 424 (424) +.. -.........+. .+..++.++++ T Consensus 444 ~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~d~~~ 477 (478) T protein:vir:10 444 ERIEQENIELNQQLPDIEEGLNDEQQRQSEDNQS 477 (478) T ss_pred HHHHHHHHHHHHhccccCCCCcccccccCcCCCC Confidence 000 000011111111 12233344444 No 208 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=97.23 E-value=0.00012 Score=42.01 Aligned_cols=395 Identities=11% Similarity=0.049 Sum_probs=182.6 Q ss_pred CCCCccccc-CCC-CCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCc Q lcl|NC_019719. 1 MEEPKYTID-LRT-NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~~~~~~-~~~-~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) .++-+--|+ .+. +..=++++..+..+............ .. .. +..-+.++....+|+..+.-+-.-| T Consensus 41 ~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~-~~----~~------~~~ki~~n~~k~Ivd~~~~yl~g~p 109 (501) T protein:vir:27 41 WELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRK-DR----EM------ADKRAVHNYGRMISKFKTGYLAGNP 109 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccC-cc----cc------ccceeccchHHHHHHHHhhhhcccC Confidence 111111111 111 11224556666655321111110000 00 00 0011234566778888888887778 Q ss_pred eEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEc Q lcl|NC_019719. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~ 158 (424) +.+.-.+.... ..+...|.. -...-........+..+++.+|.+|+++-++.+|.+ .+..++|..+.+..+ T Consensus 110 ~~~~~~d~~~~-------~~~~~~l~~-~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~-~i~~~~p~~~~~v~d 180 (501) T protein:vir:27 110 IRVEYDDNDNN-------SQNDDTIKR-IGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDET-RIKRLNPLETFVIYD 180 (501) T ss_pred eeEecCCccch-------HHHHHHHHH-HHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCce-EEEEEccceeEEEec Confidence 76643222111 122222211 111124567778889999999999999999988875 467788888887766 Q ss_pred CCc---eE--E-EEEe--c-Cc---eEEecHhHeeEecc----------------CC----CCccccCchHHHHHHHHHH Q lcl|NC_019719. 159 GKK---VV--Y-RYQR--D-SE---YADFSQKEIFHLKG----------------FG----FTGLVGLSPIAFACKSAGV 206 (424) Q Consensus 159 ~~~---~~--~-~~~~--~-~~---~~~~~~~evih~r~----------------~~----~~~~~G~s~~~~~~~~i~~ 206 (424) +.. .. + .|.. . +. ...+.++.+.++.. .+ .+...|.|.+..+...++. T Consensus 181 ~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa 260 (501) T protein:vir:27 181 NSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDL 260 (501) T ss_pred CCCCCceEEEEEEEEeeecCCcEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEecCCCCCCCchhhhHHHHHH Confidence 421 11 1 1110 0 00 01122333222210 00 1234678888877777777 Q ss_pred HHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHH Q lcl|NC_019719. 207 AVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKF 286 (424) Q Consensus 207 ~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~ 286 (424) ...+.....+.+...+.|-.++.-......++....++.. ........+.....+.+.+++.+.....+..+....+. T Consensus 261 ~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 338 (501) T protein:vir:27 261 YDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRT--RLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTR 338 (501) T ss_pred HHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhc--CceeecccccccCCCCCcceeeeeccCCHHHHHHHHHH Confidence 7666666665566666666665533222222222222211 11111111122223445566666555555556677888 Q ss_pred HHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcc-Cccccccceeee Q lcl|NC_019719. 287 QVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWENSIQRWLI-PAKDVGRIHAEH 355 (424) Q Consensus 287 ~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ie~~l~~~l~-~~~~~~~~~~~f 355 (424) ..+.|+..-++|..-.+... ++.++...+-. ....+...+.-++..+...++..-- ...+. ..+.+ T Consensus 339 l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~--~~i~v 415 (501) T protein:vir:27 339 LNRDIHIFTNIPDMSDTNFS-GNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDE--SLLKI 415 (501) T ss_pred HHHHHHHHhCCcccCccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc--ccceE Confidence 88999999999964443322 22222222211 1122233333333333333322110 01111 22444 Q ss_pred cchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CC----------Ceeeeccccc-chhhccccCCCccc Q lcl|NC_019719. 356 NLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GG----------DVAMRQSQYV-PITDLGTNKEPRNN 422 (424) Q Consensus 356 d~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~g----------d~~~~~~n~~-~~~~~~~~~~~~~~ 422 (424) .+...+..+..+.++.+.++ .|+++..-+.++++.-..| +. +.-.+...+. +.+ ...++.+++ T Consensus 416 ~f~~~~p~n~~e~ad~~~kl--~g~iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~--~~~d~~~~~ 491 (501) T protein:vir:27 416 TFTPNLPKSLNEQVSILTGL--GGQVSQETALSLSGLVESPNEELDKINKEVSEIDFKGYSNDFNEHVG--KYTDEVKET 491 (501) T ss_pred EeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCccccccc--cccCCCCCC Confidence 44666678888899988887 4789888888877653211 11 1001111111 111 111122222 Q ss_pred CC Q lcl|NC_019719. 423 GA 424 (424) Q Consensus 423 ga 424 (424) ++ T Consensus 492 ~~ 493 (501) T protein:vir:27 492 HT 493 (501) T ss_pred cc Confidence 22 No 209 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=97.21 E-value=0.00013 Score=41.88 Aligned_cols=386 Identities=10% Similarity=0.062 Sum_probs=169.1 Q ss_pred CCCCcccccCCCCC----chHHHHHhhccCcccCccccccccccccccc----ccCcccccHHHHhhhHHHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNN----GWWARLQSWFVGGRLVTPNQGSQTGPVSAHG----HLGDSSINDERILQISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~~~~~~----G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~v~~~i~~ia~ 72 (424) .+.|++ +.+-++. -.+.++....... .++.......+.+.. .......-+..=+..+.....|+..+. T Consensus 3 ~~~~~~-~~~~~~~~~~~~~i~~~i~~~~~~---~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~ 78 (452) T protein:vir:36 3 YKPPKL-MTFSKDEPITVEVVTKFMEKHKLE---VARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVDTFTG 78 (452) T ss_pred ccCcee-EEcCCccCCCHHHHHHHHHHHHHH---HHHHHHHHHHhccccccccCccccccCccceeecchHHHHHHHHhh Confidence 233332 2222222 2233333222111 011000000010000 000000000111224566677777777 Q ss_pred hhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCce Q lcl|NC_019719. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) Q Consensus 73 ~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~ 152 (424) -+-.-|+.+--. +.. ....+.+++.. | ........+..+.+.+|.+|..+..+.+|.+ .+..++|.. T Consensus 79 ~l~g~~~~~~~~--d~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~ 145 (452) T protein:vir:36 79 YFNGIPVKKSHS--DKE-----ILTKLQEFDNL--N---DMEDEESELAKMACIYGRAFEFLYQDEDTQT-NVVYNSPEN 145 (452) T ss_pred hhcccCceeecC--Chh-----HHHHHHHHHhh--c---ChhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccc Confidence 776777664311 111 11233444431 2 3555667788899999999999999988876 567788888 Q ss_pred EEEEEcCCc---eE--EEEE--ecCc--eEEecHhHeeEeccC-----------------C----CCccccCchHHHHHH Q lcl|NC_019719. 153 MDVKLVGKK---VV--YRYQ--RDSE--YADFSQKEIFHLKGF-----------------G----FTGLVGLSPIAFACK 202 (424) Q Consensus 153 v~~~~~~~~---~~--~~~~--~~~~--~~~~~~~evih~r~~-----------------~----~~~~~G~s~~~~~~~ 202 (424) +.+..++.. .. ..+. .... ...+.++.+++++.. + .+...|.|.+..... T Consensus 146 ~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~~ 225 (452) T protein:vir:36 146 MFMVYDDTVKQEPLFAVRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLPVVEFYFNEERMSIFESVIS 225 (452) T ss_pred eEEEEcCCCCCceEEEEEEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCcccEEEecCCCCCCcchHHHHH Confidence 887765422 11 1111 1111 011233333222110 0 112357777766666 Q ss_pred HHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-----CceeeecccChhH Q lcl|NC_019719. 203 SAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-----GFSTSAIGVTPQD 277 (424) Q Consensus 203 ~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~-----g~~~~~l~~~~~d 277 (424) .++....+.....+.+...+.|-.++. +... +++....++ .++++.++. +.+++.+.....+ T Consensus 226 liDa~d~~~s~~~~~~~~~~~p~~~~~-g~~~-~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~l~~~~~~ 292 (452) T protein:vir:36 226 LVNAFNKAISEKANDVDYFSDQYLTFL-GAAV-EEEDLKNIR-----------SNRVINYYADGEGKNVDVKFLEKPDSD 292 (452) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeEee-cCCc-Cchhhhhhh-----------hcceEEecCCCCccCCcceeEeecCCH Confidence 666655555555555556666766654 2222 222211111 112333322 2233334433344 Q ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHHHHHHHHHHHHHHHhhccCccc Q lcl|NC_019719. 278 AEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWENSIQRWLIPAKD 347 (424) Q Consensus 278 ~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ie~~l~~~l~~~~~ 347 (424) ..+....+...+.|+..-++|.. +....++.++...+.. ....+...+...++.|..-+... -...+ T Consensus 293 ~~~~~~~~~l~~~I~~~s~~p~~--~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~ 369 (452) T protein:vir:36 293 SQTENLLDRLTKLIFQTTMVANI--SDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNV-SNKDS 369 (452) T ss_pred HHHHHHHHHHHHHHHHHhCcccc--CcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CCccc Confidence 55567778888899999999853 2222223332222211 11223333333333333322211 11111 Q ss_pred cccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeeee--------cccccchhhccccC Q lcl|NC_019719. 348 VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAMR--------QSQYVPITDLGTNK 417 (424) Q Consensus 348 ~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~--gd~~~~--------~~n~~~~~~~~~~~ 417 (424) .. .+.+.+..-+..|..+.++.+.++ .|+++..-+.++++.-+.+. .++.-. ..+..+ +..+.++ T Consensus 370 ~~--~i~i~f~~~~p~d~~~~a~~~~k~--~g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~-~~~~~~~ 444 (452) T protein:vir:36 370 WK--DIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQP-SEKGTDT 444 (452) T ss_pred cc--cceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccC-CCCcccc Confidence 12 233444555677888899988887 47899888888887643211 110000 000000 0000011 Q ss_pred CCcccCC Q lcl|NC_019719. 418 EPRNNGA 424 (424) Q Consensus 418 ~~~~~ga 424 (424) +..++.- T Consensus 445 ~~~~~~~ 451 (452) T protein:vir:36 445 VVSETNE 451 (452) T ss_pred cCccccC Confidence 1111111 No 210 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=97.16 E-value=0.00015 Score=41.57 Aligned_cols=387 Identities=11% Similarity=0.016 Sum_probs=167.8 Q ss_pred CCCCccccc------------C--CCCCchHHHHHhhccCcccCccccccccccccccc---------ccCccccc--HH Q lcl|NC_019719. 1 MEEPKYTID------------L--RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHG---------HLGDSSIN--DE 55 (424) Q Consensus 1 ~~~~~~~~~------------~--~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~--~~ 55 (424) |-+-.|-++ . .+..=++.++........ +........+.+-. ...+.... +. T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~ 77 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENI---DNITMGERYYNHHPDILDAPPKRDVNGDYDETKPD 77 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHH---HHHHHHHHHhcCCCchhcccccccccccccccccc Confidence 333222111 1 111224444433221110 00000000000000 00000000 00 Q ss_pred HHhhhHHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe Q lcl|NC_019719. 56 RILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD 135 (424) Q Consensus 56 ~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~ 135 (424) .-+.++....+|+..+.-+-+-|+.+. . .+.. ....+..++. | ........+..+.+.+|.+|+.+. T Consensus 78 ~ki~~n~~~~ivd~~~~~l~g~~~~~~-~-~~d~-----~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~~~ 144 (478) T protein:vir:10 78 WRMYTNYHQNLVDQKVAYAVANPVTFG-V-DNDK-----ALKQIQHTLN---H---KWDDKLVDILTAASNKGIEWVQPY 144 (478) T ss_pred ceeccchHHHHHHHHHhhhccCCeeee-c-CChH-----HHHHHHHHHh---c---CHHHHHHHHHHHHHhcCeEEEEEE Confidence 012245566688887777777777652 1 1111 1123344332 2 345666777889999999999998 Q ss_pred eCCCCceeeEEeecCceEEEEEcCCc---e---EEEEEecCc--eEEecHhHeeEecc---------------------- Q lcl|NC_019719. 136 RNSAGDVISLLPLQSANMDVKLVGKK---V---VYRYQRDSE--YADFSQKEIFHLKG---------------------- 185 (424) Q Consensus 136 r~~~G~~~~l~~l~~~~v~~~~~~~~---~---~~~~~~~~~--~~~~~~~evih~r~---------------------- 185 (424) .+.+|.+ .+..++|..+.+..++.. . .+.|...+. ...+.++++.+.+. T Consensus 145 ~d~~g~~-~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (478) T protein:vir:10 145 VDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQ 223 (478) T ss_pred ecCCCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeeeccccccccccccceec Confidence 8888876 577788888887765321 1 111111111 11122222222211 Q ss_pred ----C-----C----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhC Q lcl|NC_019719. 186 ----F-----G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAG 252 (424) Q Consensus 186 ----~-----~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~ 252 (424) + + .+...|.|.+..+...++....+.....+.+...+.|-.++.-.......+....+ T Consensus 224 ~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~--------- 294 (478) T protein:vir:10 224 GNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNL--------- 294 (478) T ss_pred ccccccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhhhhh--------- Confidence 0 0 02345777777666666666655555555555556676555432221111111111 Q ss_pred CcccCcceec--CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HH Q lcl|NC_019719. 253 GPVKKRLWIL--EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NL 320 (424) Q Consensus 253 ~~~~g~~~~l--~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~ 320 (424) ..++++.+ +.|.+.+.+........+.+..+...+.|...-++|..-..... ++.++...+.. .. T Consensus 295 --~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~ 371 (478) T protein:vir:10 295 --KYYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFG-NSPSGIALKFMYSNLDLKANKLK 371 (478) T ss_pred --hhcceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccc-cccHHHHHHHHHHHHHHHHHHHH Confidence 11223333 23344444444334455667788888888888888853332211 22222111111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CC Q lcl|NC_019719. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GG 398 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~g 398 (424) ..+..++.-+++.|...+. ...+.....+.| +.-+..|..+.++.+.++ +|+++...+.+++++-..+ +. T Consensus 372 ~~~~~~l~~~~~li~~~~g----~~~~~~~i~i~f--~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~ 443 (478) T protein:vir:10 372 NKTLTALQELLQYIIDFYR----LDVKVQDIEITF--NFNVMVNELENSQIAMNS--TGLLSKETILSNHAWVEDPVAEM 443 (478) T ss_pred HHHHHHHHHHHHHHHHHhC----CCcccccceEEe--cCCCCCCHHHHHHHHHHH--hCCCChHHHHHhCCCCCCHHHHH Confidence 1222222222222221111 111222233344 445567888889888887 6899998888888763321 11 Q ss_pred Ceeee-----cccccch-hh-ccccC-CCcccCC Q lcl|NC_019719. 399 DVAMR-----QSQYVPI-TD-LGTNK-EPRNNGA 424 (424) Q Consensus 399 d~~~~-----~~n~~~~-~~-~~~~~-~~~~~ga 424 (424) +.+-. ....... .. .++.+ +.+++.. T Consensus 444 ~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (478) T protein:vir:10 444 ERIEQENIELNQQLPDIEEGLNGEQQRQSENNQP 477 (478) T ss_pred HHHHHHHHHHHhhccccccccCCCCCCCCCCCCC Confidence 10000 0000111 11 11111 2222222 No 211 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=96.87 E-value=0.00029 Score=39.98 Aligned_cols=380 Identities=7% Similarity=-0.028 Sum_probs=159.1 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccccccccccccc---------c--cCcccccHHHHhhhHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHG---------H--LGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---------~--~~~~~~~~~~~~~~~~v~~~i~~ 69 (424) .|++++...+..+ ++.++.+...... +........+.+.. . .......+..=+.++....+++. T Consensus 17 ~~~~~~~~~~~~~--~i~~~i~~~~~~~---~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~ 91 (468) T protein:vir:96 17 VEQIKPQYETQEE--MILRLITKHKENV---EDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRMYTNYHQNLVDQ 91 (468) T ss_pred eecccccccCcHH--HHHHHHHHHHHHH---HHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHH Confidence 3444433322222 3333332221110 00000000000000 0 00000001111234555667777 Q ss_pred HHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeec Q lcl|NC_019719. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQ 149 (424) Q Consensus 70 ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~ 149 (424) .+.-+-+-|+.+-- .+. .....+...+. | +.......+..+.+.+|.+|..+..+.+|.+ .+..++ T Consensus 92 ~~~~l~g~p~~~~~--~d~-----~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~ 157 (468) T protein:vir:96 92 KVAYAVANPVTYGT--EDE-----KSLKTIQEVLN---H---KWDDKLVDILTAASNKGVEWIQPYVDEQGEF-KTFRVP 157 (468) T ss_pred HHhhhccCCceecc--CCh-----HHHHHHHHHHh---c---CHHHHHHHHHHHHhhcCeEEEEEEEcCCCce-EEEEEc Confidence 77776666766421 111 11223444442 2 3455566788999999999999888888865 577788 Q ss_pred CceEEEEEcCC---ce---EEEEEecC--ceEEecHhHeeEeccC-------------------------------C--- Q lcl|NC_019719. 150 SANMDVKLVGK---KV---VYRYQRDS--EYADFSQKEIFHLKGF-------------------------------G--- 187 (424) Q Consensus 150 ~~~v~~~~~~~---~~---~~~~~~~~--~~~~~~~~evih~r~~-------------------------------~--- 187 (424) |..+.+..++. .. .+.|...+ ....+.++.+.+.+.. + T Consensus 158 p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~ 237 (468) T protein:vir:96 158 AEQAIPIWTNKERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIP 237 (468) T ss_pred ccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCcccEEE Confidence 88887765432 11 11111111 1111222222222110 0 Q ss_pred -CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC--C Q lcl|NC_019719. 188 -FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE--A 264 (424) Q Consensus 188 -~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~--~ 264 (424) .+...|.|.+..+...++....+.....+.++..+.|-.+++-.... +.+ ....... .++++.++ + T Consensus 238 ~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~-~~~------~~~~~~~----~~~~i~~~~d~ 306 (468) T protein:vir:96 238 FKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGE-DLE------EFMYNLK----YYKAINVDGDG 306 (468) T ss_pred ecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcc-ccc------hhhhhhh----cCceEEecCCC Confidence 02345777777666666666555555555556666676665432211 111 1111111 12344443 3 Q ss_pred CceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHHHHHHHHHHHHH Q lcl|NC_019719. 265 GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRW 334 (424) Q Consensus 265 g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~i 334 (424) +.+.+.+........+....+...+.|...-++|..-... ..++.++...+.. ....+...++-+++.| T Consensus 307 ~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li 385 (468) T protein:vir:96 307 SGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDK-FGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYI 385 (468) T ss_pred CCcceEEeecCChHHHHHHHHHHHHHHHHHhCcccccccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3344444444344455667788888899998988532211 1122222222111 1112222222222222 Q ss_pred HHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeeeec-ccccchh Q lcl|NC_019719. 335 ENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAMRQ-SQYVPIT 411 (424) Q Consensus 335 e~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~~~~-~n~~~~~ 411 (424) ...+. ...+.....+.| +.-+..|..+.++ .+...|+++.-.+.++++.-..| +.++.-.- ....... T Consensus 386 ~~~~g----~~~d~~~i~i~f--~~~~p~d~~e~a~---~~~~~g~iS~et~i~~l~~v~D~~~E~~ri~~E~~~~~~~~ 456 (468) T protein:vir:96 386 IDFYK----LSIKVQDVEITF--NFNVMVNELEQSQ---IGVNSQYLSKETVVTNHPWVDDPVAEMERIDQEELALPSIE 456 (468) T ss_pred HHHhC----CCcccceeeEEe--cCCCCcCHHHHHH---HHHhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHh Confidence 21111 111222233344 3334455554444 45667999988888877552211 11100000 0000000 Q ss_pred ---hccccCCCc Q lcl|NC_019719. 412 ---DLGTNKEPR 420 (424) Q Consensus 412 ---~~~~~~~~~ 420 (424) ...++.+|. T Consensus 457 ~~~~~~~~~~~~ 468 (468) T protein:vir:96 457 EGLNGKENNEPT 468 (468) T ss_pred hccCCCCCCCCC Confidence 001111111 No 212 >protein:vir:97376 Length: 320 # NCBI annotation: putative portal protein # Family: family:all:11744 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762589;genbank:gi:115304290;genbank:GeneID:5130579 Probab=96.64 E-value=7e-05 Score=43.36 Aligned_cols=306 Identities=12% Similarity=0.110 Sum_probs=146.9 Q ss_pred CchHHHHHhhccCcccCcccccc--ccccc-ccccccCcccccHHHHhhhHHHHHHHHHHHHhhccCceEEEEecccCcc Q lcl|NC_019719. 14 NGWWARLQSWFVGGRLVTPNQGS--QTGPV-SAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 14 ~G~~~~l~~~~~~~~~~~~~~~~--~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) -|+|+ |.++...+|.... ..... --.+...|.. -|...+-+|.+||.- +..|+...+ T Consensus 1 ~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~-~~~~~~~~~--- 60 (320) T protein:vir:97 1 MGIFN-----FKKRETLTPELKESIIRQVTIEDESPFTGTT-----------DFNVRNEVAESIATY-LGAYKTSAK--- 60 (320) T ss_pred CCccc-----cccccccChhHHhhhhheeeeccCCCccccc-----------ccchhhHHHHHHHHH-hhhhccccc--- Confidence 45554 2333333332111 00000 0001111111 111223333444321 112222111 Q ss_pred ccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEEcCCceEEEEEecC Q lcl|NC_019719. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDS 170 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~ 170 (424) ...||. .| ..|++.++.|.+..-..|++.-+. .|.++ -++-++.-.+. ...+...+.. T Consensus 61 --------~~~~~~--~~-----~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~----~~~~~~~~~~~--~~~~~~~D~F 118 (320) T protein:vir:97 61 --------RLSLLT--NN-----PSFLRRLVKHALHNKTTYVYKSPT-YGWLI----TDSMTIEGLRA--RLTFTLPDPF 118 (320) T ss_pred --------eeeeee--CC-----HHHHHHHHHHhhcccceEEeeCCc-cceee----ecceeeeeeee--eEEEecCccc Confidence 112232 22 368999999999999999887543 23221 11111110000 0000000000 Q ss_pred ---ceEEecHhHeeEeccCCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHH Q lcl|NC_019719. 171 ---EYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENF 247 (424) Q Consensus 171 ---~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~ 247 (424) .+..++-.|+=.+ .++++|+-+-++.. +...+....-+-+.|.+....+++++.+..-++.+++....+ T Consensus 119 N~~V~mtvpfyD~~IL----dnpl~gv~tqe~gk----M~g~a~~~v~kkL~~~~~IKafi~Tdid~GLee~kD~~~~kI 190 (320) T protein:vir:97 119 NSAVTMTVPFYDVGII----DSPLVEVDTEEANK----MLEAAYSAVMKKLHNTGAIKAFISSDIDVGLEKMKEESDSKI 190 (320) T ss_pred ceeEEEEeeeechhhh----hhhhcccChHHhhH----HHHHHhhhhhhhccccceeEEEEecccchhHHHHHHHHHHHH Confidence 0111111111111 24567877753322 222233334455667777888888877665567777777777 Q ss_pred HHHhCCcc-cCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHH Q lcl|NC_019719. 248 KEIAGGPV-KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 248 ~~~~~~~~-~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) +++..-.+ -.++-+++.|-+++++.....-.. ..-....++..+.-|+||..+|-+. ..+.+..+|+... T Consensus 191 k~mq~~A~~~nG~T~i~~~dDI~Qi~pDYS~sn-~~D~~l~~t~alS~y~m~~~IL~Gs--------Ate~~~Iaf~~~~ 261 (320) T protein:vir:97 191 KAMLATAELLSGYTYIQRGDDVTQMMPDYTTSN-VTDFAAMRTFAASQLSVSDKILDGS--------ATDGEKVAVMFRF 261 (320) T ss_pred HHHHHHHHHhcCcccccCCcceeeecccccccc-hhHHHHHHHHHHhhcCCchhhcccc--------CCcceeeehhhHh Confidence 66655433 457888999999999987655432 3335566778889999999998543 2256778999999 Q ss_pred HHHHHHHH---HHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC----CCC Q lcl|NC_019719. 327 LQPYISRW---ENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP----GGD 399 (424) Q Consensus 327 l~P~~~~i---e~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~----~gd 399 (424) +.|+++++ |-+|.+++-.+ .++-|-.. .|-+..|-+- -.|.+.-| ||| T Consensus 262 V~PLL~Q~~~~Ek~Lvy~m~~E-----~FVs~mtT-------------------GG~l~S~~~~-~~~~~~~~~~~~~~~ 316 (320) T protein:vir:97 262 VEPILEQFREYEPSLIYAMRDE-----FFVSFMTT-------------------GGMLNSNRVD-GWGKEKAPNESKGGD 316 (320) T ss_pred HHHHHHHhhhcCcceeeeeccc-----eeeeeeec-------------------Cceeeccccc-ccccccCCccccCCc Confidence 99999997 55665544211 11111100 3333333211 12333222 333 Q ss_pred eeee Q lcl|NC_019719. 400 VAMR 403 (424) Q Consensus 400 ~~~~ 403 (424) +--+ T Consensus 317 ~~~~ 320 (320) T protein:vir:97 317 VGDV 320 (320) T ss_pred ccCC Confidence 2222 No 213 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=96.59 E-value=0.00048 Score=38.76 Aligned_cols=382 Identities=7% Similarity=0.023 Sum_probs=169.8 Q ss_pred CCC--------Cc------ccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHH Q lcl|NC_019719. 1 MEE--------PK------YTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRC 66 (424) Q Consensus 1 ~~~--------~~------~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ 66 (424) ||+ .. +-=+++++..-++++..+..+.-.. ..+. ..... -+..-+..+....+ T Consensus 5 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i-~~~~-----------~~~~~-~~~~ki~~n~~~~I 71 (499) T protein:vir:10 5 IDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEI-EKHE-----------FDNAT-VEAANVMVNHAKYI 71 (499) T ss_pred hhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccch-hcCC-----------cCcCC-CCcceeecchHHHH Confidence 111 10 0002233334444444444332110 0000 00000 00111123455667 Q ss_pred HHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce---- Q lcl|NC_019719. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV---- 142 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~---- 142 (424) |+..+.-+-+-|+.+--. +. ...+.+.+++.. | ....+...+..+.+.+|.+|.++-.+.+|.+ T Consensus 72 v~~~~~~l~g~p~~~~~~--~~-----~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~ 139 (499) T protein:vir:10 72 TDMNVGFMTGNPVKYVAE--KG-----KNIDDILEVFNQ--I---DIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRD 139 (499) T ss_pred HHHHhhhhcccCceeecC--Ch-----hHHHHHHHHHhh--c---CHhHHHHHHHHHHHhcCceEEEEEecccccccccc Confidence 777777776777664321 11 112234454432 2 3455677888999999999999888887743 Q ss_pred ------------eeEEeecCceEEEEEcCCce-------EEEEEe---cCce----EEecHhHeeEeccC---------- Q lcl|NC_019719. 143 ------------ISLLPLQSANMDVKLVGKKV-------VYRYQR---DSEY----ADFSQKEIFHLKGF---------- 186 (424) Q Consensus 143 ------------~~l~~l~~~~v~~~~~~~~~-------~~~~~~---~~~~----~~~~~~evih~r~~---------- 186 (424) ..+..++|..+.+..++... +|.+.. +... ..+.++.|.+++.. T Consensus 140 ~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~ 219 (499) T protein:vir:10 140 ELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDP 219 (499) T ss_pred cccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcce Confidence 34667788777766553221 111111 1111 12344444443210 Q ss_pred ------C----------CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019719. 187 ------G----------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 187 ------~----------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) + .+...|.|.+..+...++....+.....+.+...+.|-.+++-............+ T Consensus 220 ~~~~~~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~------- 292 (499) T protein:vir:10 220 IVYDGENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQRL------- 292 (499) T ss_pred ecccccCCCCccceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchhhhh------- Confidence 0 01234667677666666666655555555556666677666532111111111111 Q ss_pred hCCcccCccee--cCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH----------HH Q lcl|NC_019719. 251 AGGPVKKRLWI--LEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQ 318 (424) Q Consensus 251 ~~~~~~g~~~~--l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~ 318 (424) ..+++.. .++|.+++.+........+....+...+.|...-++|..-.... .++.|+...+ +. T Consensus 293 ----~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~gn~Sg~Al~~~~~~l~~k~~~ 367 (499) T protein:vir:10 293 ----KRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKF-MGNVSGEAMKFKLFGLENLLSI 367 (499) T ss_pred ----hhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhh-cccchHHHHHHHHHHHHHHHHH Confidence 1122222 24555666665544445556677777888888888874211111 1222221211 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC-- Q lcl|NC_019719. 319 NLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP-- 396 (424) Q Consensus 319 ~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~-- 396 (424) ....+...+.-.++.+...++.. ........+.+.+..-+..|..+.++.+.++ .|+++..-++++++.-..+ T Consensus 368 k~~~~~~~l~~~~~li~~~~~~~---~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~ 442 (499) T protein:vir:10 368 KQRYFFDGLRRRLKLIQTIVNIK---GANDDASGCKISLVANIPSNLSDVVNNVKNA--DGIIPRKYTYSWLPDVDNPQD 442 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHhcc---CCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHH Confidence 12223333333333333332211 1111112234444555567889999999988 5889888777777653211 Q ss_pred CCC--------------eeeecccccchhhccc---cC-CCcccCC Q lcl|NC_019719. 397 GGD--------------VAMRQSQYVPITDLGT---NK-EPRNNGA 424 (424) Q Consensus 397 ~gd--------------~~~~~~n~~~~~~~~~---~~-~~~~~ga 424 (424) ..+ ..+...+..+...... .+ ...++++ T Consensus 443 E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (499) T protein:vir:10 443 VIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKEAGS 488 (499) T ss_pred HHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCCCcc Confidence 100 1111111111111111 11 1111111 No 214 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=96.59 E-value=0.00049 Score=38.75 Aligned_cols=377 Identities=9% Similarity=0.033 Sum_probs=171.1 Q ss_pred ccCCCCCchHHHHH--------------hhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019719. 8 IDLRTNNGWWARLQ--------------SWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 8 ~~~~~~~G~~~~l~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) |++.+-.-+.+++. ....+.- ....+................. .+..=+.++.....|+..+.- T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~-~I~~~~~~~~~~~~~~~~~~~~-~~~~ki~~n~~k~Iv~~~~~y 78 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKT-DITTRNNGKAKLNKEGKKDPLR-SADNRIPSNFYQLLVDQEAGY 78 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-chhccccchhcccccccccccc-cCCcccccchHHHHHHhhhhh Confidence 55544444444333 2222210 0000000000000000000000 001112244455667777777 Q ss_pred hccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceE Q lcl|NC_019719. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) +-+-|+.+--.+ .. ..+.+.+.++. +..+-...+..+++.+|.+|.++-.+.+|.+ .+..++|..+ T Consensus 79 l~G~p~~~~~~d--~~-----~~~~l~~~~~~------~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~-~~~~~~p~~~ 144 (470) T protein:vir:10 79 VASVFPDIDVGK--DA-----DNKKIIDVLGD------DRALTLNGLLVDSSNAGRAWLHYWIDEDGNF-RYGIIQPDQI 144 (470) T ss_pred eeccceeeecCc--hH-----HHHHHHHHHhh------hHHHHHHHHHHHHhhcCeeEEEEEecCCCce-EEEEEcccce Confidence 777777653221 11 11234444531 2344455677888999999999999988875 5777888888 Q ss_pred EEEEcCCc---e-----EEEEEe-cC-ce----EEecHhHeeEeccC--------------------------------- Q lcl|NC_019719. 154 DVKLVGKK---V-----VYRYQR-DS-EY----ADFSQKEIFHLKGF--------------------------------- 186 (424) Q Consensus 154 ~~~~~~~~---~-----~~~~~~-~~-~~----~~~~~~evih~r~~--------------------------------- 186 (424) .+..++.. . +|.... .+ .. ..+.++.+.|++.. T Consensus 145 ~~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (470) T protein:vir:10 145 TPIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHN 224 (470) T ss_pred EEEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccC Confidence 88776431 1 111111 11 00 11223333332210 Q ss_pred ----C----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCc Q lcl|NC_019719. 187 ----G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKR 258 (424) Q Consensus 187 ----~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 258 (424) + .+...|.|.+......++....+.....+.+...+.|-.++........++.... ++ ..+ T Consensus 225 ~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~----~~-------~~~ 293 (470) T protein:vir:10 225 FGRVPFIEFSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMND----LR-------KYK 293 (470) T ss_pred CCeeeEEEeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhh----hh-------hcC Confidence 0 0123577777777777777666666566666666667766653322222222111 11 112 Q ss_pred ceecC-----CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHH----------HHHHH Q lcl|NC_019719. 259 LWILE-----AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFL 323 (424) Q Consensus 259 ~~~l~-----~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~~~ 323 (424) .+.++ .+.+++-+........+....+...+.|...-++|.. .....++.|+....-. ....+ T Consensus 294 ~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~--~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~ 371 (470) T protein:vir:10 294 SIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDP--ANFESSNASGVAIKMLYSHLELKAAKTQTYF 371 (470) T ss_pred eEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCC--CccccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 23332 1233333333323334456677778888888888842 2222233333222211 12223 Q ss_pred HHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCee Q lcl|NC_019719. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~--~gd~~ 401 (424) ..+++-.++.|...++ ..+.....+.+.++.-+..|..+.++.+.++ +|+++..-+.++++.-..+ ..+++ T Consensus 372 ~~~l~~~~~~i~~~l~-----~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~~~p~v~D~~~E~eri 444 (470) T protein:vir:10 372 EHAINELVRAIMRYLN-----FSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDWQQELKDL 444 (470) T ss_pred HHHHHHHHHHHHHHhc-----ccCcccceeeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHH Confidence 3333333333333222 1122223455556677778899999998887 5899988888887653221 11100 Q ss_pred ------eecccccchhhcc-ccCCCcc Q lcl|NC_019719. 402 ------MRQSQYVPITDLG-TNKEPRN 421 (424) Q Consensus 402 ------~~~~n~~~~~~~~-~~~~~~~ 421 (424) -.+.+. ...+.. ++.+.++ T Consensus 445 ~~E~~e~~~~~~-~~~~~~~~~~dde~ 470 (470) T protein:vir:10 445 AKDKEENDPYSN-QADELNGKGVNDEQ 470 (470) T ss_pred HHHHHHHHHhhc-cccccCCCCCCCCC Confidence 000000 001100 0000111 No 215 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=96.52 E-value=0.00055 Score=38.47 Aligned_cols=374 Identities=10% Similarity=0.040 Sum_probs=168.2 Q ss_pred CCCC---cccccCCCCCchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHHHHHHHHHHHhhccC Q lcl|NC_019719. 1 MEEP---KYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~---~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |... ++.=+++++.--+.++..+..+.-..-..... .... ...-+.++....+|+..+.-+-.- T Consensus 1 l~~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~----------~~~~---~~~ki~~n~~~~ivd~~~~~l~g~ 67 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFNLSYSAYKQLYEGDHAILQQKQK----------EQYK---PDNRLVVNFAKYIVDTFNGYFIGV 67 (429) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc----------ccCC---CcceeecchHHHHHHHHhhhhccc Confidence 1110 00001122333334444444332100000000 0000 111123566777888888887777 Q ss_pred ceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEEEE Q lcl|NC_019719. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~ 157 (424) |+.+.-. +.. ....+..++. . | ........+..+.+.+|.+|+.+.++.+|.+ .+..++|..+.+.. T Consensus 68 ~~~~~~~--~~~-----~~~~l~~~~~-~-n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~~~~~~p~~~~~v~ 134 (429) T protein:vir:98 68 PVQTSHE--NKQ-----VSNYLELLDG-Y-N---DQDDNNAELSKICSIYGHGYELVFNDENAEA-GITYLTPLEAFIVY 134 (429) T ss_pred CceeecC--ChH-----HHHHHHHHHh-h-c---CHhHHHHHHHHHHhhcCeEEEEEEecCCCcE-EEEEEcccceEEEE Confidence 7765311 110 1122333333 2 2 3456667888999999999999999999876 56678888887765 Q ss_pred cCCc---eEE--EE-E-ecCc-eEEecHhH--------------------------eeEeccCCCCccccCchHHHHHHH Q lcl|NC_019719. 158 VGKK---VVY--RY-Q-RDSE-YADFSQKE--------------------------IFHLKGFGFTGLVGLSPIAFACKS 203 (424) Q Consensus 158 ~~~~---~~~--~~-~-~~~~-~~~~~~~e--------------------------vih~r~~~~~~~~G~s~~~~~~~~ 203 (424) ++.. ..+ .+ . .+.. ...+...+ |++++ +...|.|.+..+... T Consensus 135 dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~----n~~~g~sd~e~v~~l 210 (429) T protein:vir:98 135 DDSIRQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIEYV----ENEERQSLLASVVTL 210 (429) T ss_pred eCCCCCceEEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccceEEec----CCCCCCCcHHHHHHH Confidence 5321 110 11 1 1110 00111111 12221 234677877777777 Q ss_pred HHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCC----ceeeecccChhHHH Q lcl|NC_019719. 204 AGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAG----FSTSAIGVTPQDAE 279 (424) Q Consensus 204 i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g----~~~~~l~~~~~d~~ 279 (424) ++....+.....+.....+.|-.+++-. .. +++....+ . .++++.++.+ .+...+........ T Consensus 211 iD~~d~~~s~~~~~~~~~~~p~~~i~g~-~~-~~~~~~~~-------~----~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 277 (429) T protein:vir:98 211 INAFNKAISEKANDVEYFADAYLKILGA-EL-DDETLKSL-------R----DTRIINLKDTDAQQLTVEFLQKPDADAT 277 (429) T ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecC-CC-CcchhhhH-------h----hCceeeccCCCCCCcceeEEeecCCHHH Confidence 7766666666665566667777666522 22 22211111 1 1234444322 23444443333344 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHH----------HHHHHHHHHHHHHHHHHHhhccCccccc Q lcl|NC_019719. 280 MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG----------FLQYTLQPYISRWENSIQRWLIPAKDVG 349 (424) Q Consensus 280 ~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~----------~~~~tl~P~~~~ie~~l~~~l~~~~~~~ 349 (424) +....+...+.|+..-++|..-.. ..++.|+...+..... .+...+.-.++.+...++.. ..... T Consensus 278 ~~~~~~~l~~~i~~~s~~p~~~~~--~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~---~~~~d 352 (429) T protein:vir:98 278 QEHLLDRLENLIFRTAMVANISDE--SFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSK---IGPKD 352 (429) T ss_pred HHHHHHHHHHHHHHHhCccccCcc--ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC---CCccc Confidence 556678889999999999853222 1122232222211111 11122222222222211110 00111 Q ss_pred cceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccccc----chhhccccCCCcccCC Q lcl|NC_019719. 350 RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYV----PITDLGTNKEPRNNGA 424 (424) Q Consensus 350 ~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~----~~~~~~~~~~~~~~ga 424 (424) ...+.+.+...+..|..+.++.+.++ .|+++..-+.++++.-+.|..+--.+...-. ........++.+++.= T Consensus 353 ~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 353 WIGIKYKFTRNLPANLLEESQIAGNL--AGIVSEETQVGVLSIVENPQKEIERKNSDKSTLISRQAGGLNGQNTTTILE 429 (429) T ss_pred cccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhcCCCCCCCCC Confidence 12234445566778888899998887 5789987788888764322111000000000 0000000000000000 No 216 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=96.45 E-value=0.00061 Score=38.19 Aligned_cols=376 Identities=6% Similarity=-0.011 Sum_probs=164.8 Q ss_pred ccCCCCCchHHH--------------HHhhccCcccCcccccccccccc---cccccCcccccHHHHhhhHHHHHHHHHH Q lcl|NC_019719. 8 IDLRTNNGWWAR--------------LQSWFVGGRLVTPNQGSQTGPVS---AHGHLGDSSINDERILQISTVWRCVSLI 70 (424) Q Consensus 8 ~~~~~~~G~~~~--------------l~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~v~~~i~~i 70 (424) |++..-.=++.+ +..+..+.-. ............ ...........+..-+.++....+|+.. T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hd-i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~ 79 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNEND-IKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQK 79 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-cccccchhhhhcccccccccccccccccceeccchhHHHHHhh Confidence 555443333333 3333332210 000000000000 0000000000011112344566677777 Q ss_pred HHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-CCceeeEEeec Q lcl|NC_019719. 71 STLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-AGDVISLLPLQ 149 (424) Q Consensus 71 a~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~-~G~~~~l~~l~ 149 (424) +.-+-+-|+.+-- .+.. ....+...+ . | ........+...++.+|.+|.++.++. +|. ..+..++ T Consensus 80 ~~yl~G~p~~~~~--~~~~-----~~~~l~~~~--~-n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~-~~~~~~~ 145 (471) T protein:vir:10 80 KAYALTYPPTFDV--DDKK-----VNDMIVDVL--G-D---DYERISKQLCVNAGNAGIAWLHVWKDASDNS-FRYACVD 145 (471) T ss_pred hhhhcccCceecc--CChH-----HHHHHHHHH--h-c---CHHHHHHHHHHHHhhCCeEEEEEEeeCCCCe-eEEEEEc Confidence 7777777776521 1111 111122222 1 2 234456667888999999999998875 465 4677788 Q ss_pred CceEEEEEcCCc---e-----EEEEE--ecCce----EEecHhHeeEeccC----------------------------- Q lcl|NC_019719. 150 SANMDVKLVGKK---V-----VYRYQ--RDSEY----ADFSQKEIFHLKGF----------------------------- 186 (424) Q Consensus 150 ~~~v~~~~~~~~---~-----~~~~~--~~~~~----~~~~~~evih~r~~----------------------------- 186 (424) |..+.+..++.. . +|... .++.. ..+..+.+.|++.. T Consensus 146 p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (471) T protein:vir:10 146 SKEVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNS 225 (471) T ss_pred ccceEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCccccccccccccccccccccccccccc Confidence 988887766432 1 11111 01111 11233444443210 Q ss_pred ---CC---------CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCc Q lcl|NC_019719. 187 ---GF---------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGP 254 (424) Q Consensus 187 ---~~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (424) .+ +...|.|.+......++....+.....+.+...+.|-.+++-......++.... .. T Consensus 226 ~~~~~g~iPvv~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~-------~~--- 295 (471) T protein:vir:10 226 FKHDFGLVPFIPFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLED-------LK--- 295 (471) T ss_pred ccCCCCceeEEEeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHH-------hh--- Confidence 00 122466777766666666665555555555555667666553322222221111 11 Q ss_pred ccCcceecC-----CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHH-------- Q lcl|NC_019719. 255 VKKRLWILE-----AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG-------- 321 (424) Q Consensus 255 ~~g~~~~l~-----~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~-------- 321 (424) .++.+.++ .+.+++-+........+....+...+.|...-++|..-... .++.++...+..... T Consensus 296 -~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~--~gn~Sg~Alk~~~~~l~~k~~~~ 372 (471) T protein:vir:10 296 -RYKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDK--LGNSSGVALKFLYSLLELKAGNM 372 (471) T ss_pred -cCCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCccc--ccCccHHHHHHHHHHHHHHHHHH Confidence 11222221 22233333333333445667778888888888888532222 233333223222111 Q ss_pred --HHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC-- Q lcl|NC_019719. 322 --FLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG-- 397 (424) Q Consensus 322 --~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~-- 397 (424) .+...+.-.++.|... +...+.. .+.+.+...+..|..+.++.+.++ .|+++..-+.++++.-..+. T Consensus 373 ~~~~~~~l~~~~~li~~~-----~~~~d~~--~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~p~v~D~~~E 443 (471) T protein:vir:10 373 ETQFRSGYATLVKMILKH-----LGLSDKL--KIKQTWTRNSINNDTEMAQVVSTL--ATITSRENVAKSNPIVEDWQDE 443 (471) T ss_pred HHHHHHHHHHHHHHHHHH-----hccCCCc--eeEEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHH Confidence 1222222222222221 2222222 344555666778899999999887 57899888888875532111 Q ss_pred CCeee-----ecccccchhhccccCCCc Q lcl|NC_019719. 398 GDVAM-----RQSQYVPITDLGTNKEPR 420 (424) Q Consensus 398 gd~~~-----~~~n~~~~~~~~~~~~~~ 420 (424) .+.+- .......+....++.+-+ T Consensus 444 ~eri~~E~~~~~~~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 444 LRLQKAEQEGRSEKLYDMEEVEHESEVE 471 (471) T ss_pred HHHHHHHHHHHHhcccccCCCCCccccC Confidence 00000 000011111111111111 No 217 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=96.43 E-value=0.00063 Score=38.12 Aligned_cols=397 Identities=10% Similarity=0.004 Sum_probs=177.6 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccccccccccccc--------ccC------cccccHHHHhhhHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHG--------HLG------DSSINDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~------~~~~~~~~~~~~~~v~~~ 66 (424) |-.|--+|++-.-..+|..-...+...... +........+.+-. ... .....+.+=+.+....-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~-~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~I 79 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHI-KWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTEL 79 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHH Confidence 777777777755556665432221110000 00000000000000 000 000000011223344556 Q ss_pred HHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEE Q lcl|NC_019719. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLL 146 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~ 146 (424) |+..+.-+-+-|+.+--.+ ++ ...+...|+..-+ .........+..++..+|.||.++-.+.+|.+ .+. T Consensus 80 vd~~~~yl~G~Pv~~~~~d-~~-------~~e~~~~l~~~~~--~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~-~~~ 148 (537) T protein:vir:78 80 VDQLAQYLLSNGVEVKVKD-ED-------NTQLDEILQEYFD--EDFQATIDTLVTNASKKGFEGIFARTTSEGKL-KFQ 148 (537) T ss_pred HHHHhhhhcccCceeecCc-ch-------hHHHHHHHHHHhh--ccHHHHHHHHHHHHhhcCeeEEEeeecCCCce-EEE Confidence 7777777777787753211 11 1123344432211 23344566778889999999999988988865 467 Q ss_pred eecCceEEEEEcCCce------EEEE-EecC---------ceEEecHhHeeEeccC------------------------ Q lcl|NC_019719. 147 PLQSANMDVKLVGKKV------VYRY-QRDS---------EYADFSQKEIFHLKGF------------------------ 186 (424) Q Consensus 147 ~l~~~~v~~~~~~~~~------~~~~-~~~~---------~~~~~~~~evih~r~~------------------------ 186 (424) .++|..+.+..++... +|.. .... ....+.++.|.+.+.. T Consensus 149 ~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~ 228 (537) T protein:vir:78 149 TVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLA 228 (537) T ss_pred EEccceeEEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeee Confidence 7888888776654321 1110 0000 0112334444433210 Q ss_pred --------------------CC---------CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH Q lcl|NC_019719. 187 --------------------GF---------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 187 --------------------~~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~ 237 (424) ++ +...|.|.+......++....+....++.+...+.|-.++.-......+ T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~ 308 (537) T protein:vir:78 229 IEESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTD 308 (537) T ss_pred ccccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccch Confidence 00 1234777777777777777776666666666666666555432222222 Q ss_pred HHHHHHHHHHHHHhCCcccCcceecC-CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH Q lcl|NC_019719. 238 QQRSQVEENFKEIAGGPVKKRLWILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~g~~~~l~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) +....+ +. .+++.++ .+.+++-+.....+.......+...+.|...-.+|. ......|+.|+.... T Consensus 309 ~~~~~l----~~-------~~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~--~~~~~~gn~SGvAlk 375 (537) T protein:vir:78 309 KLRQNI----KA-------KKMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFN--STAVGDGNVTNVVIK 375 (537) T ss_pred hHHHHH----hh-------cCceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCC--CccccccCCcHHHHH Confidence 222211 11 1233332 333333333332222233445555555655444442 122222333322221 Q ss_pred ----------HHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_019719. 317 ----------QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEM 386 (424) Q Consensus 317 ----------~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~ 386 (424) .....++...|+-.++.|...++.+-....+. ..+.+.+..-+..|..+.++.+.++++.|+++..-+ T Consensus 376 ~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d~--~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~ 453 (537) T protein:vir:78 376 SRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYDS--NDICFEIEPHVLANELDIATTRKTEAETEALKIGNI 453 (537) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccc--ceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHH Confidence 12223344444444444544444322112222 334555556677889999999999999999998877 Q ss_pred HHHhCCCCCCC--------------------CCeeeeccccc----chhhc-c---cc--CCCcccCC Q lcl|NC_019719. 387 RRTDNLPPLPG--------------------GDVAMRQSQYV----PITDL-G---TN--KEPRNNGA 424 (424) Q Consensus 387 R~~~G~~p~~~--------------------gd~~~~~~n~~----~~~~~-~---~~--~~~~~~ga 424 (424) .+.+++-..+. .+.-....... +.... . ++ -++.+..| T Consensus 454 l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 521 (537) T protein:vir:78 454 MTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVA 521 (537) T ss_pred HHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCC Confidence 76654421110 00000000000 00000 0 00 01111111 No 218 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=96.33 E-value=0.00074 Score=37.76 Aligned_cols=396 Identities=11% Similarity=0.055 Sum_probs=164.4 Q ss_pred CCCCcccccCCCC--------------------CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhh Q lcl|NC_019719. 1 MEEPKYTIDLRTN--------------------NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQI 60 (424) Q Consensus 1 ~~~~~~~~~~~~~--------------------~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 60 (424) -..-.-++.++++ .--++++..+..+.......+.. ... ...-+..-+.. T Consensus 6 ~~~~~~~~~~~~~~~~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~---------~~~-~~~~~~~ki~~ 75 (506) T protein:vir:94 6 TEHKQANLIYQESLENLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQS---------RRH-EDGKADHRATH 75 (506) T ss_pred hhhhcceeecccchhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc---------ccc-cccCCcceeec Confidence 0000001111111 11123333333222110000000 000 00001111234 Q ss_pred HHHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019719. 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) +.....|+..+.-+-.-|+.+--. ++. ....+.+++. . | ........+..+++.+|.||+.+..+.+| T Consensus 76 n~~~~Iv~~~~~~l~G~p~~~~~~--d~~-----~~~~l~~~~~-~-N---~~~~~~~~~~~~~~~~G~a~~~v~~ded~ 143 (506) T protein:vir:94 76 SFAKYIADFQTSYSVGNPINVKLP--DDG-----SNSGFDTFNK-A-N---DVDAENYDLFLDMSRYGRAYEYVYRGEDN 143 (506) T ss_pred chHHHHHHHhhhhhcccCceeecC--cch-----HHHHHHHHHh-c-c---CHhHHHHHHHHHHHhcCeEEEEEEecCCC Confidence 566777888777777777665311 111 1123444443 2 2 34556677888899999999999998888 Q ss_pred ceeeEEeecCceEEEEEcCCc---eE---EEEE---ecCc--------eEEecHhHeeEecc-----------------C Q lcl|NC_019719. 141 DVISLLPLQSANMDVKLVGKK---VV---YRYQ---RDSE--------YADFSQKEIFHLKG-----------------F 186 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~~~~---~~---~~~~---~~~~--------~~~~~~~evih~r~-----------------~ 186 (424) .+ .+..++|..+.+..++.. .. +.|. ..+. ...+.+..+.+... . T Consensus 144 ~~-~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~v 222 (506) T protein:vir:94 144 EE-HLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTF 222 (506) T ss_pred ee-EEEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccceeccccccCCcc Confidence 65 566788888888765422 11 0010 0000 00112222211110 0 Q ss_pred C----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC---------------------CCHHHHH Q lcl|NC_019719. 187 G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV---------------------LTEQQRS 241 (424) Q Consensus 187 ~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~---------------------~~~~~~~ 241 (424) + .+.-.|.|.+......++....+.....+.....+.|-.+++-.... ......+ T Consensus 223 Pvv~~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (506) T protein:vir:94 223 PVVEFKNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLE 302 (506) T ss_pred ceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhH Confidence 0 01123556566555555444443333333222222233333211000 0111111 Q ss_pred HHHHHHHH-HhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH---- Q lcl|NC_019719. 242 QVEENFKE-IAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE---- 316 (424) Q Consensus 242 ~~~~~~~~-~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e---- 316 (424) .++...+. .......+.+...+.+.+++-+........+....+.....|...-++|..-.... .++.++.... T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Aik~~~~ 381 (506) T protein:vir:94 303 LIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENF-ASNSSGVAMQYKVL 381 (506) T ss_pred HHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccc-cccchHHHHHHHHH Confidence 11111111 11111112222233344555565555556667788888999999999996322211 1222322222 Q ss_pred ------HHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh Q lcl|NC_019719. 317 ------QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD 390 (424) Q Consensus 317 ------~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~ 390 (424) ......+...+...+..|...++..- .........+++.++.-+..|..+.++.+.++ .|+++...+++++ T Consensus 382 ~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~-~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l 458 (506) T protein:vir:94 382 GTVELASTKRRMFERGLYARYQIISDIENSIH-GDWTFDPQELTFTFRDNLPADNISQIKALVQA--GATLPQKYLYQQL 458 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC Confidence 11223334444444444444333210 00011112234444566678888899998888 4899999999888 Q ss_pred CCCCCCC--CCeeee-----cccccchhhccccCCCcccCC Q lcl|NC_019719. 391 NLPPLPG--GDVAMR-----QSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 391 G~~p~~~--gd~~~~-----~~n~~~~~~~~~~~~~~~~ga 424 (424) +.-..|. .+++-. ...+......++.++ .+..+ T Consensus 459 p~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~-~~~~~ 498 (506) T protein:vir:94 459 PGVTNPQDIVDMMKEQSANGDYSFDQNGVISNDGQ-TNTTA 498 (506) T ss_pred CCCCCHHHHHHHHHHHHHHHhhcchhhcCCCcccC-ccccc Confidence 6533211 000000 000000000011111 11111 No 219 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=95.77 E-value=0.0015 Score=36.08 Aligned_cols=408 Identities=13% Similarity=0.115 Sum_probs=175.1 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccccccccc------cccCccc------c-cHHHHhhhHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAH------GHLGDSS------I-NDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~------~-~~~~~~~~~~v~~~i 67 (424) |..--|-..||...- .+ .+...+.|.....+..+... ....+.. + .-+..+.+|.|..|| T Consensus 1 ~~~~lfg~~i~~~~~-~~------~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av 73 (537) T protein:vir:10 1 MAQQLFGFSLQRAKK-VP------KGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDSAV 73 (537) T ss_pred Cccccccceeecccc-cc------cCCcccCCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchhhHH Confidence 444333334433211 11 11111122111111111111 1111111 1 125566789999999 Q ss_pred HHHHHhhccC-----ceEEEEeccc-CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-- Q lcl|NC_019719. 68 SLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-- 139 (424) Q Consensus 68 ~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-- 139 (424) +.|.+.+.-. |+.+--...+ ++..+........++|+ --+-...++ .++..|...|..|..++-+.. T Consensus 74 ~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~-ll~F~~~~~----e~fR~WYVDgRi~fhKiid~k~p 148 (537) T protein:vir:10 74 DDVVNETICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILR-LLDFDNRAY----EIFRRWYVDGRLFFHKVIDPKKP 148 (537) T ss_pred HHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhheeeeEEEEEEEEeCCCc Confidence 9999987643 3332211111 11111112223333343 122223344 445666778998888766533 Q ss_pred -CceeeEEeecCceEEEEEc-----CCce---------------EEEEEe------cCceEEecHhHeeEec--cCCCCc Q lcl|NC_019719. 140 -GDVISLLPLQSANMDVKLV-----GKKV---------------VYRYQR------DSEYADFSQKEIFHLK--GFGFTG 190 (424) Q Consensus 140 -G~~~~l~~l~~~~v~~~~~-----~~~~---------------~~~~~~------~~~~~~~~~~evih~r--~~~~~~ 190 (424) .-+.+|..|+|.++...+. .+.. +|.|.. .+....++.+-|.+.. -...++ T Consensus 149 k~GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~n~ 228 (537) T protein:vir:10 149 RQGLVELRYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAPDSIAYCHSGIQDLNK 228 (537) T ss_pred cccceeeeeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceeccHhheeeecccceeCCC Confidence 3588999999999975443 1111 122221 2233456664444333 123455 Q ss_pred cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCC----cccC------cc Q lcl|NC_019719. 191 LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGG----PVKK------RL 259 (424) Q Consensus 191 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~----~~~g------~~ 259 (424) -+.+|-+..+.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++..- ...| +. T Consensus 229 ~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~ 308 (537) T protein:vir:10 229 NMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKF 308 (537) T ss_pred CeeeeeehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchh Confidence 677888888888888777777666544333344445566555543 344455556655554421 0011 11 Q ss_pred ee-c----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH----H-HHHH Q lcl|NC_019719. 260 WI-L----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----Q-QNLG 321 (424) Q Consensus 260 ~~-l----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----~-~~~~ 321 (424) +. + ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|...++.+.. ...| + ...- T Consensus 309 msMlEDyWLPRReGgrgTEItTLpGgqnlgem~D---V~YF~kKLy~aLnVP~SRl~~e~~f~~G-r~~EItRDEiKF~K 384 (537) T protein:vir:10 309 MSMLEDFWLPRREGGRGTEISTLPGGQNLGELED---VKYFQKKLYKALNVPSSRLETETTFNIG-RAAEITRDEVKFQK 384 (537) T ss_pred hhhhhhhcccccCCCcccceeeccccCCcChHHH---HHHHHHHHHHHhCCCccccCCCCccccc-ccchhhHHHHHHHH Confidence 11 1 13455665543 3344443 4455888999999999999754432221 1111 1 1112 Q ss_pred HHHHHHHHHHHHHHHHHHhhcc-----Ccccccc--ceeeecc--hhh----hccC-HHHHHHHHHHHHh--CCCCCHH- Q lcl|NC_019719. 322 FLQYTLQPYISRWENSIQRWLI-----PAKDVGR--IHAEHNL--DGL----LRGD-SASRAAFMKAMGE--AGLRTIN- 384 (424) Q Consensus 322 ~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd~--~~l----~~~d-~~~~~~~~~~~~~--~g~~T~N- 384 (424) |+..-=.-+...|.+.|...|+ ++.++.. ..++|++ +.. .... ...|...++.+-. +-+++.+ T Consensus 385 FI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dy 464 (537) T protein:vir:10 385 FIARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANY 464 (537) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHH Confidence 2222222344444455555443 4455432 3333333 222 1111 1123333332210 1122333 Q ss_pred -----------HHHHH---------hCCCCCCC----C------Ceeeecccccchhhcc--ccCCCcccCC Q lcl|NC_019719. 385 -----------EMRRT---------DNLPPLPG----G------DVAMRQSQYVPITDLG--TNKEPRNNGA 424 (424) Q Consensus 385 -----------E~R~~---------~G~~p~~~----g------d~~~~~~n~~~~~~~~--~~~~~~~~ga 424 (424) |+-++ .|+=+-|. + +..+.|++..|..+.. +..+..++|- T Consensus 465 i~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 536 (537) T protein:vir:10 465 IRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKRGE 536 (537) T ss_pred HHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCCccCCCCCCccCCC Confidence 33211 12211121 1 1222222322221111 1111111111 No 220 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=95.63 E-value=0.0017 Score=35.74 Aligned_cols=399 Identities=13% Similarity=0.131 Sum_probs=167.3 Q ss_pred HHHHHhhccCc--------ccCccccccccccccccc------ccCccc------c-cHHHHhhhHHHHHHHHHHHHhhc Q lcl|NC_019719. 17 WARLQSWFVGG--------RLVTPNQGSQTGPVSAHG------HLGDSS------I-NDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 17 ~~~l~~~~~~~--------~~~~~~~~~~~~~~~~~~------~~~~~~------~-~~~~~~~~~~v~~~i~~ia~~ia 75 (424) +..|+++--.. +.++|.......++...+ ...+.. + .-+..+.+|.|..||+.|.+.+. T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai 80 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVNETI 80 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 33343331111 111111111111111110 111111 1 12455678999999999999876 Q ss_pred cC-----ceEEEEec-ccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC---CCceeeEE Q lcl|NC_019719. 76 CL-----PLDVFETD-QNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS---AGDVISLL 146 (424) Q Consensus 76 ~~-----~~~v~~~~-~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~---~G~~~~l~ 146 (424) -. |+.+--.. +-++..+........++|+ --+-...++ .++..|...|..|..++-+. ..-+.+|. T Consensus 81 v~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~-ll~F~~~~~----e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr 155 (533) T protein:vir:10 81 CGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILR-LLDFENRSY----EIFRRWYVDGRLFYHKVIDPDNPQGGLIELR 155 (533) T ss_pred eecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhhhhcceEEEEEEecCCCccccceeee Confidence 43 22221111 0011111111223333333 122223344 44556677899888876553 34689999 Q ss_pred eecCceEEEEEc-----CCc---------------eEEEEEec------CceEEecHhHeeEecc--CCCCccccCchHH Q lcl|NC_019719. 147 PLQSANMDVKLV-----GKK---------------VVYRYQRD------SEYADFSQKEIFHLKG--FGFTGLVGLSPIA 198 (424) Q Consensus 147 ~l~~~~v~~~~~-----~~~---------------~~~~~~~~------~~~~~~~~~evih~r~--~~~~~~~G~s~~~ 198 (424) .|+|.+|+..+. .++ -+|.|... +....++.+-|.+... ...++-.-+|-+. T Consensus 156 ~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLh 235 (533) T protein:vir:10 156 YIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAPDSICYVHSGIMDLNKNMTLSHLH 235 (533) T ss_pred eccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccccCCCceecchhheeeeeccceeCCCCceeccch Confidence 999999987432 111 01223222 2334566644444331 1223334467777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCC----cccC------ccee-c---- Q lcl|NC_019719. 199 FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGG----PVKK------RLWI-L---- 262 (424) Q Consensus 199 ~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~----~~~g------~~~~-l---- 262 (424) .+.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++..- ...| +.+. + T Consensus 236 kAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyW 315 (533) T protein:vir:10 236 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFW 315 (533) T ss_pred HhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhc Confidence 7777777666666655444333343445555554443 344455555555554421 0011 1111 1 Q ss_pred ------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH----H-HHHHHHHHHHHH Q lcl|NC_019719. 263 ------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----Q-QNLGFLQYTLQP 329 (424) Q Consensus 263 ------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----~-~~~~~~~~tl~P 329 (424) ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|...++-+.. ...| + ...-|+..-=.- T Consensus 316 LPRReGgrgTEItTLpGgqnLgem~D---V~YF~kKLY~aLnVP~SRl~~e~~f~~G-r~~EItRDEiKF~KFI~RLR~r 391 (533) T protein:vir:10 316 LPRREGGRGTEITTLPGGQNLGELED---VKYFQKKLYKSLNVPGSRLETETTFNVG-RAAEITRDEVKFQKFVARLRKR 391 (533) T ss_pred ccccCCCCccceeeccccCCcChHHH---HHHHHHHHHHHhCCCccccCCCCccccc-ccchhhHHHHHHHHHHHHHHHH Confidence 13455665543 3344443 4455888999999999999754332221 1111 1 111222222223 Q ss_pred HHHHHHHHHHhhcc-----Ccccccc--ceeeec--chhh----hccC-HHHHHHHHHHHH--hCCCCCHHHHHHH-hCC Q lcl|NC_019719. 330 YISRWENSIQRWLI-----PAKDVGR--IHAEHN--LDGL----LRGD-SASRAAFMKAMG--EAGLRTINEMRRT-DNL 392 (424) Q Consensus 330 ~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd--~~~l----~~~d-~~~~~~~~~~~~--~~g~~T~NE~R~~-~G~ 392 (424) +...|.+.|...|+ ++.++.. ..++|+ .+.. .... ...|...+..+- -+-+++.+-+|+. |.+ T Consensus 392 Fs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~ 471 (533) T protein:vir:10 392 FSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQ 471 (533) T ss_pred HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcc Confidence 44444455555543 4455432 333333 3222 1111 122333333321 0113344444331 222 Q ss_pred CCCC----------CCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 393 PPLP----------GGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 393 ~p~~----------~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) .-.+ +.+....+--..+++......+|+.+|+ T Consensus 472 tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~ 513 (533) T protein:vir:10 472 TDVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGA 513 (533) T ss_pred CHHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCc Confidence 1100 0000000000001111111112222222 No 221 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=93.37 E-value=0.0078 Score=32.15 Aligned_cols=399 Identities=11% Similarity=0.095 Sum_probs=164.3 Q ss_pred HHHHHhhccC---------cccCccccccccccccccc------ccCcccc-------cHHHHhhhHHHHHHHHHHHHhh Q lcl|NC_019719. 17 WARLQSWFVG---------GRLVTPNQGSQTGPVSAHG------HLGDSSI-------NDERILQISTVWRCVSLISTLT 74 (424) Q Consensus 17 ~~~l~~~~~~---------~~~~~~~~~~~~~~~~~~~------~~~~~~~-------~~~~~~~~~~v~~~i~~ia~~i 74 (424) +..|+++.-. ...+.|........+...+ ...+..- .-+..+.+|.|..||+.|.+.+ T Consensus 1 m~~lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea 80 (558) T protein:vir:10 1 MAKLFGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEADGAIEDVVNEA 80 (558) T ss_pred CcchhcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchhhHHHHhhcce Confidence 2223332211 1111222211111111111 1111111 1245567899999999999987 Q ss_pred ccC-----ceEEEEecccC-ccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---CceeeE Q lcl|NC_019719. 75 ACL-----PLDVFETDQND-NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---GDVISL 145 (424) Q Consensus 75 a~~-----~~~v~~~~~~~-~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~---G~~~~l 145 (424) .-. |+.+--.+.+. ............++|+ --|-...++ .++..|...|..|..++-+.. .-+.+| T Consensus 81 iv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~-ll~F~~~~~----e~fR~WYVDgRiyfHKiid~k~pk~GI~EL 155 (558) T protein:vir:10 81 IVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKE-MMDFDKKSH----EIFRNWYVDGRVFYLKVIDTKNPQEGIQDL 155 (558) T ss_pred eEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhheeeeEEEEEEEEeCCCccccceee Confidence 643 33222111111 1111112223333333 122223344 445666788998888766433 358899 Q ss_pred EeecCceEEEEEcC----------------Cc--------eEEEEEecC-------------ceEEecHhHeeEecc--C Q lcl|NC_019719. 146 LPLQSANMDVKLVG----------------KK--------VVYRYQRDS-------------EYADFSQKEIFHLKG--F 186 (424) Q Consensus 146 ~~l~~~~v~~~~~~----------------~~--------~~~~~~~~~-------------~~~~~~~~evih~r~--~ 186 (424) ..|+|.+++..+.- .+ .+|.|...+ ....++.+=|.+... . T Consensus 156 r~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~dAI~y~hSGL~ 235 (558) T protein:vir:10 156 RYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKDSITMCTSGLV 235 (558) T ss_pred eeeCcccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeechhheeeecccce Confidence 99999999764431 01 123232221 112233333332221 1 Q ss_pred CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCC----cccC---- Q lcl|NC_019719. 187 GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGG----PVKK---- 257 (424) Q Consensus 187 ~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~----~~~g---- 257 (424) +.++-.-+|-+..+.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++..- ...| T Consensus 236 d~~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~d 315 (558) T protein:vir:10 236 DRNKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRD 315 (558) T ss_pred ecCCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecc Confidence 1233344566777777776666666555444333333345555554443 334455555555554421 0011 Q ss_pred --ccee-c----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH-----H Q lcl|NC_019719. 258 --RLWI-L----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE-----Q 317 (424) Q Consensus 258 --~~~~-l----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e-----~ 317 (424) +.+. + ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|...++-+.. ...| = T Consensus 316 drk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~D---V~YF~kKLy~aLnVP~SRl~~e~~f~~G-r~~EItRDEi 391 (558) T protein:vir:10 316 DRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELSD---VDYFQKKLYRALGVPESRIAAEGGFNLG-RSSEILRDEL 391 (558) T ss_pred cchhhhhHhhhcccccCCCCccceeeccccCCcchHHH---HHHHHHHHHHHhCCCccccCCCCccccc-ccchhhHHHH Confidence 1111 1 13455655543 4444444 4455888999999999999754332221 1111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcc-----Ccccccc--ceeeec--chhh----hccC-HHHHHHHHHHHHh--CCCC Q lcl|NC_019719. 318 QNLGFLQYTLQPYISRWENSIQRWLI-----PAKDVGR--IHAEHN--LDGL----LRGD-SASRAAFMKAMGE--AGLR 381 (424) Q Consensus 318 ~~~~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd--~~~l----~~~d-~~~~~~~~~~~~~--~g~~ 381 (424) ...-|+..-=.-+...|.+.|...|+ ++.++.. ..++|+ .+.. .... ...|...+..+-. +-++ T Consensus 392 KF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~ 471 (558) T protein:vir:10 392 KFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYY 471 (558) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 11222222222344444555555543 3455432 333333 2222 1111 1123333332211 1133 Q ss_pred CHHHHHHH-h--------------------CCCCCCCCCeeee-----cccccchhhcccc-CCCcccCC Q lcl|NC_019719. 382 TINEMRRT-D--------------------NLPPLPGGDVAMR-----QSQYVPITDLGTN-KEPRNNGA 424 (424) Q Consensus 382 T~NE~R~~-~--------------------G~~p~~~gd~~~~-----~~n~~~~~~~~~~-~~~~~~ga 424 (424) +.+=+|+. | |+=+-|...+++. +.+-..+...+.+ .+++-.++ T Consensus 472 S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 541 (558) T protein:vir:10 472 STEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLEAQ 541 (558) T ss_pred chHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchhccCCCCCcccccccc Confidence 33333321 1 1111122111111 1111111111111 12222222 No 222 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=91.81 E-value=0.014 Score=30.73 Aligned_cols=394 Identities=11% Similarity=0.089 Sum_probs=160.2 Q ss_pred CCCCcccccCCCC-----------------CchHHHHHhhccCcccCcccccccccccccccccCcccccHHHHhhhHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTN-----------------NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTV 63 (424) Q Consensus 1 ~~~~~~~~~~~~~-----------------~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 63 (424) .++--+-+++... ..-++++..+..+.-... .. +.......... + +..+.. T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~-~~-----~~~~~~~~~~~----k--i~~n~~ 69 (489) T protein:vir:99 2 LQEDFEAIDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIK-YR-----PAKTDKYAADN----R--IASDFA 69 (489) T ss_pred CccceeeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccc-cc-----cccccccCCcc----e--eecchH Confidence 2222222222221 223333333333221000 00 00000000000 0 224556 Q ss_pred HHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee----CCC Q lcl|NC_019719. 64 WRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR----NSA 139 (424) Q Consensus 64 ~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r----~~~ 139 (424) ..+|+..+.-+-.-|+.+--. +.. ....+.+++. . | ....+...+..+++.+|.+|..+.. +.. T Consensus 70 ~~iv~~~~~~l~g~~~~~~~~--d~~-----~~~~l~~~~~-~-n---~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~ 137 (489) T protein:vir:99 70 KYITVFEQGYMLGVPVEYKNE--NKD-----LQAAIDLMSV-R-N---NEDYHNVKIKTDLSIYGRAYELLTVEKIDDKK 137 (489) T ss_pred HHHHHHHhhhhccCCceeecC--Chh-----HHHHHHHHHh-h-c---ChhHHHHHHHHHHhhCCeEEEEEeeccCcCCC Confidence 677887777776667664211 111 1122333333 2 2 3445678888999999999977653 333 Q ss_pred CceeeEEeecCceEEEEEcCCc---eE-----EEEEec-Cc----eEEecHhHeeEeccCC------------------- Q lcl|NC_019719. 140 GDVISLLPLQSANMDVKLVGKK---VV-----YRYQRD-SE----YADFSQKEIFHLKGFG------------------- 187 (424) Q Consensus 140 G~~~~l~~l~~~~v~~~~~~~~---~~-----~~~~~~-~~----~~~~~~~evih~r~~~------------------- 187 (424) |. ..+..++|..+.+..++.. .. |..... +. ...+.++.+.+++... T Consensus 138 ~~-~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~v 216 (489) T protein:vir:99 138 TE-VKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGV 216 (489) T ss_pred cc-eEEEEEcccceEEEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCce Confidence 33 4677788888877765321 11 111111 00 1122333443332100 Q ss_pred -----CCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHh-------CCcc Q lcl|NC_019719. 188 -----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIA-------GGPV 255 (424) Q Consensus 188 -----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~-------~~~~ 255 (424) .+...|.|.+..+...++....+.....+.....+.|-.+++-. .....+. ......+.... .... T Consensus 217 Pvv~~~n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 294 (489) T protein:vir:99 217 PVNEYANNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGN-AYTGADE-NDYLDDGRLNPNGRLAISIGFK 294 (489) T ss_pred eEEEeecCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccC-Ccccccc-hhhhhhcccccccccccccccc Confidence 01123556565555555544444333333333334444444321 1111111 11111111100 1112 Q ss_pred cCcceecCCCc-------eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHH----------HH Q lcl|NC_019719. 256 KKRLWILEAGF-------STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQ 318 (424) Q Consensus 256 ~g~~~~l~~g~-------~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~ 318 (424) .++++.++.+. +.+.+.....+..+....+...+.|...-++|..-.... .++.++...+ +. T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~ 373 (489) T protein:vir:99 295 KAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKF-SGVQSGESMKYKLMASDNYREK 373 (489) T ss_pred cceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-cccchHHHHHHHHHHHHHHHHH Confidence 23344443332 333343333344445667778888999999985322111 1222222221 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCcccc-ccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCC--CC Q lcl|NC_019719. 319 NLGFLQYTLQPYISRWENSIQRWLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLP--PL 395 (424) Q Consensus 319 ~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~--p~ 395 (424) ....+...+.-+++.+...+...-...... ....+.+.++.-+..|..+.++.+.++. |+++...+.++++.= +. T Consensus 374 k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~--giis~et~~~~l~~v~~~d 451 (489) T protein:vir:99 374 QERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY--GIVSDQTIFEILNTVTGVD 451 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhcCCCCchh Confidence 112333334444443333332211110000 0112344445556678888899888885 789988888876441 11 Q ss_pred --CCCCee-------eecccccchhhccccCCCcccCC Q lcl|NC_019719. 396 --PGGDVA-------MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 396 --~~gd~~-------~~~~n~~~~~~~~~~~~~~~~ga 424 (424) .+.+++ ....+....++..+++++.++.- T Consensus 452 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 452 AEAELKRLKEEADKKQSLPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred HHHHHHHHHHHHHHHhccccccccCCCCCCcCCCCCCC Confidence 111100 00011111111111121111111 No 223 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=90.97 E-value=0.018 Score=30.13 Aligned_cols=385 Identities=8% Similarity=0.031 Sum_probs=166.5 Q ss_pred ccCCCCCchHHHHHhhcc-------C---cccCccccccccc-----------cccccc-c-cCcc---c------c-cH Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFV-------G---GRLVTPNQGSQTG-----------PVSAHG-H-LGDS---S------I-ND 54 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~-------~---~~~~~~~~~~~~~-----------~~~~~~-~-~~~~---~------~-~~ 54 (424) |.| +..|+|+.+...-. + .+.+.|.....+. ++.+.. . .++. . + .- T Consensus 1 m~~-~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:72 1 MKF-NVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTY 79 (524) T ss_pred CCC-chhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHH Confidence 777 45566554322100 0 0111121111000 111000 0 1110 0 1 12 Q ss_pred HHHhhhHHHHHHHHHHHHhhccC-----ceEEEEeccc-CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcC Q lcl|NC_019719. 55 ERILQISTVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYG 128 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G 128 (424) +..+.+|.|..||+.|.+.+.-. |+.+--.+.+ +...+........++|+ --+-...++ .++..|...| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVDg 154 (524) T protein:vir:72 80 RNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLN-HLSFQRKGS----DHFRRWYVDS 154 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhheeee Confidence 45567899999999999887643 3332211111 01101111223333333 122222333 4456667789 Q ss_pred CeEEEEeeCCC---CceeeEEeecCceEEEEEc-----CCc--------eEEEEEecC-------------ceEEecHhH Q lcl|NC_019719. 129 NAYALVDRNSA---GDVISLLPLQSANMDVKLV-----GKK--------VVYRYQRDS-------------EYADFSQKE 179 (424) Q Consensus 129 ~a~~~~~r~~~---G~~~~l~~l~~~~v~~~~~-----~~~--------~~~~~~~~~-------------~~~~~~~~e 179 (424) ..|..++-+.. .-+.+|..|+|.+++..+. +.+ -+|.|..+. ....++.+- T Consensus 155 Ri~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dA 234 (524) T protein:vir:72 155 RIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAA 234 (524) T ss_pred EEEEEEEEeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhh Confidence 98887765533 3588999999999976432 111 123333222 223344444 Q ss_pred eeEecc--CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCC--- Q lcl|NC_019719. 180 IFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGG--- 253 (424) Q Consensus 180 vih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~--- 253 (424) |.|... .+.++-.-+|-+..+.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++..- T Consensus 235 I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvY 314 (524) T protein:vir:72 235 VVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVY 314 (524) T ss_pred eeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEE Confidence 444431 12233344677777777776666666555444333343445555554443 334455555555554421 Q ss_pred -cccC------cceec-----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccch Q lcl|NC_019719. 254 -PVKK------RLWIL-----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGS 313 (424) Q Consensus 254 -~~~g------~~~~l-----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~ 313 (424) .+.| +.+.+ ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|.....+..+.+ T Consensus 315 Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kkLy~aLnVP~sRl~~d~~~~f~~g 391 (524) T protein:vir:72 315 DASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED---IRWFRQALYMALRVPLSRIPQDQQGGVMFD 391 (524) T ss_pred eCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH---HHHHHHHHHHHhCCchhhcCCCCCcccccc Confidence 0111 11111 13455665543 3444444 445588899999999999943211111111 Q ss_pred hHHHH------HHHHHHHHHHHHHHHHHHHHHhhcc-----Ccccccc--ceeeecc--hhh----hccC-HHHHHHHHH Q lcl|NC_019719. 314 GIEQQ------NLGFLQYTLQPYISRWENSIQRWLI-----PAKDVGR--IHAEHNL--DGL----LRGD-SASRAAFMK 373 (424) Q Consensus 314 n~e~~------~~~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd~--~~l----~~~d-~~~~~~~~~ 373 (424) ...+. ..-|+..-=.-+...|.+.|...|+ ++.++.. ..++|++ +.. .... ...|...++ T Consensus 392 r~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~ 471 (524) T protein:vir:72 392 SGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLT 471 (524) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHH Confidence 11111 1122222222344444555555543 4455432 3333333 222 1111 112333333 Q ss_pred HHHh--CCCCCHHHHHH-HhCCC----------------------CCCCCCee Q lcl|NC_019719. 374 AMGE--AGLRTINEMRR-TDNLP----------------------PLPGGDVA 401 (424) Q Consensus 374 ~~~~--~g~~T~NE~R~-~~G~~----------------------p~~~gd~~ 401 (424) .+-. +-.++.+=+|+ .|.+. |.+..+.+ T Consensus 472 ~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:72 472 MAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 2221 11334444433 22221 11112222 No 224 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=90.67 E-value=0.02 Score=29.94 Aligned_cols=385 Identities=9% Similarity=0.031 Sum_probs=166.5 Q ss_pred ccCCCCCchHHHHHhhcc-------C---cccCccccccccc-----------cccccc-c-cCcc---c------c-cH Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFV-------G---GRLVTPNQGSQTG-----------PVSAHG-H-LGDS---S------I-ND 54 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~-------~---~~~~~~~~~~~~~-----------~~~~~~-~-~~~~---~------~-~~ 54 (424) |.| +..|+|+.+...-. + .+.+.|.....+. ++.+.. . .++. . + .- T Consensus 1 m~~-~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:10 1 MKF-NVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTY 79 (524) T ss_pred CCC-chhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHH Confidence 777 45566554322100 0 0111121111000 111000 0 1110 0 1 12 Q ss_pred HHHhhhHHHHHHHHHHHHhhccC-----ceEEEEeccc-CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcC Q lcl|NC_019719. 55 ERILQISTVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYG 128 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G 128 (424) +..+.+|.|..||+.|.+.+.-. |+.+--.+.+ +...+........++|+ --+-...++ .++..|...| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVDg 154 (524) T protein:vir:10 80 RNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLN-HLSFQRKGS----DHFRRWYVDS 154 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhheeee Confidence 45567899999999999887643 3332211111 01101111222333333 122222333 4456667789 Q ss_pred CeEEEEeeCCC---CceeeEEeecCceEEEEEc-----CCc--------eEEEEEecC-------------ceEEecHhH Q lcl|NC_019719. 129 NAYALVDRNSA---GDVISLLPLQSANMDVKLV-----GKK--------VVYRYQRDS-------------EYADFSQKE 179 (424) Q Consensus 129 ~a~~~~~r~~~---G~~~~l~~l~~~~v~~~~~-----~~~--------~~~~~~~~~-------------~~~~~~~~e 179 (424) ..|..++-+.. .-+.+|..|+|.+++..+. +.+ -+|.|..+. ....++.+- T Consensus 155 Ri~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dA 234 (524) T protein:vir:10 155 RIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAA 234 (524) T ss_pred EEEEEEEeeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhh Confidence 98888766533 3588999999999976432 111 123333222 223344444 Q ss_pred eeEecc--CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCC--- Q lcl|NC_019719. 180 IFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGG--- 253 (424) Q Consensus 180 vih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~--- 253 (424) |.|... .+.++-.-+|-+..+.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++..- T Consensus 235 I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvY 314 (524) T protein:vir:10 235 IVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVY 314 (524) T ss_pred eeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEE Confidence 444431 12233344677777777776666666555444333343445555554443 334455555555554421 Q ss_pred -cccC------cceec-----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccch Q lcl|NC_019719. 254 -PVKK------RLWIL-----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGS 313 (424) Q Consensus 254 -~~~g------~~~~l-----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~ 313 (424) .+.| +.+.+ ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|.....+..+.+ T Consensus 315 Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kkLy~aLnVP~sRl~~d~~~~f~~g 391 (524) T protein:vir:10 315 DASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED---VRWFRQALYMALRVPLSRIPQDQQGGVMFD 391 (524) T ss_pred eCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH---HHHHHHHHHHHhCCchhhcCCCCCcccccc Confidence 0111 11111 13455665543 3444444 445588899999999999943211111111 Q ss_pred hHHHH------HHHHHHHHHHHHHHHHHHHHHhhcc-----Ccccccc--ceeeecc--hhh----hccC-HHHHHHHHH Q lcl|NC_019719. 314 GIEQQ------NLGFLQYTLQPYISRWENSIQRWLI-----PAKDVGR--IHAEHNL--DGL----LRGD-SASRAAFMK 373 (424) Q Consensus 314 n~e~~------~~~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd~--~~l----~~~d-~~~~~~~~~ 373 (424) ...+. ..-|+..-=.-+...|.+.|...|+ ++.++.. ..++|++ +.. .... ...|...++ T Consensus 392 r~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~ 471 (524) T protein:vir:10 392 SGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLT 471 (524) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHH Confidence 11111 1122222222344444555555543 4455432 3333333 222 1111 112333333 Q ss_pred HHHh--CCCCCHHHHHH-HhCCC----------------------CCCCCCee Q lcl|NC_019719. 374 AMGE--AGLRTINEMRR-TDNLP----------------------PLPGGDVA 401 (424) Q Consensus 374 ~~~~--~g~~T~NE~R~-~~G~~----------------------p~~~gd~~ 401 (424) .+-. +-.++.+=+|+ .|.+. |.+..+.+ T Consensus 472 ~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 472 MAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 2221 11334444433 22221 11112222 No 225 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=90.57 E-value=0.02 Score=29.88 Aligned_cols=406 Identities=15% Similarity=0.143 Sum_probs=169.7 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccccc-----cccccccccCcc--c------c-cHHHHhhhHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQT-----GPVSAHGHLGDS--S------I-NDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~--~------~-~~~~~~~~~~v~~~ 66 (424) |.+ -|-..||...| ..+.+.++|...... +.++......|. . + .-+..+.+|.|..| T Consensus 1 m~~-lfgf~i~~~~~--------~~~~S~vpp~~~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~A 71 (564) T protein:vir:10 1 MSQ-LFGFLINEKEG--------QKGQSPVPPNDEASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDSA 71 (564) T ss_pred Ccc-hhcceeeeecc--------CCCCCcccCCcCCChhhhhccccceeeecccccchhhHHHHHHHHHHHhhccchhhH Confidence 211 11222222221 111222222211111 111111111221 1 1 12455678999999 Q ss_pred HHHHHHhhccC-----ceEEEEeccc-CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC- Q lcl|NC_019719. 67 VSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA- 139 (424) Q Consensus 67 i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~- 139 (424) |+.|.+.+.-. |+.|--...+ +...+........++|+ --|-...++ .++..|.+.|..|..++-+.+ T Consensus 72 v~eIVneaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~-ll~F~~~~~----e~fR~WYVDgRi~fHkiid~~~ 146 (564) T protein:vir:10 72 IDEIVNEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILR-MMNFNVNAH----EIIRNWYVDGRSHYHKVIDLDN 146 (564) T ss_pred HHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhhhhcceEEEEEEeeCCC Confidence 99999985532 3322111110 00000011122333332 122223344 445566778998887765422 Q ss_pred --CceeeEEeecCceEEEEEc------CCc-----------------eEEEEEec-----------------CceEEecH Q lcl|NC_019719. 140 --GDVISLLPLQSANMDVKLV------GKK-----------------VVYRYQRD-----------------SEYADFSQ 177 (424) Q Consensus 140 --G~~~~l~~l~~~~v~~~~~------~~~-----------------~~~~~~~~-----------------~~~~~~~~ 177 (424) .-+.+|..|+|..++..+. ... .+|.|... +....++. T Consensus 147 pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~ 226 (564) T protein:vir:10 147 PKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIAS 226 (564) T ss_pred hhhhhhhhhhhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeech Confidence 2388999999998876542 111 12233211 11345666 Q ss_pred hHeeEecc--CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCC- Q lcl|NC_019719. 178 KEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGG- 253 (424) Q Consensus 178 ~evih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~- 253 (424) +-|.|.+. .+.++-.-+|-+..+.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++..- T Consensus 227 daI~y~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNkl 306 (564) T protein:vir:10 227 DAIAQSTSGLMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKL 306 (564) T ss_pred hhcceecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceE Confidence 66766653 23344445667777777777666666555444333343345555554443 334455555555554421 Q ss_pred ---cccC------ccee-c----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCC-CCc Q lcl|NC_019719. 254 ---PVKK------RLWI-L----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STS 310 (424) Q Consensus 254 ---~~~g------~~~~-l----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~-~~~ 310 (424) ...| +.+. + ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|..... -+. T Consensus 307 VYDa~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~D---V~YF~kKLY~aLnVP~SRl~~e~~~f~~ 383 (564) T protein:vir:10 307 VYDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKD---VEYFKKKLYNSLNLPPSRLTDDNKAFNL 383 (564) T ss_pred EEeccCceecccchhhhhHhhhcccccCCCcccceeeccccCCcchHHH---HHHHHHHHHHHhCCCcccccCCCceeec Confidence 0011 1111 1 13455655543 4444444 445588899999999999975422 111 Q ss_pred cchhHH----H-HHHHHHHHHHHHHHHHHHHHHHhhcc-----Ccccccc--ceeeec--chhh----hccC-HHHHHHH Q lcl|NC_019719. 311 WGSGIE----Q-QNLGFLQYTLQPYISRWENSIQRWLI-----PAKDVGR--IHAEHN--LDGL----LRGD-SASRAAF 371 (424) Q Consensus 311 ~~~n~e----~-~~~~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd--~~~l----~~~d-~~~~~~~ 371 (424) . ...| + ...-|+..-=.-+...|.+.|...|+ ++.++.. ..++|+ .|.. .... ...|... T Consensus 384 G-r~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~ 462 (564) T protein:vir:10 384 G-KSTEILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNL 462 (564) T ss_pred c-cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHH Confidence 1 1111 1 11222222222344444555555543 4455432 333333 3222 1111 1123333 Q ss_pred HHHHHh--CCCCCH------------HHHHHH---------hCC--CCC--CCCCeeee-cccccchhhcc------ccC Q lcl|NC_019719. 372 MKAMGE--AGLRTI------------NEMRRT---------DNL--PPL--PGGDVAMR-QSQYVPITDLG------TNK 417 (424) Q Consensus 372 ~~~~~~--~g~~T~------------NE~R~~---------~G~--~p~--~~gd~~~~-~~n~~~~~~~~------~~~ 417 (424) +..+-. +-+++. +|+-++ .|+ +|. ..||..-+ +..+.|..... ..+ T Consensus 463 l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 542 (564) T protein:vir:10 463 ATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAERE 542 (564) T ss_pred HHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhccccccccC Confidence 332210 112233 333211 122 332 12432222 22223332211 111 Q ss_pred CCcccCC Q lcl|NC_019719. 418 EPRNNGA 424 (424) Q Consensus 418 ~~~~~ga 424 (424) ....++| T Consensus 543 ~~~~~~a 549 (564) T protein:vir:10 543 IKKLNSA 549 (564) T ss_pred hhhhccC Confidence 1111222 No 226 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=89.52 E-value=0.026 Score=29.28 Aligned_cols=371 Identities=14% Similarity=0.100 Sum_probs=147.9 Q ss_pred CCCCcccccCCCCCch--------HHHHHhhccCcccCc-ccccccccccc--cccccCcccccHHHHhhhHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGW--------WARLQSWFVGGRLVT-PNQGSQTGPVS--AHGHLGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~--------~~~l~~~~~~~~~~~-~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~v~~~i~~ 69 (424) |+| +.|+ |++|++.-. .+... ..-..+..|.. ..+..+... ..... .++-..|++. T Consensus 1 ~~~---------~~~~~~~~~~~r~~~l~~~R~-~~e~~w~e~~~y~lP~~~~~~~~~~~~~--~~~~~-dst~~~a~~~ 67 (522) T protein:vir:94 1 MAE---------REGFAAEGAKAVYDRLKNGRQ-PYETRAQNCAAVTIPSLFPKESDNSSTE--YTTPW-QAVGARCLNN 67 (522) T ss_pred Ccc---------cchhhHHHHHHHHHHHHHHhh-HHHHHHHHHHHHhcccccCCCCCccccc--ccccc-cccHHHHHHH Confidence 554 4555 444433210 00000 00000111110 000001100 01112 3344456666 Q ss_pred HHHhhccC-----ceEEEEecccCcc---ccccccch-----------hhhhhccCCCCCCCHHHHHHHHHHHHHHcCCe Q lcl|NC_019719. 70 ISTLTACL-----PLDVFETDQNDNR---KKVDLSNP-----------LARLLRYSPNQYMTAQEFREAMTMQLCFYGNA 130 (424) Q Consensus 70 ia~~ia~~-----~~~v~~~~~~~~~---~~~~~~~~-----------l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a 130 (424) +|+.+-+. ||.=....+.... .......+ +...|. +- +.+.-+..+..++..+||+ T Consensus 68 Las~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~-~s----nf~~~~~~~~~~L~~~G~a 142 (522) T protein:vir:94 68 LAAKLMLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYME-TN----SFRVPLFEALKQLIVSGNC 142 (522) T ss_pred HHHHHHhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHH-hc----CcHHHHHHHHHHHHhhCcE Confidence 66665442 4422211110000 00000011 111221 22 3455566778889999999 Q ss_pred EEEEeeCCCCcee--eEEeecCceEEEEEcCCceE-----------------------------------EE--EEecCc Q lcl|NC_019719. 131 YALVDRNSAGDVI--SLLPLQSANMDVKLVGKKVV-----------------------------------YR--YQRDSE 171 (424) Q Consensus 131 ~~~~~r~~~G~~~--~l~~l~~~~v~~~~~~~~~~-----------------------------------~~--~~~~~~ 171 (424) .+++..+..|.+. ..||+.. +.+..|..+.+ |. +...+. T Consensus 143 ~l~~~~~~~~~~~~~~~~pl~~--y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~ 220 (522) T protein:vir:94 143 LLYIPEPEQGTYSPMRMYRLVS--YVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDE 220 (522) T ss_pred eEeeeccCCCceeeEEEEEcce--EEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCc Confidence 9998877766554 4566643 33333322211 00 001111 Q ss_pred eE---EecHh--------------HeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC Q lcl|NC_019719. 172 YA---DFSQK--------------EIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK 233 (424) Q Consensus 172 ~~---~~~~~--------------evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~ 233 (424) .. .+... =.+..|+...++ .||.||...+...+.....+.+.......-...|..++. +++ T Consensus 221 ~~~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~-~~g 299 (522) T protein:vir:94 221 YLRYEEVEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVN-PNG 299 (522) T ss_pred eeEEeeccCceecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-ccc Confidence 00 00000 122334433344 899999999999999999999999998888888886554 333 Q ss_pred CCCHHHHHHHHHHHHHHhCCcccCccee--cCCCceeeecccChhHHHH-HHHHHHHHHHHHHHhCCCHHHhcCCCCCCc Q lcl|NC_019719. 234 VLTEQQRSQVEENFKEIAGGPVKKRLWI--LEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGVPPHLVGDVEKSTS 310 (424) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~--l~~g~~~~~l~~~~~d~~~-~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~ 310 (424) ....... ..+.+ | .++ -++++...++... .+.+. .+..+.....|..+|-+.. +...+.... T Consensus 300 ~~~~~~~----------~~~~~-g-~~v~g~~~~v~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~r~ 364 (522) T protein:vir:94 300 ITQPRRL----------NKAAT-G-EFVAGRVEDINFLQLTKG-QDFTIAKSVADAIEQRLGWAFLLNS--AVQRNAERV 364 (522) T ss_pred cccchhe----------eccCC-c-eeecCCcccceeeecccc-cchhHHHHHHHHHHHHHHHHHhhhh--hccCCCccc Confidence 3333211 11111 1 111 1233444454432 23332 3455666777888886652 222221211 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHhh-------------ccCccccccceeeecchhhhccCHHHHHHHHHHHHh Q lcl|NC_019719. 311 WGSGIEQQNLGFLQYTLQPYISRWENSIQRW-------------LIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGE 377 (424) Q Consensus 311 ~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~-------------l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~ 377 (424) + +.--..+..-....|.|....++++|-.- ++++..... ++.++.+.+. ...|..-+.++.+ T Consensus 365 T-AtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~--v~v~~~s~La--~~qr~~~~~~l~~ 439 (522) T protein:vir:94 365 T-AEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEA--VEPTVSTGLE--ALGRGQDLEKLTQ 439 (522) T ss_pred c-HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccc--EEeeEecHHH--HHHHHHHHHHHHH Confidence 1 11123334555566667666666665422 222222111 2222222211 1122222222222 Q ss_pred CCCCCHHHHHHHhCCCCCCCCCeeeecccccchhh-ccccCCCcccCC Q lcl|NC_019719. 378 AGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITD-LGTNKEPRNNGA 424 (424) Q Consensus 378 ~g~~T~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~-~~~~~~~~~~ga 424 (424) ..+.+ -.+.|.. .+. ..|+-.+-+ ..+ .-+-+... T Consensus 440 ----~~~~i---a~l~P~~-~~~---~id~d~~~~~~a~-~~Gv~~~~ 475 (522) T protein:vir:94 440 ----AVNMM---TGLQPLS-QDP---DINLPTLKLRLLN-ALGIDTAG 475 (522) T ss_pred ----HHHHH---Hhccchh-hhh---cCCHHHHHHHHHH-HcCCChhh Confidence 01211 1222310 110 112111101 111 00111111 No 227 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=87.98 E-value=0.035 Score=28.55 Aligned_cols=402 Identities=8% Similarity=0.032 Sum_probs=176.4 Q ss_pred ccCCCCCchHHHHHhh--------cc--CcccCccccccccc--------ccccc---ccc---Cccc------c-cHHH Q lcl|NC_019719. 8 IDLRTNNGWWARLQSW--------FV--GGRLVTPNQGSQTG--------PVSAH---GHL---GDSS------I-NDER 56 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~--------~~--~~~~~~~~~~~~~~--------~~~~~---~~~---~~~~------~-~~~~ 56 (424) |.| .-..+|...... +. ....+.|.....+. ++.+. +.+ .+.. + .-+. T Consensus 1 m~~-~~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ 79 (521) T protein:vir:10 1 MNP-IFLKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRS 79 (521) T ss_pred CCc-chhHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHH Confidence 555 222222222110 00 11112222111110 00000 000 0000 1 1245 Q ss_pred HhhhHHHHHHHHHHHHhhccC-----ceEEEEecccC-ccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCe Q lcl|NC_019719. 57 ILQISTVWRCVSLISTLTACL-----PLDVFETDQND-NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNA 130 (424) Q Consensus 57 ~~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~~~-~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a 130 (424) .+.+|.|..||+.|.+.+.-. |+.+--...+. ...+........++|+ --|-...++ .++..|...|.. T Consensus 80 ma~~pEvd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVDgRi 154 (521) T protein:vir:10 80 LSKYHEVDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILK-LLKFEREGK----RHFRRWYVDSRI 154 (521) T ss_pred HhhccchhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhheeeeeE Confidence 567899999999999987643 23332111111 1111112223333343 122223344 445666778998 Q ss_pred EEEEeeCC---CCceeeEEeecCceEEEEEc-----CCc--------eEEEEEe--------c---CceEEecHhHeeEe Q lcl|NC_019719. 131 YALVDRNS---AGDVISLLPLQSANMDVKLV-----GKK--------VVYRYQR--------D---SEYADFSQKEIFHL 183 (424) Q Consensus 131 ~~~~~r~~---~G~~~~l~~l~~~~v~~~~~-----~~~--------~~~~~~~--------~---~~~~~~~~~evih~ 183 (424) |..++-+. ..-+.+|..|+|.+++..+. .++ .+|.|.. + +....++.+-|.|. T Consensus 155 ~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~daI~y~ 234 (521) T protein:vir:10 155 YFHKMIDPARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPIDAIVYS 234 (521) T ss_pred EEEEEeeCCCccccceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeeccCCCceecCCCCCCcceeechhheeee Confidence 98876543 23589999999999976542 111 1233321 1 12245777666665 Q ss_pred cc--CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCC------- Q lcl|NC_019719. 184 KG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGG------- 253 (424) Q Consensus 184 r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~------- 253 (424) .. .+.++-+.+|-+..|.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++..+.++..- T Consensus 235 hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~T 314 (521) T protein:vir:10 235 HSGKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSST 314 (521) T ss_pred cccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccC Confidence 52 34566778888999888888777777666544333344445555555443 334455555555544321 Q ss_pred ---cccCcceec-----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCC-CCccchhHH Q lcl|NC_019719. 254 ---PVKKRLWIL-----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STSWGSGIE 316 (424) Q Consensus 254 ---~~~g~~~~l-----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~-~~~~~~n~e 316 (424) .+..+.+.+ ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|..... -+... .+| T Consensus 315 Gev~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~D---V~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr-~~E 390 (521) T protein:vir:10 315 GKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDD---VRWFNRKLYESMKIPLSRLPQEGAGVTFGA-GND 390 (521) T ss_pred ceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH---HHHHHHHHHHHhCCCccccCCCCCceeccc-ccc Confidence 011111111 13455665543 3344443 445588899999999999865422 11111 111 Q ss_pred ----H-HHHHHHHHHHHHHHHHHHHHHHhhcc-----Ccccccc--ceeeecc--hhh----hccC-HHHHHHHHHHHH- Q lcl|NC_019719. 317 ----Q-QNLGFLQYTLQPYISRWENSIQRWLI-----PAKDVGR--IHAEHNL--DGL----LRGD-SASRAAFMKAMG- 376 (424) Q Consensus 317 ----~-~~~~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd~--~~l----~~~d-~~~~~~~~~~~~- 376 (424) + ...-|+..-=.-+...|.+.|...|+ ++.++.. ..+.|++ +.. .... ...|...++.+- T Consensus 391 ItRDEikF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp 470 (521) T protein:vir:10 391 ITRDELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLAS 470 (521) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcC Confidence 1 11122222222344444455555543 4455432 3333332 222 1111 123444444442 Q ss_pred ---hCCCCCHHHHHH-HhCCCCCC--CCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 377 ---EAGLRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 377 ---~~g~~T~NE~R~-~~G~~p~~--~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) -+-+++.+=+|+ .|.+.-.+ .-++.... . .+.+--+++++.-. T Consensus 471 ~~yvGky~s~dyi~k~ILr~tDeeik~~~k~I~~----E-~~~~~~~~p~~e~~ 519 (521) T protein:vir:10 471 AEVTGKYLSHEYVMKNILRMSDEDIKTEREKIDG----E-LKDSVYKNPEDPME 519 (521) T ss_pred ccccccccchHHHHHHHhcCCHhHHHHHHHHHHH----h-hhCCCCCCCcchhh Confidence 122566666654 34443110 00000000 0 00000001111100 No 228 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=87.23 E-value=0.04 Score=28.24 Aligned_cols=374 Identities=11% Similarity=0.025 Sum_probs=162.2 Q ss_pred ccCCCCCchHHHH-------HhhccCcccCcccccccccccccccccCccccc-HHHHhh----hHHHHHHHHHHHHhhc Q lcl|NC_019719. 8 IDLRTNNGWWARL-------QSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIN-DERILQ----ISTVWRCVSLISTLTA 75 (424) Q Consensus 8 ~~~~~~~G~~~~l-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~----~~~v~~~i~~ia~~ia 75 (424) |-+.++.--+... +....|......... .......++.-. -+.+++ .+++...++.++..+. T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~------~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf 74 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVKKKGV------RFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVL 74 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCc------ccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhh Confidence 7777776444443 333332211000000 001111122111 122233 3455566666666665 Q ss_pred cCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeeEEeecCceEEE Q lcl|NC_019719. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDV 155 (424) Q Consensus 76 ~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~~~~v~~ 155 (424) +-|..+- ....+.++..+ ....+-.+|.+.++...+.+|-+++++.....|.-..+..++|..|.= T Consensus 75 ~k~p~~~------------~p~~l~~~~~D--~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~ 140 (452) T protein:vir:94 75 DQPPVIT------------HPDAMSKYFED--QSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILN 140 (452) T ss_pred cCCceec------------ccHHHHHHHhc--ccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcC Confidence 5555431 11223333222 446789999999999999999999999887776433444444433321 Q ss_pred EE-------------------cC-Cc----e--EEEE---EecCceEE-ec---------HhHeeEec------------ Q lcl|NC_019719. 156 KL-------------------VG-KK----V--VYRY---QRDSEYAD-FS---------QKEIFHLK------------ 184 (424) Q Consensus 156 ~~-------------------~~-~~----~--~~~~---~~~~~~~~-~~---------~~evih~r------------ 184 (424) .. ++ +. . .|+. ..+...+. +. ..+..+-. T Consensus 141 W~~~~~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v~ 220 (452) T protein:vir:94 141 WEEDEDGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFFC 220 (452) T ss_pred ccccccCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEEE Confidence 11 11 00 0 0110 01100000 00 01111111 Q ss_pred --cCCCCccccCchHHHHHHH-HHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCccee Q lcl|NC_019719. 185 --GFGFTGLVGLSPIAFACKS-AGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWI 261 (424) Q Consensus 185 --~~~~~~~~G~s~~~~~~~~-i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 261 (424) ..+.+...|.+|+..++.. +........+.. .+...+.|-.++.-.... + .-..|. +.++. T Consensus 221 ~~~~~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~-~l~~~~~P~l~~~g~~~~-~-----------~i~iG~---~~~~~ 284 (452) T protein:vir:94 221 ITPSGLSMTPAKPPMIDIVDINYSHYRTSADLEH-GRHFTGLPTPWITGAESQ-S-----------TMHIGS---TKAWV 284 (452) T ss_pred EcCCCCCCCCCccchHHHHHHHHHHhcchhHHHH-HHHHcccceeEeecCcCC-C-----------ceEecc---ccccc Confidence 0112334678887765433 333333334333 344556776666532211 1 113332 24567 Q ss_pred cCC-Cc--eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019719. 262 LEA-GF--STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSI 338 (424) Q Consensus 262 l~~-g~--~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l 338 (424) ++. |. .|.+.+.+.-.+. .+..+...+++ ...|- .++-....++.+. .......+-.+..|.-++.++|+.+ T Consensus 285 lpe~~~~~~yie~~g~~i~~~-~~~l~~le~~m-~~~Ga--~ll~~~~~~~~s~-ea~~~~~~~~~s~L~~~a~~~e~al 359 (452) T protein:vir:94 285 IPEVAAKVGFLEFTGQGLQSL-EKALSEKQAQL-ASLSA--RLIDNSTRGSEAT-ETVKLRYMSETASLKSVTRAVEALL 359 (452) T ss_pred CCCCCCcceEEccCchhHHHH-HHHHHHHHHHH-HHHHH--HhhccCCCcchHH-HHHHHHHHHhhHHHHHHHHHHHHHH Confidence 774 64 4556555443322 12222222222 11121 2232212111111 1112223334577777888888887 Q ss_pred HhhccC--cc--ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh---CCCCCCCCCeeeecccccchh Q lcl|NC_019719. 339 QRWLIP--AK--DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD---NLPPLPGGDVAMRQSQYVPIT 411 (424) Q Consensus 339 ~~~l~~--~~--~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~---G~~p~~~gd~~~~~~n~~~~~ 411 (424) ++-|-- .. ......++.+.+=+.........+.+.+++..|.++....++.+ |....+.-++... .-.+.. T Consensus 360 ~~~l~~~a~w~g~~~~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~i~--~E~~~~ 437 (452) T protein:vir:94 360 NKAYSCIMDMESMGGTLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMGVI--PDPPAP 437 (452) T ss_pred HHHHHHHHHHcCCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHHHH--HHhhcc Confidence 654321 10 11123343333322333234456666778999999998888877 5543221111100 011111 Q ss_pred hccccCCCcccCC Q lcl|NC_019719. 412 DLGTNKEPRNNGA 424 (424) Q Consensus 412 ~~~~~~~~~~~ga 424 (424) ......++-++|+ T Consensus 438 ~~~~~~~~~~~~~ 450 (452) T protein:vir:94 438 EPSPSNTPPNPSS 450 (452) T ss_pred CcccCCCCCCCcc Confidence 2222334444444 No 229 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=82.98 E-value=0.072 Score=26.84 Aligned_cols=385 Identities=11% Similarity=0.040 Sum_probs=172.0 Q ss_pred ccCCCCCchHHHHHhhcc-----------C--cccCcccccccccc---------ccccc-c-cC---ccc------c-c Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFV-----------G--GRLVTPNQGSQTGP---------VSAHG-H-LG---DSS------I-N 53 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~-----------~--~~~~~~~~~~~~~~---------~~~~~-~-~~---~~~------~-~ 53 (424) |.+-.-..+++-++.|-. . ...+.|.....+.. .++.. . +. +.. + . T Consensus 1 ~~~~~~~~~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~ 80 (524) T protein:vir:98 1 MNFLGFGNVLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLINT 80 (524) T ss_pred CCCcchhhHHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHHHH Confidence 443333333332222211 0 11112221111100 00000 0 11 100 1 1 Q ss_pred HHHHhhhHHHHHHHHHHHHhhccC-----ceEEEEeccc-CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHc Q lcl|NC_019719. 54 DERILQISTVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFY 127 (424) Q Consensus 54 ~~~~~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~ 127 (424) -+..+.+|.|..||+.|.+.+.-. |+.+--.+.+ +...+........++|+ --+-...++ .++..|... T Consensus 81 YR~ma~~pEvd~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVD 155 (524) T protein:vir:98 81 YRGIMSYPEVENAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLN-IYDFDNMGA----RLFRDWYVD 155 (524) T ss_pred HHHHhhccchhhHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhhhhc Confidence 244567899999999999987532 3332111111 01101111222333332 122223343 445666788 Q ss_pred CCeEEEEeeCCCCc--eeeEEeecCceEEEEE-------cCCce-------EEEEEe-------------cCceEEecHh Q lcl|NC_019719. 128 GNAYALVDRNSAGD--VISLLPLQSANMDVKL-------VGKKV-------VYRYQR-------------DSEYADFSQK 178 (424) Q Consensus 128 G~a~~~~~r~~~G~--~~~l~~l~~~~v~~~~-------~~~~~-------~~~~~~-------------~~~~~~~~~~ 178 (424) |..|..++-+.+.. +.+|..|+|.+++..+ +++.. +|.|.. .+....++.+ T Consensus 156 gRi~fhkiid~~~~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~d 235 (524) T protein:vir:98 156 SRIYFHKIMHKDESKGIRELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRS 235 (524) T ss_pred ceeEEEEEEcCCCCcceeeeeeeCCccceeeeeccccccccchhhccceeeeeeeccCCCccccccceecCCCceeechh Confidence 99999888665443 8999999999997654 22211 233321 1233567788 Q ss_pred HeeEeccC--CCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhC--- Q lcl|NC_019719. 179 EIFHLKGF--GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAG--- 252 (424) Q Consensus 179 evih~r~~--~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~--- 252 (424) -|.|...- +.++- =+|-+..+.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++.. T Consensus 236 AIvy~hSGL~d~~~~-iisyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklv 314 (524) T protein:vir:98 236 AIVYAHSGLEDCSNN-IIGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVV 314 (524) T ss_pred heeeeccCcccCCCC-eeeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeE Confidence 88887531 22221 1577777777777666666655444333343445666555543 44555666666666551 Q ss_pred -----C--cccCccee-c----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCC-CCcc Q lcl|NC_019719. 253 -----G--PVKKRLWI-L----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STSW 311 (424) Q Consensus 253 -----~--~~~g~~~~-l----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~-~~~~ 311 (424) | .+..+.+. + ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|...++ -+.. T Consensus 315 YDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlgem~D---V~YF~kkLy~aLnVP~sRl~~~~~~f~~G 391 (524) T protein:vir:98 315 YDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDD---IKWFNRKLYEALRVPLSRMPRDDGGMQIG 391 (524) T ss_pred eeccCceeeccccccchhhhhcccccCCCCccceeeccccCCcChHHH---HHHHHHHHHHHhCCCceeccCCCCccccc Confidence 1 11112222 1 13556666543 3444444 445588899999999998864322 1111 Q ss_pred ch----hHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----Cccccc--cceeeecc--hhh----hccC-HHHHHHHHH Q lcl|NC_019719. 312 GS----GIEQQNLGFLQYTLQPYISRWENSIQRWLI-----PAKDVG--RIHAEHNL--DGL----LRGD-SASRAAFMK 373 (424) Q Consensus 312 ~~----n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~--~~~~~fd~--~~l----~~~d-~~~~~~~~~ 373 (424) .+ .-|=...-|+..-=.-+...|.+.|...|+ ++.++. ...+.|++ +.. .... ...|...++ T Consensus 392 r~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~ 471 (524) T protein:vir:98 392 GGGEITRDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMS 471 (524) T ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHH Confidence 11 111111222222222344444555555543 445443 22333333 222 1111 112333333 Q ss_pred HHHh--CCCCCHHHHHH-HhCCC----------------------CCCCCCee Q lcl|NC_019719. 374 AMGE--AGLRTINEMRR-TDNLP----------------------PLPGGDVA 401 (424) Q Consensus 374 ~~~~--~g~~T~NE~R~-~~G~~----------------------p~~~gd~~ 401 (424) .+-. +-+++.+=+|+ .|.+. |-++.+.+ T Consensus 472 ~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~p~~e~~~f 524 (524) T protein:vir:98 472 QVEGVVGKYVSHKYIMKEILRMSDEDIDEQAKLIEEESKEERFKNPEAEEENF 524 (524) T ss_pred HhccccccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCcCCccccccC Confidence 2221 12455554443 22221 11112222 No 230 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=82.07 E-value=0.08 Score=26.60 Aligned_cols=383 Identities=10% Similarity=0.056 Sum_probs=169.4 Q ss_pred CCCCCchHHHHHhhccCc--------------ccCccccccccccc---------cccc-----ccCccc------c-cH Q lcl|NC_019719. 10 LRTNNGWWARLQSWFVGG--------------RLVTPNQGSQTGPV---------SAHG-----HLGDSS------I-ND 54 (424) Q Consensus 10 ~~~~~G~~~~l~~~~~~~--------------~~~~~~~~~~~~~~---------~~~~-----~~~~~~------~-~~ 54 (424) .-|=+.+++ |++++.+. +.+.|.....+..+ ++.. ...+.. + .- T Consensus 1 ~~~~~~~~~-lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:10 1 MANFNTILS-FLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTY 79 (524) T ss_pred CCchhhHHH-HhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHH Confidence 222223333 33333221 11222211111000 0000 001100 1 12 Q ss_pred HHHhhhHHHHHHHHHHHHhhccC-----ceEEEEeccc-CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcC Q lcl|NC_019719. 55 ERILQISTVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYG 128 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G 128 (424) +..+.+|.|..||+.|.+.+.-. |+.+--.+.+ +...+........++|+ --+-...++ .++..|...| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVDg 154 (524) T protein:vir:10 80 RNLMNNYEVDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLN-LLNFQRKGT----DHFQRWYVDS 154 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhheeec Confidence 44567899999999999887643 3333211111 01101111222333332 122223343 4456667789 Q ss_pred CeEEEEeeCC---CCceeeEEeecCceEEEEEc-----CCce--------EEEEE-------------ecCceEEecHhH Q lcl|NC_019719. 129 NAYALVDRNS---AGDVISLLPLQSANMDVKLV-----GKKV--------VYRYQ-------------RDSEYADFSQKE 179 (424) Q Consensus 129 ~a~~~~~r~~---~G~~~~l~~l~~~~v~~~~~-----~~~~--------~~~~~-------------~~~~~~~~~~~e 179 (424) ..|..++-+. ..-+.+|..|+|.+++..+. .++. +|.|. ..+....++.+- T Consensus 155 Ri~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~dA 234 (524) T protein:vir:10 155 RIFFHKIINPKKMKDGVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRAA 234 (524) T ss_pred eEEEEEEeeCCCccccceeeeeeCCccceeeeeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecchhh Confidence 9898876553 33589999999999976432 1111 22232 122345678888 Q ss_pred eeEecc--CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhC---- Q lcl|NC_019719. 180 IFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAG---- 252 (424) Q Consensus 180 vih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~---- 252 (424) |.|... .+.++-.-+|-+..+.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++.. T Consensus 235 Ivy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvY 314 (524) T protein:vir:10 235 VVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVY 314 (524) T ss_pred eeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEE Confidence 888753 23333344677777777777666666655444333343445555554443 33444555555544432 Q ss_pred ----C--cccCcceec-----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccch Q lcl|NC_019719. 253 ----G--PVKKRLWIL-----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGS 313 (424) Q Consensus 253 ----~--~~~g~~~~l-----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~ 313 (424) | .+..+.+.+ ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|...+++..+.. T Consensus 315 Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kkLy~aLnVP~sRl~~e~~~~f~~g 391 (524) T protein:vir:10 315 DASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMDD---VLYFRTALYRALRIPESRIPSESNSGVMFD 391 (524) T ss_pred eccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH---HHHHHHHHHHHhCCCchhccCCCCcccccc Confidence 1 011111111 13455655543 3444443 445588899999999999954332222221 Q ss_pred hHHHHHH------HHHHHHHHHHHHHHHHHHHhhcc-----Ccccccc--ceeeecc--hhh----hccC-HHHHHHHHH Q lcl|NC_019719. 314 GIEQQNL------GFLQYTLQPYISRWENSIQRWLI-----PAKDVGR--IHAEHNL--DGL----LRGD-SASRAAFMK 373 (424) Q Consensus 314 n~e~~~~------~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd~--~~l----~~~d-~~~~~~~~~ 373 (424) ...+..+ -|+..-=.-+...|.+.|...|+ ++.++.. ..+.|++ +.. .... ...|...++ T Consensus 392 r~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~ 471 (524) T protein:vir:10 392 AGTAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLT 471 (524) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHH Confidence 2222211 22222222344444555555543 4455432 3333333 222 1111 112333333 Q ss_pred HHHh--CCCCCHHHHHH-HhCCC----------------------CCCCCCee Q lcl|NC_019719. 374 AMGE--AGLRTINEMRR-TDNLP----------------------PLPGGDVA 401 (424) Q Consensus 374 ~~~~--~g~~T~NE~R~-~~G~~----------------------p~~~gd~~ 401 (424) .+-. +-.++.+=+|+ .|.+. |.+..+.+ T Consensus 472 ~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 472 MAEPFIGKYISHQTAMKDFLQMTDEEINQEAKQIEEESKEARFQNPDEEEEDF 524 (524) T ss_pred HhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCChhhhcC Confidence 2221 11334444443 22221 11122222 No 231 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=79.22 E-value=0.11 Score=25.91 Aligned_cols=402 Identities=10% Similarity=0.085 Sum_probs=173.1 Q ss_pred ccCCCCCchHHHHHhh-------ccCcccCccccccccccc---------ccc----cccCccc------c-cHHHHhhh Q lcl|NC_019719. 8 IDLRTNNGWWARLQSW-------FVGGRLVTPNQGSQTGPV---------SAH----GHLGDSS------I-NDERILQI 60 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~-------~~~~~~~~~~~~~~~~~~---------~~~----~~~~~~~------~-~~~~~~~~ 60 (424) |.+-.=-|+|.+.... -+....+.|.....+..+ ++. -...+.. + .-+..+.+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 3333223443332221 011122223222111100 100 0001111 1 12455678 Q ss_pred HHHHHHHHHHHHhhccC-----ceEEEEeccc-CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019719. 61 STVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~ 134 (424) |.|..||+.|.+.+.-. |+.+--...+ +...+........++|+ --|-...++ .++..|...|..|..+ T Consensus 81 pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 81 PEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCR-LLDASRKLD----TLFRRWYVDSRIFFHK 155 (516) T ss_pred cchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhhhhcceEEEEE Confidence 99999999999987643 3332211111 11111111222333333 122223344 4455666788888775 Q ss_pred ee-CCCCceeeEEeecCceEEEEEcC-----Cc--------eEEEEEecC-------------ceEEecHhHeeEecc-- Q lcl|NC_019719. 135 DR-NSAGDVISLLPLQSANMDVKLVG-----KK--------VVYRYQRDS-------------EYADFSQKEIFHLKG-- 185 (424) Q Consensus 135 ~r-~~~G~~~~l~~l~~~~v~~~~~~-----~~--------~~~~~~~~~-------------~~~~~~~~evih~r~-- 185 (424) +. +...-+.+|..|+|.+++..+.- ++ .+|.|..+. ....++.+-|.|... T Consensus 156 iid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL 235 (516) T protein:vir:10 156 IMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGL 235 (516) T ss_pred EecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccc Confidence 44 44456899999999999876531 11 123333211 223444444444431 Q ss_pred CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCC----cccC--- Q lcl|NC_019719. 186 FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGG----PVKK--- 257 (424) Q Consensus 186 ~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~----~~~g--- 257 (424) .+.++-.-+|-+..|.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++..- .+.| T Consensus 236 ~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ 315 (516) T protein:vir:10 236 MDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVK 315 (516) T ss_pred eeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeec Confidence 22233333777888877777766666655444333343445555554443 334455555555554421 0111 Q ss_pred ---cceec-----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH- Q lcl|NC_019719. 258 ---RLWIL-----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL- 320 (424) Q Consensus 258 ---~~~~l-----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~- 320 (424) +.+.+ ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|...++.+...+...+..+ T Consensus 316 ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRD 392 (516) T protein:vir:10 316 NQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDD---VRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRD 392 (516) T ss_pred cchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH---HHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHH Confidence 11111 13455655543 3444444 4455888999999999999754443321111112222 Q ss_pred -----HHHHHHHHHHHHHHHHHHHhhcc-----Ccccccc--ceeeecc--hhh----hccC-HHHHHHHHHHHH--hCC Q lcl|NC_019719. 321 -----GFLQYTLQPYISRWENSIQRWLI-----PAKDVGR--IHAEHNL--DGL----LRGD-SASRAAFMKAMG--EAG 379 (424) Q Consensus 321 -----~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd~--~~l----~~~d-~~~~~~~~~~~~--~~g 379 (424) -|+..-=.-+...|.+.|...|+ ++.++.. ..+.|++ +.. .... ...|...++.+- -+. T Consensus 393 EiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGk 472 (516) T protein:vir:10 393 ELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGK 472 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 22222222344444455555543 4455432 3333333 222 1111 123444443332 234 Q ss_pred CCCHHHHHH-HhCCCCCC--CCCeeeecccccchhhcccc--CCCcccCC Q lcl|NC_019719. 380 LRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 380 ~~T~NE~R~-~~G~~p~~--~gd~~~~~~n~~~~~~~~~~--~~~~~~ga 424 (424) +++.+=+|+ .|.+.-.+ ..++... ....+. +.|++..- T Consensus 473 y~s~~yi~k~ILr~tDeei~~e~k~I~-------~E~~~~~~~~p~~~~~ 515 (516) T protein:vir:10 473 YVSHDYVMKNILQMTEEQIAQEEKQIE-------QEAGIKRFQNPENEDD 515 (516) T ss_pred ccchHHHHHHHhcCCHhhHHHHHHHHH-------HhhhCCCCCCCCcccc Confidence 667776665 34443210 0000000 000000 11111100 No 232 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=79.22 E-value=0.11 Score=25.91 Aligned_cols=402 Identities=10% Similarity=0.085 Sum_probs=173.1 Q ss_pred ccCCCCCchHHHHHhh-------ccCcccCccccccccccc---------ccc----cccCccc------c-cHHHHhhh Q lcl|NC_019719. 8 IDLRTNNGWWARLQSW-------FVGGRLVTPNQGSQTGPV---------SAH----GHLGDSS------I-NDERILQI 60 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~-------~~~~~~~~~~~~~~~~~~---------~~~----~~~~~~~------~-~~~~~~~~ 60 (424) |.+-.=-|+|.+.... -+....+.|.....+..+ ++. -...+.. + .-+..+.+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 3333223443332221 011122223222111100 100 0001111 1 12455678 Q ss_pred HHHHHHHHHHHHhhccC-----ceEEEEeccc-CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019719. 61 STVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~ 134 (424) |.|..||+.|.+.+.-. |+.+--...+ +...+........++|+ --|-...++ .++..|...|..|..+ T Consensus 81 pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 81 PEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCR-LLDASRKLD----TLFRRWYVDSRIFFHK 155 (516) T ss_pred cchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhhhhcceEEEEE Confidence 99999999999987643 3332211111 11111111222333333 122223344 4455666788888775 Q ss_pred ee-CCCCceeeEEeecCceEEEEEcC-----Cc--------eEEEEEecC-------------ceEEecHhHeeEecc-- Q lcl|NC_019719. 135 DR-NSAGDVISLLPLQSANMDVKLVG-----KK--------VVYRYQRDS-------------EYADFSQKEIFHLKG-- 185 (424) Q Consensus 135 ~r-~~~G~~~~l~~l~~~~v~~~~~~-----~~--------~~~~~~~~~-------------~~~~~~~~evih~r~-- 185 (424) +. +...-+.+|..|+|.+++..+.- ++ .+|.|..+. ....++.+-|.|... T Consensus 156 iid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL 235 (516) T protein:vir:10 156 IMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGL 235 (516) T ss_pred EecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccc Confidence 44 44456899999999999876531 11 123333211 223444444444431 Q ss_pred CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCC----cccC--- Q lcl|NC_019719. 186 FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGG----PVKK--- 257 (424) Q Consensus 186 ~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~----~~~g--- 257 (424) .+.++-.-+|-+..|.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++..- .+.| T Consensus 236 ~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ 315 (516) T protein:vir:10 236 MDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVK 315 (516) T ss_pred eeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeec Confidence 22233333777888877777766666655444333343445555554443 334455555555554421 0111 Q ss_pred ---cceec-----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHH- Q lcl|NC_019719. 258 ---RLWIL-----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL- 320 (424) Q Consensus 258 ---~~~~l-----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~- 320 (424) +.+.+ ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|...++.+...+...+..+ T Consensus 316 ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRD 392 (516) T protein:vir:10 316 NQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDD---VRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRD 392 (516) T ss_pred cchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH---HHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHH Confidence 11111 13455655543 3444444 4455888999999999999754443321111112222 Q ss_pred -----HHHHHHHHHHHHHHHHHHHhhcc-----Ccccccc--ceeeecc--hhh----hccC-HHHHHHHHHHHH--hCC Q lcl|NC_019719. 321 -----GFLQYTLQPYISRWENSIQRWLI-----PAKDVGR--IHAEHNL--DGL----LRGD-SASRAAFMKAMG--EAG 379 (424) Q Consensus 321 -----~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd~--~~l----~~~d-~~~~~~~~~~~~--~~g 379 (424) -|+..-=.-+...|.+.|...|+ ++.++.. ..+.|++ +.. .... ...|...++.+- -+. T Consensus 393 EiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGk 472 (516) T protein:vir:10 393 ELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGK 472 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 22222222344444455555543 4455432 3333333 222 1111 123444443332 234 Q ss_pred CCCHHHHHH-HhCCCCCC--CCCeeeecccccchhhcccc--CCCcccCC Q lcl|NC_019719. 380 LRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 380 ~~T~NE~R~-~~G~~p~~--~gd~~~~~~n~~~~~~~~~~--~~~~~~ga 424 (424) +++.+=+|+ .|.+.-.+ ..++... ....+. +.|++..- T Consensus 473 y~s~~yi~k~ILr~tDeei~~e~k~I~-------~E~~~~~~~~p~~~~~ 515 (516) T protein:vir:10 473 YVSHDYVMKNILQMTEEQIAQEEKQIE-------QEAGIKRFQNPENEDD 515 (516) T ss_pred ccchHHHHHHHhcCCHhhHHHHHHHHH-------HhhhCCCCCCCCcccc Confidence 667776665 34443210 0000000 000000 11111100 No 233 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=78.95 E-value=0.11 Score=25.85 Aligned_cols=402 Identities=10% Similarity=0.048 Sum_probs=173.8 Q ss_pred ccCCCCCchHHHHHhhccCc--ccCcccccccc-----c--cccccc-------ccCcc-----cc-cHHHHhhhHHHHH Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGG--RLVTPNQGSQT-----G--PVSAHG-------HLGDS-----SI-NDERILQISTVWR 65 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~--~~~~~~~~~~~-----~--~~~~~~-------~~~~~-----~~-~~~~~~~~~~v~~ 65 (424) +++-.+.-=. +........ +.+.|.....+ + .....+ ...+. -+ .-+..+.+|.|.. T Consensus 1 ~~~w~~~de~-~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~ 79 (511) T protein:vir:56 1 MKFWTKEEEQ-DIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVKELIKSYRALAEYHEVDD 79 (511) T ss_pred CCCccchhhh-hhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchHHHHHHHHHHhhccchhh Confidence 1111110000 000001111 11122111110 0 000000 01111 11 1255667899999 Q ss_pred HHHHHHHhhccC-----ceEEEEeccc-CccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC Q lcl|NC_019719. 66 CVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA 139 (424) Q Consensus 66 ~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~ 139 (424) ||+.|.+.+.-. |+.+--.+.+ +...+........++|+ --+-...++ .++..|...|..|..++-+.+ T Consensus 80 Av~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVDgRi~fHkiid~k 154 (511) T protein:vir:56 80 AIQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVS-LLQMRKHGY----KWFRKWYVDSRIYFHKILDKD 154 (511) T ss_pred HHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhhhhcceEEEEEEeccc Confidence 999999987643 3333211111 01111111222333333 122223344 445666778999988877777 Q ss_pred CceeeEEeecCceEEEEEc-------C------CceEEEEEec--------------CceEEecHhHeeEeccC----CC Q lcl|NC_019719. 140 GDVISLLPLQSANMDVKLV-------G------KKVVYRYQRD--------------SEYADFSQKEIFHLKGF----GF 188 (424) Q Consensus 140 G~~~~l~~l~~~~v~~~~~-------~------~~~~~~~~~~--------------~~~~~~~~~evih~r~~----~~ 188 (424) --+.+|..|+|.+++..+. + -..+|.|... .....++.+.|.|...- +. T Consensus 155 ~GI~eLr~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~hSGL~d~~~ 234 (511) T protein:vir:56 155 NNIIELRPLNPMKMELVREIQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAHSGLMRGCA 234 (511) T ss_pred cceeehhhcCcccchhhhhhhcccccccccccceeeeeEecCCCcccCcccccccccccceeechhheeeecccceeccC Confidence 6799999999999887542 1 1113334321 13367888899776532 24 Q ss_pred CccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCC----------cccC Q lcl|NC_019719. 189 TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGG----------PVKK 257 (424) Q Consensus 189 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~----------~~~g 257 (424) +..+.+|-+..+.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++..+.++..- .+.. T Consensus 235 ~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddr 314 (511) T protein:vir:56 235 DDPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTT 314 (511) T ss_pred CCCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccch Confidence 55567888888888888777777666544333344445555555443 344455555555554321 0111 Q ss_pred cceec-----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCC-CCccchhHHHHH---- Q lcl|NC_019719. 258 RLWIL-----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STSWGSGIEQQN---- 319 (424) Q Consensus 258 ~~~~l-----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~-~~~~~~n~e~~~---- 319 (424) +.+.+ ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|...+. +..+.....+.. T Consensus 315 k~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEi 391 (511) T protein:vir:56 315 NAMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIED---VLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDEL 391 (511) T ss_pred hhhhhHhhhcccccCCCCccceeeccccCCcChHHH---HHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHH Confidence 11111 13455665543 3444444 445588899999999999974322 122211111111 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhcc-----Ccccccc--ceeeecc--hhh----hccC-HHHHHHHHHHHHh--CCCC Q lcl|NC_019719. 320 --LGFLQYTLQPYISRWENSIQRWLI-----PAKDVGR--IHAEHNL--DGL----LRGD-SASRAAFMKAMGE--AGLR 381 (424) Q Consensus 320 --~~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd~--~~l----~~~d-~~~~~~~~~~~~~--~g~~ 381 (424) .-|+..-=.-+...|.+.|...|+ ++.++.. ..+.|++ +.. .... ...|...++.+-. +-.+ T Consensus 392 KF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~ 471 (511) T protein:vir:56 392 KFTKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYY 471 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhcccc Confidence 122222222344444555555543 4455432 3333333 222 1111 1123333222211 1234 Q ss_pred CHHHHHH-HhCCCCCC--CCCeeeecccccchhhccccCCCcccC Q lcl|NC_019719. 382 TINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 382 T~NE~R~-~~G~~p~~--~gd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) +.+=+|+ .|.+.-.+ ..++.... -.+.+-.+.++++= T Consensus 472 S~~yi~k~ILr~tDeei~~~~k~I~~-----E~k~~~~~~~e~~f 511 (511) T protein:vir:56 472 SHKYIQKNILRLSDDQITAMQSEIDE-----EETNPRFQQDDQGF 511 (511) T ss_pred chHHHHHHHhccCHHHHHHHHHHHHH-----hhcCCCCCCcccCC Confidence 5555543 23332100 00000000 00000001111111 No 234 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=78.23 E-value=0.12 Score=25.70 Aligned_cols=404 Identities=9% Similarity=0.076 Sum_probs=170.2 Q ss_pred ccCCCCCchHHHHHhh-----cc--CcccCccccccccccc---------ccc----cccCccc------c-cHHHHhhh Q lcl|NC_019719. 8 IDLRTNNGWWARLQSW-----FV--GGRLVTPNQGSQTGPV---------SAH----GHLGDSS------I-NDERILQI 60 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~-----~~--~~~~~~~~~~~~~~~~---------~~~----~~~~~~~------~-~~~~~~~~ 60 (424) |.+-.=-|+|.+.-.. .. ....+.|.....+..+ ++. -...+.. + +-+..+.+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTNN 80 (516) T ss_pred CCchHhcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhhc Confidence 4433334554332211 11 1122222221111000 000 0001111 1 12556678 Q ss_pred HHHHHHHHHHHHhhccC-----ceEEEEecc-cCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019719. 61 STVWRCVSLISTLTACL-----PLDVFETDQ-NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~-----~~~v~~~~~-~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~ 134 (424) |.|..||+.|.+.+.-. |+.+--.+. -....+........++|+ --+-...+++ ++..|...|..|..+ T Consensus 81 pEvd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~-ll~F~~~~~~----~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 81 PEVERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICR-LLDASRKLDT----LFRRWYIDSRIFFHK 155 (516) T ss_pred cchhHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-HhccchhhhH----HHHhhhhcceEEEEE Confidence 99999999999987643 333311110 000000111122333333 1222233444 455666788888775 Q ss_pred ee-CCCCceeeEEeecCceEEEEEc-----CCc--------eEEEEEecC-------------ceEEecHhHeeEecc-- Q lcl|NC_019719. 135 DR-NSAGDVISLLPLQSANMDVKLV-----GKK--------VVYRYQRDS-------------EYADFSQKEIFHLKG-- 185 (424) Q Consensus 135 ~r-~~~G~~~~l~~l~~~~v~~~~~-----~~~--------~~~~~~~~~-------------~~~~~~~~evih~r~-- 185 (424) +. +...-+.+|..|+|.+++..+. .++ .+|.|..+. ....++.+-|.+... T Consensus 156 iid~~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~daI~y~hSGl 235 (516) T protein:vir:10 156 IMPNPKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSAIVYAHSGL 235 (516) T ss_pred EecCcccceeeeeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecchhheeeeecCc Confidence 44 4455689999999999987553 111 123332211 123333333333321 Q ss_pred CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCC----cccC--- Q lcl|NC_019719. 186 FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGG----PVKK--- 257 (424) Q Consensus 186 ~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~----~~~g--- 257 (424) ...++-.=+|-+..+.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++..- .+.| T Consensus 236 ~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ 315 (516) T protein:vir:10 236 QDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVK 315 (516) T ss_pred ccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeec Confidence 11222122566777777776666666555444333343345555554443 334455555555554421 0111 Q ss_pred ---cceec-----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcc-chhHHH--- Q lcl|NC_019719. 258 ---RLWIL-----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW-GSGIEQ--- 317 (424) Q Consensus 258 ---~~~~l-----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~-~~n~e~--- 317 (424) +.+.+ ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|...++.+.. +.+.|= T Consensus 316 ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRD 392 (516) T protein:vir:10 316 NQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMDD---VRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRD 392 (516) T ss_pred cchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH---HHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHH Confidence 11111 13455665543 3444443 4455888999999999999754443321 112221 Q ss_pred --HHHHHHHHHHHHHHHHHHHHHHhhc-----cCcccccc--ceeeecc--hhh----hccC-HHHHHHHHHHHH--hCC Q lcl|NC_019719. 318 --QNLGFLQYTLQPYISRWENSIQRWL-----IPAKDVGR--IHAEHNL--DGL----LRGD-SASRAAFMKAMG--EAG 379 (424) Q Consensus 318 --~~~~~~~~tl~P~~~~ie~~l~~~l-----~~~~~~~~--~~~~fd~--~~l----~~~d-~~~~~~~~~~~~--~~g 379 (424) ...-|+..-=.-+...|.+.|...| +++.++.. ..+.|++ +.. .... ...|...++.+- -+. T Consensus 393 EiKF~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGk 472 (516) T protein:vir:10 393 ELDFRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGK 472 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 1112222222234445556666644 35555532 2333333 222 1111 123444433332 234 Q ss_pred CCCHHHHHH-HhCCCCCC--CCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 380 LRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 380 ~~T~NE~R~-~~G~~p~~--~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +++.+=+|+ .|.+.-.+ ..++.... . .+.+--++|++..- T Consensus 473 y~s~~yi~k~ILr~tDeei~~~~k~I~~----E-~~~~~~~~p~~e~~ 515 (516) T protein:vir:10 473 YVSHDYVMKNILQMTDEQIAQEEKQIEK----E-ANVKRFQNPENEDD 515 (516) T ss_pred ccchHHHHHHHhcCCHhHHHHHHHHHHH----h-hhCCCCCCCCcccc Confidence 667766665 34443210 00000000 0 00000011111111 No 235 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=77.77 E-value=0.12 Score=25.61 Aligned_cols=394 Identities=10% Similarity=-0.031 Sum_probs=153.1 Q ss_pred CCCCcccccCCCCC-------chHHHHHhhccCcccCcccccccccccccccccCcccc-cHHHHhhhHHHHHHHHHHHH Q lcl|NC_019719. 1 MEEPKYTIDLRTNN-------GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSI-NDERILQISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~~~~~~-------G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~~i~~ia~ 72 (424) |. ++.++. .-|..++....|..........+...+.... .+.+.- .-+.+++.+..+.....+.+ T Consensus 1 m~------~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~-~~~e~~~~Y~~rl~rA~~~n~~~~t~~ 73 (501) T protein:vir:95 1 MP------NVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAED-QSKENKARYEAYLKRAVFYNVARRTLF 73 (501) T ss_pred CC------CCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCC-CcccchHHHHHHhhccccCchHHHHHH Confidence 32 144443 4455555555443211111111111111110 111100 01223333333333333333 Q ss_pred hhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCc----------- Q lcl|NC_019719. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGD----------- 141 (424) Q Consensus 73 ~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~----------- 141 (424) .+.++.|.-- . .......+..++..---...+-.+|.+.++...+.+|-+++++.....+. T Consensus 74 ~l~G~vf~k~---p-----~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~ 145 (501) T protein:vir:95 74 GLVGQVFMRD---P-----VVKVPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEA 145 (501) T ss_pred HHhhhhhcCC---c-----ceeCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHh Confidence 3333333210 0 00112334455544444456899999999999999999999997643321 Q ss_pred ---eeeEEeecCceEEEE----------------------EcCC-----------------c-eEEEEE-ecCce----E Q lcl|NC_019719. 142 ---VISLLPLQSANMDVK----------------------LVGK-----------------K-VVYRYQ-RDSEY----A 173 (424) Q Consensus 142 ---~~~l~~l~~~~v~~~----------------------~~~~-----------------~-~~~~~~-~~~~~----~ 173 (424) -..+..+.|..|.=. .++. + ..+++. ..... . T Consensus 146 ~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~ 225 (501) T protein:vir:95 146 GRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGS 225 (501) T ss_pred ccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcc Confidence 023444444322110 1110 0 001111 00000 0 Q ss_pred EecHh------------------He---eEeccCCCCccccCchHHHHHHH-HHHHHHHHHHHHHHHhccCCCceeEEcC Q lcl|NC_019719. 174 DFSQK------------------EI---FHLKGFGFTGLVGLSPIAFACKS-AGVAVAMEDQQRDFFANGAKSPQILSTG 231 (424) Q Consensus 174 ~~~~~------------------ev---ih~r~~~~~~~~G~s~~~~~~~~-i~~~~~~~~~~~~~~~n~~~p~~vl~~~ 231 (424) .+... .. +.+-..+.+...|.+|+..++.. +........+.. .+...+.|-.+++-. T Consensus 226 ~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~-~l~~~~~P~l~i~G~ 304 (501) T protein:vir:95 226 KIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSADYEE-SCYIVGQPTPVLIGL 304 (501) T ss_pred eecCCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhHHHH-HHHHcccceeeeeCC Confidence 00000 00 11111111223567777766432 222333333333 344556677666522 Q ss_pred CCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcc Q lcl|NC_019719. 232 EKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW 311 (424) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~ 311 (424) .. +..+.... -.-..| .+ ..+.+|.|.++.-+..++.-+. .+.++...+++.. .| ..++..... +.+ T Consensus 305 ~~----~~~~~~~~-~~i~~G-~~--~~~~lP~~~~~~~ie~~~~~i~-~~~l~~l~~~m~~-~G--a~ll~~~~~-~~T 371 (501) T protein:vir:95 305 TE----EWVTNVLK-GSVNFG-SR--GGIPLPVGADAKLLQASENTML-KEAMDTKERQMVA-LG--AKLVEQKEV-QRT 371 (501) T ss_pred cc----cccccCCC-Cceeec-cc--ccccCCCCCceeEEecChhhHH-HHHHHHHHHHHHH-HH--HhhccCCcc-chh Confidence 21 11000000 001122 22 3567776665554444443332 2333333333322 23 233332211 111 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccc-----cccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_019719. 312 GSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKD-----VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEM 386 (424) Q Consensus 312 ~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~-----~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~ 386 (424) .........-.+..|.-++.++|+.|++-|---.. .....++.+.+=..........+.+.+++..|.++..+. T Consensus 372 -a~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~~~~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~~t~ 450 (501) T protein:vir:95 372 -ATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWVGQADSGVKFELNTDFDIARMTPDERRSLVEEWQKGAITFEEM 450 (501) T ss_pred -HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccccccCCHHHHHHHHHHHhCCCCcHHHH Confidence 12223333445667778888888888764421101 011234444332222223444566778899999999999 Q ss_pred HHHh---CCCCCC--CCCeee--ecccccchhhccccCCCcccCC Q lcl|NC_019719. 387 RRTD---NLPPLP--GGDVAM--RQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 387 R~~~---G~~p~~--~gd~~~--~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++.+ |+.+.+ +-++.. ..-+..+.+........+.+|. T Consensus 451 ~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~ 495 (501) T protein:vir:95 451 RTGLRKAGVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGD 495 (501) T ss_pred HHHHHhCCCCChhHHHHHHHHHhhhcCcccccccCCCCCCCcccc Confidence 8765 444321 000000 0001111111111111111111 No 236 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=77.75 E-value=0.12 Score=25.60 Aligned_cols=381 Identities=9% Similarity=0.041 Sum_probs=164.9 Q ss_pred ccCCCCCchHHHHHhhccCc--------------ccCcccccccc-----cccccccc--------cC---ccc------ Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGG--------------RLVTPNQGSQT-----GPVSAHGH--------LG---DSS------ 51 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~--------------~~~~~~~~~~~-----~~~~~~~~--------~~---~~~------ 51 (424) |.| + +++ |++++.+. +.+.|.....+ ....+..+ +. +.. T Consensus 1 m~f-~---~~~-lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eL 75 (523) T protein:vir:68 1 MKF-N---ILS-LFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTREL 75 (523) T ss_pred CCC-c---hhh-hhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHH Confidence 777 2 222 22222221 11222221110 00001000 11 111 Q ss_pred c-cHHHHhhhHHHHHHHHHHHHhhccC-----ceEEEEecc-cCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHH Q lcl|NC_019719. 52 I-NDERILQISTVWRCVSLISTLTACL-----PLDVFETDQ-NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQL 124 (424) Q Consensus 52 ~-~~~~~~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~-~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~ 124 (424) + .-+..+.+|.|..||+.|.+.+.-. |+.+--... -+...+........++|+ --+-...++ .++..| T Consensus 76 I~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~~----~~fR~W 150 (523) T protein:vir:68 76 IDTYRNLMTNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLN-HLSFQRKGS----DHFRRW 150 (523) T ss_pred HHHHHHHhhccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhhh----HHHHhh Confidence 1 1245567899999999999987643 222211111 011111111223333343 122223344 445666 Q ss_pred HHcCCeEEEEeeCCC---CceeeEEeecCceEEEEEc-----CCc--------eEEEEEecC-------------ceEEe Q lcl|NC_019719. 125 CFYGNAYALVDRNSA---GDVISLLPLQSANMDVKLV-----GKK--------VVYRYQRDS-------------EYADF 175 (424) Q Consensus 125 l~~G~a~~~~~r~~~---G~~~~l~~l~~~~v~~~~~-----~~~--------~~~~~~~~~-------------~~~~~ 175 (424) ...|..|..++-+.. .-+.+|..|+|.+|+..+. ..+ .+|.|.... ....+ T Consensus 151 YVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI 230 (523) T protein:vir:68 151 YVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKI 230 (523) T ss_pred eeeeEEEEEEEeeCCCccccceeeeeeCCcceeEEEeecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceec Confidence 778998888766533 3588999999999976431 111 123333211 23344 Q ss_pred cHhHeeEecc--CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhC Q lcl|NC_019719. 176 SQKEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAG 252 (424) Q Consensus 176 ~~~evih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~ 252 (424) +.+-|.|... .+.++-.-+|-+..+.+.+.....++....-+----+.-+-|+..+.+.. +..+.+-++..+.++.. T Consensus 231 ~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kN 310 (523) T protein:vir:68 231 PKAAIVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKN 310 (523) T ss_pred chhheeeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcc Confidence 4555544431 12233344677777777777666666555444333343445555555443 33445555555555432 Q ss_pred C----------cccCcceec-----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCC-C Q lcl|NC_019719. 253 G----------PVKKRLWIL-----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-S 308 (424) Q Consensus 253 ~----------~~~g~~~~l-----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~-~ 308 (424) - .+..+.+.+ ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|....+ - T Consensus 311 KlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kkLy~aLnVP~sRl~~~~~~f 387 (523) T protein:vir:68 311 RIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED---VRWFRNALYMALRIPITRIPSDQGGI 387 (523) T ss_pred eeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH---HHHHHHHHHHHhCCcceeecCCCcce Confidence 1 011111111 13455665543 3444444 445588899999999998854322 1 Q ss_pred Cccchh----HHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----Ccccccc--ceeeecc--hhh----hccC-HHHHHH Q lcl|NC_019719. 309 TSWGSG----IEQQNLGFLQYTLQPYISRWENSIQRWLI-----PAKDVGR--IHAEHNL--DGL----LRGD-SASRAA 370 (424) Q Consensus 309 ~~~~~n----~e~~~~~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd~--~~l----~~~d-~~~~~~ 370 (424) +...++ -|=...-|+..-=.-+...|.+.|...|+ ++.++.. ..+.|++ +.. .... ...|.. T Consensus 388 ~~Gr~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~ 467 (523) T protein:vir:68 388 QFDAGTSITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRIN 467 (523) T ss_pred ecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHH Confidence 211111 11111122222222344444455555543 4455432 3333333 222 1111 112333 Q ss_pred HHHHHHh--CCCCCHHHHHH-HhCCC----------------------CCCCCCee Q lcl|NC_019719. 371 FMKAMGE--AGLRTINEMRR-TDNLP----------------------PLPGGDVA 401 (424) Q Consensus 371 ~~~~~~~--~g~~T~NE~R~-~~G~~----------------------p~~~gd~~ 401 (424) .++.+-. +-.++.+=+|+ .|.+. |.+..+.+ T Consensus 468 ~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~e~~~f 523 (523) T protein:vir:68 468 MLQMAEPFIGKYISHRTAMKDILQMSDEEIEQEAKQIEEESKEARFQDPDQEQEDF 523 (523) T ss_pred HHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 3332221 11334444433 22221 11112222 No 237 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=74.76 E-value=0.15 Score=25.03 Aligned_cols=352 Identities=14% Similarity=0.070 Sum_probs=146.4 Q ss_pred ccCCCCCchHHHHHhhccCcccCccccc---cccccc----ccccccCcccccH--HHHhhhHHHHHHHHHHHHhhcc-- Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQG---SQTGPV----SAHGHLGDSSIND--ERILQISTVWRCVSLISTLTAC-- 76 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~---~~~~~~----~~~~~~~~~~~~~--~~~~~~~~v~~~i~~ia~~ia~-- 76 (424) |+.++=..-|++|++. +..-..... .+..|. .......+. .++ ..-+=.++-..|++.+|+.+-+ T Consensus 1 ~~~~~l~~r~~~l~~~---R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~-~~~~~~~~i~dst~~~a~~~Las~L~~~l 76 (547) T protein:vir:10 1 MENSKIVKRLDFLKTD---RKNVEQIWDCIRKYIMPMRSDFFSDLRSEGS-INWNQNREVFDSTAGDGLETLSSSLHGSL 76 (547) T ss_pred CCHHHHHHHHHHHHHH---hhHHHHHHHHHHHHhcccccccccCCCCCcc-cccccccccccchHHHHHHHHHHHHHHhh Confidence 5554433333333221 100000000 000010 000000000 000 0001124445566666666543 Q ss_pred ----CceEEEEecccCcccccc-------ccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-Cceee Q lcl|NC_019719. 77 ----LPLDVFETDQNDNRKKVD-------LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-GDVIS 144 (424) Q Consensus 77 ----~~~~v~~~~~~~~~~~~~-------~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-G~~~~ 144 (424) -||.=....+....+... ..+.+...|. +-| .+.-+..++.+++.+|++.+++..+.+ ..... T Consensus 77 tPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~-~sn----f~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r 151 (547) T protein:vir:10 77 TSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQ-DSN----FNLEANETYIDLCGYGNAIMVEEEDEDEEGSVV 151 (547) T ss_pred cCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhHCcEeEEeccCCCCCCcee Confidence 244222111111000000 0111222332 333 444466778999999999999876542 22333 Q ss_pred EEeecCceEEEEEcCCceE---EE-------------------------------------------EEec-C------- Q lcl|NC_019719. 145 LLPLQSANMDVKLVGKKVV---YR-------------------------------------------YQRD-S------- 170 (424) Q Consensus 145 l~~l~~~~v~~~~~~~~~~---~~-------------------------------------------~~~~-~------- 170 (424) +..++...+.+..|..+.. |+ +... . T Consensus 152 ~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~ 231 (547) T protein:vir:10 152 FQSSPIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAG 231 (547) T ss_pred EEEeecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCcccc Confidence 4333334433333332211 00 0000 0 Q ss_pred ----------ceEEecHh---H-----------eeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCc Q lcl|NC_019719. 171 ----------EYADFSQK---E-----------IFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSP 225 (424) Q Consensus 171 ----------~~~~~~~~---e-----------vih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~ 225 (424) ..+.+..+ . .+..|+...++ .||.||...+...+.....+.+.......-...|. T Consensus 232 ~~~~~~~~p~~s~~~e~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp 311 (547) T protein:vir:10 232 TVLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPA 311 (547) T ss_pred ceeeccccceeEEEEEecCceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 00011111 1 22333333344 89999999999999999999998888888877787 Q ss_pred eeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHH-HHHHHHHHHHHHHHhCCCHHHhcC Q lcl|NC_019719. 226 QILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGVPPHLVGD 304 (424) Q Consensus 226 ~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~~-~e~~~~~~~~Ia~~fgVP~~~l~~ 304 (424) .++. +.+...+ + + .+ .|++.+....-.++++...+ +.+. .+..+.....|-.+|=+....+-. T Consensus 312 ~~v~-~~g~~~~-----~----~--~~---pgg~~~~~~~~~v~pl~~~~-~~~~~~~~i~~~~~rI~~af~~d~~~~~~ 375 (547) T protein:vir:10 312 IMVT-ERGLISD-----I----D--LG---ASGLTVVRDMESMKPFESRA-RFDVSSIQLTDLRSAVRRIYYVDQLQMKD 375 (547) T ss_pred eecc-ccccccc-----c----e--ec---CCeeeecCCcccceeeeccc-chHHHHHHHHHHHHHHHHHhhhhhhhcCC Confidence 6543 2222221 1 1 11 24555656555677776553 4433 356667788888888776533322 Q ss_pred CCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHH Q lcl|NC_019719. 305 VEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTIN 384 (424) Q Consensus 305 ~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~N 384 (424) +..-++.--..+..-....|.|....++++|-.-|+.. .+..+.+.|.+ T Consensus 376 ---~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r-------------------------~~~il~r~g~l--- 424 (547) T protein:vir:10 376 ---SPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQR-------------------------TFNIRFRAGKL--- 424 (547) T ss_pred ---CccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH-------------------------HHHHHHhcCCC--- Confidence 21121122234446666777788888877764333210 01112222332 Q ss_pred HHHHHhCCCCCCC------CCeeeecccccchhhccc-------------------------cCCCcccCC Q lcl|NC_019719. 385 EMRRTDNLPPLPG------GDVAMRQSQYVPITDLGT-------------------------NKEPRNNGA 424 (424) Q Consensus 385 E~R~~~G~~p~~~------gd~~~~~~n~~~~~~~~~-------------------------~~~~~~~ga 424 (424) ||+|. +..+-+. -..++....+ ..-+-+... T Consensus 425 --------P~~p~~l~~~~~~~~~v~-~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~ 486 (547) T protein:vir:10 425 --------GELPSKLLESGKAAMDIV-YTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMV 486 (547) T ss_pred --------CCCchhhhccCcceEEEE-eccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHH Confidence 11110 1000000 0001100000 000000000 No 238 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=71.71 E-value=0.19 Score=24.51 Aligned_cols=402 Identities=10% Similarity=0.043 Sum_probs=167.2 Q ss_pred CCCCCchHHHHHhhcc----------CcccCcccccccc--------cccccccccCcccc--------------cHHHH Q lcl|NC_019719. 10 LRTNNGWWARLQSWFV----------GGRLVTPNQGSQT--------GPVSAHGHLGDSSI--------------NDERI 57 (424) Q Consensus 10 ~~~~~G~~~~l~~~~~----------~~~~~~~~~~~~~--------~~~~~~~~~~~~~~--------------~~~~~ 57 (424) .-...-+|..+.++-- ....+.|.....+ .+....+...+..+ .-+.. T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~m 80 (521) T protein:vir:81 1 MFSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGL 80 (521) T ss_pred CcchhhhhHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHH Confidence 1111112222211100 0011122211111 01111111111111 12445 Q ss_pred hhhHHHHHHHHHHHHhhccC-----ceEEEEecc-cCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeE Q lcl|NC_019719. 58 LQISTVWRCVSLISTLTACL-----PLDVFETDQ-NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAY 131 (424) Q Consensus 58 ~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~-~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~ 131 (424) +.+|.|..||+.|.+.+.-. |+.+--.+. -+...+........++|+ --|-...++ .++..|...|..| T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVDgRi~ 155 (521) T protein:vir:81 81 MNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLN-TIQFDRRGQ----DMFRRWYVDSRIF 155 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhhhhcceEE Confidence 67899999999999987643 333221111 011111111222333333 122222333 4456677889999 Q ss_pred EEEeeCCC--CceeeEEeecCceEEEEEcC------Cc-------eEEEEEecC-------------ceEEecHhHeeEe Q lcl|NC_019719. 132 ALVDRNSA--GDVISLLPLQSANMDVKLVG------KK-------VVYRYQRDS-------------EYADFSQKEIFHL 183 (424) Q Consensus 132 ~~~~r~~~--G~~~~l~~l~~~~v~~~~~~------~~-------~~~~~~~~~-------------~~~~~~~~evih~ 183 (424) ..++-+.+ .-+.+|..|+|.+++..+.. +. .+|.|..+. ....++.+=|.+. T Consensus 156 fhkiid~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~ 235 (521) T protein:vir:81 156 FHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYA 235 (521) T ss_pred EEEEEcCCccccceeeeeeCCcceeeeeeecccccCccceecceeeeeeeecCCccccccceeecCCcceeechhheeee Confidence 98885433 46899999999999876531 11 123333221 1233333333333 Q ss_pred cc--CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhC-------- Q lcl|NC_019719. 184 KG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAG-------- 252 (424) Q Consensus 184 r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~-------- 252 (424) .. ...++-.-+|-+..|.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++.. T Consensus 236 hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~T 315 (521) T protein:vir:81 236 HSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDAST 315 (521) T ss_pred eccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccc Confidence 21 12233334677777777777766666665544333344445666555543 44555566666666553 Q ss_pred Cc--ccCccee-c----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHH Q lcl|NC_019719. 253 GP--VKKRLWI-L----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ 317 (424) Q Consensus 253 ~~--~~g~~~~-l----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~ 317 (424) |. +..+.+. + ..|.+++.|.. +..+++- ..+..+.+.++++||.+-|+..+.+..+.....+ T Consensus 316 Gev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~E 392 (521) T protein:vir:81 316 GKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDD---IRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSE 392 (521) T ss_pred ccccccccccchhhhhcccccCCCcccceeecccCCCCChHHH---HHHHHHHHHHHhCCccccccCCCCcceeccccch Confidence 11 1111222 2 13556666543 4445544 4455888999999999999543332221111111 Q ss_pred H------HHHHHHHHHHHHHHHHHHHHHhhcc-----Cccccc--cceeeec--chhh----hccC-HHHHHHHHHHHHh Q lcl|NC_019719. 318 Q------NLGFLQYTLQPYISRWENSIQRWLI-----PAKDVG--RIHAEHN--LDGL----LRGD-SASRAAFMKAMGE 377 (424) Q Consensus 318 ~------~~~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~--~~~~~fd--~~~l----~~~d-~~~~~~~~~~~~~ 377 (424) . ..-|+..-=.-+...|.+.|...|+ ++.++. ...+.|+ .+.. .... ...|...++.+-. T Consensus 393 ItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dp 472 (521) T protein:vir:81 393 ITRDELEFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITP 472 (521) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhh Confidence 1 1222222222344444555555543 445443 2233333 3222 1111 1223333333221 Q ss_pred --CCCCCHHHHHH-HhCCCCCC--CCCeeeecccccchhhccccCCCc-ccCC Q lcl|NC_019719. 378 --AGLRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPR-NNGA 424 (424) Q Consensus 378 --~g~~T~NE~R~-~~G~~p~~--~gd~~~~~~n~~~~~~~~~~~~~~-~~ga 424 (424) +-.++.+=+|+ .|.+.-.+ ..++.... .. +.+-.++++ +-+. T Consensus 473 yvGky~s~dyi~k~ILr~tDeei~~~~k~I~~----E~-~~~~~~~p~~~~~~ 520 (521) T protein:vir:81 473 YIGKYFSNQTVMRDILKYTDDQMDTEKKQIEE----EA-NDPRFKQTPDEIED 520 (521) T ss_pred hhccccchHHHHHHHhccCHHHHHHHHHHHHH----Hh-hCCCCCCCcccccC Confidence 11345544443 23332100 00000000 00 000000000 0000 No 239 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=66.65 E-value=0.26 Score=23.75 Aligned_cols=363 Identities=8% Similarity=-0.036 Sum_probs=152.3 Q ss_pred ccCcccCc---ccccccccccccccccCccc-----------------ccHHHHhhhHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_019719. 24 FVGGRLVT---PNQGSQTGPVSAHGHLGDSS-----------------INDERILQISTVWRCVSLISTLTACLPLDVFE 83 (424) Q Consensus 24 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~ 83 (424) +....... ...............+.|.. ..+..=+.++...-+|+..+.-+-+-|+.+. T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~- 79 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFD- 79 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceee- Confidence 11100000 00000000000000111110 0001112244555677777777766676542 Q ss_pred ecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC-------ceeeEEeecCceEEEE Q lcl|NC_019719. 84 TDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG-------DVISLLPLQSANMDVK 156 (424) Q Consensus 84 ~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G-------~~~~l~~l~~~~v~~~ 156 (424) ...+.. . ..+.+-+. . | ........+..+.+.+|.||.++-++.+. ....+..++|..+.+. T Consensus 80 ~~~~~~-----~-~~~~~~~~-~-n---~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~v 148 (451) T protein:vir:10 80 IDNNKE-----L-NEKVTDVL-G-N---EFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPI 148 (451) T ss_pred cCCcHH-----H-HHHHHHHh-c-c---CHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEE Confidence 111110 0 01122121 1 2 34566677888899999999988777642 1234667788888776 Q ss_pred EcCCc---e-----EEEEEec--C--------ceEEecHhHeeEeccCC-------------------------CCcccc Q lcl|NC_019719. 157 LVGKK---V-----VYRYQRD--S--------EYADFSQKEIFHLKGFG-------------------------FTGLVG 193 (424) Q Consensus 157 ~~~~~---~-----~~~~~~~--~--------~~~~~~~~evih~r~~~-------------------------~~~~~G 193 (424) .++.. . +|....+ + ....+.++.+.+.+... .+...| T Consensus 149 ydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~ 228 (451) T protein:vir:10 149 YRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEFSNNIKK 228 (451) T ss_pred EcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEeccCCCC Confidence 65321 1 1110000 0 00122344444332100 012246 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-----CCcee Q lcl|NC_019719. 194 LSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-----AGFST 268 (424) Q Consensus 194 ~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~-----~g~~~ 268 (424) .|.+......++....+..-..+.+...+.|-.+++--.+...++....+ . ..+++.++ .|.+. T Consensus 229 ~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~-------~----~~~~i~~~~~~~~~~~~~ 297 (451) T protein:vir:10 229 QSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKEL-------K----RYKTIKTETDSEGDSGGL 297 (451) T ss_pred CCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHH-------h----hCCeEEecCcCCccCCcc Confidence 66666666666666555544555555555566555432222122211111 1 11233332 12233 Q ss_pred eecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHH----------HHHHHHHHHHHHHHHHH Q lcl|NC_019719. 269 SAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG----------FLQYTLQPYISRWENSI 338 (424) Q Consensus 269 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~----------~~~~tl~P~~~~ie~~l 338 (424) +-+........+.+..+...+.|...-++|. +....-++.|+....-.... .+...+.-.++.+...+ T Consensus 298 ~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~ 375 (451) T protein:vir:10 298 KTMQIEIPTEARKIILEILKKQIYESGQGLQ--QDTENFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFL 375 (451) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCccc--ccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3333333334456678888889999999984 22222223332222211111 11222222222222221 Q ss_pred HhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccc---ccchhhccc Q lcl|NC_019719. 339 QRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQ---YVPITDLGT 415 (424) Q Consensus 339 ~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~G~~p~~~gd~~~~~~n---~~~~~~~~~ 415 (424) ...+... +.+.++.-+..|..+.++.+.++. |+++..-+.++++.-..+. +...-.. -.......+ T Consensus 376 -----~~~d~~~--i~i~f~~~~p~n~~e~~~~~~kl~--g~iS~et~~~~~p~v~d~~--~e~~~~~ee~~~~~~~~~~ 444 (451) T protein:vir:10 376 -----GVTDYKK--IQQTYTRNMMSNDLEDADIATKSV--GIIPTKIILRHHPWVDDVE--EAEKLYLEEKKIQASKVSD 444 (451) T ss_pred -----CCCCccc--eeEEecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHH--HHHHHHHHHHHHHHHHHHh Confidence 2222222 333445556788889999999985 7898888888776533211 0000000 000000000 Q ss_pred cCCCccc Q lcl|NC_019719. 416 NKEPRNN 422 (424) Q Consensus 416 ~~~~~~~ 422 (424) .-.+-++ T Consensus 445 ~~~~~~~ 451 (451) T protein:vir:10 445 DYNNFTE 451 (451) T ss_pred hcCCCCC Confidence 0011111 No 240 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=65.35 E-value=0.28 Score=23.57 Aligned_cols=393 Identities=14% Similarity=0.102 Sum_probs=145.1 Q ss_pred CCCCcc------------------------cc-cCCCCC-------chHHHHHhhccCcccCcccccccccccccccc-- Q lcl|NC_019719. 1 MEEPKY------------------------TI-DLRTNN-------GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGH-- 46 (424) Q Consensus 1 ~~~~~~------------------------~~-~~~~~~-------G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-- 46 (424) |....- .| ++.++. .-|..++....|..........+...+..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~ 80 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDE 80 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCc Confidence 111111 11 132322 33444444443321100000001000000000 Q ss_pred cCcccccHHHHhhhH----HHHHHHHHHHHhhccCceEEEEecccCccccccccchhhhhhccCCCCCCCHHHHHHHHHH Q lcl|NC_019719. 47 LGDSSINDERILQIS----TVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTM 122 (424) Q Consensus 47 ~~~~~~~~~~~~~~~----~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~ 122 (424) -+... -+.+++.+ ++...++.++..+-+-|..+ .....+..++..---...+-.+|.+.++. T Consensus 81 E~~~~--Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~------------~~p~~l~~l~~d~D~~G~~L~~f~~~~~~ 146 (535) T protein:vir:80 81 EQRRR--YETYLQRAIFYNVTARTLDGMMGQVFSRDPIR------------QLPPALEAIVEDIDGEGVSLDQQAKKALG 146 (535) T ss_pred CCHHH--HHHHHhhccCCChhHHHHHHHhchhhcCCcce------------eccHHHHHHHhccCCCCCCHHHHHHHHHH Confidence 00001 12223333 33344444444433333221 11233455554444445689999999999 Q ss_pred HHHHcCCeEEEEeeCCCCce------------eeEEeecCceEEE----------------------EEcCC---ceE-- Q lcl|NC_019719. 123 QLCFYGNAYALVDRNSAGDV------------ISLLPLQSANMDV----------------------KLVGK---KVV-- 163 (424) Q Consensus 123 ~~l~~G~a~~~~~r~~~G~~------------~~l~~l~~~~v~~----------------------~~~~~---~~~-- 163 (424) ..+.+|-+++++.....|.. ..+..+.|..|.= ..+++ ... T Consensus 147 ~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q 226 (535) T protein:vir:80 147 YTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQ 226 (535) T ss_pred HHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEE Confidence 99999999999986555432 2233333222211 11111 000 Q ss_pred EEE--------------EecCc-------eEEecHh------Hee---EeccCCCCccccCchHHHHHHH-HHHHHHHHH Q lcl|NC_019719. 164 YRY--------------QRDSE-------YADFSQK------EIF---HLKGFGFTGLVGLSPIAFACKS-AGVAVAMED 212 (424) Q Consensus 164 ~~~--------------~~~~~-------~~~~~~~------evi---h~r~~~~~~~~G~s~~~~~~~~-i~~~~~~~~ 212 (424) |+. ...+. ...++.+ ..| .+-..+.+...|.+|+..++.. +........ T Consensus 227 ~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd 306 (535) T protein:vir:80 227 WRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSAD 306 (535) T ss_pred EEEEEecCCceEEEEEEEeecCCccccccceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhH Confidence 110 00000 0011111 111 1111112334677777655433 333333333 Q ss_pred HHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCcee--eecccChhHHHHHHHHHHHHHH Q lcl|NC_019719. 213 QQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFST--SAIGVTPQDAEMMASRKFQVSE 290 (424) Q Consensus 213 ~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~--~~l~~~~~d~~~~e~~~~~~~~ 290 (424) +. ..+...+.|-.+++-....+.+.. .+...-..|+ ...+.+|.+.++ .+++.+.-..+.++.++..... T Consensus 307 ~~-~il~~~~~P~l~i~G~~~~~~~~~----~~~~~i~iG~---~~~~~lP~~~~~~~~e~~~~~~a~~~l~~~e~qM~~ 378 (535) T protein:vir:80 307 YE-EMAFVAGQPTAFFTGLTKDWVEDV----FKDFKVHLGS---RAIIPLPQGATAGILQITPNSVPFEAMTHKESQMIA 378 (535) T ss_pred HH-HHHHHhcCceeeeecCchhhhhcC----CCCcceEecC---cccccCCCCCCcceeeeccchhHHHHHHHHHHHHHH Confidence 33 333455667777653222211100 0000012232 235667766544 4444333333333333222222 Q ss_pred HHHHhCCCHHHhcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc-------ccceeeecchhh-hc Q lcl|NC_019719. 291 LARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV-------GRIHAEHNLDGL-LR 362 (424) Q Consensus 291 Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~-------~~~~~~fd~~~l-~~ 362 (424) +...+ +.....+..+ ........=.+..|.-++..+|+.|++-|---..+ ....+..+.+=. .. T Consensus 379 lGa~l------l~~~~~~~Ta--~~a~~~~~~~~S~L~~~a~~le~al~~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~ 450 (535) T protein:vir:80 379 MGANL------LVKSGGNRTF--GEAQQEEASEQSILSACTKNVSMAFRKALRWANQFQTGIVNDETVEYNLNTDFPAAR 450 (535) T ss_pred HHHHh------hccCcccccH--HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCceEEEecccccccc Confidence 22222 2111111111 11111222234566677788888777543211111 112233322211 22 Q ss_pred cCHHHHHHHHHHHHhCCCCCHHHHHHHh---CCC-C-CCCC-----------CeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 363 GDSASRAAFMKAMGEAGLRTINEMRRTD---NLP-P-LPGG-----------DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 363 ~d~~~~~~~~~~~~~~g~~T~NE~R~~~---G~~-p-~~~g-----------d~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) .|.+ ....+.++++.|.++....++.+ |.- | +++- +.....+-.......+++..+.+||- T Consensus 451 ld~~-~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~ 527 (535) T protein:vir:80 451 LTPN-ERAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNNGN 527 (535) T ss_pred CCHH-HHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccCCc Confidence 2444 45556677888888877777655 331 1 1111 11111111111111222222222222 No 241 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=60.19 E-value=0.38 Score=22.90 Aligned_cols=402 Identities=10% Similarity=0.048 Sum_probs=167.7 Q ss_pred CCCCCchHHHHHhh--------ccC--cccCccccccccccc--------ccccccCcccc--------------cHHHH Q lcl|NC_019719. 10 LRTNNGWWARLQSW--------FVG--GRLVTPNQGSQTGPV--------SAHGHLGDSSI--------------NDERI 57 (424) Q Consensus 10 ~~~~~G~~~~l~~~--------~~~--~~~~~~~~~~~~~~~--------~~~~~~~~~~~--------------~~~~~ 57 (424) .-.+.-+|+++.+. ... ...+.|.....+..+ ...+...+..+ .-+.. T Consensus 1 ~~~~l~~~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~m 80 (521) T protein:vir:65 1 MFSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGL 80 (521) T ss_pred CccchhhhhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHH Confidence 11111111111110 000 011122211111111 11111111111 12445 Q ss_pred hhhHHHHHHHHHHHHhhccC-----ceEEEEecc-cCccccccccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeE Q lcl|NC_019719. 58 LQISTVWRCVSLISTLTACL-----PLDVFETDQ-NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAY 131 (424) Q Consensus 58 ~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~-~~~~~~~~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~ 131 (424) +.+|.|..||+.|.+.+.-. |+.+--.+. -+...+........++|+ --|-...++ .++..|...|..| T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVDgRi~ 155 (521) T protein:vir:65 81 MNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLN-TIQFDRRGQ----DMFRRWYVDSRIF 155 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhhhhcceeE Confidence 67899999999999987643 333211111 011111111222333333 122223343 4456667889999 Q ss_pred EEEeeCC--CCceeeEEeecCceEEEEEcC------Cc-------eEEEEEecC-------------ceEEecHhHeeEe Q lcl|NC_019719. 132 ALVDRNS--AGDVISLLPLQSANMDVKLVG------KK-------VVYRYQRDS-------------EYADFSQKEIFHL 183 (424) Q Consensus 132 ~~~~r~~--~G~~~~l~~l~~~~v~~~~~~------~~-------~~~~~~~~~-------------~~~~~~~~evih~ 183 (424) ..++-+. ..-+.+|..|+|.+++..+.. +. .+|.|..+. ....++.+=|.+. T Consensus 156 fhkiid~~pk~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~ 235 (521) T protein:vir:65 156 FHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYA 235 (521) T ss_pred EEEEEcCCccccceeeeeeCCcceeeeeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeechhheeee Confidence 9888543 346899999999999876531 11 123333221 1233333333333 Q ss_pred cc--CCCCccccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhC-------- Q lcl|NC_019719. 184 KG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAG-------- 252 (424) Q Consensus 184 r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~-------- 252 (424) .. ...++-.-+|-+..|.+.+.....++....-+----+.-+-|+..+-+.. +..+.+-++....++.. T Consensus 236 hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~T 315 (521) T protein:vir:65 236 HSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDAST 315 (521) T ss_pred eccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccc Confidence 21 12233344677788777777776666665544333344445666555543 44555566666666553 Q ss_pred Cc--ccCccee-c----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHH Q lcl|NC_019719. 253 GP--VKKRLWI-L----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ 317 (424) Q Consensus 253 ~~--~~g~~~~-l----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~ 317 (424) |. +..+.+. + ..|.+++.|.. +..+++- ..+..+.+.++++||.+-++..+.+..+.....+ T Consensus 316 Gev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~E 392 (521) T protein:vir:65 316 GKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDD---IRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSE 392 (521) T ss_pred ccccccccccchhhhhcccccCCCCccceeecccCCCcChHHH---HHHHHHHHHHHhCCCceeccCCCCcceeccccch Confidence 11 1111222 2 13556666543 4445544 4455888999999999988544332222111111 Q ss_pred H------HHHHHHHHHHHHHHHHHHHHHhhcc-----Cccccc--cceeeecc--hhh----hccC-HHHHHHHHHHHHh Q lcl|NC_019719. 318 Q------NLGFLQYTLQPYISRWENSIQRWLI-----PAKDVG--RIHAEHNL--DGL----LRGD-SASRAAFMKAMGE 377 (424) Q Consensus 318 ~------~~~~~~~tl~P~~~~ie~~l~~~l~-----~~~~~~--~~~~~fd~--~~l----~~~d-~~~~~~~~~~~~~ 377 (424) . ..-|+..-=.-+...|.+.|...|+ ++.++. ...+.|++ +.. .... ...|...++.+-. T Consensus 393 ItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dp 472 (521) T protein:vir:65 393 ITRDELEFSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITP 472 (521) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhh Confidence 1 1122222222344444555555543 445443 22333333 222 1111 1223333333221 Q ss_pred --CCCCCHHHHHH-HhCCCCCC--CCCeeeecccccchhhccccCCCc-ccCC Q lcl|NC_019719. 378 --AGLRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPR-NNGA 424 (424) Q Consensus 378 --~g~~T~NE~R~-~~G~~p~~--~gd~~~~~~n~~~~~~~~~~~~~~-~~ga 424 (424) +-.++.+=+|+ .|.+.-.+ ..++.... .. +.+-.++++ +-+. T Consensus 473 yvGky~S~dyi~k~ILr~tDeei~~~~k~I~~----E~-~~~~~~~p~~~~~~ 520 (521) T protein:vir:65 473 YIGKYFSNQTVMRDILKYTDDQMDTEKKQIEE----EA-NDPRFKQTPDEIED 520 (521) T ss_pred hhccccchHHHHHHHhccCHHHHHHHHHHHHH----hh-hCCCCCCCcccccC Confidence 12445555543 23332100 00000000 00 000000000 0000 No 242 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=50.73 E-value=0.6 Score=21.80 Aligned_cols=356 Identities=12% Similarity=0.087 Sum_probs=138.3 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccccc---cccccc----cccccCcccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGS---QTGPVS----AHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~~---~~~~~~----~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) |.+ .+...+-+++......+..-.+.... +..|.. ......+..... .. -.++-..|++.+|+. T Consensus 1 m~~-------~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-~~-~dst~~~a~~~Las~ 71 (559) T protein:vir:95 1 MAE-------TTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNT-RI-IDSTGTMAARTLASG 71 (559) T ss_pred CCh-------hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCccccccc-cc-ccchHHHHHHHHHHH Confidence 443 33333333222221111110000000 011110 000000000000 01 123344566666666 Q ss_pred hcc------CceEEEEecccCccccccc-------cchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019719. 74 TAC------LPLDVFETDQNDNRKKVDL-------SNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 74 ia~------~~~~v~~~~~~~~~~~~~~-------~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) +-+ -||.=....+....+.... .+.+...|. +-| .+.-+..++.+++.+||+.+++..+.. T Consensus 72 l~~~ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~L~~~Gta~l~~~~d~~- 145 (559) T protein:vir:95 72 MMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFN-KSN----LYQSLPQLYGSLGTYSTGAMAVLDDDE- 145 (559) T ss_pred HHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhhCceeeEeecCCC- Confidence 543 2442221111111110000 011222332 333 444466778899999999998876543 Q ss_pred ceeeEEeecCceEEEEEcCCceEEEEE----------------------------ec--Cce------------------ Q lcl|NC_019719. 141 DVISLLPLQSANMDVKLVGKKVVYRYQ----------------------------RD--SEY------------------ 172 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~~~~~~~~~~----------------------------~~--~~~------------------ 172 (424) ....+.+++...+.+..|..+..-.+. .+ ... T Consensus 146 ~~~r~~~~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~ 225 (559) T protein:vir:95 146 DIIRTMPFPIGSYYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKL 225 (559) T ss_pred ceeEEEEeecCeEEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEecccccccccc Confidence 344555555555555554433210000 00 000 Q ss_pred ---------EEec--Hh--Hee-----------EeccCCCC-ccccCc-hHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_019719. 173 ---------ADFS--QK--EIF-----------HLKGFGFT-GLVGLS-PIAFACKSAGVAVAMEDQQRDFFANGAKSPQ 226 (424) Q Consensus 173 ---------~~~~--~~--evi-----------h~r~~~~~-~~~G~s-~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~ 226 (424) +.+. .+ .++ -.|+...+ ..||.| |...+...+.....+.+...........|.. T Consensus 226 ~~~~~pf~s~~~e~~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~ 305 (559) T protein:vir:95 226 DSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPM 305 (559) T ss_pred ccccceEEEEEEEecCCCceeeecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCce Confidence 0000 01 111 11222223 379999 8999999999999999888888888888866 Q ss_pred eEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC--C-ceeeeccc-ChhHHHH-HHHHHHHHHHHHHHhCCCHHH Q lcl|NC_019719. 227 ILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA--G-FSTSAIGV-TPQDAEM-MASRKFQVSELARFFGVPPHL 301 (424) Q Consensus 227 vl~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~--g-~~~~~l~~-~~~d~~~-~e~~~~~~~~Ia~~fgVP~~~ 301 (424) ++ +.+..... .+ . ..|++.+++. | -.++++-. ++ +.++ .+..+.....|-.+|-..+.. T Consensus 306 ~v--~~~~~~~~--------~~--l---~pgg~~~~~~~~~~~~i~p~~~~~~-~~~~~~~~i~~~~~rI~~af~~d~~~ 369 (559) T protein:vir:95 306 VA--PTSLKNQR--------AS--L---LPGDITYIDQITGQDGFRPAYLVNP-STADLVADIQDTRQIINSAYFVDLFM 369 (559) T ss_pred ec--cccccccc--------ee--e---eccceeeeCCCCCcccceeeccccc-chHHHHHHHHHHHHHHHHHhhhhhHH Confidence 54 22221110 00 1 1122322221 1 22444422 22 2333 233456788899999886532 Q ss_pred -hcCCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCC Q lcl|NC_019719. 302 -VGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGL 380 (424) Q Consensus 302 -l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~ 380 (424) +...... .-++.--..+..-....|.|....++++|-.-|+.. .+..+.+.|. T Consensus 370 ~l~~r~~~-rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r-------------------------~~~il~r~g~ 423 (559) T protein:vir:95 370 MLQNINTR-SMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDR-------------------------SFSMMVRKNM 423 (559) T ss_pred HhhcCCCC-CCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH-------------------------HHHHHHhcCC Confidence 2222222 111111123334555667777777777754222110 0111111222 Q ss_pred CCHHHHHHHhCCCCCCCC-Ceeeecccc-cchhhccc-------------------------cCCCcccCC Q lcl|NC_019719. 381 RTINEMRRTDNLPPLPGG-DVAMRQSQY-VPITDLGT-------------------------NKEPRNNGA 424 (424) Q Consensus 381 ~T~NE~R~~~G~~p~~~g-d~~~~~~n~-~~~~~~~~-------------------------~~~~~~~ga 424 (424) + ||.|.. +..-+...+ .++..+.+ ..-+-+... T Consensus 424 l-----------P~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pevld~id~d~~~ 483 (559) T protein:vir:95 424 L-----------PPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEALDKLNVDQAI 483 (559) T ss_pred C-----------CCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhhhcCCHHHHH Confidence 1 112110 000000000 01110000 000000000 No 243 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=50.51 E-value=0.61 Score=21.77 Aligned_cols=364 Identities=12% Similarity=0.060 Sum_probs=139.0 Q ss_pred ccCCCCCchHHHHHhhccCcccCcccccccccccccc----cccCcccccHHHHhhhHHHHHHHHHHHHhhcc------C Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAH----GHLGDSSINDERILQISTVWRCVSLISTLTAC------L 77 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~------~ 77 (424) |+++.+---+..-++.|...+.. -..+..|.... ....+... .... .++--.|++.+|+.+.+ - T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e---~~~~tlP~~~~~~~~~~~~~~~~--~~~~-dstg~~a~~~LAa~l~~~ltpp~~ 74 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVE---CSELTLPYLIDDDISSRPNHKSL--TVPW-QSVGAKCCVTLAAKLMLAVLPPQT 74 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHH---HHHHhhhcccCCCCCCCcccccc--cccc-cchHHHHHHHHHHHHHHhhcCCCC Confidence 88776432222222222111100 00011111000 00011110 1111 23344566666666543 3 Q ss_pred ceEEEEecccCcccccc-------------ccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceee Q lcl|NC_019719. 78 PLDVFETDQNDNRKKVD-------------LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVIS 144 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~-------------~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~ 144 (424) ||.=....+..-.+... ..+.+...|. +. +.+.-+..++.++..+||+.+++..+. .. T Consensus 75 ~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~-~s----nf~~~~~~~~~~L~~~G~a~ly~~~~~----~~ 145 (522) T protein:vir:10 75 SFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIA-AS----NDRVAVHQALKHLIVGGNALIFMGKDG----LK 145 (522) T ss_pred ccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHH-hc----CcHHHHHHHHHHHHhHCceeEEEcCCC----ce Confidence 44222111110000000 0111222232 33 355566777889999999998865542 24 Q ss_pred EEeecCceEEEEEcCCce-------------------------------------EEE--EEe-c-CceEEe--cHhHee Q lcl|NC_019719. 145 LLPLQSANMDVKLVGKKV-------------------------------------VYR--YQR-D-SEYADF--SQKEIF 181 (424) Q Consensus 145 l~~l~~~~v~~~~~~~~~-------------------------------------~~~--~~~-~-~~~~~~--~~~evi 181 (424) .||+....|.....++.. +|. +.. + +....+ ..+.++ T Consensus 146 ~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~~ 225 (522) T protein:vir:10 146 TFPLTRYVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQEAFDKII 225 (522) T ss_pred EEEcceEEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEccCCccc Confidence 455543333222111110 010 001 0 000000 111122 Q ss_pred ---------------EeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019719. 182 ---------------HLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 182 ---------------h~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~~~~~~~ 245 (424) -.|+...++ .||.||...+...+.....+.+.......-...|..++... +...... T Consensus 226 ~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~-~~~~~~~------ 298 (522) T protein:vir:10 226 PDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPS-STTKPAT------ 298 (522) T ss_pred cccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeccc-ccccccc------ Confidence 223332234 89999999999999999999999988888888887665422 3222211 Q ss_pred HHHHHhCCcccCccee--cCCCceeeecccChhHHHH-HHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHHHHHHH Q lcl|NC_019719. 246 NFKEIAGGPVKKRLWI--LEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 246 ~~~~~~~~~~~g~~~~--l~~g~~~~~l~~~~~d~~~-~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~~ 322 (424) ...+.+ +.++ -++++...+++. ..|++. .+..+.....|..+|= +....++..-++.--..+..- T Consensus 299 ----l~~~~~--~~~v~g~~~~v~~~~~~~-~~d~~~~~~~i~~~~~ri~~aFl-----~~~~~d~~rvTAtEV~~r~~E 366 (522) T protein:vir:10 299 ----IAKAGN--GAIVQGRPEDVAVIQVGK-TADFSTAANMATAIEKRLLEAFL-----VMNVRNAERVTAEEVRLTQLE 366 (522) T ss_pred ----ccCCCC--cceecCCCccceeecccc-cccchHHHHHHHHHHHHHHHHHh-----hccCCCCCCCCHHHHHHHHHH Confidence 111111 1122 123333444332 234432 3445566667777773 222222222211212333345 Q ss_pred HHHHHHHHHHHHHHHHHhh-------------ccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHHHHHHH Q lcl|NC_019719. 323 LQYTLQPYISRWENSIQRW-------------LIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRT 389 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~-------------l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~ 389 (424) ....|.|....+.++|-.- +|++...... ....+...+.-.|+...+++... ...+-.. T Consensus 367 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~----~~~~v~~is~Laraq~~~~l~~~----~~~i~~~ 438 (522) T protein:vir:10 367 LEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIV----RPTIVAGVNALGRGQDRESLTAF----VGTIAQT 438 (522) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccc----ccccccchhHHHHHHHHHHHHHH----HHHHHHh Confidence 5556666666665555322 2222211100 00111112222333332222210 1111111 Q ss_pred hCCCCCCCCCeeeecccccch-hhccccCCCcccCC Q lcl|NC_019719. 390 DNLPPLPGGDVAMRQSQYVPI-TDLGTNKEPRNNGA 424 (424) Q Consensus 390 ~G~~p~~~gd~~~~~~n~~~~-~~~~~~~~~~~~ga 424 (424) +| | | ......|+-.+ +...+ ..|. T Consensus 439 ~~--p-~---~~~~~id~d~~~~~~a~-----~~Gv 463 (522) T protein:vir:10 439 LG--P-E---ALMQYLNPLEAIKRLAA-----AQGI 463 (522) T ss_pred hC--c-h---hhhhcCCHHHHHHHHHH-----HhCC Confidence 11 1 1 01111121111 11111 1121 No 244 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=38.23 E-value=1.1 Score=20.41 Aligned_cols=357 Identities=14% Similarity=0.081 Sum_probs=141.1 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccc---cccccccc----ccccCcccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQG---SQTGPVSA----HGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~---~~~~~~~~----~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) |++++=.= -+.+|+......+..-.+... .+..|... .....+.... .. .=.++...|++.+|+. T Consensus 1 M~~~~~~~------~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~-~~-~~dst~~~a~~~LAa~ 72 (555) T protein:vir:10 1 MAEQTERK------LLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRH-NN-ILDNTGTRALRVLAAG 72 (555) T ss_pred CCCcccHH------HHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcc-cc-cccccHHHHHHHHHHH Confidence 76665211 122222211111110000000 01111100 0000111100 00 1134445677777766 Q ss_pred hcc------CceEEEEecccCcccccc-------ccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019719. 74 TAC------LPLDVFETDQNDNRKKVD-------LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 74 ia~------~~~~v~~~~~~~~~~~~~-------~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) +-+ -||.=....+....+... ..+.+...|. +- +.+.-+..+..+++.+||+.+++..+..+ T Consensus 73 L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~~Lv~~G~a~l~~~~d~~~ 147 (555) T protein:vir:10 73 MMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KS----NTYRALHSMYEELGAFGTASSIVLPDFDA 147 (555) T ss_pred HHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hc----CcHHHHHHHHHHHHhhCceEEEEecCCCc Confidence 553 244222111111111000 0111222232 22 34555667788999999999998776543 Q ss_pred ceeeEEeecCceEEEEEcCCceE---EE-----------------------------------------EE-ecC----- Q lcl|NC_019719. 141 DVISLLPLQSANMDVKLVGKKVV---YR-----------------------------------------YQ-RDS----- 170 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~~~~~~---~~-----------------------------------------~~-~~~----- 170 (424) .+.+.+++...+.+..|..+.. |+ |. ... T Consensus 148 -~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~ 226 (555) T protein:vir:10 148 -VVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKR 226 (555) T ss_pred -eEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCC Confidence 3444444444444444433321 00 00 000 Q ss_pred --c-----eEEec----HhHe-----------eEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCcee Q lcl|NC_019719. 171 --E-----YADFS----QKEI-----------FHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQI 227 (424) Q Consensus 171 --~-----~~~~~----~~ev-----------ih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~v 227 (424) . .+.+. ...| +.+|+...++ .||.||...+...+.....+.+.......-...|... T Consensus 227 ~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:10 227 DDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred CccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 0 01110 0011 2233333334 7999999999999999999988888777777777655 Q ss_pred EEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC---CceeeecccChhHHH-HHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_019719. 228 LSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA---GFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVG 303 (424) Q Consensus 228 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~---g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVP~~~l~ 303 (424) +.... .... ....- |++..+.. +-...++-....|.+ ..+..+.....|..+|=.+..... T Consensus 307 v~~~~-~~~~---------~~~~p-----gg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l 371 (555) T protein:vir:10 307 LPVSA-KNQD---------ISTVP-----GGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLML 371 (555) T ss_pred ecccc-cccc---------ceecc-----ccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhc Confidence 43221 1111 11111 12211111 112233222222333 334567778889999966532222 Q ss_pred CCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCH Q lcl|NC_019719. 304 DVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI 383 (424) Q Consensus 304 ~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~ 383 (424) ...++..-++.--..+..-....|.|....+.++|-.-|+.. .+..+.+.|.+ T Consensus 372 ~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r-------------------------~~~il~r~g~l-- 424 (555) T protein:vir:10 372 ANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIEL-------------------------TFQRMVEANIL-- 424 (555) T ss_pred cCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH-------------------------HHHHHHhcCCC-- Confidence 122222221222234456667777788888777664322210 11112222222 Q ss_pred HHHHHHhCCCCCCC---CCeeeecccc-cchhhccccC------------------CC-------cc---------cCC Q lcl|NC_019719. 384 NEMRRTDNLPPLPG---GDVAMRQSQY-VPITDLGTNK------------------EP-------RN---------NGA 424 (424) Q Consensus 384 NE~R~~~G~~p~~~---gd~~~~~~n~-~~~~~~~~~~------------------~~-------~~---------~ga 424 (424) ||.|. +..+ ...+ .++..+.+.. +| -+ .|. T Consensus 425 ---------P~~P~~l~~~~i--~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gv 492 (555) T protein:vir:10 425 ---------PPPPQEMQGVDL--NVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGI 492 (555) T ss_pred ---------CCCchhhcCcee--EEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCC Confidence 11111 1100 0000 0111100000 00 00 000 No 245 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=38.23 E-value=1.1 Score=20.41 Aligned_cols=357 Identities=14% Similarity=0.081 Sum_probs=141.1 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccc---cccccccc----ccccCcccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQG---SQTGPVSA----HGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~---~~~~~~~~----~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) |++++=.= -+.+|+......+..-.+... .+..|... .....+.... .. .=.++...|++.+|+. T Consensus 1 M~~~~~~~------~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~-~~-~~dst~~~a~~~LAa~ 72 (555) T protein:vir:10 1 MAEQTERK------LLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRH-NN-ILDNTGTRALRVLAAG 72 (555) T ss_pred CCCcccHH------HHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcc-cc-cccccHHHHHHHHHHH Confidence 76665211 122222211111110000000 01111100 0000111100 00 1134445677777766 Q ss_pred hcc------CceEEEEecccCcccccc-------ccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019719. 74 TAC------LPLDVFETDQNDNRKKVD-------LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 74 ia~------~~~~v~~~~~~~~~~~~~-------~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) +-+ -||.=....+....+... ..+.+...|. +- +.+.-+..+..+++.+||+.+++..+..+ T Consensus 73 L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~~Lv~~G~a~l~~~~d~~~ 147 (555) T protein:vir:10 73 MMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KS----NTYRALHSMYEELGAFGTASSIVLPDFDA 147 (555) T ss_pred HHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hc----CcHHHHHHHHHHHHhhCceEEEEecCCCc Confidence 553 244222111111111000 0111222232 22 34555667788999999999998776543 Q ss_pred ceeeEEeecCceEEEEEcCCceE---EE-----------------------------------------EE-ecC----- Q lcl|NC_019719. 141 DVISLLPLQSANMDVKLVGKKVV---YR-----------------------------------------YQ-RDS----- 170 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~~~~~~---~~-----------------------------------------~~-~~~----- 170 (424) .+.+.+++...+.+..|..+.. |+ |. ... T Consensus 148 -~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~ 226 (555) T protein:vir:10 148 -VVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKR 226 (555) T ss_pred -eEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCC Confidence 3444444444444444433321 00 00 000 Q ss_pred --c-----eEEec----HhHe-----------eEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCcee Q lcl|NC_019719. 171 --E-----YADFS----QKEI-----------FHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQI 227 (424) Q Consensus 171 --~-----~~~~~----~~ev-----------ih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~v 227 (424) . .+.+. ...| +.+|+...++ .||.||...+...+.....+.+.......-...|... T Consensus 227 ~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:10 227 DDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred CccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 0 01110 0011 2233333334 7999999999999999999988888777777777655 Q ss_pred EEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC---CceeeecccChhHHH-HHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_019719. 228 LSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA---GFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVG 303 (424) Q Consensus 228 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~---g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVP~~~l~ 303 (424) +.... .... ....- |++..+.. +-...++-....|.+ ..+..+.....|..+|=.+..... T Consensus 307 v~~~~-~~~~---------~~~~p-----gg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l 371 (555) T protein:vir:10 307 LPVSA-KNQD---------ISTVP-----GGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLML 371 (555) T ss_pred ecccc-cccc---------ceecc-----ccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhc Confidence 43221 1111 11111 12211111 112233222222333 334567778889999966532222 Q ss_pred CCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCH Q lcl|NC_019719. 304 DVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI 383 (424) Q Consensus 304 ~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~ 383 (424) ...++..-++.--..+..-....|.|....+.++|-.-|+.. .+..+.+.|.+ T Consensus 372 ~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r-------------------------~~~il~r~g~l-- 424 (555) T protein:vir:10 372 ANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIEL-------------------------TFQRMVEANIL-- 424 (555) T ss_pred cCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH-------------------------HHHHHHhcCCC-- Confidence 122222221222234456667777788888777664322210 11112222222 Q ss_pred HHHHHHhCCCCCCC---CCeeeecccc-cchhhccccC------------------CC-------cc---------cCC Q lcl|NC_019719. 384 NEMRRTDNLPPLPG---GDVAMRQSQY-VPITDLGTNK------------------EP-------RN---------NGA 424 (424) Q Consensus 384 NE~R~~~G~~p~~~---gd~~~~~~n~-~~~~~~~~~~------------------~~-------~~---------~ga 424 (424) ||.|. +..+ ...+ .++..+.+.. +| -+ .|. T Consensus 425 ---------P~~P~~l~~~~i--~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gv 492 (555) T protein:vir:10 425 ---------PPPPQEMQGVDL--NVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGI 492 (555) T ss_pred ---------CCCchhhcCcee--EEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCC Confidence 11111 1100 0000 0111100000 00 00 000 No 246 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=38.23 E-value=1.1 Score=20.41 Aligned_cols=357 Identities=14% Similarity=0.081 Sum_probs=141.1 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCccccc---cccccccc----ccccCcccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQG---SQTGPVSA----HGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~~---~~~~~~~~----~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) |++++=.= -+.+|+......+..-.+... .+..|... .....+.... .. .=.++...|++.+|+. T Consensus 1 M~~~~~~~------~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~-~~-~~dst~~~a~~~LAa~ 72 (555) T protein:vir:98 1 MAEQTERK------LLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRH-NN-ILDNTGTRALRVLAAG 72 (555) T ss_pred CCCcccHH------HHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcc-cc-cccccHHHHHHHHHHH Confidence 76665211 122222211111110000000 01111100 0000111100 00 1134445677777766 Q ss_pred hcc------CceEEEEecccCcccccc-------ccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019719. 74 TAC------LPLDVFETDQNDNRKKVD-------LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 74 ia~------~~~~v~~~~~~~~~~~~~-------~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) +-+ -||.=....+....+... ..+.+...|. +- +.+.-+..+..+++.+||+.+++..+..+ T Consensus 73 L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~~Lv~~G~a~l~~~~d~~~ 147 (555) T protein:vir:98 73 MMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFA-KS----NTYRALHSMYEELGAFGTASSIVLPDFDA 147 (555) T ss_pred HHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHH-hc----CcHHHHHHHHHHHHhhCceEEEEecCCCc Confidence 553 244222111111111000 0111222232 22 34555667788999999999998776543 Q ss_pred ceeeEEeecCceEEEEEcCCceE---EE-----------------------------------------EE-ecC----- Q lcl|NC_019719. 141 DVISLLPLQSANMDVKLVGKKVV---YR-----------------------------------------YQ-RDS----- 170 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~~~~~~---~~-----------------------------------------~~-~~~----- 170 (424) .+.+.+++...+.+..|..+.. |+ |. ... T Consensus 148 -~~rf~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~ 226 (555) T protein:vir:98 148 -VVYHHSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKR 226 (555) T ss_pred -eEEEEEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCC Confidence 3444444444444444433321 00 00 000 Q ss_pred --c-----eEEec----HhHe-----------eEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCcee Q lcl|NC_019719. 171 --E-----YADFS----QKEI-----------FHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQI 227 (424) Q Consensus 171 --~-----~~~~~----~~ev-----------ih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~v 227 (424) . .+.+. ...| +.+|+...++ .||.||...+...+.....+.+.......-...|... T Consensus 227 ~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:98 227 DDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred CccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 0 01110 0011 2233333334 7999999999999999999988888777777777655 Q ss_pred EEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC---CceeeecccChhHHH-HHHHHHHHHHHHHHHhCCCHHHhc Q lcl|NC_019719. 228 LSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA---GFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVG 303 (424) Q Consensus 228 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~l~~---g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVP~~~l~ 303 (424) +.... .... ....- |++..+.. +-...++-....|.+ ..+..+.....|..+|=.+..... T Consensus 307 v~~~~-~~~~---------~~~~p-----gg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l 371 (555) T protein:vir:98 307 LPVSA-KNQD---------ISTVP-----GGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLML 371 (555) T ss_pred ecccc-cccc---------ceecc-----ccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhc Confidence 43221 1111 11111 12211111 112233222222333 334567778889999966532222 Q ss_pred CCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCH Q lcl|NC_019719. 304 DVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI 383 (424) Q Consensus 304 ~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~ 383 (424) ...++..-++.--..+..-....|.|....+.++|-.-|+.. .+..+.+.|.+ T Consensus 372 ~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r-------------------------~~~il~r~g~l-- 424 (555) T protein:vir:98 372 ANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIEL-------------------------TFQRMVEANIL-- 424 (555) T ss_pred cCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH-------------------------HHHHHHhcCCC-- Confidence 122222221222234456667777788888777664322210 11112222222 Q ss_pred HHHHHHhCCCCCCC---CCeeeecccc-cchhhccccC------------------CC-------cc---------cCC Q lcl|NC_019719. 384 NEMRRTDNLPPLPG---GDVAMRQSQY-VPITDLGTNK------------------EP-------RN---------NGA 424 (424) Q Consensus 384 NE~R~~~G~~p~~~---gd~~~~~~n~-~~~~~~~~~~------------------~~-------~~---------~ga 424 (424) ||.|. +..+ ...+ .++..+.+.. +| -+ .|. T Consensus 425 ---------P~~P~~l~~~~i--~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gv 492 (555) T protein:vir:98 425 ---------PPPPQEMQGVDL--NVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGI 492 (555) T ss_pred ---------CCCchhhcCcee--EEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCC Confidence 11111 1100 0000 0111100000 00 00 000 No 247 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=37.14 E-value=1.1 Score=20.28 Aligned_cols=393 Identities=12% Similarity=0.056 Sum_probs=140.5 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccc---cccccccc--cccccCcccccHHHHhhhHHHHHHHHHHHHhhc Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQ---GSQTGPVS--AHGHLGDSSINDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia 75 (424) |.|-+--+.-++=..-|++|++. +..-.+.. ..+..|.. ..+..+... ..... .++...|++.+|+.+- T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~---R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~--~~~~~-dst~~~a~~~Laa~l~ 74 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKND---RAPYETRAQNCAQYTIPSLFPKDSDNASTD--YQTPW-QAVGARGLNNLASKLM 74 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHH---hhHHHHHHHHHHHHhcccccCCCCCccccc--ccccc-cccHHHHHHHHHHHHH Confidence 55532111111111223333221 00000000 00011110 000001100 01111 2344456666665544 Q ss_pred cC-----ceEEEEecccCccccc--------------cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019719. 76 CL-----PLDVFETDQNDNRKKV--------------DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 76 ~~-----~~~v~~~~~~~~~~~~--------------~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r 136 (424) +. ||.=....+.+-.... ...+.+...|. +.| .+.-+..+..+++.+||+.+++.. T Consensus 75 ~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~L~~~G~a~ly~~e 149 (536) T protein:vir:21 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNVLLYLPE 149 (536) T ss_pred HhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhHCcEeEEEee Confidence 31 3322111111100000 00112223332 333 445556778888899999999876 Q ss_pred CCCCce--eeEEeecCceEEEEEcCCce-----------------------------------EEE--EEe-cC-ceE-- Q lcl|NC_019719. 137 NSAGDV--ISLLPLQSANMDVKLVGKKV-----------------------------------VYR--YQR-DS-EYA-- 173 (424) Q Consensus 137 ~~~G~~--~~l~~l~~~~v~~~~~~~~~-----------------------------------~~~--~~~-~~-~~~-- 173 (424) +..+.+ ...|||....|....+++.. +|. +.. ++ ... T Consensus 150 ~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~ 229 (536) T protein:vir:21 150 PEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRY 229 (536) T ss_pred CCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEE Confidence 654433 35566644333332222110 000 000 10 000 Q ss_pred ------EecHh---------HeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH Q lcl|NC_019719. 174 ------DFSQK---------EIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 174 ------~~~~~---------evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~ 237 (424) .+..+ =.+.+|+...++ .||.||...+...+.....+.+.......-...|...+. +.+.... T Consensus 230 ~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~~ 308 (536) T protein:vir:21 230 EEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQP 308 (536) T ss_pred eccCCeeeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccC-cccccch Confidence 01000 113334433344 899999999999999999888888776666566655443 3333232 Q ss_pred HHHHHHHHHHHHHhCCcccCcce-ecCCCceeeecccChhHHH-HHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhH Q lcl|NC_019719. 238 QQRSQVEENFKEIAGGPVKKRLW-ILEAGFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGI 315 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~g~~~-~l~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~ 315 (424) .. ...+.+ |.++ ..++.....++... .+.+ ..+..+.....|..+|-+.. +... ++..-++.- T Consensus 309 ~~----------~~~~~~-g~~v~g~~~~v~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~--l~~~-~~~r~TAtE 373 (536) T protein:vir:21 309 RR----------LTKAQT-GDFVTGRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS--AVQR-TGERVTAEE 373 (536) T ss_pred hh----------hccCCC-cceecCCcccceeeecccc-ccchHHHHHHHHHHHHHHHHHhhhh--cccC-CCCCccHHH Confidence 21 111111 1111 12233444454433 2332 23455666777888885431 2111 121111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh-------------hccCccccccceeee--cchhhhccC-HHHHHHHHHHHHhCC Q lcl|NC_019719. 316 EQQNLGFLQYTLQPYISRWENSIQR-------------WLIPAKDVGRIHAEH--NLDGLLRGD-SASRAAFMKAMGEAG 379 (424) Q Consensus 316 e~~~~~~~~~tl~P~~~~ie~~l~~-------------~l~~~~~~~~~~~~f--d~~~l~~~d-~~~~~~~~~~~~~~g 379 (424) -..+..-....|.|....++++|-. .+|++.........+ -+..+.+.- .+....++..+.+.+ T Consensus 374 V~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~ 453 (536) T protein:vir:21 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALA 453 (536) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhc Confidence 1222333444444544444444322 244332221122222 122222211 111111212111111 Q ss_pred ------CCCHHHH----HHHhCCCCCCCCCeeeeccc-ccchh--------------hccc------cCCCcc------c Q lcl|NC_019719. 380 ------LRTINEM----RRTDNLPPLPGGDVAMRQSQ-YVPIT--------------DLGT------NKEPRN------N 422 (424) Q Consensus 380 ------~~T~NE~----R~~~G~~p~~~gd~~~~~~n-~~~~~--------------~~~~------~~~~~~------~ 422 (424) .+..+++ .+.+|.+|.. ++.+.. ...+. .++. ...++. + T Consensus 454 Pe~ld~~id~d~~~~~~a~~~Gv~p~~----~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 529 (536) T protein:vir:21 454 PMRDDPDINLAMIKLRIANAIGIDTSG----ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADS 529 (536) T ss_pred hhhhcccCCHHHHHHHHHHHcCCChhh----hcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhhhhc Confidence 1222222 2234553310 111000 00000 0000 001111 1 Q ss_pred CC Q lcl|NC_019719. 423 GA 424 (424) Q Consensus 423 ga 424 (424) ++ T Consensus 530 ~g 531 (536) T protein:vir:21 530 VG 531 (536) T ss_pred cc Confidence 11 No 248 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=33.05 E-value=1.4 Score=19.81 Aligned_cols=393 Identities=11% Similarity=0.055 Sum_probs=141.1 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCcccc---cccccccc--cccccCcccccHHHHhhhHHHHHHHHHHHHhhc Q lcl|NC_019719. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQ---GSQTGPVS--AHGHLGDSSINDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~l~~~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia 75 (424) |.|-+--+.-++=..-|++|++. +..-.+.. ..+..|.. ..+..+... ..... .++...|++.+|+.+- T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~---R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~--~~~~~-dst~~~a~~~Laa~l~ 74 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKND---RAPYETRAQNCAQYTIPSLFPKDSDNASTD--YQTPW-QAVGARGLNNLASKLM 74 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHH---hhHHHHHHHHHHHHhcccccCCCCCccccc--ccccc-cccHHHHHHHHHHHHH Confidence 55532111111111223333221 00000000 00111110 000001100 01111 2334456666665554 Q ss_pred cC-----ceEEEEecccCccccc--------------cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019719. 76 CL-----PLDVFETDQNDNRKKV--------------DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 76 ~~-----~~~v~~~~~~~~~~~~--------------~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r 136 (424) +. ||.=....+.+-.... ...+.+...|. +.| .+.-+..+..+++.+||+.+++.. T Consensus 75 ~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~L~~~G~a~ly~~e 149 (536) T protein:vir:10 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNVLLYLPE 149 (536) T ss_pred hhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhHCcEeEEEee Confidence 31 3422111111100000 00112223332 333 445556778888899999999876 Q ss_pred CCCCce--eeEEeecCceEEEEEcCCce-----------------------------------EE--EEEe--cCce--- Q lcl|NC_019719. 137 NSAGDV--ISLLPLQSANMDVKLVGKKV-----------------------------------VY--RYQR--DSEY--- 172 (424) Q Consensus 137 ~~~G~~--~~l~~l~~~~v~~~~~~~~~-----------------------------------~~--~~~~--~~~~--- 172 (424) +..+.+ ...|||....|....+++.. +| .+.. ++.. T Consensus 150 ~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~ 229 (536) T protein:vir:10 150 PEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRY 229 (536) T ss_pred CCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEE Confidence 654433 35566644333332222110 00 0111 0100 Q ss_pred EEecHhH--------------eeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH Q lcl|NC_019719. 173 ADFSQKE--------------IFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 173 ~~~~~~e--------------vih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~ 237 (424) ..+.... .+.+|+...++ .||.||...+...+.....+.+.......-...|..++. +.+.... T Consensus 230 ~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~~ 308 (536) T protein:vir:10 230 EEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQP 308 (536) T ss_pred EeecCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccC-cccccch Confidence 0111111 13334433344 899999999999999999888888776666566655443 3333232 Q ss_pred HHHHHHHHHHHHHhCCcccCcce-ecCCCceeeecccChhHHH-HHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhH Q lcl|NC_019719. 238 QQRSQVEENFKEIAGGPVKKRLW-ILEAGFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGI 315 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~g~~~-~l~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~ 315 (424) .. ...+.+ |.++ ..++.....++... .+.+ ..+..+.....|..+|-+.. +... ++..-++.- T Consensus 309 ~~----------~~~~~~-g~~v~g~~~~v~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~--l~~~-~~~r~TAtE 373 (536) T protein:vir:10 309 RR----------LTKAQT-GDFVTGRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS--AVQR-TGERVTAEE 373 (536) T ss_pred hh----------hccCCC-cceecCCcccceeeecccc-ccchHHHHHHHHHHHHHHHHHhhhh--cccC-CCCCccHHH Confidence 21 111111 1111 12233444554433 2332 23455666777888885432 2111 121111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh-------------hccCccccccceeee--cchhhhccC-HHHHHHHHHHHHhCC Q lcl|NC_019719. 316 EQQNLGFLQYTLQPYISRWENSIQR-------------WLIPAKDVGRIHAEH--NLDGLLRGD-SASRAAFMKAMGEAG 379 (424) Q Consensus 316 e~~~~~~~~~tl~P~~~~ie~~l~~-------------~l~~~~~~~~~~~~f--d~~~l~~~d-~~~~~~~~~~~~~~g 379 (424) -..+..-....|.|....++++|-. .+|++.........+ -+..+.+.- .+....++..+.+.+ T Consensus 374 V~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~ 453 (536) T protein:vir:10 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALA 453 (536) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhc Confidence 1222333444444544444444322 244332221122222 122222211 111111122111111 Q ss_pred ------CCCHHHHH----HHhCCCCCCCCCeeeeccc-ccchh--------------hccc------cCCCcc------c Q lcl|NC_019719. 380 ------LRTINEMR----RTDNLPPLPGGDVAMRQSQ-YVPIT--------------DLGT------NKEPRN------N 422 (424) Q Consensus 380 ------~~T~NE~R----~~~G~~p~~~gd~~~~~~n-~~~~~--------------~~~~------~~~~~~------~ 422 (424) .+..+++- +.+|.+|.. ++.+.. ...+. .++. ...++. + T Consensus 454 P~~ld~~id~d~~~~~~a~~~Gv~p~~----~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 529 (536) T protein:vir:10 454 PMRDDPDINLAMIKLRIANAIGIDTSG----ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADS 529 (536) T ss_pred hhhhcccCCHHHHHHHHHHHcCCCchh----hcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhhhhc Confidence 12233322 234553310 111100 00000 0000 001111 1 Q ss_pred CC Q lcl|NC_019719. 423 GA 424 (424) Q Consensus 423 ga 424 (424) ++ T Consensus 530 ~g 531 (536) T protein:vir:10 530 VG 531 (536) T ss_pred cc Confidence 11 No 249 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=23.33 E-value=2.3 Score=18.58 Aligned_cols=373 Identities=11% Similarity=0.034 Sum_probs=135.3 Q ss_pred ccCCCCCchHHHHHhhc---cCcccCccccccccccccc--ccccCcccccHHHHhhhHHHHHHHHHHHHhhcc------ Q lcl|NC_019719. 8 IDLRTNNGWWARLQSWF---VGGRLVTPNQGSQTGPVSA--HGHLGDSSINDERILQISTVWRCVSLISTLTAC------ 76 (424) Q Consensus 8 ~~~~~~~G~~~~l~~~~---~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~------ 76 (424) |+-.- ..-|++|++.- ...+. .-..+..|... .+...+.. ..... .++-..|++.+|+.+-+ T Consensus 1 mk~~a-~~r~~~l~~~R~~~e~~w~---e~~~y~lP~~~~~~~~~~~~~--~~~~~-dstg~~a~~~Laa~l~~~ltpp~ 73 (542) T protein:vir:78 1 MKGLA-QARYSAMRADREDFLDMAR---RCAALTLPYLLTEDGHASGGR--LQQPY-QSLGSKGVNALSSKLMLSLFPIQ 73 (542) T ss_pred ChhHH-HHHHHHHHHHhhHHHHHHH---HHHHHhccccCCCCCCccccc--ccccc-cchHHHHHHHHHHHHHHhhcCCC Confidence 43110 12344443211 00000 00000111100 00000110 01111 23344566666666543 Q ss_pred CceEEEEeccc--------Ccc-c-cc-c----ccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCc Q lcl|NC_019719. 77 LPLDVFETDQN--------DNR-K-KV-D----LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGD 141 (424) Q Consensus 77 ~~~~v~~~~~~--------~~~-~-~~-~----~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~ 141 (424) -||.=....+. +.. . +. . ....+...|. +. +.+.-+..++.++..+||+.+++..+ T Consensus 74 ~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~~L~~~G~a~l~~~~~---- 144 (542) T protein:vir:78 74 TSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIA-ES----SDRVQLTAAMKHLIVTGNVLVFAGKK---- 144 (542) T ss_pred CccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHH-hc----CcHHHHHHHHHHHHhhCeEEEEecCC---- Confidence 34432211110 000 0 00 0 0111222332 23 34555667788899999999887543 Q ss_pred eeeEEeecCceEEEEEcCCceE-E-------------------------------------------------------- Q lcl|NC_019719. 142 VISLLPLQSANMDVKLVGKKVV-Y-------------------------------------------------------- 164 (424) Q Consensus 142 ~~~l~~l~~~~v~~~~~~~~~~-~-------------------------------------------------------- 164 (424) +...|||....|....+++... | T Consensus 145 ~~~~~pl~~y~v~~d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~ 224 (542) T protein:vir:78 145 TLKVYPLDRYVIERDGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLV 224 (542) T ss_pred CceEEecceeEEeeCCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccccC Confidence 2345565443333222211100 0 Q ss_pred ------EEEecCceE-------EecHhHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEc Q lcl|NC_019719. 165 ------RYQRDSEYA-------DFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILST 230 (424) Q Consensus 165 ------~~~~~~~~~-------~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~ 230 (424) .....+... .|...=.+-.|+...++ .||.||...+...+.....+.+.......-...|..++.. T Consensus 225 ~~~~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~ 304 (542) T protein:vir:78 225 DGQHRWHQECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSP 304 (542) T ss_pred CCeEEEEEEeccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecc Confidence 000000000 00000112223333333 8999999999999999999999998888888888765532 Q ss_pred CCCCCCHHHHHHHHHHHHHHhCCcccCccee--cCCCceeeecccChhHHH-HHHHHHHHHHHHHHHhCCCHHHhcCCCC Q lcl|NC_019719. 231 GEKVLTEQQRSQVEENFKEIAGGPVKKRLWI--LEAGFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEK 307 (424) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~--l~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVP~~~l~~~~~ 307 (424) . +...... ...+.+ | .++ .++++...++... .+.+ ..+..+.....|..+|-+- ...+ T Consensus 305 ~-g~~~~~~----------~~~~~~-g-~iv~g~~~~v~~~~~~~~-~~~~~~~~~i~~~~~rI~~aFl~~-----~~~d 365 (542) T protein:vir:78 305 S-ATTKPQS----------LARAGT-G-AIIQGRAEDVSVVQANKG-ADFRTVQEMIRDLSQRISDAFLIL-----NVRQ 365 (542) T ss_pred c-cccchhh----------cccCCC-c-eeecCCccceeeeecccc-cchhHHHHHHHHHHHHHHHHhccc-----ccCC Confidence 2 2223221 111111 1 112 1233333343332 3333 2345566677777777432 2222 Q ss_pred CCccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc----ccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCH Q lcl|NC_019719. 308 STSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK----DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI 383 (424) Q Consensus 308 ~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~----~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~ 383 (424) +..-++.--..+..-....|.|....++++|-.-|+... .+.+...... .++...+...-.+.+.++.. +-.+ T Consensus 366 ~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p-~~lv~~~~~s~La~~~r~~~--~~~l 442 (542) T protein:vir:78 366 SERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLP-KGLVMPTVVAGLGGVGRGED--RAAL 442 (542) T ss_pred cccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc-hhceeeeeechHHHHHHHHH--HHHH Confidence 222222222344456667777888888776643222100 0000000000 00000000001111111110 0111 Q ss_pred HHHHHHhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 384 NEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 384 NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ...-..++.- + +-..+....|+-.+-+. -....|- T Consensus 443 ~~~~~~i~~~-~-~p~~l~~~id~d~~~~~----~a~~~Gv 477 (542) T protein:vir:78 443 IEFMQTVGQA-M-GPEALQQFIDPTEFLKR----LAAASGI 477 (542) T ss_pred HHHHHHHHHh-c-CChhHHhcCCHHHHHHH----HHHHcCC Confidence 1111111110 0 00000000011000000 0001111 No 250 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=20.79 E-value=2.7 Score=18.21 Aligned_cols=382 Identities=14% Similarity=0.062 Sum_probs=145.6 Q ss_pred CCCCcccccCCCCC--chHHHHHhhc---cCcccCcccccccccc--cccccccCcccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019719. 1 MEEPKYTIDLRTNN--GWWARLQSWF---VGGRLVTPNQGSQTGP--VSAHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~--G~~~~l~~~~---~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) |-+-|-+. |..+. ..|++|++.- ...+. .-..+..| +...+...+... .... .++-..|++.+|+. T Consensus 1 m~~~~~~~-~~~~~~k~r~~~l~~~R~~~e~~w~---e~~~~~lP~~~~~~~~~~~~~~--~~~~-dst~~~a~~~Laa~ 73 (535) T protein:vir:15 1 MADSKRTG-LGEDGAKATYDRLTNDRRAYETRAE---NCAQYTIPSLFPKESDNESTDY--TTPW-QAVGARGLNNLASK 73 (535) T ss_pred CCccchhc-cchHHHHHHHHHHHHHhhHHHHHHH---HHHHHhcccccCCCCCcccccc--cccc-cccHHHHHHHHHHH Confidence 66655221 11111 2344444321 00000 00001111 100011111110 1111 33444566666666 Q ss_pred hccC-----ceEEEEecccC------ccc---cc-----cccchhhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019719. 74 TACL-----PLDVFETDQND------NRK---KV-----DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 74 ia~~-----~~~v~~~~~~~------~~~---~~-----~~~~~l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~ 134 (424) +-+. ||.=....+.. ... +. ...+.+...|. +- +.+.-+..+..+++.+||+.+++ T Consensus 74 l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~~L~~~G~a~l~~ 148 (535) T protein:vir:15 74 LMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIE-SN----SYRVTLFECLKQLIVAGNALLYL 148 (535) T ss_pred HHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHH-hc----CcHHHHHHHHHHHHhhCceeEEe Confidence 5431 33221111100 000 00 00111222232 33 35555667788999999999888 Q ss_pred eeCCCC-ceeeEEeecCceEEEEEcCCce-----------------------------------EEE--EEe-c-CceEE Q lcl|NC_019719. 135 DRNSAG-DVISLLPLQSANMDVKLVGKKV-----------------------------------VYR--YQR-D-SEYAD 174 (424) Q Consensus 135 ~r~~~G-~~~~l~~l~~~~v~~~~~~~~~-----------------------------------~~~--~~~-~-~~~~~ 174 (424) ..+..+ .....||+....|.....++.. +|. +.. . +.... T Consensus 149 ~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~ 228 (535) T protein:vir:15 149 PEPEGSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLK 228 (535) T ss_pred ecCCCCceeeEEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEE Confidence 665433 3445566643333322111100 011 001 0 00000 Q ss_pred ---ecHhH--------------eeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC Q lcl|NC_019719. 175 ---FSQKE--------------IFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT 236 (424) Q Consensus 175 ---~~~~e--------------vih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~ 236 (424) +...+ .+..|+...++ .||.||...+...+.....+.+.......-...|..++.. .+... T Consensus 229 ~~e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~-~g~~~ 307 (535) T protein:vir:15 229 YEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNP-AGITQ 307 (535) T ss_pred EEEeeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecc-ccccc Confidence 00001 23334433344 8999999999999999999999998888888888866542 23222 Q ss_pred HHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHH-HHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhH Q lcl|NC_019719. 237 EQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGI 315 (424) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~g~~~~l~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~ 315 (424) ... ...+.+..-+.-.++++...++...+ +.+ ..+..+.....|..+|=+. .+...+ +..-++.- T Consensus 308 ~~~----------l~~~~~g~~v~g~~~~v~~~~~~~~~-~~~~~~~~i~~~~~~I~~af~~~--~~~~~~-~~r~TAtE 373 (535) T protein:vir:15 308 PRR----------LTKAQTGDFVPGRREDIDFLQLEKQA-DFTVAKAVSDQIEARLSYAFMLN--SAVQRT-GERVTAEE 373 (535) T ss_pred chh----------cccCCceeeecCCcccceeeeccccc-chhHHHHHHHHHHHHHHHHHhhh--hcccCC-CccccHHH Confidence 221 11111100011123444555554432 232 2345556677788888443 122111 21111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh-------------ccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCC Q lcl|NC_019719. 316 EQQNLGFLQYTLQPYISRWENSIQRW-------------LIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRT 382 (424) Q Consensus 316 e~~~~~~~~~tl~P~~~~ie~~l~~~-------------l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T 382 (424) -..+..-....|.|....++++|-.- ++++.......+++ ++.|- ...|...++++.+. T Consensus 374 V~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~y-is~La---~aqr~~~~~~l~~~---- 445 (535) T protein:vir:15 374 IRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTI-STGLE---AIGRGQDLDKLERC---- 445 (535) T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEE-ecHHH---HHHHHHHHHHHHHH---- Confidence 12333445555666666655554322 33322222222222 11221 11122222222220 Q ss_pred HHHHHHHhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 383 INEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 383 ~NE~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ... .-.+.| +-.|..+ |+..+-+.-..--+-+... T Consensus 446 ~~~---la~~~P-~~ld~~i---d~d~~~~~~a~~~Gvp~~~ 480 (535) T protein:vir:15 446 ISA---WAALAP-MQGDPDI---NLAVIKLRIANAIGIDTSG 480 (535) T ss_pred HHH---HHhcCh-hhhhccC---CHHHHHHHHHHHcCCChhh Confidence 111 112222 1122111 2211111000000111011 No 251 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=20.16 E-value=2.8 Score=18.11 Aligned_cols=380 Identities=13% Similarity=0.055 Sum_probs=137.5 Q ss_pred CCCCcccccCCCCC--chHHHHHhhccCcccCc-ccccccccccc--cccccCcccccHHHHhhhHHHHHHHHHHHHhhc Q lcl|NC_019719. 1 MEEPKYTIDLRTNN--GWWARLQSWFVGGRLVT-PNQGSQTGPVS--AHGHLGDSSINDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~--G~~~~l~~~~~~~~~~~-~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia 75 (424) |..++-+=.|..+. ..|++|++.-. .+... ..-..+..|.. ..+..+... .... -.++...|++.+|+.+- T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~-~~e~~w~e~~~y~lP~~~~~~~~~~~~~--~~~~-~dst~~~a~~~Laa~l~ 76 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRN-SYETRAENCAKYTIPSLFPKDSDNASTD--YTTP-WQAVGARGLNNLASKLM 76 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhh-HHHHHHHHHHHHhccccCCCCCCccccc--cCCc-ccccHHHHHHHHHHHHH Confidence 33332211111111 22333332110 00000 00000011110 000001100 0111 12334456666666554 Q ss_pred cC-----ceEEEEecccCccc------c-ccccch-------hhhhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019719. 76 CL-----PLDVFETDQNDNRK------K-VDLSNP-------LARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 76 ~~-----~~~v~~~~~~~~~~------~-~~~~~~-------l~~lL~~~pN~~~s~~~f~~~~~~~~l~~G~a~~~~~r 136 (424) +. ||.=....+..-.. + ...... +...| .+. +.+.-+..++.++..+||+.+++.. T Consensus 77 ~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~-~~s----nf~~~~~~~~~~L~~~G~a~l~~~~ 151 (535) T protein:vir:94 77 LALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYI-ESN----SYRVTLFETLKQLVVAGNALLYIPE 151 (535) T ss_pred hhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHH-Hhc----CcHHHHHHHHHHHHhhCcEeEeecc Confidence 31 44211111100000 0 000001 11112 123 3445556668888999999999866 Q ss_pred CC-CCceeeEEeecCceEEEEEcCCce----------------------------------EEE--EEe-cC-ceE---E Q lcl|NC_019719. 137 NS-AGDVISLLPLQSANMDVKLVGKKV----------------------------------VYR--YQR-DS-EYA---D 174 (424) Q Consensus 137 ~~-~G~~~~l~~l~~~~v~~~~~~~~~----------------------------------~~~--~~~-~~-~~~---~ 174 (424) +. .+.....||+....|.....+... +|. +.. .+ ... . T Consensus 152 ~~~~~~~f~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e 231 (535) T protein:vir:94 152 PEGTYNPMKLYRLSSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYLDEESGEYLKYEE 231 (535) T ss_pred CcCcccceEEEEcCeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEEEeeCCCCcEEEEEE Confidence 53 233445666644333322222110 010 001 00 000 0 Q ss_pred ecH--------------hHeeEeccCCCCc-cccCchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHH Q lcl|NC_019719. 175 FSQ--------------KEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ 239 (424) Q Consensus 175 ~~~--------------~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n~~~p~~vl~~~~~~~~~~~ 239 (424) +.. -=.+..|+...++ .||.||...+...+.....+.+.......-...|..++. +.+...... T Consensus 232 ~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~~~~ 310 (535) T protein:vir:94 232 IDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVN-PAGITQVRR 310 (535) T ss_pred ecCeeeccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc-cccccchhh Confidence 000 0122333333334 899999999999999999888887777666666665443 333323221 Q ss_pred HHHHHHHHHHHhCCcccCc-ceecCCCceeeecccChhHHHH-HHHHHHHHHHHHHHhCCCHHHhcCCCCCCccchhHHH Q lcl|NC_019719. 240 RSQVEENFKEIAGGPVKKR-LWILEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ 317 (424) Q Consensus 240 ~~~~~~~~~~~~~~~~~g~-~~~l~~g~~~~~l~~~~~d~~~-~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~ 317 (424) ...+.+ |. +...++++...++... .+.+. .+..+.....|..+|=+. .+... ++..-++.=-. T Consensus 311 ----------~~~~~~-g~~v~g~~~~v~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~--~~~~~-d~~rvTAtEV~ 375 (535) T protein:vir:94 311 ----------LTKAQT-GDFVSGRPEDISFLQLEKA-ADFSVARAVSEQIEGRLSYAFMLN--SAVQR-TGERVTAEEIR 375 (535) T ss_pred ----------cccCCC-ceeecCCcccceeeecccc-cchhHHHHHHHHHHHHHHHHHhHh--hhccC-CCCCccHHHHH Confidence 111111 11 1112344445555543 23322 345556677788888322 12111 22111111123 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh-------------ccCccccccceeeecchhhhccCHHHHHHHHHHHHhCCCCCHH Q lcl|NC_019719. 318 QNLGFLQYTLQPYISRWENSIQRW-------------LIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTIN 384 (424) Q Consensus 318 ~~~~~~~~tl~P~~~~ie~~l~~~-------------l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~N 384 (424) .+..-....|.|....++++|-.- +|++.........+ +..|- ...|...+.++.+. ++ T Consensus 376 ~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~p~~~v~~~~-vs~la---~l~r~~~~~~l~~~----~~ 447 (535) T protein:vir:94 376 YVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPELPKEAVEPTI-STGME---ALGRGQDLDKLERC----IA 447 (535) T ss_pred HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceE-eehHH---HHHHHHHHHHHHHH----HH Confidence 334555566667666666665422 22221111111111 11110 11122222222210 11 Q ss_pred HHHHHhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019719. 385 EMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 385 E~R~~~G~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) -+ -.+.| +-.|.++ |+-.+-+. -....|. T Consensus 448 ~l---aq~~P-~~ld~~i---d~d~~~~~----~a~~~Gv 476 (535) T protein:vir:94 448 AW---SALAP-MQGDPDI---NIATIKLR----IANAIGI 476 (535) T ss_pred HH---HhhCh-HHhhhcC---CHHHHHHH----HHHHhCC Confidence 11 11112 1112111 11111000 0011121 Done!