Query lcl|NC_019705.1_cdsid_YP_007111439.1 [gene=F855_gp03] [protein=portal protein] [protein_id=YP_007111439.1] [location=2007..3281] Match_columns 424 No_of_seqs 140 out of 1035 Neff 9.4 Searched_HMMs 1612 Date Thu Nov 7 16:49:03 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:1884 Length: 424 # 100.0 2E-125 1E-128 704.0 46.5 424 1-424 1-424 (424) 2 protein:vir:189 Length: 424 # 100.0 3E-124 2E-127 697.8 46.6 424 1-424 1-424 (424) 3 protein:vir:4337 Length: 434 # 100.0 6E-102 4E-105 575.4 43.4 419 1-424 1-421 (434) 4 protein:vir:81072 Length: 432 100.0 2E-101 1E-104 572.5 43.8 410 8-424 1-421 (432) 5 protein:vir:10362 Length: 432 100.0 1E-100 7E-104 568.6 44.2 410 8-424 1-421 (432) 6 protein:vir:4509 Length: 424 # 100.0 3E-100 2E-103 566.4 45.2 415 1-424 1-421 (424) 7 protein:vir:102855 Length: 432 100.0 3E-100 2E-103 566.6 45.0 406 14-424 1-423 (432) 8 protein:vir:107605 Length: 432 100.0 3E-100 2E-103 566.6 45.0 406 14-424 1-423 (432) 9 protein:vir:105002 Length: 432 100.0 3E-100 2E-103 566.6 45.0 406 14-424 1-423 (432) 10 protein:vir:5737 Length: 419 # 100.0 2E-100 1E-103 567.0 44.2 404 14-424 1-408 (419) 11 protein:vir:97060 Length: 432 100.0 2E-100 1E-103 566.9 44.3 410 8-424 1-421 (432) 12 protein:vir:81152 Length: 411 100.0 5E-100 3E-103 564.9 43.9 398 14-422 1-411 (411) 13 protein:vir:105064 Length: 421 100.0 4E-100 2E-103 565.7 43.0 402 14-424 1-412 (421) 14 protein:vir:100150 Length: 437 100.0 8E-100 5E-103 563.8 43.7 414 1-424 1-424 (437) 15 protein:vir:4454 Length: 414 # 100.0 8E-100 5E-103 563.9 43.4 402 14-424 1-409 (414) 16 protein:vir:102080 Length: 429 100.0 1.5E-99 9E-103 562.4 44.4 405 14-424 1-420 (429) 17 protein:vir:1431 Length: 419 # 100.0 1.1E-99 7E-103 563.0 42.6 401 15-424 1-407 (419) 18 protein:vir:1380 Length: 422 # 100.0 6.6E-99 4E-102 558.8 43.8 402 14-424 1-421 (422) 19 protein:vir:483 Length: 413 # 100.0 6.6E-99 4E-102 558.9 43.0 401 16-424 1-406 (413) 20 protein:vir:100249 Length: 431 100.0 9.6E-99 6E-102 557.9 43.2 400 14-420 1-431 (431) 21 protein:vir:1266 Length: 416 # 100.0 2.8E-98 2E-101 555.4 42.4 399 15-424 1-406 (416) 22 protein:vir:80333 Length: 419 100.0 3E-98 2E-101 555.3 42.3 401 15-424 1-407 (419) 23 protein:vir:6240 Length: 457 # 100.0 6.5E-98 4E-101 553.4 43.4 406 14-424 1-434 (457) 24 protein:vir:93610 Length: 454 100.0 6E-98 4E-101 553.6 42.1 401 16-424 1-415 (454) 25 protein:vir:102118 Length: 409 100.0 1.1E-97 7E-101 552.2 42.7 398 16-424 1-408 (409) 26 protein:vir:1326 Length: 457 # 100.0 1.2E-97 8E-101 551.9 42.5 406 14-424 1-425 (457) 27 protein:vir:8418 Length: 409 # 100.0 1.6E-95 1E-98 540.3 43.6 397 14-424 1-404 (409) 28 protein:vir:98396 Length: 441 100.0 4.5E-95 2.8E-98 537.8 42.9 411 1-424 4-437 (441) 29 protein:vir:9408 Length: 441 # 100.0 6.6E-95 4.1E-98 536.9 43.2 411 1-424 4-437 (441) 30 protein:vir:79984 Length: 441 100.0 6.6E-95 4.1E-98 536.9 43.2 411 1-424 4-437 (441) 31 protein:vir:2683 Length: 412 # 100.0 1.7E-94 1.1E-97 534.6 42.0 401 8-424 1-406 (412) 32 protein:vir:101648 Length: 518 100.0 2.2E-94 1.4E-97 534.1 42.3 396 8-424 1-423 (518) 33 protein:vir:7853 Length: 518 # 100.0 4E-94 2.5E-97 532.6 42.2 396 8-424 1-423 (518) 34 protein:vir:81095 Length: 416 100.0 3.7E-94 2.3E-97 532.8 41.5 394 14-424 1-412 (416) 35 protein:vir:4598 Length: 416 # 100.0 3.7E-94 2.3E-97 532.8 41.5 394 14-424 1-412 (416) 36 protein:vir:93943 Length: 409 100.0 9.4E-94 5.8E-97 530.6 42.4 398 11-424 1-403 (409) 37 protein:vir:96980 Length: 409 100.0 1.5E-93 9.5E-97 529.4 42.6 398 11-424 1-403 (409) 38 protein:vir:81218 Length: 423 100.0 2.6E-93 1.6E-96 528.2 42.9 403 14-422 1-423 (423) 39 protein:vir:94426 Length: 409 100.0 3.5E-93 2.2E-96 527.5 42.1 398 11-424 1-403 (409) 40 protein:vir:3868 Length: 417 # 100.0 2.7E-93 1.7E-96 528.1 40.8 390 14-424 1-404 (417) 41 protein:vir:101647 Length: 460 100.0 1E-91 6.2E-95 519.5 42.6 404 16-424 1-459 (460) 42 protein:vir:9702 Length: 406 # 100.0 1.4E-91 8.8E-95 518.7 41.0 388 21-424 1-401 (406) 43 protein:vir:8317 Length: 409 # 100.0 1.3E-90 8.2E-94 513.3 39.3 373 14-406 1-409 (409) 44 protein:vir:94666 Length: 723 100.0 1.4E-89 8.6E-93 507.7 40.3 380 31-424 1-408 (723) 45 protein:vir:80134 Length: 403 100.0 7E-88 4.4E-91 498.4 41.1 387 14-424 1-398 (403) 46 protein:vir:95378 Length: 406 100.0 2.2E-87 1.4E-90 495.7 42.8 391 14-424 1-401 (406) 47 protein:vir:960 Length: 413 # 100.0 1.4E-87 8.5E-91 496.8 40.6 402 1-424 4-413 (413) 48 protein:vir:8100 Length: 466 # 100.0 3.3E-87 2E-90 494.7 42.0 401 14-424 1-466 (466) 49 protein:vir:102727 Length: 945 100.0 1.6E-86 1E-89 490.9 42.2 419 1-424 55-524 (945) 50 protein:vir:6210 Length: 394 # 100.0 1.6E-85 1E-88 485.4 38.5 384 14-424 1-390 (394) 51 protein:vir:104259 Length: 403 100.0 4.6E-85 2.9E-88 482.9 39.7 386 14-424 1-399 (403) 52 protein:vir:100187 Length: 385 100.0 1.1E-84 7E-88 480.8 38.6 376 14-422 1-385 (385) 53 protein:vir:3843 Length: 397 # 100.0 3E-84 1.9E-87 478.5 40.4 380 14-424 1-387 (397) 54 protein:vir:9359 Length: 348 # 100.0 4.2E-84 2.6E-87 477.7 38.1 337 74-424 1-342 (348) 55 protein:vir:100882 Length: 383 100.0 1.4E-83 8.9E-87 474.8 38.8 376 14-417 1-383 (383) 56 protein:vir:1082 Length: 359 # 100.0 2.4E-83 1.5E-86 473.6 39.0 352 14-396 1-359 (359) 57 protein:vir:80796 Length: 574 100.0 7.4E-82 4.6E-85 465.4 38.8 420 1-424 20-493 (574) 58 protein:vir:7407 Length: 392 # 100.0 3.7E-81 2.3E-84 461.5 39.4 363 16-409 1-392 (392) 59 protein:vir:4854 Length: 386 # 100.0 5.5E-81 3.4E-84 460.6 39.2 373 14-424 1-385 (386) 60 protein:vir:4995 Length: 384 # 100.0 1.8E-81 1.1E-84 463.3 36.3 367 14-402 1-384 (384) 61 protein:vir:100650 Length: 395 100.0 5E-81 3.1E-84 460.8 37.8 372 14-424 1-384 (395) 62 protein:vir:101289 Length: 395 100.0 5E-81 3.1E-84 460.8 37.8 372 14-424 1-384 (395) 63 protein:vir:9507 Length: 395 # 100.0 5E-81 3.1E-84 460.8 37.8 372 14-424 1-384 (395) 64 protein:vir:80644 Length: 551 100.0 2.2E-80 1.3E-83 457.3 40.5 410 10-424 1-516 (551) 65 protein:vir:95965 Length: 385 100.0 7.3E-81 4.5E-84 459.9 36.9 373 14-421 1-385 (385) 66 protein:vir:100691 Length: 535 100.0 7.4E-80 4.6E-83 454.4 38.4 416 1-424 13-490 (535) 67 protein:vir:94002 Length: 378 100.0 2.3E-80 1.4E-83 457.2 35.2 355 14-424 1-370 (378) 68 protein:vir:63755 Length: 547 100.0 1.5E-79 9.1E-83 452.8 39.5 406 14-424 1-512 (547) 69 protein:vir:93867 Length: 378 100.0 3.3E-80 2E-83 456.3 34.5 355 14-424 1-370 (378) 70 protein:vir:3989 Length: 392 # 100.0 3.5E-79 2.2E-82 450.7 39.5 367 12-409 1-392 (392) 71 protein:vir:1023 Length: 392 # 100.0 3.5E-79 2.2E-82 450.7 39.5 367 12-409 1-392 (392) 72 protein:vir:1661 Length: 378 # 100.0 8.5E-80 5.3E-83 454.1 36.1 355 14-424 1-370 (378) 73 protein:vir:78310 Length: 376 100.0 5.4E-79 3.3E-82 449.7 37.0 366 14-423 1-376 (376) 74 protein:vir:4089 Length: 395 # 100.0 1.4E-78 8.7E-82 447.4 38.0 379 14-424 1-391 (395) 75 protein:vir:96579 Length: 576 100.0 2.1E-77 1.3E-80 441.0 40.3 409 1-424 38-495 (576) 76 protein:vir:4952 Length: 386 # 100.0 3.8E-77 2.4E-80 439.5 39.1 378 14-421 1-386 (386) 77 protein:vir:99312 Length: 563 100.0 8.2E-77 5.1E-80 437.7 40.9 415 1-424 14-523 (563) 78 protein:vir:95599 Length: 563 100.0 8.2E-77 5.1E-80 437.7 40.9 415 1-424 14-523 (563) 79 protein:vir:4828 Length: 382 # 100.0 2.8E-77 1.7E-80 440.3 37.3 369 14-421 1-382 (382) 80 protein:vir:98643 Length: 395 100.0 2.5E-77 1.6E-80 440.5 36.8 380 14-423 1-395 (395) 81 protein:vir:94869 Length: 378 100.0 2.7E-77 1.6E-80 440.4 36.4 355 14-424 1-370 (378) 82 protein:vir:4156 Length: 542 # 100.0 9.8E-77 6.1E-80 437.3 38.0 393 3-424 1-443 (542) 83 protein:vir:9641 Length: 395 # 100.0 3.2E-77 2E-80 440.0 34.5 376 14-423 1-395 (395) 84 protein:vir:858 Length: 378 # 100.0 9E-77 5.6E-80 437.5 36.5 354 14-424 1-370 (378) 85 protein:vir:4194 Length: 540 # 100.0 1.3E-76 7.8E-80 436.7 36.9 393 1-424 1-441 (540) 86 protein:vir:3153 Length: 467 # 100.0 7.3E-76 4.5E-79 432.5 38.9 371 53-424 1-442 (467) 87 protein:vir:99452 Length: 651 100.0 3.2E-71 2E-74 407.1 32.7 413 1-424 1-535 (651) 88 protein:vir:79772 Length: 648 100.0 1.9E-69 1.2E-72 397.4 38.6 408 1-424 1-489 (648) 89 protein:vir:78641 Length: 278 100.0 8.5E-63 5.3E-66 360.9 32.5 273 74-360 1-278 (278) 90 protein:vir:79150 Length: 368 100.0 1.9E-60 1.2E-63 347.9 28.1 347 8-376 1-368 (368) 91 protein:vir:103971 Length: 376 100.0 4.1E-58 2.5E-61 335.2 33.6 329 1-367 26-376 (376) 92 protein:vir:100328 Length: 346 100.0 2.3E-58 1.5E-61 336.5 31.4 323 21-365 1-346 (346) 93 protein:vir:267 Length: 348 # 100.0 2.5E-58 1.6E-61 336.4 30.2 332 1-371 1-348 (348) 94 protein:vir:79207 Length: 351 100.0 1.8E-57 1.1E-60 331.7 32.4 329 1-367 1-351 (351) 95 protein:vir:98567 Length: 340 100.0 3E-57 1.9E-60 330.4 32.3 330 1-364 1-340 (340) 96 protein:vir:78191 Length: 351 100.0 3.6E-57 2.2E-60 330.0 32.6 329 1-367 1-351 (351) 97 protein:vir:78749 Length: 337 100.0 4.7E-57 2.9E-60 329.4 29.3 323 1-361 1-337 (337) 98 protein:vir:1150 Length: 350 # 100.0 1.9E-56 1.2E-59 326.0 32.2 328 1-360 1-350 (350) 99 protein:vir:6058 Length: 344 # 100.0 5.2E-56 3.3E-59 323.7 32.3 331 8-365 1-344 (344) 100 protein:vir:5691 Length: 344 # 100.0 5.7E-56 3.5E-59 323.5 31.2 333 1-365 1-344 (344) 101 protein:vir:2013 Length: 344 # 100.0 1.1E-55 6.8E-59 321.9 30.5 328 1-365 1-344 (344) 102 protein:vir:3743 Length: 345 # 100.0 8.5E-55 5.3E-58 317.0 31.1 331 1-362 1-345 (345) 103 protein:vir:3780 Length: 345 # 100.0 8.3E-55 5.1E-58 317.1 30.2 326 8-362 1-345 (345) 104 protein:vir:4698 Length: 251 # 100.0 1.5E-52 9.1E-56 304.8 26.3 242 14-268 1-251 (251) 105 protein:vir:98853 Length: 219 100.0 5.2E-47 3.2E-50 274.3 21.9 208 153-364 1-219 (219) 106 protein:vir:5249 Length: 437 # 99.9 1.4E-27 8.6E-31 167.8 30.2 388 14-424 1-435 (437) 107 protein:vir:107742 Length: 537 99.9 1E-22 6.5E-26 141.1 30.3 398 1-424 48-533 (537) 108 protein:vir:94049 Length: 532 99.9 4.2E-22 2.6E-25 137.8 27.1 404 1-424 1-508 (532) 109 protein:vir:99853 Length: 488 99.9 2.2E-21 1.4E-24 133.8 30.0 395 1-424 1-409 (488) 110 protein:vir:99563 Length: 862 99.8 2.3E-20 1.4E-23 128.3 28.8 401 1-424 50-553 (862) 111 protein:vir:103860 Length: 528 99.8 2.1E-19 1.3E-22 123.0 32.7 393 17-424 1-437 (528) 112 protein:vir:96068 Length: 765 99.8 2.4E-20 1.5E-23 128.1 27.4 397 1-424 30-525 (765) 113 protein:vir:99232 Length: 526 99.8 5.1E-19 3.2E-22 120.9 32.6 406 1-424 12-448 (526) 114 protein:vir:108215 Length: 469 99.8 7E-19 4.3E-22 120.1 33.2 398 1-424 1-455 (469) 115 protein:vir:107662 Length: 427 99.8 2.9E-19 1.8E-22 122.2 27.0 374 8-423 1-427 (427) 116 protein:vir:79233 Length: 526 99.8 2.5E-18 1.6E-21 117.1 32.0 406 1-424 12-441 (526) 117 protein:vir:79647 Length: 435 99.8 2.8E-19 1.8E-22 122.3 26.1 378 1-424 5-433 (435) 118 protein:vir:80040 Length: 461 99.8 4.5E-19 2.8E-22 121.2 27.1 398 1-423 1-461 (461) 119 protein:vir:104338 Length: 422 99.8 1.5E-18 9.4E-22 118.3 27.0 373 10-422 1-422 (422) 120 protein:vir:79063 Length: 491 99.8 5.5E-17 3.4E-20 109.7 34.3 388 1-424 15-423 (491) 121 protein:vir:1986 Length: 512 # 99.7 3.6E-17 2.2E-20 110.7 31.5 388 17-424 1-440 (512) 122 protein:vir:107880 Length: 491 99.7 5.7E-17 3.5E-20 109.7 32.4 388 1-424 15-423 (491) 123 protein:vir:389 Length: 530 # 99.7 8.4E-17 5.2E-20 108.7 28.9 413 8-424 1-530 (530) 124 protein:vir:79538 Length: 502 99.7 2.6E-17 1.6E-20 111.5 24.9 405 14-424 1-498 (502) 125 protein:vir:96738 Length: 505 99.7 2.2E-16 1.4E-19 106.4 26.8 416 1-424 1-502 (505) 126 protein:vir:79511 Length: 448 99.7 1.2E-15 7.7E-19 102.3 29.4 400 1-424 5-438 (448) 127 protein:vir:77981 Length: 448 99.6 1.5E-15 9.2E-19 101.9 29.2 398 1-424 5-437 (448) 128 protein:vir:95542 Length: 548 99.6 9E-16 5.6E-19 103.1 26.2 404 14-424 1-539 (548) 129 protein:vir:3420 Length: 533 # 99.6 7.7E-15 4.8E-18 98.0 29.0 419 1-424 1-531 (533) 130 protein:vir:10321 Length: 495 99.6 3.8E-15 2.4E-18 99.6 24.7 406 8-424 1-491 (495) 131 protein:vir:95254 Length: 488 99.6 3.9E-14 2.4E-17 94.1 30.1 409 1-424 1-471 (488) 132 protein:vir:98816 Length: 446 99.6 4.1E-14 2.6E-17 94.0 28.3 373 6-400 1-446 (446) 133 protein:vir:105782 Length: 449 99.5 1.9E-14 1.2E-17 95.9 22.7 386 1-424 1-449 (449) 134 protein:vir:6382 Length: 553 # 99.5 1.5E-13 9.2E-17 90.9 27.6 418 1-424 1-549 (553) 135 protein:vir:3648 Length: 695 # 99.5 7.4E-14 4.6E-17 92.6 24.3 404 1-424 60-546 (695) 136 protein:vir:78589 Length: 695 99.5 9.5E-14 5.9E-17 92.0 24.7 404 1-424 60-548 (695) 137 protein:vir:101541 Length: 694 99.5 8.2E-14 5.1E-17 92.3 23.0 404 1-424 59-545 (694) 138 protein:vir:106716 Length: 698 99.4 6.8E-14 4.2E-17 92.8 22.0 407 1-424 60-549 (698) 139 protein:vir:78161 Length: 355 99.4 6.8E-13 4.2E-16 87.3 25.3 287 131-424 1-321 (355) 140 protein:vir:106491 Length: 646 99.3 5.7E-13 3.5E-16 87.7 20.3 396 21-424 1-487 (646) 141 protein:vir:102426 Length: 631 99.2 2.8E-12 1.7E-15 83.9 18.0 406 8-424 1-516 (631) 142 protein:vir:8654 Length: 629 # 99.1 1.9E-11 1.2E-14 79.3 19.5 406 1-424 9-508 (629) 143 protein:vir:99088 Length: 629 99.1 2E-11 1.2E-14 79.3 19.2 406 1-424 9-508 (629) 144 protein:vir:107517 Length: 639 99.0 3.9E-11 2.4E-14 77.7 16.1 404 8-424 1-514 (639) 145 protein:vir:97900 Length: 639 99.0 3.9E-11 2.4E-14 77.7 16.1 404 8-424 1-514 (639) 146 protein:vir:105819 Length: 456 98.9 2.6E-09 1.6E-12 67.6 21.8 391 8-424 1-455 (456) 147 protein:vir:102602 Length: 456 98.9 2.6E-09 1.6E-12 67.6 21.8 391 8-424 1-455 (456) 148 protein:vir:106027 Length: 629 98.9 2.2E-09 1.4E-12 68.0 20.7 401 8-424 1-515 (629) 149 protein:vir:98444 Length: 434 98.9 2.9E-09 1.8E-12 67.4 21.0 358 46-424 1-431 (434) 150 protein:vir:94742 Length: 409 98.8 2.1E-08 1.3E-11 62.6 24.3 347 8-396 1-409 (409) 151 protein:vir:7987 Length: 456 # 98.8 5.4E-09 3.3E-12 65.9 20.9 392 8-424 1-455 (456) 152 protein:vir:104082 Length: 485 98.7 4.3E-08 2.7E-11 60.9 22.9 394 1-424 1-485 (485) 153 protein:vir:5839 Length: 533 # 98.7 8.5E-08 5.3E-11 59.4 22.6 407 1-424 1-506 (533) 154 protein:vir:1634 Length: 409 # 98.6 1.5E-07 9.2E-11 58.0 23.6 347 8-396 1-409 (409) 155 protein:vir:2427 Length: 485 # 98.6 2.2E-07 1.4E-10 57.1 22.4 383 1-424 8-482 (485) 156 protein:vir:7768 Length: 484 # 98.5 3.2E-07 2E-10 56.2 24.7 397 1-424 1-479 (484) 157 protein:vir:38 Length: 496 # N 98.5 2.7E-07 1.7E-10 56.6 22.0 388 12-421 1-496 (496) 158 protein:vir:9751 Length: 422 # 98.5 3.1E-07 1.9E-10 56.3 22.2 361 8-423 1-422 (422) 159 protein:vir:4073 Length: 279 # 98.5 1.2E-09 7.7E-13 69.4 8.8 260 58-399 1-279 (279) 160 protein:vir:98883 Length: 517 98.5 2.2E-07 1.4E-10 57.1 20.5 395 14-424 1-512 (517) 161 protein:vir:9568 Length: 410 # 98.4 1.1E-06 6.6E-10 53.3 25.2 350 8-418 1-410 (410) 162 protein:vir:1587 Length: 508 # 98.3 1.5E-06 9.4E-10 52.5 22.3 387 14-424 1-506 (508) 163 protein:vir:2500 Length: 501 # 98.3 1.7E-06 1E-09 52.3 23.2 384 1-424 16-501 (501) 164 protein:vir:8184 Length: 474 # 98.3 1.7E-06 1E-09 52.2 25.5 402 1-422 2-474 (474) 165 protein:vir:99916 Length: 504 98.3 1.8E-06 1.1E-09 52.1 26.7 403 1-424 1-496 (504) 166 protein:vir:3028 Length: 500 # 98.2 2.1E-06 1.3E-09 51.8 22.4 387 14-424 1-496 (500) 167 protein:vir:9815 Length: 500 # 98.2 2.1E-06 1.3E-09 51.8 22.4 387 14-424 1-496 (500) 168 protein:vir:5961 Length: 503 # 98.2 2.1E-06 1.3E-09 51.7 29.9 394 1-424 1-499 (503) 169 protein:vir:2341 Length: 488 # 98.2 2.3E-06 1.4E-09 51.5 22.9 398 1-424 1-488 (488) 170 protein:vir:103219 Length: 201 98.2 4E-08 2.5E-11 61.1 11.1 182 227-422 1-201 (201) 171 protein:vir:4223 Length: 486 # 98.2 3.2E-06 2E-09 50.7 26.6 389 1-424 1-484 (486) 172 protein:vir:78227 Length: 480 98.1 3.8E-06 2.4E-09 50.3 24.4 389 10-424 1-472 (480) 173 protein:vir:80959 Length: 499 98.1 4.9E-06 3.1E-09 49.7 25.2 388 12-421 1-499 (499) 174 protein:vir:78537 Length: 480 98.0 6.6E-06 4.1E-09 49.0 23.9 382 10-424 1-472 (480) 175 protein:vir:4782 Length: 522 # 98.0 8.2E-06 5.1E-09 48.5 24.2 384 14-423 1-522 (522) 176 protein:vir:4898 Length: 502 # 98.0 9E-06 5.6E-09 48.3 25.5 398 1-424 42-498 (502) 177 protein:vir:99072 Length: 479 97.9 1.4E-05 8.8E-09 47.2 21.5 380 1-424 2-470 (479) 178 protein:vir:95113 Length: 474 97.9 1.5E-05 9.2E-09 47.1 28.4 383 1-424 20-470 (474) 179 protein:vir:95806 Length: 440 97.8 1.6E-05 9.9E-09 46.9 23.4 389 6-424 1-435 (440) 180 protein:vir:96839 Length: 474 97.8 2E-05 1.2E-08 46.4 26.7 379 1-422 13-474 (474) 181 protein:vir:79703 Length: 505 97.7 2.7E-05 1.7E-08 45.7 25.5 388 14-424 1-504 (505) 182 protein:vir:96494 Length: 501 97.7 2.7E-05 1.7E-08 45.6 24.5 406 1-424 1-493 (501) 183 protein:vir:2732 Length: 501 # 97.6 3.5E-05 2.2E-08 45.0 25.9 396 1-424 41-493 (501) 184 protein:vir:93747 Length: 472 97.6 3.6E-05 2.2E-08 44.9 27.6 387 1-424 11-468 (472) 185 protein:vir:80680 Length: 441 97.6 3.9E-05 2.4E-08 44.7 26.3 370 1-424 1-433 (441) 186 protein:vir:78907 Length: 518 97.5 5.1E-05 3.2E-08 44.1 23.9 389 14-424 1-512 (518) 187 protein:vir:105889 Length: 474 97.4 6.5E-05 4.1E-08 43.5 25.0 398 1-424 12-473 (474) 188 protein:vir:94101 Length: 474 97.4 6.5E-05 4.1E-08 43.5 25.0 398 1-424 12-473 (474) 189 protein:vir:9306 Length: 511 # 97.4 6.5E-05 4.1E-08 43.5 25.0 396 1-424 26-507 (511) 190 protein:vir:96240 Length: 511 97.4 6.7E-05 4.1E-08 43.5 25.3 393 1-424 24-498 (511) 191 protein:vir:1236 Length: 483 # 97.4 8.6E-05 5.3E-08 42.9 26.5 387 1-424 20-480 (483) 192 protein:vir:94805 Length: 492 97.3 8.9E-05 5.5E-08 42.8 26.3 384 1-424 29-486 (492) 193 protein:vir:79043 Length: 479 97.3 9.2E-05 5.7E-08 42.7 29.8 391 1-423 7-479 (479) 194 protein:vir:103951 Length: 511 97.3 0.00011 6.6E-08 42.4 25.3 393 1-424 20-498 (511) 195 protein:vir:97376 Length: 320 97.3 6.1E-06 3.8E-09 49.2 9.5 307 14-403 1-320 (320) 196 protein:vir:3964 Length: 453 # 97.2 0.00012 7.2E-08 42.2 25.6 394 1-424 2-452 (453) 197 protein:vir:106639 Length: 481 97.2 0.00014 8.9E-08 41.6 25.0 395 1-424 21-480 (481) 198 protein:vir:97336 Length: 492 97.1 0.00015 9.5E-08 41.5 25.8 385 1-424 35-486 (492) 199 protein:vir:94498 Length: 474 97.1 0.00016 9.8E-08 41.4 28.0 383 1-424 20-468 (474) 200 protein:vir:97447 Length: 474 97.1 0.00016 9.8E-08 41.4 28.0 383 1-424 20-468 (474) 201 protein:vir:97171 Length: 512 97.1 0.00016 9.8E-08 41.4 24.5 396 1-424 24-508 (512) 202 protein:vir:96366 Length: 511 97.1 0.00017 1E-07 41.3 24.7 396 1-424 39-507 (511) 203 protein:vir:78805 Length: 511 97.1 0.00017 1E-07 41.3 24.7 396 1-424 39-507 (511) 204 protein:vir:96266 Length: 474 97.1 0.00019 1.2E-07 41.0 25.8 386 1-424 9-468 (474) 205 protein:vir:95899 Length: 474 97.1 0.00019 1.2E-07 41.0 25.8 386 1-424 9-468 (474) 206 protein:vir:99522 Length: 470 97.0 0.00021 1.3E-07 40.8 25.0 382 1-422 22-470 (470) 207 protein:vir:733 Length: 453 # 96.9 0.00026 1.6E-07 40.2 25.3 382 1-424 3-452 (453) 208 protein:vir:99781 Length: 511 96.6 0.00048 3E-07 38.8 26.2 395 1-424 20-506 (511) 209 protein:vir:107112 Length: 478 96.5 0.00053 3.3E-07 38.6 29.5 384 1-424 1-477 (478) 210 protein:vir:105292 Length: 478 96.4 0.00066 4.1E-07 38.0 27.1 384 1-424 1-477 (478) 211 protein:vir:106571 Length: 499 96.2 0.00092 5.7E-07 37.2 27.7 381 1-424 5-488 (499) 212 protein:vir:9871 Length: 429 # 96.0 0.0012 7.3E-07 36.7 24.8 371 1-424 1-429 (429) 213 protein:vir:96179 Length: 468 95.4 0.0021 1.3E-06 35.3 27.6 379 1-420 17-468 (468) 214 protein:vir:105461 Length: 470 95.3 0.0024 1.5E-06 35.0 24.6 371 8-424 1-469 (470) 215 protein:vir:94546 Length: 506 95.2 0.0025 1.5E-06 34.9 24.3 396 1-424 6-498 (506) 216 protein:vir:3609 Length: 452 # 95.2 0.0026 1.6E-06 34.7 25.1 383 1-424 3-451 (452) 217 protein:vir:78083 Length: 537 94.2 0.005 3.1E-06 33.2 31.7 396 1-424 1-521 (537) 218 protein:vir:103177 Length: 533 94.1 0.0053 3.3E-06 33.1 21.1 402 17-424 1-513 (533) 219 protein:vir:102950 Length: 471 92.8 0.0098 6.1E-06 31.6 24.6 369 8-424 1-464 (471) 220 protein:vir:94709 Length: 522 89.0 0.029 1.8E-05 29.0 21.7 376 1-424 1-475 (522) 221 protein:vir:104500 Length: 537 87.0 0.042 2.6E-05 28.2 22.6 403 16-424 1-536 (537) 222 protein:vir:106282 Length: 521 81.5 0.085 5.3E-05 26.4 24.4 399 8-423 1-521 (521) 223 protein:vir:94956 Length: 452 78.5 0.11 7E-05 25.8 26.6 370 8-424 1-450 (452) 224 protein:vir:108049 Length: 524 76.9 0.13 8.1E-05 25.4 21.3 397 10-423 1-524 (524) 225 protein:vir:104892 Length: 558 75.2 0.15 9.3E-05 25.1 23.7 399 17-424 1-541 (558) 226 protein:vir:7208 Length: 524 # 73.4 0.17 0.00011 24.8 20.3 399 8-423 1-524 (524) 227 protein:vir:101189 Length: 516 72.1 0.19 0.00012 24.6 22.6 406 8-424 1-515 (516) 228 protein:vir:101806 Length: 516 72.1 0.19 0.00012 24.6 22.6 406 8-424 1-515 (516) 229 protein:vir:103458 Length: 524 71.4 0.2 0.00012 24.5 20.3 399 8-423 1-524 (524) 230 protein:vir:9922 Length: 489 # 69.1 0.23 0.00014 24.1 27.2 394 1-424 2-489 (489) 231 protein:vir:106999 Length: 564 68.9 0.23 0.00014 24.1 20.6 400 17-424 1-549 (564) 232 protein:vir:5665 Length: 511 # 64.5 0.3 0.00019 23.5 22.4 398 14-423 1-511 (511) 233 protein:vir:80453 Length: 535 57.9 0.42 0.00026 22.6 23.4 388 9-424 1-527 (535) 234 protein:vir:6896 Length: 523 # 57.0 0.44 0.00028 22.5 20.7 397 8-423 1-523 (523) 235 protein:vir:102330 Length: 451 53.9 0.52 0.00032 22.2 24.2 378 1-422 1-451 (451) 236 protein:vir:100598 Length: 516 49.4 0.64 0.0004 21.6 24.0 404 8-424 1-515 (516) 237 protein:vir:6596 Length: 521 # 43.6 0.84 0.00052 21.0 26.3 402 10-424 1-520 (521) 238 protein:vir:81017 Length: 521 41.6 0.92 0.00057 20.8 25.2 402 10-424 1-520 (521) 239 protein:vir:102668 Length: 547 39.3 1 0.00064 20.5 18.2 352 1-424 1-486 (547) 240 protein:vir:98265 Length: 524 38.7 1.1 0.00066 20.5 26.0 402 1-423 1-524 (524) 241 protein:vir:2198 Length: 536 # 32.0 1.5 0.0009 19.7 18.5 392 1-424 1-531 (536) 242 protein:vir:95149 Length: 501 31.0 1.5 0.00095 19.6 23.9 388 1-424 1-495 (501) 243 protein:vir:100039 Length: 522 30.9 1.5 0.00095 19.6 20.3 367 8-424 1-467 (522) 244 protein:vir:8883 Length: 543 # 26.3 2 0.0012 19.0 17.8 378 1-424 1-482 (543) No 1 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=2.2e-125 Score=703.99 Aligned_cols=424 Identities=100% Similarity=1.486 Sum_probs=411.4 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) ||||||||+||+|+|||+++++||.++....|...+..++++..++.++..|+.+.|+++++||+||++||++||++||+ T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~ 80 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) T ss_pred CCCCcceEeecCCCchHHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceE Confidence 99999999999999999999999998888888888888888888888999999999999999999999999999999999 Q ss_pred EEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc Q lcl|NC_019705. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +|+.++++..+++..+|++.++|+.+||++||+++||+.++.+++++||||++++|+..|.+++|||++|++|++..+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~~~~~ 160 (424) T protein:vir:18 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) T ss_pred EEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC Confidence 99999988887777889999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEEeCCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHH Q lcl|NC_019705. 161 KVVYRYQRDSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 161 ~~~~~~~~~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~ 240 (424) ...|.|..++..+.|+++||||+|++++|+++|+||+..++.+++++.++++++.++|+||++|+|||+++.+.++++++ T Consensus 161 ~~~y~~~~~g~~~~~~~~eIih~r~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~ 240 (424) T protein:vir:18 161 KVVYRYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) T ss_pred eEEEEEEeCCeEEEeccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999888899999 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH Q lcl|NC_019705. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~ 320 (424) +++++.|++.+++.|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++++++|+|++.+ T Consensus 241 ~~~~~~~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~ 320 (424) T protein:vir:18 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) T ss_pred HHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999888889999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCe Q lcl|NC_019705. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDV 400 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~ 400 (424) .|+++||.|+++.||++|+++|+++.++..++++||+++++++|.+++++++.+++++|+||+||+|+++|+||+||||+ T Consensus 321 ~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD~ 400 (424) T protein:vir:18 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDV 400 (424) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Confidence 99999999999999999999999999887889999999999999999999999999999999999999999999999999 Q ss_pred eeecccccchhhccccCCCcccCC Q lcl|NC_019705. 401 AMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +++++|++|++.++++++++++|| T Consensus 401 ~~~~~n~~~l~~~~~~~~p~~~ga 424 (424) T protein:vir:18 401 AMRQSQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred eeeccCccchHhhhccCCCccCCC Confidence 999999999999999999999999 No 2 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=3e-124 Score=697.75 Aligned_cols=424 Identities=98% Similarity=1.474 Sum_probs=410.4 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) ||||||||+++||+|||+++++||.++....+.......++.+.++.++..|+++.|+++++||+||++||++||++||+ T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~ 80 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) T ss_pred CCCCccccccCCCCchHHHHHhhccccccccccchhhccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceE Confidence 99999999999999999999999998888888777777888888888899999999999999999999999999999999 Q ss_pred EEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc Q lcl|NC_019705. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +|+..++|..+++..+|++.++|+.+||++||+++||+.++.+++++||||++++|+..|.+++|||++|.+|++..+++ T Consensus 81 vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~ 160 (424) T protein:vir:18 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) T ss_pred EEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC Confidence 99999988887777889999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEEeCCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHH Q lcl|NC_019705. 161 KVVYRYQRDSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 161 ~~~~~~~~~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~ 240 (424) ...|.|..++....|+++||||+|+++.|+++|+||+..++.++.++.++++++.++|+||++|+|+|+++...++++++ T Consensus 161 ~~~y~~~~~g~~~~~~~~eVihir~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~ 240 (424) T protein:vir:18 161 KVVYRYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) T ss_pred eEEEEEEeCCeEEEeccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999887899999 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH Q lcl|NC_019705. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~ 320 (424) +++++.|++.+++.|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++++++|+|++.+ T Consensus 241 ~~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~ 320 (424) T protein:vir:18 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) T ss_pred HHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999988899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCe Q lcl|NC_019705. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDV 400 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~ 400 (424) +|+++||.|++++||++|+++|+++.++..++++||+++|+++|.+++++++.+++++|+||+||+|+++|+||+||||+ T Consensus 321 ~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~ 400 (424) T protein:vir:18 321 GFLQYTLQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGDV 400 (424) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Confidence 99999999999999999999999999887889999999999999999999999999999999999999999999999999 Q ss_pred eeecccccchhhccccCCCcccCC Q lcl|NC_019705. 401 AMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +++++|++|++.++++++++++|| T Consensus 401 ~~~~~n~~~l~~~~~~~~~~~n~a 424 (424) T protein:vir:18 401 AMRQAQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred eeeccCccchhhhhccCCccccCC Confidence 999999999999999999999999 No 3 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=6.2e-102 Score=575.43 Aligned_cols=419 Identities=31% Similarity=0.528 Sum_probs=355.5 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |-+--..+--+...+.-..+++|.++ . ........+..+.+....+++.|+.+.|+++++||+||++||++||++||+ T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~~ia~lp~~ 78 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWGGK-T-IRLTDGAFWSQFLGRESSSGKKVTVDKAMKLSAVWACVRLISTSVAGLPLG 78 (434) T ss_pred Cccchhhhhhhcccccchhhhccccc-c-cccCchHHHHHHhcCCccCCceechhhhhccHHHHHHHHHHHHhhhhCceE Confidence 22211111111111111111111100 0 011111222234444566788999999999999999999999999999999 Q ss_pred EEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc Q lcl|NC_019705. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +|+++.+|...+ ..+|++.+||+.+||++||+++||+.++.+++++||+|+++.++ .|++++|+||+|++|++..+.+ T Consensus 79 ~~~~~~~g~~~~-~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v~~~~~~~ 156 (434) T protein:vir:43 79 VYERKADGSRVD-ARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRVDLECDEN 156 (434) T ss_pred EEEEcCCCcccc-ccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEcCC Confidence 999988776544 56799999999999999999999999999999999999999877 6999999999999999888654 Q ss_pred -eEEEE-EEeCCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH Q lcl|NC_019705. 161 -KVVYR-YQRDSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 161 -~~~~~-~~~~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~ 238 (424) ...|. +..++..+.|+++||||+|++++|+++|+||+..++.++....++++++.++|+||++|+|+|+++.. ++++ T Consensus 157 g~~~y~~~~~~g~~~~~~~~eVih~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-l~~e 235 (434) T protein:vir:43 157 GRLKYFYTTKKGARREIERTNMLHIPAFTLDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRI-LQPA 235 (434) T ss_pred CeEEEEEEecCceEEEEccccEEEecCcCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCC-CCHH Confidence 34444 44556778999999999999999999999999999999999999999999999999999999999865 6788 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHH Q lcl|NC_019705. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ 318 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~ 318 (424) +.++++++|+++.++.|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++.+++|+|++ T Consensus 236 ~~~~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~ 315 (434) T protein:vir:43 236 QREEFREYVKSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQ 315 (434) T ss_pred HHHHHHHHHHHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHH Confidence 89999999999999999999999999999999999999999999999999999999999999999998888888899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019705. 319 NLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 319 ~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg 398 (424) .++|+++||.|++.+||++|+++|+++.++..++++||++.|+++|.+++++++.+++++|+||+||+|+++|+||+||| T Consensus 316 ~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~gg 395 (434) T protein:vir:43 316 MLAFLTFSISSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGG 395 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 99999999999999999999999999988777889999999999999999999999999999999999999999999999 Q ss_pred CeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 399 DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |++++|+|++|++.+++++.+++--+ T Consensus 396 D~~~~~~n~~~~~~~~~~~~~~~~~~ 421 (434) T protein:vir:43 396 DILTVQSNLVPIDQLGQSNKSQAVRA 421 (434) T ss_pred CeEeeccCccchhhhhccCCCcchhh Confidence 99999999999998876554433211 No 4 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=2.1e-101 Score=572.50 Aligned_cols=410 Identities=30% Similarity=0.507 Sum_probs=360.5 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchh-------hccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQT-------GPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |-=++.+|||+|++++|.+........+... ..++.....++..|+.+.|+++|+||+||++||++||++||+ T Consensus 1 ~~~~~~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~ 80 (432) T protein:vir:81 1 MPDEKKLGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLT 80 (432) T ss_pred CCchhhcchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhCcee Confidence 7788899999999999977554322111111 122334455788899999999999999999999999999999 Q ss_pred EEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc Q lcl|NC_019705. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +|++.++|.. ...+|++.++|+.+||++||+++||+.++.+++++||||+++.|+ +|++++||||+|+.|++..+.+ T Consensus 81 ~y~~~~~g~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~~~~~ 157 (432) T protein:vir:81 81 MYMRTPDGRK--EAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDPK 157 (432) T ss_pred eEEecCCcce--ecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEECCC Confidence 9998877643 346799999999999999999999999999999999999999986 5999999999999999988644 Q ss_pred -eEEEEEE-eCCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH Q lcl|NC_019705. 161 -KVVYRYQ-RDSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 161 -~~~~~~~-~~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~ 238 (424) ...|.|+ .++....|+++||+|+|+++.|+++|+||+..++.++....++++++.++|+||++|+|+++++.. ++++ T Consensus 158 g~~~y~~~~~~g~~~~~~~~~iih~r~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-l~~e 236 (432) T protein:vir:81 158 GNTAYRYRRTDGQMIDIPKQQIWKIMGYSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRF-LTDD 236 (432) T ss_pred CcEEEEEEecCceEEEEccccEEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCC-CCHH Confidence 4556654 466778899999999999999999999999999999999999999999999999999999999865 6778 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcc-cchhHHH Q lcl|NC_019705. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS-WGSGIEQ 317 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~-~~~n~e~ 317 (424) +++++++.++ ++.|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..+.+++ +++|+|+ T Consensus 237 ~~~~~~~~~~---~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq 313 (432) T protein:vir:81 237 QYDSFAKKVS---GSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHHHh---hhhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHH Confidence 7777776654 55688999999999999999999999999999999999999999999999999887665 4578999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Q lcl|NC_019705. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g 397 (424) +.+.|+++||.||++.||++|+++|+++.++..++++||+++|+++|.+++++++.+++++|+||+||+|+++|+||+|| T Consensus 314 ~~~~f~~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g 393 (432) T protein:vir:81 314 QQLGFLTMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999998877789999999999999999999999999999999999999999999998 Q ss_pred CC-eeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 398 GD-VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd-~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |+ .+++++|++|++..+.+..+++.++ T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~~~~~~~~ 421 (432) T protein:vir:81 394 NAAVLTVQSAMVPLDSIGLQASPEPASG 421 (432) T ss_pred CcceEeecCcccchhhhccCCCCCCCCC Confidence 75 5558999999998887766655554 No 5 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=1.1e-100 Score=568.61 Aligned_cols=410 Identities=29% Similarity=0.501 Sum_probs=356.8 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchh-------hccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQT-------GPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |-=--.+|||+|+++.|.+++.......... ..++...+..+..|+.+.|+++++||+||++||++||++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~ 80 (432) T protein:vir:10 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLT 80 (432) T ss_pred CCCCcccchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhhhCcee Confidence 2222346999999999987654322111111 112333455788899999999999999999999999999999 Q ss_pred EEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecC- Q lcl|NC_019705. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG- 159 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~- 159 (424) +|+++.+|.. ...+|++.++|+.+||++||+++||+.++.+++++||||++++|+ +|.+.+||||+|++|++..+. T Consensus 81 ~y~~~~~g~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~~~~ 157 (432) T protein:vir:10 81 MYMRTPDGRK--EAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTK 157 (432) T ss_pred EEEecCCCcc--cccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEEcCC Confidence 9999887743 346799999999999999999999999999999999999999997 599999999999999998864 Q ss_pred ceEEEEEE-eCCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH Q lcl|NC_019705. 160 KKVVYRYQ-RDSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 160 ~~~~~~~~-~~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~ 238 (424) +...|.|. .++..+.|+++||+|+|+++.|+++|+||+..++.++.+..++++++.++|+||++|++|++++.. ++++ T Consensus 158 g~~~y~~~~~~g~~~~~~~~~iih~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-l~~e 236 (432) T protein:vir:10 158 GNTAYRYRRTDGQMIDIPKQQIWKIMGYSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRF-LTDD 236 (432) T ss_pred CcEEEEEEecCceEEEEcCccEEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCC-CCHH Confidence 45556554 466778899999999999999999999999999999999999999999999999999999999865 6778 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcc-cchhHHH Q lcl|NC_019705. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS-WGSGIEQ 317 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~-~~~n~e~ 317 (424) +++++++.|. +..|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..+.+++ +++|+|+ T Consensus 237 ~~~~~~~~~~---~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~ 313 (432) T protein:vir:10 237 QYDSFAKKVS---GSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHHHh---hhhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHH Confidence 8777776664 45688999999999999999999999999999999999999999999999999887655 5578999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Q lcl|NC_019705. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g 397 (424) +.+.|+++||.||++.||++|+++|+++.++..++++||+++++++|.+++++++.+++++|+||+||+|+++|+||++| T Consensus 314 ~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g 393 (432) T protein:vir:10 314 QQLGFLSMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999998877789999999999999999999999999999999999999999999998 Q ss_pred CC-eeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 398 GD-VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd-~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |+ .+++++|++|++.++.+.++++.++ T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~~~~~~~~ 421 (432) T protein:vir:10 394 NAAVLTVQSAMVPLDSIGLQASPEPASG 421 (432) T ss_pred CcceEeecCcccchhhhcccCCCCCCCC Confidence 75 4558999999998877766655555 No 6 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=2.7e-100 Score=566.44 Aligned_cols=415 Identities=23% Similarity=0.383 Sum_probs=352.7 Q ss_pred CCCCcccccCCCCC--chHHHHHhhccCcccCCccccchhhcccc-ccccCcccccHHHHhccHHHHHHHHHHHHhhccC Q lcl|NC_019705. 1 MEEPKYTIDLRTNN--GWWARLQSWFVGGRLVTPNQGSQTGPVSA-HGHLGDSSINDERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~--G~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |----.|-.+.=+. .+|++| |+++...+|..+.....+.. ....++..|+.+.|+++++||+||++||++||++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~l---f~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~l 77 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDAL---FRSKSLENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQM 77 (424) T ss_pred CeeEeeeceecCcchhHHHHhh---ccccCCCCCccccchhhhhhhccccCCceechHHhhccHHHHHHHHHHHHHHhhC Confidence 22111121222111 233333 44444444444333222222 2234577899999999999999999999999999 Q ss_pred ceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEee Q lcl|NC_019705. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~ 157 (424) ||++|++++++ . +...+|++.++|+.+||++||+++||+.++.+++++||+|+++.|+..|.+++|+|++|+.|++.. T Consensus 78 p~~v~~~~~~~-~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~ 155 (424) T protein:vir:45 78 PLHVMRRHKGK-V-EPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMN 155 (424) T ss_pred ceEEEEecCCc-e-eecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEE Confidence 99999876433 3 345679999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceEEEEEEeCCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCH Q lcl|NC_019705. 158 VGKKVVYRYQRDSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~ 237 (424) +++...|.+..++....|+++||||||++++|+++|+||+..++.+++...++++++.++|+||++|+|+|+++.. +++ T Consensus 156 ~~~~~~y~~~~~~~~~~~~~~eVih~r~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~ 234 (424) T protein:vir:45 156 TGGRYTYGLYNEYGAFAISPDDMIHIRALGNNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSG-LNK 234 (424) T ss_pred cCCeEEEEEEecCceEEECcccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-CCH Confidence 9999999998888888999999999999999999999999999999999999999999999999999999999875 688 Q ss_pred HHHHHHHHHHHHHhCC--cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhH Q lcl|NC_019705. 238 QQRSQVEENFKEIAGG--PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGI 315 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~--~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~ 315 (424) ++.+++++.|++.++| +|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+.++++ ++|+ T Consensus 235 e~~~~~~~~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~ 312 (424) T protein:vir:45 235 ESWGWLKDQWQKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKAT--FSNI 312 (424) T ss_pred HHHHHHHHHHHHHhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccH Confidence 9999999999887665 58999999999999999999999999999999999999999999999999987765 4689 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCC Q lcl|NC_019705. 316 EQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPP 394 (424) Q Consensus 316 e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p 394 (424) |++.++|+++||.||+++||++|+++|+++.++. +++++||+++++++|.+++++.+.+++++|+||+||+|+++|+|| T Consensus 313 eq~~~~f~~~tL~P~~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~p 392 (424) T protein:vir:45 313 SAQAIQFVRYTMMPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNP 392 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 9999999999999999999999999999988765 488999999999999999999999999999999999999999999 Q ss_pred CCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 395 LPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 395 ~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +||||++++|+|+.+...... +.+.+.|- T Consensus 393 i~ggD~~~~~~n~~~~~~~~~-~~~~~~~~ 421 (424) T protein:vir:45 393 VEGLDEMLVSVNAANPAGDFK-PPKNDEGK 421 (424) T ss_pred CCCcceeeecccccccccccC-CCCCCCCC Confidence 999999999999987543211 11111111 No 7 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=2.5e-100 Score=566.61 Aligned_cols=406 Identities=23% Similarity=0.398 Sum_probs=354.0 Q ss_pred CchHHHHHhhccCcccCCccccc----hhhccccc-cccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCC Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGS----QTGPVSAH-GHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~-~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) +|||+|++++|+..+...+.... ......+. ....+..|+.+.|+++++||+||++||++||++||+++++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 99999999988633322211111 11111111 23457788999999999999999999999999999999997766 Q ss_pred ccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc-------e Q lcl|NC_019705. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-------K 161 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-------~ 161 (424) .. ...+|++.+||+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|++|++..++. . T Consensus 81 ~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~ 158 (432) T protein:vir:10 81 IQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTK 158 (432) T ss_pred ee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccce Confidence 43 45689999999999999999999999999999999999999999999999999999999999887642 3 Q ss_pred EEEEEEeCCceEEecHhHEEEeecC-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHH Q lcl|NC_019705. 162 VVYRYQRDSEYAEFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 162 ~~~~~~~~~~~~~~~~~eiih~r~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~ 240 (424) .+|.+..++....|+++||||+|++ +.++++|+||+..+..++....++++++.++|+||++|+++|+++.. +++++. T Consensus 159 ~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~-l~~e~~ 237 (432) T protein:vir:10 159 MWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGD-LNEDAK 237 (432) T ss_pred EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC-CCHHHH Confidence 4567777888889999999999975 57899999999999999999999999999999999999999999865 678888 Q ss_pred HHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHH Q lcl|NC_019705. 241 SQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN 319 (424) Q Consensus 241 ~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~ 319 (424) +++++.|++.+++ .|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++. T Consensus 238 ~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~s~~e~~~ 315 (432) T protein:vir:10 238 KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQ 315 (432) T ss_pred HHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHH Confidence 9999999987666 68999999999999999999999999999999999999999999999999877765 55899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019705. 320 LGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 320 ~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg 398 (424) ++|+++||+|++++||++|+++|+++.++. +++++||+++|+++|.+++++++++++++|++|+||+|+++|+||+||| T Consensus 316 ~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~gg 395 (432) T protein:vir:10 316 QQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGG 395 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 999999999999999999999999998764 5889999999999999999999999999999999999999999999999 Q ss_pred Ceeeecccccchhhcccc--CCCcccCC Q lcl|NC_019705. 399 DVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~--~~~~~~ga 424 (424) |++++|+|++|++.+++. +++.+++- T Consensus 396 D~~~~~~n~~~~~~~~~~~~k~~~~~~~ 423 (432) T protein:vir:10 396 DRLLVNGNMLPIDMAGQAYLKGGDTNGE 423 (432) T ss_pred CeEeecccccchhhccccccCCCCCCCC Confidence 999999999999876542 11111111 No 8 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=2.5e-100 Score=566.61 Aligned_cols=406 Identities=23% Similarity=0.398 Sum_probs=354.0 Q ss_pred CchHHHHHhhccCcccCCccccc----hhhccccc-cccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCC Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGS----QTGPVSAH-GHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~-~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) +|||+|++++|+..+...+.... ......+. ....+..|+.+.|+++++||+||++||++||++||+++++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 99999999988633322211111 11111111 23457788999999999999999999999999999999997766 Q ss_pred ccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc-------e Q lcl|NC_019705. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-------K 161 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-------~ 161 (424) .. ...+|++.+||+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|++|++..++. . T Consensus 81 ~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~ 158 (432) T protein:vir:10 81 IQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTK 158 (432) T ss_pred ee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccce Confidence 43 45689999999999999999999999999999999999999999999999999999999999887642 3 Q ss_pred EEEEEEeCCceEEecHhHEEEeecC-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHH Q lcl|NC_019705. 162 VVYRYQRDSEYAEFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 162 ~~~~~~~~~~~~~~~~~eiih~r~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~ 240 (424) .+|.+..++....|+++||||+|++ +.++++|+||+..+..++....++++++.++|+||++|+++|+++.. +++++. T Consensus 159 ~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~-l~~e~~ 237 (432) T protein:vir:10 159 MWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGD-LNEDAK 237 (432) T ss_pred EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC-CCHHHH Confidence 4567777888889999999999975 57899999999999999999999999999999999999999999865 678888 Q ss_pred HHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHH Q lcl|NC_019705. 241 SQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN 319 (424) Q Consensus 241 ~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~ 319 (424) +++++.|++.+++ .|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++. T Consensus 238 ~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~s~~e~~~ 315 (432) T protein:vir:10 238 KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQ 315 (432) T ss_pred HHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHH Confidence 9999999987666 68999999999999999999999999999999999999999999999999877765 55899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019705. 320 LGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 320 ~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg 398 (424) ++|+++||+|++++||++|+++|+++.++. +++++||+++|+++|.+++++++++++++|++|+||+|+++|+||+||| T Consensus 316 ~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~gg 395 (432) T protein:vir:10 316 QQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGG 395 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 999999999999999999999999998764 5889999999999999999999999999999999999999999999999 Q ss_pred Ceeeecccccchhhcccc--CCCcccCC Q lcl|NC_019705. 399 DVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~--~~~~~~ga 424 (424) |++++|+|++|++.+++. +++.+++- T Consensus 396 D~~~~~~n~~~~~~~~~~~~k~~~~~~~ 423 (432) T protein:vir:10 396 DRLLVNGNMLPIDMAGQAYLKGGDTNGE 423 (432) T ss_pred CeEeecccccchhhccccccCCCCCCCC Confidence 999999999999876542 11111111 No 9 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=2.5e-100 Score=566.61 Aligned_cols=406 Identities=23% Similarity=0.398 Sum_probs=354.0 Q ss_pred CchHHHHHhhccCcccCCccccc----hhhccccc-cccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCC Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGS----QTGPVSAH-GHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~-~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) +|||+|++++|+..+...+.... ......+. ....+..|+.+.|+++++||+||++||++||++||+++++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 99999999988633322211111 11111111 23457788999999999999999999999999999999997766 Q ss_pred ccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc-------e Q lcl|NC_019705. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-------K 161 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-------~ 161 (424) .. ...+|++.+||+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|++|++..++. . T Consensus 81 ~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~ 158 (432) T protein:vir:10 81 IQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTK 158 (432) T ss_pred ee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccce Confidence 43 45689999999999999999999999999999999999999999999999999999999999887642 3 Q ss_pred EEEEEEeCCceEEecHhHEEEeecC-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHH Q lcl|NC_019705. 162 VVYRYQRDSEYAEFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 162 ~~~~~~~~~~~~~~~~~eiih~r~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~ 240 (424) .+|.+..++....|+++||||+|++ +.++++|+||+..+..++....++++++.++|+||++|+++|+++.. +++++. T Consensus 159 ~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~-l~~e~~ 237 (432) T protein:vir:10 159 MWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGD-LNEDAK 237 (432) T ss_pred EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC-CCHHHH Confidence 4567777888889999999999975 57899999999999999999999999999999999999999999865 678888 Q ss_pred HHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHH Q lcl|NC_019705. 241 SQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN 319 (424) Q Consensus 241 ~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~ 319 (424) +++++.|++.+++ .|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++. T Consensus 238 ~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~s~~e~~~ 315 (432) T protein:vir:10 238 KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQ 315 (432) T ss_pred HHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHH Confidence 9999999987666 68999999999999999999999999999999999999999999999999877765 55899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019705. 320 LGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 320 ~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg 398 (424) ++|+++||+|++++||++|+++|+++.++. +++++||+++|+++|.+++++++++++++|++|+||+|+++|+||+||| T Consensus 316 ~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~gg 395 (432) T protein:vir:10 316 QQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGG 395 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 999999999999999999999999998764 5889999999999999999999999999999999999999999999999 Q ss_pred Ceeeecccccchhhcccc--CCCcccCC Q lcl|NC_019705. 399 DVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~--~~~~~~ga 424 (424) |++++|+|++|++.+++. +++.+++- T Consensus 396 D~~~~~~n~~~~~~~~~~~~k~~~~~~~ 423 (432) T protein:vir:10 396 DRLLVNGNMLPIDMAGQAYLKGGDTNGE 423 (432) T ss_pred CeEeecccccchhhccccccCCCCCCCC Confidence 999999999999876542 11111111 No 10 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=2.2e-100 Score=566.99 Aligned_cols=404 Identities=26% Similarity=0.425 Sum_probs=351.5 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +||++.+++. ..............+....+.+|..|+.+.|+++++||+||++||++||++||++|+++++|.. +. T Consensus 1 m~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~-~~ 76 (419) T protein:vir:57 1 MFIPQFWKGR---PSENRVNWQVVPGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGGR-EI 76 (419) T ss_pred CcchhhhccC---CccccccccccccccccccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCce-ec Confidence 6776654322 1211111222223344556677889999999999999999999999999999999999888754 44 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEeCCceE Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++||||+|++|++..+++...| |...+.+. T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g~~~-y~~~~~~~ 155 (419) T protein:vir:57 77 AFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDGMPY-YDIPSIGE 155 (419) T ss_pred cccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCceEE-EEEcCCce Confidence 56899999999999999999999999999999999999999999999999999999999999887654432 33344456 Q ss_pred EecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC---CCCHHHHHHHHHHHHHH Q lcl|NC_019705. 174 EFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK---VLTEQQRSQVEENFKEI 250 (424) Q Consensus 174 ~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~---~~~~~~~~~~~~~~~~~ 250 (424) .+++++|+|+|+++.|+++|+||+..++.++....++++++.++|+||++|+|+|+.+.. ..++++.+++++.|++. T Consensus 156 ~~~~~~vih~r~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~ 235 (419) T protein:vir:57 156 ILPMRMVHHIKSFSLDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTER 235 (419) T ss_pred EEchhhEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHH Confidence 799999999999999999999999999999999999999999999999999999998643 46788899999999887 Q ss_pred hCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHH Q lcl|NC_019705. 251 AGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 251 ~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P 329 (424) +++ .|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.++|+++||+| T Consensus 236 ~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~f~~~~l~P 313 (419) T protein:vir:57 236 YGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKST--NNNIEHQGLQYVIYTMLA 313 (419) T ss_pred hccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCc--cccHHHHHHHHHHHHHHH Confidence 666 68999999999999999999999999999999999999999999999999877654 568999999999999999 Q ss_pred HHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccc Q lcl|NC_019705. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~ 409 (424) ++++||++|+++|+++.++.+++++||++.|+++|.+++++++++++++|+||+||+|+++|+||+||||++++|+|+++ T Consensus 314 ~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~ 393 (419) T protein:vir:57 314 ILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGGDKYLTPLNMVD 393 (419) T ss_pred HHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccc Confidence 99999999999999998887889999999999999999999999999999999999999999999999999999999998 Q ss_pred hhhccccCCCcccCC Q lcl|NC_019705. 410 ITDLGTNKEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~~~~~~~ga 424 (424) ++.+.+.+++.++.- T Consensus 394 ~~~~~~~~~~~~~~~ 408 (419) T protein:vir:57 394 SKALTGIGKATPQQL 408 (419) T ss_pred ccccccccCCCcccC Confidence 876655333322222 No 11 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=2.2e-100 Score=566.93 Aligned_cols=410 Identities=29% Similarity=0.503 Sum_probs=356.1 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccc-------hhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGS-------QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |-=-=.+|||+++++.|.+++........ ....++...+..|..|+++.|+++++||+||++||++||++||+ T Consensus 1 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ia~lp~~ 80 (432) T protein:vir:97 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAVAAMPLM 80 (432) T ss_pred CCCcccCchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHHHHhhccCceE Confidence 22222469999999999876543211111 11122334456788899999999999999999999999999999 Q ss_pred EEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecC- Q lcl|NC_019705. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG- 159 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~- 159 (424) +|+++.+|..+ ..+|++.++|+.+||++||+++||+.++.+++++||||++++|+ +|++.+||||+|++|++..+. T Consensus 81 ~y~~~~~g~~~--~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~~~~ 157 (432) T protein:vir:97 81 MYMRTPDGRKE--AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTK 157 (432) T ss_pred EEEecCCCccc--ccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEEcCC Confidence 99998877433 46799999999999999999999999999999999999999997 599999999999999998864 Q ss_pred ceEEEEEE-eCCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH Q lcl|NC_019705. 160 KKVVYRYQ-RDSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 160 ~~~~~~~~-~~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~ 238 (424) +...|.|. .++....|+++||+|+|+++.|+++|+||+..++.++.+..+++++..++|+||++|+|+|+++.. ++++ T Consensus 158 g~~~y~~~~~~g~~~~~~~~~iih~r~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~-l~~e 236 (432) T protein:vir:97 158 GNTAYRYRRTDGQMIDIPRQQIWKIMGYSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRF-LTDD 236 (432) T ss_pred CcEEEEEEecCceEEEEccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCC-CCHH Confidence 45566654 456678899999999999999999999999999999999999999999999999999999999865 5777 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcc-cchhHHH Q lcl|NC_019705. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS-WGSGIEQ 317 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~-~~~n~e~ 317 (424) +++++++.+ .+..|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..+.+++ +++|+|+ T Consensus 237 ~~~~~~~~~---~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~ 313 (432) T protein:vir:97 237 QYDSFSKKV---SGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHHH---hhhhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHH Confidence 777766655 355688999999999999999999999999999999999999999999999999887665 4578999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Q lcl|NC_019705. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g 397 (424) +.+.|+++||.||++.||++|+++|+++.++..++++||++.|+++|.+++++++.+++++|+||+||+|+++|+||++| T Consensus 314 ~~~~f~~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g 393 (432) T protein:vir:97 314 QQLGFLTMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999998877789999999999999999999999999999999999999999999998 Q ss_pred CCee-eecccccchhhccccCCCcccCC Q lcl|NC_019705. 398 GDVA-MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd~~-~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ||.+ +++.|++|++.++.+..+++.++ T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~~~~~~~~ 421 (432) T protein:vir:97 394 NAAVLTVQSAMVPLDSIGLQASPEPASG 421 (432) T ss_pred CcceEeecccccchhhhcccCCCCCCCC Confidence 7654 58999999998877666655544 No 12 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=5.2e-100 Score=564.87 Aligned_cols=398 Identities=23% Similarity=0.398 Sum_probs=351.3 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|||+|++++|.+........ .+. ...+.++..++.+.++++++||+||++||++||++||++|++++++.. . T Consensus 1 MG~~~~~~~~~~~~~~~~~~~----~~~-~~~~~g~~~~~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~--~ 73 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVDMT----NPL-LLQWLGVDPDTPRNQLSEATYFACLKILSESLGKLPLKMYQKTERGIV--K 73 (411) T ss_pred CchHHHHHhhccCcccccccc----hHH-HHHHhcCcccChhhhhccHHHHHHHHHHHHhHhhCceeEEEecCCcee--e Confidence 999999998887543322111 111 123445667889999999999999999999999999999998877643 3 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCce-------EEEEE Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-------VVYRY 166 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-------~~~~~ 166 (424) ..+|++.++|+.+||++||+++||+.++.+++++||||++++|+ .|.+.+|||++|++|++..++.. .+|.| T Consensus 74 ~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 152 (411) T protein:vir:81 74 SDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-GPQLQALWILPSQYVTIVVDDRGLLGEKNAIWYRY 152 (411) T ss_pred ecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCceEEEEEECCceEEEEEcCcccccccceEEEEE Confidence 45799999999999999999999999999999999999999998 58999999999999999887542 34455 Q ss_pred E--eCCceEEecHhHEEEeec-CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019705. 167 Q--RDSEYAEFSQKEIFHLKG-FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 167 ~--~~~~~~~~~~~eiih~r~-~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~ 243 (424) . .++..+.|+++||||+|+ ++.++++|+||+..++.++.+..++++++.++|+||++|+|+|+++.. +++++++++ T Consensus 153 ~~~~~g~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~~~ 231 (411) T protein:vir:81 153 NDPYDGKMYVFRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGD-LNQEARDRL 231 (411) T ss_pred EecCCceEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC-CCHHHHHHH Confidence 4 356677899999999996 467999999999999999999999999999999999999999999865 688889999 Q ss_pred HHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHH Q lcl|NC_019705. 244 EENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 244 ~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~ 322 (424) ++.|++.+++ +|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.++| T Consensus 232 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~~n~e~~~~~f 309 (411) T protein:vir:81 232 VKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSS--YASAEAQNLAF 309 (411) T ss_pred HHHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--chhHHHHHHHH Confidence 9999987665 68999999999999999999999999999999999999999999999999887765 46899999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCee Q lcl|NC_019705. 323 LQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVA 401 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~ 401 (424) +++||.|++++||++|+++|+++.++. +++++||+++++++|.+++++++++++++|+||+||+|+++|+||+||||++ T Consensus 310 ~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD~~ 389 (411) T protein:vir:81 310 YVDTLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYGNNL 389 (411) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCee Confidence 999999999999999999999998864 5889999999999999999999999999999999999999999999999999 Q ss_pred eecccccchhhccccCC-Cccc Q lcl|NC_019705. 402 MRQSQYVPITDLGTNKE-PRNN 422 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~-~~~~ 422 (424) ++++|++|++.++++.+ +.|+ T Consensus 390 ~~~~n~~pl~~~~~~~~kgGd~ 411 (411) T protein:vir:81 390 MANGNYIPLSMLGANYGKGGDS 411 (411) T ss_pred eeccCccchhhhhhhhccCCCC Confidence 99999999998877543 2222 No 13 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=3.7e-100 Score=565.70 Aligned_cols=402 Identities=26% Similarity=0.410 Sum_probs=346.5 Q ss_pred CchHHHHHhhccCcccCC--ccccchh-hccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcc Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVT--PNQGSQT-GPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~--~~~~~~~-~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) +++. .||++..... +..+... ..+......+|+.|+++.|+++++||+||++||++||++||++|+++++|+. T Consensus 1 m~~~----~~~~~~~~~~s~~~~w~~~~~~~~~~~~~~g~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~ 76 (421) T protein:vir:10 1 MFIP----QMFEGKKRSVSGGGFWEAMLGGVRSSHSKAGVMITPETALALSAVRACVTLLAESVAQLPVELYRRDKNGGR 76 (421) T ss_pred CCCc----chhcccccccCcchhhHHHhhhhccCcccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCce Confidence 3332 3333332221 1111111 1122334456788999999999999999999999999999999999887764 Q ss_pred ceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCce-EEEEEEeC Q lcl|NC_019705. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-VVYRYQRD 169 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-~~~~~~~~ 169 (424) + ...+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.|++||||+|++|++..+++. .+|.+.. T Consensus 77 ~-~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~g~~~y~~~~- 154 (421) T protein:vir:10 77 Q-RATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPDGMPYYEIPE- 154 (421) T ss_pred e-ecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCCceEEEEEcC- Confidence 4 456799999999999999999999999999999999999999999999999999999999999887654 4444433 Q ss_pred CceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCC---CCHHHHHHHHHH Q lcl|NC_019705. 170 SEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV---LTEQQRSQVEEN 246 (424) Q Consensus 170 ~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~---~~~~~~~~~~~~ 246 (424) .+..++++||+|+|+++.|+++|+||+..++.++....++++++.++|+||++|+|+|+.+... .++++.+++++. T Consensus 155 -~g~~~~~~eiih~~~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~ 233 (421) T protein:vir:10 155 -IGETLPMRMMHHVKVFSLDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAK 233 (421) T ss_pred -CCcEEchhhEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHH Confidence 3457999999999999999999999999999999999999999999999999999999987643 488899999999 Q ss_pred HHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH Q lcl|NC_019705. 247 FKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 247 ~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) |++.++| .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.|+++ T Consensus 234 ~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~f~~~ 311 (421) T protein:vir:10 234 WTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKAT--NNNIEHQGLQFVMY 311 (421) T ss_pred HHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCc--cccHHHHHHHHHHH Confidence 9987765 68899999999999999999999999999999999999999999999999987765 45899999999999 Q ss_pred HHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc Q lcl|NC_019705. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS 405 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~ 405 (424) ||.|++.+||++|+++|+++.++.+++++||++.++++|.+++++++.+++++|+||+||+|+++|+||+||||++++|+ T Consensus 312 tl~P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~ 391 (421) T protein:vir:10 312 TLLAWLKRHEGALQRDLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGDKYLTPL 391 (421) T ss_pred HHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecc Confidence 99999999999999999999887778899999999999999999999999999999999999999999999999999999 Q ss_pred cccchhhccccC--CCcccCC Q lcl|NC_019705. 406 QYVPITDLGTNK--EPRNNGA 424 (424) Q Consensus 406 n~~~~~~~~~~~--~~~~~ga 424 (424) |+++++.....+ +..+.+| T Consensus 392 n~~~~~~~~~~~~~~~~~~~~ 412 (421) T protein:vir:10 392 NMVDSAQIIPGDKKPTAQQMA 412 (421) T ss_pred ccccccccccCCCCcccccCc Confidence 999887664322 2222333 No 14 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=8.2e-100 Score=563.80 Aligned_cols=414 Identities=33% Similarity=0.594 Sum_probs=357.4 Q ss_pred CCCCcccccCCCCCchH-HHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCce Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWW-ARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPL 79 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~ 79 (424) |.++| +-..+-+ ..+++||+.+. .+.....+..+.+.....+..|+.+.|+++|+||+||++||++||++|| T Consensus 1 ~~~~~-----~~~~~~~~~~~~~~~g~~~--s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~ 73 (437) T protein:vir:10 1 MKQGK-----QRALGRIKSSFLKWLGVPI--SLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPL 73 (437) T ss_pred CCcch-----hhhhhhhHHhhhhhcCCcc--cCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCce Confidence 33332 2222322 23466665432 2222333445566667788899999999999999999999999999999 Q ss_pred EEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecC Q lcl|NC_019705. 80 DVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG 159 (424) Q Consensus 80 ~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~ 159 (424) ++|+++++|... ...+|++.++|+.+||++||+++||+.++.+++++||||++++|+. |.+++||||+|++|++..+. T Consensus 74 ~~~~~~~~g~~~-~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-g~~~~L~~l~p~~v~i~~~~ 151 (437) T protein:vir:10 74 NLYQTKPDGTRV-LAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA-GVLIGLELMLPQRTTVKRLT 151 (437) T ss_pred eEEEEcCCCcee-eccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEEEEEEEcCcceEEEECC Confidence 999998877544 4568999999999999999999999999999999999999999984 99999999999999998764 Q ss_pred -ceEEEEEE-eCCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCH Q lcl|NC_019705. 160 -KKVVYRYQ-RDSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 160 -~~~~~~~~-~~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~ 237 (424) +...|.|. .++....|+++||+|||++++|+++|+||+..++.++.+..++++++.++|+||++|+|+|+.+.. +++ T Consensus 152 ~g~~~y~~~~~~g~~~~~~~~dIih~r~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~ 230 (437) T protein:vir:10 152 SGALQYTYRNVDGTVSTLAEDDVFHVRGFSLDGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQI-LQK 230 (437) T ss_pred CCeEEEEEEecCceEEEEccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC-CCH Confidence 44556554 466678899999999999999999999999999999999999999999999999999999999865 688 Q ss_pred HHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHH Q lcl|NC_019705. 238 QQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e 316 (424) ++.+++++.|++.+++ .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++++++|+| T Consensus 231 e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e 310 (437) T protein:vir:10 231 EKRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIE 310 (437) T ss_pred HHHHHHHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHH Confidence 8889999999876655 689999999999999999999999999999999999999999999999999999888889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC Q lcl|NC_019705. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 317 ~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ 396 (424) ++.+.|+++||.||+..||++|+++||++.++..++++||+++++++|.+++++++.+++++|+||+||+|+++|+||+| T Consensus 311 ~~~~~f~~~tl~P~~~~ie~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ 390 (437) T protein:vir:10 311 QQTLGFLTFTLRPWLTRIEQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMG 390 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999999999999999888778899999999999999999999999999999999999999999999 Q ss_pred CCCee-eecccccchhhccccCCCccc-----CC Q lcl|NC_019705. 397 GGDVA-MRQSQYVPITDLGTNKEPRNN-----GA 424 (424) Q Consensus 397 ggd~~-~~~~n~~~~~~~~~~~~~~~~-----ga 424 (424) |||++ ++++|++|++..+++..+..+ |+ T Consensus 391 gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (437) T protein:vir:10 391 GNAAVLTVQSALLPIDKLGEHTTATAAQDALKAW 424 (437) T ss_pred CCcceEeecCcccchhhccCcCCCcchhcccccc Confidence 88765 479999999876654322111 00 No 15 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=7.8e-100 Score=563.92 Aligned_cols=402 Identities=29% Similarity=0.485 Sum_probs=348.4 Q ss_pred CchHHHHHhhccCcccCCccccchhh--ccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccc Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTG--PVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~ 91 (424) +|||++++.. ++........... ........++..|+.+.++++++||+||++||++||++||++|+.++++ + T Consensus 1 Mg~f~~lf~r---~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~--~ 75 (414) T protein:vir:44 1 MVFFSGLFQR---KSDAPVTTPAELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSL--K 75 (414) T ss_pred Cchhhhhhcc---CccCcccchhhHhHhhccCccccCCceechhhhhccHHHHHHHHHHHHHhccCceEEEEecCCc--e Confidence 8999887443 3222222222221 1223345678889999999999999999999999999999999987665 3 Q ss_pred eeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecC-ceEEEEEE-eC Q lcl|NC_019705. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG-KKVVYRYQ-RD 169 (424) Q Consensus 92 ~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~-~~~~~~~~-~~ 169 (424) +...+|++.++|+.+||++||+++||+.++.+++++||||++++|+ .|.+++||||+|.+|++..++ +...|.+. .+ T Consensus 76 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~~~ 154 (414) T protein:vir:44 76 QRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSWEPVYQVTFPD 154 (414) T ss_pred eecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCCcEEEEEEecC Confidence 4466899999999999999999999999999999999999999887 599999999999999988764 34455544 45 Q ss_pred CceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHH Q lcl|NC_019705. 170 SEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) Q Consensus 170 ~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~ 249 (424) +....|+++||||||+++.|+++|+||+..+..++++..++++++.++|+||++|+|+|+++.. +++++.+++++.|++ T Consensus 155 g~~~~~~~~evih~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~~~~~~~~~ 233 (414) T protein:vir:44 155 GSTDVLSQEDIWHVRTLTLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQT-LSDQAYERLKKDFEE 233 (414) T ss_pred ceEEEEccccEEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC-CCHHHHHHHHHHHHH Confidence 5678899999999999999999999999999999999999999999999999999999999865 688889999999988 Q ss_pred HhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHH Q lcl|NC_019705. 250 IAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ 328 (424) Q Consensus 250 ~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~ 328 (424) .+++ +|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.|+++||+ T Consensus 234 ~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t--~~n~e~~~~~~~~~~l~ 311 (414) T protein:vir:44 234 RHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRAT--FNNIEELGLGFINYSLV 311 (414) T ss_pred HhcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHHHHHHHHH Confidence 7665 68999999999999999999999999999999999999999999999999877654 56999999999999999 Q ss_pred HHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccccc Q lcl|NC_019705. 329 PYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYV 408 (424) Q Consensus 329 P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~ 408 (424) |++++||++|+++|+++.++..++++||+++|+++|.+++++++++++++|+||+||+|+++|+||+||||++++|+|+. T Consensus 312 P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~ 391 (414) T protein:vir:44 312 PYLTRIEQRINTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMT 391 (414) T ss_pred HHHHHHHHHHHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeccccccc Confidence 99999999999999999887778899999999999999999999999999999999999999999999999999999998 Q ss_pred chhhccccC--CCcccCC Q lcl|NC_019705. 409 PITDLGTNK--EPRNNGA 424 (424) Q Consensus 409 ~~~~~~~~~--~~~~~ga 424 (424) +....+... ++.++.+ T Consensus 392 ~~~~~~~~~~~~~~~~~~ 409 (414) T protein:vir:44 392 TKPSDGSKAGKQKDNANA 409 (414) T ss_pred ccCCccccCCCCCCCCCC Confidence 665433322 2222222 No 16 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=1.5e-99 Score=562.41 Aligned_cols=405 Identities=23% Similarity=0.388 Sum_probs=353.3 Q ss_pred CchHHHHHhhccCcccCC-ccccchh--hccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcc Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVT-PNQGSQT--GPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) +|||++++++++..+... ....... ..+.+ ...++..|+.+.|+++++||+||++||++||++||++|++.+++. T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g-~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l~~~~~~~~~~~~- 78 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLG-ISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYGI- 78 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhc-CCCCcceechhhhhccHHHHHHHHHHHHhhccCceEEEEecCCce- Confidence 999999998775432211 1111111 11222 234567789999999999999999999999999999999876663 Q ss_pred ceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc-------eEE Q lcl|NC_019705. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-------KVV 163 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-------~~~ 163 (424) +...+|++.++|+.+||++||+++||+.++.+++++||||+++.|+..|.+++|||++|++|++..++. ..+ T Consensus 79 -~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~~~~~~ 157 (429) T protein:vir:10 79 -QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMW 157 (429) T ss_pred -eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEE Confidence 346679999999999999999999999999999999999999999999999999999999999987753 235 Q ss_pred EEEEeCCceEEecHhHEEEeecC-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHH Q lcl|NC_019705. 164 YRYQRDSEYAEFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQ 242 (424) Q Consensus 164 ~~~~~~~~~~~~~~~eiih~r~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~ 242 (424) |.+..++..+.|+++||||||+. +.++++|+||+..+..+++...++++++.++|+||++|+++|+++.. +++++.++ T Consensus 158 ~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~-l~~e~~~~ 236 (429) T protein:vir:10 158 YVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGD-LNEDAKKV 236 (429) T ss_pred EEEccCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC-CCHHHHHH Confidence 66777777889999999999975 57899999999999999999999999999999999999999999865 68888999 Q ss_pred HHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHH Q lcl|NC_019705. 243 VEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG 321 (424) Q Consensus 243 ~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~ 321 (424) +++.|++.+++ .|+|+++++++|++|++++.+++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.++ T Consensus 237 ~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~sn~e~~~~~ 314 (429) T protein:vir:10 237 FRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQQQ 314 (429) T ss_pred HHHHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHH Confidence 99999987665 68999999999999999999999999999999999999999999999999887765 4589999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCe Q lcl|NC_019705. 322 FLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDV 400 (424) Q Consensus 322 ~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~ 400 (424) |++.||.|++++||++|+++|+++.++. +++++||++.|+++|.+++++.+++++++|+||+||+|+++|+||+||||+ T Consensus 315 f~~~~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~ 394 (429) T protein:vir:10 315 FYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDR 394 (429) T ss_pred HHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Confidence 9999999999999999999999998865 588999999999999999999999999999999999999999999999999 Q ss_pred eeecccccchhhcccc--CCCcccCC Q lcl|NC_019705. 401 AMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~--~~~~~~ga 424 (424) +++|+|++|++.+++. +++.+++. T Consensus 395 ~~~~~n~~~~d~~~~~~~k~g~~~~~ 420 (429) T protein:vir:10 395 LLVNGNMLPIDMAGQAYLKGGDTNGE 420 (429) T ss_pred eeecccccchhhccccccCCCCCCCC Confidence 9999999999876432 22222222 No 17 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=1.1e-99 Score=563.02 Aligned_cols=401 Identities=25% Similarity=0.457 Sum_probs=345.9 Q ss_pred chHHHHHhhccCcccCCccccchh-hccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 15 GWWARLQSWFVGGRLVTPNQGSQT-GPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 15 G~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =||+|.+....+ ...++..... ..++.....+++.||++.|+++++||+||++||++||++||++|+++.++. +. T Consensus 1 ~~~~r~~~~~~~--~~~~~~~~~~~~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~--~~ 76 (419) T protein:vir:14 1 MFFSRQLLSNLG--QTQMSAGGWVSALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIAQLPIELYERSGEDR--KP 76 (419) T ss_pred Cccccccccccc--ccccCcchhhHHhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhccCceEEEEecCCcc--cc Confidence 234433222111 1122222222 233445566788999999999999999999999999999999999876653 44 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCce-EEEEEEeCCce Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-VVYRYQRDSEY 172 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-~~~~~~~~~~~ 172 (424) ..+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++||||+|++|++..+++. ..|.+... T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~~~~y~~~~~--- 153 (419) T protein:vir:14 77 ATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDLKPVYRVRGS--- 153 (419) T ss_pred ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceEEEEEccC--- Confidence 67899999999999999999999999999999999999999999999999999999999999887654 34444322 Q ss_pred EEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCC---CCHHHHHHHHHHHHH Q lcl|NC_019705. 173 AEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV---LTEQQRSQVEENFKE 249 (424) Q Consensus 173 ~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~---~~~~~~~~~~~~~~~ 249 (424) ..++.++|+|+|+++.|+++|+||+..+..++....++++++.++|+||++|+|+|+++... .++++.+++++.|++ T Consensus 154 ~~~~~~~i~h~~~~~~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~ 233 (419) T protein:vir:14 154 DPMPQRLVHHVRWMSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNA 233 (419) T ss_pred cccchhheeEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHH Confidence 24789999999999999999999999999999999999999999999999999999987654 368889999999998 Q ss_pred HhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHH Q lcl|NC_019705. 250 IAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ 328 (424) Q Consensus 250 ~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~ 328 (424) .+++ +|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.|+++||. T Consensus 234 ~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t--~s~~E~~~~~f~~~~L~ 311 (419) T protein:vir:14 234 KFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERAT--FSNIEHQSLQFVIYTLL 311 (419) T ss_pred HhcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC--cccHHHHHHHHHHHHHH Confidence 7766 58899999999999999999999999999999999999999999999999877665 55899999999999999 Q ss_pred HHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccccc Q lcl|NC_019705. 329 PYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYV 408 (424) Q Consensus 329 P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~ 408 (424) |++++||++|+++|+++.++..++++||+++|+++|.+++++++++++++|++|+||+|+++|+||+||||++++|+|++ T Consensus 312 P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~~~~n~~ 391 (419) T protein:vir:14 312 PWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMV 391 (419) T ss_pred HHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccccc Confidence 99999999999999999888788999999999999999999999999999999999999999999999999999999999 Q ss_pred chhhccccCCCcccCC Q lcl|NC_019705. 409 PITDLGTNKEPRNNGA 424 (424) Q Consensus 409 ~~~~~~~~~~~~~~ga 424 (424) +++...+.+.++.+.+ T Consensus 392 ~~~~~~~~~~~~~~~~ 407 (419) T protein:vir:14 392 DASKPQQLPVGKSEPT 407 (419) T ss_pred cccccccccCCCCCCc Confidence 9887665333322211 No 18 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=6.6e-99 Score=558.83 Aligned_cols=402 Identities=25% Similarity=0.406 Sum_probs=352.2 Q ss_pred CchHHHHHhhccCcccCCcc-------ccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEecc Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPN-------QGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQ 86 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~ 86 (424) +|||+++++...+.....+. .......+...+...+..|+.+.|+++++|++||++||++||++|+++++..+ T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~~~ 80 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKDKE 80 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEecCc Confidence 99999998776654322211 11122233344445566789999999999999999999999999999998643 Q ss_pred CCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCce----- Q lcl|NC_019705. 87 NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK----- 161 (424) Q Consensus 87 ~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~----- 161 (424) . ..+|++.++|+.+||++||+++||+.++.+++++||||+++.|+..|++++|+||+|++|++..+++. T Consensus 81 ~------~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~~ 154 (422) T protein:vir:13 81 E------YKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSSL 154 (422) T ss_pred c------cccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceecc Confidence 2 34689999999999999999999999999999999999999999999999999999999999887543 Q ss_pred --EEEEEE-eCCceEEecHhHEEEeecC-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCH Q lcl|NC_019705. 162 --VVYRYQ-RDSEYAEFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 162 --~~~~~~-~~~~~~~~~~~eiih~r~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~ 237 (424) .+|.+. .++....++++||||++++ +.++++|+||+..+..++.+..++++++.++|+||++|+|+|+++.. +++ T Consensus 155 ~~~~y~~~~~~g~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~ 233 (422) T protein:vir:13 155 SKVWYVVTDKNGKEHKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGD-LDE 233 (422) T ss_pred ceEEEEEEeCCCeEEEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-CCH Confidence 345554 4566788999999999965 67899999999999999999999999999999999999999999864 688 Q ss_pred HHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHH Q lcl|NC_019705. 238 QQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e 316 (424) ++.+++++.|++.+++ +|+++++++++|++|++++.+++|+||+|.+++++++||++|||||.+||..++++ ++|+| T Consensus 234 e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~--~sn~e 311 (422) T protein:vir:13 234 KAKKIFKKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERAT--FNNLT 311 (422) T ss_pred HHHHHHHHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHH Confidence 8899999999987666 68899999999999999999999999999999999999999999999999887765 55899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Q lcl|NC_019705. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL 395 (424) Q Consensus 317 ~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~ 395 (424) ++.++|+++||.|++++||++|+++|+++.++. +++++||+++|+++|.+++++++++++++|+||+||+|+++|+||+ T Consensus 312 ~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~ 391 (422) T protein:vir:13 312 EQQKDFYVTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPV 391 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 999999999999999999999999999998875 5889999999999999999999999999999999999999999999 Q ss_pred CCCCeeeecccccchhhcccc-CCCcccCC Q lcl|NC_019705. 396 PGGDVAMRQSQYVPITDLGTN-KEPRNNGA 424 (424) Q Consensus 396 ~ggd~~~~~~n~~~~~~~~~~-~~~~~~ga 424 (424) ||||++++|+|++|++.++++ +++.+.|+ T Consensus 392 ~ggD~~~~~~n~~~l~~~~~~~~~~g~~~g 421 (422) T protein:vir:13 392 EGGDRLLVNGNMIPIEMAGEQYKKGGEKGG 421 (422) T ss_pred CCcCeeeeccCccchhhcccccccCCCcCC Confidence 999999999999999988764 34444444 No 19 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=6.6e-99 Score=558.85 Aligned_cols=401 Identities=28% Similarity=0.514 Sum_probs=350.8 Q ss_pred hHHHHHhhccCcccCCccccchhh-cc-ccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 16 WWARLQSWFVGGRLVTPNQGSQTG-PV-SAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) || |.++|++++........... .+ ......++..|+.+.|+++|+||+||++||++||++|+++++.++++. +. T Consensus 1 ~~--f~~~f~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~--~~ 76 (413) T protein:vir:48 1 MF--FSGLFQRKSDAPVTTPAELAEAIGLSYDTYTGKRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLK--TR 76 (413) T ss_pred Cc--cchhhccCccCCccchHHHHHhhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcc--ee Confidence 22 23445444333332222222 22 223356788899999999999999999999999999999999876553 34 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc-eEEEEEE-eCCc Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-KVVYRYQ-RDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-~~~~~~~-~~~~ 171 (424) ..+|++.++|+.+||++||+++||+.++.+++++||||++++|+ .|+|++||||+|++|++..+.+ ...|.+. .++. T Consensus 77 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~~~g~ 155 (413) T protein:vir:48 77 VVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQWQPVYQVTFPDGS 155 (413) T ss_pred ecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCceEEEEEEecCce Confidence 56799999999999999999999999999999999999999987 5899999999999999888754 4455544 4566 Q ss_pred eEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_019705. 172 YAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIA 251 (424) Q Consensus 172 ~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~ 251 (424) ...|+++||||+|++++|+++|+||+..+..++....++++++.++|+||++|+|+|+++.. .++++.+++++.|++.+ T Consensus 156 ~~~~~~~evih~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~-~~~e~~~~~~~~~~~~~ 234 (413) T protein:vir:48 156 VDVLTQDEIWHVRTLTLDGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQK-LTPDAYERLKKDFEERH 234 (413) T ss_pred EEEEccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCHHHHHHHHHHHHHHh Confidence 67899999999999999999999999999999999999999999999999999999999865 58888999999999877 Q ss_pred CC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHH Q lcl|NC_019705. 252 GG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPY 330 (424) Q Consensus 252 ~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~ 330 (424) ++ +|+|+++++++|++|++++.+++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.|+++||.|+ T Consensus 235 ~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~~n~e~~~~~f~~~~i~P~ 312 (413) T protein:vir:48 235 TGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRAT--FNNIEELGLGFINYSLVPY 312 (413) T ss_pred cCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCC--cccHHHHHHHHHHHHHHHH Confidence 66 68999999999999999999999999999999999999999999999999876654 5699999999999999999 Q ss_pred HHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccch Q lcl|NC_019705. 331 ISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPI 410 (424) Q Consensus 331 ~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~ 410 (424) +++||++|+++|+++.++.+++++||+++|+++|.+++++++++++++|++|+||+|+++|+||+||||++++|+|++++ T Consensus 313 ~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~ 392 (413) T protein:vir:48 313 LTRIEQRINTGLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTTS 392 (413) T ss_pred HHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeecccccccc Confidence 99999999999999988777899999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccccCCCcccCC Q lcl|NC_019705. 411 TDLGTNKEPRNNGA 424 (424) Q Consensus 411 ~~~~~~~~~~~~ga 424 (424) +.++++.++..+++ T Consensus 393 ~~~~~~~~~~~~~~ 406 (413) T protein:vir:48 393 PSAGDDNGKKKESG 406 (413) T ss_pred ccccccCCCCCCCC Confidence 98887765555555 No 20 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=9.6e-99 Score=557.94 Aligned_cols=400 Identities=25% Similarity=0.411 Sum_probs=342.4 Q ss_pred CchHHHHHhhccCcccCCcc----c----cc--h------------hhccccccccCcccccHHHHhccHHHHHHHHHHH Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPN----Q----GS--Q------------TGPVSAHGHLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~----~----~~--~------------~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia 71 (424) +|+|++|++.........+. . .. . ...+......++..|+.+.|+++++||+||++|| T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci~~Ia 80 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCVTLIS 80 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHHHHHH Confidence 89999877543321111110 0 00 0 0111233455677899999999999999999999 Q ss_pred HhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCc Q lcl|NC_019705. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~ 151 (424) ++||++|+++|+++++ .+...+|++.++|+.+||++||+++||+.++.+++++||||++++|+. |.+++|+|++|. T Consensus 81 ~~iA~lp~~v~~~~~~---~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~ 156 (431) T protein:vir:10 81 GTIGMLPMNLISSDDS---KQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-NRPIRLIPMDRG 156 (431) T ss_pred HhhccCceEEEEecCc---eeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CceEEEEEEcCc Confidence 9999999999987432 345677999999999999999999999999999999999999999985 899999999999 Q ss_pred eeEEeec-CceEEEEEE-eCCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEE Q lcl|NC_019705. 152 NMDVKLV-GKKVVYRYQ-RDSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 152 ~v~~~~~-~~~~~~~~~-~~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~ 229 (424) +|++..+ ++...|.|. .++....|+++||+|||++++|+++|+||+..+..++.+..+++++..++|+||++|+|+|+ T Consensus 157 ~v~~~~~~~~~~~y~~~~~~g~~~~~~~~dViHir~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 236 (431) T protein:vir:10 157 SAKGRLTSTWQIVYDYTTPTGDKIELPAREVFHLRDLSIDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIE 236 (431) T ss_pred eeEEEEcCCCeEEEEEEeCCceEEEEchhhEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEe Confidence 9998765 445556554 45667889999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC Q lcl|NC_019705. 230 TGEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS 308 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~ 308 (424) ++.. +++++.+++++.|++.++| +|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..+++ T Consensus 237 ~~~~-ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~ 315 (431) T protein:vir:10 237 VPKE-LSDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTS 315 (431) T ss_pred cCCC-CCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCC Confidence 9864 6888899999999887665 6999999999999999999999999999999999999999999999999987654 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCC----CCCHH Q lcl|NC_019705. 309 TSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAG----LRTIN 384 (424) Q Consensus 309 ~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g----~~t~N 384 (424) +++|+|++.++|+++||.||+++||++|+++|+++.++.+++++||+++|+++|.+++++.++++++.| +||+| T Consensus 316 --t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~N 393 (431) T protein:vir:10 316 --WGSGIEQLAIFFIQYGLSHWFVSWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQN 393 (431) T ss_pred --ccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHH Confidence 456999999999999999999999999999999988877889999999999999999999999998655 59999 Q ss_pred HHHHHhCCCCCCC--CCeeeecccccchhhccccCCCc Q lcl|NC_019705. 385 EMRRTDNLPPLPG--GDVAMRQSQYVPITDLGTNKEPR 420 (424) Q Consensus 385 E~R~~~g~~p~~g--gd~~~~~~n~~~~~~~~~~~~~~ 420 (424) |+|+++|+||++| ||++++|.|..+.++..+.+..+ T Consensus 394 E~R~~~gl~p~~~~~gD~~~~p~n~~~~~~~~~~p~~~ 431 (431) T protein:vir:10 394 EVREMLDLPRADDPVADQLRNPMTQKQKGSGDEPPATT 431 (431) T ss_pred HHHHHhCCCCCCCccccceecccccccCCCCCCCCCCC Confidence 9999999999955 99999999998865432221111 No 21 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=2.8e-98 Score=555.41 Aligned_cols=399 Identities=23% Similarity=0.380 Sum_probs=350.1 Q ss_pred chHHHHHhhccCcccCC-ccc---cchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcc Q lcl|NC_019705. 15 GWWARLQSWFVGGRLVT-PNQ---GSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 15 G~~~~~~~~~~~~~~~~-~~~---~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) =||+++ |++++... ... ......+++....++..|+.+.++++|+||+||++||++||++||++|++++++. T Consensus 1 m~~~~~---f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~~- 76 (416) T protein:vir:12 1 MLLERM---FEKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKLPIHTYKRTDGGI- 76 (416) T ss_pred Cccchh---cccccCccccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcc- Confidence 234444 33322211 111 1223345555567788899999999999999999999999999999999876553 Q ss_pred ceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeec--CceEEEEEEe Q lcl|NC_019705. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV--GKKVVYRYQR 168 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~--~~~~~~~~~~ 168 (424) +...+|++.++|+.+||++||+++||+.++.+++++||||+++.|+..|.+.+||||+|++|++..+ ++..+|.+.. T Consensus 77 -~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~ 155 (416) T protein:vir:12 77 -ERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPTTGMLWYQTVL 155 (416) T ss_pred -ccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCCCcEEEEEEec Confidence 3346789999999999999999999999999999999999999999999999999999999997764 4567788888 Q ss_pred CCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019705. 169 DSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 169 ~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) ++..+.++++||+|+|++++++++|+||+.++..++.+..++++++.++|+||+.|+++|+++.. .++++.+++++.|+ T Consensus 156 ~g~~~~~~~~eiih~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~ 234 (416) T protein:vir:12 156 NGKAIELYDYEVLHFKGLSTDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAF-LDEKPKENVRKEWK 234 (416) T ss_pred CCeEEEecCccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCC-CCHHHHHHHHHHHH Confidence 88889999999999999999999999999999999999999999999999999999999999864 68899999999998 Q ss_pred HHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHH Q lcl|NC_019705. 249 EIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ 328 (424) Q Consensus 249 ~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~ 328 (424) ... ++++++++++|++|++++++++|+||+|.+++++++||++|||||.+||...+++ ++|+|++.++|+++||. T Consensus 235 ~~~---~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~f~~~~l~ 309 (416) T protein:vir:12 235 RVN---KVENIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKAT--FSNIEHQSIEYVRNTLQ 309 (416) T ss_pred HHh---cCCCeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCC--cccHHHHHHHHHHHHHH Confidence 765 4678999999999999999999999999999999999999999999999877665 55999999999999999 Q ss_pred HHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccc Q lcl|NC_019705. 329 PYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQY 407 (424) Q Consensus 329 P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~ 407 (424) |++++||++|+++|+++.++. +++++||+++++++|.+++++++.+++++|+||+||+|+++|+||+||||++++|+|+ T Consensus 310 P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~n~ 389 (416) T protein:vir:12 310 PWIVNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKYISSLNY 389 (416) T ss_pred HHHHHHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccc Confidence 999999999999999988764 5889999999999999999999999999999999999999999999999999999999 Q ss_pred cchhhccccCCCcccCC Q lcl|NC_019705. 408 VPITDLGTNKEPRNNGA 424 (424) Q Consensus 408 ~~~~~~~~~~~~~~~ga 424 (424) ++++.+.+++..+.+++ T Consensus 390 ~~~~~~~~~~~~~~~~~ 406 (416) T protein:vir:12 390 VFLDFLEEYQRLKAGGA 406 (416) T ss_pred ccccccchhhccccccc Confidence 99987765544333222 No 22 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=3e-98 Score=555.27 Aligned_cols=401 Identities=25% Similarity=0.456 Sum_probs=344.4 Q ss_pred chHHHHHhhccCcccCCccccch-hhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 15 GWWARLQSWFVGGRLVTPNQGSQ-TGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 15 G~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) =||++.. .+......+....+ ...+++..+.++..||++.|+++++||+||++||++||++||++|++++++. +. T Consensus 1 m~~~~~~--~~~~~~~~~~~~~~~~~~~g~~~s~~~~~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~--~~ 76 (419) T protein:vir:80 1 MFFSRQL--LSNLGQTQPGSGGWVSALLGSARSEAGQVVTPASALSLTVLQNCVTLLAESIAQLPVELYERSGDDR--KP 76 (419) T ss_pred CCccccc--ccccCcCCCCcchhhHHhhcccccccCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEecCCCc--cc Confidence 1122211 00011112211222 2233455566788999999999999999999999999999999999887763 44 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCce-EEEEEEeCCce Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-VVYRYQRDSEY 172 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-~~~~~~~~~~~ 172 (424) ..+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++||||+|++|++..+++. ..|++. + . T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~~~~y~~~--~-~ 153 (419) T protein:vir:80 77 ATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDLKPMYRVA--G-A 153 (419) T ss_pred ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceEEEEEc--C-c Confidence 56899999999999999999999999999999999999999999999999999999999999887654 344332 2 2 Q ss_pred EEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC---CCCHHHHHHHHHHHHH Q lcl|NC_019705. 173 AEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK---VLTEQQRSQVEENFKE 249 (424) Q Consensus 173 ~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~---~~~~~~~~~~~~~~~~ 249 (424) ..+++++|+|+|+++.|+++|+||+..++.++....++++++.++|+||++|+|+|+++.. ..++++.+++++.|++ T Consensus 154 ~~~~~~~i~h~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~ 233 (419) T protein:vir:80 154 DPLPQRLVHHVRWMSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNA 233 (419) T ss_pred cccchhheEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHH Confidence 3588999999999999999999999999999999999999999999999999999998754 3478888999999998 Q ss_pred HhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHH Q lcl|NC_019705. 250 IAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ 328 (424) Q Consensus 250 ~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~ 328 (424) .+++ +|+|++++|++|++|++++.+++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.|+++||. T Consensus 234 ~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t--~~n~e~~~~~f~~~~l~ 311 (419) T protein:vir:80 234 KFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERAT--FSNIEHQSLQFVIYTLL 311 (419) T ss_pred HhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC--cccHHHHHHHHHHHHHH Confidence 7766 58899999999999999999999999999999999999999999999999877665 56899999999999999 Q ss_pred HHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccccc Q lcl|NC_019705. 329 PYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYV 408 (424) Q Consensus 329 P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~ 408 (424) |++.+||++|+++|+++.++..++++||+++|+++|.+++++.+++++++|++|+||+|+++|+||+||||++++|+|++ T Consensus 312 P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD~~~~~~n~~ 391 (419) T protein:vir:80 312 PWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMV 391 (419) T ss_pred HHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccc Confidence 99999999999999999888788999999999999999999999999999999999999999999999999999999999 Q ss_pred chhhccccCCCcccCC Q lcl|NC_019705. 409 PITDLGTNKEPRNNGA 424 (424) Q Consensus 409 ~~~~~~~~~~~~~~ga 424 (424) +++...+.+.++.+-+ T Consensus 392 ~~~~~~~~~~~~~~~~ 407 (419) T protein:vir:80 392 DASKPQPIPMGKTEPT 407 (419) T ss_pred cccccccccCCCCCch Confidence 8877655433332222 No 23 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=6.5e-98 Score=553.40 Aligned_cols=406 Identities=26% Similarity=0.427 Sum_probs=341.6 Q ss_pred CchHHHHHhhccCcccCCccccch-----hhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCC Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQ-----TGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) +|||++++++..++.......... .....+....+++.|+.+.|+++++||+||++||++||++||++|++..++ T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGT 80 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCCc Confidence 999999876554432211111111 111223344578889999999999999999999999999999999876433 Q ss_pred ccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc-----eEE Q lcl|NC_019705. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-----KVV 163 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-----~~~ 163 (424) . +... ++....|+.+||++||+++||+.++.+++++||||+++.++ .|.+.+||||+|.+|++..+.. ..+ T Consensus 81 -~-~~~~-~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~ 156 (457) T protein:vir:62 81 -R-KEID-TPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGLRRKVF 156 (457) T ss_pred -c-cccc-chHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCccceeE Confidence 2 2333 44444445689999999999999999999999999998665 6899999999999999876521 222 Q ss_pred EEE--EeCCc---eEEecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCH Q lcl|NC_019705. 164 YRY--QRDSE---YAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 164 ~~~--~~~~~---~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~ 237 (424) +.| ..++. ...|+++||||||++++++ ++|+||+..++.++.+..++++++.++|+||++|+|+|+++.. +++ T Consensus 157 ~~y~~~~~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-ls~ 235 (457) T protein:vir:62 157 EAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGT-MSE 235 (457) T ss_pred EEEEEccCCceeEEEeeCccceEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCC-CCH Confidence 333 33332 2468999999999998877 8999999999999999999999999999999999999999864 688 Q ss_pred HHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHH Q lcl|NC_019705. 238 QQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e 316 (424) ++++++++.|++.++| +|+|++++|++|++|++++++++|+||+|++++++++||++|||||.+||..++++++++|+| T Consensus 236 e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e 315 (457) T protein:vir:62 236 EGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLA 315 (457) T ss_pred HHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHH Confidence 9999999999987666 689999999999999999999999999999999999999999999999999999998889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC Q lcl|NC_019705. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 317 ~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ 396 (424) ++.++|+++||.|++++||++|+++|+++.++..++++||+++|+++|.+++++++.+++++|+||+||+|+++|+||+| T Consensus 316 q~~~~f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ 395 (457) T protein:vir:62 316 EQNIAFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLP 395 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999999999999999887778899999999999999999999999999999999999999999999 Q ss_pred CC--CeeeecccccchhhccccCCCccc---------CC Q lcl|NC_019705. 397 GG--DVAMRQSQYVPITDLGTNKEPRNN---------GA 424 (424) Q Consensus 397 gg--d~~~~~~n~~~~~~~~~~~~~~~~---------ga 424 (424) || |++++|+|+++++...+.+..... ++ T Consensus 396 ~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (457) T protein:vir:62 396 DGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPA 434 (457) T ss_pred CCCcceeeeccccccccccccccccCCCccCCCCccCCC Confidence 87 999999999988765443211111 11 No 24 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=6e-98 Score=553.57 Aligned_cols=401 Identities=23% Similarity=0.284 Sum_probs=341.0 Q ss_pred hHHHHHhhccCcccCCccccchhhc-c------ccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCC Q lcl|NC_019705. 16 WWARLQSWFVGGRLVTPNQGSQTGP-V------SAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~-~------~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) ||+.++..........+.....+.. + .+..+.++..|+.+.|+++++||+||++||++||++||++|+++.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~g 80 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQTDAQG 80 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEeccCC Confidence 4444322111111112222111111 1 12235567889999999999999999999999999999999998877 Q ss_pred ccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecC-ceEEEEEE Q lcl|NC_019705. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG-KKVVYRYQ 167 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~-~~~~~~~~ 167 (424) ..++ ..+| ...+|+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|++|++..++ +...|.|. T Consensus 81 ~~~~-~~~~-~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~y~~~ 158 (454) T protein:vir:93 81 IRRE-TRRG-DIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADDGEVFYRIT 158 (454) T ss_pred ccch-hhhH-HHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCCCcEEEEEE Confidence 6554 3444 4556667999999999999999999999999999999999999999999999999988764 45667776 Q ss_pred eCC-----ceEEecHhHEEEeec-CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHH Q lcl|NC_019705. 168 RDS-----EYAEFSQKEIFHLKG-FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRS 241 (424) Q Consensus 168 ~~~-----~~~~~~~~eiih~r~-~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~ 241 (424) ... ....++++||||+|+ ++.++++|+||+..+..++.+..++++++.++|+||++|+|+|+++.. +++++.+ T Consensus 159 ~~~~~~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~ 237 (454) T protein:vir:93 159 PDRNCGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGS-ITEENAK 237 (454) T ss_pred eccccccceeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCC-CCHHHHH Confidence 543 356799999999996 567899999999999999999999999999999999999999999864 6889999 Q ss_pred HHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHH Q lcl|NC_019705. 242 QVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG 321 (424) Q Consensus 242 ~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~ 321 (424) ++++.|++.++|.|+|++++|++|++|++++.+++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.++ T Consensus 238 ~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~ 315 (454) T protein:vir:93 238 KLKSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPS--SDNVEALEQQ 315 (454) T ss_pred HHHHHHHHHhcccccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc--chhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999887664 5689999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCee Q lcl|NC_019705. 322 FLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVA 401 (424) Q Consensus 322 ~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~ 401 (424) |+++||.|++..||++|+++|+++.+ ++++||+++|+++|.+++++++.+++++|+||+||+|+++|+||+||||++ T Consensus 316 f~~~~l~P~~~~ie~~ln~~L~~~~~---~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~ 392 (454) T protein:vir:93 316 YYSQCLQTLIESIELLLDEALETGEN---ESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDAL 392 (454) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCC---cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCee Confidence 99999999999999999999997654 468999999999999999999999999999999999999999999999999 Q ss_pred eecccccchhhccccCCCcccCC Q lcl|NC_019705. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++++|.++++.+++.+..++... T Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~ 415 (454) T protein:vir:93 393 YLQQQNYSLEALSRRDAREDPFA 415 (454) T ss_pred eeccCccchHhhhccCcccCCCC Confidence 99999999987765443322211 No 25 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=1.1e-97 Score=552.20 Aligned_cols=398 Identities=25% Similarity=0.418 Sum_probs=340.8 Q ss_pred hHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccceecc Q lcl|NC_019705. 16 WWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDL 95 (424) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~ 95 (424) || ++..|..++............+.+ ...++..|+.++|+++++||+||++||++||++||+||++.+ +. +... T Consensus 1 m~--f~~~~~~~~~~~~~~~~~~~~~~g-~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~-~~--~~~~ 74 (409) T protein:vir:10 1 ML--FRKGFKNQSQEISIDDKKILEWLG-INPSETYVNGKSCLKQATVFGCIRILSDNISKLPIKIYQKKD-GI--KRVP 74 (409) T ss_pred Cc--ccccccCcCCCCCCChHHHHHHhc-CCcCcceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecC-Ce--eecc Confidence 22 111222222211111111111222 244577889999999999999999999999999999998643 32 2345 Q ss_pred chHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc-------eEEEEEE- Q lcl|NC_019705. 96 SNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-------KVVYRYQ- 167 (424) Q Consensus 96 ~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-------~~~~~~~- 167 (424) +|++.++|+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|++|++..++. ...|.+. T Consensus 75 ~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~~~~ 154 (409) T protein:vir:10 75 DHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYLYTD 154 (409) T ss_pred CchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccccccceEEEEEEe Confidence 79999999999999999999999999999999999999999999999999999999999887643 2345554 Q ss_pred eCCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHH Q lcl|NC_019705. 168 RDSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENF 247 (424) Q Consensus 168 ~~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~ 247 (424) ..+....|+++||||+|++++|+++|+||+..+..+++...++++++.++|+||++|+|+|+++.. +++++.+++++.| T Consensus 155 ~~g~~~~~~~~evih~r~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-l~~e~~~~~~~~~ 233 (409) T protein:vir:10 155 DLGQRHKFMSDEILHFKGLTADGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGD-LNPEAEEVFKENF 233 (409) T ss_pred CCceeEEeccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCC-CCHHHHHHHHHHH Confidence 345678899999999999999999999999999999999999999999999999999999999864 6888999999999 Q ss_pred HHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHH Q lcl|NC_019705. 248 KEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 248 ~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) ++.+++ .|+|+++++++|++|++++.+++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.++|+++| T Consensus 234 ~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~~~~e~~~~~f~~~~ 311 (409) T protein:vir:10 234 ERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRAT--HSNITEQNREFYIDT 311 (409) T ss_pred HHHhccccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc--cccHHHHHHHHHHHH Confidence 987766 67999999999999999999999999999999999999999999999999887664 568999999999999 Q ss_pred HHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc Q lcl|NC_019705. 327 LQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS 405 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~ 405 (424) |.|++++||++|+++|+++.++. +++++||+++|+++|.+++++++.+++++|++|+||+|+++|+||+||||++++|+ T Consensus 312 l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~~~~~~ 391 (409) T protein:vir:10 312 LQSILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDVLLING 391 (409) T ss_pred HHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecc Confidence 99999999999999999988764 58899999999999999999999999999999999999999999999999999999 Q ss_pred cccchhhccccCCCcccCC Q lcl|NC_019705. 406 QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 406 n~~~~~~~~~~~~~~~~ga 424 (424) |++|++.++++. .++|- T Consensus 392 n~~~~~~~~~~~--~kgGe 408 (409) T protein:vir:10 392 NMIPVKMAGEQY--SKGGE 408 (409) T ss_pred Cccchhhccccc--cccCC Confidence 999998876522 12222 No 26 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=1.2e-97 Score=551.92 Aligned_cols=406 Identities=26% Similarity=0.449 Sum_probs=346.4 Q ss_pred CchHHHHHhhccCcccCCccccc----hhhc-cccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCC Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGS----QTGP-VSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~----~~~~-~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~ 88 (424) +|||++|++++.+.......... .... ..+....+++.|+.+.|+++++||+||++||++||++||++|++..++ T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGS 80 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 99999998877654322211111 1111 123345578899999999999999999999999999999999976544 Q ss_pred ccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc-----eEE Q lcl|NC_019705. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-----KVV 163 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-----~~~ 163 (424) .+....|++..+|+.+|| +||+++||+.++.+++++||||+++.++ .|.+++||||+|++|++..+.. ..+ T Consensus 81 --~~~~~~~~l~~~ln~~~n-~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~ 156 (457) T protein:vir:13 81 --RKEIVTPEWLDYPNAEPG-GMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRRKVF 156 (457) T ss_pred --ccccccchHHHhccccCC-CCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccceeE Confidence 334567889999986666 7999999999999999999999999776 5999999999999999876532 122 Q ss_pred --EEEEeCCc---eEEecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCH Q lcl|NC_019705. 164 --YRYQRDSE---YAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 164 --~~~~~~~~---~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~ 237 (424) |.+..++. ...|+++||||+|++++++ ++|+||+..+..+|.+..++++++.++|+||++|+++|+++.. +++ T Consensus 157 ~~y~~~~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-ls~ 235 (457) T protein:vir:13 157 EAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGT-MSE 235 (457) T ss_pred EEEEEecCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCC-CCH Confidence 33333332 2468999999999999876 8999999999999999999999999999999999999999864 688 Q ss_pred HHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHH Q lcl|NC_019705. 238 QQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e 316 (424) ++++++++.|++.++| +|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++++++|+| T Consensus 236 e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e 315 (457) T protein:vir:13 236 EGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLA 315 (457) T ss_pred HHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHH Confidence 9999999999987765 689999999999999999999999999999999999999999999999999999988888999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC Q lcl|NC_019705. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 317 ~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ 396 (424) ++.++|+++||.||++.||++|+++|+++.++..++++||+++|+++|.+++++++.+++++|+||+||+|+++|+||+| T Consensus 316 q~~~~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ 395 (457) T protein:vir:13 316 EQNIAFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLP 395 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCccccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999999999999999887778899999999999999999999999999999999999999999999 Q ss_pred CC--CeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 397 GG--DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 397 gg--d~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) || |++++|+|+++++...+.+......| T Consensus 396 ~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~ 425 (457) T protein:vir:13 396 DGLGEKYRVPLNLGEVGEEPEPEPAPAPPA 425 (457) T ss_pred CCcccceeeccccccccccccccccCCCCC Confidence 87 99999999998876444322111111 No 27 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=1.6e-95 Score=540.25 Aligned_cols=397 Identities=26% Similarity=0.460 Sum_probs=338.4 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|+|+|++.......... ...............+..++.+.|+++++||+||++||++||++||++|+.++++. T Consensus 1 Mgl~~~~f~~~~~~~~~~--~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~---- 74 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLT--KISGIPSPAEDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVR---- 74 (409) T ss_pred CchhhhhhcCCCcccccc--cccccccccchhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcc---- Confidence 899998854432211111 11111111222344677789999999999999999999999999999999765542 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCceeEEEEecCceeEEeec--CceEEEEEEeCC Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGDVISLLPLQSANMDVKLV--GKKVVYRYQRDS 170 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~-r~~~G~~~~l~~l~~~~v~~~~~--~~~~~~~~~~~~ 170 (424) ...|++.++|+.+||++||+++||+.++.+++++||+|+++. ++..|.+++||||+|++|++... .....+.+.... T Consensus 75 ~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~~~~~~~~~ 154 (409) T protein:vir:84 75 IPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGDWIEPVYRI 154 (409) T ss_pred cccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcceEEEEEecC Confidence 346899999999999999999999999999999999999986 78889999999999999998753 333444443444 Q ss_pred ceEEecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHH Q lcl|NC_019705. 171 EYAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) Q Consensus 171 ~~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~ 249 (424) .+..|+++||||+|+++.++ ++|+||+..+..++....++++++.++|+||++|+|+|+.+.. +++++.+++++.|.+ T Consensus 155 ~g~~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~~~~~~~~~ 233 (409) T protein:vir:84 155 DGKVVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDAD-LTPDQVKQTQKQWIQ 233 (409) T ss_pred CceEEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCC-CCHHHHHHHHHHHHH Confidence 45679999999999988776 7899999999999999999999999999999999999998864 688888888888877 Q ss_pred HhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHH Q lcl|NC_019705. 250 IAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 250 ~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P 329 (424) .. .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++++++|+|++.++|+++||.| T Consensus 234 ~~--~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P 311 (409) T protein:vir:84 234 SH--HNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLP 311 (409) T ss_pred Hh--ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHH Confidence 65 4688999999999999999999999999999999999999999999999999988888889999999999999999 Q ss_pred HHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccc Q lcl|NC_019705. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~ 409 (424) +++.||++|+++|.. +++++||++.|+++|.+++++++.+++++|+||+||+|+++|+||+||||++++|+|++| T Consensus 312 ~~~~ie~~l~~~L~~-----g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD~~~~~~n~~~ 386 (409) T protein:vir:84 312 WLRCIEQALDTFLPR-----GQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGDIHLQPMNFVP 386 (409) T ss_pred HHHHHHHHHHHhccC-----CCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccc Confidence 999999999998832 467899999999999999999999999999999999999999999999999999999999 Q ss_pred hhhcccc---CCCcccCC Q lcl|NC_019705. 410 ITDLGTN---KEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~---~~~~~~ga 424 (424) ++.+... ++++.+++ T Consensus 387 ~~~~~~~~~~~~~~~~~~ 404 (409) T protein:vir:84 387 LGYVPPEEPAQEPQPNSA 404 (409) T ss_pred cccCCccccCcCCCCCCc Confidence 9876443 23333333 No 28 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=4.5e-95 Score=537.84 Aligned_cols=411 Identities=18% Similarity=0.287 Sum_probs=335.3 Q ss_pred CCCCcccccCCCCCchHH--HHHhhccCcccC---Cccccchh--hccccccccCcccccHHHHhccHHHHHHHHHHHHh Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWA--RLQSWFVGGRLV---TPNQGSQT--GPVSAHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~--~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ 73 (424) ....-|-.++.++.-=-. .+++.|.+.+.. .+..+... ....+....++..++...|+++++||+||++||++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~acv~~Ia~~ 83 (441) T protein:vir:98 4 YNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASD 83 (441) T ss_pred ecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhhhhccHHHHHHHHHHHHh Confidence 223345566666551111 123334332221 12222111 11222333456678999999999999999999999 Q ss_pred hccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCcee Q lcl|NC_019705. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) ||++|+++++.+. ....|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.|++||||+|++| T Consensus 84 iA~lpl~~~~~~~------~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v 157 (441) T protein:vir:98 84 LARMPIRVTVNGQ------INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEI 157 (441) T ss_pred hccCceEEecCCc------ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcee Confidence 9999999986432 3456899999999999999999999999999999999999999999999999999999999 Q ss_pred EEeecCc-eEEEEEEe-----CCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCcee Q lcl|NC_019705. 154 DVKLVGK-KVVYRYQR-----DSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQI 227 (424) Q Consensus 154 ~~~~~~~-~~~~~~~~-----~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~v 227 (424) ++..+++ ..+|.+.. .+....++++||||||+++.|+++|+||+..+..++.+..++++++.++|+||++|+|+ T Consensus 158 ~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~dviHir~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gi 237 (441) T protein:vir:98 158 ELKLDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGI 237 (441) T ss_pred EEEECCCCcEEEEEEEeccCcceeeEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEE Confidence 9988654 34444332 22346799999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Q lcl|NC_019705. 228 LSTGEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 228 l~~~~~~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~ 306 (424) |+++....++++++++++.|++.++| +|+|++++|++|++|++++.+++|+||+|.+++++++||++|||||.+||... T Consensus 238 l~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~ 317 (441) T protein:vir:98 238 LKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET 317 (441) T ss_pred EEeCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC Confidence 99998877889899999999887665 78999999999999999999999999999999999999999999999998643 Q ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_019705. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEM 386 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~ 386 (424) . + .+.+++...|. +||.|++++||++|+++|+++.+ +++++||+++|++.|.+++++++++++++|+||+||+ T Consensus 318 ~-~---~s~~q~~~~y~-~tl~P~~~~ie~~ln~~L~~~~~--~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~ 390 (441) T protein:vir:98 318 A-N---MSITDANLDYL-STLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEI 390 (441) T ss_pred C-C---ccHHHHHHHHH-HHHHHHHHHHHHHHHhhcccccc--CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 2 2 24566666655 69999999999999999987654 4578999999999999999999999999999999999 Q ss_pred HHHhCCCCCCCCC--eeeecccccchhhccccCCCcccC-------C Q lcl|NC_019705. 387 RRTDNLPPLPGGD--VAMRQSQYVPITDLGTNKEPRNNG-------A 424 (424) Q Consensus 387 R~~~g~~p~~ggd--~~~~~~n~~~~~~~~~~~~~~~~g-------a 424 (424) |+++|+||+|||| .+++|+|++|++.+++.+..+.++ + T Consensus 391 R~~~gl~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~~~~~kgG 437 (441) T protein:vir:98 391 RQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGG 437 (441) T ss_pred HHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCC Confidence 9999999999988 578999999999876533222221 1 No 29 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=6.6e-95 Score=536.91 Aligned_cols=411 Identities=18% Similarity=0.278 Sum_probs=333.2 Q ss_pred CCCCcccccCCCCCchHH--HHHhhccCccc---CCccccchh--hccccccccCcccccHHHHhccHHHHHHHHHHHHh Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWA--RLQSWFVGGRL---VTPNQGSQT--GPVSAHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~--~~~~~~~~~~~---~~~~~~~~~--~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ 73 (424) ....-|-|++.++.-=-+ .+++.|.+.+. ..+..+... ..+.+....++..|+...|+++++||+||++||++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~ 83 (441) T protein:vir:94 4 YNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASD 83 (441) T ss_pred ccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHHHHh Confidence 222345555555441111 12233332221 122222111 11222333456678899999999999999999999 Q ss_pred hccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCcee Q lcl|NC_019705. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) ||++||++++.+ +....|++.++|+.+||++||+++||+.++.+++++||||++++|+..|+|++|+||+|++| T Consensus 84 iA~lp~~~~~~~------~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v 157 (441) T protein:vir:94 84 LARMPIRVTVNG------QINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEI 157 (441) T ss_pred hccCceeeecCc------cccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcee Confidence 999999998643 23456899999999999999999999999999999999999999999999999999999999 Q ss_pred EEeecCc-eEEEEEEe-----CCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCcee Q lcl|NC_019705. 154 DVKLVGK-KVVYRYQR-----DSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQI 227 (424) Q Consensus 154 ~~~~~~~-~~~~~~~~-----~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~v 227 (424) ++..+++ ..+|.+.. .+..+.++++||||||+++.|+++|+||+..+..++++..++++++.++|+||++|+|+ T Consensus 158 ~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 237 (441) T protein:vir:94 158 ELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGI 237 (441) T ss_pred EEEECCCccEEEEEEEeccCCceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEE Confidence 9988754 33444432 22346799999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Q lcl|NC_019705. 228 LSTGEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 228 l~~~~~~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~ 306 (424) |+++....++++++++++.|++.++| .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||... T Consensus 238 l~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 317 (441) T protein:vir:94 238 LKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET 317 (441) T ss_pred EEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC Confidence 99998877889999999999887666 68999999999999999999999999999999999999999999999998643 Q ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_019705. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEM 386 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~ 386 (424) . + .+.+++... +.+||.|++++||++|+++|+++.. .++++||++.|++.|.+++++++++++++|+||+||+ T Consensus 318 ~-~---~s~~q~~~~-~~~tl~P~~~~ie~eln~kl~~~~~--~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~ 390 (441) T protein:vir:94 318 A-N---MSITDANLD-YLSTLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEI 390 (441) T ss_pred C-C---ccHHHHHHH-HHHHHHHHHHHHHHHHhhhcccccc--CceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 2 2 234566555 4579999999999999999987643 4678999999999999999999999999999999999 Q ss_pred HHHhCCCCCCCCC--eeeecccccchhhccccCCCccc-------CC Q lcl|NC_019705. 387 RRTDNLPPLPGGD--VAMRQSQYVPITDLGTNKEPRNN-------GA 424 (424) Q Consensus 387 R~~~g~~p~~ggd--~~~~~~n~~~~~~~~~~~~~~~~-------ga 424 (424) |+++|+||+|||| .+++++|++|++.+++.+..+.+ |+ T Consensus 391 R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgG 437 (441) T protein:vir:94 391 RQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGG 437 (441) T ss_pred HHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCC Confidence 9999999999988 58899999999987543222211 11 No 30 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=6.6e-95 Score=536.91 Aligned_cols=411 Identities=18% Similarity=0.278 Sum_probs=333.2 Q ss_pred CCCCcccccCCCCCchHH--HHHhhccCccc---CCccccchh--hccccccccCcccccHHHHhccHHHHHHHHHHHHh Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWA--RLQSWFVGGRL---VTPNQGSQT--GPVSAHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~--~~~~~~~~~~~---~~~~~~~~~--~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ 73 (424) ....-|-|++.++.-=-+ .+++.|.+.+. ..+..+... ..+.+....++..|+...|+++++||+||++||++ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~ 83 (441) T protein:vir:79 4 YNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASD 83 (441) T ss_pred ccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHHHHh Confidence 222345555555441111 12233332221 122222111 11222333456678899999999999999999999 Q ss_pred hccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCcee Q lcl|NC_019705. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) ||++||++++.+ +....|++.++|+.+||++||+++||+.++.+++++||||++++|+..|+|++|+||+|++| T Consensus 84 iA~lp~~~~~~~------~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v 157 (441) T protein:vir:79 84 LARMPIRVTVNG------QINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEI 157 (441) T ss_pred hccCceeeecCc------cccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcee Confidence 999999998643 23456899999999999999999999999999999999999999999999999999999999 Q ss_pred EEeecCc-eEEEEEEe-----CCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCcee Q lcl|NC_019705. 154 DVKLVGK-KVVYRYQR-----DSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQI 227 (424) Q Consensus 154 ~~~~~~~-~~~~~~~~-----~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~v 227 (424) ++..+++ ..+|.+.. .+..+.++++||||||+++.|+++|+||+..+..++++..++++++.++|+||++|+|+ T Consensus 158 ~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 237 (441) T protein:vir:79 158 ELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGI 237 (441) T ss_pred EEEECCCccEEEEEEEeccCCceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEE Confidence 9988754 33444432 22346799999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Q lcl|NC_019705. 228 LSTGEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 228 l~~~~~~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~ 306 (424) |+++....++++++++++.|++.++| .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||... T Consensus 238 l~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 317 (441) T protein:vir:79 238 LKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET 317 (441) T ss_pred EEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC Confidence 99998877889999999999887666 68999999999999999999999999999999999999999999999998643 Q ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_019705. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEM 386 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~ 386 (424) . + .+.+++... +.+||.|++++||++|+++|+++.. .++++||++.|++.|.+++++++++++++|+||+||+ T Consensus 318 ~-~---~s~~q~~~~-~~~tl~P~~~~ie~eln~kl~~~~~--~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~ 390 (441) T protein:vir:79 318 A-N---MSITDANLD-YLSTLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEI 390 (441) T ss_pred C-C---ccHHHHHHH-HHHHHHHHHHHHHHHHhhhcccccc--CceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 2 2 234566555 4579999999999999999987643 4678999999999999999999999999999999999 Q ss_pred HHHhCCCCCCCCC--eeeecccccchhhccccCCCccc-------CC Q lcl|NC_019705. 387 RRTDNLPPLPGGD--VAMRQSQYVPITDLGTNKEPRNN-------GA 424 (424) Q Consensus 387 R~~~g~~p~~ggd--~~~~~~n~~~~~~~~~~~~~~~~-------ga 424 (424) |+++|+||+|||| .+++++|++|++.+++.+..+.+ |+ T Consensus 391 R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgG 437 (441) T protein:vir:79 391 RQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGG 437 (441) T ss_pred HHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCC Confidence 9999999999988 58899999999987543222211 11 No 31 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=1.7e-94 Score=534.63 Aligned_cols=401 Identities=19% Similarity=0.291 Sum_probs=341.3 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccC Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~ 87 (424) |.|-..-|++.|++..+..+....+..... .+..+...++..++.+.|+++|+|++||++||++||++||++|++++. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~~ 78 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWIDQSTSKLY--DFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV 78 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhcccccccc--cccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeecccc Confidence 999888899998877665443322222111 122333446667889999999999999999999999999999986542 Q ss_pred CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc--eEEEE Q lcl|NC_019705. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~--~~~~~ 165 (424) .+|++.++|+.+||++||+++||+.++.+++++||||++++|+..|.+++|+||+|++|++..+.+ ..+|. T Consensus 79 -------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~ 151 (412) T protein:vir:26 79 -------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYS 151 (412) T ss_pred -------ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEE Confidence 358899999999999999999999999999999999999999999999999999999999887653 45555 Q ss_pred EEe-CCceEEecHhHEEEeecC-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019705. 166 YQR-DSEYAEFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 166 ~~~-~~~~~~~~~~eiih~r~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~ 243 (424) +.. ++....|+++||+|||++ +.++++|+||+..++.++.+..+++++. ++.++..++++++.+. .+++++.+++ T Consensus 152 ~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~~--~~~~~~~~~~i~~~~~-~l~~e~~~~~ 228 (412) T protein:vir:26 152 IHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGS-NVGKEKRQQV 228 (412) T ss_pred EEcCCceEEEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecCC-CCCHHHHHHH Confidence 543 355678999999999987 4688999999999999999999999885 4455555566666664 4688889999 Q ss_pred HHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHH Q lcl|NC_019705. 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 244 ~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~ 323 (424) ++.|++.++ ++|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..+++ +++|+|++.++|+ T Consensus 229 ~~~~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~--~~sn~e~~~~~f~ 304 (412) T protein:vir:26 229 LEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT--NFAKNEELNRFYL 304 (412) T ss_pred HHHHHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHHHH Confidence 999998765 578899999999999999999999999999999999999999999999976544 5669999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeee Q lcl|NC_019705. 324 QYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAM 402 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~ 402 (424) ++||.|++.+||++|+++|+++.++. +++++||++++++.|.+++++++++++++|++|+||+|+++|+||+||||+++ T Consensus 305 ~~~l~P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~ 384 (412) T protein:vir:26 305 QHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPL 384 (412) T ss_pred HHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeee Confidence 99999999999999999999998864 57899999999999999999999999999999999999999999999999999 Q ss_pred ecccccchhhccccCCCcccCC Q lcl|NC_019705. 403 RQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~~~~~~ga 424 (424) +++|++|++...+++....+|- T Consensus 385 ~~~n~~~~~~~~~~~~~~~gG~ 406 (412) T protein:vir:26 385 ISGDLYPIDTPLELRKSLKGGD 406 (412) T ss_pred ecccccccccchhhcccccCCC Confidence 9999999987655433222222 No 32 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=2.2e-94 Score=534.07 Aligned_cols=396 Identities=18% Similarity=0.279 Sum_probs=329.9 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchh----hccccccccCcc------cccHHHHhccHHHHHHHHHHHHhhccC Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQT----GPVSAHGHLGDS------SINDERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~------~vs~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |-| -+|...++|+..... ..+++ .+..+. .+....|+++++||+||++||++||++ T Consensus 1 ~~~-------------~~~~~~~~p~~~e~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~l 66 (518) T protein:vir:10 1 MLL-------------ANGQTLSAPAMAELSPQMQDSYYY-APAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARL 66 (518) T ss_pred Ccc-------------cCceeecCchhhhhhhhhhccccc-ccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccC Confidence 111 123333444322111 11111 112222 233456889999999999999999999 Q ss_pred ceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEee Q lcl|NC_019705. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~ 157 (424) ||++|+++.++..+ ..+|++ .+|+.+||++||+++||+.++.+++++||||++++|+.+|.+++||||+|++|++.. T Consensus 67 pl~l~~~~~~~~~~--~~~~~~-~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~ 143 (518) T protein:vir:10 67 PVKCMFTSGDTETE--ESDTGY-AKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKR 143 (518) T ss_pred ceEEEEEcCCCcee--ccchHH-HHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEE Confidence 99999988776543 344555 556679999999999999999999999999999999999999999999999999988 Q ss_pred cC--ceEEEEEEeC----CceEEecHhHEEEeecCCCCCc-ccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEc Q lcl|NC_019705. 158 VG--KKVVYRYQRD----SEYAEFSQKEIFHLKGFGFTGL-VGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILST 230 (424) Q Consensus 158 ~~--~~~~~~~~~~----~~~~~~~~~eiih~r~~~~~~~-~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~ 230 (424) +. +...|.|..+ +..+.|+++||||||++++++. +|+||+..+..++....++++++.++|+||++|+|+|+. T Consensus 144 ~~~~~~~~y~~~~~~~~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~ 223 (518) T protein:vir:10 144 NSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRH 223 (518) T ss_pred cCCCCEEEEEEEecCCccceEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEec Confidence 64 3455666543 3456899999999999999884 899999999999999999999999999999999999999 Q ss_pred CCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCc Q lcl|NC_019705. 231 GEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST 309 (424) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~ 309 (424) +.. +++++++++++.|++.++| .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ T Consensus 224 ~~~-ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t 302 (518) T protein:vir:10 224 EKR-LSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT 302 (518) T ss_pred CCC-CCHHHHHHHHHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCC Confidence 865 6888899999999987766 68999999999999999999999999999999999999999999999999887765 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHH Q lcl|NC_019705. 310 SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRT 389 (424) Q Consensus 310 ~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~ 389 (424) ++|+|++.+.|+++||.||+.+||++|+++|++..++ .++++||++.|+++|.+++++++.+++++|++|+||+|++ T Consensus 303 --~sn~eq~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~-~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~ 379 (518) T protein:vir:10 303 --FSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREI 379 (518) T ss_pred --chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccC-CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 5689999999999999999999999999999988764 4679999999999999999999999999999999999999 Q ss_pred hCCCCCC--CCCeeeecccccchhhccccC-----CC--cccCC Q lcl|NC_019705. 390 DNLPPLP--GGDVAMRQSQYVPITDLGTNK-----EP--RNNGA 424 (424) Q Consensus 390 ~g~~p~~--ggd~~~~~~n~~~~~~~~~~~-----~~--~~~ga 424 (424) +|+||++ |||++++++|++|++...++. .+ .+.++ T Consensus 380 ~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~ 423 (518) T protein:vir:10 380 MGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPAS 423 (518) T ss_pred hCCCCCCCCCCCeeeecccceecccccccccCCCCCCCCCCCCc Confidence 9999995 899999999999987543321 00 00011 No 33 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=4e-94 Score=532.61 Aligned_cols=396 Identities=17% Similarity=0.268 Sum_probs=329.5 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccc----hhhccccccccCcc------cccHHHHhccHHHHHHHHHHHHhhccC Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGS----QTGPVSAHGHLGDS------SINDERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~------~vs~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |-| -+|...+.|+... ....+++ .+..+. .+....|+++|+||+||++||++||++ T Consensus 1 ~~~-------------~~~~~~~~p~~~~~~~~~~~~~~~-~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~l 66 (518) T protein:vir:78 1 MLL-------------ANGQTLSAPAMAELSPQMQDSYYY-APAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARL 66 (518) T ss_pred Ccc-------------cCceeeccchhhhhhhhhhhcccc-cceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccC Confidence 111 1233333443221 1111221 222333 233456889999999999999999999 Q ss_pred ceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEee Q lcl|NC_019705. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~ 157 (424) ||++|+++.++..+ .. ++...+|+.+||++||+++||+.++.+++++||+|+++.|+..|.+++||||+|++|++.. T Consensus 67 p~~l~~~~~~~~~~--~~-~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~ 143 (518) T protein:vir:78 67 PVKCMFTSGDTETE--EH-DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKR 143 (518) T ss_pred ceEEEEEcCCcccc--cc-chHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEE Confidence 99999987766443 23 4445566679999999999999999999999999999999999999999999999999987 Q ss_pred cC--ceEEEEEEeC----CceEEecHhHEEEeecCCCCCc-ccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEc Q lcl|NC_019705. 158 VG--KKVVYRYQRD----SEYAEFSQKEIFHLKGFGFTGL-VGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILST 230 (424) Q Consensus 158 ~~--~~~~~~~~~~----~~~~~~~~~eiih~r~~~~~~~-~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~ 230 (424) +. +...|.|... +..+.|+++||||||++++++. +|+||+..+..++....++++++.++|+||++|+|+|++ T Consensus 144 ~~~~~~~~y~~~~~~~~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~ 223 (518) T protein:vir:78 144 NSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRH 223 (518) T ss_pred cCCCCEEEEEEEecCCccceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEec Confidence 64 4455655533 3456799999999999998885 799999999999999999999999999999999999999 Q ss_pred CCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCc Q lcl|NC_019705. 231 GEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST 309 (424) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~ 309 (424) +.. +++++.+++++.|++.++| .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ T Consensus 224 ~~~-ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st 302 (518) T protein:vir:78 224 EKR-LSPEAQQRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT 302 (518) T ss_pred CCC-CCHHHHHHHHHHHHHHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCC Confidence 855 6888899999999987666 68999999999999999999999999999999999999999999999999887764 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHH Q lcl|NC_019705. 310 SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRT 389 (424) Q Consensus 310 ~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~ 389 (424) ++|+|++.+.|+++||.|++.+||++|+++|++..++ .++++||++.|+++|.+++++++.+++++|+||+||+|++ T Consensus 303 --~sn~e~~~~~f~~~tL~P~~~~ie~eln~~L~~~~~~-~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~ 379 (518) T protein:vir:78 303 --FSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREI 379 (518) T ss_pred --chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccC-cceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 5699999999999999999999999999999987765 4679999999999999999999999999999999999999 Q ss_pred hCCCCCC--CCCeeeecccccchhhccccC-CCcc------cCC Q lcl|NC_019705. 390 DNLPPLP--GGDVAMRQSQYVPITDLGTNK-EPRN------NGA 424 (424) Q Consensus 390 ~g~~p~~--ggd~~~~~~n~~~~~~~~~~~-~~~~------~ga 424 (424) +|+||++ |||++++++|++|++...++. ++.+ .++ T Consensus 380 ~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~ 423 (518) T protein:vir:78 380 MGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPAS 423 (518) T ss_pred hCCCCCCCCCCceeeecccceecccccccccCCCCCCCCCCCCc Confidence 9999996 799999999999987643321 0000 011 No 34 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=3.7e-94 Score=532.79 Aligned_cols=394 Identities=19% Similarity=0.278 Sum_probs=329.8 Q ss_pred CchHHHHHhhccCcccCCccccchh--hccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccc Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQT--GPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~ 91 (424) +|||++. .++....+...... ....+.....+..|+...|+++++||+||++||+++|++||++++.++ T Consensus 1 Mg~f~~~----~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~----- 71 (416) T protein:vir:81 1 MGIFYKN----EKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQ----- 71 (416) T ss_pred CCccccc----ccccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCcc----- Confidence 6776542 22222222222111 112233344677889999999999999999999999999999986432 Q ss_pred eeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc-eEEEEEEe-- Q lcl|NC_019705. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-KVVYRYQR-- 168 (424) Q Consensus 92 ~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-~~~~~~~~-- 168 (424) ....|++.++|+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|++|++..++. ...|.+.. T Consensus 72 -~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~ 150 (416) T protein:vir:81 72 -INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRID 150 (416) T ss_pred -ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEec Confidence 33568999999999999999999999999999999999999999999999999999999999988754 33444332 Q ss_pred ---CCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019705. 169 ---DSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 169 ---~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~ 245 (424) ....+.++++||||||+++.|+++|+||+..+..++++..++++++.++|+||++|+|+|+++....++++++++++ T Consensus 151 ~~~~~~~~~~~~~evihir~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~ 230 (416) T protein:vir:81 151 SNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRARE 230 (416) T ss_pred CCCceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHH Confidence 12346799999999999999999999999999999999999999999999999999999999988778889999999 Q ss_pred HHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHH Q lcl|NC_019705. 246 NFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ 324 (424) Q Consensus 246 ~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~ 324 (424) .|++.++| .|+|++++|++|++|++++.+++|+||+|.+++++++||++|||||.+||.... + .+.+++... +. T Consensus 231 ~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~---~~~~~~~~~-~~ 305 (416) T protein:vir:81 231 EFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N---MSITDANLD-YL 305 (416) T ss_pred HHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-C---ccHHHHHHH-HH Confidence 99887766 689999999999999999999999999999999999999999999999986432 2 234555554 56 Q ss_pred HHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC--eee Q lcl|NC_019705. 325 YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD--VAM 402 (424) Q Consensus 325 ~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd--~~~ 402 (424) +||.|++++||++|+++|++..+ .++++||++.|++.|.+++++++++++++|+||+||+|+++|+||+|||| .++ T Consensus 306 ~~l~P~~~~ie~~ln~~l~~~~~--~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~ 383 (416) T protein:vir:81 306 STLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHR 383 (416) T ss_pred HHHHHHHHHHHHHHhhhcccccc--CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEe Confidence 79999999999999999987653 46789999999999999999999999999999999999999999999987 688 Q ss_pred ecccccchhhccccCCCcccC-------C Q lcl|NC_019705. 403 RQSQYVPITDLGTNKEPRNNG-------A 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~~~~~~g-------a 424 (424) +++|++|++.+.+.+..+.+. + T Consensus 384 ~~~n~~~~~~~~~~~~~~~~~~~~~~kgG 412 (416) T protein:vir:81 384 VDLNHVNIELVDEYQMNKSRATDKKLKGG 412 (416) T ss_pred ecccccccccccccCcccccccccccCCC Confidence 999999999765433222221 1 No 35 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=3.7e-94 Score=532.79 Aligned_cols=394 Identities=19% Similarity=0.278 Sum_probs=329.8 Q ss_pred CchHHHHHhhccCcccCCccccchh--hccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccc Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQT--GPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~ 91 (424) +|||++. .++....+...... ....+.....+..|+...|+++++||+||++||+++|++||++++.++ T Consensus 1 Mg~f~~~----~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~----- 71 (416) T protein:vir:45 1 MGIFYKN----EKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQ----- 71 (416) T ss_pred CCccccc----ccccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCcc----- Confidence 6776542 22222222222111 112233344677889999999999999999999999999999986432 Q ss_pred eeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc-eEEEEEEe-- Q lcl|NC_019705. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-KVVYRYQR-- 168 (424) Q Consensus 92 ~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-~~~~~~~~-- 168 (424) ....|++.++|+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|++|++..++. ...|.+.. T Consensus 72 -~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~ 150 (416) T protein:vir:45 72 -INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRID 150 (416) T ss_pred -ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEec Confidence 33568999999999999999999999999999999999999999999999999999999999988754 33444332 Q ss_pred ---CCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019705. 169 ---DSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 169 ---~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~ 245 (424) ....+.++++||||||+++.|+++|+||+..+..++++..++++++.++|+||++|+|+|+++....++++++++++ T Consensus 151 ~~~~~~~~~~~~~evihir~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~ 230 (416) T protein:vir:45 151 SNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRARE 230 (416) T ss_pred CCCceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHH Confidence 12346799999999999999999999999999999999999999999999999999999999988778889999999 Q ss_pred HHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHH Q lcl|NC_019705. 246 NFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ 324 (424) Q Consensus 246 ~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~ 324 (424) .|++.++| .|+|++++|++|++|++++.+++|+||+|.+++++++||++|||||.+||.... + .+.+++... +. T Consensus 231 ~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~---~~~~~~~~~-~~ 305 (416) T protein:vir:45 231 EFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N---MSITDANLD-YL 305 (416) T ss_pred HHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-C---ccHHHHHHH-HH Confidence 99887766 689999999999999999999999999999999999999999999999986432 2 234555554 56 Q ss_pred HHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC--eee Q lcl|NC_019705. 325 YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD--VAM 402 (424) Q Consensus 325 ~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd--~~~ 402 (424) +||.|++++||++|+++|++..+ .++++||++.|++.|.+++++++++++++|+||+||+|+++|+||+|||| .++ T Consensus 306 ~~l~P~~~~ie~~ln~~l~~~~~--~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~ 383 (416) T protein:vir:45 306 STLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHR 383 (416) T ss_pred HHHHHHHHHHHHHHhhhcccccc--CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEe Confidence 79999999999999999987653 46789999999999999999999999999999999999999999999987 688 Q ss_pred ecccccchhhccccCCCcccC-------C Q lcl|NC_019705. 403 RQSQYVPITDLGTNKEPRNNG-------A 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~~~~~~g-------a 424 (424) +++|++|++.+.+.+..+.+. + T Consensus 384 ~~~n~~~~~~~~~~~~~~~~~~~~~~kgG 412 (416) T protein:vir:45 384 VDLNHVNIELVDEYQMNKSRATDKKLKGG 412 (416) T ss_pred ecccccccccccccCcccccccccccCCC Confidence 999999999765433222221 1 No 36 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=9.4e-94 Score=530.59 Aligned_cols=398 Identities=20% Similarity=0.287 Sum_probs=335.4 Q ss_pred CCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcc Q lcl|NC_019705. 11 RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 11 ~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) -.-.+++.|+++.+.......+.... ..+ ..+...++..|+.+.|+++++|++||++||++||++||+++++++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~~--- 75 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQSTSKL-YDF-SPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV--- 75 (409) T ss_pred CCccchhhhhhhhhhhhhhccccccc-ccc-ccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeecccc--- Confidence 23458888888876543322222211 111 1222234556788899999999999999999999999999987543 Q ss_pred ceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecC--ceEEEEEE- Q lcl|NC_019705. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYRYQ- 167 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~--~~~~~~~~- 167 (424) .+|++.++|+.+||++||+++||+.++.+++++||||+++.|+..|.+++||||+|++|++..++ +..+|.+. T Consensus 76 ----~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~ 151 (409) T protein:vir:93 76 ----VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHA 151 (409) T ss_pred ----ccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEc Confidence 35889999999999999999999999999999999999999999999999999999999987754 34555554 Q ss_pred eCCceEEecHhHEEEeecC-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019705. 168 RDSEYAEFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 168 ~~~~~~~~~~~eiih~r~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~ 246 (424) .++....|+++||||+|++ +.++++|+||+.++..++.+..+++++. ++.++..++++++.+. .+++++++++++. T Consensus 152 ~~g~~~~~~~~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~-~l~~e~~~~~~~~ 228 (409) T protein:vir:93 152 ATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGS-NVGKEKRQQVLED 228 (409) T ss_pred CCceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecCC-CCCHHHHHHHHHH Confidence 3456678999999999986 5788999999999999999999998885 4555555566666664 5788999999999 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHH Q lcl|NC_019705. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) |++.++ ++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..+++ +++|+|++.+.|++.| T Consensus 229 ~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~--~~sn~e~~~~~f~~~~ 304 (409) T protein:vir:93 229 FKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT--NFAKNEELNRFYLQHT 304 (409) T ss_pred HHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHHHHHHH Confidence 998764 578999999999999999999999999999999999999999999999986554 5568999999999999 Q ss_pred HHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc Q lcl|NC_019705. 327 LQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS 405 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~ 405 (424) |.|++++||++|+++|+++.++. +++++||++++++.|.+++++++++++++|++|+||+|+++|+||+||||++++++ T Consensus 305 l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~ 384 (409) T protein:vir:93 305 LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISG 384 (409) T ss_pred HHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecc Confidence 99999999999999999998875 47899999999999999999999999999999999999999999999999999999 Q ss_pred cccchhhccccCCCcccCC Q lcl|NC_019705. 406 QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 406 n~~~~~~~~~~~~~~~~ga 424 (424) |++|++...+++....+|- T Consensus 385 n~~~~~~~~~~~~~~~gG~ 403 (409) T protein:vir:93 385 DLYPIDTPLELRKSLKGGD 403 (409) T ss_pred cccccccchhhcccccCCC Confidence 9999987655443322222 No 37 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=1.5e-93 Score=529.44 Aligned_cols=398 Identities=20% Similarity=0.289 Sum_probs=332.9 Q ss_pred CCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcc Q lcl|NC_019705. 11 RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 11 ~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) -...+++.|+++.+.+.....+.... .. +..+...++..|+.+.|+++++|++||++||++||++||++|++.+. T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~~--- 75 (409) T protein:vir:96 1 MAKENIVTRIKKKLIDNWIDQSASKL-YD-FSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV--- 75 (409) T ss_pred CccccchhhhhhHHhhhhhccccccc-cc-cccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCceEEeecccc--- Confidence 23457888888776543332222111 11 11122234456788899999999999999999999999999986542 Q ss_pred ceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc--eEEEEEE- Q lcl|NC_019705. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQ- 167 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~--~~~~~~~- 167 (424) .+|++.++|+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|++|++..+++ ..+|.+. T Consensus 76 ----~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~ 151 (409) T protein:vir:96 76 ----VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHA 151 (409) T ss_pred ----cchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEc Confidence 358999999999999999999999999999999999999999999999999999999999887643 3455554 Q ss_pred eCCceEEecHhHEEEeecC-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019705. 168 RDSEYAEFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 168 ~~~~~~~~~~~eiih~r~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~ 246 (424) .++....|+++||||||++ +.++++|+||+..+..++++..+++++. ++.++..++++++.+ ..+++++++++++. T Consensus 152 ~~g~~~~~~~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~-~~l~~e~~~~~~~~ 228 (409) T protein:vir:96 152 ATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYG-SNVSTEKRQQVLED 228 (409) T ss_pred CCceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEecC-CCCCHHHHHHHHHH Confidence 3456778999999999976 5788999999999999999999998874 333444444555554 55788999999999 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHH Q lcl|NC_019705. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) |++.++ ++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..+++ +++|+|++.+.|+++| T Consensus 229 ~~~~~~--n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~s~~e~~~~~f~~~~ 304 (409) T protein:vir:96 229 FKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNT--NFAKNEELNRFYLQHT 304 (409) T ss_pred HHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHHHHHHH Confidence 998774 678999999999999999999999999999999999999999999999987655 4568999999999999 Q ss_pred HHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc Q lcl|NC_019705. 327 LQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS 405 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~ 405 (424) |.|++++||++|+++|+++.++. +++++||++++++.|.+++++++++++++|++|+||+|+++|+||+||||++++++ T Consensus 305 l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD~~~~~~ 384 (409) T protein:vir:96 305 LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISG 384 (409) T ss_pred HHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcceeeecc Confidence 99999999999999999998864 58899999999999999999999999999999999999999999999999999999 Q ss_pred cccchhhccccCCCcccCC Q lcl|NC_019705. 406 QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 406 n~~~~~~~~~~~~~~~~ga 424 (424) |++|++...+.+...++|- T Consensus 385 n~~~~~~~~~~~~~~~gG~ 403 (409) T protein:vir:96 385 DLYPIDTPLELRKSLKGGD 403 (409) T ss_pred cccccccchhhcccccCCC Confidence 9999986654332222222 No 38 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=2.6e-93 Score=528.18 Aligned_cols=403 Identities=18% Similarity=0.276 Sum_probs=332.3 Q ss_pred CchHHHHHhhccCcccCCccccchhhccc-cccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccce Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVS-AHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) +|||+++.... ...+.+......+.+. .....+...+....++++|+|++||++||++||++|+++|++..+|..++ T Consensus 1 Mg~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~~~ 78 (423) T protein:vir:81 1 MGFLQKLGLAP--SVVATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGRER 78 (423) T ss_pred CchhHhhcccc--ccccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCceee Confidence 89999874222 2222222221222222 22222333334555678999999999999999999999999887776544 Q ss_pred eccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC--CceeEEEEecCceeEEeecC---ceEEEEEE Q lcl|NC_019705. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA--GDVISLLPLQSANMDVKLVG---KKVVYRYQ 167 (424) Q Consensus 93 ~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~--G~~~~l~~l~~~~v~~~~~~---~~~~~~~~ 167 (424) ..+|++.+||. +||++||+++||+.++.+++++||||+++.|+.. +.+..|+|+++..|++.... +...|.+. T Consensus 79 -~~~~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~~~~~Y~~~ 156 (423) T protein:vir:81 79 -VREGHLARVCK-LANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGWGSLDYIII 156 (423) T ss_pred -eccchHHHHhh-cCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCCcceEEEEE Confidence 46688988886 8999999999999999999999999999998763 46778888888888765532 23445543 Q ss_pred ----eCCceEEecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC----CCCHH Q lcl|NC_019705. 168 ----RDSEYAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK----VLTEQ 238 (424) Q Consensus 168 ----~~~~~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~----~~~~~ 238 (424) .++....++++||||+|++++++ .+|+||+..++.+++...++++++.++|+||++|+|+|+++.. .++++ T Consensus 157 ~~~~~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e 236 (423) T protein:vir:81 157 ESGDNDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAE 236 (423) T ss_pred EecCCCceEEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHH Confidence 24556789999999999988776 5799999999999999999999999999999999999987643 36889 Q ss_pred HHHHHHHHHHHHh--CCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHH Q lcl|NC_019705. 239 QRSQVEENFKEIA--GGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 239 ~~~~~~~~~~~~~--~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e 316 (424) +++++++.|++.+ +.+|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+| T Consensus 237 ~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t--~sn~e 314 (423) T protein:vir:81 237 SRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNAN--YSNVR 314 (423) T ss_pred HHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCC--cccHH Confidence 9999999999875 3468899999999999999999999999999999999999999999999999887764 56899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccChhhh--cccchhhhhhhhhccCHHHHHHHHHHHHh-CCCCCHHHHHHHhCCC Q lcl|NC_019705. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDV--GRIHAEHNLDGLLRGDSASRAAFMKAMGE-AGLRTINEMRRTDNLP 393 (424) Q Consensus 317 ~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~-~g~~t~NE~R~~~g~~ 393 (424) ++.++|+++||.|++.+||++|+++|+++.+. .+++++||.++|+++|.+++++++.+++. +||||+||+|+++|+| T Consensus 315 ~~~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~ 394 (423) T protein:vir:81 315 EFRKALYGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLP 394 (423) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCC Confidence 99999999999999999999999999998764 45889999999999999999999999885 6999999999999999 Q ss_pred CCCCCCeeeecccccchhhccccCCCccc Q lcl|NC_019705. 394 PLPGGDVAMRQSQYVPITDLGTNKEPRNN 422 (424) Q Consensus 394 p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ 422 (424) |+||||++++|.|+.+.+......+..+- T Consensus 395 p~~gGD~~~~p~n~~~~~~~~~~~~~~~t 423 (423) T protein:vir:81 395 SIDGGDDLARPLNTEFGDSEDAPGEEVET 423 (423) T ss_pred CCCCcceeecccccccCccCCCCCCCCCC Confidence 99999999999999986643322222222 No 39 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=3.5e-93 Score=527.46 Aligned_cols=398 Identities=20% Similarity=0.289 Sum_probs=333.7 Q ss_pred CCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcc Q lcl|NC_019705. 11 RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 11 ~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~ 90 (424) -+...++.|+++.+.+.....+..... .+..+...++..|+.+.|+++++|++||++||++||++||+++++++. T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~~~~--~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~--- 75 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDNWIDQSASKLY--DFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV--- 75 (409) T ss_pred CcccccchhhhhHHhhhhhcCCccccc--ccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeecccc--- Confidence 234466777777654433322222111 111222334556788999999999999999999999999999986543 Q ss_pred ceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecC--ceEEEEEE- Q lcl|NC_019705. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYRYQ- 167 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~--~~~~~~~~- 167 (424) .+|++.++|+.+||++||+++||+.++.+++++||||++++|+..|.+++||||+|++|++..+. +..+|.+. T Consensus 76 ----~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~ 151 (409) T protein:vir:94 76 ----VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHA 151 (409) T ss_pred ----cchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEc Confidence 35889999999999999999999999999999999999999999999999999999999987764 34555554 Q ss_pred eCCceEEecHhHEEEeecC-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019705. 168 RDSEYAEFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 168 ~~~~~~~~~~~eiih~r~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~ 246 (424) .++....|+++||+|+|++ +.++++|+||+..+..++++..+++++. ++.++..++++++.+. .+++++.+++++. T Consensus 152 ~~g~~~~~~~~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~-~l~~e~~~~~~~~ 228 (409) T protein:vir:94 152 ATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGS-NVGKEKRQQVLED 228 (409) T ss_pred CCceEEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCeeEEecCC-CCCHHHHHHHHHH Confidence 3456778999999999986 4688999999999999999999998885 4445555566666654 5688999999999 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHH Q lcl|NC_019705. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) |++.++ ++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..+++ +++|+|++.+.|+++| T Consensus 229 ~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~sn~e~~~~~f~~~~ 304 (409) T protein:vir:94 229 FKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT--NFAKNEELNRFYLQHT 304 (409) T ss_pred HHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHHHHHHH Confidence 998775 678999999999999999999999999999999999999999999999986554 5568999999999999 Q ss_pred HHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc Q lcl|NC_019705. 327 LQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS 405 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~ 405 (424) |.|+++.||++|+++|+++.++. +++++||+++++++|.+++++++++++++|+||+||+|+++|+||+||||++++++ T Consensus 305 l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~ 384 (409) T protein:vir:94 305 LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISG 384 (409) T ss_pred HHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeEeecc Confidence 99999999999999999998874 48899999999999999999999999999999999999999999999999999999 Q ss_pred cccchhhccccCCCcccCC Q lcl|NC_019705. 406 QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 406 n~~~~~~~~~~~~~~~~ga 424 (424) |++|++...+.+....+|- T Consensus 385 n~~~~~~~~~~~~~~kGG~ 403 (409) T protein:vir:94 385 DLYPIDTPLELRKSLKGGD 403 (409) T ss_pred cccccccchhhcccccCCC Confidence 9999987654332222222 No 40 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=2.7e-93 Score=528.08 Aligned_cols=390 Identities=16% Similarity=0.197 Sum_probs=321.9 Q ss_pred CchHHHHHhhccCcccCCccccchh-hccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccce Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQT-GPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) +++|+ +........|... ...+......+.. +...|+++++||+||++||++||++|+++|+++.++. T Consensus 1 m~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~Al~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~--- 69 (417) T protein:vir:38 1 MKLFR-------GLATEVDPHWADHLLDSGVIPSFRGGY-LGISALRNSDVLTAVSIVSGDVSRFPLVITDSSTDEV--- 69 (417) T ss_pred Ccccc-------ccccCCCccchhhhcccccccccCCce-echhhcccHHHHHHHHHHHHhhccCeeEEEEcCCcce--- Confidence 33332 1111111111111 1112222223333 3356899999999999999999999999998876543 Q ss_pred eccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-CceeEEEEecCceeEEeec-CceEEEEEEeC- Q lcl|NC_019705. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-GDVISLLPLQSANMDVKLV-GKKVVYRYQRD- 169 (424) Q Consensus 93 ~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~-G~~~~l~~l~~~~v~~~~~-~~~~~~~~~~~- 169 (424) ...|++.++|+.+||++||+++||+.++.+++++||||++++|+.. |.|..|+|++|++|++..+ .+...|.|... T Consensus 70 -~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~~~~y~~~~~~ 148 (417) T protein:vir:38 70 -IDLANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPDNIIYRFTPYN 148 (417) T ss_pred -eccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCCeEEEEEEEcC Confidence 3468899999999999999999999999999999999999999864 6799999999999998765 45566666543 Q ss_pred -CceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019705. 170 -SEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 170 -~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) +....++++||||||+++.|+++|+||+.++..++.+..++++++.++|+||++|+++++.+.. +++++.+++++.|+ T Consensus 149 ~~~~~~~~~~dviH~r~~~~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~-l~~e~~~~~~~~~~ 227 (417) T protein:vir:38 149 SSMQKVCGFEDVIHWKFFSYDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESR-LSAEARQKIREDFE 227 (417) T ss_pred CcEEEEecCcceEEecCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC-CCHHHHHHHHHHHH Confidence 3345689999999999999999999999999999999999999999999999999999998865 68888999999999 Q ss_pred HHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHH Q lcl|NC_019705. 249 EIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ 328 (424) Q Consensus 249 ~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~ 328 (424) +.+++.|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||. +.+++|+|++.++|+++||. T Consensus 228 ~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~----~~~~s~~e~~~~~~~~~tl~ 303 (417) T protein:vir:38 228 RAQAGADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQ----NSPNQSVKQLADDYIRNDLP 303 (417) T ss_pred HHhcccccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCC----CCcchhHHHHHHHHHHHHHH Confidence 99998899999999999999999999999999999999999999999999999984 23566899999999999999 Q ss_pred HHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--Ceeeeccc Q lcl|NC_019705. 329 PYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVAMRQSQ 406 (424) Q Consensus 329 P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg--d~~~~~~n 406 (424) |++++||++|+++|+++.++..++++||.+.+.+.+ .+.+++++++|+||+||+|+++|+||+||| |++++|+| T Consensus 304 P~~~~ie~~l~~~Ll~~~~~~~~~~~fd~~~l~~~~----~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n 379 (417) T protein:vir:38 304 FYFEPITSEFELKLLDDAQRHQYCIGFDTKSVNGLP----IADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLN 379 (417) T ss_pred HHHHHHHHHHHhhhcChhhcccceEEechhhhhHHH----HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeeccc Confidence 999999999999999998877788999988875433 344678899999999999999999999886 88999999 Q ss_pred ccchhhccccCCC-------cccCC Q lcl|NC_019705. 407 YVPITDLGTNKEP-------RNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~~~~-------~~~ga 424 (424) +++++.....+.+ .++.+ T Consensus 380 ~~~~d~~~~~~~~~~~~~kgg~~~~ 404 (417) T protein:vir:38 380 TVFLDQKEAYQAEHAAELKGGDTNA 404 (417) T ss_pred ccccccccccccccccccCCCCCCC Confidence 9999865442211 11111 No 41 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=1e-91 Score=519.48 Aligned_cols=404 Identities=16% Similarity=0.154 Sum_probs=334.2 Q ss_pred hHHHHHhhccCcccCCccccchhh-ccc---cccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccc Q lcl|NC_019705. 16 WWARLQSWFVGGRLVTPNQGSQTG-PVS---AHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~---~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~ 91 (424) |++.+...+++...........+. .++ .....++..++.+.|+++|+||+||++||++||++||++|+...+|... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g~~~ 80 (460) T protein:vir:10 1 MANRIIRALRELTGLDNKFNDAFIKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTKAYQ 80 (460) T ss_pred CchhHHHHHhhhhccCCCchHHHHHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCCccch Confidence 666655555433222222222221 111 1223356778899999999999999999999999999999998877543 Q ss_pred e-------------------------eccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC----CCce Q lcl|NC_019705. 92 K-------------------------VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS----AGDV 142 (424) Q Consensus 92 ~-------------------------~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~----~G~~ 142 (424) + ....+++..+|+.+||++||+++||+.++.+++++||||++++|+. .|.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~~G~~ 160 (460) T protein:vir:10 81 QLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGINAGVP 160 (460) T ss_pred hhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCccCcee Confidence 2 2233456667888999999999999999999999999999999964 4789 Q ss_pred eEEEEecCceeEEeecCce---------EEEEEEeCCceEEecHhHEEEeecCCCC------CcccCchHHHHHHHHHHH Q lcl|NC_019705. 143 ISLLPLQSANMDVKLVGKK---------VVYRYQRDSEYAEFSQKEIFHLKGFGFT------GLVGLSPIAFACKSAGVA 207 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~---------~~~~~~~~~~~~~~~~~eiih~r~~~~~------~~~G~s~i~~~~~~i~~~ 207 (424) ++||||+|++|++..+++. ..|.+..++....|+++||||||+++++ +++|+||+..++.++.+. T Consensus 161 ~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~ 240 (460) T protein:vir:10 161 SQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQ 240 (460) T ss_pred EEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHHHHH Confidence 9999999999999887543 2344556777889999999999987643 589999999999999999 Q ss_pred HHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHH Q lcl|NC_019705. 208 VAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKF 286 (424) Q Consensus 208 ~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~ 286 (424) .++++++.++|+||+.|+++++.+. .+++++.+++++.|++.++| +|+|++++|++|++|++++++++|+||+|.+++ T Consensus 241 ~~~~~~~~~~f~ng~~~~~i~~~~~-~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~ 319 (460) T protein:vir:10 241 NSTIDNNVKTMQNGGVFGFIHGGST-GLTQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKY 319 (460) T ss_pred HHHHHHHHHHHhcCCCcceeeecCC-CCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHH Confidence 9999999999999999999888765 46888999999999987665 689999999999999999999999999999999 Q ss_pred HHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhh--hcc Q lcl|NC_019705. 287 QVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGL--LRG 363 (424) Q Consensus 287 ~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l--~~~ 363 (424) ++++||++|||||.+||..++++.+++|+|++.+.|+++||.|++..||++|+++|+++.++. .++++||++.+ ++. T Consensus 320 ~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~ 399 (460) T protein:vir:10 320 DQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQT 399 (460) T ss_pred HHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHH Confidence 999999999999999999988888899999999999999999999999999999999988764 47789998887 333 Q ss_pred CHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--CCCCeeeecccccchhhccccC-CCcccCC Q lcl|NC_019705. 364 DSASRAAFMKAMGEAGLRTINEMRRTDNLPPL--PGGDVAMRQSQYVPITDLGTNK-EPRNNGA 424 (424) Q Consensus 364 d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~--~ggd~~~~~~n~~~~~~~~~~~-~~~~~ga 424 (424) |. +....++++|++|+||+|+++|+||+ ||||++++|+|++|+++.+++. +..+|.. T Consensus 400 d~----~~~~~~~~~g~~T~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~~~~~~~nq~ 459 (460) T protein:vir:10 400 DM----VAMASWLNTIPVTPNEIRIAMKYETLNQDGMDIVFMPSNKVRIDDVSNNLIDSAFNQN 459 (460) T ss_pred HH----HHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCCeeeecccccchhhcccccCCCcccCC Confidence 33 34445788999999999999999998 5799999999999999876543 2222222 No 42 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=1.4e-91 Score=518.65 Aligned_cols=388 Identities=14% Similarity=0.183 Sum_probs=323.6 Q ss_pred HhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHH Q lcl|NC_019705. 21 QSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLA 100 (424) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~ 100 (424) |+||+++.............+. .......++...|+++++||+||++||++||++||++++.+. +...+|++. T Consensus 1 m~~f~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~g-----~~~~~~~~~ 73 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDDYISSVL--AGDVSQKYLGVSALKNSDILTATSIIAGDIARFPLVKKDVNG-----DIIHDEDIN 73 (406) T ss_pred CccccccCCCCCCcchHHHHHh--cCCCCcccccchhhccHHHHHHHHHHHHhhhhCeeEEEecCc-----cccccchHH Confidence 5555443322211111111111 112223445567999999999999999999999998876543 234568999 Q ss_pred HHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-CCceeEEEEecCceeEEeecC-ceEEEEEE--eCCceEEec Q lcl|NC_019705. 101 RLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-AGDVISLLPLQSANMDVKLVG-KKVVYRYQ--RDSEYAEFS 176 (424) Q Consensus 101 ~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~-~G~~~~l~~l~~~~v~~~~~~-~~~~~~~~--~~~~~~~~~ 176 (424) +||+.+||++||+++||+.++.+++++||||+++.|+. .|.+.+|+|++|++|++..++ +...|.|. .++....++ T Consensus 74 ~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~~~~y~~~~~~~~~~~~~~ 153 (406) T protein:vir:97 74 YLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNHEIVYTFTDMLTAKQVKCF 153 (406) T ss_pred HHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCceEEEEEEecCCceEEEEc Confidence 99999999999999999999999999999999999985 689999999999999988764 45666665 456678899 Q ss_pred HhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCccc Q lcl|NC_019705. 177 QKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVK 256 (424) Q Consensus 177 ~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~a 256 (424) ++||||||+++.|+++|+||+.++..++.++.++++++.++|+||+.|++++..+ ..+++++.+++++.|++.+++.|+ T Consensus 154 ~~evih~r~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~-~~l~~e~~~~~~~~~~~~~~g~n~ 232 (406) T protein:vir:97 154 AHDVIHWKFFSHDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKG-AQLSGDARQRARQEFEKMREGSVG 232 (406) T ss_pred cccEEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecC-CCCCHHHHHHHHHHHHHHhccccc Confidence 9999999999999999999999999999999999999999999999887776655 557899999999999999999999 Q ss_pred CcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 257 KRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWEN 336 (424) Q Consensus 257 g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~ 336 (424) |++++|++|++|++++++++|+||+|.+++++++||++|||||.+||... +++|+|++.+.|+..||.|++++||+ T Consensus 233 g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~----~~~~~e~~~~~f~~~~l~P~~~~ie~ 308 (406) T protein:vir:97 233 GSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNS----PNQSVAQLMEDYVTNDLPFYFDAITS 308 (406) T ss_pred CceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCC----CcchHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999998532 34588999999999999999999999 Q ss_pred HHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeeeecccccchhhcc Q lcl|NC_019705. 337 SIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAMRQSQYVPITDLG 414 (424) Q Consensus 337 ~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~~~~~~n~~~~~~~~ 414 (424) +|+++|+++.++..++++||++++ .+.+++.+.+++++|+||+||+|+++|+||+++ ||++++|+|++|++... T Consensus 309 ~l~~kll~~~~~~~~~i~fd~~~~----~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~ 384 (406) T protein:vir:97 309 ELGLKTLNDKDRRLYHIEFDTRSV----TGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKE 384 (406) T ss_pred HHhhhhcChhhccceeEEEecCcc----chhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccCccchhccc Confidence 999999999887778889997654 456677888999999999999999999999965 99999999999998764 Q ss_pred ccC-------CCcccCC Q lcl|NC_019705. 415 TNK-------EPRNNGA 424 (424) Q Consensus 415 ~~~-------~~~~~ga 424 (424) +.+ ++.++++ T Consensus 385 ~~~~~~~~~~~gg~~~~ 401 (406) T protein:vir:97 385 EYQDKVGIKGKGGEVNA 401 (406) T ss_pred ccccccccccCCCCCCC Confidence 311 1222222 No 43 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=1.3e-90 Score=513.32 Aligned_cols=373 Identities=22% Similarity=0.281 Sum_probs=315.8 Q ss_pred CchHHHHHhh------------------------ccCcccCCccccc-hhh--ccc----cccccCcccccHHHHhccHH Q lcl|NC_019705. 14 NGWWARLQSW------------------------FVGGRLVTPNQGS-QTG--PVS----AHGHLGDSSINDERILQIST 62 (424) Q Consensus 14 ~G~~~~~~~~------------------------~~~~~~~~~~~~~-~~~--~~~----~~~~~~~~~vs~~~~~~~~~ 62 (424) +|||+++++. |+++......... ... .+. .+....+..++.+.++++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 9999999986 3322211111111 001 111 12344677899999999999 Q ss_pred HHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEE-eeCCCCc Q lcl|NC_019705. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV-DRNSAGD 141 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~-~r~~~G~ 141 (424) ||+||++||++||++||++|+.++. .+.+..+|+.+||++||+++||+.++.++++ ||+|+++ .|+.+|. T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~~--------~~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~ 151 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGRI--------IDSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGY 151 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCcc--------ccchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCc Confidence 9999999999999999999975432 2345668999999999999999999999988 9999975 5899999 Q ss_pred eeEEEEecCceeEEeecCce-EEEEEEeCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019705. 142 VISLLPLQSANMDVKLVGKK-VVYRYQRDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFA 219 (424) Q Consensus 142 ~~~l~~l~~~~v~~~~~~~~-~~~~~~~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ 219 (424) |++|+||+|++|++..+.+. ..|++ ++ ...++||||||+++ .++++|+||++.++.++.+..++++++.++|+ T Consensus 152 ~~~L~pl~p~~v~v~~~~~g~~~y~~--~~---~~~~~eiiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~ 226 (409) T protein:vir:83 152 PIRFRVVPPWLVNVELKKGARREYRI--GG---LNVTDEILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAE 226 (409) T ss_pred EEEEEEECCcceEEEEcCCceEEEEE--cc---ccCccceEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999887554 34444 32 23468999999876 57799999999999999999999999999999 Q ss_pred cCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCcee-eecccChhHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019705. 220 NGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFST-SAIGVTPQDAEMMASRKFQVSELARFFGVP 298 (424) Q Consensus 220 ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~-~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP 298 (424) ||++|+|+|+++.. +++++.+++++.|+..+++ |+|+++++++|+++ ++++++++|+||+|.+++++++||++|||| T Consensus 227 nga~p~gil~~~~~-ls~e~~~~~~~~~~~~~~~-nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVP 304 (409) T protein:vir:83 227 TGGVPLYWLGVERR-LSETEAVDLMDRWIESRSK-YAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVP 304 (409) T ss_pred cCCCcceEeecCCC-CCHHHHHHHHHHHHHhhCC-ccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCC Confidence 99999999999865 6888899999999887654 78999999999997 568999999999999999999999999999 Q ss_pred HHHhCCCCCCcc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHh Q lcl|NC_019705. 299 PHLVGDVEKSTS-WGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGE 377 (424) Q Consensus 299 p~~l~~~~~~~~-~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~ 377 (424) |.+||..+++++ +|+|+|++.++|+++||.||+++||++|+++|+++.+ +++||++.|+++|.++|+++++++++ T Consensus 305 p~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~~----~~~f~~~~llr~d~~~r~~~~~~~~~ 380 (409) T protein:vir:83 305 PFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPSPQ----HLELNRDDYTRPSLVERATAYKIMIE 380 (409) T ss_pred HHHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCc----EEEeehhhhhccCHHHHHHHHHHHHh Confidence 999998776543 6789999999999999999999999999999997643 68999999999999999999999999 Q ss_pred CCCCCHHHHHHHhCCCCCCCCCeeeeccc Q lcl|NC_019705. 378 AGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 378 ~g~~t~NE~R~~~g~~p~~ggd~~~~~~n 406 (424) +|+||+||+|+++||||++|||++.-.+- T Consensus 381 ~G~lT~NE~R~~~glpp~~ggd~l~~~gv 409 (409) T protein:vir:83 381 AGVMEPNEARAMERLHSEAAAVRLSGGGV 409 (409) T ss_pred CCCcCHHHHHHHhCCCCCCCCcccCCCCC Confidence 99999999999999999999999843322 No 44 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=1.4e-89 Score=507.75 Aligned_cols=380 Identities=19% Similarity=0.230 Sum_probs=312.3 Q ss_pred CccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCC Q lcl|NC_019705. 31 TPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQY 110 (424) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~ 110 (424) .++.++..+.+..++......++.+.++++++||+||++||++||++||++|+++ +. ....|++.++|+.+||++ T Consensus 1 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~--~~---~~~~~~l~~lL~~~PN~~ 75 (723) T protein:vir:94 1 MTTFPSGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPD--GE---LDELHPLSQLWNVMPNRA 75 (723) T ss_pred CcccccCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCC--Cc---cchhhHHHHHHhhCCCCC Confidence 1111122222222333445567788999999999999999999999999998643 22 234689999999999999 Q ss_pred CCHHHHHHHHHHHHHHcCCeEEEEeeCC---CCceeEEEEecCceeEEeecCc--------eEEEEEE-eCCceEEecHh Q lcl|NC_019705. 111 MTAQEFREAMTMQLCFYGNAYALVDRNS---AGDVISLLPLQSANMDVKLVGK--------KVVYRYQ-RDSEYAEFSQK 178 (424) Q Consensus 111 ~s~~~f~~~~~~~~ll~G~a~~~~~r~~---~G~~~~l~~l~~~~v~~~~~~~--------~~~~~~~-~~~~~~~~~~~ 178 (424) ||+++||+.++.+|+++||+|++++|+. .|.|.+|+|+++..+.+....+ ...|.++ .++...+++++ T Consensus 76 ~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~~~~G~~~~~~~~ 155 (723) T protein:vir:94 76 MPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIERTDGVRVPVLAD 155 (723) T ss_pred CCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEEecCceeEEeccc Confidence 9999999999999999999999999654 4899999999998877654332 2234443 45666789999 Q ss_pred HEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCC-ccc Q lcl|NC_019705. 179 EIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGG-PVK 256 (424) Q Consensus 179 eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~-~~a 256 (424) ||||||+++ .|+++|+||+..++.+|....++++++.++|+||++|+|||+.+ + +++++.+++++.|++.++| .|+ T Consensus 156 dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~-~-l~~e~~~~~~~~~~~~~~G~~Na 233 (723) T protein:vir:94 156 EMLWLRFSDPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLG-D-MDEQTFTKTVAAFRSQVEGVQNA 233 (723) T ss_pred ceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-C-CCHHHHHHHHHHHHHHhhchhhc Confidence 999999886 79999999999999999999999999999999999999999976 3 6889999999999886655 799 Q ss_pred CcceecC----------CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHH Q lcl|NC_019705. 257 KRLWILE----------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 257 g~~~~l~----------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) |++++|+ .|++|++++++++|+||+|++++++++||++|||||.+|++. .+++|++++.+.|+.+| T Consensus 234 gk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~----st~sN~e~~~~~f~~~t 309 (723) T protein:vir:94 234 GRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGG----STYENQAEAKAAVWTET 309 (723) T ss_pred CcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCC----CCcccHHHHHHHHHHHH Confidence 9999986 589999999999999999999999999999999999999642 34568999999999999 Q ss_pred HHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC--eeeec Q lcl|NC_019705. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD--VAMRQ 404 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd--~~~~~ 404 (424) |+||++.||++|+++|++..+. ..+++||...++++|.+++++++..++++|+||+||+|+++|+||+|||| .++.| T Consensus 310 L~P~~~~ie~~ln~~Ll~~~g~-~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~~~p 388 (723) T protein:vir:94 310 LIPQMEVMASITDLQLLPDIGW-TVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMTLTP 388 (723) T ss_pred HHHHHHHHHHHHhHhhcccccC-ceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceecc Confidence 9999999999999999976543 35678888899999999999999999999999999999999999999987 34455 Q ss_pred c--cccchhhccccCCCcccCC Q lcl|NC_019705. 405 S--QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 405 ~--n~~~~~~~~~~~~~~~~ga 424 (424) . ++.|.+... +..++++| T Consensus 389 ~~~~~a~~~~~~--p~~~e~~~ 408 (723) T protein:vir:94 389 YRAQFAPAPAPA--PAVEEGAA 408 (723) T ss_pred ccccccCCCCCC--ccchhhhH Confidence 3 444433221 11122222 No 45 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=7e-88 Score=498.39 Aligned_cols=387 Identities=18% Similarity=0.234 Sum_probs=314.0 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccH-HHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccce Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIND-ERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~-~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) +|||+. |+++....+...... +..........++. ..+..+|+||+||++||++||++|+++|++.++|.. T Consensus 1 Mg~~~~----f~~k~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~-- 72 (403) T protein:vir:80 1 MGLFNF----FRRKTRSEPTNAISW--FLTQEAYDTLAIPGYTRLSDNPEVRMAVHKIAELISSMTIHLMQNTDNGDI-- 72 (403) T ss_pred Cccccc----ccccccccccchhhh--hcccccccccccchhhhhhhhHHHHHHHHHHHHhhhhCceEEEEecCCcee-- Confidence 888863 433333322221111 11111111112222 234568999999999999999999999998776643 Q ss_pred eccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHc--CCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEeCC Q lcl|NC_019705. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFY--GNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~--G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~ 170 (424) ...|++.++|+.+||++||+++||+.++.++++. ||||+++.|+..|.+.+||||+|++|++..+++...+.|. T Consensus 73 -~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g~~~~y~--- 148 (403) T protein:vir:80 73 -RIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTGYQIWYQ--- 148 (403) T ss_pred -ecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCceEEEEe--- Confidence 3468999999999999999999999999999994 7899999999999999999999999999888776555543 Q ss_pred ceEEecHhHEEEeec--CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHH- Q lcl|NC_019705. 171 EYAEFSQKEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENF- 247 (424) Q Consensus 171 ~~~~~~~~eiih~r~--~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~- 247 (424) ...++++||+||+. .+.++++|+||+..++.++....++++++.++|+||++|++||+.+... ++++.+++++.| T Consensus 149 -~~~~~~~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-~~~~~~~~~~~~~ 226 (403) T protein:vir:80 149 -GKAYNYDEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAAT-AELSSEEGRNAVF 226 (403) T ss_pred -ecccchhhEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCC-ChHHHHHHHHHHH Confidence 35689999999994 3467889999999999999999999999999999999999999998764 555555566555 Q ss_pred HHHhCCcccCcceecCCCc-eeeecc-cChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH Q lcl|NC_019705. 248 KEIAGGPVKKRLWILEAGF-STSAIG-VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 248 ~~~~~~~~ag~~~~l~~g~-~~~~l~-~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) +++.++.++|++++++.+. ++.++. ++++|+||+|.+++++.+||++|||||.+||..+ +.+++..+|+++ T Consensus 227 ~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-------~~~~~~~~f~~~ 299 (403) T protein:vir:80 227 KKYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVGK-------YDKDEYNNFINS 299 (403) T ss_pred HHHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC-------ccHHHHHHHHHH Confidence 5566777899999987664 455554 5789999999999999999999999999997532 224566789999 Q ss_pred HHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc Q lcl|NC_019705. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS 405 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~ 405 (424) ||.|++++||++|+++|+++.+ ++++||++.|+++|.+++++++.+++++|+||+||+|+++|+||+||||++++++ T Consensus 300 ~l~P~~~~ie~~l~~kll~~~~---~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd~~~~~~ 376 (403) T protein:vir:80 300 TILPIAKGIEQELTRKLLISPD---LYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLSELVILE 376 (403) T ss_pred HHHHHHHHHHHHHHHhccCCCC---cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecc Confidence 9999999999999999998765 5689999999999999999999999999999999999999999999999999999 Q ss_pred cccchhhcccc---CCCcccCC Q lcl|NC_019705. 406 QYVPITDLGTN---KEPRNNGA 424 (424) Q Consensus 406 n~~~~~~~~~~---~~~~~~ga 424 (424) |++|++.++++ ++++++|+ T Consensus 377 n~~pl~~~~~~~~~k~ge~~~~ 398 (403) T protein:vir:80 377 NYIPLDKIGDQNKLKGGEKGGA 398 (403) T ss_pred cccchhhccchhhccCCCCCCC Confidence 99999876653 33344444 No 46 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=2.2e-87 Score=495.67 Aligned_cols=391 Identities=17% Similarity=0.191 Sum_probs=327.0 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|||++++....+.. ..........+.......+..++...++++++|++||++||++||++||++|+.++++.. T Consensus 1 Mg~f~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~--- 75 (406) T protein:vir:95 1 MGLFDRWRRTKRKSK--IRADTGYVGLFMSGEDVSFLVPGYVRLSDNPEVRMAVHKIADLISSMTIYLMQNTEDGDI--- 75 (406) T ss_pred Ccchhhhcccccccc--ccccchhhhhhccCcccCccccCHHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce--- Confidence 899988754432221 111222222333334455666788899999999999999999999999999998876632 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCe--EEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEeCCc Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNA--YALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a--~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.+++++|++ |+++.|+..|.+++||||+|++|++..+.+... |..+ T Consensus 76 ~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~~~~--~~~~-- 151 (406) T protein:vir:95 76 RIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPDGYQ--VLYG-- 151 (406) T ss_pred eecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCCeEE--EEec-- Confidence 2468899999999999999999999999999999765 556779999999999999999999999887644 3334 Q ss_pred eEEecHhHEEEeecC--CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHH Q lcl|NC_019705. 172 YAEFSQKEIFHLKGF--GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) Q Consensus 172 ~~~~~~~eiih~r~~--~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~ 249 (424) ...|+++||||+|+. +.++++|+||+..+..++.+..++++++.++|+||++|+++++.+.. +++++.+++++.|.+ T Consensus 152 ~~~~~~~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~-l~~e~~~~~~~~~~~ 230 (406) T protein:vir:95 152 GQTFNYDEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAA-TAELSSEEGRNAVFK 230 (406) T ss_pred cEEEchhHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCHHHHHHHHHHHHH Confidence 357999999999953 46789999999999999999999999999999999999999999866 577778888888877 Q ss_pred HhC-CcccCcceecCC-Cceeeecc-cChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHH Q lcl|NC_019705. 250 IAG-GPVKKRLWILEA-GFSTSAIG-VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 250 ~~~-~~~ag~~~~l~~-g~~~~~l~-~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) .++ ..|+|++++++. |.+++++. ++++|+||+|.+++++++||++|||||.+||..+ +.+++..+|+++| T Consensus 231 ~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~-------~~~~~~~~~~~~~ 303 (406) T protein:vir:95 231 KYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIGE-------FNRDEYNNFINST 303 (406) T ss_pred HhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-------chHHHHHHHHHHH Confidence 654 568899988865 45677764 6899999999999999999999999999997532 4578899999999 Q ss_pred HHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccc Q lcl|NC_019705. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n 406 (424) |.|++++||++|+++|+++.+ ++++||+++|++.|.+++++.+.+++++|++|+||+|+++|+||+||||++++|+| T Consensus 304 l~P~~~~ie~~l~~~l~~~~~---~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~gd~~~~~~n 380 (406) T protein:vir:95 304 ILPIAKGIEQELTRKLLISPD---LYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEGLSELVILEN 380 (406) T ss_pred HHHHHHHHHHHHHHhcCCCCC---cEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccC Confidence 999999999999999998754 46899999999999999999999999999999999999999999999999999999 Q ss_pred ccchhhcccc---CCCcccCC Q lcl|NC_019705. 407 YVPITDLGTN---KEPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~---~~~~~~ga 424 (424) ++|++.+++. +++.++|. T Consensus 381 ~~~~~~~~~~~~~k~g~~~~~ 401 (406) T protein:vir:95 381 YIPLDKIGDQSKLKGGDNSGA 401 (406) T ss_pred ccchhhcccccccCCCCCCCC Confidence 9999876542 33333333 No 47 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=1.4e-87 Score=496.81 Aligned_cols=402 Identities=16% Similarity=0.231 Sum_probs=322.4 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhccc-cccccCccccc-HHHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVS-AHGHLGDSSIN-DERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~vs-~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) |++-+ -..+++||++-.+.....+... .......+.. ...+......+ ...++++++|++||++||++||++| T Consensus 4 ~~~~~----~~~~m~~F~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~ 78 (413) T protein:vir:96 4 VSEIR----KDKNLKFFNNKRSPTEESKAKD-EIPKAPQVVMTLPNFFKELISDGYTKLSDSPEVRMAVDCIADLVSNMT 78 (413) T ss_pred cchhh----hhhcCCccccCCCcchhhhhhc-cccccccccccchhhHhhhccchhHHHhhchHHHHHHHHHHHhhccCc Confidence 33222 1123345544221111100000 0011111110 00111111111 2236789999999999999999999 Q ss_pred eEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC-ceeEEEEecCceeEEee Q lcl|NC_019705. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG-DVISLLPLQSANMDVKL 157 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G-~~~~l~~l~~~~v~~~~ 157 (424) |++|+++.++... ..|++.++|+.+||++||+++||+.++.+++++||||++++|+..| .+.+|||++|++|++.. T Consensus 79 ~~~~~~~~~~~~~---~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~ 155 (413) T protein:vir:96 79 IQLMQNGETGDKR---IKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNV 155 (413) T ss_pred eEEEEecCCCccc---cccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEE Confidence 9999988766432 4689999999999999999999999999999999999999999887 57899999999999999 Q ss_pred cCceEEEEEEeCCceEEecHhHEEEeec-CCC-CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCC Q lcl|NC_019705. 158 VGKKVVYRYQRDSEYAEFSQKEIFHLKG-FGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL 235 (424) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~eiih~r~-~~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~ 235 (424) +++...|.+..++ ..++++||||||+ ++. ++++|+||+.++..++.+..++++++.++|+||++|+|+|+++.. + T Consensus 156 ~~~~~~y~~~~~~--~~~~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~-l 232 (413) T protein:vir:96 156 SDDDLDYSITFDN--KEYDPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSD-S 232 (413) T ss_pred cCCeEEEEEeecC--cEEchhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-C Confidence 9888888887665 4689999999995 444 578999999999999999999999999999999999999999865 6 Q ss_pred CHHHHHHHHHHHHHHhCC-cccCcceecCCCc-eeeec-ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccc Q lcl|NC_019705. 236 TEQQRSQVEENFKEIAGG-PVKKRLWILEAGF-STSAI-GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWG 312 (424) Q Consensus 236 ~~~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~-~~~~l-~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~ 312 (424) ++++.+++++.|++.+++ .|+|+++++++|. ++.++ .++++|+||+|.+++++++||++|||||.+||..+ T Consensus 233 ~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~------ 306 (413) T protein:vir:96 233 DELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVGT------ 306 (413) T ss_pred CHHHHHHHHHHHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCc------ Confidence 788899999999887665 6899999997665 45565 46899999999999999999999999999997521 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCC Q lcl|NC_019705. 313 SGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNL 392 (424) Q Consensus 313 ~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~ 392 (424) +.+++..+|+++||.|+++.||++|+++|+++ +.+++||++++++.|.+++++++.+++++|++|+||+|+++|+ T Consensus 307 -~~~~~~~~~~~~~l~P~~~~ie~~ln~~ll~~----~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~ 381 (413) T protein:vir:96 307 -YNKDEFNNFINTKIMSIAQVIQQTYNKLIVEE----DMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGM 381 (413) T ss_pred -chHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC----CcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 35788999999999999999999999999875 3578999999999999999999999999999999999999999 Q ss_pred CCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 393 PPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 393 ~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ||+||||++++++|++|++++++++..+.+-- T Consensus 382 ~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 382 PPDAEMDDLLVLENYLQQKDLVNQKKLIQDET 413 (413) T ss_pred CCCCCcceeeecccccchhhcccccCCCCCCC Confidence 99999999999999999998776543222222 No 48 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=3.3e-87 Score=494.71 Aligned_cols=401 Identities=16% Similarity=0.150 Sum_probs=325.1 Q ss_pred CchHHHHHhhccCcccCCc-cccc--------------------hhhcc----ccccccCcccccHHHHhccHHHHHHHH Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTP-NQGS--------------------QTGPV----SAHGHLGDSSINDERILQISTVWRCVS 68 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~-~~~~--------------------~~~~~----~~~~~~~~~~vs~~~~~~~~~v~~~i~ 68 (424) +|||+|+++.+.++..... .... ..... ...+...+..|+.+.|+++|+||+||+ T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~v~~~i~ 80 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGPVFACML 80 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccchhhhhccHHHHHHHH Confidence 9999999998865422111 0000 00000 011223577799999999999999999 Q ss_pred HHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC--------C Q lcl|NC_019705. 69 LISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA--------G 140 (424) Q Consensus 69 ~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~--------G 140 (424) +||++||++||++|++++++ ++...+|++..|+ .+||++||+++||+.++.+++++||||++++|+.. | T Consensus 81 ~Ia~~ia~lp~~~~~~~~~~--~~~~~~~~~~~L~-~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g 157 (466) T protein:vir:81 81 VRQLVFSSVRFRWQRLRDGK--PSDTFGSRDLQIL-ETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVD 157 (466) T ss_pred HHHHhhccCceEEEEecCCc--eeeccccHHHHHh-hCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCc Confidence 99999999999999876433 2334566766655 59999999999999999999999999999999765 4 Q ss_pred ceeEEEEecCceeEEeecCc---eEEEEEEeCC-----ceEEecHhHEEEeecC--CCCCcccCchHHHHHHHHHHHHHH Q lcl|NC_019705. 141 DVISLLPLQSANMDVKLVGK---KVVYRYQRDS-----EYAEFSQKEIFHLKGF--GFTGLVGLSPIAFACKSAGVAVAM 210 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~~~---~~~~~~~~~~-----~~~~~~~~eiih~r~~--~~~~~~G~s~i~~~~~~i~~~~~~ 210 (424) .+++|+||+|++|++..+.+ ...|.|+.++ ....++++||||||++ +.++++|+||+..+++++.+..++ T Consensus 158 ~~~~l~~l~~~~v~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~dviHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~ 237 (466) T protein:vir:81 158 VVVEERMVRGGRGELGGGQLGWRKVGYLYTEGGRQSGNESVGFLAEDVVHFAPIPDPLASYRGMSWLTPILREIRADQAM 237 (466) T ss_pred ceeEEEEecCcceEEEEcCCCceEEEEEEEecCcccccceeeeccccEEEEcCCCCcccccccccHHHHHHHHHHHHHHH Confidence 58999999999999887654 3345665543 4567999999999975 368899999999999999999999 Q ss_pred HHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHH Q lcl|NC_019705. 211 EDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVS 289 (424) Q Consensus 211 ~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~ 289 (424) ++++.++|+||++|+|||+++.. +++++++++++.|++.++| +|+|++++|++|++|++++++++|+||+|.++++++ T Consensus 238 ~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~ 316 (466) T protein:vir:81 238 SKHQAKFFDNGATVNLVIKHNPM-ADPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGET 316 (466) T ss_pred HHHHHHHHhcCCCcceEEecCCC-CCHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHH Confidence 99999999999999999998865 6888999999999877655 689999999999999999999999999999999999 Q ss_pred HHHHHhCCCHHHhCCCCC-CcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHH Q lcl|NC_019705. 290 ELARFFGVPPHLVGDVEK-STSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASR 368 (424) Q Consensus 290 ~Ia~~fgVPp~~l~~~~~-~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~ 368 (424) +||++|||||.+||+.++ ++.+++|+|++.++|+++||.|++++||++|+++|+++.++..++++||.++++++|.+++ T Consensus 317 ~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~~r 396 (466) T protein:vir:81 317 RIAAAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEKDA 396 (466) T ss_pred HHHHHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHHHH Confidence 999999999999998765 4567889999999999999999999999999999999887777889999999999999998 Q ss_pred HHH-------HHHHHhCCCCCHHHHHHHhCCCCCCCCCeeee-cccccchhhcc------------ccCCCcccCC Q lcl|NC_019705. 369 AAF-------MKAMGEAGLRTINEMRRTDNLPPLPGGDVAMR-QSQYVPITDLG------------TNKEPRNNGA 424 (424) Q Consensus 369 ~~~-------~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~-~~n~~~~~~~~------------~~~~~~~~ga 424 (424) +++ +..++++|+ |+||+|+ ++++||.++. +.++.+++... +.+++.+||- T Consensus 397 ~~~~~~~~~~~~~~~~~g~-t~nE~r~-----~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 397 ADIQKVRAETINTLITAGY-EPESVVA-----AVNSGDLRLLKHTGLTSVQLLPPGVSASASSDTPTSGGADDNGN 466 (466) T ss_pred HHHHHHHHHHHHHHHHcCC-Chhhccc-----cccCCccccccCCCcchhhhcccccccccCCCCcccCCCCcCCC Confidence 876 667888996 9999995 4567776544 34444443321 1122222222 No 49 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=1.6e-86 Score=490.90 Aligned_cols=419 Identities=15% Similarity=0.199 Sum_probs=320.1 Q ss_pred CCCCcccc-cCCCCCchHHHHHhhccCcccCC---------ccccchhhccccccccCcccccHHHHhccHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTI-DLRTNNGWWARLQSWFVGGRLVT---------PNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLI 70 (424) Q Consensus 1 ~~~~~~~~-~~~~~~G~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~i 70 (424) -..-.|.+ -|++++=+-+.=..++.+.+..+ +...............++..++.+.++++++|++||++| T Consensus 55 ~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~I 134 (945) T protein:vir:10 55 NSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEI 134 (945) T ss_pred cceeeeeeeeehhhhHHHhhcccccccccccchhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHH Confidence 01111221 12222221111111111111110 000000001111112234567888999999999999999 Q ss_pred HHhhccCceEEEEeccCCcc----ceeccchHHHHHhhcCCCCCCCHHH----HHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019705. 71 STLTACLPLDVFETDQNDNR----KKVDLSNPLARLLRYSPNQYMTAQE----FREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 71 a~~ia~~~~~v~~~~~~~~~----~~~~~~~~l~~lL~~~pn~~~s~~~----f~~~~~~~~ll~G~a~~~~~r~~~G~~ 142 (424) |++||++|+++|++.++|.. ++....|++..||+ +||++||+++ |++.++.+++++||+|++++|+.+|.+ T Consensus 135 A~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~-rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~i 213 (945) T protein:vir:10 135 PKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLE-RPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNL 213 (945) T ss_pred HhhhccCceEEEEecccCcccccccccccchHHHHHHh-CCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcE Confidence 99999999999998877653 34557789999997 9999999998 556788999999999999999999999 Q ss_pred eEEEEecCceeEEeecCce-EE--EEEEeCCc-eEEecHhHEE-EeecCCCCCc---ccCchHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 143 ISLLPLQSANMDVKLVGKK-VV--YRYQRDSE-YAEFSQKEIF-HLKGFGFTGL---VGLSPIAFACKSAGVAVAMEDQQ 214 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~-~~--~~~~~~~~-~~~~~~~eii-h~r~~~~~~~---~G~s~i~~~~~~i~~~~~~~~~~ 214 (424) ++|+|++|++|++..+++. .. |.+..++. ...++++|+| |+|++++++. +|+||+.+++.++....++++++ T Consensus 214 i~L~pLdPs~Vti~~ddDG~~~y~Yv~~idG~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~a 293 (945) T protein:vir:10 214 VAITPVDGTTIKPILSEDTGIVVGYVQEVDGAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGN 293 (945) T ss_pred EEEEEECCcceEEEEcCCCcEEEEEEEecCCceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHH Confidence 9999999999998876432 22 33344443 4568888865 6677877764 59999999999999999999999 Q ss_pred HHHHh-cCCCCceeEEcCC---------CCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHH Q lcl|NC_019705. 215 RDFFA-NGAKSPQILSTGE---------KVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASR 284 (424) Q Consensus 215 ~~~~~-ng~~~~~vl~~~~---------~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~ 284 (424) .++|. ||++|+|+|+++. +.+++++.+++++.|++.++|.++|+++++++|++|++++++++|+||+|++ T Consensus 294 ar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s~~DaQfLEsr 373 (945) T protein:vir:10 294 LDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRDMQFKELA 373 (945) T ss_pred HHHHHhCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCChhHHHHHHHH Confidence 99996 6889999998653 3468999999999999999988889999999999999999999999999999 Q ss_pred HHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccC Q lcl|NC_019705. 285 KFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGD 364 (424) Q Consensus 285 ~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d 364 (424) ++++++||++|||||.+||..++++ ++|+|++.+.|+++||.|++.+||++||++|+...+...+++.|+ .+...| T Consensus 374 kfs~eeIArAFGVPP~lLG~~e~st--~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~eg~~i~fdFd--~ldl~D 449 (945) T protein:vir:10 374 EFVARKICAVYQVSPQDVGILEGSN--KATAEVMASLTKAKGLEPLMATISKGFDEVVSEFRNEKDIKLWFK--EDDLEK 449 (945) T ss_pred HHHHHHHHHHhCCCHHHcccCCCCC--cchHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccCceeEEEec--chhccC Confidence 9999999999999999999877654 458999999999999999999999999999986655444445554 555678 Q ss_pred HHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc-cccchhhccccC--------------CCcccCC Q lcl|NC_019705. 365 SASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS-QYVPITDLGTNK--------------EPRNNGA 424 (424) Q Consensus 365 ~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~-n~~~~~~~~~~~--------------~~~~~ga 424 (424) .+++++++.+++++|+||+||+|+++|+||+||||+++++. |+.|.+...+.+ ++...|+ T Consensus 450 ~ksraEal~kli~sGiLTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGG 524 (945) T protein:vir:10 450 ERDWWNIIQGQLNTGFRSINEARMEKGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGG 524 (945) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCC Confidence 89999999999999999999999999999999999999987 455654332110 1111111 No 50 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=1.6e-85 Score=485.45 Aligned_cols=384 Identities=14% Similarity=0.172 Sum_probs=316.9 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|+|+++++++.+.... .. .....+++....++..|+.+.++++++|++||++||++||++||++++++. + . T Consensus 1 MGl~~~~~~~~~~~~~~--~~-~~~~~~~~~~~~~~~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g--~---~ 72 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEK--RG-YLDNVLGKSIRYSGVYVTDSNILQSSDVYELLQDISNQMVLADIVVEDEFG--N---E 72 (394) T ss_pred CchhhhhhhhccCCCCc--hh-hhhhhhhcccccCccccChhhhhccHHHHHHHHHHHHhhcccceEEEcCCC--c---c Confidence 99999998776433221 11 233345555667788899999999999999999999999999999997543 2 2 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEeCCceE Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..+|++..|| .+||++||+++||+.++.+++++||+|+++.++..+.+ ..|.+..++... +.|..+ ++ T Consensus 73 ~~~~~~~~Ll-~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~~~--------~~~~~~~~~~~~-~~~~~~--~~ 140 (394) T protein:vir:62 73 IKDDIALQIL-RNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIHLA--------SNVFTELDDNLV-EHFNIG--GH 140 (394) T ss_pred cchhhHHHHh-ccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceeecc--------ccceEEECCceE-EEEeeC--CE Confidence 3467776665 59999999999999999999999999999976543322 345555555433 334333 46 Q ss_pred EecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhC Q lcl|NC_019705. 174 EFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAG 252 (424) Q Consensus 174 ~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~-~~~~~~~~~~~~~~~~~ 252 (424) +|+++||+|+|+++.|+++|+||+..+..++....++++++.++|+||++|+++|+++.... ++++++++++.|++.++ T Consensus 141 ~~~~~eiih~r~~~~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (394) T protein:vir:62 141 EIPPCMIRHVKNIGADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLE 220 (394) T ss_pred EechhheEEecCcCCCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhc Confidence 79999999999999999999999999999999999999999999999999999999987653 45667888888888766 Q ss_pred C-cccCcceecCCCc--eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHH Q lcl|NC_019705. 253 G-PVKKRLWILEAGF--STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 253 ~-~~ag~~~~l~~g~--~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P 329 (424) + .|+|++++++.|. ++++++.+++|+||+|.+++++++||++|||||.+||... ++|+|++.++|+++||.| T Consensus 221 g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-----~sn~e~~~~~~~~~~l~P 295 (394) T protein:vir:62 221 SIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELI-----KEDIEKAMMYIHNKAVRP 295 (394) T ss_pred cccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-----CcCHHHHHHHHHHHHHHH Confidence 5 6889999998776 5568888999999999999999999999999999998643 357899999999999999 Q ss_pred HHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--CCCCeeeecccc Q lcl|NC_019705. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL--PGGDVAMRQSQY 407 (424) Q Consensus 330 ~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~--~ggd~~~~~~n~ 407 (424) ++++||++|+++|+++.++..++++||...++.. .++++++.+++++|+||+||+|+++|+||+ |+||++++++|+ T Consensus 296 ~~~~ie~~l~~kll~~~~~~~~~~~fd~~~~~~~--~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~~n~ 373 (394) T protein:vir:62 296 IMKNFEDHLSLLFYAQNSGKRIKFKINILDFVTY--SNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAIYISNDV 373 (394) T ss_pred HHHHHHHHHhhhhcCccccCceEEEechhhhcCH--HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeccccc Confidence 9999999999999998887777888888888655 467888999999999999999999999999 679999999999 Q ss_pred cchhhccccCCCcccCC Q lcl|NC_019705. 408 VPITDLGTNKEPRNNGA 424 (424) Q Consensus 408 ~~~~~~~~~~~~~~~ga 424 (424) .|++.....+++..+|- T Consensus 374 ~~~~~~~~~~~~~kgge 390 (394) T protein:vir:62 374 TEIGKKEATDGSLGGGE 390 (394) T ss_pred ccccccccccccCCCCC Confidence 99986544333333333 No 51 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=4.6e-85 Score=482.93 Aligned_cols=386 Identities=20% Similarity=0.232 Sum_probs=318.8 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +||++++..++..+....... .+...... .....+.++++++++|++||++||++||++||+++++......... T Consensus 1 mg~~~~~~~~~~~~~~~~~~~----~~~~~~~~-~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~~~~~ 75 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRIIRDM----EPVSHRTN-RKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNIVTYANG 75 (403) T ss_pred Ccchhhhhhccchhhhhhhcc----cccccccC-CcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeecccccccccc Confidence 888888877775432221101 11111111 1122356889999999999999999999999999987665555555 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEeCCceE Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ...|++.++|+.+||++||+++||+.++.+++++||||+++.+ ..|++++++.|++..+.+...+.|..+ .+. T Consensus 76 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~~------~~l~~l~~~~~~v~~~~~~~~~~~~~~-~~~ 148 (403) T protein:vir:10 76 VKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWDG------TSLYHVPAALMQVEADANKFIKKFIFN-NQI 148 (403) T ss_pred cccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEeC------ceeEeecCcceEEEEcCCceEEEEEec-Cce Confidence 5678999999999999999999999999999999999998753 258999999999988877666655443 346 Q ss_pred EecHhHEEEeecCCC-----CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019705. 174 EFSQKEIFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 174 ~~~~~eiih~r~~~~-----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) .+.++||+|||.++. ++++|+||+.++..+++...++++++.++|+||++|+++|+.+.. +++++++++++.|+ T Consensus 149 ~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-l~~e~~~~~~~~~~ 227 (403) T protein:vir:10 149 NYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEI-LNKKLRERKQEELQ 227 (403) T ss_pred eecccceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCHHHHHHHHHHHH Confidence 788999999996543 789999999999999999999999999999999999999998855 68888999999999 Q ss_pred HHhCC-cccCcceecCCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH Q lcl|NC_019705. 249 EIAGG-PVKKRLWILEAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 249 ~~~~~-~~ag~~~~l~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) +.++| +|+|++++|++|++|+++++ ++.|+||+|.+++++++||++|||||.+||.. +++|+|++.+.|+++ T Consensus 228 ~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-----~~sn~e~~~~~f~~~ 302 (403) T protein:vir:10 228 LDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGG-----NNANIRPNIELFYYM 302 (403) T ss_pred HHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCC-----CCcCHHHHHHHHHHH Confidence 87655 68999999999999999985 57899999999999999999999999999753 345889999999999 Q ss_pred HHHHHHHHHHHHHHhhccChhhhcccchhhhhhhh--hccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCee Q lcl|NC_019705. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL--LRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVA 401 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l--~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~--ggd~~ 401 (424) ||.||++.||++|+++|. ++++||++.+ ++.|.+++++++.+++++|++|+||+|+++|+||+| +||++ T Consensus 303 tl~P~~~~ie~~l~~~L~-------~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~d~~ 375 (403) T protein:vir:10 303 TIIPMLNKLTSSLTFFFG-------YKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQMNKI 375 (403) T ss_pred HHHHHHHHHHHHHHHhcC-------ceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCccccccc Confidence 999999999999999883 2456777655 899999999999999999999999999999999995 69999 Q ss_pred eecccccchhhc-cccCCCcccCC Q lcl|NC_019705. 402 MRQSQYVPITDL-GTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~-~~~~~~~~~ga 424 (424) ++|+|+...... +.++++..+++ T Consensus 376 ~~p~n~~~~~~~~~~~e~~~~~~~ 399 (403) T protein:vir:10 376 RIPANVAGSATGVSGQEGGRPKGS 399 (403) T ss_pred ccccccccccccCCCCcCCCCCCC Confidence 999999765442 33333334444 No 52 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=1.1e-84 Score=480.81 Aligned_cols=376 Identities=16% Similarity=0.207 Sum_probs=314.0 Q ss_pred CchHHHHHhhccCcccCCccccchhh-ccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccce Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTG-PVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) +|||+++. +. +++........... .....+...+..|+.+.|+++++|++||++||++||++||++++. T Consensus 1 Mg~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~~~-------- 70 (385) T protein:vir:10 1 MGLLTPRN-FN-KRKAKNMVYPSNPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVASAHFKTENT-------- 70 (385) T ss_pred Cccccchh-cc-cccccccccccchhhhhhhccccCccccCHHHhhccHHHHHHHHHHHHHHhhCceeeecc-------- Confidence 88887642 22 22111111111111 112223344667899999999999999999999999999998743 Q ss_pred eccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEe--CC Q lcl|NC_019705. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQR--DS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~--~~ 170 (424) +...+|+ +||++||+++||+.++.+++++||||++++|+ +.+++|+++.+|++..+++...|.+.. ++ T Consensus 71 -----~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~----~~~~~p~~~~~v~~~~~~~~~~~~~~~~~~~ 140 (385) T protein:vir:10 71 -----ATLNRLE-SPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGIVYTVLESNDR 140 (385) T ss_pred -----chhhhhh-cCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCCceEEEEEcCCceEEEEEEcCCc Confidence 3444554 99999999999999999999999999999876 468999999999999888877777654 34 Q ss_pred ceEEecHhHEEEeecCC---CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHH Q lcl|NC_019705. 171 EYAEFSQKEIFHLKGFG---FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENF 247 (424) Q Consensus 171 ~~~~~~~~eiih~r~~~---~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~ 247 (424) ..++|+++||||||+++ .++++|+||+..++.++....++++++.++|+||++|+|+|+++....++++.+++++.| T Consensus 141 ~~~~~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~ 220 (385) T protein:vir:10 141 PQMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEF 220 (385) T ss_pred eEEEEccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHH Confidence 56789999999999865 356899999999999999999999999999999999999999998878889999999999 Q ss_pred HHHhCCcccCcceecCCCceeeecccChhHHHHH-HHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHH Q lcl|NC_019705. 248 KEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMM-ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 248 ~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~-e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) ++.+++.|+|+++++++|++|++++.++.|+|++ |.+++++++||++|||||.+||+.+.++.+++|+|++.. ++..| T Consensus 221 ~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~-~~~~~ 299 (385) T protein:vir:10 221 EKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKA-TYLAN 299 (385) T ss_pred HHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHH-HHHHH Confidence 9999999999999999999999999999999975 999999999999999999999998777778889886654 55679 Q ss_pred HHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeeeec Q lcl|NC_019705. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAMRQ 404 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~~~~~ 404 (424) |.|+++.||++|+++|+++ .++||+++++++|.+++++.+++++++|++|+||+|+++|++|+|+ ||++..+ T Consensus 300 l~P~~~~ie~~l~~~l~~~------~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~~~~~ 373 (385) T protein:vir:10 300 LNSYVNPIVDELRLKMNAP------DLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPL 373 (385) T ss_pred HHHHHHHHHHHHHHhhCCc------eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCccccCc Confidence 9999999999999999865 3789999999999999999999999999999999999999999964 5677766 Q ss_pred ccccchhhccccCCCccc Q lcl|NC_019705. 405 SQYVPITDLGTNKEPRNN 422 (424) Q Consensus 405 ~n~~~~~~~~~~~~~~~~ 422 (424) .+.+..++ ..+| T Consensus 374 ~~~~~~g~------~~dn 385 (385) T protein:vir:10 374 TTQVKGGD------EGDN 385 (385) T ss_pred ccccCCCC------CCCC Confidence 66543222 1222 No 53 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=3e-84 Score=478.47 Aligned_cols=380 Identities=14% Similarity=0.126 Sum_probs=314.9 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|||++.+..- ...+.....+.. ...+...+..|+.+.|+++++|++||++||++||++||++. T Consensus 1 M~~f~~~~~~~----~~~~~~~~~~~~-~~~~~~~~~~v~~~~al~~~~V~~~v~~ia~~ia~~p~~~~----------- 64 (397) T protein:vir:38 1 MPLLKLNKSHS----QGFSLNDPDWVN-FLTGGEAQKYVSADTALKNSDIFSLIMQLSGDLAMVRYTSE----------- 64 (397) T ss_pred Ccchhhhhccc----CcccCCchhhhh-hhcCCcCCceechHHhhccHHHHHHHHHHHHHHhhCccccc----------- Confidence 78887653221 111111111111 12233467789999999999999999999999999999642 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeec--CceEEEEEEe--- Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV--GKKVVYRYQR--- 168 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~--~~~~~~~~~~--- 168 (424) |+..++|+.+||++||+++||+.++.+++++||||++++|+..|.+++|+||+|++|++..+ ++...|++.. T Consensus 65 ---~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~~~y~~~~~~~ 141 (397) T protein:vir:38 65 ---SDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSGLIYNINFDEP 141 (397) T ss_pred ---ccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEeccc Confidence 44567788899999999999999999999999999999999999999999999999998765 3456666654 Q ss_pred -CCceEEecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019705. 169 -DSEYAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 169 -~~~~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~ 246 (424) ++..+.|+++||||+|+++.++ ++|+||+..+..++....++++++.++|+||++|+++|+.+.. .++++.+++++. T Consensus 142 ~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-~~~e~~~~~~~~ 220 (397) T protein:vir:38 142 AIGYMENVPAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKG-GLLDAETRIARS 220 (397) T ss_pred cccceeEecCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-CCHHHHHHHHHH Confidence 3345789999999999998766 7999999999999999999999999999999999999999876 567788899999 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHH Q lcl|NC_019705. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) |+...++.|+|+++++++|++|++++.++.|+||+|.+++.+++||++|||||.+||+..+++ +|.| +...|+.+| T Consensus 221 ~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~---~~~e-~~~~~~~~~ 296 (397) T protein:vir:38 221 KEISKQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQ---SSIT-QISGQYAKS 296 (397) T ss_pred HHHHhcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc---cHHH-HHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999865433 4565 456788999 Q ss_pred HHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccc Q lcl|NC_019705. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n 406 (424) |+|++..||++|+++|++..+ |++..+++.|.+++++.+++++++|++|+||+|+++|++|+++||.+..... T Consensus 297 l~P~~~~ie~~ln~~l~~~~~-------~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~ 369 (397) T protein:vir:38 297 LNRYVQAIVGELNDKLHANIS-------ANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKE 369 (397) T ss_pred HHHHHHHHHHHHHHhccChhc-------ccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccccccc Confidence 999999999999999998754 4555667889999999999999999999999999999999999997766555 Q ss_pred ccchhhccccCCCcccCC Q lcl|NC_019705. 407 YVPITDLGTNKEPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~~~~~~~ga 424 (424) ..+.......+++.+++. T Consensus 370 ~~~~~~~~~~~~g~~~~~ 387 (397) T protein:vir:38 370 PQQAIQLIQQEGGENDGN 387 (397) T ss_pred ccccccccccccCCCCCC Confidence 554443333333332222 No 54 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=100.00 E-value=4.2e-84 Score=477.67 Aligned_cols=337 Identities=22% Similarity=0.325 Sum_probs=296.4 Q ss_pred hccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCcee Q lcl|NC_019705. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) ||++||+++++++. .+|++.++|+.+||++||+++||+.++.+++++||||++++|+..|.|++||||+|++| T Consensus 1 ia~lp~~~~~~~~~-------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v 73 (348) T protein:vir:93 1 MASLPLKMYEDYKV-------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVV 73 (348) T ss_pred CcccceEeEecCcC-------cccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCce Confidence 99999999986542 35899999999999999999999999999999999999999999999999999999999 Q ss_pred EEeecCc--eEEEEEE-eCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEE Q lcl|NC_019705. 154 DVKLVGK--KVVYRYQ-RDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 154 ~~~~~~~--~~~~~~~-~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~ 229 (424) ++..+++ ...|.+. .++..+.|+++||||||+++ .++++|+||+..+..++.+..++++++.. .++..+.++++ T Consensus 74 ~~~~~~~~~~~~y~~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~--~~~~~~~~i~~ 151 (348) T protein:vir:93 74 EMLIENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLT--EMQKPDSFMLK 151 (348) T ss_pred EEEEeCCCcEEEEEEEcCCCeEEEEccccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHH--hcCCCceeEEe Confidence 9877643 4455554 34567789999999999875 58899999999999999999999988633 33333455555 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCc Q lcl|NC_019705. 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST 309 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~ 309 (424) .+ ...++++.+++++.|++.++ |+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..+++ T Consensus 152 ~~-~~l~~e~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~- 227 (348) T protein:vir:93 152 YG-SNVSTEKRQQVLEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNT- 227 (348) T ss_pred cC-CCCCHHHHHHHHHHHHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC- Confidence 55 45788999999999998874 678999999999999999999999999999999999999999999999976655 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_019705. 310 SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRR 388 (424) Q Consensus 310 ~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~ 388 (424) +++|+|++.++|+++||.|+++.||++|+++|+++.++. +++++||.+++++.|.+++++++.+++++|++|+||+|+ T Consensus 228 -~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~ 306 (348) T protein:vir:93 228 -NFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIRE 306 (348) T ss_pred -CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHH Confidence 456999999999999999999999999999999998864 588999999999999999999999999999999999999 Q ss_pred HhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 389 TDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 389 ~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++|+||+||||++++++|++|++..+++++...+|- T Consensus 307 ~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gg~ 342 (348) T protein:vir:93 307 WEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGD 342 (348) T ss_pred HhCCCCCCCcCeEeecccccccccchhhcccccCCC Confidence 999999999999999999999988766554333333 No 55 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=1.4e-83 Score=474.77 Aligned_cols=376 Identities=15% Similarity=0.213 Sum_probs=317.4 Q ss_pred CchHHHHHhhccCcccCCccccchhhcc-ccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccce Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPV-SAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) +|||+++. |.+.+............+ ...+...+..|+.++|+++++|++||++||+++|++||++++ T Consensus 1 Mg~~~~~~--~~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~--------- 69 (383) T protein:vir:10 1 MGLLTPKN--FSKRNAKNMVYPSNPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVSSAHFKTEN--------- 69 (383) T ss_pred CCcccccc--cccccccccccccchhhhhhhccCccccccchhHhhcchHHHHHHHHHHHhhccCceeecc--------- Confidence 78887642 223222222222221111 222344566789999999999999999999999999999864 Q ss_pred eccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEe--CC Q lcl|NC_019705. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQR--DS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~--~~ 170 (424) |+...+|+ +||++||+++||+.++.+++++||||++++|+ +.+++|+++.+|++..+.+...|.+.. ++ T Consensus 70 ----~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~----~~~~~p~~~~~v~~~~~~~~~~~~~~~~~~~ 140 (383) T protein:vir:10 70 ----TATLNRLE-SPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGIVYTVLESNDR 140 (383) T ss_pred ----cchhhhhh-CCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCcceEEEEEcCCceEEEEEEcCCc Confidence 23445565 99999999999999999999999999999875 467999999999988887776666543 45 Q ss_pred ceEEecHhHEEEeecCCC---CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHH Q lcl|NC_019705. 171 EYAEFSQKEIFHLKGFGF---TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENF 247 (424) Q Consensus 171 ~~~~~~~~eiih~r~~~~---~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~ 247 (424) ..+.|+++||+|||+++. ++++|+||+.++..++....++++++.++|+||++|+|+|+++....++++.+++++.| T Consensus 141 ~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~ 220 (383) T protein:vir:10 141 PKMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEF 220 (383) T ss_pred eEEEEcccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHH Confidence 678899999999998764 45789999999999999999999999999999999999999998888899999999999 Q ss_pred HHHhCCcccCcceecCCCceeeecccChhHHHHH-HHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHH Q lcl|NC_019705. 248 KEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMM-ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 248 ~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~-e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) ++.+++.|+|+++++++|++|++++.+++|+|++ |.+++++++||++|||||.+||..+.++.+++|+|++...| ..| T Consensus 221 ~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~~-~~~ 299 (383) T protein:vir:10 221 EKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATY-LAN 299 (383) T ss_pred HHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHHH-HHH Confidence 9999989999999999999999999999999975 89999999999999999999998777777888888876655 579 Q ss_pred HHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccc Q lcl|NC_019705. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n 406 (424) |.|+++.||++|+++|+.+ +++||++.+++.|.+++++.+.+++++|++|+||+|+++|++|+|+||.+....+ T Consensus 300 l~P~~~~ie~~l~~~l~~~------~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~ 373 (383) T protein:vir:10 300 LNSYVNPIVDELRLKMNAP------DLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPL 373 (383) T ss_pred HHHHHHHHHHHHHHhhCCc------eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCcccccCCC Confidence 9999999999999999764 4789999999999999999999999999999999999999999999998877666 Q ss_pred ccchhhccccC Q lcl|NC_019705. 407 YVPITDLGTNK 417 (424) Q Consensus 407 ~~~~~~~~~~~ 417 (424) ..++.. ++.| T Consensus 374 ~~~~~g-Gd~e 383 (383) T protein:vir:10 374 TNETKG-GDDK 383 (383) T ss_pred cccCCC-CCCC Confidence 655442 2222 No 56 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=2.4e-83 Score=473.56 Aligned_cols=352 Identities=16% Similarity=0.213 Sum_probs=303.4 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|||+++ .+++...+..+. ...........+..|+.+.|+++++||+||++||++||++|+. T Consensus 1 M~~~~~f----~~r~~~~~~~~~-~~~~~~~~~~~~~~v~~~~al~~~av~~cv~~ia~~ia~~p~~------------- 62 (359) T protein:vir:10 1 MSILNPF----ERRSSITPNNYY-PFMVQNGSIVPNSLVDATEALKNSDLYAVTSLISSDIAGTRFI------------- 62 (359) T ss_pred Ccccchh----hccccCCCCcch-hhhhccccccCCcccCHHHhhcchHHHHHHHHHHHhhhcCccc------------- Confidence 8888754 233322222221 1222333456777899999999999999999999999999983 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEe--CCc Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQR--DSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~--~~~ 171 (424) .+++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.+.+||||+|++|++..+++...|.+.. ++. T Consensus 63 --~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~y~~~~~~~~~ 140 (359) T protein:vir:10 63 --GNQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDDTLTYEVNQFDDYP 140 (359) T ss_pred --cchHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCCeEEEEEEecCCce Confidence 2567778888999999999999999999999999999999999999999999999999999888888777753 456 Q ss_pred eEEecHhHEEEeecCCC-----CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019705. 172 YAEFSQKEIFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 172 ~~~~~~~eiih~r~~~~-----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~ 246 (424) ...++++||+|||+++. ++++|+||+.+++.++....+++++..++|+||++|+|+|+++.+.+++++++++++. T Consensus 141 ~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~ 220 (359) T protein:vir:10 141 SAKYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKE 220 (359) T ss_pred EEEEcccceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHH Confidence 78899999999997653 7889999999999999999999999999999999999999998878899999999999 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHH Q lcl|NC_019705. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) |+++++++|+|++++|++|++|++++++++|+||+|.+++++++||++|||||++||+.++.+.+++++|++...|+..+ T Consensus 221 ~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~ 300 (359) T protein:vir:10 221 FEKANGGNNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRF 300 (359) T ss_pred HHHHhCccccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999988777778889999888999999 Q ss_pred HHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC Q lcl|NC_019705. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ 396 (424) |.|++..|++.|+.++. ++...+.+.|...+...+.+++++|++|+||+|+++|++|+= T Consensus 301 l~p~~~~l~~~l~~~~~-----------~~~~~~~~~d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 301 IEPLISELRIKCDSSIG-----------VDMSPITDYSNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred HHHHHHHHHHHhhhhhc-----------ccchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99988888888876653 333344444455556667889999999999999999999985 No 57 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=7.4e-82 Score=465.36 Aligned_cols=420 Identities=12% Similarity=0.128 Sum_probs=314.5 Q ss_pred CCCCcccccCCCCC-chHHH-------HHhhccCcccCCccccchhhccc-----------------cccccC-cccccH Q lcl|NC_019705. 1 MEEPKYTIDLRTNN-GWWAR-------LQSWFVGGRLVTPNQGSQTGPVS-----------------AHGHLG-DSSIND 54 (424) Q Consensus 1 ~~~~~~~~~~~~~~-G~~~~-------~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~-~~~vs~ 54 (424) -+.-.|.|.+++.. -+.+. +--...+...+..........+. ...... ...+.. T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~~ 99 (574) T protein:vir:80 20 RNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILNA 99 (574) T ss_pred HhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccHHHHHHhhccChhHHH Confidence 22234666666544 11110 10001000011100000000000 000001 111222 Q ss_pred HHHhccHHHHHHHHHHHHhhccCceEEEEeccCCc--cceeccchHHHHHhhc---CCCCCC-CHHHHHHHHHHHHHHcC Q lcl|NC_019705. 55 ERILQISTVWRCVSLISTLTACLPLDVFETDQNDN--RKKVDLSNPLARLLRY---SPNQYM-TAQEFREAMTMQLCFYG 128 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~--~~~~~~~~~l~~lL~~---~pn~~~-s~~~f~~~~~~~~ll~G 128 (424) ...+....|++|+.+|+.++|++||+|++++.++. .++....|++..+|.. .|||++ |+.+|++.++.+++++| T Consensus 100 ~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~G 179 (574) T protein:vir:80 100 IINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYD 179 (574) T ss_pred HHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcC Confidence 33345567888888888899999999998765432 3445667899988864 356665 78899999999999999 Q ss_pred CeEEEEeeCCCCceeEEEEecCceeEEeecCc-------eEEEEEEeCCceEEecHhHEEEeecCCC----CCcccCchH Q lcl|NC_019705. 129 NAYALVDRNSAGDVISLLPLQSANMDVKLVGK-------KVVYRYQRDSEYAEFSQKEIFHLKGFGF----TGLVGLSPI 197 (424) Q Consensus 129 ~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-------~~~~~~~~~~~~~~~~~~eiih~r~~~~----~~~~G~s~i 197 (424) |+|++++|+..|.|++||||+|.+|++..+.. ..+|++..++....|+++||||+|+.+. ++.+|+||+ T Consensus 180 nayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi 259 (574) T protein:vir:80 180 QVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKLIKNGERFVQVIDNRIVAKFNERELAFAVRNPRADIEVGQYGYPEL 259 (574) T ss_pred CeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccccCceEEEEEeCCceEEEEccccEEEEeccCCCCcccccccccHH Confidence 99999999999999999999999999987643 3455666677778899999999997543 367899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhCC-cccCcce-ecCCCceeeecccC Q lcl|NC_019705. 198 AFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGG-PVKKRLW-ILEAGFSTSAIGVT 274 (424) Q Consensus 198 ~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~-~~~~~~~~~~~~~~~~~~~~-~~ag~~~-~l~~g~~~~~l~~~ 274 (424) .+++.++....++++++.++|+||++|+|||+++.+ .+++++.+++++.|++.++| .|+|+++ ++++|++|++++++ T Consensus 260 ~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s 339 (574) T protein:vir:80 260 EIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPS 339 (574) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCC Confidence 999999999999999999999999999999998654 47999999999999987665 6899875 55789999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCc--------ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChh Q lcl|NC_019705. 275 PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST--------SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK 346 (424) Q Consensus 275 ~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~--------~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~ 346 (424) ++|+||+|++++++++||++|||||.+||..++++ .+++|+|++.+.|+++||.|++..||++|+++|++.. T Consensus 340 ~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~ 419 (574) T protein:vir:80 340 ANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEF 419 (574) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhc Confidence 99999999999999999999999999999987654 3567899999999999999999999999999999876 Q ss_pred hhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 347 DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 347 ~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +. .++++|+..+++..+. .. .+..++.+|+||+||+|+++|+||+||||++++|+|+++++.....+......+ T Consensus 420 ~~-~~~~~f~~~d~~~~~~--~~-~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~ 493 (574) T protein:vir:80 420 GE-KYQFQFRGGDLSAQLD--KL-KIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRS 493 (574) T ss_pred CC-ceEEEecccchhhHHH--HH-HHHHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccch Confidence 64 3677888766644322 22 234578899999999999999999999999999999999876533221111111 No 58 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=3.7e-81 Score=461.55 Aligned_cols=363 Identities=13% Similarity=0.148 Sum_probs=297.5 Q ss_pred hHHHHHhhccCcccC----Cccccc----hhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccC Q lcl|NC_019705. 16 WWARLQSWFVGGRLV----TPNQGS----QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 16 ~~~~~~~~~~~~~~~----~~~~~~----~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~ 87 (424) |+-.++++|++.... .+..+. ....+.......+..|+.+.|+++++|++||++||++||++|++++++.. T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~- 79 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN- 79 (392) T ss_pred CcchhhhhhhcccCcccccccccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchh- Confidence 444455555543221 111111 11123344455678899999999999999999999999999999987542 Q ss_pred CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeec--CceEEEE Q lcl|NC_019705. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV--GKKVVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~--~~~~~~~ 165 (424) ..|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|+||+|++|++..+ ++...|. T Consensus 80 -------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~y~ 146 (392) T protein:vir:74 80 -------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYN 146 (392) T ss_pred -------------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEE Confidence 2366799999999999999999999999999999999999999999999999998875 4556677 Q ss_pred EEeCC----ceEEecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHH Q lcl|NC_019705. 166 YQRDS----EYAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 166 ~~~~~----~~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~ 240 (424) +...+ ....++++||||||++++++ ++|+||+.++..++.+..++++++.++|+||++|+|+|+++.+...++ T Consensus 147 ~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~-- 224 (392) T protein:vir:74 147 ITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSD-- 224 (392) T ss_pred EEecCCccceeEEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchH-- Confidence 66543 35679999999999999887 789999999999999999999999999999999999999987654332 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH Q lcl|NC_019705. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~ 320 (424) +..+++.+.+.+..|+|++++|++|++|++++++++|+||+|.+++.+++||++|||||.+||+..+++ +.+++.+ T Consensus 225 ~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~----~~~e~~~ 300 (392) T protein:vir:74 225 KDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ----SSIQQIS 300 (392) T ss_pred HHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc----cHHHHHH Confidence 222344445667778999999999999999999999999999999999999999999999999765433 3467789 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh---------- Q lcl|NC_019705. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD---------- 390 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~---------- 390 (424) +|+++||.|+++.||++|+++|++. ++||...+++.|.+.+++.+..++++|++|+||+|+++ T Consensus 301 ~~~~~~l~p~~~~ie~~l~~~l~~~-------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~ 373 (392) T protein:vir:74 301 GMYASALNRYLRPAISELEYKLSDH-------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDL 373 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhccch-------hcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCcccc Confidence 9999999999999999999999865 46888899999999999999999999999999999987 Q ss_pred ----CCCCCCCCCeeeecccccc Q lcl|NC_019705. 391 ----NLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 391 ----g~~p~~ggd~~~~~~n~~~ 409 (424) |+||++|||+ .+.+| T Consensus 374 r~~enl~~~~~Gd~----~~p~p 392 (392) T protein:vir:74 374 PAPENTNKKTTGQS----NEPVP 392 (392) T ss_pred chhcCCCCCCCCCC----CCCCC Confidence 5555555543 11222 No 59 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=5.5e-81 Score=460.59 Aligned_cols=373 Identities=13% Similarity=0.168 Sum_probs=307.8 Q ss_pred CchHHHHHhhccCcccCCcccc----chhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCc Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQG----SQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDN 89 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~ 89 (424) +|||++++.. +.+.+... ....+.......+++.|+.+.++++|+|++||++||++||++|+++++.. T Consensus 1 M~~f~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~~---- 72 (386) T protein:vir:48 1 MPIFNITNLA----TESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRKQ---- 72 (386) T ss_pred Cccccccccc----ccccccccccccccccchhcccccCCceechhhhhcchHHHHHHHHHHHhhccCceeeccch---- Confidence 7888764322 22121111 11122223345578889999999999999999999999999999998532 Q ss_pred cceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecC--ceEEEEEE Q lcl|NC_019705. 90 RKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYRYQ 167 (424) Q Consensus 90 ~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~--~~~~~~~~ 167 (424) .+.|+.+||++||+++||+.++.+++++||||++++|+..|.+++|+||+|++|++..+. +..+|.+. T Consensus 73 ----------~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~ 142 (386) T protein:vir:48 73 ----------LQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNIT 142 (386) T ss_pred ----------hHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEE Confidence 335677999999999999999999999999999999999999999999999999988764 45566665 Q ss_pred eCC----ceEEecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHH Q lcl|NC_019705. 168 RDS----EYAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQ 242 (424) Q Consensus 168 ~~~----~~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~ 242 (424) .++ ..+.|+++||||+|++++++ ++|+||+..+..++.+..++++++.++|+||++|+++|+.+.. .+++++++ T Consensus 143 ~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~-~~~e~~~~ 221 (386) T protein:vir:48 143 FDDPRIPPKQHVPQGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGG-GLLDFKTK 221 (386) T ss_pred ecCccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCHHHHHH Confidence 443 34679999999999998876 8999999999999999999999999999999999999999876 46666677 Q ss_pred HHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHH Q lcl|NC_019705. 243 VEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 243 ~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~ 322 (424) +++.|... ..++|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||... +++|++++.++| T Consensus 222 ~~~~~~~~--~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~----~~~~~e~~~~~~ 295 (386) T protein:vir:48 222 LSRSRQAM--KQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQG----DQQSSLEMSLDL 295 (386) T ss_pred HHHHHHHh--hcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC----CcccHHHHHHHH Confidence 77777653 357899999999999999999999999999999999999999999999998532 345889999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeee Q lcl|NC_019705. 323 LQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAM 402 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~ 402 (424) ++.||.|+++.||++|+++|++.. ++|...++..|...+...+.+++++|++|+||+|+++|++|+++||... T Consensus 296 ~~~~l~P~~~~ie~~l~~~l~~~~-------~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~~~~~ 368 (386) T protein:vir:48 296 YNKAVSRYLRPFLSELSQKLSCDV-------DADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILPKELPE 368 (386) T ss_pred HHHHHHHHHHHHHHHHHHhhcchh-------hcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCCccchh Confidence 999999999999999999998754 4666667778888899999999999999999999999999998877543 Q ss_pred -ecccccchhhccccCCCcccCC Q lcl|NC_019705. 403 -RQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 403 -~~~n~~~~~~~~~~~~~~~~ga 424 (424) ...|..|++. +.++|. T Consensus 369 ~~~~~~~~~~g------Gd~~~~ 385 (386) T protein:vir:48 369 GENPNKTTLKG------GEINGE 385 (386) T ss_pred hcCCCCCccCC------CCCCCC Confidence 3345555432 222222 No 60 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=1.8e-81 Score=463.25 Aligned_cols=367 Identities=14% Similarity=0.192 Sum_probs=310.2 Q ss_pred CchHHHHHhhccCcccCCccccc----hhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCc Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGS----QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDN 89 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~ 89 (424) +|||+++. .+ ....+.... ...+.....+.++..|+.+.|+++++|++||++||++||++||+++++.. T Consensus 1 Mglf~~~~---~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~~~--- 73 (384) T protein:vir:49 1 MPIFNITN---LA-TESPPSNQDSFFDITDPEFLDALNGSEWVSAETALKNSDLFSIISQLSNDLATAKITTSRKQL--- 73 (384) T ss_pred Cccccccc---cC-cccccccchhhccccchhhcccccCCceechhhhhccHHHHHHHHHHHHHHhhCceeeecchh--- Confidence 78887642 11 111211111 11111222345677899999999999999999999999999999986542 Q ss_pred cceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeec--CceEEEEEE Q lcl|NC_019705. 90 RKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV--GKKVVYRYQ 167 (424) Q Consensus 90 ~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~--~~~~~~~~~ 167 (424) ..|+.+||++||+++|++.++.+++++||||++++|+..|.+++|+||+|++|++..+ ++...|.|. T Consensus 74 -----------~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~ 142 (384) T protein:vir:49 74 -----------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNGLYYNIT 142 (384) T ss_pred -----------hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEE Confidence 2367799999999999999999999999999999999999999999999999998765 355667766 Q ss_pred eC----CceEEecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHH Q lcl|NC_019705. 168 RD----SEYAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQ 242 (424) Q Consensus 168 ~~----~~~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~ 242 (424) .+ +..+.|+++||||+|++++++ ++|+||+.++..++.+..++++++.++|+||++|+++|+++....+++.. T Consensus 143 ~~~~~~~~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~-- 220 (384) T protein:vir:49 143 FDDPRIPPKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKT-- 220 (384) T ss_pred ecCccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHH-- Confidence 43 345789999999999998776 89999999999999999999999999999999999999998776554433 Q ss_pred HHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHH Q lcl|NC_019705. 243 VEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 243 ~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~ 322 (424) +++.+.+.+++|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||...++++++++++++...| T Consensus 221 -~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~ 299 (384) T protein:vir:49 221 -KQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKA 299 (384) T ss_pred -HHHHHHHhcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHH Confidence 4455566777899999999999999999999999999999999999999999999999999877777888999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccC---hh-hhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019705. 323 LQYTLQPYISRWENSIQRWLIP---AK-DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~l~~---~~-~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg 398 (424) ++.++.|++..|+++|+++|.. .. .....+++|+++.+++.+..++.++...+.+.|+++ ||+|+.+|++|+||| T Consensus 300 i~~~l~pi~~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~-ne~r~~~~~~p~~gG 378 (384) T protein:vir:49 300 VSRFLRPFVSELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP-KDLPEGETDSTLKGG 378 (384) T ss_pred HHHHHHHHHHHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC-hhHHHHcCCCCCCCC Confidence 9999999999999999998843 22 233477899999999999999999999999999986 999999999999987 Q ss_pred C--eee Q lcl|NC_019705. 399 D--VAM 402 (424) Q Consensus 399 d--~~~ 402 (424) | +.+ T Consensus 379 d~~~~~ 384 (384) T protein:vir:49 379 ETNEQY 384 (384) T ss_pred CCCCCC Confidence 4 444 No 61 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=100.00 E-value=5e-81 Score=460.84 Aligned_cols=372 Identities=14% Similarity=0.152 Sum_probs=302.9 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|||+++++. +. .+.. +.....+..|+.+.|+++++|++||++||++||++||++|+++. T Consensus 1 Mg~f~~lf~~---~~--~~~~--------~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~------- 60 (395) T protein:vir:10 1 MSILEKIFKT---RK--DITY--------MLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR------- 60 (395) T ss_pred Cchhhhhhcc---Cc--cccc--------cccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc------- Confidence 8889887432 21 1111 11223456678889999999999999999999999999997532 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEE--EEEEeCCc Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVV--YRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~--~~~~~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.++++.|++|+++.++. .++++++..+++........ +.+...+. T Consensus 61 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (395) T protein:vir:10 61 IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTVKDYTY 135 (395) T ss_pred cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEEcCcee Confidence 346899999999999999999999999999999999998876543 36777766666544433332 33333334 Q ss_pred eEEecHhHEEEeecCCCC-CcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019705. 172 YAEFSQKEIFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~eiih~r~~~~~-~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ...++++||||+|+++.+ ..+|.||+..+..++.... +.|.+|+.++|+|+.+...+++++++++++.|+++ T Consensus 136 ~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~ 208 (395) T protein:vir:10 136 QRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKL 208 (395) T ss_pred eeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 578999999999987754 5789999999988876543 35678888999999998888999999999999999 Q ss_pred hCCcccCc--ceecCCCceeeecccChhHH-----HHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHH Q lcl|NC_019705. 251 AGGPVKKR--LWILEAGFSTSAIGVTPQDA-----EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 251 ~~~~~ag~--~~~l~~g~~~~~l~~~~~d~-----~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~ 323 (424) +++.++++ ++++++|++|+++++++.++ ||+|.+++++++||++|||||.+|++ +++|+|++.++|+ T Consensus 209 ~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~------~~sn~e~~~~~~~ 282 (395) T protein:vir:10 209 FNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG------ETADLEKNTLVFE 282 (395) T ss_pred hccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------cccCHHHHHHHHH Confidence 88876655 55679999999999887765 99999999999999999999999973 3458899999999 Q ss_pred HHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--Cee Q lcl|NC_019705. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg--d~~ 401 (424) ++||.|++.+||++|+++|+++.++.. +++||++.+++.|.+++++++.+++++|+||+||+|+++|+||+||| |++ T Consensus 283 ~~~l~P~~~~ie~~l~~kL~~~~~~~~-~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:10 283 KFCLTPLLKKIQNELNAKLITQSMYLK-DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhhcc-cceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 999999999999999999999876543 45899999999999999999999999999999999999999999876 999 Q ss_pred eecccccchhhccccCCCcccCC Q lcl|NC_019705. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++|+|+++++...+++.+.+..+ T Consensus 362 ~~~~n~~~~~~~~~~~~~~~~~~ 384 (395) T protein:vir:10 362 LITKNYEKANSGENDEKEKDENT 384 (395) T ss_pred eeccccccccccccccCcccccc Confidence 99999999886544332222211 No 62 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=100.00 E-value=5e-81 Score=460.84 Aligned_cols=372 Identities=14% Similarity=0.152 Sum_probs=302.9 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|||+++++. +. .+.. +.....+..|+.+.|+++++|++||++||++||++||++|+++. T Consensus 1 Mg~f~~lf~~---~~--~~~~--------~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~------- 60 (395) T protein:vir:10 1 MSILEKIFKT---RK--DITY--------MLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR------- 60 (395) T ss_pred Cchhhhhhcc---Cc--cccc--------cccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc------- Confidence 8889887432 21 1111 11223456678889999999999999999999999999997532 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEE--EEEEeCCc Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVV--YRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~--~~~~~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.++++.|++|+++.++. .++++++..+++........ +.+...+. T Consensus 61 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (395) T protein:vir:10 61 IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTVKDYTY 135 (395) T ss_pred cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEEcCcee Confidence 346899999999999999999999999999999999998876543 36777766666544433332 33333334 Q ss_pred eEEecHhHEEEeecCCCC-CcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019705. 172 YAEFSQKEIFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~eiih~r~~~~~-~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ...++++||||+|+++.+ ..+|.||+..+..++.... +.|.+|+.++|+|+.+...+++++++++++.|+++ T Consensus 136 ~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~ 208 (395) T protein:vir:10 136 QRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKL 208 (395) T ss_pred eeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 578999999999987754 5789999999988876543 35678888999999998888999999999999999 Q ss_pred hCCcccCc--ceecCCCceeeecccChhHH-----HHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHH Q lcl|NC_019705. 251 AGGPVKKR--LWILEAGFSTSAIGVTPQDA-----EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 251 ~~~~~ag~--~~~l~~g~~~~~l~~~~~d~-----~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~ 323 (424) +++.++++ ++++++|++|+++++++.++ ||+|.+++++++||++|||||.+|++ +++|+|++.++|+ T Consensus 209 ~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~------~~sn~e~~~~~~~ 282 (395) T protein:vir:10 209 FNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG------ETADLEKNTLVFE 282 (395) T ss_pred hccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------cccCHHHHHHHHH Confidence 88876655 55679999999999887765 99999999999999999999999973 3458899999999 Q ss_pred HHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--Cee Q lcl|NC_019705. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg--d~~ 401 (424) ++||.|++.+||++|+++|+++.++.. +++||++.+++.|.+++++++.+++++|+||+||+|+++|+||+||| |++ T Consensus 283 ~~~l~P~~~~ie~~l~~kL~~~~~~~~-~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:10 283 KFCLTPLLKKIQNELNAKLITQSMYLK-DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhhcc-cceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 999999999999999999999876543 45899999999999999999999999999999999999999999876 999 Q ss_pred eecccccchhhccccCCCcccCC Q lcl|NC_019705. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++|+|+++++...+++.+.+..+ T Consensus 362 ~~~~n~~~~~~~~~~~~~~~~~~ 384 (395) T protein:vir:10 362 LITKNYEKANSGENDEKEKDENT 384 (395) T ss_pred eeccccccccccccccCcccccc Confidence 99999999886544332222211 No 63 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=100.00 E-value=5e-81 Score=460.84 Aligned_cols=372 Identities=14% Similarity=0.152 Sum_probs=302.9 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|||+++++. +. .+.. +.....+..|+.+.|+++++|++||++||++||++||++|+++. T Consensus 1 Mg~f~~lf~~---~~--~~~~--------~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~------- 60 (395) T protein:vir:95 1 MSILEKIFKT---RK--DITY--------MLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR------- 60 (395) T ss_pred Cchhhhhhcc---Cc--cccc--------cccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc------- Confidence 8889887432 21 1111 11223456678889999999999999999999999999997532 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEE--EEEEeCCc Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVV--YRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~--~~~~~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.++++.|++|+++.++. .++++++..+++........ +.+...+. T Consensus 61 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (395) T protein:vir:95 61 IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTVKDYTY 135 (395) T ss_pred cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEEcCcee Confidence 346899999999999999999999999999999999998876543 36777766666544433332 33333334 Q ss_pred eEEecHhHEEEeecCCCC-CcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019705. 172 YAEFSQKEIFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~eiih~r~~~~~-~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ...++++||||+|+++.+ ..+|.||+..+..++.... +.|.+|+.++|+|+.+...+++++++++++.|+++ T Consensus 136 ~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~ 208 (395) T protein:vir:95 136 QRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKL 208 (395) T ss_pred eeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 578999999999987754 5789999999988876543 35678888999999998888999999999999999 Q ss_pred hCCcccCc--ceecCCCceeeecccChhHH-----HHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHH Q lcl|NC_019705. 251 AGGPVKKR--LWILEAGFSTSAIGVTPQDA-----EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 251 ~~~~~ag~--~~~l~~g~~~~~l~~~~~d~-----~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~ 323 (424) +++.++++ ++++++|++|+++++++.++ ||+|.+++++++||++|||||.+|++ +++|+|++.++|+ T Consensus 209 ~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~------~~sn~e~~~~~~~ 282 (395) T protein:vir:95 209 FNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG------ETADLEKNTLVFE 282 (395) T ss_pred hccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------cccCHHHHHHHHH Confidence 88876655 55679999999999887765 99999999999999999999999973 3458899999999 Q ss_pred HHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--Cee Q lcl|NC_019705. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg--d~~ 401 (424) ++||.|++.+||++|+++|+++.++.. +++||++.+++.|.+++++++.+++++|+||+||+|+++|+||+||| |++ T Consensus 283 ~~~l~P~~~~ie~~l~~kL~~~~~~~~-~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:95 283 KFCLTPLLKKIQNELNAKLITQSMYLK-DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhhcc-cceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 999999999999999999999876543 45899999999999999999999999999999999999999999876 999 Q ss_pred eecccccchhhccccCCCcccCC Q lcl|NC_019705. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++|+|+++++...+++.+.+..+ T Consensus 362 ~~~~n~~~~~~~~~~~~~~~~~~ 384 (395) T protein:vir:95 362 LITKNYEKANSGENDEKEKDENT 384 (395) T ss_pred eeccccccccccccccCcccccc Confidence 99999999886544332222211 No 64 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=2.2e-80 Score=457.32 Aligned_cols=410 Identities=12% Similarity=0.117 Sum_probs=304.6 Q ss_pred CCCCCchHHHHH-------hhccC-------------------------cccC--CccccchhhccccccccCcccc--- Q lcl|NC_019705. 10 LRTNNGWWARLQ-------SWFVG-------------------------GRLV--TPNQGSQTGPVSAHGHLGDSSI--- 52 (424) Q Consensus 10 ~~~~~G~~~~~~-------~~~~~-------------------------~~~~--~~~~~~~~~~~~~~~~~~~~~v--- 52 (424) +..-+|+|++++ .++.- ...+ .+..+.....-+........+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~l 80 (551) T protein:vir:80 1 MKNKLGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDL 80 (551) T ss_pred CchhhhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccCccccChhHH Confidence 566778888887 11110 0000 0000000000000000011110 Q ss_pred --cHHHHhccHHHHHHHHHHHHhhccC-----------ceEEEEeccCC--ccceeccchHHHHHhhcCCCCCC-----C Q lcl|NC_019705. 53 --NDERILQISTVWRCVSLISTLTACL-----------PLDVFETDQND--NRKKVDLSNPLARLLRYSPNQYM-----T 112 (424) Q Consensus 53 --s~~~~~~~~~v~~~i~~ia~~ia~~-----------~~~v~~~~~~~--~~~~~~~~~~l~~lL~~~pn~~~-----s 112 (424) -.+.+..+|+|++||++||++||++ +|.+.-++.+. ..+.....+.+..+|. +||+++ | T Consensus 81 ~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~-~pn~~~~p~~~s 159 (551) T protein:vir:80 81 HGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIE-KTGVDNDINRDS 159 (551) T ss_pred HHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHH-hcCCCCCCccch Confidence 1245667899999999999999974 45443222211 1111122234555665 899874 7 Q ss_pred HHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCce-------EEEEEEeCCceEEecHhHEEEeec Q lcl|NC_019705. 113 AQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-------VVYRYQRDSEYAEFSQKEIFHLKG 185 (424) Q Consensus 113 ~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-------~~~~~~~~~~~~~~~~~eiih~r~ 185 (424) +.+|++.++.+++++||||++++|+..|.|++||||+|.+|++..+.++ .++++..++....|+++||||+|+ T Consensus 160 ~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~eiiH~~~ 239 (551) T protein:vir:80 160 FSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVR 239 (551) T ss_pred HHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccccccCceEEEEEeCCcEEEEEcccceEEecc Confidence 8899999999999999999999999999999999999999998876543 233334455567899999999997 Q ss_pred CCC----CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhCC-cccCcc Q lcl|NC_019705. 186 FGF----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGG-PVKKRL 259 (424) Q Consensus 186 ~~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~-~~~~~~~~~~~~~~~~~~~~-~~ag~~ 259 (424) .+. ++.+|+||+.+++.++....++++++.++|+||++|+|+|+++.. .+++++.+++++.|++.++| +|+|++ T Consensus 240 n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~ 319 (551) T protein:vir:80 240 NPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQI 319 (551) T ss_pred cCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcc Confidence 543 357899999999999999999999999999999999999998643 47899999999999987665 689998 Q ss_pred eec-CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--------cccchhHHHHHHHHHHHHHHHH Q lcl|NC_019705. 260 WIL-EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS--------TSWGSGIEQQNLGFLQYTLQPY 330 (424) Q Consensus 260 ~~l-~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~--------~~~~~n~e~~~~~~~~~tl~P~ 330 (424) +++ ++|++|++++++++|+||+|++++++++||++|||||.+||...++ +.+++|+|++...|+++||.|+ T Consensus 320 ~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~ 399 (551) T protein:vir:80 320 PVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPL 399 (551) T ss_pred ccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHH Confidence 666 6899999999999999999999999999999999999999986653 3467899999999999999999 Q ss_pred HHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCC-CCCCCeeeecccccc Q lcl|NC_019705. 331 ISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPP-LPGGDVAMRQSQYVP 409 (424) Q Consensus 331 ~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p-~~ggd~~~~~~n~~~ 409 (424) +..||++|+++|++..+. . +.|+++.+...+..+++++++ ++.+|+||+||+|+++|+|| +||||+++.|.++.+ T Consensus 400 ~~~ie~~ln~~L~~~~~~-~--~~f~f~~~~~~~~~~~~~~~~-~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~ 475 (551) T protein:vir:80 400 LGFIEDFINKHIVAEFGD-K--YTFQFVGGDIKSELESVKILA-EKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQR 475 (551) T ss_pred HHHHHHHHHhhhccccCC-c--eEEEeeccChhhHHHHHHHHH-HHhcCCcCHHHHHHHhCCCCCCCCCceeeccccccc Confidence 999999999999987653 2 345555666777777777654 66789999999999999998 799999999998887 Q ss_pred hhhccccCCCc--c------------------------cCC Q lcl|NC_019705. 410 ITDLGTNKEPR--N------------------------NGA 424 (424) Q Consensus 410 ~~~~~~~~~~~--~------------------------~ga 424 (424) +......+.+. . +++ T Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 516 (551) T protein:vir:80 476 IGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGK 516 (551) T ss_pred ccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCcc Confidence 65332211100 0 000 No 65 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=100.00 E-value=7.3e-81 Score=459.91 Aligned_cols=373 Identities=13% Similarity=0.105 Sum_probs=300.9 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|||++++.. .. .+.... .......++.+.|+++++|++||++||++||++||++++++.. T Consensus 1 Mg~f~~~f~~----~~-~~~~~~--------~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~------ 61 (385) T protein:vir:95 1 MGLFDSVFKR----HS-ELSWMY--------DLEFLQDKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTK------ 61 (385) T ss_pred Cchhhhhhcc----Cc-cccccc--------chhhhhccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCcc------ Confidence 8899887432 11 111111 1112334677889999999999999999999999999986532 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEeCCceE Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..|++.++|+.+||++||+++||+.++.+++++||||+++.++. +.+..++++.+..+...... .....+....... T Consensus 62 -~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 138 (385) T protein:vir:95 62 -EKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEG-HFFVADDFEKEDELGLYSHR-FTNVLVNDFEFKR 138 (385) T ss_pred -ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEecCC-Ceeecccccccccccccccc-ceeeeecccceee Confidence 34899999999999999999999999999999999999987764 44555666666655433221 1111122223446 Q ss_pred EecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCC-CCCCHHHHHHHHHHHHHHh Q lcl|NC_019705. 174 EFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE-KVLTEQQRSQVEENFKEIA 251 (424) Q Consensus 174 ~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~-~~~~~~~~~~~~~~~~~~~ 251 (424) .++++||||+|+++.++ .+|.||+..+..++....++.. +++.++|+++++. ..+++++.+++++.|++.+ T Consensus 139 ~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~-------~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~ 211 (385) T protein:vir:95 139 VFTMDDVIYLKYNNQKLDAFSLGLFEDYGEIFGRMIDLQM-------LNNQIRGILKVDATKFYNKEKQKELQAYIDTLF 211 (385) T ss_pred eeccccEEEecCCCCCcccccchHHHHHHHHHHHHHHHHH-------hcCCCceEEEeCCccCCCHHHHHHHHHHHHHHh Confidence 79999999999998775 7899999999998876655432 2345788898864 4578999999999999887 Q ss_pred CCc--ccCcceecCCCceeeeccc------ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHH Q lcl|NC_019705. 252 GGP--VKKRLWILEAGFSTSAIGV------TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 252 ~~~--~ag~~~~l~~g~~~~~l~~------~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~ 323 (424) ++. +.++++++++|++|+++++ +++|+||+|.+++++++||++|||||.+|++ +++|.|++.++|+ T Consensus 212 ~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~------~~sn~e~~~~~~~ 285 (385) T protein:vir:95 212 DAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLG------EMADLEKTIESYL 285 (385) T ss_pred hhhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcC------CCcCHHHHHHHHH Confidence 663 4556899999999999975 6679999999999999999999999999963 3568999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--CCCCee Q lcl|NC_019705. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL--PGGDVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~--~ggd~~ 401 (424) ++||.|++.+||++|+++|+++.++...+++||++++++.|.+++++++.+++++|+||+||+|+++|+||+ ||||++ T Consensus 286 ~~~l~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd~~ 365 (385) T protein:vir:95 286 QFCINPLLRKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELDKF 365 (385) T ss_pred HHHHHHHHHHHHHHHHhhcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 999999999999999999999988877789999999999999999999999999999999999999999998 689999 Q ss_pred eecccccchhhccccCCCcc Q lcl|NC_019705. 402 MRQSQYVPITDLGTNKEPRN 421 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~ 421 (424) ++|+|+++++.....+...+ T Consensus 366 ~~~~n~~~~~~~kgge~~~e 385 (385) T protein:vir:95 366 IITKNLQSADAFKGGESNEE 385 (385) T ss_pred eecccceecccccCCCCCCC Confidence 99999999987533333333 No 66 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=7.4e-80 Score=454.42 Aligned_cols=416 Identities=13% Similarity=0.142 Sum_probs=309.9 Q ss_pred CCCCcccccC--------------C--------CCCch--HH-HHHhhccCcccCCccccchhhccccc--cccCccccc Q lcl|NC_019705. 1 MEEPKYTIDL--------------R--------TNNGW--WA-RLQSWFVGGRLVTPNQGSQTGPVSAH--GHLGDSSIN 53 (424) Q Consensus 1 ~~~~~~~~~~--------------~--------~~~G~--~~-~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~vs 53 (424) |-.-|-+-.+ . |-.|+ .+ ...++..++...+. .......... ...-+. . T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~--~~~~~l~~~~~~~~~~~~--~ 88 (535) T protein:vir:10 13 LSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQYSVASISDV--LSTKKLLKAYADNDIVQA--I 88 (535) T ss_pred hhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccccccCCcccccccCccccc--cCHHHHHHHhccChhHHH--H Confidence 1111111110 0 00010 00 00000000000000 0000000000 000011 1 Q ss_pred HHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHH----HHHHHHHHHHHc-C Q lcl|NC_019705. 54 DERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQE----FREAMTMQLCFY-G 128 (424) Q Consensus 54 ~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~----f~~~~~~~~ll~-G 128 (424) ...+....++|+|+.++++.++++|+++++.+..+..++....|++.++|+.+||++|++++ |++.++.+++++ | T Consensus 89 i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g 168 (535) T protein:vir:10 89 IRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQ 168 (535) T ss_pred HHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCC Confidence 23455678899999999999999999999998888777788889999999999999999875 556667776665 5 Q ss_pred CeEEEEeeCCCCceeEEEEecCceeEEeecC-----ceEEEEEEeCCceEEecHhHEEEeecCCC----CCcccCchHHH Q lcl|NC_019705. 129 NAYALVDRNSAGDVISLLPLQSANMDVKLVG-----KKVVYRYQRDSEYAEFSQKEIFHLKGFGF----TGLVGLSPIAF 199 (424) Q Consensus 129 ~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~-----~~~~~~~~~~~~~~~~~~~eiih~r~~~~----~~~~G~s~i~~ 199 (424) ++|++++|+..|+|++||||+|.+|++..+. ...+|.+..++....|+++||||||+++. ++.+|+||+.+ T Consensus 169 ~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~~~~~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~ 248 (535) T protein:vir:10 169 INIERIFKNDSNELDHFNAVDASKVVISYSPRSKDQPRKFEQFVSETKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEA 248 (535) T ss_pred ceEEEEEECCCCcEEEEEEeCCceeEEEEcCccccCceEEEEEecCceeEEECcccEEEEeccCCCCcccccccccHHHH Confidence 7899999999999999999999999988763 34566677777788999999999997653 35689999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC---CCCHHHHHHHHHHHHHHhCC-cccCcceecC-CCceeeecccC Q lcl|NC_019705. 200 ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK---VLTEQQRSQVEENFKEIAGG-PVKKRLWILE-AGFSTSAIGVT 274 (424) Q Consensus 200 ~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~---~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l~-~g~~~~~l~~~ 274 (424) +..++....++++++.++|+||++|+|||+++.. .+++++.+++++.|++.++| +|+|+++++. +|++|++++++ T Consensus 249 ~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~ 328 (535) T protein:vir:10 249 SIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKDAKFVNMTQN 328 (535) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEecCCC Confidence 9999999999999999999999999999998753 47889999999999987665 6889987775 69999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccc----------chhHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019705. 275 PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW----------GSGIEQQNLGFLQYTLQPYISRWENSIQRWLIP 344 (424) Q Consensus 275 ~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~----------~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~ 344 (424) ++|+||+|++++++++||++|||||.+||..++++++ .+++|++...|++.||.||+..||++|+++|++ T Consensus 329 ~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~ 408 (535) T protein:vir:10 329 SRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMR 408 (535) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Confidence 9999999999999999999999999999998887653 346788999999999999999999999999998 Q ss_pred hhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc---cccchhhccccC--CC Q lcl|NC_019705. 345 AKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS---QYVPITDLGTNK--EP 419 (424) Q Consensus 345 ~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~---n~~~~~~~~~~~--~~ 419 (424) ..+. .++|+++.+++.|.+++.++++.+. .|+||+||+|+++|+||+||||+++... +++.....++.. .+ T Consensus 409 ~~~~---~~~f~f~~l~~~d~~~r~~~~~~~~-~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~ 484 (535) T protein:vir:10 409 YVDT---DYRFSFTLGDAQDKLQEEQVWKLKL-ANGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDS 484 (535) T ss_pred ccCC---eEEEEeccccccCHHHHHHHHHHHH-cCCCCHHHHHHHhCCCCCCCccccccccchhhcccccccccccCCCC Confidence 7654 3567778999999999999887665 5779999999999999999999876543 332221111110 00 Q ss_pred c-ccCC Q lcl|NC_019705. 420 R-NNGA 424 (424) Q Consensus 420 ~-~~ga 424 (424) . +.|+ T Consensus 485 ~~~~~~ 490 (535) T protein:vir:10 485 SDDSGS 490 (535) T ss_pred CCCccc Confidence 0 0011 No 67 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=100.00 E-value=2.3e-80 Score=457.21 Aligned_cols=355 Identities=14% Similarity=0.111 Sum_probs=285.8 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccc-- Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK-- 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~-- 91 (424) +|||+++.++.++....... .+.. + .+...+++.++|++||++||++||++|++|++..+++... T Consensus 1 Mg~f~~~~~~~~~~~~~~~~------~~~~--~-----~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQ------RVTA--W-----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CCccccchhcccccccCCcc------eeee--e-----ccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCccccc Confidence 99999988755433322211 1110 1 1122356788999999999999999999998877665432 Q ss_pred -eeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC-CCCceeEEEEecCceeEEeecCceEEEEEEeC Q lcl|NC_019705. 92 -KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN-SAGDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 92 -~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~-~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 169 (424) ....+|++.+||+.+||++||+++||+.++.+++++||||++++++ ..|.++.++|.. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l~p~~-------------------- 127 (378) T protein:vir:94 68 LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDLLFAD-------------------- 127 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEEEecC-------------------- Confidence 3456799999999999999999999999999999999999987654 457777666532 Q ss_pred CceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHH----HHHHHH Q lcl|NC_019705. 170 SEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ----RSQVEE 245 (424) Q Consensus 170 ~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~----~~~~~~ 245 (424) ...+|+++||||+|++ .++..|+||++.+..++.. ++.+ +.++|+|+++... ++++ ++++++ T Consensus 128 -~~~~~~~~diiH~~~~-~~~~~g~s~l~~~~~~i~~----------~~~~-~~~~gil~~~~~l-~~~~~~~~~~~~~~ 193 (378) T protein:vir:94 128 -DKKEYKPEELVRLTSP-FYINEDTSILDNALASIQT----------KLEQ-GKLRGLLKINAFL-DIDNTQEYREKALT 193 (378) T ss_pred -CeeEeeeeeeEEecCc-CCccchhHHHHHHHHHHHH----------HHhc-ccccceeeeCCcC-CHHHHHHHHHHHHH Confidence 2346789999999975 5777899999988877643 2334 4689999998654 4443 445555 Q ss_pred HHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH Q lcl|NC_019705. 246 NFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 246 ~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) .++...+++++|++++|++|++|++++++++++|+ +.+++++++||++|||||.+|++ +++|++..+|+++ T Consensus 194 ~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~l~~--------~~se~~~~~f~~~ 264 (378) T protein:vir:94 194 TIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG--------TASQEQQIYFYNS 264 (378) T ss_pred HHHHhhcccccccceecCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC--------ChHHHHHHHHHHH Confidence 66666778889999999999999999999999996 67799999999999999999953 1358899999999 Q ss_pred HHHHHHHHHHHHHHhhccChhhhccc-------chhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019705. 326 TLQPYISRWENSIQRWLIPAKDVGRI-------HAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~~-------~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg 398 (424) ||.||+.+||++|+++||++.++... .++||++.++++|.+++++++.+++++|+||+||+|+++|+||+||| T Consensus 265 tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gG 344 (378) T protein:vir:94 265 TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGG 344 (378) T ss_pred HHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 99999999999999999998776432 36799999999999999999999999999999999999999999999 Q ss_pred CeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 399 DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |++++|+|++|++.+++++..++++- T Consensus 345 D~~~~~~n~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:94 345 DVYIANLNAVAVKNLSDLQGSRKDVT 370 (378) T ss_pred CeeeecccccccccchhhcCCcCCCC Confidence 99999999999998876554332221 No 68 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=1.5e-79 Score=452.78 Aligned_cols=406 Identities=12% Similarity=0.112 Sum_probs=300.8 Q ss_pred CchHHHHHhhccCcccC--------------------------Cccccchhhc------cc-ccc-ccCccc---c--cH Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLV--------------------------TPNQGSQTGP------VS-AHG-HLGDSS---I--ND 54 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~--------------------------~~~~~~~~~~------~~-~~~-~~~~~~---v--s~ 54 (424) +|||.+++-.+.....- +........+ +. ++. ...... + .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~ 80 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 80 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHH Confidence 55555554433211000 0000000111 10 000 001111 0 12 Q ss_pred HHHhccHHHHHHHHHHHHhhccC-------------ceEEEEeccCCccceeccchHHHHHhhcCCCCCC-----CHHHH Q lcl|NC_019705. 55 ERILQISTVWRCVSLISTLTACL-------------PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYM-----TAQEF 116 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~-------------~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~-----s~~~f 116 (424) +.+..+|+|++||++||+.||++ .++++.+.+....+.....+.+..+|. +||+++ |+.+| T Consensus 81 ~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~-~pn~~~~p~~~s~~~f 159 (547) T protein:vir:63 81 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIE-KTGVDNDINRDSFSSF 159 (547) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHH-hhCCCCCCccchHHHH Confidence 45667899999999999999974 233332222222222233345666665 788874 78899 Q ss_pred HHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCce-------EEEEEEeCCceEEecHhHEEEeecCCCC Q lcl|NC_019705. 117 REAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-------VVYRYQRDSEYAEFSQKEIFHLKGFGFT 189 (424) Q Consensus 117 ~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~-------~~~~~~~~~~~~~~~~~eiih~r~~~~~ 189 (424) ++.++.+++++||+|++++|+.+|.+++||||+|.+|++..+.+. .++++..+.....|+++||||+|+.+.+ T Consensus 160 ~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~eiih~r~n~~~ 239 (547) T protein:vir:63 160 VKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVRNPRS 239 (547) T ss_pred HHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccccccCceEEEEEcCCcEEEEeccccEEEecccCCC Confidence 999999999999999999999999999999999999998765432 2334444555678999999999976532 Q ss_pred ----CcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhCC-cccCcceec- Q lcl|NC_019705. 190 ----GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGG-PVKKRLWIL- 262 (424) Q Consensus 190 ----~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~-~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l- 262 (424) +.+|+||+.++..++....++++++.++|+||++|+|+|+++.. .+++++.+++++.|++.++| +|+|+++++ T Consensus 240 ~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~ 319 (547) T protein:vir:63 240 DIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS 319 (547) T ss_pred CcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCccccccccccc Confidence 56899999999999999999999999999999999999998643 47899999999999987655 689998666 Q ss_pred CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--------cccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 263 EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS--------TSWGSGIEQQNLGFLQYTLQPYISRW 334 (424) Q Consensus 263 ~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~--------~~~~~n~e~~~~~~~~~tl~P~~~~i 334 (424) ++|++|++++++++|+||+|++++++++||++|||||.+||...++ +.+++|+|++.+.|+++||.|++..| T Consensus 320 ~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~i 399 (547) T protein:vir:63 320 AEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFI 399 (547) T ss_pred CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 6889999999999999999999999999999999999999986553 34678999999999999999999999 Q ss_pred HHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCC-CCCCCeeeecccccchhhc Q lcl|NC_019705. 335 ENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPP-LPGGDVAMRQSQYVPITDL 413 (424) Q Consensus 335 e~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p-~~ggd~~~~~~n~~~~~~~ 413 (424) |++||++|++..+. .+ .|+++.+...+..+++++. +++.+|+||+||+|+++|+|| +||||+++.|.+..+++.. T Consensus 400 e~~ln~~L~~~~~~-~~--~~~f~~~~~~~~~~~~~~~-~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~ 475 (547) T protein:vir:63 400 EDFINKHIVAEFGD-KY--TFQFVGGDIKSELESVKIL-AEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQL 475 (547) T ss_pred HHHHHhhcccccCC-ce--EEEeeccccccHHHHHHHH-HHHhCCCcCHHHHHHHhCCCCCCCCCceeeccccccccccc Confidence 99999999976643 23 4444566667777776655 577789999999999999998 6999999999888776532 Q ss_pred cccCCCcc-----------------c---------CC Q lcl|NC_019705. 414 GTNKEPRN-----------------N---------GA 424 (424) Q Consensus 414 ~~~~~~~~-----------------~---------ga 424 (424) ...+.+.+ + ++ T Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (547) T protein:vir:63 476 MQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGK 512 (547) T ss_pred ccccCCccccchhhccccccccCCCCCCCCCCCCCCc Confidence 21111100 0 00 No 69 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=100.00 E-value=3.3e-80 Score=456.35 Aligned_cols=355 Identities=14% Similarity=0.116 Sum_probs=285.0 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccce- Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK- 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~- 92 (424) +|||+++.++.+......... ... .. +...+++.++|++||++||++||++||+|+++.+++.... T Consensus 1 Mg~f~~~~~f~~~~~~~~~~~------~~~---~~----~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLNNDTQR------VTA---WQ----NEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CccchhhhhhhccccCCCcce------eee---cc----cchhHHHHHHHHHHHHHHHhhhhhCceeeEEEccccccccc Confidence 999999987433322211111 100 01 1223567889999999999999999999998876654332 Q ss_pred --eccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-CceeEEEEecCceeEEeecCceEEEEEEeC Q lcl|NC_019705. 93 --VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-GDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 93 --~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~-G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 169 (424) ...+|++.+||+.+||++||+++||+.++.+++++||||++++++.. |.+..++|. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l~~~--------------------- 126 (378) T protein:vir:93 68 LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFA--------------------- 126 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEec--------------------- Confidence 34679999999999999999999999999999999999999887643 666665542 Q ss_pred CceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHH----HHHHHH Q lcl|NC_019705. 170 SEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ----RSQVEE 245 (424) Q Consensus 170 ~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~----~~~~~~ 245 (424) +.+.+|+++||||+|++ .++..|.|++..+...+. .++.+| .++|+|+++... ++++ ++++++ T Consensus 127 ~~~~~~~~~diih~r~~-~~~~~~~s~l~~~~~~i~----------~~~~~~-~~~g~l~~~~~l-~~~~~~~~~~~~~~ 193 (378) T protein:vir:93 127 DDKKEYKTEELVRLTSP-FYINEDTSILDNALASIQ----------TKLEQG-KLRGLLKINAFL-DIDNTQEYREKALT 193 (378) T ss_pred CCeeEeccceeEEecCc-cccchhhHHHHHHHHHHH----------HHHhcC-cccceeeeCCcC-CHHHHHHHHHHHHH Confidence 23456889999999964 567789999887776553 345555 589999988654 4443 445555 Q ss_pred HHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH Q lcl|NC_019705. 246 NFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 246 ~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) .+++..+++++|++++|++|++|++++.+++++|+ +.+++++++||++|||||.+|++ +++|++..+|+.+ T Consensus 194 ~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~g--------~~~e~~~~~f~~~ 264 (378) T protein:vir:93 194 TIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG--------TATQEQQIYFYNS 264 (378) T ss_pred HHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC--------CcHHHHHHHHHHH Confidence 66666778889999999999999999999999996 77889999999999999999953 2458999999999 Q ss_pred HHHHHHHHHHHHHHhhccChhhhcc-------cchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019705. 326 TLQPYISRWENSIQRWLIPAKDVGR-------IHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~-------~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg 398 (424) ||.|++++||++|+++|+++.++.. ..++||++.++++|.+++++++.+++++|++|+||+|+++|+||+||| T Consensus 265 tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~gg 344 (378) T protein:vir:93 265 TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGG 344 (378) T ss_pred HHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 9999999999999999999887643 237899999999999999999999999999999999999999999999 Q ss_pred CeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 399 DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |++++|+|++|++.+++++..+.+.- T Consensus 345 D~~~~~~n~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:93 345 DVYIANLNAVAVKNLSDLQGSRKDVT 370 (378) T ss_pred CeeeeccccccccchhhhcCccCCCC Confidence 99999999999998876554333222 No 70 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=3.5e-79 Score=450.73 Aligned_cols=367 Identities=13% Similarity=0.133 Sum_probs=294.2 Q ss_pred CCCchHHHHHhhccCcccCCccccc----hhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccC Q lcl|NC_019705. 12 TNNGWWARLQSWFVGGRLVTPNQGS----QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 12 ~~~G~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~ 87 (424) ==+|||++++..-...+...+..+. ....+.......+..|+.+.|+++|+|++||++||++||++|++++++.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~- 79 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN- 79 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh- Confidence 1135554432111111111111111 11122334455677899999999999999999999999999999986542 Q ss_pred CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeec--CceEEEE Q lcl|NC_019705. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV--GKKVVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~--~~~~~~~ 165 (424) ..|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|+||+|++|++..+ ++...|+ T Consensus 80 -------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~ 146 (392) T protein:vir:39 80 -------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYN 146 (392) T ss_pred -------------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEE Confidence 2366799999999999999999999999999999999999999999999999998875 4556677 Q ss_pred EEeCC----ceEEecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHH Q lcl|NC_019705. 166 YQRDS----EYAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 166 ~~~~~----~~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~ 240 (424) +...+ ....++++||||+|++++++ ++|+||+.++..++.+..++++++.++|+||++|+|+|+++.+...++. T Consensus 147 ~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~- 225 (392) T protein:vir:39 147 ITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDK- 225 (392) T ss_pred EEecCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHH- Confidence 66543 34679999999999999887 7999999999999999999999999999999999999999876533322 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH Q lcl|NC_019705. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~ 320 (424) ..+++.+++.+..++|++++|++|++|++++.+++|+||+|.+++++++||++|||||.+||+..+++ +.+++.+ T Consensus 226 -~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~----~~~~~~~ 300 (392) T protein:vir:39 226 -DKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ----SSIQQIS 300 (392) T ss_pred -HHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc----cHHHHHH Confidence 22334445666778999999999999999999999999999999999999999999999999754432 3467789 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh---------- Q lcl|NC_019705. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD---------- 390 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~---------- 390 (424) +|+++||.|+++.||++|+++|++. ++||...+++.|.+.+.+.+.+++++|++|+||+|+++ T Consensus 301 ~f~~~~l~P~~~~ie~~l~~~L~~~-------~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~ 373 (392) T protein:vir:39 301 GMYASALNRYLRPAISELEYKLSDH-------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDL 373 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccc-------ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcccc Confidence 9999999999999999999999865 45888888999999999999999999999999999987 Q ss_pred ----CCCCCCCCCeeeecccccc Q lcl|NC_019705. 391 ----NLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 391 ----g~~p~~ggd~~~~~~n~~~ 409 (424) |+||++|||. .+.+| T Consensus 374 r~~e~l~~~~~Gd~----~~p~p 392 (392) T protein:vir:39 374 PAPENTNKKTTGQS----NEPVP 392 (392) T ss_pred chhcCCCCCCCCCC----CCCCC Confidence 5555555442 11112 No 71 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=3.5e-79 Score=450.73 Aligned_cols=367 Identities=13% Similarity=0.133 Sum_probs=294.2 Q ss_pred CCCchHHHHHhhccCcccCCccccc----hhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccC Q lcl|NC_019705. 12 TNNGWWARLQSWFVGGRLVTPNQGS----QTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 12 ~~~G~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~ 87 (424) ==+|||++++..-...+...+..+. ....+.......+..|+.+.|+++|+|++||++||++||++|++++++.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~- 79 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN- 79 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh- Confidence 1135554432111111111111111 11122334455677899999999999999999999999999999986542 Q ss_pred CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeec--CceEEEE Q lcl|NC_019705. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV--GKKVVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~--~~~~~~~ 165 (424) ..|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|+||+|++|++..+ ++...|+ T Consensus 80 -------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~ 146 (392) T protein:vir:10 80 -------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYN 146 (392) T ss_pred -------------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEE Confidence 2366799999999999999999999999999999999999999999999999998875 4556677 Q ss_pred EEeCC----ceEEecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHH Q lcl|NC_019705. 166 YQRDS----EYAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 166 ~~~~~----~~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~ 240 (424) +...+ ....++++||||+|++++++ ++|+||+.++..++.+..++++++.++|+||++|+|+|+++.+...++. T Consensus 147 ~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~- 225 (392) T protein:vir:10 147 ITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDK- 225 (392) T ss_pred EEecCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHH- Confidence 66543 34679999999999999887 7999999999999999999999999999999999999999876533322 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH Q lcl|NC_019705. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~ 320 (424) ..+++.+++.+..++|++++|++|++|++++.+++|+||+|.+++++++||++|||||.+||+..+++ +.+++.+ T Consensus 226 -~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~----~~~~~~~ 300 (392) T protein:vir:10 226 -DKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ----SSIQQIS 300 (392) T ss_pred -HHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc----cHHHHHH Confidence 22334445666778999999999999999999999999999999999999999999999999754432 3467789 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh---------- Q lcl|NC_019705. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD---------- 390 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~---------- 390 (424) +|+++||.|+++.||++|+++|++. ++||...+++.|.+.+.+.+.+++++|++|+||+|+++ T Consensus 301 ~f~~~~l~P~~~~ie~~l~~~L~~~-------~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~ 373 (392) T protein:vir:10 301 GMYASALNRYLRPAISELEYKLSDH-------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDL 373 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccc-------ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcccc Confidence 9999999999999999999999865 45888888999999999999999999999999999987 Q ss_pred ----CCCCCCCCCeeeecccccc Q lcl|NC_019705. 391 ----NLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 391 ----g~~p~~ggd~~~~~~n~~~ 409 (424) |+||++|||. .+.+| T Consensus 374 r~~e~l~~~~~Gd~----~~p~p 392 (392) T protein:vir:10 374 PAPENTNKKTTGQS----NEPVP 392 (392) T ss_pred chhcCCCCCCCCCC----CCCCC Confidence 5555555442 11112 No 72 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=100.00 E-value=8.5e-80 Score=454.08 Aligned_cols=355 Identities=14% Similarity=0.110 Sum_probs=285.5 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccce- Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK- 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~- 92 (424) +|||+++.++.++........ +.. . .....+++.++|++||++||++||++||+++++.+++.... T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~------~~~---~----~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~ 67 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNNDTQR------VTA---W----QNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CccchhhhhhhcccccCCcce------eee---c----ccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEccccccccc Confidence 999999987544332222111 110 1 11223567889999999999999999999998877654332 Q ss_pred --eccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-CceeEEEEecCceeEEeecCceEEEEEEeC Q lcl|NC_019705. 93 --VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-GDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 93 --~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~-G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 169 (424) ...+|++.++|+.+||++||+++||+.++.+++++||||++++|+.. |.+..++|.. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~~-------------------- 127 (378) T protein:vir:16 68 LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFAD-------------------- 127 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecC-------------------- Confidence 34579999999999999999999999999999999999999988754 5665555432 Q ss_pred CceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH----HHHHHHH Q lcl|NC_019705. 170 SEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ----QRSQVEE 245 (424) Q Consensus 170 ~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~----~~~~~~~ 245 (424) ....|+++||||+|++ .++..|.|++..+...+. .++. ++.++|+|+.+... +++ .++++++ T Consensus 128 -~~~~~~~~diih~r~~-~~~~~~~s~l~~~~~~i~----------~~~~-~~~~~g~l~~~~~l-~~~~~~~~~~~~~~ 193 (378) T protein:vir:16 128 -DKKEYKPEELVRLTSP-FYINEDTSILDNALASIQ----------TKLE-QGKLRGLLKINAFL-DIDNTQEYREKALT 193 (378) T ss_pred -CeeEecccceEEecCc-cCccchhHHHHHHHHHHH----------HHHh-cCccceeeEeCCcC-CHHHHHHHHHHHHH Confidence 2456789999999964 566789999888876653 2333 45689999988654 444 3455566 Q ss_pred HHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH Q lcl|NC_019705. 246 NFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 246 ~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) .++...+++++|++++|++|++|++++++++++|+ +.+++++++||++|||||.+|++ +++|++.++|+.+ T Consensus 194 ~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~g--------~~~e~~~~~f~~~ 264 (378) T protein:vir:16 194 TIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG--------TASQEQQIYFYNS 264 (378) T ss_pred HHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC--------CchHHHHHHHHHH Confidence 66666788899999999999999999999999997 55689999999999999999953 2458999999999 Q ss_pred HHHHHHHHHHHHHHhhccChhhhccc-------chhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019705. 326 TLQPYISRWENSIQRWLIPAKDVGRI-------HAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~~-------~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg 398 (424) ||.||+++||++|+++|+++.++... .++||++.+++.|.+++++++.+++++|+||+||+|+++|+||+||| T Consensus 265 tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gg 344 (378) T protein:vir:16 265 TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGG 344 (378) T ss_pred HHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 99999999999999999998876432 36899999999999999999999999999999999999999999999 Q ss_pred CeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 399 DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |++++|+|++|++.+++++.+++++- T Consensus 345 D~~~~~~n~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:16 345 DVYIANLNAVAVKNLSDLQGSRKDVT 370 (378) T ss_pred CeEeeccccccccchhhhcCccCCCC Confidence 99999999999998776554433332 No 73 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=100.00 E-value=5.4e-79 Score=449.69 Aligned_cols=366 Identities=14% Similarity=0.121 Sum_probs=286.1 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|||++++..... +... .....+..++.+.|+++++|++||++||+++|++||++|++. . T Consensus 1 Mg~f~~l~~~~~~-----~~~~--------~~~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~-------~ 60 (376) T protein:vir:78 1 MGFFSELFKRNKE-----IEWM--------WDLDFLEDKTTKVYLKKMALNTCVKHIARTIAKSDFRLKNGE-------T 60 (376) T ss_pred CchhhhhhccCCc-----cccc--------cchhhccccchhhhhhhHHHHHHHHHHHHhhcccceeecccc-------c Confidence 8999887543211 1010 111234457788999999999999999999999999998643 2 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEeCCceE Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..+|++.++|+.+||++||+++||+.++.+++++||||+++.|+..|.+.+++|+.+..+............| .... T Consensus 61 ~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 137 (376) T protein:vir:78 61 SVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFPDVFEGVTVKDY---RYNR 137 (376) T ss_pred cccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceeeeeeeeeeeecc---eeee Confidence 3468999999999999999999999999999999999999999999999999999988765443222221111 1235 Q ss_pred EecHhHEEEeecCCCCCcccCchH-HHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhC Q lcl|NC_019705. 174 EFSQKEIFHLKGFGFTGLVGLSPI-AFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAG 252 (424) Q Consensus 174 ~~~~~eiih~r~~~~~~~~G~s~i-~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~ 252 (424) .++++||+|+|+.+.++....+++ ..+...+ ......++.+++.+++++......+++++.+++++.|++.++ T Consensus 138 ~~~~~evih~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 211 (376) T protein:vir:78 138 NFSMDDVIFLEYGNERLSAFTDGMFEDYGELF------GKMIRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYA 211 (376) T ss_pred eeccccEEEeccCCCCchhhhhHHHHHHHHHH------HHHHHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhc Confidence 699999999997665443222222 2222111 122223333444333333333456788999999999998887 Q ss_pred Cc--ccCcceecCCCceeeecccChhH-----HHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH Q lcl|NC_019705. 253 GP--VKKRLWILEAGFSTSAIGVTPQD-----AEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 253 ~~--~ag~~~~l~~g~~~~~l~~~~~d-----~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) +. +.++++++++|++|+++++++.| +||+|.+++++++||++|||||.+||+ +++|+|++.+.|+++ T Consensus 212 g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~------~~s~~e~~~~~f~~~ 285 (376) T protein:vir:78 212 SFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHG------DMADLSNNMKAYMEY 285 (376) T ss_pred cccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCC------CCCCHHHHHHHHHHH Confidence 74 44568889999999999988865 499999999999999999999999973 345889999999999 Q ss_pred HHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC--Ceeee Q lcl|NC_019705. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVAMR 403 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg--d~~~~ 403 (424) ||.|++.+||++|+++|+++.+ ++++|+++.+++.|.+++++++.+++++|++|+||+|+++|+||+||| |++++ T Consensus 286 ~l~P~~~~ie~~l~~kll~~~~---~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~d~~~~ 362 (376) T protein:vir:78 286 CIDPLTKKLEDELNAKLFTFSE---FLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPELDKYLI 362 (376) T ss_pred HHHHHHHHHHHHHHhhhCCccc---ceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeee Confidence 9999999999999999999865 356788899999999999999999999999999999999999999876 99999 Q ss_pred cccccchhhccccCCCcccC Q lcl|NC_019705. 404 QSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 404 ~~n~~~~~~~~~~~~~~~~g 423 (424) |+|++|++..++ +| T Consensus 363 ~~n~~~~~~~~e------~g 376 (376) T protein:vir:78 363 TKNYQSADEGGE------DG 376 (376) T ss_pred ccCceehhcccc------CC Confidence 999999886433 33 No 74 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=100.00 E-value=1.4e-78 Score=447.40 Aligned_cols=379 Identities=15% Similarity=0.106 Sum_probs=288.2 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +||++++++||++........ ....+.....++.+.|+++++|++||++||+++|++||+++++++ T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~------- 66 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLT-------DTVWCSIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGE------- 66 (395) T ss_pred CchHHHHHhhhcccccccccc-------cchhhccccccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCc------- Confidence 999999999997654332211 111222334467788999999999999999999999999987542 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEeCC--c Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDS--E 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~--~ 171 (424) ...|++.++|+.+||++||+++||+.++.+++++||||+++.++. +++.++..+.........++.+...+ . T Consensus 67 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~------~~~~~~~~~~~~~~~~~~~~~v~~~~~~~ 140 (395) T protein:vir:40 67 EVRKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEY------IYVADSFTKNDKSLYENTYTEVTLKDLTL 140 (395) T ss_pred cccchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCc------eeecCCccccccccccceeeeeeecCcee Confidence 234789999999999999999999999999999999999998764 33333332221111111222222222 2 Q ss_pred eEEecHhHEEEeecCCCCCcccCc-hHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019705. 172 YAEFSQKEIFHLKGFGFTGLVGLS-PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~eiih~r~~~~~~~~G~s-~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) .+.|+++||+|+|+.+..+....+ .+......+... ....++.++.++.++++.+. ..++++.+++++.|++. T Consensus 141 ~~~~~~~evih~r~~~~~~~~~~~~l~~~~~~~~~~~-----~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~~~~~~~~~ 214 (395) T protein:vir:40 141 KKEFKESEVLHLTLNNESIKSIIDGFYLLYGDLLTAA-----VNKYKKLNSRKIIVKLKAMF-GQTPEAEEKLRLMLSER 214 (395) T ss_pred eeeeccccEEEeecCCCCccccchhHHHHHHHHHHHH-----HHHHHhcCCCCceEEEeccc-CCCHHHHHHHHHHHHHH Confidence 356899999999976543322222 223333222221 22233445555555554443 46888899999999987 Q ss_pred hCC--cccCcceecCCCceeeecccChhHHHHHHHHHHHH---HHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH Q lcl|NC_019705. 251 AGG--PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQV---SELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 251 ~~~--~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~---~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) +++ .++++++++++|++|++++++++|+||+|.+++.. ++||++|||||.+|++ +++|+|++.+.|+++ T Consensus 215 ~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~------~~sn~e~~~~~f~~~ 288 (395) T protein:vir:40 215 MKKFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKG------DTVGLSEQVNSFLMF 288 (395) T ss_pred HHHhhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------CCcCHHHHHHHHHHH Confidence 665 47788999999999999999999999999999875 7999999999999963 345889999999999 Q ss_pred HHHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeee Q lcl|NC_019705. 326 TLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAM 402 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~~~ 402 (424) ||.|++++||++|+++||++.++. +++++||+++++++|.+++++++.+++++|+||+||+|+++|+||+++ ||+++ T Consensus 289 ~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD~~~ 368 (395) T protein:vir:40 289 SINPIAEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQERF 368 (395) T ss_pred HHHHHHHHHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCceee Confidence 999999999999999999998865 488999999999999999999999999999999999999999999954 99999 Q ss_pred ecccccchhhccccC-CCcccCC Q lcl|NC_019705. 403 RQSQYVPITDLGTNK-EPRNNGA 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~-~~~~~ga 424 (424) +|+|++|++...+.. ++++++. T Consensus 369 ~~~n~~~~~~~~~~~kgge~~~~ 391 (395) T protein:vir:40 369 VTKNYAPLGENEEDLKGGDINEN 391 (395) T ss_pred eccccccccccccccCCCCCCCC Confidence 999999998665432 2222222 No 75 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=2.1e-77 Score=440.95 Aligned_cols=409 Identities=14% Similarity=0.137 Sum_probs=294.2 Q ss_pred CCCCcccccCCCCC-c----hHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhc Q lcl|NC_019705. 1 MEEPKYTIDLRTNN-G----WWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~-G----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia 75 (424) +++ |...+..-. | .-.++.+-.-+ .+.. ...|..... .....-.-+.+..+|+|++||++||++|| T Consensus 38 ~~~--~~~~~~~~~~~~~~a~~~p~~~~~~~----~~~~--~~~p~~~~~-~~~~~~~l~~~~~npiv~~~I~~ia~~vA 108 (576) T protein:vir:96 38 IEE--KSKELNKSLYGKQQAYAEPFLEVMDT----NPEF--RTKRSYMKN-SDNLHDVLKQFGNNPILNAIILTRSNQVA 108 (576) T ss_pred hhh--hhhhhccccCCccchhhcceeeeeec----CCCc--cccCcchhh-hhhhHHHHHHhhcCHHHHHHHHHHHHHHH Confidence 332 122221111 0 01111000000 0000 000100000 00001112445678999999999999999 Q ss_pred cC-------------ceEEEEeccCCccceeccchHHHHHhh---cCCCCC-CCHHHHHHHHHHHHHHcCCeEEEEee-- Q lcl|NC_019705. 76 CL-------------PLDVFETDQNDNRKKVDLSNPLARLLR---YSPNQY-MTAQEFREAMTMQLCFYGNAYALVDR-- 136 (424) Q Consensus 76 ~~-------------~~~v~~~~~~~~~~~~~~~~~l~~lL~---~~pn~~-~s~~~f~~~~~~~~ll~G~a~~~~~r-- 136 (424) ++ ++.++..+.....++....+++...|. ..|||+ +|+.+|++.++.+++++||+|+++++ T Consensus 109 ~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~r 188 (576) T protein:vir:96 109 MYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNK 188 (576) T ss_pred hhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEec Confidence 63 334443333222233333333333332 245555 58999999999999999999999885 Q ss_pred CCCCceeEEEEecCceeEEeecCceEE-------EEEEeCCceEEecHhHEEEee-cCCCC---CcccCchHHHHHHHHH Q lcl|NC_019705. 137 NSAGDVISLLPLQSANMDVKLVGKKVV-------YRYQRDSEYAEFSQKEIFHLK-GFGFT---GLVGLSPIAFACKSAG 205 (424) Q Consensus 137 ~~~G~~~~l~~l~~~~v~~~~~~~~~~-------~~~~~~~~~~~~~~~eiih~r-~~~~~---~~~G~s~i~~~~~~i~ 205 (424) +..|.+++||||+|.+|++..+.+... +.+..+.....++++||||++ +++.+ +.+|+||+.++..++. T Consensus 189 d~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~ 268 (576) T protein:vir:96 189 KNATTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFI 268 (576) T ss_pred CCCCceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccceEEEeecCCCCcccCcccccHHHHHHHHHH Confidence 456789999999999999988765432 223344556789999998765 45444 6789999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhCC-cccCcc-eecCCCceeeecccChhHHHHHH Q lcl|NC_019705. 206 VAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGG-PVKKRL-WILEAGFSTSAIGVTPQDAEMMA 282 (424) Q Consensus 206 ~~~~~~~~~~~~~~ng~~~~~vl~~~~~-~~~~~~~~~~~~~~~~~~~~-~~ag~~-~~l~~g~~~~~l~~~~~d~~~~e 282 (424) +..++++++.++|+||++|+|||+.+.+ .+++++.+++++.|++.++| .|+|++ +++++|++|++++++++|+||+| T Consensus 269 ~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle 348 (576) T protein:vir:96 269 AYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFEK 348 (576) T ss_pred HHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHHHHH Confidence 9999999999999999999999998764 46899999999999987665 688885 88999999999999999999999 Q ss_pred HHHHHHHHHHHHhCCCHHHhCCCCCCc---------ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccch Q lcl|NC_019705. 283 SRKFQVSELARFFGVPPHLVGDVEKST---------SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHA 353 (424) Q Consensus 283 ~~~~~~~~Ia~~fgVPp~~l~~~~~~~---------~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~ 353 (424) ++++++++||++|||||.+||..+.++ .+++|+|++.+.|+++||.|++..||++|+++|++..+.. +++ T Consensus 349 ~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~-~~~ 427 (576) T protein:vir:96 349 WLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISEYSDK-YVF 427 (576) T ss_pred HHHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhccCc-eEE Confidence 999999999999999999999877543 3678999999999999999999999999999999875432 223 Q ss_pred hhhhhhhhccCHHHHHHHHHHH--HhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 354 EHNLDGLLRGDSASRAAFMKAM--GEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 354 ~fd~~~l~~~d~~~~~~~~~~~--~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) + +++.|.+++.+.++.+ +..|+||+||+|+++|+||+||||+++.|.++.+++............+ T Consensus 428 ~-----f~r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~ 495 (576) T protein:vir:96 428 Q-----FVGGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQ 495 (576) T ss_pred E-----eccCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCccc Confidence 3 4567888888877654 5579999999999999999999999999999887764332111111111 No 76 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=3.8e-77 Score=439.53 Aligned_cols=378 Identities=11% Similarity=0.142 Sum_probs=300.3 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|||+++.....++....+.......+.....+.++..|+.+.++++|+|++||++||++||++|++++++.. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~~~~------- 73 (386) T protein:vir:49 1 MPIFNITNLATESPPINQESFFDIADSDFLASLNSSEWVSAENALKNSDLFSIISQLSNDLATAKITTSRKQL------- 73 (386) T ss_pred CchhhhhccCCCCcccchhhhhhhhhccccccccCCceechhhhhccHHHHHHHHHHHHHhhhCceeeccchh------- Confidence 8888876433222111111111111222333455678899999999999999999999999999999987542 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecC--ceEEEEEEe--- Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYRYQR--- 168 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~--~~~~~~~~~--- 168 (424) +.|+.+||++||+++||+.++.+++++||||++++|+.+|++++||||+|++|++..++ +...|.|.. T Consensus 74 -------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~ 146 (386) T protein:vir:49 74 -------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDP 146 (386) T ss_pred -------hhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEEEcCc Confidence 23667999999999999999999999999999999999999999999999999988764 445666653 Q ss_pred -CCceEEecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019705. 169 -DSEYAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 169 -~~~~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~ 246 (424) ++..+.|+++||||||++++++ ++|+||+.++..++.+..++++++.++|+||+.|+++|+++... ++++.+++++. T Consensus 147 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~-~~~~~~~~~~~ 225 (386) T protein:vir:49 147 HIAPKQHVPQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGG-LLDFKTKVSRS 225 (386) T ss_pred cccceeEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCC-ChHHHHHHHHH Confidence 2456789999999999998876 89999999999999999999999999999999999999998765 45555666666 Q ss_pred HHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHH Q lcl|NC_019705. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~t 326 (424) |+. +..|+|+++++++|++|++++.+++|+||+|.+++++++||++|||||.+||....+++ +. ++.+.|+..+ T Consensus 226 ~~~--~~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~---~~-~~~~~~~~~~ 299 (386) T protein:vir:49 226 RQA--MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQS---SL-EMIYNIYFKS 299 (386) T ss_pred HHH--hccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc---hH-HHHHHHHHHH Confidence 654 34688999999999999999999999999999999999999999999999987544332 33 4557889999 Q ss_pred HHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeec-c Q lcl|NC_019705. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQ-S 405 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~-~ 405 (424) |.|+++.|+++|+++|++. ++||.+.+++.|...+...+.+++++|++|+||+|++++..++...+.+... . T Consensus 300 i~~~l~~i~~~~~~~l~~~-------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~~~~ 372 (386) T protein:vir:49 300 VSRYLRPFVSEMSKKLSCE-------VDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKELPDGKNP 372 (386) T ss_pred HHHHHHHHHHHHHHHhcch-------hcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcCcchhcc Confidence 9999999999999999753 5789999999999999999999999999999999999987665332221111 1 Q ss_pred cccchhhccccCCCcc Q lcl|NC_019705. 406 QYVPITDLGTNKEPRN 421 (424) Q Consensus 406 n~~~~~~~~~~~~~~~ 421 (424) +..+++. ++ ...+| T Consensus 373 ~~~~~~g-Gd-~~~~~ 386 (386) T protein:vir:49 373 NRTSLKG-GE-INEQD 386 (386) T ss_pred CCCCCCC-CC-CCCCC Confidence 1122221 11 11122 No 77 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=8.2e-77 Score=437.71 Aligned_cols=415 Identities=13% Similarity=0.148 Sum_probs=302.4 Q ss_pred CCCC--cccccCCCCCchHH------------HHHhh-ccCcccCCccccchhh-cc--ccccccCcc------cccHHH Q lcl|NC_019705. 1 MEEP--KYTIDLRTNNGWWA------------RLQSW-FVGGRLVTPNQGSQTG-PV--SAHGHLGDS------SINDER 56 (424) Q Consensus 1 ~~~~--~~~~~~~~~~G~~~------------~~~~~-~~~~~~~~~~~~~~~~-~~--~~~~~~~~~------~vs~~~ 56 (424) .+.- --++.+ +-|+=- ..... ..+...+.. .+.... .. +.....+.. ...-+. T Consensus 14 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~-~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~ 90 (563) T protein:vir:99 14 YGNNSTIAQVPI--DEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYA-EPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKK 90 (563) T ss_pred cccccccceeec--cCChhhhHhhhhccchhHHHHHhhhccCCCcch-hhhHhhhcccccccccccCCCCcccHHHHHHH Confidence 1110 011222 111111 11111 111111111 111100 00 000000000 011234 Q ss_pred HhccHHHHHHHHHHHHhhcc-------------CceEEEEeccCCccceeccchHHHHHhh-c--CCCCC-CCHHHHHHH Q lcl|NC_019705. 57 ILQISTVWRCVSLISTLTAC-------------LPLDVFETDQNDNRKKVDLSNPLARLLR-Y--SPNQY-MTAQEFREA 119 (424) Q Consensus 57 ~~~~~~v~~~i~~ia~~ia~-------------~~~~v~~~~~~~~~~~~~~~~~l~~lL~-~--~pn~~-~s~~~f~~~ 119 (424) +..+++|.+||+++++.||. +++++++++..+..++....+++..+|. . .|||+ +|+++|++. T Consensus 91 ~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~ 170 (563) T protein:vir:99 91 FGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKK 170 (563) T ss_pred hhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHH Confidence 45578899999999998885 6888887777666666666677766553 1 23333 588999999 Q ss_pred HHHHHHHcCCeEEEEe--eCCCCceeEEEEecCceeEEeecCceEE------EEE-EeCCceEEecHhHEEEe-ecCCCC Q lcl|NC_019705. 120 MTMQLCFYGNAYALVD--RNSAGDVISLLPLQSANMDVKLVGKKVV------YRY-QRDSEYAEFSQKEIFHL-KGFGFT 189 (424) Q Consensus 120 ~~~~~ll~G~a~~~~~--r~~~G~~~~l~~l~~~~v~~~~~~~~~~------~~~-~~~~~~~~~~~~eiih~-r~~~~~ 189 (424) ++.+++++||+|++++ |+..|++++||||+|++|++..+.+... |.+ ..+.....|+++||||+ ++++.+ T Consensus 171 lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d 250 (563) T protein:vir:99 171 IVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTE 250 (563) T ss_pred HHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCC Confidence 9999999999999865 7888999999999999999988765432 233 33445668999997755 566555 Q ss_pred ---CcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhCC-cccCcc-eecC Q lcl|NC_019705. 190 ---GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGG-PVKKRL-WILE 263 (424) Q Consensus 190 ---~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~-~~~~~~~~~~~~~~~~~~~~-~~ag~~-~~l~ 263 (424) +.+|+||+.++..++.+..++++++.++|+||++|+|+|+++.+ .+++++++++++.|++.++| .|+|++ ++++ T Consensus 251 ~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~ 330 (563) T protein:vir:99 251 LSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMA 330 (563) T ss_pred cccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcC Confidence 68899999999999999999999999999999999999998765 47899999999999987665 688886 7899 Q ss_pred CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcc---------cchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 264 AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS---------WGSGIEQQNLGFLQYTLQPYISRW 334 (424) Q Consensus 264 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~---------~~~n~e~~~~~~~~~tl~P~~~~i 334 (424) +|++|++++++++|+||+|++++++++||++|||||.+||..+++++ +++|+|++.+.|+++||.||+..| T Consensus 331 ~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~i 410 (563) T protein:vir:99 331 DDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFI 410 (563) T ss_pred CCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999998766533 567899999999999999999999 Q ss_pred HHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHH--HHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhh Q lcl|NC_019705. 335 ENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMK--AMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITD 412 (424) Q Consensus 335 e~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~--~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~ 412 (424) |++|+++|++..+.. +.|+ ++++|.+++.+.+. .++++|+||+||+|+++|+||+||||+++.|.++.+++. T Consensus 411 e~~ln~~L~~~~~~~---~~~~---f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~ 484 (563) T protein:vir:99 411 EDLVNRHIISEYGDK---YTFQ---FVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQ 484 (563) T ss_pred HHHHHhhhchhcccc---cEEE---eccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccc Confidence 999999999875532 2332 46788888888765 468899999999999999999999999999998887654 Q ss_pred ccccCCC---------------------------cccCC Q lcl|NC_019705. 413 LGTNKEP---------------------------RNNGA 424 (424) Q Consensus 413 ~~~~~~~---------------------------~~~ga 424 (424) ....... ..+++ T Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (563) T protein:vir:99 485 LQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSS 523 (563) T ss_pred cccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCC Confidence 3211100 00000 No 78 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=8.2e-77 Score=437.71 Aligned_cols=415 Identities=13% Similarity=0.148 Sum_probs=302.4 Q ss_pred CCCC--cccccCCCCCchHH------------HHHhh-ccCcccCCccccchhh-cc--ccccccCcc------cccHHH Q lcl|NC_019705. 1 MEEP--KYTIDLRTNNGWWA------------RLQSW-FVGGRLVTPNQGSQTG-PV--SAHGHLGDS------SINDER 56 (424) Q Consensus 1 ~~~~--~~~~~~~~~~G~~~------------~~~~~-~~~~~~~~~~~~~~~~-~~--~~~~~~~~~------~vs~~~ 56 (424) .+.- --++.+ +-|+=- ..... ..+...+.. .+.... .. +.....+.. ...-+. T Consensus 14 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~-~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~ 90 (563) T protein:vir:95 14 YGNNSTIAQVPI--DEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYA-EPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKK 90 (563) T ss_pred cccccccceeec--cCChhhhHhhhhccchhHHHHHhhhccCCCcch-hhhHhhhcccccccccccCCCCcccHHHHHHH Confidence 1110 011222 111111 11111 111111111 111100 00 000000000 011234 Q ss_pred HhccHHHHHHHHHHHHhhcc-------------CceEEEEeccCCccceeccchHHHHHhh-c--CCCCC-CCHHHHHHH Q lcl|NC_019705. 57 ILQISTVWRCVSLISTLTAC-------------LPLDVFETDQNDNRKKVDLSNPLARLLR-Y--SPNQY-MTAQEFREA 119 (424) Q Consensus 57 ~~~~~~v~~~i~~ia~~ia~-------------~~~~v~~~~~~~~~~~~~~~~~l~~lL~-~--~pn~~-~s~~~f~~~ 119 (424) +..+++|.+||+++++.||. +++++++++..+..++....+++..+|. . .|||+ +|+++|++. T Consensus 91 ~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~ 170 (563) T protein:vir:95 91 FGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKK 170 (563) T ss_pred hhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHH Confidence 45578899999999998885 6888887777666666666677766553 1 23333 588999999 Q ss_pred HHHHHHHcCCeEEEEe--eCCCCceeEEEEecCceeEEeecCceEE------EEE-EeCCceEEecHhHEEEe-ecCCCC Q lcl|NC_019705. 120 MTMQLCFYGNAYALVD--RNSAGDVISLLPLQSANMDVKLVGKKVV------YRY-QRDSEYAEFSQKEIFHL-KGFGFT 189 (424) Q Consensus 120 ~~~~~ll~G~a~~~~~--r~~~G~~~~l~~l~~~~v~~~~~~~~~~------~~~-~~~~~~~~~~~~eiih~-r~~~~~ 189 (424) ++.+++++||+|++++ |+..|++++||||+|++|++..+.+... |.+ ..+.....|+++||||+ ++++.+ T Consensus 171 lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d 250 (563) T protein:vir:95 171 IVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTE 250 (563) T ss_pred HHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCC Confidence 9999999999999865 7888999999999999999988765432 233 33445668999997755 566555 Q ss_pred ---CcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhCC-cccCcc-eecC Q lcl|NC_019705. 190 ---GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGG-PVKKRL-WILE 263 (424) Q Consensus 190 ---~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~-~~~~~~~~~~~~~~~~~~~~-~~ag~~-~~l~ 263 (424) +.+|+||+.++..++.+..++++++.++|+||++|+|+|+++.+ .+++++++++++.|++.++| .|+|++ ++++ T Consensus 251 ~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~ 330 (563) T protein:vir:95 251 LSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMA 330 (563) T ss_pred cccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcC Confidence 68899999999999999999999999999999999999998765 47899999999999987665 688886 7899 Q ss_pred CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcc---------cchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 264 AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS---------WGSGIEQQNLGFLQYTLQPYISRW 334 (424) Q Consensus 264 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~---------~~~n~e~~~~~~~~~tl~P~~~~i 334 (424) +|++|++++++++|+||+|++++++++||++|||||.+||..+++++ +++|+|++.+.|+++||.||+..| T Consensus 331 ~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~i 410 (563) T protein:vir:95 331 DDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFI 410 (563) T ss_pred CCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999998766533 567899999999999999999999 Q ss_pred HHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHH--HHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhh Q lcl|NC_019705. 335 ENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMK--AMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITD 412 (424) Q Consensus 335 e~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~--~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~ 412 (424) |++|+++|++..+.. +.|+ ++++|.+++.+.+. .++++|+||+||+|+++|+||+||||+++.|.++.+++. T Consensus 411 e~~ln~~L~~~~~~~---~~~~---f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~ 484 (563) T protein:vir:95 411 EDLVNRHIISEYGDK---YTFQ---FVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQ 484 (563) T ss_pred HHHHHhhhchhcccc---cEEE---eccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccc Confidence 999999999875532 2332 46788888888765 468899999999999999999999999999998887654 Q ss_pred ccccCCC---------------------------cccCC Q lcl|NC_019705. 413 LGTNKEP---------------------------RNNGA 424 (424) Q Consensus 413 ~~~~~~~---------------------------~~~ga 424 (424) ....... ..+++ T Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (563) T protein:vir:95 485 LQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSS 523 (563) T ss_pred cccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCC Confidence 3211100 00000 No 79 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=2.8e-77 Score=440.29 Aligned_cols=369 Identities=12% Similarity=0.162 Sum_probs=287.4 Q ss_pred CchHHHHHhhccCcccCCccc-cchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccce Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQ-GSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~ 92 (424) +|||+++... +..+... ...........+.++..++.+.++++|+||+||++||++||++||++++... T Consensus 1 Mg~f~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~------ 70 (382) T protein:vir:48 1 MPIFNLATES----PPDNQGGFFDVVDSDFLASLKGNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKKL------ 70 (382) T ss_pred CccccccccC----CcccccccccchhhhccccccCCcccchHhhhccHHHHHHHHHHHHhhccCceeeecchh------ Confidence 8888776321 1111111 1111122233455677899999999999999999999999999999986542 Q ss_pred eccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecC--ceEEEEEEeCC Q lcl|NC_019705. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYRYQRDS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~--~~~~~~~~~~~ 170 (424) ..|+.+||++||+++|++.++.+++++||||++++|+..|.+++||||+|++|++..+. +...|.+..++ T Consensus 71 --------~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~ 142 (382) T protein:vir:48 71 --------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDD 142 (382) T ss_pred --------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCeEEEEEEecC Confidence 24778999999999999999999999999999999999999999999999999988754 45567765543 Q ss_pred ----ceEEecHhHEEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019705. 171 ----EYAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 171 ----~~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~ 245 (424) ..+.|+++||||+|++++++ ++|+||+.++..++....++++++.++|+||+.|+++|+++... ++++.+++++ T Consensus 143 ~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-~~e~~~~~~~ 221 (382) T protein:vir:48 143 PRIPPKQHVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGG-LLDFKTKLSR 221 (382) T ss_pred ccccceeEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC-ChHHHHHHHH Confidence 45689999999999998876 89999999999999999999999999999999999999998764 5555566666 Q ss_pred HHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH Q lcl|NC_019705. 246 NFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 246 ~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) .+.. +..|+|+++++++|++|++++.+++|+||+|.+++.+++||++|||||.+||...++ .+++++.+.|++. T Consensus 222 ~~~~--~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~----~~~~~~~~~~~~~ 295 (382) T protein:vir:48 222 SRQA--MKQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQ----QSSLEMSSDLYSK 295 (382) T ss_pred HHHh--hccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc----ccHHHHHHHHHHH Confidence 6654 345789999999999999999999999999999999999999999999999975543 2568889999999 Q ss_pred HHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCC-----CCCCCe Q lcl|NC_019705. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPP-----LPGGDV 400 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p-----~~ggd~ 400 (424) ||.|+++.|+++|+++|+++.+.. ....+..+.......+..++++|++|+||+|+.++..+ +++|+. T Consensus 296 ~l~p~~~~i~~~l~~~l~~~~~~~-------~~~~~~~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~ 368 (382) T protein:vir:48 296 AVSRYLRPFLSELSQKLSCDVDAD-------IFPAVDPTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGEN 368 (382) T ss_pred HHHHHHHHHHHHHHHHhcChhhhh-------hhhhhccchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhc Confidence 999999999999999999876532 11222233344555566778888888888888764322 334443 Q ss_pred eeecccccchhhccccCCCcc Q lcl|NC_019705. 401 AMRQSQYVPITDLGTNKEPRN 421 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~~~~~ 421 (424) +.. +++ +..+.+++ T Consensus 369 ~~~-----~~~--GGd~~~~~ 382 (382) T protein:vir:48 369 PNS-----TLK--GGEEDGQD 382 (382) T ss_pred CCC-----CCC--CCCCCCCC Confidence 321 121 11111111 No 80 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=100.00 E-value=2.5e-77 Score=440.54 Aligned_cols=380 Identities=13% Similarity=0.081 Sum_probs=291.8 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|+|+++.. +..... ... .. ......++.+.++++++||+||++||++||++||++++.++.+ T Consensus 1 MGlf~~~~~----~~~~~~-~~~------~~-~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~----- 63 (395) T protein:vir:98 1 MGILDFFSF----KKSGTL-SDD------DS-GSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLT----- 63 (395) T ss_pred CcchhhhcC----CCcccc-ccc------cc-chhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcc----- Confidence 888887631 111110 000 00 1112235677889999999999999999999999999864322 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEeCC--c Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDS--E 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~--~ 171 (424) ...|++.++|+.+||++||+++||+.++.+++++||||++++++..+ ++ ++..+.........++.+...+ . T Consensus 64 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~-----~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 137 (395) T protein:vir:98 64 ENQKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGI-----YV-ADSFTQDKKISGSQFKVSRVQGQTY 137 (395) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCce-----ec-CCcccccccccCcccceeeecCcee Confidence 34689999999999999999999999999999999999999987643 22 2222222221122222222222 2 Q ss_pred eEEecHhHEEEeecCCCCCc-ccCchHHHHHHHHHHH--HHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019705. 172 YAEFSQKEIFHLKGFGFTGL-VGLSPIAFACKSAGVA--VAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 172 ~~~~~~~eiih~r~~~~~~~-~G~s~i~~~~~~i~~~--~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) ...++++||+|+|+.+.++. ++.+++......+... ........+++.++..+.+++.......++++.+..+++++ T Consensus 138 ~~~~~~~evih~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (395) T protein:vir:98 138 EKTFTFDQVIYLKNDNSDLMSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFK 217 (395) T ss_pred eeEecCccEEEecCCCCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccCCcHHHHHHHHHHHH Confidence 46789999999998876653 3444445544444433 33445567788899998888887777778888888899999 Q ss_pred HHhCCc--ccCcceecCCCceeeeccc------ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH Q lcl|NC_019705. 249 EIAGGP--VKKRLWILEAGFSTSAIGV------TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 249 ~~~~~~--~ag~~~~l~~g~~~~~l~~------~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~ 320 (424) +..++. +.++++++++|++|+++++ +++++||.+.+++++++||++|||||.+|++ +++|.|++.+ T Consensus 218 ~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~------~~sn~e~~~~ 291 (395) T protein:vir:98 218 RTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHG------DIADNQKNYE 291 (395) T ss_pred HHHhhhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------CcccHHHHHH Confidence 887764 4456888999999999985 4678899999999999999999999999963 3558999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C Q lcl|NC_019705. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--G 398 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--g 398 (424) +|+++||.|++++||++|+++|+++.++.. ..+||+++++++|.+++++++.+++++|++|+||+|+++|+||+|| | T Consensus 292 ~f~~~tl~P~~~~ie~~l~~kll~~~~~~~-g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~g 370 (395) T protein:vir:98 292 LLLEGPIESLITNIVDGLEYAIFDKSETLQ-GSFIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLG 370 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCChhhhcC-cceeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC Confidence 999999999999999999999999887543 2457888999999999999999999999999999999999999976 9 Q ss_pred CeeeecccccchhhccccCCCcccC Q lcl|NC_019705. 399 DVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) |++++++|++|++..+.....+++. T Consensus 371 D~~~~~~n~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:98 371 KVLYMTKNYESVLERGGEVDEEVET 395 (395) T ss_pred ceeeecccceecccccCCCCCCCCC Confidence 9999999999998654443333333 No 81 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=100.00 E-value=2.7e-77 Score=440.40 Aligned_cols=355 Identities=14% Similarity=0.091 Sum_probs=281.5 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccc-- Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK-- 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~-- 91 (424) +|+|+++++++.......... ....+....+++.++|++||++||++||++|+++|++.+.+... T Consensus 1 M~if~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQR-------------VTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CchhHHhHhhhhcccccCcce-------------eeeeecchhhhhhHHHHHHHHHHHHhHhhCceeeeeeccccccccc Confidence 999999998875433222110 11112233457788999999999999999999999886654332 Q ss_pred -eeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCceeEEEEecCceeEEeecCceEEEEEEeC Q lcl|NC_019705. 92 -KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 92 -~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~-r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 169 (424) +....|++.+||+.+||++||+++||+.++.+++++||||++++ ++..|.+..+++.. T Consensus 68 ~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~~-------------------- 127 (378) T protein:vir:94 68 LISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDLLFAN-------------------- 127 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEEEEec-------------------- Confidence 34567999999999999999999999999999999999999854 55566665544321 Q ss_pred CceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH----HHHHHHH Q lcl|NC_019705. 170 SEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ----QRSQVEE 245 (424) Q Consensus 170 ~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~----~~~~~~~ 245 (424) .+.+|+++||+|+|++...+ .+.+++..+...+. ..+.+ +.++|+|+.+... +++ +++++++ T Consensus 128 -~~~~~~~~dvih~~~~~~~~-~~~~~~~~~~~~~~----------~~~~~-~~~~g~l~~~~~l-~~~~~~~~~e~~~~ 193 (378) T protein:vir:94 128 -DKKEYKPEELVRLTSPFYIN-EDTSILDNALASIQ----------TKLEQ-GKLRGLLKINAFL-DIDNTQEYREKALA 193 (378) T ss_pred -CcEEechhceeeecCcCCcc-cchhHHHHHHHHHH----------HHHhh-CCcccceeeCCcC-CHHHHHHHHHHHHH Confidence 23568899999999654221 24455665554332 23333 4678999988654 443 4556677 Q ss_pred HHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH Q lcl|NC_019705. 246 NFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 246 ~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~ 325 (424) .++++.++.++|++++|++|++|++++++++++|+ +.+++++++||++|||||.+|++ + .+|++.++|+++ T Consensus 194 ~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l~g----~----~~e~~~~~f~~~ 264 (378) T protein:vir:94 194 TIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG----T----ATQEQQIYFYNS 264 (378) T ss_pred HHHHhhcccccccceeccCCceEEEccCChHHhhH-HHHHHHHHHHHHHhCCCHHHhcC----C----chHHHHHHHHHH Confidence 77777888899999999999999999999999996 77899999999999999999953 1 347899999999 Q ss_pred HHHHHHHHHHHHHHhhccChhhhcc-------cchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Q lcl|NC_019705. 326 TLQPYISRWENSIQRWLIPAKDVGR-------IHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~-------~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg 398 (424) ||.||+++||++|+++|+++.++.. ..++||++.++++|.+++++++.+++++|+||+||+|+++|+||+||| T Consensus 265 tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~gg 344 (378) T protein:vir:94 265 TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGG 344 (378) T ss_pred HHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 9999999999999999999877643 236799999999999999999999999999999999999999999999 Q ss_pred CeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 399 DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |++++|+|++|++..++++..++++. T Consensus 345 d~~~~~~n~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:94 345 DVYIANLNAVAVKNLSDLQGNRKDVT 370 (378) T ss_pred CeeeecccccchhcchhcccccCCCC Confidence 99999999999998887665555433 No 82 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=9.8e-77 Score=437.29 Aligned_cols=393 Identities=12% Similarity=0.071 Sum_probs=292.9 Q ss_pred CCcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccH----HHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_019705. 3 EPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIND----ERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 3 ~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~----~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) --.|.|.++| .....+..................+...+++. +.+..+++|++||++||++||++| T Consensus 1 ~~~~~~~i~s----------~~~~~~i~~~~~~s~~~~~~~~~~~~~pp~~~~~la~l~~~n~~v~scI~~ia~~IA~l~ 70 (542) T protein:vir:41 1 MFNYHLSIRS----------LEKYKAIKREEVESQALGETRFEEYVEPKVNPLVLLSLLQVNPYHASACSIKANDIIRTG 70 (542) T ss_pred Cccccccccc----------cccchhhhhccccccccccccCCccccCCCCHHHHHHHHhhcHHHHHHHHHHHHHHhhCc Confidence 1122233333 11111100000000000111111111222343 334568999999999999999999 Q ss_pred eEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeec Q lcl|NC_019705. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~ 158 (424) |+++.... ..+++..||++||+++|++.++.+++++||||++++|+..|.+.+|+||+|++|++..+ T Consensus 71 ~~~~~~~~-------------~~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d 137 (542) T protein:vir:41 71 YILEGDDE-------------GVVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKD 137 (542) T ss_pred eeeecccc-------------hhhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEc Confidence 99864321 12344569999999999999999999999999999999999999999999999999887 Q ss_pred CceEEEE-----------EE--------eCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 159 GKKVVYR-----------YQ--------RDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFF 218 (424) Q Consensus 159 ~~~~~~~-----------~~--------~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~ 218 (424) ++..... |. .+.....++++||||||+++ .++++|+||+..+..++.+..++++++.++| T Consensus 138 ~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f 217 (542) T protein:vir:41 138 GSRYRQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANELVFIHIPSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFF 217 (542) T ss_pred CCeeEeeecCCcceeEEeecccccccccccccccccCcccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 6543211 11 11123458899999999887 6889999999999999999999999999999 Q ss_pred hcCCCCceeEEcCCC---------CCCHHHHHHHHHHHHHHhCC--cccCcceecC------CCceeeecccChhHHHHH Q lcl|NC_019705. 219 ANGAKSPQILSTGEK---------VLTEQQRSQVEENFKEIAGG--PVKKRLWILE------AGFSTSAIGVTPQDAEMM 281 (424) Q Consensus 219 ~ng~~~~~vl~~~~~---------~~~~~~~~~~~~~~~~~~~~--~~ag~~~~l~------~g~~~~~l~~~~~d~~~~ 281 (424) +||++|+++|+++.. ..++++.+.+++.|++.+.+ .|+|++++|+ +|++|++++++++|++|+ T Consensus 218 ~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfl 297 (542) T protein:vir:41 218 DNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFR 297 (542) T ss_pred hccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHH Confidence 999999999988643 35778899999999876544 4788899984 799999999999999999 Q ss_pred HHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhh Q lcl|NC_019705. 282 ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLL 361 (424) Q Consensus 282 e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~ 361 (424) |.+++++++||++|||||.+||..++++++++|+|++.+.|+++||.|+++.||++|+++|+++.++ .++++|+.+.++ T Consensus 298 e~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~-~~~~~f~~~~ll 376 (542) T protein:vir:41 298 EYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTDFFQVKFNP-KTRFKFNDETLL 376 (542) T ss_pred HHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC-ceEEEecchhhc Confidence 9999999999999999999999998888888899999999999999999999999999999888765 467899999998 Q ss_pred ccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC-eeeecccccchhhccccC--CCc------ccCC Q lcl|NC_019705. 362 RGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD-VAMRQSQYVPITDLGTNK--EPR------NNGA 424 (424) Q Consensus 362 ~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd-~~~~~~n~~~~~~~~~~~--~~~------~~ga 424 (424) +.|. .+.+..++++|++|+||+|+.+ +++|+|| .++.|.|.........++ +.. ..-+ T Consensus 377 ~~d~---~~~~~~~v~~GilT~NE~Re~L--~g~~pgdd~~l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~~ 443 (542) T protein:vir:41 377 ESDS---VRNCALLVQSGVLTPAEARERL--FGLDGGPDIFMVPSKGAAKSVKRQERNYEKNQIREIRKIYA 443 (542) T ss_pred chHH---HHHHHHHHhCCCCCHHHHHHhh--CCCCCCCccccccccccccccccCCcCCCCCchhhhhhccc Confidence 8764 4456779999999999999753 3444454 455565554322111100 000 0000 No 83 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=100.00 E-value=3.2e-77 Score=439.95 Aligned_cols=376 Identities=14% Similarity=0.096 Sum_probs=279.0 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|+|++++....+ ..+.. .... ....++.+.|+++++|++||++||++||++||+++++++. . T Consensus 1 Mgl~d~~~~~~~~---~~~~~--------~~~~-~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~-----~ 63 (395) T protein:vir:96 1 MGILDFFSFKKSG---TLSDD--------DSGS-TTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKL-----T 63 (395) T ss_pred CcchhhhcCCCCc---ccccc--------cccc-chhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCcc-----c Confidence 8999876432211 11100 0011 1123567889999999999999999999999999976432 3 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEe--CCc Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQR--DSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~--~~~ 171 (424) ..+|++.+||+.+||++||+++||+.++.+++++||||+++.|+..+.+...++ +.....+. .++.+.. ... T Consensus 64 ~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~~~~~~~-----~~~~~~~~-~~~~v~~~~~~~ 137 (395) T protein:vir:96 64 ENQKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIYVADAFT-----QDKKLSGN-KFKVSRVQGQTY 137 (395) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCceecCCccc-----cccccccc-eeeeeeecccee Confidence 356899999999999999999999999999999999999999876433222222 21111111 1222222 222 Q ss_pred eEEecHhHEEEeecCCCCC-cccCchHHHHHHHH------HHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHH Q lcl|NC_019705. 172 YAEFSQKEIFHLKGFGFTG-LVGLSPIAFACKSA------GVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVE 244 (424) Q Consensus 172 ~~~~~~~eiih~r~~~~~~-~~G~s~i~~~~~~i------~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~ 244 (424) ...++++||+|||+.+.+. .++.+++......+ .....+.++..++|.+++.+.+++...... ..+..+ T Consensus 138 ~~~~~~~dvih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~ 213 (395) T protein:vir:96 138 EKIFTFDQVIYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGGR----QPKSDK 213 (395) T ss_pred eeEeccCceEEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCchh----hHHHHH Confidence 4679999999999876543 33333333332222 223334567889999999999998766543 334455 Q ss_pred HHHHHHhCCc--ccCcceecCCCceeeecccChhHHHHHHHHHHH------HHHHHHHhCCCHHHhCCCCCCcccchhHH Q lcl|NC_019705. 245 ENFKEIAGGP--VKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQ------VSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 245 ~~~~~~~~~~--~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~------~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e 316 (424) ++++++.++. +.++++++++|++|++++.++.|+|++|.+++. .++||++|||||.+|++ +++|+| T Consensus 214 ~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~~------~~sn~e 287 (395) T protein:vir:96 214 DFFKRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHG------DIADNQ 287 (395) T ss_pred HHHHHHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------CCccHH Confidence 5555555443 345688899999999999999999999988776 58999999999999963 345899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC Q lcl|NC_019705. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 317 ~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ 396 (424) ++.++|+++||.||+.+||++|+++|+++.++.. ..+|+++.++++|.+++++++++++++|++|+||+|+++|+||+| T Consensus 288 ~~~~~f~~~~L~P~~~~ie~~l~~~Ll~~~e~~~-~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~ 366 (395) T protein:vir:96 288 KNYELLLEGPIESLITNIVDGLEYAIFDKSETLE-GSFIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELP 366 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcC-ceeEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 9999999999999999999999999999877543 245788999999999999999999999999999999999999997 Q ss_pred C--CCeeeecccccchhhccccCCCcccC Q lcl|NC_019705. 397 G--GDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 397 g--gd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) | ||++++|+|++|++..+...+.+++. T Consensus 367 ~~~gD~~~~~~N~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:96 367 DGLGKVLYMTKNYESVLERGGEVDEEVET 395 (395) T ss_pred CCCCceeeecccceechhccCCCCCCCCC Confidence 6 99999999999998744332222222 No 84 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=100.00 E-value=9e-77 Score=437.49 Aligned_cols=354 Identities=13% Similarity=0.095 Sum_probs=277.1 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccc-- Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK-- 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~-- 91 (424) +|||+++.++++......+.. ....++...+++.++|++||++||++||++||+++++++++... T Consensus 1 M~~f~k~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:85 1 MNLFGKVVSFSRGKLNNDTQR-------------VTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CchhhhhhhhhhcccccCCcc-------------eeeeeccchhhhhHHHHHHHHHHHHhHhhCceeEEEEecccccccc Confidence 999999987775433221110 01112334467889999999999999999999999987765433 Q ss_pred -eeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCceeEEEEecCceeEEeecCceEEEEEEeC Q lcl|NC_019705. 92 -KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 92 -~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~-r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 169 (424) +...+|++.+||+.+||++||+++||+.++.+++++||||++++ ++..|.+..+++. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~--------------------- 126 (378) T protein:vir:85 68 LISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLLFA--------------------- 126 (378) T ss_pred ccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEEec--------------------- Confidence 34567999999999999999999999999999999999999864 4555655443322 Q ss_pred CceEEecHhHEEEeecC-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHH----H Q lcl|NC_019705. 170 SEYAEFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV----E 244 (424) Q Consensus 170 ~~~~~~~~~eiih~r~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~----~ 244 (424) .....+.++||+|++.+ +.++ +.+.+..+...+ ..++. ++.++|+|+.+.. +++++.+++ + T Consensus 127 ~~~~~~~~~dvih~~~~~~~~~--~~~~~~~a~~~~----------~~~~~-~~~~~g~l~~~~~-l~~~~~~~~~~~~~ 192 (378) T protein:vir:85 127 NDKKEYKPEELVRLVSPFYINE--DTSILDNALASI----------QTKLE-QGKLRGLLKINAF-LDIDNTQEYREKAL 192 (378) T ss_pred CCCEEEcccceEEEecCcCccc--hhhHHHHHHHHH----------HHHHh-cCCcceEEEeCCc-CCHHHHHHHHHHHH Confidence 12445788999999854 3333 333444333322 23344 4578999998865 455554444 4 Q ss_pred HHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHH Q lcl|NC_019705. 245 ENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ 324 (424) Q Consensus 245 ~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~ 324 (424) +.++.+.++.++|++++|++|++|++++++++++++ +.++++.++||++|||||.+|+. +++|++..+|+. T Consensus 193 ~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~~--------s~~e~~~~~f~~ 263 (378) T protein:vir:85 193 ATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIELIKSELLTGYFMNENILLG--------TATQEQQIYFYN 263 (378) T ss_pred HHHHHhhcccccccceecCCCceEEeccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC--------CchHHHHHHHHH Confidence 445566777889999999999999999999999996 77899999999999999999952 245889999999 Q ss_pred HHHHHHHHHHHHHHHhhccChhhhccc-------chhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Q lcl|NC_019705. 325 YTLQPYISRWENSIQRWLIPAKDVGRI-------HAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 325 ~tl~P~~~~ie~~l~~~l~~~~~~~~~-------~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g 397 (424) +||.||+.+||++|+++|+++.++... .+.||.+.++++|.+++++++.+++++|+||+||+|+++|+||+|| T Consensus 264 ~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~g 343 (378) T protein:vir:85 264 STIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEG 343 (378) T ss_pred HHHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 999999999999999999998776431 3679999999999999999999999999999999999999999999 Q ss_pred CCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 398 GDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ||++++|+|++|++..++++.++++.. T Consensus 344 GD~~~~~~N~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:85 344 GDIYIANLNAVAVKNLSDLQGSRKDVA 370 (378) T ss_pred CCeEeecccccccccchhhcCccCCCC Confidence 999999999999998877654433322 No 85 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=1.3e-76 Score=436.71 Aligned_cols=393 Identities=11% Similarity=0.066 Sum_probs=293.2 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhcc-cccc-ccCcccccHHHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPV-SAHG-HLGDSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) |- .|.|.++|-. =+.++++. ..........+ .+.. ...... -.+.+..+++|++||++||++||++| T Consensus 1 ~~--~~~~~~~~~~-~~~~~~~~-------~~~~~~~~~~~~~~~~pp~~~~~-La~~~~~n~~v~scI~~ia~~ia~~~ 69 (540) T protein:vir:41 1 MF--NYHLSIKSLE-KYRAIKGD-------TDSQALKEDRFEEYVEPKVHPLV-LLSLLQVNPYHASACSIKANDILRTG 69 (540) T ss_pred CC--CcccChhhcc-chhhhhcc-------ccccccccCCCCccccCCCCHHH-HHHHHHhcHHHHHHHHHHHHHHhcCC Confidence 22 2334444321 12222221 11111111111 1111 111111 12455678999999999999999999 Q ss_pred eEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeec Q lcl|NC_019705. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~ 158 (424) ++++.+.. .+.. ..||++||+.+|++.++.+++++||||++++|+..|.+++|+||+|.+|++..+ T Consensus 70 ~~i~~~~~-----------~~~~---~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~ 135 (540) T protein:vir:41 70 YLIDGDDG-----------GVEE---LLRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRD 135 (540) T ss_pred ceEecCcc-----------chhh---hccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEc Confidence 99864321 2222 249999999999999999999999999999999999999999999999998876 Q ss_pred CceEE---------EE--E--------EeCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 159 GKKVV---------YR--Y--------QRDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFF 218 (424) Q Consensus 159 ~~~~~---------~~--~--------~~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~ 218 (424) +...+ |. | ..+.....++++||||+|.++ .++++|+||+.++..++....++++++.++| T Consensus 136 ~~~~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f 215 (540) T protein:vir:41 136 GSRYMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQKIDEYNYAFF 215 (540) T ss_pred CceeEeeecCceeeeeecccccceeeccccccceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 54322 11 0 112234578999999999886 6789999999999999999999999999999 Q ss_pred hcCCCCceeEEcCCCCCCHH---------HHHHHHHHHHHHhCC--cccCcceecC------CCceeeecccChhHHHHH Q lcl|NC_019705. 219 ANGAKSPQILSTGEKVLTEQ---------QRSQVEENFKEIAGG--PVKKRLWILE------AGFSTSAIGVTPQDAEMM 281 (424) Q Consensus 219 ~ng~~~~~vl~~~~~~~~~~---------~~~~~~~~~~~~~~~--~~ag~~~~l~------~g~~~~~l~~~~~d~~~~ 281 (424) +||++|+|+|+++....+++ .++.+++.|+..+.+ .|+|++++|+ +|++|++++++++|+||+ T Consensus 216 ~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfl 295 (540) T protein:vir:41 216 DNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFR 295 (540) T ss_pred hccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHHHH Confidence 99999999999886554432 245566666665444 5789999984 799999999999999999 Q ss_pred HHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhh Q lcl|NC_019705. 282 ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLL 361 (424) Q Consensus 282 e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~ 361 (424) |.+++++++||++|||||.+||..+.++++++|+|++.+.|+++||.|+++.||++|+++|++..+. +++++||.+.++ T Consensus 296 e~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~-~~~i~f~~~~ll 374 (540) T protein:vir:41 296 EYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIVSSVLTDFIQLKLDP-GARFVFNEEILM 374 (540) T ss_pred HHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC-ceEEEecchhhc Confidence 9999999999999999999999998888888999999999999999999999999999999876554 578899999999 Q ss_pred ccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC-CCCeeeecccccchhhccccCC---Cccc-----CC Q lcl|NC_019705. 362 RGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP-GGDVAMRQSQYVPITDLGTNKE---PRNN-----GA 424 (424) Q Consensus 362 ~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~-ggd~~~~~~n~~~~~~~~~~~~---~~~~-----ga 424 (424) +.|.+++ +.+++++|++|+||+|+.+ +++| ++|.++.|.|+...+..+..+. ++.+ ++ T Consensus 375 ~~D~~~~---~~~lv~~G~lT~NE~Re~L--~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~ 441 (540) T protein:vir:41 375 ESEFVHN---YALLVQCGVLTPSEVREKL--FGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYA 441 (540) T ss_pred chHHHHH---HHHHHhCCCCCHHHHHHHh--CcCcCCCcccccccccccccccccccccCCCCccccccccc Confidence 9875544 5678999999999999854 3444 4466777888765443222110 0000 11 No 86 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=7.3e-76 Score=432.52 Aligned_cols=371 Identities=17% Similarity=0.148 Sum_probs=294.4 Q ss_pred cHHHHhccHHHHHHHHHHHHhhccCceEEEEeccC-Cc-cceeccchHHHHHhhcCCCCCC--------CHHHHHHHHHH Q lcl|NC_019705. 53 NDERILQISTVWRCVSLISTLTACLPLDVFETDQN-DN-RKKVDLSNPLARLLRYSPNQYM--------TAQEFREAMTM 122 (424) Q Consensus 53 s~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~-~~-~~~~~~~~~l~~lL~~~pn~~~--------s~~~f~~~~~~ 122 (424) =..-+..+|+|++||++||++||++||+++.+... +. .......+....++..+||+.| |+.+||+.++. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 12233447999999999999999999999865422 11 1122223334456777888866 56689999999 Q ss_pred HHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceE---------EEEEE----------------------eCCc Q lcl|NC_019705. 123 QLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV---------VYRYQ----------------------RDSE 171 (424) Q Consensus 123 ~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~---------~~~~~----------------------~~~~ 171 (424) +++++||||++++|+..|.|++|+||+|++|++..+.... ++.+. ..+. T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDADDGSTGT 160 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeeeeeeeccccccc Confidence 9999999999999999999999999999999987765322 11110 1234 Q ss_pred eEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019705. 172 YAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ...++++||||+|.++ .++++|+||+.++..++....++++++.++|+||++|+|+|+++....++++.+++++.|+.. T Consensus 161 ~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~ 240 (467) T protein:vir:31 161 SVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDN 240 (467) T ss_pred eeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhh Confidence 5679999999999876 578999999999999999999999999999999999999999877778999999999999876 Q ss_pred hC------------CcccCcceecCCCceeeecc--------cChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcc Q lcl|NC_019705. 251 AG------------GPVKKRLWILEAGFSTSAIG--------VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS 310 (424) Q Consensus 251 ~~------------~~~ag~~~~l~~g~~~~~l~--------~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~ 310 (424) ++ ..+++++++++.|+++++++ .+++|+||+|++++++++||++|||||.+||..+++++ T Consensus 241 ~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~ 320 (467) T protein:vir:31 241 NEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAF 320 (467) T ss_pred hcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCc Confidence 53 45788899998887666654 36789999999999999999999999999998776654 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhh-cccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHH Q lcl|NC_019705. 311 WGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRT 389 (424) Q Consensus 311 ~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~ 389 (424) ++|+|++.+.|+++||.|+++.||++||.+|++..+. ..++++||++.+++.|.++++++++.++++|++|+||+|++ T Consensus 321 -~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 399 (467) T protein:vir:31 321 -STDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDE 399 (467) T ss_pred -ccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 3689999999999999999999999999999987665 45789999999999999999999999999999999999999 Q ss_pred hCCCCCCCCCeee-------ecccccchhhccccCCCc-ccCC Q lcl|NC_019705. 390 DNLPPLPGGDVAM-------RQSQYVPITDLGTNKEPR-NNGA 424 (424) Q Consensus 390 ~g~~p~~ggd~~~-------~~~n~~~~~~~~~~~~~~-~~ga 424 (424) +|+||+++++.+- +.++..|.+..+++.++. ++-+ T Consensus 400 ~Gl~pi~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (467) T protein:vir:31 400 FGFEPFPEEHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDRA 442 (467) T ss_pred hCCCCCCcccccCCcccccccccccCCCCcccCcCCCCCCCcc Confidence 9999996543221 111222222222221111 1111 No 87 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=3.2e-71 Score=407.05 Aligned_cols=413 Identities=15% Similarity=0.152 Sum_probs=302.9 Q ss_pred CCCCcccccCC----CCCchHHHHHhhccCcccCCccccchhhccccccccCcccccH---HHHhc-cHHHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLR----TNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIND---ERILQ-ISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~~~----~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~---~~~~~-~~~v~~~i~~ia~ 72 (424) |..-|-+.+-+ ..-|.-... .+ ...+.......+.....+-.-++++ ..... ++++++||+++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~------~~-~~~~~~~~~~~~~~~~~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~~ 73 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADL------AK-SPNSTQIPDHRIQSHNVGVNPPYNPDRLAAFLELNETLATGIRKKSR 73 (651) T ss_pred CCCccceeeeeEEEeecccccccc------cc-cccccccchhhhcccCCCCCCCCCHHHHHHHHhcChHHHHHHHHHhh Confidence 33332111100 000000000 00 0000111111111111111111222 33344 8999999999999 Q ss_pred hhccCceEEEEecc-CCccce---e-------ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCc Q lcl|NC_019705. 73 LTACLPLDVFETDQ-NDNRKK---V-------DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGD 141 (424) Q Consensus 73 ~ia~~~~~v~~~~~-~~~~~~---~-------~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~ 141 (424) .||+++|.+..... ++.... . ...++....+...+|+.+|+.++++.++.|++.+|++|+.++|+..|. T Consensus 74 ~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~ 153 (651) T protein:vir:99 74 YEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGR 153 (651) T ss_pred hhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccc Confidence 99999998864322 111100 0 012333344455679999999999999999999999999999999999 Q ss_pred eeEEEEecCceeEEeecCce----------------------------------EE------------------------ Q lcl|NC_019705. 142 VISLLPLQSANMDVKLVGKK----------------------------------VV------------------------ 163 (424) Q Consensus 142 ~~~l~~l~~~~v~~~~~~~~----------------------------------~~------------------------ 163 (424) |+.|+++++..+++..++.. .. T Consensus 154 pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~ 233 (651) T protein:vir:99 154 PVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIR 233 (651) T ss_pred hhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEE Confidence 99999999987765432110 00 Q ss_pred --------------------EEEEeCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_019705. 164 --------------------YRYQRDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGA 222 (424) Q Consensus 164 --------------------~~~~~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~ 222 (424) |.+...+....++++||||||+++ .++++|+||+..+..++.++.++++++.++|+||+ T Consensus 234 ~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~ 313 (651) T protein:vir:99 234 YREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDT 313 (651) T ss_pred eccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 000111223457899999999887 58999999999999999999999999999999999 Q ss_pred CCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-----------CceeeecccCh-hHHHHHHHHHHHHHH Q lcl|NC_019705. 223 KSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-----------GFSTSAIGVTP-QDAEMMASRKFQVSE 290 (424) Q Consensus 223 ~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~-----------g~~~~~l~~~~-~d~~~~e~~~~~~~~ 290 (424) +|+|+|+.+.+.+++++.+++++.|++..+ |+|++++|+. |++|+++++++ +|+||+|++++++++ T Consensus 314 ~p~gil~~~~~~ls~e~~~~lr~~~~~~~~--nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~e 391 (651) T protein:vir:99 314 IPRMVIKVTGGELSEESKRDLRQMLNGLRE--ESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHE 391 (651) T ss_pred CCceEEEecCCCCCHHHHHHHHHHHHHHhc--cCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHH Confidence 999999998777899999999999998765 6789999865 99999999876 599999999999999 Q ss_pred HHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhc---ccchhhhhhhhhccCHHH Q lcl|NC_019705. 291 LARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG---RIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 291 Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~---~~~~~fd~~~l~~~d~~~ 367 (424) ||++|||||.+||..++++ ++|+|++.+.|+++||+|++..||++||++|+++.++. .++++|+.+.+++.|.++ T Consensus 392 Ia~afgVPp~~lG~~~~~~--~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~ 469 (651) T protein:vir:99 392 IAKVLEVPPVKIGVTDSAN--RSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQEAQL 469 (651) T ss_pred HHHHhCCCHHHhccCCCCC--cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhccHHH Confidence 9999999999999887654 56899999999999999999999999999999987653 256788999999999999 Q ss_pred HHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeeeecccccchhhccccCC-------CcccCC Q lcl|NC_019705. 368 RAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAMRQSQYVPITDLGTNKE-------PRNNGA 424 (424) Q Consensus 368 ~~~~~~~~~~~g~~t~NE~R~~~g~~p~~--ggd~~~~~~n~~~~~~~~~~~~-------~~~~ga 424 (424) +++++..++++|+||+||+|+++|+||++ +||..+.+.+...+....+..+ +.++.. T Consensus 470 ~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~~ 535 (651) T protein:vir:99 470 AEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENKI 535 (651) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccccccccccCCCCcccccCcccccc Confidence 99999999999999999999999999995 4899888877765543222111 111111 No 88 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=1.9e-69 Score=397.36 Aligned_cols=408 Identities=12% Similarity=0.064 Sum_probs=280.5 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCC--------------c-----------------cccchhh--cccccccc Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVT--------------P-----------------NQGSQTG--PVSAHGHL 47 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~--------------~-----------------~~~~~~~--~~~~~~~~ 47 (424) |.... =.+|||.|+...|+++.-.. | ....... .....+.. T Consensus 1 ~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~ 74 (648) T protein:vir:79 1 MARKV------WGRGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGG 74 (648) T ss_pred Cccch------hcchhhhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcC Confidence 32221 26799999999998432111 0 0000000 00000000 Q ss_pred Cc-----cccc----HHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHH Q lcl|NC_019705. 48 GD-----SSIN----DERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFRE 118 (424) Q Consensus 48 ~~-----~~vs----~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~ 118 (424) ++ ..++ .+.+..+|.|++||++||++||++||.++.++.+. ...++. .++..+||++||+++|++ T Consensus 75 g~~~~~epp~d~~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~-----~~~~~~-~~ll~rPn~~~t~~~f~~ 148 (648) T protein:vir:79 75 GGRDFEEPEFDFNEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNA-----VEYIRM-RFTLMAEATQIPTNQLFI 148 (648) T ss_pred CccccccCCcCHHHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCcc-----chhhHH-HHHhhccCCCCCHHHHHH Confidence 11 1122 24445699999999999999999999986654321 112233 344569999999999999 Q ss_pred HHHHHHHHcCCeEEEEeeCCCCc---------------eeEEEEecCceeEEeecCc--eEEEEEEeCC--ceEEecHhH Q lcl|NC_019705. 119 AMTMQLCFYGNAYALVDRNSAGD---------------VISLLPLQSANMDVKLVGK--KVVYRYQRDS--EYAEFSQKE 179 (424) Q Consensus 119 ~~~~~~ll~G~a~~~~~r~~~G~---------------~~~l~~l~~~~v~~~~~~~--~~~~~~~~~~--~~~~~~~~e 179 (424) .++.+++++||||++++|+.+|. +.++|||+|.+|++..+.. ...|.|...+ ....|+++| T Consensus 149 ~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~~Y~y~~~g~~~~~~~~~~d 228 (648) T protein:vir:79 149 EIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIKGWQQEQEGQDKPQKFKPED 228 (648) T ss_pred HHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCceeeeEEEecCCceeEEecCcc Confidence 99999999999999999998883 4789999999999887654 3445565433 446789999 Q ss_pred EEEeecC-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCc Q lcl|NC_019705. 180 IFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKR 258 (424) Q Consensus 180 iih~r~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~ 258 (424) |||||+. +.++++|+||+..++.+|.+..+++++..++|.||++|+++|+++.+....+..++.++.+...+.+.+.++ T Consensus 229 IIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i~g 308 (648) T protein:vir:79 229 IVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQEGFGAEEGEVDLVRGEVENMDVEG 308 (648) T ss_pred EEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCccchHHHHHHHHHHHHhcccccccc Confidence 9999965 578999999999999999999999999999999999999999986544444444445555554443322222 Q ss_pred ceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 259 LWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSI 338 (424) Q Consensus 259 ~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l 338 (424) ..+....+.+.+. .+++|+||++.+++++++||++|||||.+||..++++++ +.+++ ..++..++.|++..++..+ T Consensus 309 g~v~~~~~~i~~~-~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~s--tae~~-~~~~~~~i~~l~~~i~~~l 384 (648) T protein:vir:79 309 GMVTTERVNISSI-ASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRS--TGDNL-SSDFKDRIKALQKVMATFI 384 (648) T ss_pred cccccceeecccc-CCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccch--HHHHH-HHHHHHHHHHHHHHHHHHH Confidence 2222222222222 256899999999999999999999999999987655443 55544 4456777888777665555 Q ss_pred Hhhc----cChhhh-----cccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCe-eeeccccc Q lcl|NC_019705. 339 QRWL----IPAKDV-----GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDV-AMRQSQYV 408 (424) Q Consensus 339 ~~~l----~~~~~~-----~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~-~~~~~n~~ 408 (424) +.++ +.+... ..++++|+++++++.|.+.+++.+.+++++||||+||+|+++|+||+|+|+. .++..+.. T Consensus 385 e~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~ 464 (648) T protein:vir:79 385 NEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMV 464 (648) T ss_pred HHHHHHHHhhhhhccccccccceEEEeecccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccc Confidence 4433 222211 1245789999999999999999999999999999999999999999998753 34555555 Q ss_pred chhhcccc----CCCc-----ccCC Q lcl|NC_019705. 409 PITDLGTN----KEPR-----NNGA 424 (424) Q Consensus 409 ~~~~~~~~----~~~~-----~~ga 424 (424) +....... ..+. +.++ T Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~a~~ 489 (648) T protein:vir:79 465 TIAQATALAALAPTPAGGSSASASG 489 (648) T ss_pred cchhccccccCCCCCCCCCCCCccc Confidence 43322111 0010 0000 No 89 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=100.00 E-value=8.5e-63 Score=360.87 Aligned_cols=273 Identities=20% Similarity=0.288 Sum_probs=243.9 Q ss_pred hccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCcee Q lcl|NC_019705. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) ||++||++|++++. .+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++||||+|++| T Consensus 1 ia~l~~~~~~~~~~-------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v 73 (278) T protein:vir:78 1 MASLPLKMYEDYKV-------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVV 73 (278) T ss_pred CccceeEEEecCcc-------cccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCcee Confidence 99999999986643 25899999999999999999999999999999999999999999999999999999999 Q ss_pred EEeecC--ceEEEEEEe-CCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEE Q lcl|NC_019705. 154 DVKLVG--KKVVYRYQR-DSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 154 ~~~~~~--~~~~~~~~~-~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~ 229 (424) ++..+. +..+|.+.. ++....|+++||+|+|+++ .++++|+||+..+..++....++++++...+.++ |+++++ T Consensus 74 ~v~~~~~~~~~~y~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~~--~~~i~~ 151 (278) T protein:vir:78 74 EMLIENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKP--DSFMLK 151 (278) T ss_pred EEEEcCCCceEEEEEEcCCceEEEEccccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhcCC--CcEEEE Confidence 998765 345555543 4466889999999999875 6789999999999999999999999876665554 688888 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCc Q lcl|NC_019705. 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST 309 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~ 309 (424) .+.. .++++.+++++.|++..+ ++|+++++++|+++++++++++|++|+|.+++++++||++|||||.+||..++++ T Consensus 152 ~~~~-l~~e~~~~~~~~~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~ 228 (278) T protein:vir:78 152 YGSN-VGKEKRQQVLEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTN 228 (278) T ss_pred eCCC-CCHHHHHHHHHHHHHHhc--cCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC Confidence 7754 688889999999998764 6789999999999999999999999999999999999999999999999887654 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhc-ccchhhhhhhh Q lcl|NC_019705. 310 SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGL 360 (424) Q Consensus 310 ~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~-~~~~~fd~~~l 360 (424) ++|++++.++|++.||+|+++.||++||++|+++.++. +++++||+++| T Consensus 229 --~sn~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 229 --FAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred --cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 56899999999999999999999999999999998864 58999999999 No 90 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=1.9e-60 Score=347.94 Aligned_cols=347 Identities=14% Similarity=0.160 Sum_probs=258.1 Q ss_pred ccCCCCCchHHHHHhhccCcccCC-cc-ccchhhccccccccCccccc-------------HHHHhccHHHHHHHHHHHH Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVT-PN-QGSQTGPVSAHGHLGDSSIN-------------DERILQISTVWRCVSLIST 72 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~vs-------------~~~~~~~~~v~~~i~~ia~ 72 (424) |.=|..+..-+.-.+......... +. ..........++....++|. ...++..|+-+.|+..+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~~~ 80 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLARSFR 80 (368) T ss_pred CCccccccchhccCcccccccccCcchhhccccCceEEEEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHHHHHh Confidence 444433333222111111110000 00 00001111111110111111 1112333333444433322 Q ss_pred hhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCce Q lcl|NC_019705. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) Q Consensus 73 ~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~ 152 (424) .-+ ... ..+...|++..++ .+||++||+++|++ ++.+++++||||++++|+..|++++|+||++.+ T Consensus 81 ~~~-----------~h~-~~~~~~~n~l~l~-~~Pn~~~t~~~f~~-l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~ 146 (368) T protein:vir:79 81 AAA-----------HHS-SAVYVKRNILVST-FIPHPLLSRATFER-LVLDWQVFGNAYLERRENVLGGTIRLDTPLAKY 146 (368) T ss_pred hcc-----------ccc-hhhhhhcchhhhh-cCCCcCCCHHHHHH-HHHHHhhcCCeEEEEEEcCCCCEEEEEEeCccc Confidence 221 111 1223346666554 59999999999975 678999999999999999999999999999999 Q ss_pred eEEeecCceEEEEEEeCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcC Q lcl|NC_019705. 153 MDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG 231 (424) Q Consensus 153 v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~ 231 (424) |++..+++..+ .+..++..+.|+++||+|+|.++ .++++|+||+..++.++.+..+++.+..++|+||++|+++|+.+ T Consensus 147 v~~~~~~~~~~-~~~~~~~~~~~~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~ 225 (368) T protein:vir:79 147 VRRGLDLNTYF-FVQNWQQPYTFAAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMT 225 (368) T ss_pred ceeeccCCEEE-EEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC Confidence 99888776544 44566777899999999999887 57899999999999999999999999999999999999999988 Q ss_pred CCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Q lcl|NC_019705. 232 EKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~ 306 (424) ...+++++.+++++.|++..|..|+|+++++ ++|++|++++.+++|+||+|.+++++++||++|||||.+||..+ T Consensus 226 ~~~l~~e~~~~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~ 305 (368) T protein:vir:79 226 DAAQKQEDVDTLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIP 305 (368) T ss_pred CCCCCHHHHHHHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccC Confidence 7778999999999999998888999999998 67999999999999999999999999999999999999999988 Q ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHH Q lcl|NC_019705. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMG 376 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~ 376 (424) +++.+++|+|++.+.|+++||.|+++.|| +++.+|..+ +++|+...|++.|.+.++.....-. T Consensus 306 ~~t~~~sn~e~~~~~f~~~~l~Pl~~~ie-~ln~~l~~e------~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 306 NNTGGFGDVEKAAMVFARNEVKPLQDRLL-AINDWIGDE------VVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred CCCCccccHHHHHHHHHHHHHHHHHHHHH-HHHhccCcc------eeeechhHhhcccccccCCcccccC Confidence 88888899999999999999999999998 688777432 4789999999999888876322211 No 91 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=4.1e-58 Score=335.21 Aligned_cols=329 Identities=16% Similarity=0.210 Sum_probs=251.0 Q ss_pred CCCCcccccCCCCCchHHHHH---------hh-ccCcccCCcccc--chhhccccccccCcccccHH----HHhccHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQ---------SW-FVGGRLVTPNQG--SQTGPVSAHGHLGDSSINDE----RILQISTVW 64 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~---------~~-~~~~~~~~~~~~--~~~~~~~~~~~~~~~~vs~~----~~~~~~~v~ 64 (424) |...|++-. ++..---..-. .+ |+.+..+-.... ...+-+. .+.+.-.+|+.. ..-.++... T Consensus 26 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~f~fg~p~~v~~~~~~~~~~~~~~-~~~~~~pp~~~~~La~~~~~~~~h~ 103 (376) T protein:vir:10 26 MSKRRSRAP-RTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWS-NGEWFEPPVSFAGLAKSFRASTHHS 103 (376) T ss_pred chhccCCCc-ccchhhhhHhhhccCcceeEEEEcCCceeccCcchhhhhhhhhh-cCceecCCCCHHHHHHHHhhhHHhh Confidence 666666521 11111111100 00 111111100000 0000000 001111123332 222244455 Q ss_pred HHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeE Q lcl|NC_019705. 65 RCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVIS 144 (424) Q Consensus 65 ~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~ 144 (424) +||...++.+++ ..+||++||+.+|++. +.+++++||||++++|+..|.+++ T Consensus 104 s~l~~k~n~l~~---------------------------~~~Pnp~lT~~~f~~~-v~d~ll~Gnay~~~~rn~~G~~~~ 155 (376) T protein:vir:10 104 SALFFKANVLAS---------------------------TFRPHRWLSRHAFERW-ALDFLTFGNGYLERRRNMVGGTLR 155 (376) T ss_pred hhHHHHhHHHHh---------------------------ccCCCCCCCHHHHHHH-HHHHHhcCCeEEEEEECCCCCEEE Confidence 555554443322 2479999999999855 568999999999999999999999 Q ss_pred EEEecCceeEEeecCceEEEEEEeCCceEEecHhHEEEeecCCC-CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_019705. 145 LLPLQSANMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAK 223 (424) Q Consensus 145 l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~ 223 (424) |+||+|.+|++..+++..+| +..++....|+++||||||.+++ ++++|+||+.+++.++.+..++++|+.++|+||++ T Consensus 156 L~pl~~~~vr~~~d~~~~~~-~~~~~~~~~~~~~eViHir~~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~ 234 (376) T protein:vir:10 156 LEPALAKYVRRKADFNGFVY-VNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSH 234 (376) T ss_pred EEEeCCcceEEEeeCCeEEE-EEcCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 99999999999988776544 44566678899999999999875 68999999999999999999999999999999999 Q ss_pred CceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019705. 224 SPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVP 298 (424) Q Consensus 224 ~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP 298 (424) |++||+.+...+++++.+.+++.|++..|..|+++++++ ++|+++++++.+++|+||+|.+++++++||++|||| T Consensus 235 pggIl~~~d~~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VP 314 (376) T protein:vir:10 235 AGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVP 314 (376) T ss_pred CceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCC Confidence 999999877778999999999999998888899998888 578999999999999999999999999999999999 Q ss_pred HHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHH Q lcl|NC_019705. 299 PHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 299 p~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~ 367 (424) |.++|..++++.+++|+|++.+.|+++||.|+++.|| +++.+|..+ .++|+...|+++|.+. T Consensus 315 p~llGi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~ie-eln~~L~~~------~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 315 PQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFA-ELNDWLGEE------VVRFDDYEIPPAPVAA 376 (376) T ss_pred HHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHH-HHHhhcccc------ccccChhHhhcccccC Confidence 9999999888888899999999999999999999998 588777332 4789999999999887 No 92 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=2.3e-58 Score=336.53 Aligned_cols=323 Identities=16% Similarity=0.224 Sum_probs=245.8 Q ss_pred HhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHH----------------HHHHhhccCceEEEEe Q lcl|NC_019705. 21 QSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVS----------------LISTLTACLPLDVFET 84 (424) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~----------------~ia~~ia~~~~~v~~~ 84 (424) |+...++................++....++| +...++..|+. -+|+.+-..+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~-----~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~~~~~h---- 71 (346) T protein:vir:10 1 MKKQLRKNLTQNDRLQPQAQTEIFSFGDPIPV-----LDRADILNYLECSAMYEKWYNPPMSFDGLAKSLRSSTHH---- 71 (346) T ss_pred CCcccCCCCCcccccccccCeEEEecCCccee-----cCchhHHHHHHHhhcCCceEecCCCHHHHHHHHHhhhhc---- Confidence 22221111111000000001111111111111 11111111111 122222222211 Q ss_pred ccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEE Q lcl|NC_019705. 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVY 164 (424) Q Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~ 164 (424) ...-....|.+..++. +||++||+.+|++ ++.+++++||||++++|+..|++++|+|++|.+|++..+++...| T Consensus 72 ----~~~i~~k~n~l~~l~~-~Pn~~~t~~~f~~-~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v~~~~~~~~~~~ 145 (346) T protein:vir:10 72 ----ESAIITKANILLSTCE-VDSRYLSRRDLSS-FVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYVRKGLEAGQFYY 145 (346) T ss_pred ----chhhhhhhhhHHHHHh-CCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCceEEEEcCCeEEE Confidence 0011123466777665 8999999999987 567899999999999999999999999999999999887766554 Q ss_pred E-EEeCCceEEecHhHEEEeecCCC-CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHH Q lcl|NC_019705. 165 R-YQRDSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQ 242 (424) Q Consensus 165 ~-~~~~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~ 242 (424) . +..++....|+++||||+|.+++ ++++|+||+..+..++.+..+++++..++|+||++|++||+.+...+++++.++ T Consensus 146 ~~~~~~g~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~ 225 (346) T protein:vir:10 146 VPQRFDHQEHEFAKGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVEN 225 (346) T ss_pred EEEccCCeEEEEecccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHH Confidence 4 45677889999999999999886 689999999999999999999999999999999999999998777789999999 Q ss_pred HHHHHHHHhCCcccCcceecCC-----CceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHH Q lcl|NC_019705. 243 VEENFKEIAGGPVKKRLWILEA-----GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ 317 (424) Q Consensus 243 ~~~~~~~~~~~~~ag~~~~l~~-----g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~ 317 (424) +++.|++..|+.|+++++++.+ |+++++++.+++|+||+|.+++++++||++|||||.+||..++++.+++|+|+ T Consensus 226 i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~ 305 (346) T protein:vir:10 226 IRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVAD 305 (346) T ss_pred HHHHHHHhcCccccCceeEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHH Confidence 9999999998889999998854 78999999999999999999999999999999999999999888888899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCH Q lcl|NC_019705. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 318 ~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~ 365 (424) +.+.|++++|.|+++.||+ ++.+|..+ .++|+..+|++.|. T Consensus 306 ~~~~f~~~~l~P~~~~iee-~n~~L~~e------~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 306 AAEVFFITEIEPLQERLKE-FNQWLGQE------VIKFKPSKLLQRTQ 346 (346) T ss_pred HHHHHHHHHHHHHHHHHHH-HHhhcccc------eeeechhhhcccCC Confidence 9999999999999999985 77677432 47899999999987 No 93 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=2.5e-58 Score=336.35 Aligned_cols=332 Identities=15% Similarity=0.139 Sum_probs=245.3 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccC--- Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACL--- 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~--- 77 (424) |.+....+.=.+... ....++. ++. ++-.+....++.|++++.+..... T Consensus 1 ~~~~~~~~~~~~~~~------------------------~~~~~~~-~~~---p~~~~~~~~~~~~~~~~~~~~~~~~ep 52 (348) T protein:vir:26 1 MTEQLIHSHTTDGTE------------------------SKSVYSF-DPN---PEPVDTNSWMTRYCELFYNDFDDYWEP 52 (348) T ss_pred CCccccchhhccccC------------------------CceEEEe-cCC---CeeecCcchHHHHHHHHhcCCCccccC Confidence 333322211111100 0000000 000 111233445555555554443321 Q ss_pred ceEE------EEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCc Q lcl|NC_019705. 78 PLDV------FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) Q Consensus 78 ~~~v------~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~ 151 (424) |+.. ++.+.--...-....+-+. -..+||++||+.+|++. +.+++++||||++++|+..|++++|+||++. T Consensus 53 p~~~~~La~l~~~n~~h~~~i~~k~N~l~--~~~~Pn~~~t~~~f~~~-~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~ 129 (348) T protein:vir:26 53 PISLKGLAEIANANGYHGSLLKARANYVA--GRFMNGGGLPMYKMNSA-CWDYFGLGMSAFVKIRSYLKNVIALEPLPMV 129 (348) T ss_pred CCCHHHHHHHHhhhhhhhhhHhhhhhHHh--hcccCCCCCCHHHHHHH-HHHHHhcCCeEEEEEEcCCCcEEEEEEecCc Confidence 2211 1100000000000001111 12379999999999765 5699999999999999999999999999999 Q ss_pred eeEEeecCceEEEEEEeCCceEEecHhHEEEeecCCC-CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEc Q lcl|NC_019705. 152 NMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILST 230 (424) Q Consensus 152 ~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~ 230 (424) +|++..++. +|.+..++..+.|+++||+|||.+++ ++++|+||+..+++++.+..+++.+..++|+||++|++||+. T Consensus 130 ~v~~~~d~~--~~~~~~~g~~~~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~ 207 (348) T protein:vir:26 130 HMRKRKNGD--FVQLLRNNEQKVFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYA 207 (348) T ss_pred eeEeeecCc--EEEEEecCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEe Confidence 999987765 45666778888999999999999885 689999999999999999999999999999999999999988 Q ss_pred CCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC Q lcl|NC_019705. 231 GEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDV 305 (424) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~ 305 (424) +....++++++++++.|++..|+.|.++++++ ++|+++++++.+++|+||+|.+++++++||++|||||.++|.. T Consensus 208 ~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~ 287 (348) T protein:vir:26 208 TDPNLSEADEKALKEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGML 287 (348) T ss_pred cCCCCCHHHHHHHHHHHHHhcCcccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHcccc Confidence 77778999999999999998888899999988 7899999999999999999999999999999999999999998 Q ss_pred CCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhh-hccCHHHHHHH Q lcl|NC_019705. 306 EKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL-LRGDSASRAAF 371 (424) Q Consensus 306 ~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l-~~~d~~~~~~~ 371 (424) .+++.+++|+|++.+.|+++||.|+++.||++||++|..+.+ .+++||++.. .+. ++.+. T Consensus 288 ~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~---~~~~fdl~~~~e~~---~~~a~ 348 (348) T protein:vir:26 288 PQQGANVPDPLKVSQVYDFYEVIPVCKRFMDAVNNDPEIPDN---LKLKFNLNPGVESA---NGSAV 348 (348) T ss_pred CCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhhCCCCc---cEEEEecCcccccc---hhhcC Confidence 877778889999999999999999999999999999875544 3456666532 222 22222 No 94 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=1.8e-57 Score=331.66 Aligned_cols=329 Identities=15% Similarity=0.198 Sum_probs=247.8 Q ss_pred CCCCcccccCCCCCchHHH---------HHhh-ccCcccCCccccc--hhhccccccccCcccccHHH----HhccHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWAR---------LQSW-FVGGRLVTPNQGS--QTGPVSAHGHLGDSSINDER----ILQISTVW 64 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~---------~~~~-~~~~~~~~~~~~~--~~~~~~~~~~~~~~~vs~~~----~~~~~~v~ 64 (424) |...|++=. ++...-=+. ...+ |+.+.++-..... ..+-+ ..+.+.--+|+... +-.++... T Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~-~~~~~~~pp~~~~~la~~~~~~~~h~ 78 (351) T protein:vir:79 1 MSKRRSRAP-RTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECW-SNGEWFEPPVSFAGLAKSFRASTHHS 78 (351) T ss_pred CCCCCCCCC-CCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhh-hcCceecCCCCHHHHHHHHhhhHhhh Confidence 777666511 111110000 0000 1111111000000 00000 00001111223221 11233333 Q ss_pred HHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeE Q lcl|NC_019705. 65 RCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVIS 144 (424) Q Consensus 65 ~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~ 144 (424) +||...++.+++ ..+||++||..+|++ ++.+++++||||++++|+..|.+++ T Consensus 79 ~~l~~k~n~l~~---------------------------~~~Pnp~~t~~~f~~-~v~d~ll~Gnay~~~~r~~~G~~~~ 130 (351) T protein:vir:79 79 SALFFKANVLAS---------------------------TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGTLR 130 (351) T ss_pred hhhhhhhhHHhh---------------------------cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEE Confidence 444332222211 247999999999964 6689999999999999999999999 Q ss_pred EEEecCceeEEeecCceEEEEEEeCCceEEecHhHEEEeecCCC-CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_019705. 145 LLPLQSANMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAK 223 (424) Q Consensus 145 l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~ 223 (424) |+||+|.+|++..+++..+ .+..++....|+++||||+|.+++ ++++|+||+.+++.++.+..+++.+..++|+||++ T Consensus 131 L~~l~~~~v~~~~~~~~~~-~~~~~g~~~~~~~~eIihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~ 209 (351) T protein:vir:79 131 LEPALAKYVRRKADFSGFV-YVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSH 209 (351) T ss_pred EEEeCCcceeeeecCCeEE-EEecCceEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 9999999999988877643 445566778899999999999885 68999999999999999999999999999999999 Q ss_pred CceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019705. 224 SPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVP 298 (424) Q Consensus 224 ~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP 298 (424) |+++|+.+...+++++.+.+++.|++..|..|+++++++ ++|+++++++.+++|+||+|.+++++++||++|||| T Consensus 210 pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VP 289 (351) T protein:vir:79 210 AGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVP 289 (351) T ss_pred CceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCC Confidence 999999887778999999999999998888899998888 578999999999999999999999999999999999 Q ss_pred HHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHH Q lcl|NC_019705. 299 PHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 299 p~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~ 367 (424) |.++|..++++.+++|+|++.+.|+++||.|+++.||+ ++.+|.. ..++||..+|+++|.+. T Consensus 290 p~llGi~~~~t~~~~n~e~~~~~f~~~~l~Pl~~~ie~-ln~~lg~------~~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 290 PQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGD------EVVTFDDYEIPPAPVAA 351 (351) T ss_pred HHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCc------ceeeeChhhhccccccC Confidence 99999998888888999999999999999999999985 7766632 24789999999999887 No 95 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=3e-57 Score=330.44 Aligned_cols=330 Identities=17% Similarity=0.236 Sum_probs=245.2 Q ss_pred CCCCcccccCCCCCchHHHHHhh-ccCcccCCccccchhhccccccccC-cccccHHHHhccHHHHHHHHHHHHhhc--c Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSW-FVGGRLVTPNQGSQTGPVSAHGHLG-DSSINDERILQISTVWRCVSLISTLTA--C 76 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~-~~~vs~~~~~~~~~v~~~i~~ia~~ia--~ 76 (424) |..-|..=.-++...==.+...+ |+.+.+..... ..+. .+......++..|+-+.++..+.++-+ + T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~----------~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~ 70 (340) T protein:vir:98 1 MSKRKPRKAVAMTASAPQKMEAFTFGEPVPVLDKR----------DILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHS 70 (340) T ss_pred CCCCCCCccccccccCccceeEEEcCCceeecCcc----------hhhhhhhhhhcCceecCCCCHHHHHHHHHhccccc Confidence 43222110000000000000000 11111100000 0000 001111223444444444444332222 1 Q ss_pred CceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEe Q lcl|NC_019705. 77 LPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVK 156 (424) Q Consensus 77 ~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~ 156 (424) -++..++ +.+.. ..+||++||..+|++ ++.+++++||||++++|+..|.+++|+|+++.+|++. T Consensus 71 s~i~~k~-------------n~l~~--~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~ 134 (340) T protein:vir:98 71 SPIYVKR-------------NVLAS--TYIPHPLLSRQDFSR-FALDYLVFGNAFLEQRHSVTGQLIKLLTSPAKYTRRG 134 (340) T ss_pred hhhhhhh-------------hHHhh--ccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceEEEc Confidence 1222111 11221 248999999999965 5579999999999999999999999999999999988 Q ss_pred ecCceEEEEEEeCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCC Q lcl|NC_019705. 157 LVGKKVVYRYQRDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL 235 (424) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~ 235 (424) .+++ .+|++..++....|+++||+|+|.++ .++++|+|++..+++++.+..+++.++.++|+||++|++||..+...+ T Consensus 135 ~~~~-~~~~~~~~~~~~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~~~l 213 (340) T protein:vir:98 135 VDDS-VFWFVENFTQPHEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTDPAQ 213 (340) T ss_pred ccCc-EEEEEecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCC Confidence 7765 45667778888899999999999887 478999999999999999999999999999999999999999887778 Q ss_pred CHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcc Q lcl|NC_019705. 236 TEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS 310 (424) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~ 310 (424) ++++.+++++.|++..|..|+++++++ ++|+++++++.+++|+||+|.+++++++||++|||||.++|..++++. T Consensus 214 s~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~ 293 (340) T protein:vir:98 214 SATDVESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIG 293 (340) T ss_pred CHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCC Confidence 999999999999998888888999888 579999999999999999999999999999999999999999888888 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccC Q lcl|NC_019705. 311 WGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGD 364 (424) Q Consensus 311 ~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d 364 (424) +++|+|++.+.|+++||.|+++.||+ +|.+|..+ .++|+...|++.| T Consensus 294 ~~sn~e~~~~~f~~~~l~Pl~~~iee-~n~~L~~e------~~rF~~~~l~~~d 340 (340) T protein:vir:98 294 SLGDVEKVAKVFVRNELSPLQDRFRE-VNDWLGME------VIRFKEYTLDNPE 340 (340) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHH-HHhccccc------ccccCccccccCC Confidence 88999999999999999999999985 88887543 2679999999988 No 96 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=3.6e-57 Score=330.04 Aligned_cols=329 Identities=16% Similarity=0.211 Sum_probs=247.7 Q ss_pred CCCCcccccCCCCCchHHH---------HHhh-ccCcccCCccccc--hhhccccccccCcccccHHH----HhccHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWAR---------LQSW-FVGGRLVTPNQGS--QTGPVSAHGHLGDSSINDER----ILQISTVW 64 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~---------~~~~-~~~~~~~~~~~~~--~~~~~~~~~~~~~~~vs~~~----~~~~~~v~ 64 (424) |...|++=. ++...-=+. ...+ |+.+.++-..... ..+-+ ..+.+.--+|+... +-.++... T Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~-~~~~~~~pp~~~~~la~~~~~~~~h~ 78 (351) T protein:vir:78 1 MSKRRSRAP-RTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECW-SNGEWFEPPVSFAGLAKSFRASTHHS 78 (351) T ss_pred CCCCCCCCC-CCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhh-ccCceecCCCCHHHHHHHHhhhHhhh Confidence 777666511 111110000 0000 1111111000000 00000 00001111223221 11233333 Q ss_pred HHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeE Q lcl|NC_019705. 65 RCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVIS 144 (424) Q Consensus 65 ~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~ 144 (424) +||...++.+++ ..+||++||.++|++ ++.+++++||||++++|+..|.+++ T Consensus 79 ~~l~~k~n~l~~---------------------------~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~ 130 (351) T protein:vir:78 79 SALFFKANVLAS---------------------------TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGTLR 130 (351) T ss_pred hhhhhhhhHHhh---------------------------cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEE Confidence 444333222222 247999999999975 5578999999999999999999999 Q ss_pred EEEecCceeEEeecCceEEEEEEeCCceEEecHhHEEEeecCCC-CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_019705. 145 LLPLQSANMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAK 223 (424) Q Consensus 145 l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~ 223 (424) |+||++.+|++..+++..+|. ..++....|+++||||+|.+++ ++++|+|++..+++++.+..++..++.++|+||++ T Consensus 131 L~pl~~~~v~~~~~~~~~~~~-~~~~~~~~~~~~eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~ 209 (351) T protein:vir:78 131 LEPALAKYVRRKADFSGFVYV-NGWQERHEFAPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSH 209 (351) T ss_pred EEEecCcceEEeeeCCeEEEE-ecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 999999999999887765543 3456678899999999999874 78999999999999999999999999999999999 Q ss_pred CceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019705. 224 SPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVP 298 (424) Q Consensus 224 ~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP 298 (424) |++||+.+...+++++.+.+++.|++..|.+|+++++++ ++|+++++++.+++|+||+|.+++++++||++|||| T Consensus 210 pggIl~~~~~~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VP 289 (351) T protein:vir:78 210 AGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVP 289 (351) T ss_pred CceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCC Confidence 999999877778999999999999998888899999988 578999999999999999999999999999999999 Q ss_pred HHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHH Q lcl|NC_019705. 299 PHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 299 p~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~ 367 (424) |.++|..++++.+++|+|++.+.|+++||.|+++.||+ ++.+|..+ +++||..+|+++|.+. T Consensus 290 p~llGi~~~~t~~~sn~e~~~~~f~~~~l~P~~~~iee-~n~~l~~~------~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 290 PQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGDE------VVRFDDYEIPPAPVAA 351 (351) T ss_pred HHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCcc------ceecChhhhccccccC Confidence 99999998888888999999999999999999999985 67666322 4789999999999887 No 97 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=4.7e-57 Score=329.40 Aligned_cols=323 Identities=15% Similarity=0.144 Sum_probs=246.2 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhcc---C Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTAC---L 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~---~ 77 (424) |..+|. ..... ....+...++....++| +...++..|+....+..+. - T Consensus 1 m~~~~~-----------------------~~~~~-~~~~~~~~~~~~~p~~~-----~~~~~~~~~~~~~~~~~~~~~~p 51 (337) T protein:vir:78 1 MTKRQQ-----------------------QPAQA-AASSPRPSVVFSMPEAI-----DPTAWMTDYTGVFYNPYGEYYQP 51 (337) T ss_pred CCCccc-----------------------Ccccc-cccCceeEEEecCcccc-----cCcchhHhhhhhhhccCcceecC Confidence 222111 11000 01111111111111222 3344566666665554443 2 Q ss_pred ceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHH----HHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCcee Q lcl|NC_019705. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQ----EFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~----~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) |+....- .+-.........+|..+||+.++++ ++++.++.|++++||||++++|+..|++++|+||++.+| T Consensus 52 P~~~~~L-----a~l~~~~~~h~~~L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~~~~v 126 (337) T protein:vir:78 52 PIDRKGL-----AKVARANAHHGAILMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLSSVYL 126 (337) T ss_pred CCCHHHH-----HHHhhcchhhhhHHHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeCCcee Confidence 3322100 0000000112446788999877654 689999999999999999999999999999999999999 Q ss_pred EEeecCceEEEEEEeCCceEEecHhHEEEeecCCC-CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCC Q lcl|NC_019705. 154 DVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE 232 (424) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~ 232 (424) ++..++... .+..++....|+++||+|+|.+++ ++++|+||+..++.++.+..+++++..++|+||++|++||+.+. T Consensus 127 ~~~~d~~~~--~~~~~~~~~~~~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~ 204 (337) T protein:vir:78 127 RRREDGCFV--YLQQGKPNLIYRPDDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATD 204 (337) T ss_pred EeeeCCeEE--EEEcCCceEEECCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC Confidence 988776543 334466778899999999999885 78999999999999999999999999999999999999999887 Q ss_pred CCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC Q lcl|NC_019705. 233 KVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK 307 (424) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~ 307 (424) ..+++++.+++++.|++..|+.|.++++++ ++|+++++++.+++|+||+|.+++++++||++|||||.++|...+ T Consensus 205 ~~l~~e~~~~lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~ 284 (337) T protein:vir:78 205 PNMDDDTEEEMKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPT 284 (337) T ss_pred CCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccC Confidence 778999999999999998888888888887 678999999999999999999999999999999999999998776 Q ss_pred C-cccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhh Q lcl|NC_019705. 308 S-TSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLL 361 (424) Q Consensus 308 ~-~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~ 361 (424) + +.+++|+|++.+.|+++||.|+++.||++++++|++.... ..++++..+++ T Consensus 285 ~~~~~~~n~e~~~~~f~~~~L~P~~~~ie~~~n~~ll~~~~~--~~f~~~~~~~~ 337 (337) T protein:vir:78 285 NGGGGLGDPEKYDATYARNEVLPLCELVQDAINSAGLPRALW--VTFRETIGAAV 337 (337) T ss_pred CCcCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhcCChhhc--eeccccccccC Confidence 5 4566789999999999999999999999999988876543 34677777777 No 98 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=1.9e-56 Score=326.02 Aligned_cols=328 Identities=16% Similarity=0.193 Sum_probs=237.3 Q ss_pred CCCCcccccCCC---CCchH-HH-------HHhh-ccCcccCCccccc--hhhccccccccCcccccHHHHhccHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRT---NNGWW-AR-------LQSW-FVGGRLVTPNQGS--QTGPVSAHGHLGDSSINDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~---~~G~~-~~-------~~~~-~~~~~~~~~~~~~--~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~ 66 (424) |..-|..=.-++ ...-- +. +..+ |+.+.++-..... ..+-+ ..+.+...+|+.. . T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~-~~~~~~~pp~~~~----------~ 69 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECW-PNGRWYEPPLSME----------G 69 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHh-hcCccccCCCCHH----------H Confidence 433221100000 00000 00 0000 1111110000000 00000 0001111112221 1 Q ss_pred HHHH--HHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeE Q lcl|NC_019705. 67 VSLI--STLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVIS 144 (424) Q Consensus 67 i~~i--a~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~ 144 (424) +-.+ ++...+-++..++ +-+. ...+||++||+++|++ ++.+++++||||++++|+..|++++ T Consensus 70 la~~~~~~~~h~~~l~~k~-------------n~l~--~~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~~~rn~~G~~~~ 133 (350) T protein:vir:11 70 LAKSVGSSVYLQSGLKFKR-------------NMLA--KTFIPHRLLSRATFEQ-FSLDWLTFGSAYLEQPRSRLGTRMP 133 (350) T ss_pred HHHHHhhhhhhccchhhhh-------------hhhh--hcccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCCEEE Confidence 1111 1111111222110 1121 1348999999999986 5678999999999999999999999 Q ss_pred EEEecCceeEEeecCceEEEEEEeCCceEEecHhHEEEeecCCC-CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_019705. 145 LLPLQSANMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAK 223 (424) Q Consensus 145 l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~ 223 (424) |+||+|.+|++..+++. +|++..++....|+++||||+|++++ ++++|+||+.+++.++.+..++..+..++|+||++ T Consensus 134 L~~l~~~~vr~~~~~~~-~~~~~~~~~~~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~ 212 (350) T protein:vir:11 134 LQAPLAKYMRRGTDLET-FYQVRSWKDEHEFEKGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSH 212 (350) T ss_pred EEEeCCceeEeeecCCe-EEEEeeCCeEEEECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 99999999999887765 46677788889999999999998875 57999999999999999999999999999999999 Q ss_pred CceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019705. 224 SPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVP 298 (424) Q Consensus 224 ~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP 298 (424) |++||+.+...+++++++++++.|++..|+.|+++++++ ++|+++++++.+++|+||+|.+++++++||++|||| T Consensus 213 ~~gil~~~~~~ls~e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VP 292 (350) T protein:vir:11 213 AGFILYMTDAAQNEEDIDALRTALKTAKGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVY 292 (350) T ss_pred CceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCC Confidence 999999887778999999999999998888899999888 468999999999999999999999999999999999 Q ss_pred HHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhh Q lcl|NC_019705. 299 PHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL 360 (424) Q Consensus 299 p~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l 360 (424) |.++|..++++.+++|+|++.+.|+++||.|+++.||+ ++.+|..+.. .+.+|++++| T Consensus 293 p~llGi~~~~t~~~sn~e~~~~~f~~~~L~P~~~~ie~-ln~~l~~~~~---~F~~~~~~~l 350 (350) T protein:vir:11 293 PQLMGVVPQNAGGFGSISDAAAVWASLELAPMQTRLQQ-VNEMIGEEVV---RFAQFDAPGL 350 (350) T ss_pred HHHhcccCCCCCCcCCHHHHHHHHHHHHHHHHHHHHHH-HHhhcCcccc---ccCcccccCC Confidence 99999998888888999999999999999999999985 8888764322 3457888887 No 99 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=5.2e-56 Score=323.65 Aligned_cols=331 Identities=17% Similarity=0.226 Sum_probs=237.6 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchhhccccccc---cCccc-ccHHHHhccHHHHHHHHHH--HHhhccCceEE Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGH---LGDSS-INDERILQISTVWRCVSLI--STLTACLPLDV 81 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~-vs~~~~~~~~~v~~~i~~i--a~~ia~~~~~v 81 (424) |.=|-.+.....-...-...... ..-+...|...... +.... .....++.-|.-+.++-.+ ++...+-+++. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~--~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i~~ 78 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPKM--EAFTFGEPVPVLDRRDILDYVECISNGRWYEPPISFTGLAKSLRAAVHHSSPIYV 78 (344) T ss_pred CCcccCCCCCchHHhhcCCcCcE--EEEEcCCceeecCCcchhHHHHhhhcCccccCCCCHHHHHHHHHhhhhhccchhh Confidence 33332222211111000000000 00000011000000 00000 0011111111222222222 11112223322 Q ss_pred EEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCce Q lcl|NC_019705. 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK 161 (424) Q Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~ 161 (424) ++ +.+.. ..+||++||+.+| +.++.+++++||||++++|+..|++++|+||++.+|++..+++. T Consensus 79 k~-------------n~l~~--~~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~~l~~~~vr~~~~~~~ 142 (344) T protein:vir:60 79 KR-------------NILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEEDV 142 (344) T ss_pred hh-------------hHHHh--hccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCcceEEEeecCCe Confidence 11 22222 3489999999999 57889999999999999999999999999999999999888765 Q ss_pred EEEEEEeCCceEEecHhHEEEeecCCC-CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHH Q lcl|NC_019705. 162 VVYRYQRDSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 162 ~~~~~~~~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~ 240 (424) +|++..++....|+++||+|+|.+++ ++++|+||+..++.++.+..+++.+..++|+||++|++||+.+...+++++. T Consensus 143 -~~~v~~~~~~~~~~~~eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ls~e~~ 221 (344) T protein:vir:60 143 -YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDI 221 (344) T ss_pred -EEEEccCCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHH Confidence 45566777788999999999998874 7899999999999999999999999999999999999999987777899999 Q ss_pred HHHHHHHHHHhCCcccCcceec------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchh Q lcl|NC_019705. 241 SQVEENFKEIAGGPVKKRLWIL------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG 314 (424) Q Consensus 241 ~~~~~~~~~~~~~~~ag~~~~l------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n 314 (424) +++++.|++..++ ++++.+++ ++|+++++++.+++|+||+|.+++++++||++|||||.++|..++++.+++| T Consensus 222 ~~ik~~~~~~~g~-~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n 300 (344) T protein:vir:60 222 EMLRENMVKSKGR-NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGD 300 (344) T ss_pred HHHHHHHHHhcCC-CCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCcccc Confidence 9999999987754 67888877 4799999999999999999999999999999999999999998888888899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCH Q lcl|NC_019705. 315 IEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 315 ~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~ 365 (424) +|++.+.|+++||.|+++.|| +++.+|..+ .++|+...|...|. T Consensus 301 ~e~~~~~f~~~~L~Pl~~~~e-~ln~~lg~~------~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 301 IEKVAKVFVRNELIPLQDRIR-EINGWLGQE------VIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHhcCCc------ccccCccccCCCCC Confidence 999999999999999999998 588887432 23566556655555 No 100 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=5.7e-56 Score=323.47 Aligned_cols=333 Identities=17% Similarity=0.214 Sum_probs=235.1 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhcccccc-ccCcc-cccHHHHhccHHHHHHHHHHH--Hhhcc Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHG-HLGDS-SINDERILQISTVWRCVSLIS--TLTAC 76 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~vs~~~~~~~~~v~~~i~~ia--~~ia~ 76 (424) |..-|- +-=-......-........-......++.... .+... ......++.=|.-+.++-.+. +..-+ T Consensus 1 ~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~ 73 (344) T protein:vir:56 1 MSKKKG-------KTPQPAAKTMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHS 73 (344) T ss_pred CCCCCC-------CCCchhhHHhhcCCCceEEEEcCCceeecCcchhhhHHHhhhcCccccCCCCHHHHHHHHhhhhhhC Confidence 332221 10000000000000000000000000000000 00000 000011111122233322221 11222 Q ss_pred CceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEe Q lcl|NC_019705. 77 LPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVK 156 (424) Q Consensus 77 ~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~ 156 (424) -++..++ +-+.. ..+||++||+.+| +.++.+++++||||++++|+..|++++|+|+++.+|++. T Consensus 74 s~i~~k~-------------n~l~~--~~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~ 137 (344) T protein:vir:56 74 SPIYVKR-------------NILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRG 137 (344) T ss_pred ccceehh-------------hhHHh--hcCCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEEe Confidence 2333321 12222 3489999999999 677889999999999999999999999999999999998 Q ss_pred ecCceEEEEEEeCCceEEecHhHEEEeecCCC-CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCC Q lcl|NC_019705. 157 LVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL 235 (424) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~ 235 (424) .+++. +|++..++....|+++||+|+|.+++ ++++|+||+..++.++.+..+++++..++|+||++|++||+.+...+ T Consensus 138 ~~~~~-~~~~~~~g~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~~l 216 (344) T protein:vir:56 138 VEEDV-YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQ 216 (344) T ss_pred ecCCE-EEEEecCCeEEEEcCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCC Confidence 87765 45566777788999999999998874 78999999999999999999999999999999999999999877678 Q ss_pred CHHHHHHHHHHHHHHhCCcccCcceec------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCc Q lcl|NC_019705. 236 TEQQRSQVEENFKEIAGGPVKKRLWIL------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST 309 (424) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~ag~~~~l------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~ 309 (424) ++++++++++.|++..| .+++++++| ++|+++++++.+++|+||+|.+++++++||++|||||.++|..++++ T Consensus 217 s~e~~~~lk~~~~~~~g-~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t 295 (344) T protein:vir:56 217 DRNDIEMLRENMVKSKG-RNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENV 295 (344) T ss_pred CHHHHHHHHHHHHHhcC-CCCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCC Confidence 99999999999998765 477899988 47999999999999999999999999999999999999999988888 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCH Q lcl|NC_019705. 310 SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 310 ~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~ 365 (424) .+++|+|++.+.|+++||.|+++.||+ ++.+|..+. ++|+.-.|...|- T Consensus 296 ~~~~n~eq~~~~f~~~tL~Pl~~~ie~-~n~~l~~~~------~~F~~y~l~~~~~ 344 (344) T protein:vir:56 296 GSLGDIEKVAKVFVRNELIPLQDRIRE-INGWIGQEV------IRFKNYSLDTDNG 344 (344) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHH-HHhhhcccc------ccCCCccccccCC Confidence 888999999999999999999999985 777886432 3444434433332 No 101 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=1.1e-55 Score=321.89 Aligned_cols=328 Identities=18% Similarity=0.229 Sum_probs=237.2 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCcccc---chhhcccccccc---Cc-ccccHHHHhccHHHHHHHHHH--H Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQG---SQTGPVSAHGHL---GD-SSINDERILQISTVWRCVSLI--S 71 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~---~~-~~vs~~~~~~~~~v~~~i~~i--a 71 (424) |..-|.. | .. .... .....+... +...|....+.. .. ..+....++.=|.-+.++-.+ | T Consensus 1 ~~~~~~~---~-~~---~~~~-----~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a 68 (344) T protein:vir:20 1 MSKKKGK---T-PQ---PAAK-----TMTASGPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRA 68 (344) T ss_pred CCcccCC---C-Cc---chhh-----hhhccCCceEEEEcCCceEecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhh Confidence 3332221 0 00 0000 000000000 000000000000 00 000011111112222222222 2 Q ss_pred HhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCc Q lcl|NC_019705. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~ 151 (424) +...+-++..++ +-+.. ..+||++||+.+| +.++.+++++||||++++|+..|++++|+|+++. T Consensus 69 ~~~h~~~i~~k~-------------n~l~~--~~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~ 132 (344) T protein:vir:20 69 AVHHSSPIYVKR-------------NILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAK 132 (344) T ss_pred hhhhCccceehh-------------hhHHH--hccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCCc Confidence 222233343321 12222 2489999999999 5788999999999999999999999999999999 Q ss_pred eeEEeecCceEEEEEEeCCceEEecHhHEEEeecCCC-CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEc Q lcl|NC_019705. 152 NMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILST 230 (424) Q Consensus 152 ~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~ 230 (424) +|++..+++. +|.+..++....|+++||+|+|.+++ ++++|+||+..++.++.+..+++.+..++|+||++|++||+. T Consensus 133 ~vr~~~~~~~-~~~~~~~~~~~~~~~~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~ 211 (344) T protein:vir:20 133 YTRRGVEEDV-YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYV 211 (344) T ss_pred eeEeeecCCE-EEEEccCCeEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEe Confidence 9999887765 44566777788999999999999885 789999999999999999999999999999999999999998 Q ss_pred CCCCCCHHHHHHHHHHHHHHhCCcccCcceec------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC Q lcl|NC_019705. 231 GEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD 304 (424) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~ 304 (424) +...+++++.+.+++.|++..++ ++++.++| ++|+++++++.+++|+||+|.+++++++||++|||||.++|. T Consensus 212 ~d~~l~~e~~~~ik~~~~~~~g~-~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi 290 (344) T protein:vir:20 212 TDAVQDRNDIEMLRENMVKSKGR-NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGG 290 (344) T ss_pred cCcCCCHHHHHHHHHHHHHhcCC-CCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcc Confidence 77778999999999999987654 67888877 469999999999999999999999999999999999999999 Q ss_pred CCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCH Q lcl|NC_019705. 305 VEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 305 ~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~ 365 (424) .++++.+++|+|++.+.|+++||.|+++.|| +++.+|..+ .++|+...|...|. T Consensus 291 ~~~~t~~~~n~e~~~~~f~~~~l~P~~~~~e-~in~~lg~~------~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 291 KPENVGSLGDIEKVAKVFVRNELIPLQDRIR-EINGWLGQE------VIRFKNYSLDTDND 344 (344) T ss_pred CCCCCCccccHHHHHHHHHHHHHHHHHHHHH-HHHHhcCCc------ccccCccccccCCC Confidence 8888888889999999999999999999998 578777433 24566666655555 No 102 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=8.5e-55 Score=317.02 Aligned_cols=331 Identities=16% Similarity=0.135 Sum_probs=236.9 Q ss_pred CCCCccccc--CCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHH--HHhhcc Q lcl|NC_019705. 1 MEEPKYTID--LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLI--STLTAC 76 (424) Q Consensus 1 ~~~~~~~~~--~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~i--a~~ia~ 76 (424) |..-+-+-. -.|..+. +. ..|..+. |... ......... .+....++.-|.-+..+-.+ ++..-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~--~~-~~~~~~~---~~~~---~~~~y~~~~---~~~~~~~~epp~~~~~la~~~~~~~~h~ 68 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPI--ND-RTFSLSE---ITAS---PALDYVGIG---FDENYNCYLPPVNRHALAKLPHQNAQHG 68 (345) T ss_pred CCccccccchhhhcCCCc--eE-EEeecCC---cccc---hhhccccee---eecCCccccCCCCHHHHHHHhhcchhhc Confidence 221111100 0000010 00 0011010 0000 000000000 00011122222222222211 111111 Q ss_pred CceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEe Q lcl|NC_019705. 77 LPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVK 156 (424) Q Consensus 77 ~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~ 156 (424) -++.++ .+-+ +...+||++||+.+|++ ++.+++++||||++++|+..|.+++|+|++|.+|++. T Consensus 69 ~~i~~k-------------~n~l--~~~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~ 132 (345) T protein:vir:37 69 GILHSR-------------ANMV--SATYEGGKALSKMEMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVH 132 (345) T ss_pred chhhhh-------------hhHH--hhccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEecCceeEEe Confidence 122211 0112 12448999999999975 5578999999999999999999999999999999998 Q ss_pred ecCceEEEE----EEeCCceEEecHhHEEEeecCCC-CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcC Q lcl|NC_019705. 157 LVGKKVVYR----YQRDSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG 231 (424) Q Consensus 157 ~~~~~~~~~----~~~~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~ 231 (424) .+++..++. +...+....|+++||+|||.+++ ++++|+||+..++.++.+..++++++.++|+||++|++||+.+ T Consensus 133 ~d~~~~~~~~~~~~~~~g~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t 212 (345) T protein:vir:37 133 KDGGYSYLMKKSLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYST 212 (345) T ss_pred ecCCeeEEEeeeeeccCceEEEEccccEEEEcCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC Confidence 888765432 23345667899999999998875 6799999999999999999999999999999999999999887 Q ss_pred CCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Q lcl|NC_019705. 232 EKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~ 306 (424) ...+++++.+++++.|++..++.|.++++++ ++|+++++++.+++|+||++.+++++++||++|||||.++|..+ T Consensus 213 ~~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~ 292 (345) T protein:vir:37 213 DPDLTEEMEEEIARKISESKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIP 292 (345) T ss_pred CCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccc Confidence 7778999999999999998887776655555 56899999999999999999999999999999999999999988 Q ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhc Q lcl|NC_019705. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLR 362 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~ 362 (424) +++.+++|+|++.+.|+++||.|++++||+++|+.+-. ....+++||..+|++ T Consensus 293 ~~t~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~~e~---~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 293 TNTGGLGDPLKYREVYHYDEVMPLQEIIAETINQDPEI---KNLLKIKFREQNFAK 345 (345) T ss_pred CCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc---CCcceEEECchhhcC Confidence 88888899999999999999999999999999974311 234678999999988 No 103 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=8.3e-55 Score=317.09 Aligned_cols=326 Identities=16% Similarity=0.180 Sum_probs=238.0 Q ss_pred ccCCCCCchHHHHHhhccCccc-CCcccc---chhhccccccccCcccc---cHHHHhccHHHHHHHHHHHH--hhccCc Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRL-VTPNQG---SQTGPVSAHGHLGDSSI---NDERILQISTVWRCVSLIST--LTACLP 78 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~-~~~~~~---~~~~~~~~~~~~~~~~v---s~~~~~~~~~v~~~i~~ia~--~ia~~~ 78 (424) |+-...+ ...... ..+... +...|... ..+.+..+ ....++.-|.-+.++-.+.+ .-.+-. T Consensus 1 ~~~~~~~---------~~~~~~~~~~~~~~~f~~~~~~~~-~~~~y~~~~~~~~~~~~epp~~~~~la~l~~~~~~h~~~ 70 (345) T protein:vir:37 1 MKTNVKT---------DNKKGIVIAPINDRTFSLNEISAS-PALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGI 70 (345) T ss_pred CCCCccc---------cchhhcccCcceeEEeecCCcccc-cchhhhhhhhcCCccccCCCCCHHHHHHHhhcccccccc Confidence 2211100 000000 000000 00000000 00000000 01112233333333333221 112222 Q ss_pred eEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeec Q lcl|NC_019705. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~ 158 (424) +..+ .+-+. ...+||++||+++|++ ++.+++++||||++++|+..|.+++|+||++.+|++..+ T Consensus 71 i~~k-------------~n~l~--~~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d 134 (345) T protein:vir:37 71 LHSR-------------ANMVS--SLYEGGKALSRMDMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVRKD 134 (345) T ss_pred eeee-------------chHHH--hhccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEEe Confidence 2221 12222 2348999999999985 567899999999999999999999999999999999988 Q ss_pred CceEEEE----EEeCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC Q lcl|NC_019705. 159 GKKVVYR----YQRDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK 233 (424) Q Consensus 159 ~~~~~~~----~~~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~ 233 (424) ++..++. +..++....|+++||||+|.++ .++++|+||+..++.++.+..++++++.++|+||++|++||..+.. T Consensus 135 ~~~~~~~~~~~~~~~g~~~~~~~~dVihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~ 214 (345) T protein:vir:37 135 GGYSYLMKKSLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDP 214 (345) T ss_pred CCeeEEEEEeEecCCceEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecCC Confidence 7664422 2234566789999999999887 4679999999999999999999999999999999999999998777 Q ss_pred CCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC Q lcl|NC_019705. 234 VLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS 308 (424) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~ 308 (424) .+++++.+++++.|++..|..|.++++++ ++|+++++++.+++|+||+|.+++++++||++|||||.++|..+++ T Consensus 215 ~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~ 294 (345) T protein:vir:37 215 DLTEEMEEEIARKISESKGVGNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTN 294 (345) T ss_pred CCCHHHHHHHHHHHHHhcCcccccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCC Confidence 78999999999999998888888888877 6799999999999999999999999999999999999999998888 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhc Q lcl|NC_019705. 309 TSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLR 362 (424) Q Consensus 309 ~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~ 362 (424) +.+++|+|++.+.|+++||.|+++.||+++++.+..+ ....++|+..+|.+ T Consensus 295 ~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~~~~~~---~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 295 TGGLGDPLKYREVYHYDEVMPLQEIIAETINQDPEIK---NLLKIKFREQNFAK 345 (345) T ss_pred CCCcccHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC---CcceEEecchhhcC Confidence 8888999999999999999999999999999643222 23457888777766 No 104 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=1.5e-52 Score=304.75 Aligned_cols=242 Identities=16% Similarity=0.233 Sum_probs=197.2 Q ss_pred CchHHHHHhhccCcccCCccccchh--hccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccc Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQT--GPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~ 91 (424) +|||++. .+++...+...... ..+.......+..|+.+.|+++|+||+||++||++||++||+++++.+ T Consensus 1 MglF~~~----~~r~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~----- 71 (251) T protein:vir:46 1 MGIFYKN----EKRDLQYNEDDLQMMVQTLPSFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQ----- 71 (251) T ss_pred CCccccc----cccccCCCccchhhhhhhhccccCcCcceechhhhhccHHHHHHHHHHHHhHhhCceEEeeCcc----- Confidence 6776543 12222222221111 112233344566789999999999999999999999999999997543 Q ss_pred eeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc-eEEEEEE--- Q lcl|NC_019705. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-KVVYRYQ--- 167 (424) Q Consensus 92 ~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~-~~~~~~~--- 167 (424) ....|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|++++|+||+|++|++..+++ ...|.|. T Consensus 72 -~~~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~ 150 (251) T protein:vir:46 72 -INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRID 150 (251) T ss_pred -ccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCCCcEEEEEEEec Confidence 23468999999999999999999999999999999999999999999999999999999999988754 3444443 Q ss_pred --eCCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019705. 168 --RDSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 168 --~~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~ 245 (424) .++....|+++||||||+++.|+++|+||+.+++.++.+..++++++.++|+||++|+|+|+++....++++++++++ T Consensus 151 ~~~~g~~~~~~~~diiH~r~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e~~~~~~~ 230 (251) T protein:vir:46 151 SNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRARE 230 (251) T ss_pred cCCcceeEEECCccEEEecCcCCCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHHHHHHHHH Confidence 234567899999999999999999999999999999999999999999999999999999999988778888899999 Q ss_pred HHHHHhCC-cccCcceecCCCcee Q lcl|NC_019705. 246 NFKEIAGG-PVKKRLWILEAGFST 268 (424) Q Consensus 246 ~~~~~~~~-~~ag~~~~l~~g~~~ 268 (424) .|++.++| +|+|++++..+ = T Consensus 231 ~~~~~~~g~~n~g~~~~gm~---~ 251 (251) T protein:vir:46 231 EFPKVLVELNKLGKLSYSMN---Q 251 (251) T ss_pred HHHHHhcCcccccccccccC---C Confidence 99988776 68887665332 2 No 105 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=100.00 E-value=5.2e-47 Score=274.34 Aligned_cols=208 Identities=17% Similarity=0.195 Sum_probs=177.9 Q ss_pred eEEeecCceEEEEEE-----eCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCce Q lcl|NC_019705. 153 MDVKLVGKKVVYRYQ-----RDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQ 226 (424) Q Consensus 153 v~~~~~~~~~~~~~~-----~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ 226 (424) |++..++.. +|.+. .++....++++||+|||+++ .++++|+||+..++.++....++++|+.++|+||++|+| T Consensus 1 ~r~~~dg~~-~y~~~~~~~~~~g~~~~~~~~eilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng~~p~g 79 (219) T protein:vir:98 1 MRVCKDGNY-KYLMKKSLYDTKSEIYEYNKNDVIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNGAHMGF 79 (219) T ss_pred CceeecCeE-EEEEecceecCCceeEEeccccEEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcCCCCce Confidence 565666543 33332 23567889999999999887 688999999999999999999999999999999999999 Q ss_pred eEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_019705. 227 ILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHL 301 (424) Q Consensus 227 vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~ 301 (424) ||+++...+++++++++++.|++..|+.|+++++++ ++|++|++++++++|+||+|++++++++||++|||||.+ T Consensus 80 il~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp~~ 159 (219) T protein:vir:98 80 ILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPPGL 159 (219) T ss_pred EEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHH Confidence 998887678999999999999998887787666665 568999999999999999999999999999999999999 Q ss_pred hCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccC Q lcl|NC_019705. 302 VGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGD 364 (424) Q Consensus 302 l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d 364 (424) ||..++++.+++|+|++.+.|+++||.||++.||++||.+++.+.+. ++.|+.+.+.-.+ T Consensus 160 lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~~~~---~~~F~~~~~~d~~ 219 (219) T protein:vir:98 160 SGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDYEIKSAL---KVNFKQPEKRDKN 219 (219) T ss_pred cccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCcc---EEeecCcccccCC Confidence 99988888888999999999999999999999999999986655443 3566655443333 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.94 E-value=1.4e-27 Score=167.82 Aligned_cols=388 Identities=10% Similarity=0.007 Sum_probs=235.5 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +-+.+.+.+...+.- ++....... +.......+..+ ...|.+++.++++|+++|+.+.+.++.+.-. +...+.. T Consensus 1 ~~~~D~~~~~~~~~g--~~~~~~~~~-~~~~~~~~~~~l-~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~--d~~~~~~ 74 (437) T protein:vir:52 1 MKFFDGIKSLALKLG--SKQEQTYYS-PSLSLTDDLVQL-EALWRDNWIANKVCIKRPEDMVRNWREIYSN--DLNSKQL 74 (437) T ss_pred CchhhhhHhHHhcCC--Cccccceee-cCccccccHHHH-HHHHHhCchhhHHhhcchHHhhcCCceEecC--CCCHHHH Confidence 223333333322111 111111111 111111122222 2346788999999999999999999998531 1111111 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---------CceeEEEEecCceeEEe--ec---- Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---------GDVISLLPLQSANMDVK--LV---- 158 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~---------G~~~~l~~l~~~~v~~~--~~---- 158 (424) ..+...+. +-+ -.+-+..++.+.-++|.|++++.++.. |.+..+.+++++.|++. .+ T Consensus 75 ---~~~~~~~~-~l~----~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~ 146 (437) T protein:vir:52 75 ---DLFTKFER-SLK----LRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVL 146 (437) T ss_pred ---HHHHHHHH-hhc----HHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhcccccccccccc Confidence 12222332 222 244555556666789999999988763 67889999999988732 11 Q ss_pred ----CceEEEEEEeCCceEEecHhHEEEeecCC----CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEc Q lcl|NC_019705. 159 ----GKKVVYRYQRDSEYAEFSQKEIFHLKGFG----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILST 230 (424) Q Consensus 159 ----~~~~~~~~~~~~~~~~~~~~eiih~r~~~----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~ 230 (424) +....|.+..+.....+.++.|+||.+.. .+.+.|+|.++.+...+.....+......++.+.... ++++ T Consensus 147 s~~fg~p~~y~v~~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~--v~k~ 224 (437) T protein:vir:52 147 SPNFGRYSEYSILGGSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKID--IFKI 224 (437) T ss_pred ccccCcceEEEEecCCcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCC--ceec Confidence 12345666666666789999999997532 3557899999999999999999999988887765443 3444 Q ss_pred CC--CCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC Q lcl|NC_019705. 231 GE--KVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS 308 (424) Q Consensus 231 ~~--~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~ 308 (424) +. +.......+.+.++++......+.+++++++.+.+|++++.+..++. +...+...+||++++||..+|.+...+ T Consensus 225 ~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~e~~~~~~sgl~--~~l~~~~~~iaaa~~iP~t~L~G~s~~ 302 (437) T protein:vir:52 225 AGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDAENEYDRKELTFTGLK--DLLTEFRNAVAGAADMPVTILFGQSVS 302 (437) T ss_pred chHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCCcceEEEecCcCCHH--HHHHHHHHHHHHHhcCchhhhcCcCcc Confidence 32 12222223445555555544555678999999999999988877654 788889999999999999999776655 Q ss_pred cccchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHH-------HHHHHHH Q lcl|NC_019705. 309 TSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS-------RAAFMKA 374 (424) Q Consensus 309 ~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~-------~~~~~~~ 374 (424) ..+ +.++..++||. .-+.|.++.+-+.+-+..+.+.. ..+ .|.+++|...+.++ +++++.+ T Consensus 303 Gla--sge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~-~~~--~~~f~pL~~~s~kekae~~~~~a~a~~~ 377 (437) T protein:vir:52 303 GLA--SGDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGLP-ADW--WFEFVPLTTVKQEQQINMLNTFATAANT 377 (437) T ss_pred ccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-Ccc--eEEeCCcCCcCHHHHHHHHHHHHHHHHH Confidence 543 45667777776 56778888877777666554332 123 34445776666554 4556888 Q ss_pred HHhCCCCCHHHHHHHhC----CCCCCCCCeeeecccccchhhc---cc-cCCCcccCC Q lcl|NC_019705. 375 MGEAGLRTINEMRRTDN----LPPLPGGDVAMRQSQYVPITDL---GT-NKEPRNNGA 424 (424) Q Consensus 375 ~~~~g~~t~NE~R~~~g----~~p~~ggd~~~~~~n~~~~~~~---~~-~~~~~~~ga 424 (424) ++++|+++++|+|+.|. ++.++..|..-...+-.+.++. .+ .+.+...++ T Consensus 378 ~~~~g~i~~~e~r~~L~~~g~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (437) T protein:vir:52 378 LIQNGVLNEYQIANELRESGLFANISAEHIEELKNADEFAGNFEEPEKMEGAQVQNSE 435 (437) T ss_pred HHhcCCCCHHHHHHHHHhcCCCCCCCccccccccCCCCCCCccCCCCCCCCCCCCCCC Confidence 89999999999999873 3444433322111111000000 00 000111111 No 107 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.88 E-value=1e-22 Score=141.08 Aligned_cols=398 Identities=11% Similarity=0.037 Sum_probs=217.6 Q ss_pred CCCCcccccCCCC-------------CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTN-------------NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~~~~~~~~~-------------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i 67 (424) |.-+.-.+.+.-+ .+-...+.++.+ .. .......+.......+-.+ ...|.+++.++.+| T Consensus 48 ~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~-----~~-~~~~~~~~~~~~~~~~~~l-~a~Y~~~~l~r~iV 120 (537) T protein:vir:10 48 MAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYAN-----PN-LSEGLVLWYAQQAFIGHQM-CALIATHWLVNKAC 120 (537) T ss_pred CCCCCccCcccccccccccchhccccccchhhhhhhcc-----cc-ccchhhhhccccCCccHHH-HHHHHhCchhhhhh Confidence 1111111111100 011111100000 00 0000001111111111111 23466789999999 Q ss_pred HHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee-CCC------- Q lcl|NC_019705. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-NSA------- 139 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r-~~~------- 139 (424) +++|+.+.+-++++.-.+.+.. + ......|...-+.+..+..|.+ ++....++|.+++++.- ..+ T Consensus 121 d~~A~d~~r~~~~i~~~~~~~~--~----~~~~~~l~~~~~~l~~~~~l~~-a~~~~rlyG~~~i~i~v~~~D~~~~~~P 193 (537) T protein:vir:10 121 SQMPRDAMRKGYKIISDDGNEL--D----PKDAKFIDRYDRAFNIKKHAIQ-FVRKGRIFGIRIALFKVDSPDPYYYEKP 193 (537) T ss_pred hhhhHHhhcCCceeecCCcccc--c----HHHHHHHHHHHHHhhHHHHHHH-HHHhcccccceEEEEeecCcCCcccccc Confidence 9999999999988853322111 1 1122223222222333344444 44444567988887642 122 Q ss_pred --------CceeEEEEecCceeEEeec----Cc--eE-E---EEEEeCCceEEecHhHEEEeecCCC-------CCcccC Q lcl|NC_019705. 140 --------GDVISLLPLQSANMDVKLV----GK--KV-V---YRYQRDSEYAEFSQKEIFHLKGFGF-------TGLVGL 194 (424) Q Consensus 140 --------G~~~~l~~l~~~~v~~~~~----~~--~~-~---~~~~~~~~~~~~~~~eiih~r~~~~-------~~~~G~ 194 (424) |....|.+++|..+.+... .+ .. + -.|... +..|.++.|+||.+... .++.|. T Consensus 194 l~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~--g~~iH~SRli~f~g~~~p~~~~~~~~~~G~ 271 (537) T protein:vir:10 194 FNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLIN--GKKYHRSHLAIYINDEVVDFLKPSYIYGGV 271 (537) T ss_pred cccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeeec--CeEecceeEEEecCCCCchhhhcccCcccc Confidence 2346788888877764321 11 10 0 122333 35688999999975432 345799 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-Cceeeeccc Q lcl|NC_019705. 195 SPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-GFSTSAIGV 273 (424) Q Consensus 195 s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~-g~~~~~l~~ 273 (424) |.++.+...+.....+.......+......-.-+.......++++ +.+.++......+..++++++. +.+++++.. T Consensus 272 Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~---~~~r~~~~~~~r~n~g~~~id~e~e~~e~~~~ 348 (537) T protein:vir:10 272 PLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQ---FDETMSWWTATRDNYQVRVVDKDNEDVVQIDT 348 (537) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHH---HHHHHHHHHhhcCCcceeEecCCCceeEEEec Confidence 999999999999988888888888776654322222222334443 4444444444444456788876 588998887 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH------HHHHHHHHHHHHHHhhccChhh Q lcl|NC_019705. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY------TLQPYISRWENSIQRWLIPAKD 347 (424) Q Consensus 274 ~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~------tl~P~~~~ie~~l~~~l~~~~~ 347 (424) +...+ .+........||.+.|||..+|.+...+..+ ++.+...++|+.. .|.|.++.+.+.+-+..+.+. T Consensus 349 ~lsgl--~~~l~~~~~~iAa~~~IP~t~L~G~sp~Gln-atGe~D~~~yyd~I~~~Qe~l~p~l~~l~~ll~~~~~~~~- 424 (537) T protein:vir:10 349 TLNDL--DKVIMNQYQLVCAIARTPAPKMLGTVPTGFN-STGDYEEASYHEECESTQDDMRPLIDRHHQLVCRSHLRKR- 424 (537) T ss_pred cCCCH--HHHHHHHHHHHHhhhCCCceeeccCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC- Confidence 77765 4788888999999999999977554332221 2234455555533 478988888887776665542 Q ss_pred hcccchhhhhhhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecc--------------- Q lcl|NC_019705. 348 VGRIHAEHNLDGLLRGDSASRAA-------FMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQS--------------- 405 (424) Q Consensus 348 ~~~~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~--------------- 405 (424) ..+.|.+++|...|.+++++ ++++++++|++++||+|+.|+.+|..|-+.+.... T Consensus 425 ---~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~~ 501 (537) T protein:vir:10 425 ---IRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEGK 501 (537) T ss_pred ---cceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCccCC Confidence 23455567888888777665 48899999999999999998876532211111000 Q ss_pred --c----------ccchhhccc-cCCCcccCC Q lcl|NC_019705. 406 --Q----------YVPITDLGT-NKEPRNNGA 424 (424) Q Consensus 406 --n----------~~~~~~~~~-~~~~~~~ga 424 (424) . ..+.+..++ .++++++|| T Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 533 (537) T protein:vir:10 502 PVRIIEDQPAPSEMFGATSSGESANDPRDSGA 533 (537) T ss_pred cCCCCCCCCCccccCCCCccccccCCCccCcc Confidence 0 001111111 123334444 No 108 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.86 E-value=4.2e-22 Score=137.80 Aligned_cols=404 Identities=10% Similarity=0.047 Sum_probs=222.7 Q ss_pred CC----CCcccccCCCCCchHHHHHhhccCcc--------cC-----Cccc-cchhh------------------ccccc Q lcl|NC_019705. 1 ME----EPKYTIDLRTNNGWWARLQSWFVGGR--------LV-----TPNQ-GSQTG------------------PVSAH 44 (424) Q Consensus 1 ~~----~~~~~~~~~~~~G~~~~~~~~~~~~~--------~~-----~~~~-~~~~~------------------~~~~~ 44 (424) |. .|.-.+... ..|--.|.++.-..+. .. .|.. .+... .+... T Consensus 1 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~ 79 (532) T protein:vir:94 1 MADTDPTPRPEITYA-TLQQAQRVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEA 79 (532) T ss_pred CCCCCCCCCcceehh-hhhhHhhhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCcccccccccccccc Confidence 11 111111111 1122222211111000 00 0000 00000 00000 Q ss_pred cccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHH Q lcl|NC_019705. 45 GHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQL 124 (424) Q Consensus 45 ~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ 124 (424) ....+-.+ ...|.+++.++.+|+.+|+.+.+-.+++.-.+.+... ......|...-..+ .-.+-+..++... T Consensus 80 ~~~~~~~l-~a~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~~------~~~~~~i~~~~~~l-~v~~~l~~a~~~~ 151 (532) T protein:vir:94 80 TSWPGFPT-LALLAQLPEYRTMHETPADECVRAWGKITCSSKDELA------ADKATRITQKLEQY-NVRTLVRTVVIHD 151 (532) T ss_pred cccchHHH-HHHHHcCchhhhhhccchHHHhhCCceEeeCCccccc------hHHHHHHHHHHHhh-hHHHHHHHHHHhh Confidence 00111111 2345678999999999999999999988543322211 11222222111111 2234555566666 Q ss_pred HHcCCeEEEEeeCC-------------------CCceeEEEEecCceeEEeecC--c--------eEEEEEEeCCceEEe Q lcl|NC_019705. 125 CFYGNAYALVDRNS-------------------AGDVISLLPLQSANMDVKLVG--K--------KVVYRYQRDSEYAEF 175 (424) Q Consensus 125 ll~G~a~~~~~r~~-------------------~G~~~~l~~l~~~~v~~~~~~--~--------~~~~~~~~~~~~~~~ 175 (424) .++|.+++++.... .|.+..|.+++|..|++.... + ...|... . +..+ T Consensus 152 rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~-~--g~~i 228 (532) T protein:vir:94 152 QAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIAT-S--GKKI 228 (532) T ss_pred hcccceEEEEEeccCCccccccccccccccccccceeeEEEeechheecccccccccccccccCCceeEEEc-c--Ceee Confidence 78899988875432 233468889999888754211 1 1122221 2 3468 Q ss_pred cHhHEEEeecCCC-------CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcC-CCCCCHHHHHHHHHHH Q lcl|NC_019705. 176 SQKEIFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG-EKVLTEQQRSQVEENF 247 (424) Q Consensus 176 ~~~eiih~r~~~~-------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~-~~~~~~~~~~~~~~~~ 247 (424) .++.|+||.+... .+..|.|.++.+...+.....+......+...... .. +++. ...++.+..+.+.+++ T Consensus 229 H~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~-~v-~k~~~a~~ls~~~~~~~~~r~ 306 (532) T protein:vir:94 229 HSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSM-TN-LATDMAQLLAPGGAQSLDARL 306 (532) T ss_pred ccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCC-ce-eeechHHhhcchhHHHHHHHH Confidence 8999999975432 24579999999999999988888888876655333 33 3332 2234445566777777 Q ss_pred HHHhCCcccCcceecCC-CceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHH-- Q lcl|NC_019705. 248 KEIAGGPVKKRLWILEA-GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-- 324 (424) Q Consensus 248 ~~~~~~~~ag~~~~l~~-g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~-- 324 (424) +......+..++++++. +.+|++++.+..++ .+.......+||++.|||..+|.+...+..+ ++.+....+|+. T Consensus 307 ~~~~~~~~n~g~~~id~~~e~~e~~~~~lsgl--~~~l~~~~~~iAaa~~IP~t~LfG~sp~Gln-stGe~D~~~yyd~I 383 (532) T protein:vir:94 307 QLFNLYRDNRNIGALDKGTEEIQQTNTPLSGL--DSLQAQSQEQMAAVSHIPLVKLLGITPNGLN-ASSDGEIRVWYDFI 383 (532) T ss_pred HHHHhhcCCccceEEcCCCceeEEEecccCCH--HHHHHHHHHHHHhHhCCCeeeeecCCccccc-ccchHHHHHHHHHH Confidence 76554444556788875 57899888777764 5788888999999999999987554433332 223445555555 Q ss_pred -----HHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHH-------HHHHHHHHhCCCCCHHHHHHHhCC Q lcl|NC_019705. 325 -----YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASR-------AAFMKAMGEAGLRTINEMRRTDNL 392 (424) Q Consensus 325 -----~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~-------~~~~~~~~~~g~~t~NE~R~~~g~ 392 (424) .-+.|.++.+.+.|.+..+.... ..+ .|.+++|...+.+++ ++++++++++|++++||+|+.++. T Consensus 384 ~s~Qe~~l~p~le~l~~~l~~s~~g~~~-~d~--~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~ 460 (532) T protein:vir:94 384 AGYQATNLTPLMEWIIDLIQLSEYGQID-PGL--AWEWSPLMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAA 460 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCC-CCc--eEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhc Confidence 44788888888888766654332 123 344456777776665 455678999999999999999999 Q ss_pred CCCCCCCeeeeccccc----------------chhhccccCCCcccCC Q lcl|NC_019705. 393 PPLPGGDVAMRQSQYV----------------PITDLGTNKEPRNNGA 424 (424) Q Consensus 393 ~p~~ggd~~~~~~n~~----------------~~~~~~~~~~~~~~ga 424 (424) .|..+.+.....-+.. +.+.....+.+..+++ T Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (532) T protein:vir:94 461 DPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSE 508 (532) T ss_pred CCccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCC Confidence 8875543322211110 0000000000111111 No 109 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=99.86 E-value=2.2e-21 Score=133.83 Aligned_cols=395 Identities=10% Similarity=-0.020 Sum_probs=250.4 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) ++.|--++++.|+.|+.+.+.+++.+....+ ......- ..+..-.-+..++++.|.+|++.+...|.+++|. T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~~~~l~~~~--~~il~~a------~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~~w~ 72 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPFISGLQVPN--DSILQRR------GGNDLRVYEEILSDAQVKTVWGQRQLAVVSREWK 72 (488) T ss_pred CCccchhHHHHHHHhhhhhhccccCCCCCCC--hHHHHhh------ccCCHHHHHHHhhChHHHHHHHHHHHHHhcCCce Confidence 8888889999999988887777765443222 1111100 0111112244577999999999999999999999 Q ss_pred EEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-C--ceeEEEEecCceeEEee Q lcl|NC_019705. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-G--DVISLLPLQSANMDVKL 157 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~-G--~~~~l~~l~~~~v~~~~ 157 (424) |...+. ..+.......+..+|. + ..+.++++.+. +.+++|.+..+++.... | .|..+.+.|+.++.+.. T Consensus 73 i~p~~~--~~~~~~~ae~v~~~l~-~----~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~ 144 (488) T protein:vir:99 73 VEAGGD--RPIDQAAAEHLEQQLQ-R----VGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRFRYDQ 144 (488) T ss_pred EEcCCC--ChHHHHHHHHHHHHHh-C----CCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccceeecC Confidence 964332 2221112223444443 3 35667777776 46889999999986543 3 46789999999888877 Q ss_pred cCceEEEEEEeCCceEEecHh-H-EEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCC Q lcl|NC_019705. 158 VGKKVVYRYQRDSEYAEFSQK-E-IFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL 235 (424) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~-e-iih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~ 235 (424) +++.....-.....+.+++.. . |+|....+...++|.+.+..+.-.........++...|...-|.|-.+.+++.... T Consensus 145 ~~~l~~~~~~~~~~g~~lp~~~~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a 224 (488) T protein:vir:99 145 DGGLRLLTPNNMFEGEPCPAPYFWHFSTGADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTA 224 (488) T ss_pred CCceEEeccCCCCCccccccCceEEEEeecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCC Confidence 665443322223345666543 3 33333344556899999999999999999999999999999999988888775555 Q ss_pred CHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchh Q lcl|NC_019705. 236 TEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG 314 (424) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n 314 (424) +++.++.+.+.+.+... + ..++++.|++++-+..+ .....|.+..++..++|+.+.-=. .+-. +.++.+++. T Consensus 225 ~~~ek~~l~~av~~~~~--~--~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGq-tlts--~~~~Gs~a~ 297 (488) T protein:vir:99 225 TPEDKAKLLAALHAIQT--D--SAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQ-VAST--QGTPGRLGN 297 (488) T ss_pred CHHHHHHHHHHHHHHhc--C--cEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhh-hhcc--cccccchhh Confidence 67777777766666543 2 35666777666554421 222347888888889998874111 1111 112223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhh------cccchhhhhhhhhccCHHHHHHHHHHHHhC-CC-CCHHHH Q lcl|NC_019705. 315 IEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV------GRIHAEHNLDGLLRGDSASRAAFMKAMGEA-GL-RTINEM 386 (424) Q Consensus 315 ~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~------~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~-g~-~t~NE~ 386 (424) .+.........+.-.++.|+..||+.|+.+.-. ...++.|+. ....|.+.+++.+.++++. |+ ++..++ T Consensus 298 -~~vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~~~p~~~~~~--~e~edl~~~a~~~~~l~~~~G~~i~~~~i 374 (488) T protein:vir:99 298 -DDLQADVRLDLVKADADLICESFNLGPARWLTEWNFPGAQPPRVYRVI--EEPEDITAKAERDEKVFRMSGFRPTRGYV 374 (488) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCcCCceeEecC--CCcccHHHHHHHHHHHHhhcCCCCCHHHH Confidence 334445667888889999999999887754321 112344443 3457788899999999985 64 788899 Q ss_pred HHHhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 387 RRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 387 R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |+.+|+|+-..++....+..... .++...+.+... T Consensus 375 ~e~~Gip~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 409 (488) T protein:vir:99 375 QETYGVEVESTQAEATAPTPSTE---FAEGDQPSDPAA 409 (488) T ss_pred HHHcCCCCcccccccccCCCccc---CCCCCCCCCchH Confidence 99999998655555443321111 111111111111 No 110 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.83 E-value=2.3e-20 Score=128.28 Aligned_cols=401 Identities=11% Similarity=0.023 Sum_probs=212.8 Q ss_pred CCCCc---------------------------ccccCCC------CCch-HHHHHhhcc---CcccCCccccch-hhccc Q lcl|NC_019705. 1 MEEPK---------------------------YTIDLRT------NNGW-WARLQSWFV---GGRLVTPNQGSQ-TGPVS 42 (424) Q Consensus 1 ~~~~~---------------------------~~~~~~~------~~G~-~~~~~~~~~---~~~~~~~~~~~~-~~~~~ 42 (424) -|+|. .+|+-.. ..|+ .+.+.+... .....++..... ...+. T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~ 129 (862) T protein:vir:99 50 KEKPNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWY 129 (862) T ss_pred cccCCCCCCcccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccc Confidence 11111 0111000 0011 011111110 000000000000 00000 Q ss_pred cccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHH Q lcl|NC_019705. 43 AHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTM 122 (424) Q Consensus 43 ~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~ 122 (424) ......+-.+ ...|.+++.++++|+++++.+.+-.+++.-.+++.... ......+...+. +- .-.+-+..++. T Consensus 130 ~~~~f~gyql-~alY~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~-~e~~~~ie~~~~-rL----~v~~~l~eair 202 (862) T protein:vir:99 130 LSQGFIGHQA-CALIAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEID-EESLEKFKAIDV-EF----KVKENLIEFNR 202 (862) T ss_pred cccCcccHHH-HHHHHhCchhhhhhhhhhHHHhhCCceEeecCcccccC-HHHHHHHHHHHH-Hh----hHHHHHHHHHH Confidence 0011111111 23567899999999999999999999886433221111 111112222222 11 12333444455 Q ss_pred HHHHcCCeEEEEee-CCC---------------CceeEEEEecCceeEEee------cCceE-E---EEEEeCCceEEec Q lcl|NC_019705. 123 QLCFYGNAYALVDR-NSA---------------GDVISLLPLQSANMDVKL------VGKKV-V---YRYQRDSEYAEFS 176 (424) Q Consensus 123 ~~ll~G~a~~~~~r-~~~---------------G~~~~l~~l~~~~v~~~~------~~~~~-~---~~~~~~~~~~~~~ 176 (424) ..-++|.+++++.. ..+ |.+..|..|+|..+.+.. |.... + -.|...+ ..|. T Consensus 203 ~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y~I~g--~~IH 280 (862) T protein:vir:99 203 FKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFWIISG--QKYH 280 (862) T ss_pred hcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhcccccccccccccccccCCceeeeecC--eeec Confidence 55677877766532 112 345678888887765421 11110 0 1122332 4577 Q ss_pred HhHEEEeecCCC-------CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC--CCCHHHHHHHHHHH Q lcl|NC_019705. 177 QKEIFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK--VLTEQQRSQVEENF 247 (424) Q Consensus 177 ~~eiih~r~~~~-------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~--~~~~~~~~~~~~~~ 247 (424) ++.|+||.+... +.+.|+|.++.+...|.....+......++.+.... +++++.. ..++ +.+.+.+ T Consensus 281 ~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~--v~ktd~l~~l~~e---d~l~~r~ 355 (862) T protein:vir:99 281 RSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTT--AIHTDTAKAIANE---DKFIQRL 355 (862) T ss_pred cceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeechhHhhhccH---HHHHHHH Confidence 899999875432 235699999999999999999999998888775543 2333211 1222 2344444 Q ss_pred HHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CCcccchhHHHHHHHHHH-- Q lcl|NC_019705. 248 KEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE-KSTSWGSGIEQQNLGFLQ-- 324 (424) Q Consensus 248 ~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~-~~~~~~~n~e~~~~~~~~-- 324 (424) +......+..++++++.+.+|++++.+..++ .+.......+||++.+||..+|.+.. .|.. ++.+...++||. T Consensus 356 ~~~~~~rdN~Gi~liD~eEe~e~ls~slSGL--~dll~~~~q~IAaas~IP~tiLfGqspaGln--ATGE~D~~nYyD~I 431 (862) T protein:vir:99 356 MFWVRYRDNHAVKVLGTDETMEQFDTSLADF--DAVIMGQYQLVASIAKTPATKLLGTAPKGFN--STGEFETISYHEEL 431 (862) T ss_pred HHHHhccCcceeEEecCCCceeEEecccCCh--HHHHHHHHHHHHhhhCCCceeecccCccccc--CchHHHHHHHHHHH Confidence 4443333445689999999999998887765 47788888899999999999776544 3322 234555666666 Q ss_pred -----HHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHH-------HHHHHhCCCCCHHHHHHHh-- Q lcl|NC_019705. 325 -----YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAF-------MKAMGEAGLRTINEMRRTD-- 390 (424) Q Consensus 325 -----~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~~-- 390 (424) .-|.|.++.+...+..++..+. . +.|.++.|...+.+++++. +++++++|+++++|+|+.| T Consensus 432 ~s~QE~~L~P~LerL~~li~~~lg~~~---d--~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~ 506 (862) T protein:vir:99 432 ESIQEHVYMPFLQRHYLISRLSLGIQH---E--IDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRD 506 (862) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCC---c--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHh Confidence 4577999988877765553222 2 3344467877877776644 6789999999999999976 Q ss_pred ----CCCCCCCCCee----eecccccchhhccc---c--CCCcccCC Q lcl|NC_019705. 391 ----NLPPLPGGDVA----MRQSQYVPITDLGT---N--KEPRNNGA 424 (424) Q Consensus 391 ----g~~p~~ggd~~----~~~~n~~~~~~~~~---~--~~~~~~ga 424 (424) |++.++..|.. ..+.+...+...++ + .+....|+ T Consensus 507 ~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~~aga 553 (862) T protein:vir:99 507 DKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETASAKETQAGA 553 (862) T ss_pred cCCcCCCCCCcccccccCCCCcccccccccCCccccccccccccccc Confidence 44444332221 11222221111111 0 00011111 No 111 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.82 E-value=2.1e-19 Score=122.95 Aligned_cols=393 Identities=9% Similarity=-0.046 Sum_probs=230.6 Q ss_pred HHHHHhhccCcccCCccccch----h---hccc-----ccc----------ccCccccc----HHHH-hccHHHHHHHHH Q lcl|NC_019705. 17 WARLQSWFVGGRLVTPNQGSQ----T---GPVS-----AHG----------HLGDSSIN----DERI-LQISTVWRCVSL 69 (424) Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~----~---~~~~-----~~~----------~~~~~~vs----~~~~-~~~~~v~~~i~~ 69 (424) +.+|..+++++.......... . ..+. +.+ ...|.... .+.+ .+++.|.+|++. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l~~ 80 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAEMSK 80 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 223333332221111000000 0 0000 000 00000000 1112 268999999999 Q ss_pred HHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---CceeEEE Q lcl|NC_019705. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---GDVISLL 146 (424) Q Consensus 70 ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~---G~~~~l~ 146 (424) +...|.+++|.|.....+.. .+.....-+...|..-|+ +.+++.. +.+.+++|.+..+++.... ..|..+. T Consensus 81 Rk~av~~~~w~I~p~~~~~~-~~~~~a~~v~~~l~~~~~----f~~~i~~-~lda~~~G~s~~Ei~w~~~~g~~~~~~~~ 154 (528) T protein:vir:10 81 RKRAVLGLDWTIEPPRNASA-AEKADAEYLHELLLDLEG----IEDLMLD-CMDGVGHGYSAIELDWSLQGREWLPQAFD 154 (528) T ss_pred HHHHHhcCCceEecCCCCCH-HHHHHHHHHHHHHhCCcc----HHHHHHH-HHhhhhhcceeEEEEEeecCCceeEEEee Confidence 99999999999865433221 111112234444543221 3333333 3446779999999875443 3577899 Q ss_pred EecCceeEEeecCceEEEEEEeCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCc Q lcl|NC_019705. 147 PLQSANMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSP 225 (424) Q Consensus 147 ~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~ 225 (424) ++++.++.+..+++.....-.....+..+++...++.++.. ...++|.+.+..+.-.........++...|...-|.|- T Consensus 155 ~r~~~~f~~~~~~~~~l~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~ 234 (528) T protein:vir:10 155 HRPQSWFQLNPDDQDELRLRDNSIAGEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPI 234 (528) T ss_pred eecccceeeccCCCcEEeccCCCCCceeecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCe Confidence 99999888877765443222223356788888866666554 44589999999999999999999999999999999998 Q ss_pred eeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhCC Q lcl|NC_019705. 226 QILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGD 304 (424) Q Consensus 226 ~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~ 304 (424) -+.+++.+. +++.++.+.+.+.+..+ ++ .++++.|++++-+..+ ..-..|.+..++..++|+.+.-= ..+-.. T Consensus 235 ~igky~~~a-~~~ek~~L~~al~~i~~--~~--~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLG-qtlTs~ 308 (528) T protein:vir:10 235 RLGKYPPGT-PDEEKVTLLRAVTGLGH--AA--AGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAILG-GTLTSQ 308 (528) T ss_pred EEEecCCCC-CHHHHHHHHHHHHHHhh--Cc--EEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHHhh-hhhhcc Confidence 888888664 56666666666666543 23 5666677665554432 22234778888888988886511 122111 Q ss_pred -CCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhc----------ccchhhhhhhhhccCHHHHHHHHH Q lcl|NC_019705. 305 -VEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG----------RIHAEHNLDGLLRGDSASRAAFMK 373 (424) Q Consensus 305 -~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~----------~~~~~fd~~~l~~~d~~~~~~~~~ 373 (424) .++++++++- .+.........+.-.++.|+..||+.|+.+.-.. ..++.|+. -...|.+.+++.+. T Consensus 309 ~~~g~~gS~Al-g~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~--~e~eDl~~~a~~~~ 385 (528) T protein:vir:10 309 TSESGGGAYAL-GQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDL--KDRADLAAMATSLP 385 (528) T ss_pred ccccccchhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecC--CCcccHHHHHHHHH Confidence 1111222222 2234455677778888999999998875443111 12344444 34577788999999 Q ss_pred HHHhCCC-CCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 374 AMGEAGL-RTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 374 ~~~~~g~-~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +++..|+ ++..++|+.+|+|.-..++.+..+....+.........++.... T Consensus 386 ~L~~~G~~i~~~~i~e~~gip~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (528) T protein:vir:10 386 PLVKLGVQVPVNWVQEQLGIPLPANGEAVLGDQAGAGIAQLSRRPGPRIAAL 437 (528) T ss_pred HHHhCCCCCCHHHHHHHhCCCCCCCCcccccCCCcccccccCcccccccccc Confidence 9999998 99999999999987666776665433332221111111111100 No 112 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.82 E-value=2.4e-20 Score=128.11 Aligned_cols=397 Identities=11% Similarity=0.048 Sum_probs=211.6 Q ss_pred CCCC----------------------------------cccccCCCCCchHHHHHhhccCcc-cCCccccchhhcccccc Q lcl|NC_019705. 1 MEEP----------------------------------KYTIDLRTNNGWWARLQSWFVGGR-LVTPNQGSQTGPVSAHG 45 (424) Q Consensus 1 ~~~~----------------------------------~~~~~~~~~~G~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 45 (424) -.+| +..|+=.-..|....+.+...+.. ...+. ....+.... T Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~ 106 (765) T protein:vir:96 30 QHDPLDPMIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPT---MLQDWYNSQ 106 (765) T ss_pred CCCCcccchhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhh---HHHhhhccc Confidence 1111 112221111122222221111100 00000 000000001 Q ss_pred ccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHH Q lcl|NC_019705. 46 HLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLC 125 (424) Q Consensus 46 ~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~l 125 (424) ...+..+ ...|.+++.++.+|+++++.+.+-++++.-.+.+..... ...+...+. + ..-.+-+..++.+.- T Consensus 107 ~f~gyql-~alY~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~~~~~---~~~l~~~~~-r----l~v~~~l~ea~~~~R 177 (765) T protein:vir:96 107 GFIGYQA-CAIISQHWLVDKACSMSGEDAARNGWELKSDGRKLSDEQ---SALIARRDM-E----FRVKDNLVELNRFKN 177 (765) T ss_pred CCccHHH-HHHHHhCchhhhhhhcchHHhhcCCceeecCccccCHHH---HHHHHHHHH-H----hhHHHHHHHHHHHhh Confidence 1111111 234678899999999999999998888743221111100 011222221 1 123455566677778 Q ss_pred HcCCeEEEEeeC-CC---------------CceeEEEEecCceeEEee----cCc--e-EEE---EEEeCCceEEecHhH Q lcl|NC_019705. 126 FYGNAYALVDRN-SA---------------GDVISLLPLQSANMDVKL----VGK--K-VVY---RYQRDSEYAEFSQKE 179 (424) Q Consensus 126 l~G~a~~~~~r~-~~---------------G~~~~l~~l~~~~v~~~~----~~~--~-~~~---~~~~~~~~~~~~~~e 179 (424) ++|.+|+++.-+ .+ |....|..++|..+.... ..+ . .++ .|...+ ..|.++. T Consensus 178 lyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i~g--~~IH~SR 255 (765) T protein:vir:96 178 VFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWIISG--KKYHRSH 255 (765) T ss_pred hceeeEEEEEecccCcchhhccccccccccceeeEEEEechhhcccccchhccccccccccCcceeeeecC--ceeccce Confidence 889888876432 11 234567777776655321 111 0 011 122332 3577889 Q ss_pred EEEeecCCC-------CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC--CCCHHHHHHHHHHHHHH Q lcl|NC_019705. 180 IFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK--VLTEQQRSQVEENFKEI 250 (424) Q Consensus 180 iih~r~~~~-------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~--~~~~~~~~~~~~~~~~~ 250 (424) ||||....+ .+.+|.|.++.+...|.....+......++.+.... +++++.. ..++ +.+.+.++.. T Consensus 256 li~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~--v~k~~~~~~l~~~---~~l~~r~~~~ 330 (765) T protein:vir:96 256 LVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTS--TIHVDVEKAIANE---DAFNARLAFW 330 (765) T ss_pred EEEecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeeechHhhhccH---HHHHHHHHHH Confidence 999975432 345799999999999999999998888888776543 3333211 1222 2344445444 Q ss_pred hCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHH------ Q lcl|NC_019705. 251 AGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ------ 324 (424) Q Consensus 251 ~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~------ 324 (424) ....+..++++++.+.+|++++.+..++ .+.......+||.+.+||..+|.+...+..+ ++.|...++||. T Consensus 331 ~~~r~n~g~~~id~ee~~e~~s~~lsgl--~d~l~~~~~~iAaas~IP~t~LfGqsp~Gln-ATGe~D~~nYyD~I~s~Q 407 (765) T protein:vir:96 331 IANRDNHGVKVIGIDETMEQFDTNLSDF--DSVIMNQYQLVAAIAKTPATKLLGTSPKGFN-ATGEHETISYHEELESIQ 407 (765) T ss_pred HHhcCCceeEEecCCcceeEEecccCCH--HHHHHHHHHHHHhhhCCCeeeeccCCccccc-CcchHHHHHHHHHHHHHH Confidence 4444445689999999999999887765 5788888999999999999888665422221 233555666666 Q ss_pred -HHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHhCCCCC- Q lcl|NC_019705. 325 -YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAA-------FMKAMGEAGLRTINEMRRTDNLPPL- 395 (424) Q Consensus 325 -~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~g~~t~NE~R~~~g~~p~- 395 (424) .-+.|.++.+-+.|-+.- ... ..+.+.++.|...+.+++++ ++++++++|++++||+|+.++.++. T Consensus 408 e~~l~p~le~L~~li~~s~----~i~-~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~ 482 (765) T protein:vir:96 408 EHIFDPLLERHYLLLAKSE----SID-VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRS 482 (765) T ss_pred HHHHHHHHHHHHHHHHHhc----CCC-CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccC Confidence 456677666655554331 111 12445556887777776654 5788999999999999999866543 Q ss_pred -----CCCCe----eeecccccchhhcccc-----CCCcccCC Q lcl|NC_019705. 396 -----PGGDV----AMRQSQYVPITDLGTN-----KEPRNNGA 424 (424) Q Consensus 396 -----~ggd~----~~~~~n~~~~~~~~~~-----~~~~~~ga 424 (424) +..+. ...|.+...++..+.+ .++..+.+ T Consensus 483 g~~~l~d~~~e~~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a 525 (765) T protein:vir:96 483 GYNRLTDDQAETEPGMSPENLAELEKAGAQSAKAKGEAERAEA 525 (765) T ss_pred CCCCCCccccccccCCCccccccccCCCcccccccCccccccC Confidence 22111 1111121111111110 00000000 No 113 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=99.81 E-value=5.1e-19 Score=120.88 Aligned_cols=406 Identities=8% Similarity=-0.030 Sum_probs=233.2 Q ss_pred CCCCcccccCCC-CC-chHHHHHhhccCcccCCccccchhhcccccccc-CcccccHHHHhccHHHHHHHHHHHHhhccC Q lcl|NC_019705. 1 MEEPKYTIDLRT-NN-GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL-GDSSINDERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~~~~~~~~~-~~-G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) ...+.- ++.+| +. |+.+.+......+ .+|.+....-.-...+.. ....+-.+-..+++.|.+|++.+...|.++ T Consensus 12 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~g--ltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~l~~Rk~av~~~ 88 (526) T protein:vir:99 12 IRTQQL-REPQTSRLAGLAKEFAQHPAKG--LTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSKRKRAILGL 88 (526) T ss_pred cccccc-cchhhhhhhhhhhhhcccCcCC--CCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCC Confidence 000000 01111 11 2222211111100 011110000000000000 000011111125899999999999999999 Q ss_pred ceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-C--ceeEEEEecCceeE Q lcl|NC_019705. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-G--DVISLLPLQSANMD 154 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~-G--~~~~l~~l~~~~v~ 154 (424) +|.|.....+.. ........+...|+..| ...+++..+. +.+++|-+..+++.... | .|..+.+.++.++. T Consensus 89 ~w~I~p~~~~~~-~~~~~a~~v~~~l~~~~----~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~f~ 162 (526) T protein:vir:99 89 DWAVEPPRNASA-AEKADADYLHELLLDLE----GLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQSWFQ 162 (526) T ss_pred CceEecCCCCCH-HHHHHHHHHHHHHhccc----CHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEeeeeccccee Confidence 999865433221 11122233455554433 2456666665 47789999999975443 3 57789999999888 Q ss_pred EeecCceEEEEEEeCCceEEecHhHEEEeecC-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC Q lcl|NC_019705. 155 VKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK 233 (424) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~ 233 (424) +..+++...........+.++++...+..++. ....++|.+.+..+.-.........++...|...-|.|--+.+++.. T Consensus 163 ~~~~~~~~l~~~~~~~~g~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~ 242 (526) T protein:vir:99 163 LNPEDQNELRLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPG 242 (526) T ss_pred eccCCCcEEEecCCCCCceeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCC Confidence 87777654433333455677888876554544 45568999999999999999999999999999999999888888766 Q ss_pred CCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCCccc Q lcl|NC_019705. 234 VLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDV-EKSTSW 311 (424) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~-~~~~~~ 311 (424) . ++++++.+.+.+.+..+ + ..++++.|++++-+..+ .....|.+..++..++|+.++ +-..+-... ++++.+ T Consensus 243 a-~~~ek~~L~~av~~i~~--d--~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGqtlTs~~~~g~~gS 316 (526) T protein:vir:99 243 T-ADEEKATLLRAVTGLGH--A--AAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LGGTLTSTTSQSGGGA 316 (526) T ss_pred C-CHHHHHHHHHHHHHHhh--C--cEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhhhhccccccCcchh Confidence 4 56666666666666543 2 35677777666555432 222347788888899998874 111111111 111223 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhc----------ccchhhhhhhhhccCHHHHHHHHHHHHhCCC- Q lcl|NC_019705. 312 GSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG----------RIHAEHNLDGLLRGDSASRAAFMKAMGEAGL- 380 (424) Q Consensus 312 ~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~----------~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~- 380 (424) ++.. +........-+.-.++.|+..||+.|+.+.-.- ..++.|+. -...|.+.+++.+.++++.|+ T Consensus 317 ~a~g-~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~--~e~eDl~~~a~~~~~L~~~G~~ 393 (526) T protein:vir:99 317 FALG-QVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDL--REQADITSMAQSIPALVNVGLE 393 (526) T ss_pred hhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCC--CCcccHHHHHHHHHHHHhCCCc Confidence 2222 233455667777888899999988876443111 12334443 345778889999999999997 Q ss_pred CCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCCC----cccC-------C Q lcl|NC_019705. 381 RTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEP----RNNG-------A 424 (424) Q Consensus 381 ~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~----~~~g-------a 424 (424) ++..++|+.+|+|.-..++..+.+....+.......... ...+ + T Consensus 394 i~~~~i~e~~Gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (526) T protein:vir:99 394 IPSAWVYDKLGIPQPAKNEPVLRSAAQPAILSRQHGQRVAALATIVGPRYGDQQA 448 (526) T ss_pred cCHHHHHHHhCCCCCCCcccccCCCCCCcccccccccccccccccccccCcchhh Confidence 899999999999876666665544322111110000000 0000 0 No 114 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.81 E-value=7e-19 Score=120.11 Aligned_cols=398 Identities=13% Similarity=0.015 Sum_probs=236.0 Q ss_pred CCC----Ccc--cccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHh-ccHHHHHHHHHHHHh Q lcl|NC_019705. 1 MEE----PKY--TIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERIL-QISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~----~~~--~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~-~~~~v~~~i~~ia~~ 73 (424) |-| |.+ .++---.+|.-. ..+.+.. .......-+.....+- +..+ +++.|.+|++.+... T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~-~~~~~~~-----------~e~~~~lr~~~~~~ly-~~m~e~D~~i~s~l~~rk~a 67 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVD-GWTVWDP-----------FEQTPELQWPQSVAVY-SRMDNEDSRVTSLLEAISLP 67 (469) T ss_pred CCCcccCCCCccchhhhhhccccc-chhhccc-----------cccccccccccchHHH-HHHHhhChHHHHHHHHHHHH Confidence 211 111 111100011100 0000000 0000000000111122 2333 589999999999999 Q ss_pred hccCceEEEEeccCCccceeccchHHHHHhhcCC------------CCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC--- Q lcl|NC_019705. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSP------------NQYMTAQEFREAMTMQLCFYGNAYALVDRNS--- 138 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~p------------n~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~--- 138 (424) |.+++|+|...+.+.. . ...+...|.... +-..++.+++..++.+.+.+|-+..+++... T Consensus 68 v~~~~w~v~p~~~~~e---~--~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~ 142 (469) T protein:vir:10 68 IRSTPWRIRANGASDE---V--TEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQ 142 (469) T ss_pred HhcCCceEecCCCCHH---H--HHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccc Confidence 9999999965433221 1 122233332111 1123688888888888899999999998643 Q ss_pred --CC--ceeEEEEecCceeE---EeecCceEEEEE------------EeCCceEEecHhHEEEeecCC-CCCcccCchHH Q lcl|NC_019705. 139 --AG--DVISLLPLQSANMD---VKLVGKKVVYRY------------QRDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIA 198 (424) Q Consensus 139 --~G--~~~~l~~l~~~~v~---~~~~~~~~~~~~------------~~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~ 198 (424) +| .+..|.+.|+.++. +..+++...+.. ..+....++++...++.++.. ...++|.+.+. T Consensus 143 ~~dG~~~~~~l~~rp~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr 222 (469) T protein:vir:10 143 SPDGRFWLRKLAPRPQWTISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQGKSILR 222 (469) T ss_pred cCCCceeeeeeeecCcccceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecCCCCCcccchhHH Confidence 24 36677778876552 333444333321 123345678888877776654 45589999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHH Q lcl|NC_019705. 199 FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDA 278 (424) Q Consensus 199 ~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~ 278 (424) .++-.........++...|...-+.|--+.+++.+ .++++++.+.+...+...+.++ .++++.|++++-+..+.... T Consensus 223 ~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~-a~~~ek~~l~~a~~~~~~g~~a--~~iip~~~~ie~~ea~g~~~ 299 (469) T protein:vir:10 223 SAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSA-TDEDEVRKMAALARSVRGGINA--GVGLAQGQILELLGVSGNLP 299 (469) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCC-CCHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEeecCCCch Confidence 99999999999999999999999988888888765 4666777788888777655554 46678888887777665556 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhh-----cccch Q lcl|NC_019705. 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV-----GRIHA 353 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~-----~~~~~ 353 (424) .|.+..++..++|+.+.--. .+-....++ ++ ...+.........+.-.++.|+..||+.|+.+.-. ...+. T Consensus 300 ~~~~li~~~d~~Isk~iLG~-tlTs~~~gG--S~-a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P 375 (469) T protein:vir:10 300 DIRRAIEGHDRSIALSGLAH-FLNLDGKGG--SY-ALASVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAP 375 (469) T ss_pred HHHHHHHHHHHHHHHHHhcc-cccccCccc--hh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcc Confidence 78889999999998876332 111111122 22 23445566777788889999999999888764211 11112 Q ss_pred hhhhhhhhccCHHHHHHHHHHHHhCCCC-----CHHHHHHHhCCCCCCCCCeeeecc--cccchhhc-cccCCCcccC-- Q lcl|NC_019705. 354 EHNLDGLLRGDSASRAAFMKAMGEAGLR-----TINEMRRTDNLPPLPGGDVAMRQS--QYVPITDL-GTNKEPRNNG-- 423 (424) Q Consensus 354 ~fd~~~l~~~d~~~~~~~~~~~~~~g~~-----t~NE~R~~~g~~p~~ggd~~~~~~--n~~~~~~~-~~~~~~~~~g-- 423 (424) +|.++... .+.+..++.++++++.|++ +.+.+|+.+|+|+-+.++....+. +..|.... +....+.+.+ T Consensus 376 ~~~~~~~e-~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (469) T protein:vir:10 376 VLTFDPIG-SRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPEEPAAVPNQSAAPARTRSSGNADA 454 (469) T ss_pred EEEecCCC-CcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcccccchhcccCCCCCccccccCCCCCccc Confidence 33333333 4567788999999999984 567899999999766665544332 11111110 0100000000 Q ss_pred C Q lcl|NC_019705. 424 A 424 (424) Q Consensus 424 a 424 (424) + T Consensus 455 ~ 455 (469) T protein:vir:10 455 R 455 (469) T ss_pred c Confidence 0 No 115 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.79 E-value=2.9e-19 Score=122.20 Aligned_cols=374 Identities=9% Similarity=0.040 Sum_probs=213.9 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccC Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~ 87 (424) |+..+.-|.-+ +++. ........ ......+-.+ ...|.+++.++++|+++|+.+.+..+++.-. ++ T Consensus 1 ~~~~~~d~~~~-~~~~---~~~~~~~~--------~~~~~~~~~l-~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~-~~ 66 (427) T protein:vir:10 1 MKIVKHDGYND-IFNG---GADGSPKP--------FFMSDASYHV-GSFYNDNATAKRIVDVIPEEMVTAGFKMSGV-KD 66 (427) T ss_pred CCccccchHHH-Hhhc---CCCCcccC--------ccccCchHHH-HHHHHcCchhhhhhccchHHhhcCCccccCc-cH Confidence 88888888865 3322 11111100 0011111111 2446778999999999999999999887421 11 Q ss_pred CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eC---------CCCceeEEEEecCceeEEee Q lcl|NC_019705. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RN---------SAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~-r~---------~~G~~~~l~~l~~~~v~~~~ 157 (424) + ..+...+. +- .-.+-+..++....++|.+++++. ++ ..|.+..|.+++++.|++.. T Consensus 67 ----~----~~~~~~~~-~l----~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~ 133 (427) T protein:vir:10 67 ----E----KEFKSLWD-SY----KLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEK 133 (427) T ss_pred ----H----HHHHHHHH-Hh----hHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccc Confidence 1 11222221 11 123455666777778899998874 32 34678899999998886532 Q ss_pred c---------CceEEEEEEeCC--ceEEecHhHEEEeecCC-------CCCcccCchHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 158 V---------GKKVVYRYQRDS--EYAEFSQKEIFHLKGFG-------FTGLVGLSPIA-FACKSAGVAVAMEDQQRDFF 218 (424) Q Consensus 158 ~---------~~~~~~~~~~~~--~~~~~~~~eiih~r~~~-------~~~~~G~s~i~-~~~~~i~~~~~~~~~~~~~~ 218 (424) . +....|.+..+. ....+.++.|+||.+.. .++.+|.|++. .+...+.....+.......+ T Consensus 134 ~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~ 213 (427) T protein:vir:10 134 RVTNARSPRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQIL 213 (427) T ss_pred cccCccccccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 2 123345554332 23678999999997543 24568999986 56788888888888877777 Q ss_pred hcCCCCceeEEcCC---CCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 219 ANGAKSPQILSTGE---KVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARF 294 (424) Q Consensus 219 ~ng~~~~~vl~~~~---~~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~ 294 (424) ...... +++++. ...+......+.++++..... .+.+.+++...+.++++++.+...+ .+.......+||++ T Consensus 214 ~k~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl--~~~~~~~~~~iaaa 289 (427) T protein:vir:10 214 RRKQQA--VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSDISGV--PEFLSSKMDRIVSL 289 (427) T ss_pred HHhccc--cccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEecccCCh--HHHHHHHHHHHHhh Confidence 664432 233321 111222222333334332222 2334456666678899988877765 57888999999999 Q ss_pred hCCCHHHhCCCCCCcccchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHH Q lcl|NC_019705. 295 FGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 295 fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~ 367 (424) .+||..+|.+...+..+ ++.+....+|+. .-+.|.++.+-+.+-. . ..+.++| ++|...+.++ T Consensus 290 ~~IP~t~L~G~sp~Gln-stgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~~----s---~~~~~~f--~pL~~~s~kE 359 (427) T protein:vir:10 290 SGIHEIIIKNKNVGGVS-ASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVD----E---EEWSIEF--EPLSVPSKKE 359 (427) T ss_pred hCCCeeeeccCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----C---CCcEEEe--CCCCCCCHHH Confidence 99999988665554443 233455556665 3456666555443321 1 1233444 4666666555 Q ss_pred H-------HHHHHHHHhCCCCCHHHHHHHh----CCCCCCCCCeeee--cccccchhhccccCCCcccC Q lcl|NC_019705. 368 R-------AAFMKAMGEAGLRTINEMRRTD----NLPPLPGGDVAMR--QSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 368 ~-------~~~~~~~~~~g~~t~NE~R~~~----g~~p~~ggd~~~~--~~n~~~~~~~~~~~~~~~~g 423 (424) + ++++++++++|+++++|+|+.| +...+.+++..-. +......+ .+..++..+.+ T Consensus 360 kaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~~~~~~e~~~~~~e~~-p~~~e~~~d~~ 427 (427) T protein:vir:10 360 ESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPE-PGLGEKLEDEN 427 (427) T ss_pred HHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCCccccccccchhcCCC-CCCCCCCCCCC Confidence 4 5677789999999999999876 3444443332211 00110000 11112212222 No 116 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=99.78 E-value=2.5e-18 Score=117.07 Aligned_cols=406 Identities=8% Similarity=-0.029 Sum_probs=234.9 Q ss_pred CCCCcccccCCCCC--chHHHHHhhccCcccCCccccchhhccccccccC-cccccHHHHhccHHHHHHHHHHHHhhccC Q lcl|NC_019705. 1 MEEPKYTIDLRTNN--GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG-DSSINDERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~--G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~vs~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) +. +.--++.+|-+ |+.+++......+ .+|.+....-.-...+... ...+-.+-..+++.|.+|++.+...|.++ T Consensus 12 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~g--ltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~l~~Rk~av~~~ 88 (526) T protein:vir:79 12 IR-PQQLREPQTSRLAGLAKEFAQHPAKG--LTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSKRKRAILGL 88 (526) T ss_pred cC-ccccchhhhhhhhhhhhhcccCCCCC--cCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCC Confidence 10 00012223211 2222221111110 0111110000000000000 00111111126899999999999999999 Q ss_pred ceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-C--ceeEEEEecCceeE Q lcl|NC_019705. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-G--DVISLLPLQSANMD 154 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~-G--~~~~l~~l~~~~v~ 154 (424) +|.|.....+.. .+......+..+|...| ...+++..+.. .+++|-+..+++.... | .+..+.+.++.++. T Consensus 89 ~w~I~p~~~~~~-~~~~~a~~v~~~l~~~~----~~~~~i~~~ld-A~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~F~ 162 (526) T protein:vir:79 89 DWAVEPPRNASA-AEKADADYLHELLLDLE----GLEDLLLDALD-GIGHGYSCIELEWALQGREWMPLAFHHRPQSWFQ 162 (526) T ss_pred CceEecCCCCCh-HHHHHHHHHHHHHhccc----CHHHHHHHHHh-hhhhcceeEEEEEeecCCceeEEEeeeecccceE Confidence 999865433221 11112223455554333 24555655555 6789999999976543 3 47789999999888 Q ss_pred EeecCceEEEEEEeCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC Q lcl|NC_019705. 155 VKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK 233 (424) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~ 233 (424) +..+++...........+.++++...++.++.. ...++|.+.+..+.-.........++...|...-|.|--+.+++.. T Consensus 163 ~~~~~~~~l~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~ 242 (526) T protein:vir:79 163 LNPEDQNELRLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPG 242 (526) T ss_pred eccCCCcEEEecCCCCCceeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCC Confidence 877766544322334556788888766556554 4558999999999999999999999999999999999888888766 Q ss_pred CCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccc-ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCCccc Q lcl|NC_019705. 234 VLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV-TPQDAEMMASRKFQVSELARFFGVPPHLVGDV-EKSTSW 311 (424) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~-~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~-~~~~~~ 311 (424) . ++++++.+.+.+.+..+ + ..++++.|++++-+.. +.....|.+..++..++|+.+. +-..+-... ++++.+ T Consensus 243 a-~~~ek~~L~~av~~i~~--d--a~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGqtlTs~~~~g~~gS 316 (526) T protein:vir:79 243 T-ADEEKATLLRAVTGLGH--A--AAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LGGTLTSTTSQSGGGA 316 (526) T ss_pred C-CHHHHHHHHHHHHHHhc--C--cEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhhhhccccccCcchh Confidence 4 55666666666666643 2 3577777776655543 2233347888889999998874 111111111 111223 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhc----------ccchhhhhhhhhccCHHHHHHHHHHHHhCCC- Q lcl|NC_019705. 312 GSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG----------RIHAEHNLDGLLRGDSASRAAFMKAMGEAGL- 380 (424) Q Consensus 312 ~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~----------~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~- 380 (424) ++.. +........-+.-.++.|+..||+.|+.+.-.. ..++.|+. -...|.+.+++.+.++++.|+ T Consensus 317 ~a~g-~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~--~e~eDl~~~a~~~~~L~~~G~~ 393 (526) T protein:vir:79 317 FALG-QVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDL--REQADITSMAQSIPALVNVGLE 393 (526) T ss_pred hhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCC--CCcccHHHHHHHHHHHHhCCCc Confidence 3222 334455677778899999999998887543211 12344443 356778889999999999997 Q ss_pred CCHHHHHHHhCCCCCCCCCeeeecccccchhhccccC----CCcccCC Q lcl|NC_019705. 381 RTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNK----EPRNNGA 424 (424) Q Consensus 381 ~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~----~~~~~ga 424 (424) ++..++|+.+|+|....++..+.|............. .....++ T Consensus 394 i~~~~i~e~~gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (526) T protein:vir:79 394 IPSAWVYDKLGIPQPAKNEPVLRPAAQPAILSRQHGQRVAALATIVGP 441 (526) T ss_pred CCHHHHHHHhCCCCCCCchhhccccCCccccccccccccccccccccc Confidence 8999999999997655555554432211100000000 0000000 No 117 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.78 E-value=2.8e-19 Score=122.28 Aligned_cols=378 Identities=9% Similarity=0.033 Sum_probs=203.0 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |.+-| +..++.-|+-+. |.+. .+.....+.......+.. -...|.+++.++++|+++|+.+.+..++ T Consensus 5 m~~~~--~~~~~~D~~~~~----~~~~------~g~~~~~~~~~~~~~~~~-l~~~Y~~~~l~~~~Vd~~aed~~r~g~~ 71 (435) T protein:vir:79 5 MSDKV--KAITKEDGYNEI----FGSK------DGTFRPNAFYMQRAAFKA-LSQFYEEDGMARRIVDVIPEEMVTPGFK 71 (435) T ss_pred ccccc--ccchhhcchhhh----hccc------ccccccCcccCCcCCHHH-HHHHHhcCchhhhhhccchHHhhcCCce Confidence 22211 111122222111 1000 000000000111111111 1234567899999999999999999988 Q ss_pred EEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCC---------CCceeEEEEecC Q lcl|NC_019705. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNS---------AGDVISLLPLQS 150 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~-r~~---------~G~~~~l~~l~~ 150 (424) +.- .++ + ..+...+. +- ...+-+..++....++|.|++++. ++. .|.+..+.++++ T Consensus 72 i~g-~~~---~-----~~~~~~~~-~l----~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~ 137 (435) T protein:vir:79 72 VDG-VKN---E-----KSFKSRWD-EL----RLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDR 137 (435) T ss_pred ecC-CCh---H-----HHHHHHHH-Hh----hHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeech Confidence 632 111 1 11222221 11 123455566666678899888775 332 244568888888 Q ss_pred ceeEEee---c------CceEEEEEEeCC--ceEEecHhHEEEeecCC-------CCCcccCchH-HHHHHHHHHHHHHH Q lcl|NC_019705. 151 ANMDVKL---V------GKKVVYRYQRDS--EYAEFSQKEIFHLKGFG-------FTGLVGLSPI-AFACKSAGVAVAME 211 (424) Q Consensus 151 ~~v~~~~---~------~~~~~~~~~~~~--~~~~~~~~eiih~r~~~-------~~~~~G~s~i-~~~~~~i~~~~~~~ 211 (424) ..|++.. | +....|.+...+ .+..+.++.|+||.+.. .+...|.|++ +.+...+.....+. T Consensus 138 ~~i~~~~~~~dp~sp~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~ 217 (435) T protein:vir:79 138 YQITIHERETNARSVRYGEPKLYKISPGGDIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQ 217 (435) T ss_pred hhccchhhccCCcccccCcceEEEEecCCCCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHH Confidence 8876432 1 122345554333 35679999999997532 2356799998 57889899888888 Q ss_pred HHHHHHHhcCCCCceeEEcCC---CCCCHHHHHHHHHHHHHHhCCcc-cCcceecCCCceeeecccChhHHHHHHHHHHH Q lcl|NC_019705. 212 DQQRDFFANGAKSPQILSTGE---KVLTEQQRSQVEENFKEIAGGPV-KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQ 287 (424) Q Consensus 212 ~~~~~~~~ng~~~~~vl~~~~---~~~~~~~~~~~~~~~~~~~~~~~-ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~ 287 (424) ......+...... +++++. ...++.....+..++.......+ .+.+++..++.++++++.+..++ .+..... T Consensus 218 ~~~~~l~~~~~~~--v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~~lsgl--~~~~~~~ 293 (435) T protein:vir:79 218 ELATQLLRRKQQA--VWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLNSDVSGV--PEFLQEK 293 (435) T ss_pred HHHHHHHHHhcCc--cccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEecccCCH--HHHHHHH Confidence 8888876554432 233321 11223333344444443333222 33455555556899888877765 5888999 Q ss_pred HHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccChhhhcccchhhhhhhh Q lcl|NC_019705. 288 VSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY-------TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL 360 (424) Q Consensus 288 ~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~-------tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l 360 (424) ..+||++.|||..+|.+...+..+ ++.+....+|+.. -+.|.+..+-+.+ ... ..+ .|.+++| T Consensus 294 ~~~iaaa~~IP~t~L~G~s~~gln-stgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li----~~s---~d~--~~~f~pL 363 (435) T protein:vir:79 294 IDRIVALTGIHEIIIKNKNTGGVS-ASQNTALETFYKLIDRKRVEDYKPILEFLLPFM----ISE---TEW--SIEFEPL 363 (435) T ss_pred HHHHHhhhCCCeeeeccCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcC---CCC--eEEeCCC Confidence 999999999999888665554433 2334455555553 2445444433322 111 123 3444567 Q ss_pred hccCHHH-------HHHHHHHHHhCCCCCHHHHHHHh-CCCC---CCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 361 LRGDSAS-------RAAFMKAMGEAGLRTINEMRRTD-NLPP---LPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 361 ~~~d~~~-------~~~~~~~~~~~g~~t~NE~R~~~-g~~p---~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ...+.++ .++++++++++|+++++|+|+.+ ...+ +.+.+..-.+ ..++ .+.+++.++|- T Consensus 364 ~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~----~~~d-~~~~~~~e~g~ 433 (435) T protein:vir:79 364 SVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDTLRSICPDLKIMDNDNIELP----EPED-LDPEPGQEGGL 433 (435) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccccCCCCcccccCC----cccc-CCCCCCCCCCC Confidence 7777655 45567788999999999999977 2222 2222111111 1111 11122333333 No 118 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.78 E-value=4.5e-19 Score=121.19 Aligned_cols=398 Identities=11% Similarity=0.066 Sum_probs=213.8 Q ss_pred CCCC--cccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_019705. 1 MEEP--KYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~--~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) |-.- .-..++++...=..+++...+.+...++ .....+.....+.... -...|.+++.++.+|+.+++.+.+-+ T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d~---~~~~~~~~~~~~~~~~-l~~lY~~~~l~r~iVd~~a~d~~r~g 76 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDFMVGHGKANSRDK---LTRQTPGNGQKLDLKA-CENLYASNSIAMNIVDIISEDMVRAG 76 (461) T ss_pred CccchhhhhhhhhhhhhhhhHHHhhcCCcchhhh---hhccccCcccccCHHH-HHHHHHhCCccchhhccchHHhhcCC Confidence 2110 0011122222112222222221111111 1111111111111111 12445678889999999999999988 Q ss_pred eEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee-CCCC---------------ce Q lcl|NC_019705. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-NSAG---------------DV 142 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r-~~~G---------------~~ 142 (424) +++.- ++.+.. ..+...+. +- .-.+-+..++.+..++|.|++++.- +.+. .+ T Consensus 77 ~~i~~--~~~~~~-----~~~~~~~~-~l----~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~ 144 (461) T protein:vir:80 77 WSLKT--DNKEMK-----KNIESKWR-KL----KTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSI 144 (461) T ss_pred eeeec--CCHHHH-----HHHHHHHH-Hh----hHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccce Confidence 87632 111110 11222222 11 1234455666667899999998843 2211 12 Q ss_pred eEEEEecCceeEE---eec------CceEEEEEEe-------------CCceEEecHhHEEEeecCCC-CCcccCchHHH Q lcl|NC_019705. 143 ISLLPLQSANMDV---KLV------GKKVVYRYQR-------------DSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAF 199 (424) Q Consensus 143 ~~l~~l~~~~v~~---~~~------~~~~~~~~~~-------------~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~ 199 (424) ..|.|+.+..+.+ ..| +....|.+.. +...+.+.++.|+||.+... +..+|.|.++. T Consensus 145 ~~l~~~~~~~i~~~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~ 224 (461) T protein:vir:80 145 PYINTFNTQKVTQLYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFEGETKGRSIFES 224 (461) T ss_pred eEEEeccccccchhhhcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCCCCCccccCcchHHH Confidence 2333333333221 111 1122344432 22346799999999987764 56789999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCC-CCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHH Q lcl|NC_019705. 200 ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE-KVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDA 278 (424) Q Consensus 200 ~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~-~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~ 278 (424) +...+.....+......+..+...+ +++++. .....+....+.+.++...+ ..++++++.+.++++++.+..++ T Consensus 225 ~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~~~~~~---~~g~~~~d~~e~~e~~~~~lsgl 299 (461) T protein:vir:80 225 LYDIITVMDTSLWSVGQILYDFAFK--VYKTDDIDALNKDDKANLTAMLDFMFR---TEALAIIKGDEQLTKESTNVSGM 299 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCC--ceecchHHhhhchHHHHHHHHHHHhcC---CceEEEEcCCcceEEEecCcCCH Confidence 9999999999888888877665443 344432 11222333445555555443 34588899999999999887765 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhccChhh---h Q lcl|NC_019705. 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKD---V 348 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l~~~l~~~~~---~ 348 (424) .+..+.....||.+-+||..+|.+...+.. ++.++..++|+. .-+.|+++.+.+.+-+..+.... . T Consensus 300 --~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~--asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p 375 (461) T protein:vir:80 300 --KDLLDYGWDYLAGAVRMPKTVLKGQEAGTL--TGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDP 375 (461) T ss_pred --HHHHHHHHHHHhhhhcCCeeeeecccCCcc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCc Confidence 588899999999999999988765444333 234555555554 34667777777766554443221 1 Q ss_pred cccchhhhhhhhhccCHHHHH-------HHHHHHHhCCCCCHHHHHHHh-CCCCC-CCCCeeeecccccchhhccc--cC Q lcl|NC_019705. 349 GRIHAEHNLDGLLRGDSASRA-------AFMKAMGEAGLRTINEMRRTD-NLPPL-PGGDVAMRQSQYVPITDLGT--NK 417 (424) Q Consensus 349 ~~~~~~fd~~~l~~~d~~~~~-------~~~~~~~~~g~~t~NE~R~~~-g~~p~-~ggd~~~~~~n~~~~~~~~~--~~ 417 (424) ..+.+.+.+++|...+.++++ +++++++++|++|++|+|+.+ +.-.+ |.+...-......++..... .+ T Consensus 376 ~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (461) T protein:vir:80 376 DSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDAYA 455 (461) T ss_pred cccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhcccccc Confidence 113345666788777777765 457889999999999999855 32211 11111100001111111111 11 Q ss_pred CCcccC Q lcl|NC_019705. 418 EPRNNG 423 (424) Q Consensus 418 ~~~~~g 423 (424) +.+.+| T Consensus 456 ~e~~~g 461 (461) T protein:vir:80 456 KKNADG 461 (461) T ss_pred ccCCCC Confidence 122222 No 119 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.76 E-value=1.5e-18 Score=118.27 Aligned_cols=373 Identities=11% Similarity=0.065 Sum_probs=207.0 Q ss_pred CCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCc Q lcl|NC_019705. 10 LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDN 89 (424) Q Consensus 10 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~ 89 (424) +-.--|.-+ .|.+...... .+.......+..+ ...|.+++.++++|+++|+.+.+-.|++... +. T Consensus 1 ~~~~D~~~n----~~~gg~~~~~-------~~~~~~~~~~~~l-~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~--~~- 65 (422) T protein:vir:10 1 MVKTDSYAN----IFLGGSDGSE-------IYGSLQNQAPTIL-ASLYADNALVRRIIDTIPETALAAGFHIDGI--DD- 65 (422) T ss_pred CccchhhHH----HHcCCCCCcc-------ccCcccccCHHHH-HHHHHhChhhHHHHhhhhHHHhcCCccccCC--CH- Confidence 111112222 2222111000 0111111111111 2346778999999999999999998887321 11 Q ss_pred cceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee-C---------CCCceeEEEEecCceeEEee-- Q lcl|NC_019705. 90 RKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-N---------SAGDVISLLPLQSANMDVKL-- 157 (424) Q Consensus 90 ~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r-~---------~~G~~~~l~~l~~~~v~~~~-- 157 (424) +......+ ..| .-.+-+..++....++|.|++++.. + ..|.+..+.++++..|++.. T Consensus 66 --~~~~~~~~-~~l--------~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~ 134 (422) T protein:vir:10 66 --EPAFWSRW-DDL--------EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTRE 134 (422) T ss_pred --HHHHHHHH-HHh--------hHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcc Confidence 11111111 112 2245556666667788999988753 3 24567789999998887542 Q ss_pred -c------CceEEEEEEeCC--ceEEecHhHEEEeecCC-------CCCcccCchHHH-HHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019705. 158 -V------GKKVVYRYQRDS--EYAEFSQKEIFHLKGFG-------FTGLVGLSPIAF-ACKSAGVAVAMEDQQRDFFAN 220 (424) Q Consensus 158 -~------~~~~~~~~~~~~--~~~~~~~~eiih~r~~~-------~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~~~n 220 (424) | +....|.+..++ ....+.++.|+||.+.. .++.+|.|++.. +...+.....+.......+.. T Consensus 135 ~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~ 214 (422) T protein:vir:10 135 ENPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKR 214 (422) T ss_pred cCccccccCcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 122345554433 23678899999996532 234689999986 678888888888888887766 Q ss_pred CCCCceeEEcCC---CCCCHHHHHHHHHHHHHHhCCc-ccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019705. 221 GAKSPQILSTGE---KVLTEQQRSQVEENFKEIAGGP-VKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFG 296 (424) Q Consensus 221 g~~~~~vl~~~~---~~~~~~~~~~~~~~~~~~~~~~-~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fg 296 (424) .... +++++. ...+......+.++++...... +.+.+++..++.++++++.+..++ .+.......+||++.| T Consensus 215 ~~~~--v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl--~~~~~~~~~~iaaa~~ 290 (422) T protein:vir:10 215 KQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI--DAFLDKKFDRIVALSG 290 (422) T ss_pred hccc--cccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEecccCCh--HHHHHHHHHHHHhhhC Confidence 5433 233331 1122333334444444433332 334455555678999998887764 5889999999999999 Q ss_pred CCHHHhCCCCCCcccchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHH-- Q lcl|NC_019705. 297 VPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS-- 367 (424) Q Consensus 297 VPp~~l~~~~~~~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~-- 367 (424) ||..+|.+...+..+ ++.++..++|+. .-+.|.+..+-+.+- .. ..+.++| +.|...+.++ T Consensus 291 IP~t~L~G~s~~Gln-atgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~----~s---~~~~~~f--~pL~~~sekeka 360 (422) T protein:vir:10 291 IHEIILKNKNVGGVS-SSQNTALETFHKLVDRKRNAELLPILEFLIPFIV----NA---EEWSVEF--NPLAQESSKDKA 360 (422) T ss_pred CCeeeeccCCccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----cc---CCcEEEe--CCCCCCCHHHHH Confidence 999988665554443 234455556665 345555554433332 11 1233444 4666666654 Q ss_pred -----HHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC-Ceeeecccccchhhcc-ccCCCccc Q lcl|NC_019705. 368 -----RAAFMKAMGEAGLRTINEMRRTDNLPPLPGG-DVAMRQSQYVPITDLG-TNKEPRNN 422 (424) Q Consensus 368 -----~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg-d~~~~~~n~~~~~~~~-~~~~~~~~ 422 (424) .++++++++++|+++++|+|+.|--.....| ..-..+......+... ..++|.++ T Consensus 361 ei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 361 EILEKNVNSIAALIAAGAMDIDEARDTLRTIAPEVKINDGSVETEVTISETSNDPLEVPTDD 422 (422) T ss_pred HHHHHHHHHHHHHHhcCCCCHHHHHHHhhhhcccccCCCCCCccccchhhcCCCCCCCCCCC Confidence 4566778999999999999998843221111 0001111111111111 11233333 No 120 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.75 E-value=5.5e-17 Score=109.74 Aligned_cols=388 Identities=11% Similarity=0.047 Sum_probs=225.6 Q ss_pred CCCC--cccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_019705. 1 MEEP--KYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~--~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) ...| .-+.++.+++--++ ...... ..|... ++ ....++..-.-+..++++.|.+|++.+...|.+++ T Consensus 15 ~~~~~~~~~~~ia~~~~~~~---~~~~~~--~~p~~~----~i--l~~~~~~~~~y~~m~~D~~i~s~l~~Rk~av~~~~ 83 (491) T protein:vir:79 15 FGEPDKSLSSQIATRARSID---FFALGM--YLPNPD----PV--LKALGKDIRVYRELRADAHVGGCVRRRKAAVKALE 83 (491) T ss_pred ccccchhHHHHHhhhccccc---cccccc--cCcchh----HH--HhhccCCHHHHHHHhhChHHHHHHHHHHHHHhCCC Confidence 1010 01111221110000 000000 000000 00 00001111112345679999999999999999999 Q ss_pred eEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-C--ceeEEEEecCceeEE Q lcl|NC_019705. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-G--DVISLLPLQSANMDV 155 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~-G--~~~~l~~l~~~~v~~ 155 (424) |.|...+.+. + ....+..+|. ++ ...+++..+. +.+++|.+..+++.... | .|..+.++|+.++.+ T Consensus 84 w~i~~~~~~~--~---~a~~i~e~l~-~~----~~~~~i~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~ 152 (491) T protein:vir:79 84 WGLDRGKAKS--R---VAKSIADVFA-DL----DLSRIATEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVY 152 (491) T ss_pred cEEecCCCCH--H---HHHHHHHHHh-cC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeeeEEeeeeecccceee Confidence 9996544321 1 1234555554 33 4556666664 57789999999976443 3 467899999999888 Q ss_pred eecCceEEEEEEeCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCC Q lcl|NC_019705. 156 KLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV 234 (424) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~ 234 (424) ..++............+.++++...++.++.. ...++|.+.+..+.-.........++...|...-|.|-.+.+++.+. T Consensus 153 d~~~~l~l~~~~~~~~g~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a 232 (491) T protein:vir:79 153 DPENQLRFRSKEHWVQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSA 232 (491) T ss_pred ccCCceEEeecCCCCCceeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCC Confidence 77665444333333456788888888777664 44589999999999999999999999999999999998888887654 Q ss_pred CCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccc---ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC--CCCc Q lcl|NC_019705. 235 LTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV---TPQDAEMMASRKFQVSELARFFGVPPHLVGDV--EKST 309 (424) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~---~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~--~~~~ 309 (424) +++.++.+.+.+.+..+ ++ .++++.|++++-+.. +..-..|.+..++-.++|+.+. ||.+ .+++ T Consensus 233 -~~~ek~~l~~al~~~~~--~a--~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i------LGqtlTt~~~ 301 (491) T protein:vir:79 233 -SDAETNLLLDRLEDMVQ--DA--VAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIAL------LGQNQTTEAT 301 (491) T ss_pred -CHHHHHHHHHHHHHHhc--Ce--EEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHH------hhhhhccCcc Confidence 55556666666665532 23 566777766655432 2222336777778788888754 3332 1223 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhh----cccchhhhhhhhhccCHHHHHHHHHHHHhCCC-CCHH Q lcl|NC_019705. 310 SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV----GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGL-RTIN 384 (424) Q Consensus 310 ~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~----~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~-~t~N 384 (424) .+++..+ ........-+.-.+..|+..||. |+.+.-. ......|.+.... .+.+.+++..+++++.|+ ++.+ T Consensus 302 gs~a~~~-vh~~v~~~i~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~e~e-e~~~~~a~~~~~L~~~G~~i~~~ 378 (491) T protein:vir:79 302 STRASAQ-AGLEVTDDIRDGDKAIVVEAMNM-LIRWICDLNFDGAARPVFDMWEQE-QVDEIQAGRDEKLTRAGARFTPA 378 (491) T ss_pred cchhhHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEeecCcC-chhHHHHHHHHHHHhCCCccCHH Confidence 3333332 33445566677778888888874 5543211 1112344443332 223557889999999886 8999 Q ss_pred HHHHHhCCCCCCCCCeeeecccccchhhc--cc---cCCCcccCC Q lcl|NC_019705. 385 EMRRTDNLPPLPGGDVAMRQSQYVPITDL--GT---NKEPRNNGA 424 (424) Q Consensus 385 E~R~~~g~~p~~ggd~~~~~~n~~~~~~~--~~---~~~~~~~ga 424 (424) ++|+.+|+|+-+.++....+....+.... .. .+....+.+ T Consensus 379 ~~~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 423 (491) T protein:vir:79 379 YFKRAYNLQDGDLDERPLPVSAVDAVGAASFAEFEAPDQDALDAA 423 (491) T ss_pred HHHHHhCCCCCCCCccccCcCcccccccccccccCCCCCcchHHH Confidence 99999999876656554432211111100 00 001111111 No 121 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=99.74 E-value=3.6e-17 Score=110.75 Aligned_cols=388 Identities=9% Similarity=-0.017 Sum_probs=230.6 Q ss_pred HHHHHhhccCcccCCccccchh-------hcc-----ccccc---------c-Ccc-----cccHHHHhccHHHHHHHHH Q lcl|NC_019705. 17 WARLQSWFVGGRLVTPNQGSQT-------GPV-----SAHGH---------L-GDS-----SINDERILQISTVWRCVSL 69 (424) Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~-------~~~-----~~~~~---------~-~~~-----~vs~~~~~~~~~v~~~i~~ 69 (424) +.+|....+++-...+...... ..+ .+.+. . .|. .+-.+...+++-|.+|++. T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~dm~~~D~hi~s~l~~ 80 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAFDMEEKDTHLFSELSK 80 (512) T ss_pred CcceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 2233322222211110000000 000 00000 0 000 0111222468999999999 Q ss_pred HHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC---CCCceeEEE Q lcl|NC_019705. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN---SAGDVISLL 146 (424) Q Consensus 70 ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~---~~G~~~~l~ 146 (424) +...|.+++|.|....+. ...+.....-+...|...| ...+++..+. +.+++|-+.++++.. ....|..+. T Consensus 81 Rk~av~~~~w~I~p~~~~-~~~~~~~a~~v~~~l~~~~----~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~ 154 (512) T protein:vir:19 81 RRLAIQALEWRIAPARDA-SAQEKKDADMLNEYLHDAA----WFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALH 154 (512) T ss_pred HHHHHhCCCceEecCCCC-CHHHHHHHHHHHHHHhcCC----CHHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeee Confidence 999999999998654322 1111111223444554434 2456666655 467899999998763 334678999 Q ss_pred EecCceeEEeecCceEEEEEEeCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCc Q lcl|NC_019705. 147 PLQSANMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSP 225 (424) Q Consensus 147 ~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~ 225 (424) +.++..+.+..++............+.++++...++.++.. ...++|.+.+..+.-.........++...|...-|.|- T Consensus 155 ~r~~~~f~~~~~~~~~lr~~~~~~~G~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~ 234 (512) T protein:vir:19 155 HRDPALFCANPDNLNELRLRDASYHGLELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPM 234 (512) T ss_pred eeccccceeccCCCcEEEecCCCCCceeecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCe Confidence 99999888877665444333334456778888876666554 45689999999999999999999999999999999988 Q ss_pred eeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhCC Q lcl|NC_019705. 226 QILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGD 304 (424) Q Consensus 226 ~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~ 304 (424) -+-+++.+. ++++++.+.+.+.+..+ + ..++++.|++++-+..+ .....|.+..++..++|+.+. ||. T Consensus 235 ~igky~~~a-~~~ek~~L~~al~~~~~--~--a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~i------LGq 303 (512) T protein:vir:19 235 RVGKYPTGS-TNREKATLMQAVMDIGR--R--AGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAI------LGG 303 (512) T ss_pred eEEecCCCC-CHHHHHHHHHHHHHHhh--C--cEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHH------hhh Confidence 888887654 55566666666666533 2 35677777766554432 233447888888899999872 333 Q ss_pred CC----CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhc----------ccchhhhhhhhhccCHHHHHH Q lcl|NC_019705. 305 VE----KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG----------RIHAEHNLDGLLRGDSASRAA 370 (424) Q Consensus 305 ~~----~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~----------~~~~~fd~~~l~~~d~~~~~~ 370 (424) +- +++.+++ ..+........-+.-.++.|+..||+.|+.+.-.. ..+++|+.. ...|.+..++ T Consensus 304 tlTs~~g~~Gs~a-~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~--e~eDl~~~a~ 380 (512) T protein:vir:19 304 TLTTEAGDKGARS-LGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTS--EAGDITALSD 380 (512) T ss_pred hhcccccccchhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCC--ChhhHHHHHH Confidence 21 1222233 23445566778888999999999999888653111 123455443 3467777888 Q ss_pred HHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchh-hcc-----ccCCCcccCC Q lcl|NC_019705. 371 FMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPIT-DLG-----TNKEPRNNGA 424 (424) Q Consensus 371 ~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~-~~~-----~~~~~~~~ga 424 (424) .+.++..+--++..++|+.+|+|.-..++....+....+-. ... .+..+.++.. T Consensus 381 ~~~~l~~G~~i~~~~i~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (512) T protein:vir:19 381 AIPKLAAGMRIPVSWIQEKLHIPQPVGDEAVFTIQPVVPDNGSQKEAALSAEDIPQEDDI 440 (512) T ss_pred HHHHHhcCCCCCHHHHHHHhCCCCCCCccccccCCCccccccccccccccccCCCchhhH Confidence 88877655578999999999997654444443321111100 000 0000000000 No 122 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.74 E-value=5.7e-17 Score=109.65 Aligned_cols=388 Identities=10% Similarity=0.028 Sum_probs=228.6 Q ss_pred CCC--CcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_019705. 1 MEE--PKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~--~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) +.. +.-+-.++|++.+.+.. ... . +|.... ++. ....+..-.-+..++++.|.+|++.+...|.+++ T Consensus 15 ~~~~~~~~~~~ia~~~~~~~~~----~~~-~-~~~~~~---~iL--r~~~~~~~~y~~m~~D~~i~s~l~~Rk~av~~~~ 83 (491) T protein:vir:10 15 FGEPDKSLSSQIATRARSIDFF----ALG-M-YLPNPD---PVL--KALGKDIRVYRELRADAHVGGCVRRRKAAVKALE 83 (491) T ss_pred cccCChHHHHHHHhhhcccccc----ccc-C-CccchH---HHH--HhcCCCHHHHHHHhhChHHHHHHHHHHHHHhCCC Confidence 111 12234445544332211 111 0 111100 000 0000111112445689999999999999999999 Q ss_pred eEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-C--ceeEEEEecCceeEE Q lcl|NC_019705. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-G--DVISLLPLQSANMDV 155 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~-G--~~~~l~~l~~~~v~~ 155 (424) |.|...+.+.. ....+..+|. ++ ...+++..+. +.+++|.+..+++.... | .|..+.++|+.++.+ T Consensus 84 w~i~~~~~~~~-----~~e~v~e~l~-~~----~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~ 152 (491) T protein:vir:10 84 WGLDRGKAKSR-----VAKSIADVFA-DL----DLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVY 152 (491) T ss_pred cEEecCCCCHH-----HHHHHHHHHh-cC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeecccceee Confidence 99965433221 1233555554 33 4667777776 57889999999986544 3 467899999999888 Q ss_pred eecCceEEEEEEeCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCC Q lcl|NC_019705. 156 KLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV 234 (424) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~ 234 (424) ..++...+..-.....+.++++...++.++.. ...++|.+.+..+.-.........++...|...-|.|--+.+++.+. T Consensus 153 d~~~~l~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a 232 (491) T protein:vir:10 153 DPENQLRFRSKDHWMQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSA 232 (491) T ss_pred ccCCceEEecCCCCCCcceecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCCC Confidence 77655433222223456788888887777654 45589999999999999999999999999999999998888887654 Q ss_pred CCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccc--Chh-HHHHHHHHHHHHHHHHHHhCCCHHHhCCC--CCCc Q lcl|NC_019705. 235 LTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV--TPQ-DAEMMASRKFQVSELARFFGVPPHLVGDV--EKST 309 (424) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~--~~~-d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~--~~~~ 309 (424) ++++++.+.+.+.+..+ + ..++++.|++++-+.. +.. -..|.+..++-.++|+.+. ||.+ .+++ T Consensus 233 -~~~ek~~l~~al~~~~~--~--a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i------LGqtlTt~~~ 301 (491) T protein:vir:10 233 -SDGEKNLLLDCLEDMVQ--D--AVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIAL------LGQNQTTEAT 301 (491) T ss_pred -CHHHHHHHHHHHHHHhc--C--cEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHH------hhhhcccCcc Confidence 55666666666666533 2 3567777776655532 222 2237777778888887763 3332 1223 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhh----cccchhhhhhhhhccCHHHHHHHHHHHHhCCC-CCHH Q lcl|NC_019705. 310 SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV----GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGL-RTIN 384 (424) Q Consensus 310 ~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~----~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~-~t~N 384 (424) .+++.. +........-+.-.+..++..+|. |+.+.=. ...+.+|.+.... .+.+.+++...++++.|+ ++.. T Consensus 302 gs~a~~-~vh~~v~~di~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~~~~-e~~~~~a~~~~~L~~~G~~i~~~ 378 (491) T protein:vir:10 302 STRASA-QAGLEVTDDIRDGDKAVVSEAMNM-LIRWICDLNFDGADRPVFDMWEQE-QVDEIQAGRDQKLTQAGARFTPA 378 (491) T ss_pred cchhHH-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEEecCcC-chhHHHHHHHHHHHhCCCcCCHH Confidence 333333 234445566667777888887874 5543210 0111233333322 334678999999999986 8999 Q ss_pred HHHHHhCCCCCCCCCeeeeccccc--chhhccccCCC---cccCC Q lcl|NC_019705. 385 EMRRTDNLPPLPGGDVAMRQSQYV--PITDLGTNKEP---RNNGA 424 (424) Q Consensus 385 E~R~~~g~~p~~ggd~~~~~~n~~--~~~~~~~~~~~---~~~ga 424 (424) ++|+.+|+|+-+.++......... +....+....+ ..+.+ T Consensus 379 ~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 423 (491) T protein:vir:10 379 YFKRAYNLQDGDLDERPLPVSAVDTVGAASFAEFEAPDQDALDAA 423 (491) T ss_pred HHHHHhCCCCCCcCccccccCCCCCcccccccccCCCCCCchHHH Confidence 999999998755454433211111 11100000000 00000 No 123 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.71 E-value=8.4e-17 Score=108.72 Aligned_cols=413 Identities=11% Similarity=-0.029 Sum_probs=228.3 Q ss_pred ccCCCCCch-----HHHHHhhccCcccCCccccchhhccccc-------cccCcccccHHHHhccHHHHHHHHHHHHhhc Q lcl|NC_019705. 8 IDLRTNNGW-----WARLQSWFVGGRLVTPNQGSQTGPVSAH-------GHLGDSSINDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 8 ~~~~~~~G~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia 75 (424) |+.++-.|. ...+.+.-.+........ ..+.+.... ....-..-+...+..++.+.+||+.+.+.|- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~-~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvV 79 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFGGQL-RGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIV 79 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCCCcc-cccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhh Confidence 554444442 233333322211100000 000010000 0001112234556678999999999999888 Q ss_pred cCceEEEEecc------CCccceeccchHHHHHhh---cCCC------CCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC- Q lcl|NC_019705. 76 CLPLDVFETDQ------NDNRKKVDLSNPLARLLR---YSPN------QYMTAQEFREAMTMQLCFYGNAYALVDRNSA- 139 (424) Q Consensus 76 ~~~~~v~~~~~------~~~~~~~~~~~~l~~lL~---~~pn------~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~- 139 (424) +-.|++.-+.+ +++.. ......+..++. ..|+ ..+|.+++.+.++..++..|++|+.+.+... T Consensus 80 G~Gi~~~~~p~~~~l~~~~~~~-~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~ 158 (530) T protein:vir:38 80 GSFFRLSYRPSWRYLGINEEDS-RAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDS 158 (530) T ss_pred CCCceeeeccchhhcCCCHhHH-HHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccCC Confidence 77887764421 11111 111122333332 3343 3468889999999999999999999876543 Q ss_pred C--ceeEEEEecCceeEEe----------------ecCceEEEEEE-e--CC----------ceEEecHhHEEEeecCC- Q lcl|NC_019705. 140 G--DVISLLPLQSANMDVK----------------LVGKKVVYRYQ-R--DS----------EYAEFSQKEIFHLKGFG- 187 (424) Q Consensus 140 G--~~~~l~~l~~~~v~~~----------------~~~~~~~~~~~-~--~~----------~~~~~~~~eiih~r~~~- 187 (424) | .+..|..|+|+++.-. ..+....|.+. . .+ ....++.++|+|+.... T Consensus 159 g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r 238 (530) T protein:vir:38 159 TRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPME 238 (530) T ss_pred CCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHeEeeccccC Confidence 3 3678889988776421 11222233332 1 11 12346677999998765 Q ss_pred CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCC----------HHHHHHHHHHHHHH---hCC- Q lcl|NC_019705. 188 FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT----------EQQRSQVEENFKEI---AGG- 253 (424) Q Consensus 188 ~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~----------~~~~~~~~~~~~~~---~~~- 253 (424) +...+|+|.+..+...+.-.....+......+-.+...++|+.+.+... +.....+....... ... T Consensus 239 ~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (530) T protein:vir:38 239 DGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAA 318 (530) T ss_pred CCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhccccc Confidence 5679999999999888887766666666666666777788875543211 01111111111111 110 Q ss_pred ---cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCcccchhHHH-----------H Q lcl|NC_019705. 254 ---PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV-GDVEKSTSWGSGIEQ-----------Q 318 (424) Q Consensus 254 ---~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l-~~~~~~~~~~~n~e~-----------~ 318 (424) =..|.+..|..|.+++.+..+-...+|.+..+.....||+.+|||.+.| +...+.| |+++.. . T Consensus 319 ~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~n--YSS~R~~~~e~~r~~~~~ 396 (530) T protein:vir:38 319 PVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMS--YSTARASANESWAYFMGR 396 (530) T ss_pred ceeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccccc--HHHHHHHHHHHHHHHHHH Confidence 1245688899999999888776667789999999999999999999888 4344443 444333 3 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHhhccChhh---------hc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHH Q lcl|NC_019705. 319 NLGFLQYTLQPYISR-WENSIQRWLIPAKD---------VG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMR 387 (424) Q Consensus 319 ~~~~~~~tl~P~~~~-ie~~l~~~l~~~~~---------~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R 387 (424) +..+...-+.|.... ++.++....++-.. +. ...+.+-.-.....|+...+++...++++|+.|.-|+- T Consensus 397 q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~ 476 (530) T protein:vir:38 397 RKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKEC 476 (530) T ss_pred HHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHH Confidence 333444444554444 34444443332110 00 11223333445567999999999999999999999998 Q ss_pred HHhCCCCCCC----------CCeeeec--cccc--ch-hhccccC--CCcccCC Q lcl|NC_019705. 388 RTDNLPPLPG----------GDVAMRQ--SQYV--PI-TDLGTNK--EPRNNGA 424 (424) Q Consensus 388 ~~~g~~p~~g----------gd~~~~~--~n~~--~~-~~~~~~~--~~~~~ga 424 (424) +..|..+-+- .+++=++ .... +. .....++ +...+|| T Consensus 477 a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 477 AKRGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKSNEEEQDGARAA 530 (530) T ss_pred HHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCCCCCCCCCCCCC Confidence 8888876321 1111111 1111 10 1111122 2223333 No 124 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.70 E-value=2.6e-17 Score=111.51 Aligned_cols=405 Identities=13% Similarity=0.036 Sum_probs=228.2 Q ss_pred CchHHHHHhhccCcccC---Cccc-----cc-h-hhccccccccCcc-------------cccHHHHhccHHHHHHHHHH Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLV---TPNQ-----GS-Q-TGPVSAHGHLGDS-------------SINDERILQISTVWRCVSLI 70 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~---~~~~-----~~-~-~~~~~~~~~~~~~-------------~vs~~~~~~~~~v~~~i~~i 70 (424) ++|++|+.++|...... .+.. .. . .....+.. ..+. .-+...+..++.+.+||+.+ T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~-~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~ 79 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARR-ENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKL 79 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCC-CCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 89999998888531110 0000 00 0 00011111 0111 01234455689999999988 Q ss_pred HHhhccC-ceEEEEec--cCCccceeccchHHHHHhh-----cCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC---- Q lcl|NC_019705. 71 STLTACL-PLDVFETD--QNDNRKKVDLSNPLARLLR-----YSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS---- 138 (424) Q Consensus 71 a~~ia~~-~~~v~~~~--~~~~~~~~~~~~~l~~lL~-----~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~---- 138 (424) .+.+-+- .+.+.-+. .+....+ .....+..++. ...+.-++.+.+.+.++..++..|++|+.+.+.. T Consensus 80 ~~nvVG~ggi~~~~~~~~~~~~~~~-~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~ 158 (502) T protein:vir:79 80 EERVVGKNGIIVEPHPVLRNGAIAR-DLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSL 158 (502) T ss_pred HHhhccCCceeeeeccCCCChhHHH-HHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCcc Confidence 8876654 44432221 1111111 11112222221 2233456888999999999999999999986543 Q ss_pred ---CCceeEEEEecCceeEE------------e--ecCceEEEEEEe-------CCceEEecHhHEEEeecCC-CCCccc Q lcl|NC_019705. 139 ---AGDVISLLPLQSANMDV------------K--LVGKKVVYRYQR-------DSEYAEFSQKEIFHLKGFG-FTGLVG 193 (424) Q Consensus 139 ---~G~~~~l~~l~~~~v~~------------~--~~~~~~~~~~~~-------~~~~~~~~~~eiih~r~~~-~~~~~G 193 (424) .+.+..|..|+|+++.. . ..+....|.+.. ......+++++|+|+..+. +...+| T Consensus 159 ~~g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r~gQ~RG 238 (502) T protein:vir:79 159 TPSAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRRLHQMRG 238 (502) T ss_pred CCCcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeecccCCccccC Confidence 23467899999987742 1 122233343321 1234679999999998765 567999 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcce-ecCCCceeeecc Q lcl|NC_019705. 194 LSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLW-ILEAGFSTSAIG 272 (424) Q Consensus 194 ~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~-~l~~g~~~~~l~ 272 (424) +|.+..+...+..............+-.+...++|+.+.+....... .-...-+... .-..|.++ .|..|.+++.+. T Consensus 239 is~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~-~~~~~~~~~~-~l~pG~i~~~L~pGe~i~~~~ 316 (502) T protein:vir:79 239 TSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDG-NGSKENEREL-TIQPGIIYDDLKPGEEIGMVK 316 (502) T ss_pred CchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccccc-CCCCCccccc-cccCCccccccCCCceeeeeC Confidence 99999998888877776666666666677778888865432111000 0000000000 01235454 589999999888 Q ss_pred cChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHH-----------HHHHHHHHHHHHH-HHHHHh Q lcl|NC_019705. 273 VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG-----------FLQYTLQPYISRW-ENSIQR 340 (424) Q Consensus 273 ~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~-----------~~~~tl~P~~~~i-e~~l~~ 340 (424) .+.....|.+..+.....||+.+|||.+.|..--.+ +|+++-..... |...-++|+...+ +.++-. T Consensus 317 p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~--nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~ 394 (502) T protein:vir:79 317 SDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNG--TYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVAS 394 (502) T ss_pred CCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 765667789999999999999999999888654333 55564443333 3334444533332 333333 Q ss_pred hccChh---hhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCeeeeccc Q lcl|NC_019705. 341 WLIPAK---DVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG----------GDVAMRQSQ 406 (424) Q Consensus 341 ~l~~~~---~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g----------gd~~~~~~n 406 (424) ..++-. ++. ...+.+-.-.....|+....++...++++|+.|.-|+-+..|.++-+- .+++=++.. T Consensus 395 G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~ 474 (502) T protein:vir:79 395 GVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFD 474 (502) T ss_pred CCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCC Confidence 332211 111 122333344556679999999999999999999999988888887421 111111111 Q ss_pred ccc------hhhccccCCCcccCC Q lcl|NC_019705. 407 YVP------ITDLGTNKEPRNNGA 424 (424) Q Consensus 407 ~~~------~~~~~~~~~~~~~ga 424 (424) ..| .+...+.+++.+.++ T Consensus 475 ~~~~~~~~~~~~~~~~~e~~~~~~ 498 (502) T protein:vir:79 475 TDPASDKGGSSAATKRQEPQHTDD 498 (502) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCC Confidence 111 111122222333333 No 125 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.67 E-value=2.2e-16 Score=106.44 Aligned_cols=416 Identities=11% Similarity=-0.014 Sum_probs=231.2 Q ss_pred CCCCcccccCCCCC-chHHH-HHhhccC---cccCCccccchhhcccccccc-C-----------cccccHHHHhccHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNN-GWWAR-LQSWFVG---GRLVTPNQGSQTGPVSAHGHL-G-----------DSSINDERILQISTV 63 (424) Q Consensus 1 ~~~~~~~~~~~~~~-G~~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~-~-----------~~~vs~~~~~~~~~v 63 (424) |....+.|-+.-+. ++..+ -+...+. ........... ..+...... + -..-+...+..++.+ T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~-~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a 79 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLG-KAWLRRASRLSADEEIYADLASLVQRAREQSINNPYA 79 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhcccccCCCcc-ccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHH Confidence 77777776665554 21110 0000000 00000000000 000000000 0 011234445668999 Q ss_pred HHHHHHHHHhhcc-CceEEEEecc--CCccceeccchHH---HHHhhcCCC----CCCCHHHHHHHHHHHHHHcCCeEEE Q lcl|NC_019705. 64 WRCVSLISTLTAC-LPLDVFETDQ--NDNRKKVDLSNPL---ARLLRYSPN----QYMTAQEFREAMTMQLCFYGNAYAL 133 (424) Q Consensus 64 ~~~i~~ia~~ia~-~~~~v~~~~~--~~~~~~~~~~~~l---~~lL~~~pn----~~~s~~~f~~~~~~~~ll~G~a~~~ 133 (424) ..+|+.+.+.+-+ ..+...-+.. ++...+ .....+ .......+| .-++.+++.+.++..++..|+||+. T Consensus 80 ~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~-~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~ 158 (505) T protein:vir:96 80 KRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDD-RANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVR 158 (505) T ss_pred HHHHHHHHHHhcCCCcceeeecCCcccccccH-HHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEE Confidence 9999988887665 5666544322 111111 111112 222233444 3367888899999999999999998 Q ss_pred EeeCCC-CceeEEEEecCceeEEee-----c-------------CceEEEEEEe---C----------CceEEecHhHEE Q lcl|NC_019705. 134 VDRNSA-GDVISLLPLQSANMDVKL-----V-------------GKKVVYRYQR---D----------SEYAEFSQKEIF 181 (424) Q Consensus 134 ~~r~~~-G~~~~l~~l~~~~v~~~~-----~-------------~~~~~~~~~~---~----------~~~~~~~~~eii 181 (424) +.+... ..+..|..|+|+++.... + +....|.+.. + .....+++++|+ T Consensus 159 ~~~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~rvpa~~vl 238 (505) T protein:vir:96 159 EHRGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYERVPADEII 238 (505) T ss_pred EeecCCCCcceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccccCHhHhh Confidence 865433 346788999988874221 1 1122333321 1 123458999999 Q ss_pred EeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcce Q lcl|NC_019705. 182 HLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLW 260 (424) Q Consensus 182 h~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~ 260 (424) |+..+. +...+|+|.+..+...+.-.....+......+=.+...++|+.+.+...+...+.-.. ....-..|.+. T Consensus 239 H~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~----~~~~l~pG~i~ 314 (505) T protein:vir:96 239 HTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDDQGE----IVEEVEAGTYQ 314 (505) T ss_pred hhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCccccccCc----cccccCCceee Confidence 998764 5679999999999888887776666666666666777888886544332221111000 00111356788 Q ss_pred ecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhC-CCCCCcccchhHHHH-----------HHHHHHHHHH Q lcl|NC_019705. 261 ILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVG-DVEKSTSWGSGIEQQ-----------NLGFLQYTLQ 328 (424) Q Consensus 261 ~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~-~~~~~~~~~~n~e~~-----------~~~~~~~tl~ 328 (424) .|..|.+++.+..+-....|.+..+...+.||+.+|||.+.|. ...+.| |+++-+. +..|+..-++ T Consensus 315 ~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~n--YSS~R~~~~e~~r~~~~~q~~~~~~~~~ 392 (505) T protein:vir:96 315 LLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVN--FSSLRSGELDERDLYKLLQFFVVTELLE 392 (505) T ss_pred ecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999987766778899999999999999999998884 333443 4443332 2334445566 Q ss_pred HHHHH-HHHHHHhhccChhh--hc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC------- Q lcl|NC_019705. 329 PYISR-WENSIQRWLIPAKD--VG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG------- 397 (424) Q Consensus 329 P~~~~-ie~~l~~~l~~~~~--~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g------- 397 (424) |+... ++.++....++-.. .. ...+.+-.-.....|+....++...++++|+.|.-|+-+..|.++-+- T Consensus 393 pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e 472 (505) T protein:vir:96 393 RVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDPEDVFDEIAWE 472 (505) T ss_pred HHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHH Confidence 64444 33343333332111 11 122344444556679999999999999999999999988888887421 Q ss_pred ---CCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 398 ---GDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 ---gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) .++.=++.+..+........+..++++ T Consensus 473 ~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~ 502 (505) T protein:vir:96 473 EQLMRDKGVNPTPPEQESKDATTDEEDDSA 502 (505) T ss_pred HHHHHHcCCCCCCCCCCCCCCCCCCCCCCC Confidence 111111111111111111111111112 No 126 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=99.65 E-value=1.2e-15 Score=102.29 Aligned_cols=400 Identities=11% Similarity=0.021 Sum_probs=224.6 Q ss_pred CCCCcccccCCC---------CCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRT---------NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~~~~~~---------~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia 71 (424) +..|+=+..... -.|....+.+....+-. |.. ..++. -...+..+ .+..+.++.|.+|++.+. T Consensus 5 ~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~--~~~---~~~iL--r~~~~~~l-y~~m~~D~hi~s~l~~Rk 76 (448) T protein:vir:79 5 GRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVV--DRE---FDELL--QGKDGLLV-YHKMLSDGTVKNALNYIF 76 (448) T ss_pred CCCCccccCcccccccccchhhhhhhhhhccccccccc--ccc---hhHhh--ccccchHH-HHHHhhChHHHHHHHHHH Confidence 222221111100 00222222111110000 000 00000 00011112 234567899999999999 Q ss_pred HhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCC---CCHHHHHHHHHHHHHHcCCeEEEEeeC--CCCc--eeE Q lcl|NC_019705. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQY---MTAQEFREAMTMQLCFYGNAYALVDRN--SAGD--VIS 144 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~---~s~~~f~~~~~~~~ll~G~a~~~~~r~--~~G~--~~~ 144 (424) ..|.+++|.|...+++. +.......+...|. .++.. .++.+++..+ .+.+++|-++++++.. .+|. +.. T Consensus 77 ~av~~~~w~v~p~~~~~--~~~~~ae~v~~~l~-~~~~~~~~~~f~~~~~~~-lda~~~G~s~~Eivw~~~~~g~~~~~~ 152 (448) T protein:vir:79 77 GRIRSAKWYVEPASTDP--EDIAIAAFIHAQLG-IDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILDK 152 (448) T ss_pred HHHhcCCceEecCCCCH--HHHHHHHHHHHHhh-hhhhhhccCCHHHHHHHH-HHhhhhcceeEEEEeeecCCCceeccc Confidence 99999999995433322 11111122333332 33322 2344545444 4467899999998863 3554 456 Q ss_pred EEEecCc---eeEEeecCceEEEEEEe-------CCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 145 LLPLQSA---NMDVKLVGKKVVYRYQR-------DSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQ 214 (424) Q Consensus 145 l~~l~~~---~v~~~~~~~~~~~~~~~-------~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~ 214 (424) |.+.++. +..+..+++........ +.....+|..-++|..+.....++|.+.+..+.-.........++. T Consensus 153 l~~r~~~~~~~f~~~~d~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w 232 (448) T protein:vir:79 153 IVPIHPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVSGLEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRALILLI 232 (448) T ss_pred ccccCCccccceeeecCCceEEeecCCcccccccCCCccccccceEEEEecCccCCcccchhHHHHHHHHHHHHHHHHHH Confidence 7777776 34455555443332211 1133456788888887654445899999999999999999999999 Q ss_pred HHHHhcCCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 215 RDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELAR 293 (424) Q Consensus 215 ~~~~~ng~~~~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~ 293 (424) ..|...-+.|--+.+++.+.. +++.++.+.+...+...+.++ .++++.|++++-+.......++.+..++..++|+. T Consensus 233 ~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a--~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk 310 (448) T protein:vir:79 233 NHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRH--GIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIAR 310 (448) T ss_pred HHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCce--EEEecCCceEEEEecCCCcccHHHHHHHHHHHHHH Confidence 999999998888888876543 356667777777777666555 36688888776665544444566778888888887 Q ss_pred HhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChh-----hh--cccchhhhhhhhhccCHH Q lcl|NC_019705. 294 FFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-----DV--GRIHAEHNLDGLLRGDSA 366 (424) Q Consensus 294 ~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-----~~--~~~~~~fd~~~l~~~d~~ 366 (424) +.-=. . +.. +.++.+++...........+.++-.++.|+..||+.|+.+. +. ...++.|+.. ...|.+ T Consensus 311 ~iLGq-t-lTs-~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~--e~~Dl~ 385 (448) T protein:vir:79 311 ALGID-F-NTV-QLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFEME--ERNDFS 385 (448) T ss_pred HHhhh-h-hcc-ccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEecCC--ChHHHH Confidence 65321 1 111 11112222332333445567778889999999998887643 11 1124455433 446777 Q ss_pred HHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 367 SRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 367 ~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ..++.+.+++..+-...+-+|+.+|+|.-..++....+...- ..++.+...+. T Consensus 386 ~~a~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~~~a~~~~~-----~~~~~~~~~~~ 438 (448) T protein:vir:79 386 AAANLMGMLINAVKDSEDIPTELKALIDALPSKMRRALGVVD-----EVREAVRQPAD 438 (448) T ss_pred HHHHHhhhhhccchhhHHHHHHhhcCCCCCCCccccccCCCC-----cccccccCCcc Confidence 788888899887765556678889998432233322211000 11111111111 No 127 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=99.65 E-value=1.5e-15 Score=101.88 Aligned_cols=398 Identities=12% Similarity=0.056 Sum_probs=224.8 Q ss_pred CCCCc---------ccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHH Q lcl|NC_019705. 1 MEEPK---------YTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~---------~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia 71 (424) -..|| -.+...|-.|....+.+....+-. |... .++. -...+..+ .+..+.++.|.+|++.+. T Consensus 5 ~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~--~~~~---~~iL--r~~~~~~l-y~~m~~D~hi~s~l~~Rk 76 (448) T protein:vir:77 5 GRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVV--DREF---DELL--QGKDGLLV-YHKMLSDGTVKNALNYIF 76 (448) T ss_pred CCCCcccCCcccccchhhhhhhccchhhhccccccccc--ccch---hHhh--ccccchHH-HHHHhhChHHHHHHHHHH Confidence 01111 112222222333333222211110 0000 0000 00111122 244567899999999999 Q ss_pred HhhccCceEEEEeccCCccceeccchHHHHHhhcCCC---CCCCHHHHHHHHHHHHHHcCCeEEEEeeC--CCCc--eeE Q lcl|NC_019705. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPN---QYMTAQEFREAMTMQLCFYGNAYALVDRN--SAGD--VIS 144 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn---~~~s~~~f~~~~~~~~ll~G~a~~~~~r~--~~G~--~~~ 144 (424) ..|.+++|.|...+++.. ......-+...|. .+. ...++.+++..+ .+.+++|-+.++++.. .+|. +.. T Consensus 77 ~av~~~~w~v~p~~~~~~--d~~~ae~v~~~l~-~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~~~~~~ 152 (448) T protein:vir:77 77 GRIRSAKWYVEPASTDPE--DIAIAAFIHAQLG-IDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILDK 152 (448) T ss_pred HHHhcCCceEecCCCCHH--HHHHHHHHHHHhh-chhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCceeecc Confidence 999999999864333221 1111122333332 222 123566777776 5789999999998763 3554 456 Q ss_pred EEEecCcee---EEeecCceEEEEEEe-------CCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 145 LLPLQSANM---DVKLVGKKVVYRYQR-------DSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQ 214 (424) Q Consensus 145 l~~l~~~~v---~~~~~~~~~~~~~~~-------~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~ 214 (424) |.+.++..+ .+..+++........ ......+|...++|.++.....++|.+.+..+.-.........++. T Consensus 153 l~~r~~~~~~~f~~~~~~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w 232 (448) T protein:vir:77 153 IVPIHPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVNGLEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRALILLI 232 (448) T ss_pred ccccCCCccceeeeecCCceEEEecCCcccccccCCCccccccceEEEEecCCcCCcccchHHHHHHHHHHHHHhhHHHH Confidence 777777543 344444433322111 1123456788888887654445899999999999999999999999 Q ss_pred HHHHhcCCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 215 RDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELAR 293 (424) Q Consensus 215 ~~~~~ng~~~~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~ 293 (424) ..|...-+.|--+.+++.+.. +++.++.+.+...+...+.++ .++++.|++++-+..+.....+.+..++..++|+. T Consensus 233 ~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a--~~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk 310 (448) T protein:vir:77 233 NHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRH--GIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIAR 310 (448) T ss_pred HHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCce--EEEecCCceEEEEecCCCccCHHHHHHHHHHHHHH Confidence 999999998888888876643 456677777777776655555 36678888766665544444566778888888888 Q ss_pred HhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChh-----hh--cccchhhhhhhhhccCHH Q lcl|NC_019705. 294 FFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-----DV--GRIHAEHNLDGLLRGDSA 366 (424) Q Consensus 294 ~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-----~~--~~~~~~fd~~~l~~~d~~ 366 (424) +..-. . +.. +.++.+++...+.........+.-.++.|++.||+.|+.+. +. ...++.|+.. ...|.+ T Consensus 311 ~iLGq-t-lTs-~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~--e~eDl~ 385 (448) T protein:vir:77 311 ALGID-F-NTV-QLNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEME--ERNDFS 385 (448) T ss_pred HHhcc-c-ccc-ccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCC--ChhhHH Confidence 76443 1 111 11222223333333356677778899999999998887643 11 1124556544 346777 Q ss_pred HHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhhcccc-CCCcccCC Q lcl|NC_019705. 367 SRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTN-KEPRNNGA 424 (424) Q Consensus 367 ~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~-~~~~~~ga 424 (424) ..++.+.+++ +-+|+.+|+|.-.++.....|....+...+.+. .+.....| T Consensus 386 ~~a~~~~~l~-------~~~~~~~~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (448) T protein:vir:77 386 AAANLMGMLI-------NAVKDSEDIPTELKALIDALPSKMRRALGVVDEVREAVRQPA 437 (448) T ss_pred HHHHHhHHHH-------HHHHHHhcCCccCCcCCCCCchhcccccCCCCCCCchhhcch Confidence 7788877775 468999999753222222222111111111110 11111111 No 128 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.63 E-value=9e-16 Score=103.08 Aligned_cols=404 Identities=12% Similarity=-0.010 Sum_probs=220.0 Q ss_pred CchHHHHHhhccCcccCC---ccc------c-chhhccccccccCcc-------------cccHHHHhccHHHHHHHHHH Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVT---PNQ------G-SQTGPVSAHGHLGDS-------------SINDERILQISTVWRCVSLI 70 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~---~~~------~-~~~~~~~~~~~~~~~-------------~vs~~~~~~~~~v~~~i~~i 70 (424) ++|++++.++|....... ... . .......++.. ... .-+...+..++.+..||+.+ T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~-~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~ 79 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQ-PLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRL 79 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCC-CCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 999999998885321100 000 0 00000001110 110 11233445579999999998 Q ss_pred HHhhcc-CceEEEE--eccCCccceeccchHH---HHHhhcC--CCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC--- Q lcl|NC_019705. 71 STLTAC-LPLDVFE--TDQNDNRKKVDLSNPL---ARLLRYS--PNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA--- 139 (424) Q Consensus 71 a~~ia~-~~~~v~~--~~~~~~~~~~~~~~~l---~~lL~~~--pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~--- 139 (424) .+.|-. ..+.+.- ...++...+. ....+ ....-.+ ....++.+++.+.++..++..|++|+.+.+... T Consensus 80 ~~nvVG~~G~~i~p~~l~~d~~~a~~-l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~ 158 (548) T protein:vir:95 80 EERVVGGSGIGVEPLPLRLDGSVHAE-LAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNY 158 (548) T ss_pred HHhccCccccceeeeecCCCHHHHHH-HHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccc Confidence 776554 2333322 2222111110 11111 1112222 233477889999999999999999998875432 Q ss_pred ----CceeEEEEecCceeEEee-------------c--CceEEEEEEe-----------CCceEEecHhHEEEeecCC-C Q lcl|NC_019705. 140 ----GDVISLLPLQSANMDVKL-------------V--GKKVVYRYQR-----------DSEYAEFSQKEIFHLKGFG-F 188 (424) Q Consensus 140 ----G~~~~l~~l~~~~v~~~~-------------~--~~~~~~~~~~-----------~~~~~~~~~~eiih~r~~~-~ 188 (424) ..+..|..|+|+++..-. | +....|.+.. ......+++++|+|+..+. . T Consensus 159 ~~g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r~ 238 (548) T protein:vir:95 159 TFATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAERIIHIAYRKRI 238 (548) T ss_pred cCCcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeechhHheecccccCC Confidence 246788999998874211 1 1222333321 1124569999999998765 5 Q ss_pred CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcce-ecCCCce Q lcl|NC_019705. 189 TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLW-ILEAGFS 267 (424) Q Consensus 189 ~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~-~l~~g~~ 267 (424) ...+|+|.+..+...+......+.......+=.+...++|+.+.+..... +.....-..... -..|.++ .|..|.+ T Consensus 239 gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~--~~~~~~~~~~~~-~~pG~iv~~L~pGe~ 315 (548) T protein:vir:95 239 GQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTV--EPGKDRKNRTIP-IAPGMVFDDLEPGED 315 (548) T ss_pred ccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccC--CCCccccccccc-ccCCccccccCCCce Confidence 67999999999988888777666666666666677788887654321110 000000000000 1134443 5888999 Q ss_pred eeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH-----------HHHHHHHHHHHHHH-H Q lcl|NC_019705. 268 TSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL-----------GFLQYTLQPYISRW-E 335 (424) Q Consensus 268 ~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~-----------~~~~~tl~P~~~~i-e 335 (424) ++.+..+.....|.+..+.....||+.+|||.+.|..-.. .+|+++-.... .|+..-++|+...+ + T Consensus 316 i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s--~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle 393 (548) T protein:vir:95 316 VGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYD--GTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQ 393 (548) T ss_pred eeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8888876556778999999999999999999988855333 35555443333 33344444533332 3 Q ss_pred HHHHhhccC-hh--hhc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCee Q lcl|NC_019705. 336 NSIQRWLIP-AK--DVG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG----------GDVA 401 (424) Q Consensus 336 ~~l~~~l~~-~~--~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g----------gd~~ 401 (424) .++-.-.++ +. ++. ...+.+-.-.....|+...+++...++++|+.|.-|+-++.|.++-+- -+++ T Consensus 394 ~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~ 473 (548) T protein:vir:95 394 MYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAA 473 (548) T ss_pred HHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHc Confidence 333332222 11 111 122333334455679999999999999999999999888888876320 0111 Q ss_pred eecccccch----hhccccCCC-----c----------------------------------ccCC Q lcl|NC_019705. 402 MRQSQYVPI----TDLGTNKEP-----R----------------------------------NNGA 424 (424) Q Consensus 402 ~~~~n~~~~----~~~~~~~~~-----~----------------------------------~~ga 424 (424) =++....+. ....+..++ . .+|| T Consensus 474 GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 539 (548) T protein:vir:95 474 GLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNESNNGGA 539 (548) T ss_pred CCCCCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCCCCCcccccCCC Confidence 111111110 000111000 0 0000 No 129 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.61 E-value=7.7e-15 Score=97.96 Aligned_cols=419 Identities=10% Similarity=0.006 Sum_probs=227.5 Q ss_pred CCCCccc--ccCCCCCchHHHHHhhccCcccCCccccchhhccccc-------cccCcccccHHHHhccHHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYT--IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAH-------GHLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~--~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~vs~~~~~~~~~v~~~i~~ia 71 (424) |.-|-.- .++.--+ -...+.+.-.+...... ....+.+.... ....-..-+...+..++.+..||+.+. T Consensus 1 ~~~p~~~~~~~~~~~~-~~~~~~~y~~~a~~~~~-~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 78 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMT-SLREYAGYHGGGSGFGG-QLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQ 78 (533) T ss_pred CCCchhhhhhcccccc-hHHHHHhhhhccCCCCC-cccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 4333211 1111000 00111111111110000 00000010000 000011123445566899999999999 Q ss_pred HhhccCceEEEEecc------CCccceeccchHHHHH---hhcCCC------CCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019705. 72 TLTACLPLDVFETDQ------NDNRKKVDLSNPLARL---LRYSPN------QYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 72 ~~ia~~~~~v~~~~~------~~~~~~~~~~~~l~~l---L~~~pn------~~~s~~~f~~~~~~~~ll~G~a~~~~~r 136 (424) +.+-+-.|++.-+.+ +++.. ......+..+ .-..++ ..++.+++...++..++..|++|+.+.+ T Consensus 79 ~nvVG~Gi~~~~~p~~~~lg~~~~~~-~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~ 157 (533) T protein:vir:34 79 DHIVGSFFRLSHRPSWRYLGIGEEEA-RAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATW 157 (533) T ss_pred HHhhCCCceeeeccchhhcCCChhHH-HHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeee Confidence 888776787654321 11111 0111122222 222333 3457889999999999999999998865 Q ss_pred CCC-C--ceeEEEEecCceeEEee----------------cCceEEEEEEe---CC----------ceEEecHhHEEEee Q lcl|NC_019705. 137 NSA-G--DVISLLPLQSANMDVKL----------------VGKKVVYRYQR---DS----------EYAEFSQKEIFHLK 184 (424) Q Consensus 137 ~~~-G--~~~~l~~l~~~~v~~~~----------------~~~~~~~~~~~---~~----------~~~~~~~~eiih~r 184 (424) ... | .+..|..|+|+++.... .+....|.+.. ++ ....++.++|+|+. T Consensus 158 ~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f 237 (533) T protein:vir:34 158 DTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVF 237 (533) T ss_pred ccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeec Confidence 443 2 36788889887774211 11222343321 11 12346788999998 Q ss_pred cCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCC----------HHHHHHHHHHH---HHH Q lcl|NC_019705. 185 GFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT----------EQQRSQVEENF---KEI 250 (424) Q Consensus 185 ~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~----------~~~~~~~~~~~---~~~ 250 (424) .+. +...+|+|.+..+...+.-.....+......+-.+...++|+.+.+... +...+.+.... ... T Consensus 238 ~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (533) T protein:vir:34 238 EPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAY 317 (533) T ss_pred cccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhc Confidence 765 5679999999999888887777666666666667777888876543211 11111111111 111 Q ss_pred hCCc----ccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC-CCCCcccchhH---------- Q lcl|NC_019705. 251 AGGP----VKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD-VEKSTSWGSGI---------- 315 (424) Q Consensus 251 ~~~~----~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~-~~~~~~~~~n~---------- 315 (424) .++. ..|.+..|..|.+++.+..+-....|.+..+.....||+.+|||.+.|.. ..+.| |+++ T Consensus 318 ~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~n--YSS~R~~~~e~~r~ 395 (533) T protein:vir:34 318 YAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMS--YSTARASANESWAY 395 (533) T ss_pred cCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhccccc--HHHHHHHHHHHHHH Confidence 1111 24668889999999988877667788999999999999999999988843 33444 4443 Q ss_pred -HHHHHHHHHHHHHHHHHHH-HHHHHhhccC-hhh--------hc-ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCH Q lcl|NC_019705. 316 -EQQNLGFLQYTLQPYISRW-ENSIQRWLIP-AKD--------VG-RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI 383 (424) Q Consensus 316 -e~~~~~~~~~tl~P~~~~i-e~~l~~~l~~-~~~--------~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~ 383 (424) +..+..|...-++|+...+ +.++....++ +.. +. ...+.+-.-.....|+....++...++++|+.|. T Consensus 396 ~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~ 475 (533) T protein:vir:34 396 FMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTY 475 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCH Confidence 3333345555556655543 3334333332 110 00 1223444455567799999999999999999999 Q ss_pred HHHHHHhCCCCCCC----------CCeeeecccccc--hhh---ccccCCCcccCC Q lcl|NC_019705. 384 NEMRRTDNLPPLPG----------GDVAMRQSQYVP--ITD---LGTNKEPRNNGA 424 (424) Q Consensus 384 NE~R~~~g~~p~~g----------gd~~~~~~n~~~--~~~---~~~~~~~~~~ga 424 (424) -|+-+..|.++-+- .+++=++....+ ... ..+.++++++++ T Consensus 476 ~~~~a~~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~~~ 531 (533) T protein:vir:34 476 EKECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSR 531 (533) T ss_pred HHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCcccCC Confidence 99988888887421 122111111111 111 111223333333 No 130 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.58 E-value=3.8e-15 Score=99.63 Aligned_cols=406 Identities=11% Similarity=0.034 Sum_probs=210.3 Q ss_pred ccCCCCCchH-------HHHH-hhccCcccCCccccchhhccccccc--------cCcccccHHHHhccHHHHHHHHHHH Q lcl|NC_019705. 8 IDLRTNNGWW-------ARLQ-SWFVGGRLVTPNQGSQTGPVSAHGH--------LGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 8 ~~~~~~~G~~-------~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~vs~~~~~~~~~v~~~i~~ia 71 (424) |.+ ++.|+. .+.. +.+.+..... .+..+...+. ..-..-+...+..++.+..||+.+. T Consensus 1 m~~-~~~~~~a~~~~~~~~~~~~~y~aa~~~~-----~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 74 (495) T protein:vir:10 1 MNM-TPSGYQSLASGLLVPVGASAYEGASGGH-----RWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWV 74 (495) T ss_pred CCc-ccccccccchhhhhHHHhhhhhccccCc-----ccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 322 122322 1111 1111111000 0000100000 0001123344566899999999999 Q ss_pred HhhccCceEEEEeccCCccceeccchHHHHHhhcCC--CCCCCHHHHHHHHHHHHHHcCCeEEEEeeC--CCC--ceeEE Q lcl|NC_019705. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSP--NQYMTAQEFREAMTMQLCFYGNAYALVDRN--SAG--DVISL 145 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~p--n~~~s~~~f~~~~~~~~ll~G~a~~~~~r~--~~G--~~~~l 145 (424) +.+-+-.|+..-+..+....+... .+......++ ..-++.+.+.+.++..++..|++|+.+... ..| .+..| T Consensus 75 ~~vVG~Gi~p~~~~~~~~~~~~ie--~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~l 152 (495) T protein:vir:10 75 AAAVGNGLTPRWRMKEQELRQELQ--ELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQL 152 (495) T ss_pred HhhcCCCcccccCCchHHHHHHHH--HHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEE Confidence 988666776544333222111111 1111222222 334688889999999999999999987643 333 46789 Q ss_pred EEecCceeEEee------c-------------CceEEEEEEe---C--------CceEEecHhHEEEeecCCCCCcccCc Q lcl|NC_019705. 146 LPLQSANMDVKL------V-------------GKKVVYRYQR---D--------SEYAEFSQKEIFHLKGFGFTGLVGLS 195 (424) Q Consensus 146 ~~l~~~~v~~~~------~-------------~~~~~~~~~~---~--------~~~~~~~~~eiih~r~~~~~~~~G~s 195 (424) ..|+|+++.... + +....|.+.. + .....+++++|+|+.........|+| T Consensus 153 qliepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~~r~gQ~RGis 232 (495) T protein:vir:10 153 QIIEPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTVLTVRSDAGAP 232 (495) T ss_pred EEechhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEeccccCCCcccCcc Confidence 999998875211 1 1122333321 1 12456999999999654466689998 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHH-HH-HHHHHHHHHhCCcccCcceecCCCceeeeccc Q lcl|NC_019705. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ-RS-QVEENFKEIAGGPVKKRLWILEAGFSTSAIGV 273 (424) Q Consensus 196 ~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~-~~-~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~ 273 (424) .+..+. .+.-....++......+-.+...++|+.+.+...... .. .....-......-+.|.+..|..|.+++.+.. T Consensus 233 ~la~i~-~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p 311 (495) T protein:vir:10 233 WFQLLL-RLNELDQYEDAELVRKKTAALFAAFIQEATADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNP 311 (495) T ss_pred hhHHHH-HHHHhhHHHHHHHHHHHHhhhheeeeecCCCccccccccCccccccCcccceecCCceeeecCCCCeeeeeCC Confidence 665433 3443333333333333445566777775433111000 00 00000000001113467889999999999887 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCcccchhHHHHHHHHH------------HHHHHHHHHH-HHHHHH Q lcl|NC_019705. 274 TPQDAEMMASRKFQVSELARFFGVPPHLV-GDVEKSTSWGSGIEQQNLGFL------------QYTLQPYISR-WENSIQ 339 (424) Q Consensus 274 ~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l-~~~~~~~~~~~n~e~~~~~~~------------~~tl~P~~~~-ie~~l~ 339 (424) +..-..|.+..+.....||+.+|||.+.| |...+.| |+++-+....|. ..-++|+... ++.++- T Consensus 312 ~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~n--YSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l 389 (495) T protein:vir:10 312 ADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVN--YSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVA 389 (495) T ss_pred CCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 65666788999999999999999999988 4443433 445433333332 2223343332 232332 Q ss_pred hhccC-hh--hhc--ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCeeeec Q lcl|NC_019705. 340 RWLIP-AK--DVG--RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG----------GDVAMRQ 404 (424) Q Consensus 340 ~~l~~-~~--~~~--~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g----------gd~~~~~ 404 (424) ...++ +. +.. ...+.+-.-.....|+....++...++++|+.|.-|+-+..|.++-+- .+++=++ T Consensus 390 ~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~ 469 (495) T protein:vir:10 390 SGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDMEELFDMISDANQLIDEYDLR 469 (495) T ss_pred cCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCC Confidence 22221 11 101 122334444556679999999999999999999999988888887421 1111111 Q ss_pred ccc--cchhhccccCCCcccCC Q lcl|NC_019705. 405 SQY--VPITDLGTNKEPRNNGA 424 (424) Q Consensus 405 ~n~--~~~~~~~~~~~~~~~ga 424 (424) ... .++...+..+++.++.+ T Consensus 470 ~~~~p~~~~~~~~~~~~~~~~~ 491 (495) T protein:vir:10 470 LDSDPRYVNGSGAEQKSVMEAA 491 (495) T ss_pred CCCCCCcCCCccCCCCCCCCCC Confidence 111 11112222222222222 No 131 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=99.57 E-value=3.9e-14 Score=94.12 Aligned_cols=409 Identities=11% Similarity=0.066 Sum_probs=213.3 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |-+-. +..+..-= .|+.+.+.-.....+.. ....+....-+.....+ .+..++++.|.+|++.+...|.+++|+ T Consensus 1 ~~~~~---~~~~gl~p-~rl~~i~~~~~~~~~~~-~~~~~~~~Lr~~~~~~l-y~~m~~D~hi~s~l~~Rk~av~~~~w~ 74 (488) T protein:vir:95 1 MADIT---ETQESLPP-FRMGEVGSLGLKVKNGR-IYEEPRQALRFPESIKT-FQLMMRDPAVAASVNIIKMFVRKVNWR 74 (488) T ss_pred CCCcc---ccCCCCCH-HHHHHHHHHhhccccch-hhccchhhhcccchHHH-HHHHhhChHHHHHHHHHHHHHhcCCce Confidence 33211 11111111 22333332111100000 00000000001111112 244567999999999999999999999 Q ss_pred EEEeccCCccce-eccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-------------CC--ceeE Q lcl|NC_019705. 81 VFETDQNDNRKK-VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-------------AG--DVIS 144 (424) Q Consensus 81 v~~~~~~~~~~~-~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~-------------~G--~~~~ 144 (424) |...+....... .....-+...|.. -..++.+++..+. +.+++|-+.++++... +| .+.. T Consensus 75 v~p~~~~~~d~~~~~~a~~v~~~l~~---~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~ 150 (488) T protein:vir:95 75 FVPPKGKEQDPKMLERADFFNSLMDD---MEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAK 150 (488) T ss_pred EecCCCCchhHHHHHHHHHHHHHHhc---cCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCeeeeee Confidence 964332211110 0011123333332 1234567777775 5788999999987643 23 2556 Q ss_pred EEEecCc---eeEEeecCceEEEEEE---------------eCCceEEecHhHEEEeecCC-CCCcccCchHHHHHHHHH Q lcl|NC_019705. 145 LLPLQSA---NMDVKLVGKKVVYRYQ---------------RDSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAG 205 (424) Q Consensus 145 l~~l~~~---~v~~~~~~~~~~~~~~---------------~~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~~~i~ 205 (424) |.+.++. +..+..+++....... .......+++...++.++.. ...++|.+.+..+.-... T Consensus 151 i~~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~ 230 (488) T protein:vir:95 151 LPIRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWK 230 (488) T ss_pred eeecCcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecCCCCccchhhHHHHHHHHHH Confidence 6666664 3344444433221100 01234557888876666554 455899999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCCceeEEcCCC----CCCHHHHHHHHHH---HHHHhCCcccCcceecCCCceee--------- Q lcl|NC_019705. 206 VAVAMEDQQRDFFANGAKSPQILSTGEK----VLTEQQRSQVEEN---FKEIAGGPVKKRLWILEAGFSTS--------- 269 (424) Q Consensus 206 ~~~~~~~~~~~~~~ng~~~~~vl~~~~~----~~~~~~~~~~~~~---~~~~~~~~~ag~~~~l~~g~~~~--------- 269 (424) ......++...|...-+.+--+...+.+ ..+++....+++. ..+...+..+| ++++.|++.. T Consensus 231 fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag--~iiP~g~~~~~k~~~~e~~ 308 (488) T protein:vir:95 231 YKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAG--LIWPRYIDPDTKEDIFEFS 308 (488) T ss_pred HHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhh--eeeccccccccchhhhhhh Confidence 9999999999999875444434444322 2233332222222 22333333343 5566655322 Q ss_pred ecccC-hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChh-- Q lcl|NC_019705. 270 AIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-- 346 (424) Q Consensus 270 ~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-- 346 (424) .++.. ..-..|.+..++..++|+.+.--.- |....+++.+++ ..+.........+.-.++.|++.||++|+.+. T Consensus 309 l~~~~~~~~~~~~~li~~~d~~Isk~iLGqt--LT~~~~~~Gs~A-l~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~ 385 (488) T protein:vir:95 309 LVSRQGAKAYDTGSIIDRYSKQIMMAFMSDV--LAMGQSKYGSFS-LADSKTSLLAMSVDILLKQIKNVINRDLVAQTYA 385 (488) T ss_pred ccccccCCchhHHHHHHHHHHHHHHHHhccc--cccccCcchhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22221 2223466677777788877653221 111112222322 23445566777888899999999999887653 Q ss_pred ---hhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCH-----HHHHHHhCCCCCCCCCeeeecccccchhhccccCC Q lcl|NC_019705. 347 ---DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI-----NEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKE 418 (424) Q Consensus 347 ---~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~-----NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~ 418 (424) .....+.+|-++.....|.+..++.+.++++.|+.-+ +.+|+.+|+|+-++++....+....+-..+++... T Consensus 386 ~Nfg~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~~~~~~~~~~~~~~~~~ 465 (488) T protein:vir:95 386 LNMWDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVSEKLSPNSQSRSGDGYK 465 (488) T ss_pred hcCCCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCccccccCCCCCCCCCCcccC Confidence 1111122333344456777889999999999998765 57999999997666655554432222111111111 Q ss_pred CcccCC Q lcl|NC_019705. 419 PRNNGA 424 (424) Q Consensus 419 ~~~~ga 424 (424) ...+++ T Consensus 466 ~~~~~~ 471 (488) T protein:vir:95 466 TAGEGT 471 (488) T ss_pred CCcccC Confidence 111111 No 132 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=99.55 E-value=4.1e-14 Score=93.97 Aligned_cols=373 Identities=12% Similarity=0.024 Sum_probs=213.2 Q ss_pred ccccCCCCC--chHHHHHhhccCcccCCccccchhhccccccccCcc-----cccHHHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_019705. 6 YTIDLRTNN--GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDS-----SINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 6 ~~~~~~~~~--G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~vs~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) ..|++|+.- -+-+.+.....+...... .. .+-......++. .+..+...+++.|++|++.+...|.+++ T Consensus 1 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~g--~~--~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~ 76 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIYAMEHLGLATS--YL--SEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKV 76 (446) T ss_pred CcccccCCCchhhhhhhhhccccchhhcc--cC--CcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCC Confidence 456666522 111111110000000000 00 000000011111 1222323358999999999999999999 Q ss_pred eEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-C--ce-------eEEEEe Q lcl|NC_019705. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-G--DV-------ISLLPL 148 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~-G--~~-------~~l~~l 148 (424) |.|.-.+ +++ ..-+...|.... .++....+.+.+.+|-++.+++.... | .| +++.|+ T Consensus 77 w~V~p~~-----~~~--a~~v~~~l~~~~------~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~ 143 (446) T protein:vir:98 77 GPYQHGD-----KRI--KKFIDDQLRNRA------KTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPL 143 (446) T ss_pred ceecCcc-----HHH--HHHHHHHHhhcC------chhHHHHHHHHHhhCceeeeEEEeecccccccchhhccccccccc Confidence 9995321 111 123555565322 34555557899999999999876422 1 12 223333 Q ss_pred cCceeEEeecCce-----E--------------EE-------EEEeCCceEEecHhHEEEeecCCC-CCcccCchHHHHH Q lcl|NC_019705. 149 QSANMDVKLVGKK-----V--------------VY-------RYQRDSEYAEFSQKEIFHLKGFGF-TGLVGLSPIAFAC 201 (424) Q Consensus 149 ~~~~v~~~~~~~~-----~--------------~~-------~~~~~~~~~~~~~~eiih~r~~~~-~~~~G~s~i~~~~ 201 (424) ++. .....++.. . .+ .....+....+|....+++++... ..++|.+.+..+. T Consensus 144 ~~r-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~~~~~~p~G~gLlr~~~ 222 (446) T protein:vir:98 144 QVM-LIANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYNTKGNNPWGTSCLTSVL 222 (446) T ss_pred cce-eeeccCCccccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEecCCCCCccccchHHHHH Confidence 332 111111100 0 00 001122345688889888887654 4589999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHH--------HHHH-HHHHHHHhCCc-ccCcce---ecCCCcee Q lcl|NC_019705. 202 KSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ--------RSQV-EENFKEIAGGP-VKKRLW---ILEAGFST 268 (424) Q Consensus 202 ~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~--------~~~~-~~~~~~~~~~~-~ag~~~---~l~~g~~~ 268 (424) -.........++...|...-+.|--+.+++.+.++++. .+.. ++..++..... +++.++ .+++|+++ T Consensus 223 w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~ei 302 (446) T protein:vir:98 223 DYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQV 302 (446) T ss_pred HHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceE Confidence 99999999999999999999999888888765442211 1112 23344433322 233232 34888888 Q ss_pred eecccChh-HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhh Q lcl|NC_019705. 269 SAIGVTPQ-DAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKD 347 (424) Q Consensus 269 ~~l~~~~~-d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~ 347 (424) +-++.... ...|.+..++..++|+.+.....-.++...+.+.+++-. +.......+.++-.+++|++.+|+.|+.+.= T Consensus 303 e~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala-~vh~~V~~d~~~aDa~~i~~tln~~Li~~l~ 381 (446) T protein:vir:98 303 GALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRAS-EIQLELFDGKINSIFDTVIHAFTEQVIGNLI 381 (446) T ss_pred EeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 76654322 234788889999999998877644444433322333222 3344556677888999999999998865431 Q ss_pred hc------------ccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCH---HHHHHHhCCCCCCCCCe Q lcl|NC_019705. 348 VG------------RIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI---NEMRRTDNLPPLPGGDV 400 (424) Q Consensus 348 ~~------------~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~---NE~R~~~g~~p~~ggd~ 400 (424) .- ..+++|+.. ...|.+..++...++++.|..++ +.+|+.+|+|+-+. |+ T Consensus 382 ~lNf~~~~~~~~~~~~~~~~~~~--e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~-~~ 446 (446) T protein:vir:98 382 RLNFDPALYPLASNTGYITRLPG--RATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAIS-ST 446 (446) T ss_pred HhCCCccccccccccccceeccC--ChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCC-CC Confidence 10 012234332 24677888999999999998765 45999999976422 22 No 133 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.50 E-value=1.9e-14 Score=95.85 Aligned_cols=386 Identities=13% Similarity=0.090 Sum_probs=188.8 Q ss_pred CCCCcccc---------cC-CCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTI---------DL-RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLI 70 (424) Q Consensus 1 ~~~~~~~~---------~~-~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~i 70 (424) |.+- .++ .+ |+|.|+.+-+.+.-.++ ...+..++....+....+ ...|-.+..+..||+.+ T Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~rd~l~~~~~glg~~r-------~~~~~~~g~~~~~~~~~l-~~~Yr~~~ia~~iVd~~ 71 (449) T protein:vir:10 1 MTDK-LTLAVNHALNDARMARARMGLMVPTMGLDNKR-------HSAWCEYGFPELVTYENL-YSLYRRGGIAHGAVEKL 71 (449) T ss_pred Cchh-hHHHHhhhcchhHHHHHHHHHHHHHhcCCccc-------chhhhhcCCcccCCHHHH-HHHHhcCchhHHHHHhh Confidence 4332 221 00 23334444322221111 111111222112221111 22344578889999999 Q ss_pred HHhhccCceEEEEeccCCcc-ceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEE-eeCCC--------- Q lcl|NC_019705. 71 STLTACLPLDVFETDQNDNR-KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV-DRNSA--------- 139 (424) Q Consensus 71 a~~ia~~~~~v~~~~~~~~~-~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~-~r~~~--------- 139 (424) ++.+-.--..+....+.... .+......+.+++.. .-+..+....-+ -.++|-|++++ +++.. T Consensus 72 ~d~~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~-----~~~~~l~ea~~~-~rl~Gga~i~i~v~d~~~l~~Pl~~~ 145 (449) T protein:vir:10 72 VGKCWQTNPEIIEGDDADDSEDETSWEKKSKQVFTN-----RLWRSFAEADRR-RLVGRYAGILLHIRDEKDWNLPATKG 145 (449) T ss_pred hhhhhhcCcccccCccccchhhhHHHHHHHHHHHHH-----HHHHHHHHHHHh-hhccCcEEEEEEecCCCCCCcccccC Confidence 98763222222222211111 111111112222221 012223333333 34577666665 44432 Q ss_pred CceeEEEEecCceeEEe---ec------CceEEEEEEe---C--CceEEecHhHEEEeecCCCCCcccCchHHHHHHHHH Q lcl|NC_019705. 140 GDVISLLPLQSANMDVK---LV------GKKVVYRYQR---D--SEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAG 205 (424) Q Consensus 140 G~~~~l~~l~~~~v~~~---~~------~~~~~~~~~~---~--~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~ 205 (424) +.+..|.|+....+++. .| +....|++.. + ..+..|.++.|+||-... .-|.|.++.++..+. T Consensus 146 ~~i~~i~v~~~~~i~~~~~~~dp~sp~yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~~---~~g~~~L~~~yn~l~ 222 (449) T protein:vir:10 146 RGLQKVSVSWAGSLKVAEWDTGINSKTYGQPKLWKYTERLPNGSSRRVDIHPDRVFILGDYS---EDAIGFLEPAYNAFV 222 (449) T ss_pred cceeeEEeeccccCChhhhhcCCCCCCCCCceEEEEeeeccCCCccceeeccceeEeecCCC---CCChhHHHHHHHHhh Confidence 24566777765555432 11 1233444432 1 234568888888885433 336777777766543 Q ss_pred HHHHHH-HHHHHHHhcCC-----------CCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccc Q lcl|NC_019705. 206 VAVAME-DQQRDFFANGA-----------KSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV 273 (424) Q Consensus 206 ~~~~~~-~~~~~~~~ng~-----------~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~ 273 (424) ....+. .+...+++|-. ...++.... +...++..+.+.+..+....+.+ .+.++.+.++++++. T Consensus 223 ~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~-~~~~e~~~~~~~~~~~~~~~~~~---~~~i~~~~d~~~~~~ 298 (449) T protein:vir:10 223 SLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLY-GVSIDELQDKFNEVAGEINRGND---VLMTTQGATVTPLVT 298 (449) T ss_pred hHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHh-hCCchHHHHHHHHHHHHHhccch---heeecCCcceEEEec Confidence 222221 22222222211 111111111 11233334445545544443332 456677778999988 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH------HHHHHHHHHHHHHHhhccChhh Q lcl|NC_019705. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY------TLQPYISRWENSIQRWLIPAKD 347 (424) Q Consensus 274 ~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~------tl~P~~~~ie~~l~~~l~~~~~ 347 (424) ++.+.. +.......+||++-|||..+|-+...+..+. + ++ .++|+.. -|.|.++.+-+.|-+.-+.... T Consensus 299 ~~sgl~--d~l~~~~q~iaaa~~IP~t~L~Gqsp~glns-t-~D-~~nyyd~i~~~Q~~l~p~le~l~~~l~~s~~g~~~ 373 (449) T protein:vir:10 299 SVADPT--ATYNVNLQTAAAGVDIPTRILIGNQQAERSS-T-ED-QKYFNARCQSRRVDLSFEIEDFCDKLIELKIIDAV 373 (449) T ss_pred ccCChh--HHHHHHHHHHHHHhCCCeeeeeccCcccccc-c-hh-HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCC Confidence 877664 6677788889999999999887766655542 2 33 3344432 3567777776666544433221 Q ss_pred hcccchhhhhhhhhccCHHHHHH-------HHHHHHhCC---CCCHHHHHHHhCCCCCCCCCeeeecccccchhhccccC Q lcl|NC_019705. 348 VGRIHAEHNLDGLLRGDSASRAA-------FMKAMGEAG---LRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNK 417 (424) Q Consensus 348 ~~~~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~g---~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~ 417 (424) ..+.|.++.|...+.+++++ ++++++++| +++++|+|+.+|++|.. ++.+ +-++..+.+ T Consensus 374 ---~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~~~~EiR~~~~~~~~~-~~~~-------~~e~~de~~ 442 (449) T protein:vir:10 374 ---AKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAFSREEIRTAAGYDNDD-EEPL-------GEEDGDEED 442 (449) T ss_pred ---CceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCcCHHHHHHHhcccCCC-CCCC-------CCCCCcccc Confidence 12456667888888888755 444666666 99999999999999853 2211 122222333 Q ss_pred CCcccCC Q lcl|NC_019705. 418 EPRNNGA 424 (424) Q Consensus 418 ~~~~~ga 424 (424) +..+.+| T Consensus 443 ~~~d~~a 449 (449) T protein:vir:10 443 KATDSAA 449 (449) T ss_pred ccCCcCC Confidence 4444455 No 134 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.50 E-value=1.5e-13 Score=90.92 Aligned_cols=418 Identities=11% Similarity=-0.006 Sum_probs=217.7 Q ss_pred CCCCccc-cc-CCCCCchHHHHHhhccCcccCCccccc--hhhccccc-------cccCcccccHHHHhccHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYT-ID-LRTNNGWWARLQSWFVGGRLVTPNQGS--QTGPVSAH-------GHLGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~-~~-~~~~~G~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-------~~~~~~~vs~~~~~~~~~v~~~i~~ 69 (424) |-...-. +. ++..++. .+.+.....-.-+...... .+.+.... ....-..-+...+..++.+..+|+. T Consensus 1 m~~~~~r~~~~~a~~~~~-~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~ 79 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPE-QSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGY 79 (553) T ss_pred Ccchhhhhhcccccccch-hhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 1100000 00 0000000 0000000000000000000 00000000 0000011233445668999999999 Q ss_pred HHHhhccCceEEEEecc-------CCccceeccchHHH---HHhhcCCC------CCCCHHHHHHHHHHHHHHcCCeEEE Q lcl|NC_019705. 70 ISTLTACLPLDVFETDQ-------NDNRKKVDLSNPLA---RLLRYSPN------QYMTAQEFREAMTMQLCFYGNAYAL 133 (424) Q Consensus 70 ia~~ia~~~~~v~~~~~-------~~~~~~~~~~~~l~---~lL~~~pn------~~~s~~~f~~~~~~~~ll~G~a~~~ 133 (424) +.+.+-+-.|++.-+.+ ++... ......+. ...-..++ ..++.+++...++..++..|++|+. T Consensus 80 ~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~-~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~ 158 (553) T protein:vir:63 80 QRDSIVGAQYRLNSMPDINVIPGATEEWA-EEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLAT 158 (553) T ss_pred HHHhhccCCceeeeccchhhhcCCCHHHH-HHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEE Confidence 98887777887754321 11111 01111222 22223333 3457888999999999999999998 Q ss_pred EeeCCC-C--ceeEEEEecCceeEEee----------------cCceEEEEEEe---CC---------------ceEEec Q lcl|NC_019705. 134 VDRNSA-G--DVISLLPLQSANMDVKL----------------VGKKVVYRYQR---DS---------------EYAEFS 176 (424) Q Consensus 134 ~~r~~~-G--~~~~l~~l~~~~v~~~~----------------~~~~~~~~~~~---~~---------------~~~~~~ 176 (424) +..... | .+..|..|+|+++.... .+....|.+.. +. ....++ T Consensus 159 ~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~~~v~ 238 (553) T protein:vir:63 159 AEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQSKPWG 238 (553) T ss_pred eeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeeeccccccC Confidence 765432 2 35678889988874222 11222333321 10 123578 Q ss_pred HhHEEEeecCC-CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHH-------- Q lcl|NC_019705. 177 QKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENF-------- 247 (424) Q Consensus 177 ~~eiih~r~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~-------- 247 (424) +.+|+|+..+. +...+|+|.+..+...+.-............+-.+...++|+.+.+ ++...+.+.... T Consensus 239 a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~--~~~~~~~~~~~~~~~~~~~~ 316 (553) T protein:vir:63 239 RRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELP--PEFIHSQMSGGSPNADMVGI 316 (553) T ss_pred hhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCC--hhhhhhhccccccccccccc Confidence 99999998764 5679999999999888887777666666666666777888886532 222211111110 Q ss_pred --------HHHhCC-----cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCcccch Q lcl|NC_019705. 248 --------KEIAGG-----PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV-GDVEKSTSWGS 313 (424) Q Consensus 248 --------~~~~~~-----~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l-~~~~~~~~~~~ 313 (424) .....+ -..|.+..|..|.+++-+..+-...+|.+..+.....||+.+|||.+.| +...+.| |+ T Consensus 317 ~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~n--YS 394 (553) T protein:vir:63 317 FGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKAN--YS 394 (553) T ss_pred ccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhccccc--HH Confidence 001111 1246788899999999888775667789999999999999999999877 4333444 44 Q ss_pred hHH-----------HHHHHHHHHHHHHHHHHH-HHHHHhhccC-hhh-hc-----------ccchhhhhhhhhccCHHHH Q lcl|NC_019705. 314 GIE-----------QQNLGFLQYTLQPYISRW-ENSIQRWLIP-AKD-VG-----------RIHAEHNLDGLLRGDSASR 368 (424) Q Consensus 314 n~e-----------~~~~~~~~~tl~P~~~~i-e~~l~~~l~~-~~~-~~-----------~~~~~fd~~~l~~~d~~~~ 368 (424) ++- ..+..|...-++|+...+ +.++-...++ +.. .. ...+.+-.-.....|+... T Consensus 395 S~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke 474 (553) T protein:vir:63 395 SIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKE 474 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHH Confidence 432 223334455555654443 3333322221 111 00 0112333334456799999 Q ss_pred HHHHHHHHhCCCCCHHHHHHHhCCCCCCC----------CCeeeecccccchhhcc----ccCCCc-----ccCC Q lcl|NC_019705. 369 AAFMKAMGEAGLRTINEMRRTDNLPPLPG----------GDVAMRQSQYVPITDLG----TNKEPR-----NNGA 424 (424) Q Consensus 369 ~~~~~~~~~~g~~t~NE~R~~~g~~p~~g----------gd~~~~~~n~~~~~~~~----~~~~~~-----~~ga 424 (424) .++...++++|+.|.-|+-+..|..+-+- .+++=++....+-...+ ...++. ++++ T Consensus 475 ~~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (553) T protein:vir:63 475 TQAAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTS 549 (553) T ss_pred HHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCCcc Confidence 99999999999999999988888876321 11111111111100000 000011 1111 No 135 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.48 E-value=7.4e-14 Score=92.58 Aligned_cols=404 Identities=11% Similarity=0.024 Sum_probs=205.7 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCcc-ccchh--------hccccccccCcccccHHHHhccHHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPN-QGSQT--------GPVSAHGHLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~-~~~~~--------~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia 71 (424) .-||.-...|.+.-- +.--...+....+. ..... .++.+.+..++.. -....+.|.+++|+.+|+ T Consensus 60 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~--la~laQ~~eyr~~~~~ia 133 (695) T protein:vir:36 60 VVEPSPSLRLARQFE----VDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPT--LVLLAQLPEYRAMHEVLA 133 (695) T ss_pred ccCCCcccccceece----ecccccCccccchhhhhhcccccccccchhhhccCcchHHH--HHHHhhccchhhHHHHHH Confidence 445544433332110 00000000000000 00000 0111111111111 234567899999999999 Q ss_pred HhhccCceEEEEeccC----------CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC--- Q lcl|NC_019705. 72 TLTACLPLDVFETDQN----------DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS--- 138 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~----------~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~--- 138 (424) +.+..-=..+...... +..... .+...-..|...-..+.-+..| ...+.+--++|-+.+++.-++ T Consensus 134 ~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~-~d~dqik~L~~e~erL~V~~~l-~eaik~aRlfGGa~~~i~i~gdd~ 211 (695) T protein:vir:36 134 DECIRTWGEAIGGTKEKADTSGLAAGGNAAST-SDGDQLKQINDEIERLRIRDAV-RTTVIHDQAFGRAHPYFKIKGDDQ 211 (695) T ss_pred HHhhcccceecccchhhhhhcccccccccccc-CchHHHHHHHHHHHHHHHHHHH-HHHHHhhccccceEEEEEeccCcc Confidence 9887652222111000 000000 0001122222221112222333 344444556666665553322 Q ss_pred --------------CCceeEEEEecCceeEEeecC----------ceEEEEEEeCCceEEecHhHEEEeecCCC-C---- Q lcl|NC_019705. 139 --------------AGDVISLLPLQSANMDVKLVG----------KKVVYRYQRDSEYAEFSQKEIFHLKGFGF-T---- 189 (424) Q Consensus 139 --------------~G~~~~l~~l~~~~v~~~~~~----------~~~~~~~~~~~~~~~~~~~eiih~r~~~~-~---- 189 (424) .|....|.+|+|+.|++...+ ...+|. ..+ ..|-.+.++.|..... + T Consensus 212 ~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~--V~G--~kIH~SRL~~f~g~plPd~LKp 287 (695) T protein:vir:36 212 IMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWW--MIG--TEVHATRLHTIVSRPVGDMLKP 287 (695) T ss_pred ccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEE--Eec--eEEeeeeEEEecCCCchhhhhc Confidence 244566888999888764321 111222 222 3455666665554322 2 Q ss_pred --CcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcC-CCCCCHHHHHHHHHHHHHHhCCcccCcceecC-CC Q lcl|NC_019705. 190 --GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG-EKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AG 265 (424) Q Consensus 190 --~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~-~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-~g 265 (424) .+.|+|..+.+...+............+... ....++.+-- ........ ..+..+.+-........++++++ .. T Consensus 288 ~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~-~~v~~lk~dla~aL~~g~~-~~l~~R~eli~~~Rsn~G~~llDk~~ 365 (695) T protein:vir:36 288 TYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ-FSVSGILMDLAQALMPGAN-VDLSMRAELINRYRDNRNILFLDKAT 365 (695) T ss_pred ccccCcccHHHHHHHHHHHHHHHHhHHHHHHHh-hhHHHHHHHHHHhhcChhH-HHHHHHHHHHHHhcCccceEEEecCC Confidence 3469999999999988887777777766643 2223321100 11112222 22222222211112234578888 47 Q ss_pred ceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHH-------HHHHHHHHHHHHHH Q lcl|NC_019705. 266 FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSI 338 (424) Q Consensus 266 ~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l 338 (424) .+|.+.+.+...++ +........||.+-+||...|-+......+ ++.|...++||+ .-|+|.++.+-+.+ T Consensus 366 Eefeq~stslSGLd--dVi~qf~q~VAgaa~IPltkLfGqSPkGlN-ATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii 442 (695) T protein:vir:36 366 EEFFQFNTPLSGLD--ALQAQAQEQMSAVSHIPLIKLLGITPTGLN-ASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMI 442 (695) T ss_pred cceEEEecccCCHH--HHHHHHHHHHHhhhcCchhhhhccCccccc-ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 89999987777664 777778889999999999988776654442 233444455554 46889999988888 Q ss_pred HhhccChhhhcccchhhhhhhhhccCHHHHHHH-------HHHHHhCCCCCHHHHHHHhCCCCC-------CCCCeeeec Q lcl|NC_019705. 339 QRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAF-------MKAMGEAGLRTINEMRRTDNLPPL-------PGGDVAMRQ 404 (424) Q Consensus 339 ~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~~g~~p~-------~ggd~~~~~ 404 (424) -+..|...+. .+.|.++.|..++.++++++ ...+++.|+++++|+|+.+.-+|- +-.|++-.| T Consensus 443 ~rS~~G~idp---di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~ 519 (695) T protein:vir:36 443 QLSLFGAVDP---SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVP 519 (695) T ss_pred HHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcC Confidence 8777766543 35566677877777766554 557788999999999999887652 223444444 Q ss_pred ccc-cchh-----hccccCCC-cccCC Q lcl|NC_019705. 405 SQY-VPIT-----DLGTNKEP-RNNGA 424 (424) Q Consensus 405 ~n~-~~~~-----~~~~~~~~-~~~ga 424 (424) ... ++.. ..++.++. ...|| T Consensus 520 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (695) T protein:vir:36 520 ADDDIDGVLTYVQRLAEGGDTGAPGGA 546 (695) T ss_pred ccchhhhhHhhhcCcccccccCCCCcc Confidence 322 1111 11111111 11111 No 136 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.48 E-value=9.5e-14 Score=91.98 Aligned_cols=404 Identities=11% Similarity=0.031 Sum_probs=207.8 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccc-cchh--------hccccccccCcccccHHHHhccHHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQ-GSQT--------GPVSAHGHLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~-~~~~--------~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia 71 (424) ..||.-...|.+.-- +.--...+....+.+ .... .++.+.+..++.. -....+.|.+++|+.+|+ T Consensus 60 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~--la~laQ~~eyr~~~~~ia 133 (695) T protein:vir:78 60 VAEPSPSLRLARQFE----VDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPT--LVLLAQLPEYRAMHEVLA 133 (695) T ss_pred ccCCCcccccceece----eccccCCccccchhhhhhcccccccccchhhhccCcchHHH--HHHHhhccchhhHHHHHH Confidence 556655444332111 000000000000000 0000 0111111111111 234567899999999999 Q ss_pred HhhccCceEEEEeccC----------CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC--- Q lcl|NC_019705. 72 TLTACLPLDVFETDQN----------DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS--- 138 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~----------~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~--- 138 (424) +.+..-=..+...... +..... .+...-..|...-..+.-+..|.+. +.+--++|-+.+++.-++ T Consensus 134 ~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~-~d~dqi~~L~~e~erL~V~~~l~ea-ik~aRlfGGa~~~i~i~gdd~ 211 (695) T protein:vir:78 134 DECIRTWGEAIGGTKEKADTSGLAAGGNAAST-SDGDQLKQINDEIERLRIRDAVRTT-VIHDQAFGRAHPYFKIKGDDQ 211 (695) T ss_pred HHhhcccceeccccchhhhhhccccccccccc-ccHHHHHHHHHHHHHHHHHHHHHHH-HHhhccccceEEEEEeccCcc Confidence 9887652222111000 000000 0111222333222222223334444 444455666665553322 Q ss_pred --------------CCceeEEEEecCceeEEeecC----------ceEEEEEEeCCceEEecHhHEEEeecCCC-C---- Q lcl|NC_019705. 139 --------------AGDVISLLPLQSANMDVKLVG----------KKVVYRYQRDSEYAEFSQKEIFHLKGFGF-T---- 189 (424) Q Consensus 139 --------------~G~~~~l~~l~~~~v~~~~~~----------~~~~~~~~~~~~~~~~~~~eiih~r~~~~-~---- 189 (424) .|....|.+|+|+.|++...+ ...+|. ..+ ..|-.+.++.|..... + T Consensus 212 ~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~--V~G--~kIH~SRL~~f~g~plPd~LKp 287 (695) T protein:vir:78 212 IMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWW--MIG--TEVHATRLHTIVSRPVGDMLKP 287 (695) T ss_pred ccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEE--Eec--eEEeeeeEEEecCCCchhhhhc Confidence 244566888999888764321 111222 222 3455666665554322 2 Q ss_pred --CcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcC-CCCCCHHHHHHHHHHHHHHhCCcccCcceecC-CC Q lcl|NC_019705. 190 --GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG-EKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AG 265 (424) Q Consensus 190 --~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~-~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-~g 265 (424) .+.|+|..+.+...+............+... ....++.+-- ....+... ..+..+.+-........++++++ .. T Consensus 288 ~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~-~~v~~lk~dla~~L~~g~~-~~l~~R~eli~~~Rsn~G~~llDk~~ 365 (695) T protein:vir:78 288 TYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ-FSVSGILMDLAQALMPGAN-VDLSMRAELINRYRDNRNILFLDKAT 365 (695) T ss_pred ccccCcccHHHHHHHHHHHHHHHHhHHHHHHHh-hhhHHHHHHHHHhhcChhH-HHHHHHHHHHHHhcCccceEEEecCC Confidence 3469999999999988888777777776644 3333331110 11112222 22322233211112234578888 47 Q ss_pred ceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHH-------HHHHHHHHHHHHHH Q lcl|NC_019705. 266 FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSI 338 (424) Q Consensus 266 ~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l 338 (424) .+|.+.+.+...++ +........||.+-+||...|-+......+ ++.|...++||+ .-|+|.++.+-+.+ T Consensus 366 Eefeq~stslSGLd--dVi~qf~q~VAgaa~IPltkLfGqSPkGlN-ATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii 442 (695) T protein:vir:78 366 EEFFQFNTPLSGLD--ALQAQAQEQMSAVSHIPLIKLLGITPTGLN-ASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMI 442 (695) T ss_pred cceEEEecccCCHH--HHHHHHHHHHHhhhcCchhhhhccCCcccc-ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 89999987777664 777778889999999999988776654442 233444555554 46889999988888 Q ss_pred HhhccChhhhcccchhhhhhhhhccCHHHHHHH-------HHHHHhCCCCCHHHHHHHhCCCCC-------CCCCeeeec Q lcl|NC_019705. 339 QRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAF-------MKAMGEAGLRTINEMRRTDNLPPL-------PGGDVAMRQ 404 (424) Q Consensus 339 ~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~~g~~p~-------~ggd~~~~~ 404 (424) -+..|...+. .+.|.++.|..++.++++++ ...+++.|+++++|+|+.+.-+|- +-.|++-.| T Consensus 443 ~rS~~G~idp---di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~ 519 (695) T protein:vir:78 443 QLSLFGAVDP---SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVP 519 (695) T ss_pred HHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcC Confidence 8777766543 35566677877777766554 557788999999999999887652 223444444 Q ss_pred ccc-cchh-----hccccCC---CcccCC Q lcl|NC_019705. 405 SQY-VPIT-----DLGTNKE---PRNNGA 424 (424) Q Consensus 405 ~n~-~~~~-----~~~~~~~---~~~~ga 424 (424) ... ++.. ..++.++ +.+.+| T Consensus 520 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (695) T protein:vir:78 520 ADDDIDGVLTYVQRLAEGGDTGAPGGARA 548 (695) T ss_pred ccchhhhhHhhhcCcccccccCCCCCCCC Confidence 322 1111 1111111 111111 No 137 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.46 E-value=8.2e-14 Score=92.34 Aligned_cols=404 Identities=11% Similarity=0.030 Sum_probs=205.5 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCc-cccchh--------hccccccccCcccccHHHHhccHHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTP-NQGSQT--------GPVSAHGHLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~-~~~~~~--------~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia 71 (424) .-||.-+..|....- +..-...+....+ ...... .++.+.+..++.. -....+.|.+++|+.+|+ T Consensus 59 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~--la~laQ~~eyr~~~~~ia 132 (694) T protein:vir:10 59 VAEPSPSLRLARQFE----VDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPT--LVLLAQLPEYRAMHEVLA 132 (694) T ss_pred cCCCCcchhhhhhcc----ccccCCCccccchhhhhhccCcccccchhhhhccCcchHHH--HHHHhhccchhhHHHHHH Confidence 444543222211100 0000000000000 000000 0011111111111 234567899999999999 Q ss_pred HhhccCceEEEEeccC----------CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC--- Q lcl|NC_019705. 72 TLTACLPLDVFETDQN----------DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS--- 138 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~----------~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~--- 138 (424) +.+..-=..+...... +..... .+...-..|...-..+.-+..|.+. +.+--++|-+.+++.-++ T Consensus 133 ~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~-~d~dqi~~L~~e~erl~V~~~l~ea-ik~aRlfGGa~~~i~I~gdd~ 210 (694) T protein:vir:10 133 DECIRTWGEAIGGTKEKADTSGLAAGGNAAST-SDGDQLKQINDEIERLRIRDAVRTT-VIHDQAFGRAHPYFKIKGDDQ 210 (694) T ss_pred HHhhcccceeccccchhhhhhccccccccccc-ccHHHHHHHHHHHHHHHHHHHHHHH-HHhhccccceEEEEEeecCcc Confidence 9887652222111000 000000 0111222333222222223334444 444455666665553222 Q ss_pred --------------CCceeEEEEecCceeEEeecC----------ceEEEEEEeCCceEEecHhHEEEeecCCC-C---- Q lcl|NC_019705. 139 --------------AGDVISLLPLQSANMDVKLVG----------KKVVYRYQRDSEYAEFSQKEIFHLKGFGF-T---- 189 (424) Q Consensus 139 --------------~G~~~~l~~l~~~~v~~~~~~----------~~~~~~~~~~~~~~~~~~~eiih~r~~~~-~---- 189 (424) .|....|.+|+|+.|++...+ ...+|. ..+ ..|-.+.++.|..... + T Consensus 211 ~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~--V~G--~~IH~SRL~~f~g~plPd~LKp 286 (694) T protein:vir:10 211 IMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWW--MIG--TEVHATRLHTIVSRPVGDMLKP 286 (694) T ss_pred ccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEE--Eec--eEEeeeeEEEecCCCchhhhhc Confidence 244566888999888764321 111222 222 3455666665554322 2 Q ss_pred --CcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcC-CCCCCHHHHHHHHHHHHHHhCCcccCcceecC-CC Q lcl|NC_019705. 190 --GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG-EKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AG 265 (424) Q Consensus 190 --~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~-~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-~g 265 (424) .+.|+|..+.+...+............+... ....++.+-- ....+... ..+..+.+-........++++++ .. T Consensus 287 ~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~-~~v~~lk~dla~~L~~g~~-~~l~~R~eli~~~Rsn~G~~llDk~~ 364 (694) T protein:vir:10 287 TYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ-FSVSGILMDLAQALMPGAN-VDLSMRAELINRYRDNRNILFLDKAT 364 (694) T ss_pred ccccCcccHHHHHHHHHHHHHHHHhHHHHHHHh-hhhHHHHHHHHHhhcChhH-HHHHHHHHHHHHhcCccceEEEecCC Confidence 3469999999999988888777777776644 3333331100 11112222 22322233211112234578888 47 Q ss_pred ceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHH-------HHHHHHHHHHHHHH Q lcl|NC_019705. 266 FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSI 338 (424) Q Consensus 266 ~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l 338 (424) .+|.+.+.+...++ +........||.+-+||...|-+......+ ++.|...++||+ .-|+|.++.+-+.+ T Consensus 365 Eefeq~stslSGLd--dVi~qf~q~VAgaa~IPltkLfGqSPkGlN-ATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii 441 (694) T protein:vir:10 365 EEFFQFNTPLSGLD--ALQAQAQEQMSAVSHIPLIKLLGITPTGLN-ASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMI 441 (694) T ss_pred cceEEEecccCCHH--HHHHHHHHHHHhhhcCchhhhhccCccccc-ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 89999987777664 777778889999999999988776654442 233444455554 46889999988888 Q ss_pred HhhccChhhhcccchhhhhhhhhccCHHHHHHH-------HHHHHhCCCCCHHHHHHHhCCCCC-------CCCCeeeec Q lcl|NC_019705. 339 QRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAF-------MKAMGEAGLRTINEMRRTDNLPPL-------PGGDVAMRQ 404 (424) Q Consensus 339 ~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~~g~~p~-------~ggd~~~~~ 404 (424) -+..|...+. .+.|.++.|..++.++++++ ...+++.|+++++|+|+.+.-+|- +-.|++-.| T Consensus 442 ~rS~~G~idp---~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~ 518 (694) T protein:vir:10 442 QLSLFGAVDP---SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVP 518 (694) T ss_pred HHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcC Confidence 8777766543 35566677877777666544 557788999999999999887652 223444444 Q ss_pred ccc-cchh-----hccccCCC-cccCC Q lcl|NC_019705. 405 SQY-VPIT-----DLGTNKEP-RNNGA 424 (424) Q Consensus 405 ~n~-~~~~-----~~~~~~~~-~~~ga 424 (424) ... ++.. ..++.++. ...|| T Consensus 519 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (694) T protein:vir:10 519 ADDDIDGVLTYVQRLAEGGDTGAPGGA 545 (694) T ss_pred ccchhhhhHhhhcCcccccccCCCCcc Confidence 322 1111 11111111 11111 No 138 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.45 E-value=6.8e-14 Score=92.77 Aligned_cols=407 Identities=12% Similarity=0.035 Sum_probs=207.4 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccc-cchh--------hccccccccCcccccHHHHhccHHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQ-GSQT--------GPVSAHGHLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~-~~~~--------~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia 71 (424) ..||.-...|.+.-- +.--...+....+.+ .... .++.+.+..++. .-....+.|.+++|+.+|+ T Consensus 60 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~--~la~laQ~~eyr~~~~~ia 133 (698) T protein:vir:10 60 VAEPSPSLRLARQFE----VDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFP--TLVLLAQLPEYRAMHEVLA 133 (698) T ss_pred ccCCCccccccccce----eccccCCccccchhhhhhcccccccccchhhhccCcchHH--HHHHHhhccchhhHHHHHH Confidence 556655444332110 000000000000000 0000 011111111111 1234567899999999999 Q ss_pred HhhccCceEEEEeccC----------CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC--- Q lcl|NC_019705. 72 TLTACLPLDVFETDQN----------DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS--- 138 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~----------~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~--- 138 (424) +.+..-=..+...... +..... .+...-..|...-..+.-+..+.+.+.+ --++|-+.+++.-++ T Consensus 134 ~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~-~d~dqi~~L~~e~erl~V~~~l~eai~~-aRlfGGa~~~i~I~gdd~ 211 (698) T protein:vir:10 134 DECIRTWGEAIGGTKEKADTSGLAAGGNAAST-SDGDQLKQINDEIERLRIRDAVRTTVIH-DQAFGRAHPYFKIKGDDQ 211 (698) T ss_pred HHhhcccceeccccchhhhhhccccccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHh-cccccceEEEEEeecCcc Confidence 9887652222111000 000000 0111222333222222223344444444 455566555443211 Q ss_pred --------------CCceeEEEEecCceeEEeecC--ce---EEE---EEEeCCceEEecHhHEEEeecCC-CC------ Q lcl|NC_019705. 139 --------------AGDVISLLPLQSANMDVKLVG--KK---VVY---RYQRDSEYAEFSQKEIFHLKGFG-FT------ 189 (424) Q Consensus 139 --------------~G~~~~l~~l~~~~v~~~~~~--~~---~~~---~~~~~~~~~~~~~~eiih~r~~~-~~------ 189 (424) .|....|.+|+|+.|++...+ +. .+| .|...+. .+..+.++.|.... ++ T Consensus 212 ~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G~--~IH~SRL~~~vg~pvpd~LKp~y 289 (698) T protein:vir:10 212 IMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIGS--EVHATRLHTIVSRPVGDMLKPTY 289 (698) T ss_pred ccccccccccccccCccceeeeeecccccccchhhhccchhhccCCCceEEEecc--eecceeEEEecCCCchhhhcchh Confidence 244566888999888764321 00 011 1222222 45666666555432 22 Q ss_pred CcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-CCcee Q lcl|NC_019705. 190 GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AGFST 268 (424) Q Consensus 190 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-~g~~~ 268 (424) .+.|+|..+.+...+............+... ....++.+--...++......+..+++-........++++++ ...+| T Consensus 290 ~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~-~~~~~l~~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eef 368 (698) T protein:vir:10 290 SFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ-FSVSGILMDLAQALTPGANVDLSMRAELINRYRDNRNILFLDKATEEF 368 (698) T ss_pred ccCCccHHHHHHHHHHHHHHHhhhHHHHHHH-hhHHHHHHHHHHhcCChhhHHHHHHHHHHHHhcCccceEEEecCCcce Confidence 3469999999999998887777777766643 222332111111112222222333333211112234578888 57899 Q ss_pred eecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhh Q lcl|NC_019705. 269 SAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRW 341 (424) Q Consensus 269 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l~~~ 341 (424) .+++.+...++ +........||.+-+||...|-+......+ ++.|...++||+ .-|+|.++.+-+.+-+. T Consensus 369 eq~st~lSGLd--dVi~qf~q~VAgaa~IPltkLfGqSPkGlN-ATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS 445 (698) T protein:vir:10 369 FQFNTPLSGLD--ALQAQAQEQMSAVSHIPLIKLLGITPTGLN-ASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLS 445 (698) T ss_pred EEEecCcCCHH--HHHHHHHHHHHhhhcCchhhhhccCCcccC-ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99988777665 777788899999999999988776654442 233444555554 46889999998888877 Q ss_pred ccChhhhcccchhhhhhhhhccCHHHHHHH-------HHHHHhCCCCCHHHHHHHhCCCCC-------CCCCeeeecc-c Q lcl|NC_019705. 342 LIPAKDVGRIHAEHNLDGLLRGDSASRAAF-------MKAMGEAGLRTINEMRRTDNLPPL-------PGGDVAMRQS-Q 406 (424) Q Consensus 342 l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~~g~~p~-------~ggd~~~~~~-n 406 (424) .|..... .+.|.++.|..++.++++++ ...+++.|+++++|+|+.|.-+|- +--|++..|. | T Consensus 446 ~~G~idp---~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~ 522 (698) T protein:vir:10 446 LFGAVDP---SIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADD 522 (698) T ss_pred hcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCC Confidence 7766543 35566678877877776554 456778999999999999877652 1123433332 2 Q ss_pred ccchh-----hc---cccCCCc-ccCC Q lcl|NC_019705. 407 YVPIT-----DL---GTNKEPR-NNGA 424 (424) Q Consensus 407 ~~~~~-----~~---~~~~~~~-~~ga 424 (424) .++.. .. ++..+++ .+|| T Consensus 523 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (698) T protein:vir:10 523 DIDGVLTYVQRMAEGGDTGAPTAPGGA 549 (698) T ss_pred cchHHHhhhcCCcCCCCcccccccccc Confidence 22221 11 1111211 1222 No 139 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=99.42 E-value=6.8e-13 Score=87.29 Aligned_cols=287 Identities=14% Similarity=0.044 Sum_probs=171.3 Q ss_pred EEEEeeCC-CC--ceeEEEEecCceeE---EeecCceEEEEEE-e-CCceEEecHhHEEEeecCC-CCCcccCchHHHHH Q lcl|NC_019705. 131 YALVDRNS-AG--DVISLLPLQSANMD---VKLVGKKVVYRYQ-R-DSEYAEFSQKEIFHLKGFG-FTGLVGLSPIAFAC 201 (424) Q Consensus 131 ~~~~~r~~-~G--~~~~l~~l~~~~v~---~~~~~~~~~~~~~-~-~~~~~~~~~~eiih~r~~~-~~~~~G~s~i~~~~ 201 (424) +.+++... .| .|..|.+.++.++. +..+++....... . +.....+++...++.++.. ...++|.+.+..++ T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~g~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~ 80 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLVAIEQWGVFGKATVRIPVDRLVVFVNEREGANWLGQSLLRQAY 80 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCceeEEEecCCCCCCcceeccCCEEEEEeCCCCCCccchhhHHHHH Confidence 66666543 34 36778888887554 3444443333322 2 2345678888887777654 45589999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC--CCC----------HHHHHHHHHHHHHHhCCcccCcceecCCCceee Q lcl|NC_019705. 202 KSAGVAVAMEDQQRDFFANGAKSPQILSTGEK--VLT----------EQQRSQVEENFKEIAGGPVKKRLWILEAGFSTS 269 (424) Q Consensus 202 ~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~--~~~----------~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~ 269 (424) -.........++...|...-+.+--+.+.+.+ ..+ .+.++.+.........+..+ .++++.|++++ T Consensus 81 w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a--~~iip~g~~ie 158 (355) T protein:vir:78 81 KNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAA--GGYIPHGANFT 158 (355) T ss_pred HHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcce--eEeecCCceEE Confidence 99999999999999999986443333333322 211 12233344444444444443 57788888877 Q ss_pred ecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhh-- Q lcl|NC_019705. 270 AIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKD-- 347 (424) Q Consensus 270 ~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~-- 347 (424) -+.......++.+..++..++|+.++--. .+-....+++.+++ ..+.........+.-.+..|++.||+.|+.+.- T Consensus 159 ~~ea~g~~~~~~~~i~~~d~~Isk~iLGq-tlTs~~~~~gGS~A-lg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l 236 (355) T protein:vir:78 159 LTGVQGKLPEMDGPIRYHDEQIARAVLAH-FLTLGGDKSTGSYA-LGDTFASFFTGSLNAVMKHIADVTQQHVVEDLVDQ 236 (355) T ss_pred EeecCCCcccHHHHHHHHHHHHHHHHhhh-hhccccCCccchhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 77655555667788888899998887544 22211111112222 234455677788888889999999988876431 Q ss_pred ---h--cccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHH-----HHHHHhCCCCCCCCCeeeeccccc-chhhcccc Q lcl|NC_019705. 348 ---V--GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTIN-----EMRRTDNLPPLPGGDVAMRQSQYV-PITDLGTN 416 (424) Q Consensus 348 ---~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~N-----E~R~~~g~~p~~ggd~~~~~~n~~-~~~~~~~~ 416 (424) . ...++.|+ ... .+.+..++.+..++..|+.-++ .+|+.+|+|.-+.+++...+.... +....... T Consensus 237 N~~~~~~~P~~~~~--~~~-~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~~~~~~~~~~~~~~~~~~~ 313 (355) T protein:vir:78 237 NWGPEEPAPRLVPA--QLG-KEQPVTAEAIRALVECGAFTADPELEKDLRARYGLPAPAERDDGADAAAAKAAGRRRAKR 313 (355) T ss_pred cCCCCCCCCEEEec--CcC-hhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCCcccCCccccccccccccc Confidence 1 11233443 222 3445689999999999987764 479999998655555554432211 11110110 Q ss_pred CCCcccCC Q lcl|NC_019705. 417 KEPRNNGA 424 (424) Q Consensus 417 ~~~~~~ga 424 (424) ..+...++ T Consensus 314 ~~~~~~~~ 321 (355) T protein:vir:78 314 LPGQRQGA 321 (355) T ss_pred cCCccccc Confidence 01111111 No 140 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=99.34 E-value=5.7e-13 Score=87.71 Aligned_cols=396 Identities=11% Similarity=0.047 Sum_probs=220.6 Q ss_pred HhhccCccc-CCcc-------cc--chhhccccc------cccCccc----ccHHHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 21 QSWFVGGRL-VTPN-------QG--SQTGPVSAH------GHLGDSS----INDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 21 ~~~~~~~~~-~~~~-------~~--~~~~~~~~~------~~~~~~~----vs~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) +..++.+.. ..|. .. ..+.+.... +..++.. -..+-+-..|.++..+..|+++++++.+. T Consensus 1 ~~~~rPk~~p~~p~~~~~arrr~LtaAsa~l~~~~~~~~kt~~~~~~~WQ~eAW~~~d~vpELry~vgW~~~a~SR~rL~ 80 (646) T protein:vir:10 1 MALLKPKSAPPEPFGAEVARRIALAGATAQVDLGASSSWKTWKFGNKDWQTEGWRLYDIIPEHHFLAGRIGDSVAQARLY 80 (646) T ss_pred CcccCCCCCCCCcccccccchhhhhhccccccCCCcceeecCCCcchhhhHHHHHHHhhhhhHhhHhhhhhhhhceeeee Confidence 222221111 1110 00 000000000 0111100 01122233577888899999999999999 Q ss_pred EEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe---eCCCCceeEEEEecCceeEEee Q lcl|NC_019705. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD---RNSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~---r~~~G~~~~l~~l~~~~v~~~~ 157 (424) .-+-++.|...-...++.+..+....--.-.-..++++.+..++-+-|++|++.. ....+.--..+++-.+.|.. T Consensus 81 aseiddtG~~tg~v~~~~v~~iv~~~~Gg~~gQ~qlLkr~~~~ltV~GE~wiv~~~~~~~~~~~~~~W~vvt~~Ev~~-- 158 (646) T protein:vir:10 81 VTEVDDTGEETGEVQDERIKRLAAVPLGTGSQRDDNLRLAGLDLAVGGECWIVGEGAATSPEAAEGSWFVVTGSAISR-- 158 (646) T ss_pred eeeecCCCCCcCccchHHHHHHhhhhccchhhHHHHHHHHHhheecccceEEeeccccCCCCCCccceeeecHHHhcc-- Confidence 8887776665555556666666554333334467899999999999999999741 11112222355555555522 Q ss_pred cCceEEEEEEe---CCceEEecHhHEEEeecCCC---CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcC Q lcl|NC_019705. 158 VGKKVVYRYQR---DSEYAEFSQKEIFHLKGFGF---TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG 231 (424) Q Consensus 158 ~~~~~~~~~~~---~~~~~~~~~~eiih~r~~~~---~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~ 231 (424) .++.....-.. ++...-+++.+++ ||-.++ ....--||+.+++..+.-.....+...+..+.-..-.|||-++ T Consensus 159 tg~~~~i~~p~~~~g~~~v~~~~~d~l-vRiW~P~Prr~~epDSpvra~l~~l~Ei~~lt~~I~aaakSRL~GnGvLfvP 237 (646) T protein:vir:10 159 TGDEIAVRRPQQRGGSKLVLVDGQDIL-IRCWRPHPNDTDQADSFTRSAIVPLREIELLTKREFAELDSRLTGAGIMFLP 237 (646) T ss_pred CCCeeeeecCccCCCCCcceecCCceE-EEEecCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHHHHhcCceeeec Confidence 23322222111 2333445666764 464333 2357889999999988877777777776666666666777655 Q ss_pred CCC------CCHHHHHHHHHHHHH----HhCCcc-cC--cceecCC-Cc------eeeecccC-hhHHHHHHHHHHHHHH Q lcl|NC_019705. 232 EKV------LTEQQRSQVEENFKE----IAGGPV-KK--RLWILEA-GF------STSAIGVT-PQDAEMMASRKFQVSE 290 (424) Q Consensus 232 ~~~------~~~~~~~~~~~~~~~----~~~~~~-ag--~~~~l~~-g~------~~~~l~~~-~~d~~~~e~~~~~~~~ 290 (424) ... .++.....+...+-+ .+...+ +. =++++.. |. +++.+... .-+.--+.+++..+.. T Consensus 238 ~e~s~p~~~~~~a~~~~l~~~l~qaa~tAi~De~S~aA~vPiia~~P~E~i~~~~~ik~l~f~~eite~aiktR~daI~R 317 (646) T protein:vir:10 238 EGVDFPRGEEDPAGLAGFMAYLQRAAAASMADQSRASAMVPIMATIPNEMMEHLDKIKPLTFWSELSAEITPMKDKAIAR 317 (646) T ss_pred cccccCCCCCCCcchhHHHHHHHHHHHhhhcCCCCccceeeeEEeeChHHHhhhhcceeeccCchhhHHHhhhHHHHHHH Confidence 432 222223334333332 222211 11 1223221 11 23322232 2223347899999999 Q ss_pred HHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChh-------hhcccchhhhhhhhhcc Q lcl|NC_019705. 291 LARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-------DVGRIHAEHNLDGLLRG 363 (424) Q Consensus 291 Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-------~~~~~~~~fd~~~l~~~ 363 (424) +|...-|||+.|-+..++|-++. -+-...-++ -|.|.+..|++.|++.+|.+- +...|-+.||.+.|... T Consensus 318 lA~glDIppE~LLGlgd~NHWtA--WqI~de~vr-HI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~kyvvW~DaS~Lt~~ 394 (646) T protein:vir:10 318 LASSAEIPGEVLTGIGDANHWTA--WLISDEGIR-WIRGYLGLIADALTRGFLRRALESMGVTNPERYAFAFDTSTLASK 394 (646) T ss_pred HHhccCCchhheeeccccceeee--eeeccccch-hhhhHHHHHHHHHHhhHHHHHHHHcCCCChhHeEEeecCcccccC Confidence 99999999999877777665532 333344455 699999999999999988653 11246779999888433 Q ss_pred CHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCe-----------e-------eecc-------------cccc--h Q lcl|NC_019705. 364 DSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDV-----------A-------MRQS-------------QYVP--I 410 (424) Q Consensus 364 d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~-----------~-------~~~~-------------n~~~--~ 410 (424) -++.+-+..+.+.|.+|-...|+.+|+.--++-+. . +.|. .+.| + T Consensus 395 --pd~~deA~qa~drGAIt~eAlrk~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~Lil~P~~qa~~~~P~~~~~~lpp~~~ 472 (646) T protein:vir:10 395 --PNRLDEAIQLHERNLIKDEEVVKAGAFSVDQMPTVQERAVQILLGLVKTQPDLILDPAIQAALGLPAVQSVGLPPTAA 472 (646) T ss_pred --CCCcHHHHHHHHcCCccHHHHHHHhcccccccCChHHHHHHHHHHHhcCCccccccchhhccccCCCcCccccCCccc Confidence 34666677788999999999999999864221111 0 0010 0001 1 Q ss_pred hhc-cccCCCcccCC Q lcl|NC_019705. 411 TDL-GTNKEPRNNGA 424 (424) Q Consensus 411 ~~~-~~~~~~~~~ga 424 (424) +.. +++++++++|+ T Consensus 473 ~~~dg~~~~~e~~g~ 487 (646) T protein:vir:10 473 QRTDGDLDDDESEGA 487 (646) T ss_pred ccccCCCCChhhcCC Confidence 111 11233444444 No 141 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=99.22 E-value=2.8e-12 Score=83.90 Aligned_cols=406 Identities=12% Similarity=0.091 Sum_probs=224.0 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccc-cchhhcc-------ccccccCccc----ccHHHHhccHHHHHHHHHHHHhhc Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQ-GSQTGPV-------SAHGHLGDSS----INDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~-~~~~~~~-------~~~~~~~~~~----vs~~~~~~~~~v~~~i~~ia~~ia 75 (424) |--.|..-+.+|- ++...+.... ...+.++ ....+.++.. -..+-+-..+.++-.+..|+++++ T Consensus 1 ~~a~~~lr~~rrp----kg~~~a~~r~L~aAs~~~~dpg~~~~~~~g~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~s 76 (631) T protein:vir:10 1 MAATQSLRLVRRP----KGGRPAPSRALTAASQPLPDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCS 76 (631) T ss_pred CCcccceeeeecC----CCCCccchhhhhhhhccccchhhhhhhhcCCcccchhhHHHHHHHHhhhhHHHHhhhhhhhhc Confidence 4444443333332 2211111000 0001111 1111111110 012223335788889999999999 Q ss_pred cCceEEEEeccC-----Cccceeccc-hHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCc------- Q lcl|NC_019705. 76 CLPLDVFETDQN-----DNRKKVDLS-NPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGD------- 141 (424) Q Consensus 76 ~~~~~v~~~~~~-----~~~~~~~~~-~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~-r~~~G~------- 141 (424) ++.+..-+-+.+ |..++...+ ..+..+...=+..-+...++++.+..++-+-|++|+.+. |..+|- T Consensus 77 r~rL~as~idpDtg~ptg~iee~~~~~~~v~~~~~~i~gG~lgQ~~llkrl~~~ltV~GE~wiv~l~~p~~~~~~~pd~~ 156 (631) T protein:vir:10 77 RCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGS 156 (631) T ss_pred eeeeEeeeeccCCCCCccccccCCchhHHHHHHHHhcCCCcchHHHHHHHHHhheecccceEEEEEeccCcCCCCCcccc Confidence 999988777655 222221111 234444444456667889999999999999999999874 333221 Q ss_pred ---eeEEEEecCceeEEeecCceEEEEEEeCCceEEecHhHEEEeecC--CC-CCcccCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 142 ---VISLLPLQSANMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGF--GF-TGLVGLSPIAFACKSAGVAVAMEDQQR 215 (424) Q Consensus 142 ---~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~--~~-~~~~G~s~i~~~~~~i~~~~~~~~~~~ 215 (424) ..+++++....|+....+....+....+..-.-....| +.||-. ++ ...+--||+.+++..+.-.....+... T Consensus 157 ~r~~~~W~~vt~~ei~~~~~g~g~~v~lp~g~~h~~~~~~D-~l~RiW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~ 235 (631) T protein:vir:10 157 VRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTD-IIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIA 235 (631) T ss_pred cccccceeeccHHHHhcccCcccceeecCCCCccceecCCc-eEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHH Confidence 33566666666665544544444444333222223334 445533 33 235788999999988887777777776 Q ss_pred HHHhcCCCCceeEEcCCCCCC--------------------HHHHHHHHH-HHHHH---hCCc-ccC--cceecC----- Q lcl|NC_019705. 216 DFFANGAKSPQILSTGEKVLT--------------------EQQRSQVEE-NFKEI---AGGP-VKK--RLWILE----- 263 (424) Q Consensus 216 ~~~~ng~~~~~vl~~~~~~~~--------------------~~~~~~~~~-~~~~~---~~~~-~ag--~~~~l~----- 263 (424) +..+.-..-.|+|-++...+= .-....+.. .++.. +... .+. =++++. T Consensus 236 aaakSRl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~vPii~~~p~E~ 315 (631) T protein:vir:10 236 NASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQ 315 (631) T ss_pred HHHHHHHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHH Confidence 666665666667655543211 112333333 33322 2111 111 122221 Q ss_pred -CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019705. 264 -AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDV-EKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRW 341 (424) Q Consensus 264 -~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~-~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~ 341 (424) ++++...+.....+. -+..++..+..+|...-|||+.|-+. .++|-+ +.-+-...-++--|.|.+..|++.|++. T Consensus 316 i~~i~hlkf~~ei~e~-aiktR~daI~RlA~glDi~pE~LLGlGsd~NHW--sAWqI~dedVrlHI~P~l~lic~AlT~q 392 (631) T protein:vir:10 316 IKDVKHIRFDNEITEV-AIKTRNDAIARLAMGLDVSPERLLGLGSQTNHW--SAWQISDEDVQLHIAPVMEIFCQALTDQ 392 (631) T ss_pred hcCeeEEeecCchhHH-HHhhHHHHHHHHHhccCCchhhheeccCCccce--EEEEecccceeeecchHHHHHHHHHHhh Confidence 233444444444444 47899999999999999999888665 355543 2233334556667999999999999999 Q ss_pred ccChh------hhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC---------------- Q lcl|NC_019705. 342 LIPAK------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD---------------- 399 (424) Q Consensus 342 l~~~~------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd---------------- 399 (424) +|.+- +...|-+.||.+.|... -++.+-+..+.+.|.+|-...|+.+|+.--.+-| T Consensus 393 ~Lrp~Le~eGvDp~kYvvW~DaS~Lt~d--Pdr~deA~qa~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~~~a~~av~ 470 (631) T protein:vir:10 393 ILRVTLAREGIDPSKYVVWYDPSQLTID--PDKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVS 470 (631) T ss_pred HHHHHHHHhCCCHHHhEeeecCcccccC--CCCcHHHHHHHHcCCcCHHHHHHHhcCchhcccCcCchHHHHHHHHHHhh Confidence 88653 12247779999888432 3466667778899999999999999997533322 Q ss_pred --eeeecccccch----------------hhcccc----CCCcccCC Q lcl|NC_019705. 400 --VAMRQSQYVPI----------------TDLGTN----KEPRNNGA 424 (424) Q Consensus 400 --~~~~~~n~~~~----------------~~~~~~----~~~~~~ga 424 (424) .-+.| ++.|+ ...++. +++.++|. T Consensus 471 ~dpaLip-~lApl~~~~~~~v~~P~~~a~~~~g~ed~~~~~~~~~g~ 516 (631) T protein:vir:10 471 KDPTLIP-MLAPLIAGVLKQIEFPQQQAIDSGGNEDTSDADDLDDGE 516 (631) T ss_pred cccCcch-hhHHHHHHHhhhccCCCCCCCCCCCCCccccccccccCC Confidence 11111 11110 000000 01111111 No 142 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=99.15 E-value=1.9e-11 Score=79.31 Aligned_cols=406 Identities=11% Similarity=0.061 Sum_probs=210.8 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCccc-CCccccchhhccccccccC-cccccHHHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRL-VTPNQGSQTGPVSAHGHLG-DSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) ...||-. --+|++ .+.-..... ..|....... .+.+..+ .-.-..+-+-..+.++..+..|+++++++. T Consensus 9 ~rrpk~~-p~~~r~------~al~aas~~i~~p~~~~~ks--~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~r 79 (629) T protein:vir:86 9 VRRPKSE-PVSTRQ------RALVAASQPVENPGKAFRKA--MGSSTRTDWQEDAWKAYDAVGELRYYVGWRSSSASRVR 79 (629) T ss_pred eecCCCC-Chhhhh------hhhhhhhhccccccchhhhh--cCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhceee Confidence 2233321 001111 011110000 0111111000 0000000 000011222336788899999999999999 Q ss_pred eEEEEeccCCccceeccch------HHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC------ce-eEE Q lcl|NC_019705. 79 LDVFETDQNDNRKKVDLSN------PLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG------DV-ISL 145 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~------~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G------~~-~~l 145 (424) +..-+-+.++...-...++ .+.++...--..-+-..++++.+..++-+-|++|+.+.-...| .+ .+. T Consensus 80 L~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~eW 159 (629) T protein:vir:86 80 LIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPEW 159 (629) T ss_pred eEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhhh Confidence 9988777555433222222 2333333222233456799999999999999999998632323 22 233 Q ss_pred EEecCceeEEeecCceEEEEEEeCCceEEecHhHEEEeecCCC---CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_019705. 146 LPLQSANMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF---TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGA 222 (424) Q Consensus 146 ~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~---~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~ 222 (424) +.+-++.|+ ...+........+....-.+..+++ ||-.++ ...+--||+.+++..+.-.....+...+..+.-. T Consensus 160 ~~vt~~ei~--~~~~~~~i~lP~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakSRL 236 (629) T protein:vir:86 160 LALTPEEVR--ASEKKTIIELPTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKSRL 236 (629) T ss_pred eeechHHhh--hccCceeeEcCCCCcceeeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHH Confidence 444444433 1222222222223223333344554 664333 2357889999998888877776666655555544 Q ss_pred CCceeEEcCCCCC----------------------CHHHHHHHHHHHH----HHhCCcc-cC--cceecC------CCce Q lcl|NC_019705. 223 KSPQILSTGEKVL----------------------TEQQRSQVEENFK----EIAGGPV-KK--RLWILE------AGFS 267 (424) Q Consensus 223 ~~~~vl~~~~~~~----------------------~~~~~~~~~~~~~----~~~~~~~-ag--~~~~l~------~g~~ 267 (424) .-.|||-++.... .+ ..+.+.+.+. ..+...+ +. =++++. ++++ T Consensus 237 ~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~p-a~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i~ 315 (629) T protein:vir:86 237 IGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTP-AVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNVT 315 (629) T ss_pred hhCceeeeccCcccCccCCCCCCCCCCcccccccccc-hHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCee Confidence 5555543332211 11 2233333333 2222211 11 122221 2334 Q ss_pred eeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChh Q lcl|NC_019705. 268 TSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDV-EKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK 346 (424) Q Consensus 268 ~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~-~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~ 346 (424) ...+.....+. -+..++..+..+|...-|||+.|-+. .++|-+ +.-+-...-++--|.|.+..|++.|++.+|.+- T Consensus 316 hlkf~~ei~e~-aiktR~daI~RlA~glDippE~LLGlGsd~NHW--sAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp~ 392 (629) T protein:vir:86 316 HLKFDNQVTEV-AIKTRNDAIARLAMGLDVSPERLLGLGSNSNHW--SAWQIGDEDVRLHILPPVEMLCEAITNQVLRTV 392 (629) T ss_pred EEeecCchhHH-HHhhHHHHHHHHHhccCCchhhheeccCCccce--EEEEecccceeeecchHHHHHHHHHHhhHHHHH Confidence 44444444444 47899999999999999999888665 355544 223334455666799999999999999988653 Q ss_pred ------hhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC-------------eeeecccc Q lcl|NC_019705. 347 ------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD-------------VAMRQSQY 407 (424) Q Consensus 347 ------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd-------------~~~~~~n~ 407 (424) +...|-+.||.+.|... -++.+-+..+.+.|.+|-...|+.+|+.-..+-| ......++ T Consensus 393 Le~eGiDp~kYvvW~DaS~Lt~d--Pd~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~L 470 (629) T protein:vir:86 393 LMREGIDPNAYVVWHDASQLTVD--PDKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNL 470 (629) T ss_pred HHHhCCCHHHhEeeecCcccccC--CCCcHHHHHHHHcCCcCHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcch Confidence 12247779999888432 3466667778899999999999999996522221 11111111 Q ss_pred c---------------chhhcccc-----C-CCcccCC Q lcl|NC_019705. 408 V---------------PITDLGTN-----K-EPRNNGA 424 (424) Q Consensus 408 ~---------------~~~~~~~~-----~-~~~~~ga 424 (424) + |+...+-. + ..+.+|+ T Consensus 471 i~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga 508 (629) T protein:vir:86 471 LPTLAVLIPELADVEFPTPTVALPPAEEQDGDEEASGA 508 (629) T ss_pred hhhhhhhhhhhcccccCccCCCCCccccCCCcccccCC Confidence 1 11110000 0 0011111 No 143 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=99.14 E-value=2e-11 Score=79.28 Aligned_cols=406 Identities=11% Similarity=0.063 Sum_probs=210.9 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCccc-CCccccchhhccccccccC-cccccHHHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRL-VTPNQGSQTGPVSAHGHLG-DSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) ...||-. --+|++ .+.-..... ..|....... .+.+..+ .-.-..+-+-..+.++..+..|+++++++. T Consensus 9 ~rrpk~~-p~~~r~------~al~aas~~i~~p~~~~~ks--~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~r 79 (629) T protein:vir:99 9 VRRPKSE-PVSTRQ------RALVAASQPVENPGKAFRKA--MGSSTRTDWQDDAWKAYDAVGELRYYVGWRSSSASRVR 79 (629) T ss_pred eecCCCC-Chhhhh------hhhhhhhhcccccchhhhhh--cCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhceee Confidence 2233320 001111 010100000 0111111000 0000000 000011222336788899999999999999 Q ss_pred eEEEEeccCCccceeccch------HHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC------ce-eEE Q lcl|NC_019705. 79 LDVFETDQNDNRKKVDLSN------PLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG------DV-ISL 145 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~------~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G------~~-~~l 145 (424) +..-+-+.++...-...++ .+.++...--..-+-..++++.+..++-+-|++|+.+.-...| .+ .+. T Consensus 80 L~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~eW 159 (629) T protein:vir:99 80 LIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPEW 159 (629) T ss_pred eEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhhh Confidence 9988777555433222222 2333333222233456799999999999999999998632322 22 233 Q ss_pred EEecCceeEEeecCceEEEEEEeCCceEEecHhHEEEeecCCC---CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_019705. 146 LPLQSANMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF---TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGA 222 (424) Q Consensus 146 ~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~---~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~ 222 (424) +.+-++.|+ ...+........+....-.+..+++ ||-.++ ...+--||+.+++..+.-.....+...+..+.-. T Consensus 160 ~~vt~~ei~--~~~~~~~i~lP~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakSRL 236 (629) T protein:vir:99 160 LALTPEEVR--ASEKKTIIELPTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKSRL 236 (629) T ss_pred eeechHHhh--hccCceeEEcCCCCccceeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHH Confidence 444444433 1122222222223333333444544 664333 2357889999998888877776666665555545 Q ss_pred CCceeEEcCCCCC----------------------CHHHHHHHHHHHH----HHhCCcc-cC--cceecC------CCce Q lcl|NC_019705. 223 KSPQILSTGEKVL----------------------TEQQRSQVEENFK----EIAGGPV-KK--RLWILE------AGFS 267 (424) Q Consensus 223 ~~~~vl~~~~~~~----------------------~~~~~~~~~~~~~----~~~~~~~-ag--~~~~l~------~g~~ 267 (424) .-.|||-++.... .+ ..+.+.+.+. ..+...+ +. =++++. ++++ T Consensus 237 ~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~p-a~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i~ 315 (629) T protein:vir:99 237 IGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTP-AVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNVT 315 (629) T ss_pred hhCceeEeccCcccCccCCCCCCCCCCcccccccccc-hHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCee Confidence 5555543332211 11 2233333333 2222211 11 122221 2334 Q ss_pred eeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChh Q lcl|NC_019705. 268 TSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDV-EKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK 346 (424) Q Consensus 268 ~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~-~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~ 346 (424) ...+.....+. -+..++..+..+|...-|||+.|-+. .++|-+ +.-+-...-++--|.|.+..|++.|++.+|.+- T Consensus 316 hlkf~~ei~e~-aiktR~daI~RlA~glDippE~LLGlGsd~NHW--sAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp~ 392 (629) T protein:vir:99 316 HLKFDNQVTEV-AIKTRNDAIARLAMGLDVSPERLLGLGSNSNHW--SAWQIGDEDVRLHILPPVEMLCEAITNQVLRTV 392 (629) T ss_pred EEeecCchhHH-HHhhHHHHHHHHHhccCCchhhheeccCCccce--EEEEecccceeeecchhHHHHHHHHHhhHHHHH Confidence 44444444444 47899999999999999999888665 355544 223333455666799999999999999988653 Q ss_pred ------hhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC-------------Ceeeecccc Q lcl|NC_019705. 347 ------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG-------------DVAMRQSQY 407 (424) Q Consensus 347 ------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg-------------d~~~~~~n~ 407 (424) +...|-+.||.+.|... -++.+-+..+.+.|.+|-...|+.+|+.-..+- |......++ T Consensus 393 Le~eGiDp~kYvvW~DaS~Lt~d--Pd~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~L 470 (629) T protein:vir:99 393 LMREGIDPNAYVVWHDASQLTVD--PDKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNL 470 (629) T ss_pred HHHhCCCHHHhEeeecCcccccC--CCCcHHHHHHHHcCCccHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcch Confidence 12247779999888432 346666777889999999999999999652222 111111111 Q ss_pred c---------------chhhccc-----cC-CCcccCC Q lcl|NC_019705. 408 V---------------PITDLGT-----NK-EPRNNGA 424 (424) Q Consensus 408 ~---------------~~~~~~~-----~~-~~~~~ga 424 (424) + |+...+- ++ ..+.+|+ T Consensus 471 i~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga 508 (629) T protein:vir:99 471 LPTLAVLIPELADVEFPTPTVALPPAEEQDGDEEASGA 508 (629) T ss_pred hhhhhhhhhhhcccccCccCCCCCccccCCCcccccCC Confidence 1 1110000 00 0011111 No 144 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=99.02 E-value=3.9e-11 Score=77.66 Aligned_cols=404 Identities=12% Similarity=0.053 Sum_probs=211.2 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccch---hhcc----c-cccccCccc------ccHHHHhccHHHHHHHHHHHHh Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQ---TGPV----S-AHGHLGDSS------INDERILQISTVWRCVSLISTL 73 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~---~~~~----~-~~~~~~~~~------vs~~~~~~~~~v~~~i~~ia~~ 73 (424) |-- |..-+.+| .++...+ +.+... ..+. . ...+..+.. -..+.+-..+.++-.+..|+++ T Consensus 1 ma~-~~lr~~rr----pk~~p~~-~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s 74 (639) T protein:vir:10 1 MAA-TSLRVVRR----PKGSAPA-ARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSWRANS 74 (639) T ss_pred CCc-cceeeeec----CCCCCcc-hhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhhhhhh Confidence 211 11111111 1111100 011000 0011 0 000111110 0122233457888899999999 Q ss_pred hccCceEEEEeccCCcc-------ceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCc---- Q lcl|NC_019705. 74 TACLPLDVFETDQNDNR-------KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGD---- 141 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~-------~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~-r~~~G~---- 141 (424) ++++.+..-+-+.+... ++....+.+.+..+.=-..-+-..++++.+..++-+-|++|+.+. |..++. T Consensus 75 ~sr~rL~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~ 154 (639) T protein:vir:10 75 CSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGL 154 (639) T ss_pred hceeeeEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCcc Confidence 99999988776655431 122222334443332233345677999999999999999998754 444332 Q ss_pred --eeEEEE-ecCceeEEeecCceEEEEEEeCCceEEecHhHEEEeecCCC---CCcccCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 142 --VISLLP-LQSANMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF---TGLVGLSPIAFACKSAGVAVAMEDQQR 215 (424) Q Consensus 142 --~~~l~~-l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~---~~~~G~s~i~~~~~~i~~~~~~~~~~~ 215 (424) +.+-|. +....|. ...++..... .+++...+|..+.=+.||-.++ ....--||+.+++..+.-.....+... T Consensus 155 ~~~~~~W~vvs~~Ei~-~~~~~~~~i~-lPdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~ 232 (639) T protein:vir:10 155 AAPRARWYAVTREEIK-SKAGETAEIS-LPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (639) T ss_pred cccccceeeeeHHHhc-ccCCCeeEee-cCCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHH Confidence 233333 3333333 1222222222 2234444444443344664332 235788999999888887777766666 Q ss_pred HHHhcCCCCceeEEcCCCCC------------------------CHHHHHHHHHHHH----HHhCCcc-cC--cceecCC Q lcl|NC_019705. 216 DFFANGAKSPQILSTGEKVL------------------------TEQQRSQVEENFK----EIAGGPV-KK--RLWILEA 264 (424) Q Consensus 216 ~~~~ng~~~~~vl~~~~~~~------------------------~~~~~~~~~~~~~----~~~~~~~-ag--~~~~l~~ 264 (424) +..+.-..-.|+|-++...+ .....+.+...+- ..+...+ +. =++++.. T Consensus 233 aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~ 312 (639) T protein:vir:10 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (639) T ss_pred HHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEee Confidence 55555555555554432211 0011223333332 2222221 11 1222211 Q ss_pred ----Cceeeecc--cChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 265 ----GFSTSAIG--VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSI 338 (424) Q Consensus 265 ----g~~~~~l~--~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l 338 (424) .-+++.|. ....+. -+..++..+..+|....|||+.|-+..++|-+ +.-+-...-++--|.|.+..|++.| T Consensus 313 p~E~l~~ikhl~f~~ei~e~-aiktR~daI~RlA~glDi~pE~LLGl~d~NHW--sAWqI~dedvrlHI~P~l~~icdAl 389 (639) T protein:vir:10 313 AAEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSKGNHW--SAWAIGDEDVQLHIKPVMDLICQAI 389 (639) T ss_pred chHHhcCeeeeeecCchhHH-HHhhHHHHHHHHHhccCCchhheeecccccce--EEEEecccceeeecchhHHHHHHHH Confidence 12344444 333344 47899999999999999999988766665544 2234444556677999999999999 Q ss_pred HhhccChh------hhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC------------- Q lcl|NC_019705. 339 QRWLIPAK------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD------------- 399 (424) Q Consensus 339 ~~~l~~~~------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd------------- 399 (424) ++.+|.+- +...|-+.||.+.|... -++.+-+..+.+.|.+|-.-.|+.+|+.--++=| T Consensus 390 T~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~d--Pd~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~ 467 (639) T protein:vir:10 390 YNDILTPLLAREGIDPTKYILWYDASGLTSD--PDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAAD 467 (639) T ss_pred HhhHHHHHHHHhCCCHHHhEeeecCcccccC--CCCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHHH Confidence 99988653 12247779999888433 3466667778899999999999999986432212 Q ss_pred eeeecccccc---------------------hhh-ccccCCCcccCC Q lcl|NC_019705. 400 VAMRQSQYVP---------------------ITD-LGTNKEPRNNGA 424 (424) Q Consensus 400 ~~~~~~n~~~---------------------~~~-~~~~~~~~~~ga 424 (424) ....+..+++ +.. -.+..+++++|| T Consensus 468 ~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga 514 (639) T protein:vir:10 468 VVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGA 514 (639) T ss_pred HhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCC Confidence 0000111110 000 011122233333 No 145 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=99.02 E-value=3.9e-11 Score=77.66 Aligned_cols=404 Identities=12% Similarity=0.053 Sum_probs=211.2 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccch---hhcc----c-cccccCccc------ccHHHHhccHHHHHHHHHHHHh Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQ---TGPV----S-AHGHLGDSS------INDERILQISTVWRCVSLISTL 73 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~---~~~~----~-~~~~~~~~~------vs~~~~~~~~~v~~~i~~ia~~ 73 (424) |-- |..-+.+| .++...+ +.+... ..+. . ...+..+.. -..+.+-..+.++-.+..|+++ T Consensus 1 ma~-~~lr~~rr----pk~~p~~-~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s 74 (639) T protein:vir:97 1 MAA-TSLRVVRR----PKGSAPA-ARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYYVSWRANS 74 (639) T ss_pred CCc-cceeeeec----CCCCCcc-hhhHHHhhhhhccCCcccchhhhccccchhhhhhhhhhhhhhhhhHHHHhhhhhhh Confidence 211 11111111 1111100 011000 0011 0 000111110 0122233457888899999999 Q ss_pred hccCceEEEEeccCCcc-------ceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCc---- Q lcl|NC_019705. 74 TACLPLDVFETDQNDNR-------KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGD---- 141 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~-------~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~-r~~~G~---- 141 (424) ++++.+..-+-+.+... ++....+.+.+..+.=-..-+-..++++.+..++-+-|++|+.+. |..++. T Consensus 75 ~sr~rL~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~ 154 (639) T protein:vir:97 75 CSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGL 154 (639) T ss_pred hceeeeEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCcc Confidence 99999988776655431 122222334443332233345677999999999999999998754 444332 Q ss_pred --eeEEEE-ecCceeEEeecCceEEEEEEeCCceEEecHhHEEEeecCCC---CCcccCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 142 --VISLLP-LQSANMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF---TGLVGLSPIAFACKSAGVAVAMEDQQR 215 (424) Q Consensus 142 --~~~l~~-l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~---~~~~G~s~i~~~~~~i~~~~~~~~~~~ 215 (424) +.+-|. +....|. ...++..... .+++...+|..+.=+.||-.++ ....--||+.+++..+.-.....+... T Consensus 155 ~~~~~~W~vvs~~Ei~-~~~~~~~~i~-lPdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~ 232 (639) T protein:vir:97 155 AAPRARWYAVTREEIK-SKAGETAEIS-LPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIK 232 (639) T ss_pred cccccceeeeeHHHhc-ccCCCeeEee-cCCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHH Confidence 233333 3333333 1222222222 2234444444443344664332 235788999999888887777766666 Q ss_pred HHHhcCCCCceeEEcCCCCC------------------------CHHHHHHHHHHHH----HHhCCcc-cC--cceecCC Q lcl|NC_019705. 216 DFFANGAKSPQILSTGEKVL------------------------TEQQRSQVEENFK----EIAGGPV-KK--RLWILEA 264 (424) Q Consensus 216 ~~~~ng~~~~~vl~~~~~~~------------------------~~~~~~~~~~~~~----~~~~~~~-ag--~~~~l~~ 264 (424) +..+.-..-.|+|-++...+ .....+.+...+- ..+...+ +. =++++.. T Consensus 233 aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~ 312 (639) T protein:vir:97 233 NAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASV 312 (639) T ss_pred HHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEee Confidence 55555555555554432211 0011223333332 2222221 11 1222211 Q ss_pred ----Cceeeecc--cChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 265 ----GFSTSAIG--VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSI 338 (424) Q Consensus 265 ----g~~~~~l~--~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l 338 (424) .-+++.|. ....+. -+..++..+..+|....|||+.|-+..++|-+ +.-+-...-++--|.|.+..|++.| T Consensus 313 p~E~l~~ikhl~f~~ei~e~-aiktR~daI~RlA~glDi~pE~LLGl~d~NHW--sAWqI~dedvrlHI~P~l~~icdAl 389 (639) T protein:vir:97 313 AAEHLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSKGNHW--SAWAIGDEDVQLHIKPVMDLICQAI 389 (639) T ss_pred chHHhcCeeeeeecCchhHH-HHhhHHHHHHHHHhccCCchhheeecccccce--EEEEecccceeeecchhHHHHHHHH Confidence 12344444 333344 47899999999999999999988766665544 2234444556677999999999999 Q ss_pred HhhccChh------hhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC------------- Q lcl|NC_019705. 339 QRWLIPAK------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD------------- 399 (424) Q Consensus 339 ~~~l~~~~------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd------------- 399 (424) ++.+|.+- +...|-+.||.+.|... -++.+-+..+.+.|.+|-.-.|+.+|+.--++=| T Consensus 390 T~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~d--Pd~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~ 467 (639) T protein:vir:97 390 YNDILTPLLAREGIDPTKYILWYDASGLTSD--PDLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGCREFAAD 467 (639) T ss_pred HhhHHHHHHHHhCCCHHHhEeeecCcccccC--CCCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHHHHHHHH Confidence 99988653 12247779999888433 3466667778899999999999999986432212 Q ss_pred eeeecccccc---------------------hhh-ccccCCCcccCC Q lcl|NC_019705. 400 VAMRQSQYVP---------------------ITD-LGTNKEPRNNGA 424 (424) Q Consensus 400 ~~~~~~n~~~---------------------~~~-~~~~~~~~~~ga 424 (424) ....+..+++ +.. -.+..+++++|| T Consensus 468 ~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga 514 (639) T protein:vir:97 468 VVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGA 514 (639) T ss_pred HhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCC Confidence 0000111110 000 011122233333 No 146 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.90 E-value=2.6e-09 Score=67.65 Aligned_cols=391 Identities=13% Similarity=0.064 Sum_probs=188.0 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCc--ccccH-----HHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGD--SSIND-----ERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~vs~-----~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |.-.|..-|++++...+...... ......-+.+...... ..... ..-..+....-+|+..++.+-.-|+. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r---~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~ 77 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSR---VRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCee Confidence 88888888888886665432111 1111111111111100 00000 11123456677888888888888887 Q ss_pred EEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc Q lcl|NC_019705. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +-.. .+.+ ....+.+++. + |. -..+...+..+++.+|.||+.+.++.+|.+ .+..++|..+.+..|+. T Consensus 78 ~~~~-~d~~-----~~~~~~~i~~-~-N~---~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~-~i~~~~p~~~~~i~d~~ 145 (456) T protein:vir:10 78 VGGS-ADSD-----LALRARRIWR-D-NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPL 145 (456) T ss_pred cCCC-CCcc-----hHHHHHHHHH-h-cC---hhhHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEEccceeEEEEcCC Confidence 5221 1111 1122444443 2 32 235556788999999999999988888876 36778888887776642 Q ss_pred e-------EEEEEEeCCce----------------------------EEecHh------HEEEeecC----CCCCcccCc Q lcl|NC_019705. 161 K-------VVYRYQRDSEY----------------------------AEFSQK------EIFHLKGF----GFTGLVGLS 195 (424) Q Consensus 161 ~-------~~~~~~~~~~~----------------------------~~~~~~------eiih~r~~----~~~~~~G~s 195 (424) . ..|....++.. .+.... +.-|.-.. ..+...|+| T Consensus 146 ~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~g 225 (456) T protein:vir:10 146 QPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMG 225 (456) T ss_pred CCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCc Confidence 1 00111001000 000000 00011000 012235777 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC--CCCHHHHHH--HHHHHHHHhCCcccCcceecCCCceeeec Q lcl|NC_019705. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK--VLTEQQRSQ--VEENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) Q Consensus 196 ~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~--~~~~~~~~~--~~~~~~~~~~~~~ag~~~~l~~g~~~~~l 271 (424) .++.....++....+...........+.|..++.-... ...++.-.. ....++.. .+.++.++++.++.++ T Consensus 226 d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~-----~~~~~~~~~~~~~~q~ 300 (456) T protein:vir:10 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAA-----PGALWELPPGVDIWES 300 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhh-----ccccccCCCCcceEEe Confidence 76666555554443332222222222223323321100 000111011 11112221 2356778888898887 Q ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh----ccChhh Q lcl|NC_019705. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRW----LIPAKD 347 (424) Q Consensus 272 ~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~----l~~~~~ 347 (424) ....- -.|.+..+..+.+|++.-++|+..++.... |.|+..++.....+...+- -..+.|...|.+- +.-... T Consensus 301 ~~~~~-~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~Sg~Ai~~~~~~l~~k~~-~~~~~f~~~l~~~~rl~~~~~g~ 377 (456) T protein:vir:10 301 QANDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKCE-DRLSIAKIGLEAILVKALQIEGE 377 (456) T ss_pred cccCh-hHHHHHHHHHHHHHHhccCCChHHhccccc-ChHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCC Confidence 65322 237899999999999999999999986432 2232222222222211111 1111111111111 000111 Q ss_pred hcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC----CCCeeeecccccchhhccccCCCcccC Q lcl|NC_019705. 348 VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP----GGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 348 ~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~----ggd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) .....++..+......+..+.++++.++.+.|+.+..-+++++|+.+-+ ..+...... ......-.+.|.++| T Consensus 378 ~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~---~~~~~~~~~~~~~~~ 454 (456) T protein:vir:10 378 SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQI---TLFAGNPVQRPQEDG 454 (456) T ss_pred CcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHH---HHHhhhhhhcCCCCC Confidence 1112233334455667788899999999999999998888889986531 111111000 000111134566777 Q ss_pred C Q lcl|NC_019705. 424 A 424 (424) Q Consensus 424 a 424 (424) + T Consensus 455 ~ 455 (456) T protein:vir:10 455 S 455 (456) T ss_pred C Confidence 7 No 147 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.90 E-value=2.6e-09 Score=67.65 Aligned_cols=391 Identities=13% Similarity=0.064 Sum_probs=188.0 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCc--ccccH-----HHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGD--SSIND-----ERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~vs~-----~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |.-.|..-|++++...+...... ......-+.+...... ..... ..-..+....-+|+..++.+-.-|+. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r---~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~ 77 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSR---VRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCee Confidence 88888888888886665432111 1111111111111100 00000 11123456677888888888888887 Q ss_pred EEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc Q lcl|NC_019705. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +-.. .+.+ ....+.+++. + |. -..+...+..+++.+|.||+.+.++.+|.+ .+..++|..+.+..|+. T Consensus 78 ~~~~-~d~~-----~~~~~~~i~~-~-N~---~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~-~i~~~~p~~~~~i~d~~ 145 (456) T protein:vir:10 78 VGGS-ADSD-----LALRARRIWR-D-NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPL 145 (456) T ss_pred cCCC-CCcc-----hHHHHHHHHH-h-cC---hhhHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEEccceeEEEEcCC Confidence 5221 1111 1122444443 2 32 235556788999999999999988888876 36778888887776642 Q ss_pred e-------EEEEEEeCCce----------------------------EEecHh------HEEEeecC----CCCCcccCc Q lcl|NC_019705. 161 K-------VVYRYQRDSEY----------------------------AEFSQK------EIFHLKGF----GFTGLVGLS 195 (424) Q Consensus 161 ~-------~~~~~~~~~~~----------------------------~~~~~~------eiih~r~~----~~~~~~G~s 195 (424) . ..|....++.. .+.... +.-|.-.. ..+...|+| T Consensus 146 ~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~g 225 (456) T protein:vir:10 146 QPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMG 225 (456) T ss_pred CCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCc Confidence 1 00111001000 000000 00011000 012235777 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC--CCCHHHHHH--HHHHHHHHhCCcccCcceecCCCceeeec Q lcl|NC_019705. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK--VLTEQQRSQ--VEENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) Q Consensus 196 ~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~--~~~~~~~~~--~~~~~~~~~~~~~ag~~~~l~~g~~~~~l 271 (424) .++.....++....+...........+.|..++.-... ...++.-.. ....++.. .+.++.++++.++.++ T Consensus 226 d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~-----~~~~~~~~~~~~~~q~ 300 (456) T protein:vir:10 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAA-----PGALWELPPGVDIWES 300 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhh-----ccccccCCCCcceEEe Confidence 76666555554443332222222222223323321100 000111011 11112221 2356778888898887 Q ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh----ccChhh Q lcl|NC_019705. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRW----LIPAKD 347 (424) Q Consensus 272 ~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~----l~~~~~ 347 (424) ....- -.|.+..+..+.+|++.-++|+..++.... |.|+..++.....+...+- -..+.|...|.+- +.-... T Consensus 301 ~~~~~-~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~Sg~Ai~~~~~~l~~k~~-~~~~~f~~~l~~~~rl~~~~~g~ 377 (456) T protein:vir:10 301 QANDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKCE-DRLSIAKIGLEAILVKALQIEGE 377 (456) T ss_pred cccCh-hHHHHHHHHHHHHHHhccCCChHHhccccc-ChHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCC Confidence 65322 237899999999999999999999986432 2232222222222211111 1111111111111 000111 Q ss_pred hcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC----CCCeeeecccccchhhccccCCCcccC Q lcl|NC_019705. 348 VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP----GGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 348 ~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~----ggd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) .....++..+......+..+.++++.++.+.|+.+..-+++++|+.+-+ ..+...... ......-.+.|.++| T Consensus 378 ~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~---~~~~~~~~~~~~~~~ 454 (456) T protein:vir:10 378 SVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQI---TLFAGNPVQRPQEDG 454 (456) T ss_pred CcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHH---HHHhhhhhhcCCCCC Confidence 1112233334455667788899999999999999998888889986531 111111000 000111134566777 Q ss_pred C Q lcl|NC_019705. 424 A 424 (424) Q Consensus 424 a 424 (424) + T Consensus 455 ~ 455 (456) T protein:vir:10 455 S 455 (456) T ss_pred C Confidence 7 No 148 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=98.88 E-value=2.2e-09 Score=68.05 Aligned_cols=401 Identities=13% Similarity=0.096 Sum_probs=206.6 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchh---hccc----cccccCcccc-------cHHHHhccHHHHHHHHHHHHh Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQT---GPVS----AHGHLGDSSI-------NDERILQISTVWRCVSLISTL 73 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~---~~~~----~~~~~~~~~v-------s~~~~~~~~~v~~~i~~ia~~ 73 (424) |--.|=| +-+| ....|.+.+.. .+.. ......|... ..+-+-..+.++-.+..++++ T Consensus 1 ma~~~lr-v~rr--------pk~~p~~r~l~aasqp~~P~~~~~~~~~g~~~~~~WQ~eAW~~~d~VgElryyvgW~~ss 71 (629) T protein:vir:10 1 MAASTLR-VSRR--------PKGSPARRSLTAASQPMEPGRTPSRQVAGTVVRTSWQNEAWECMDLVGELRYYVGWRASS 71 (629) T ss_pred CCcccee-EEec--------CCCccceeeeccccCCCCcchhhchhhhhhhhhhhhhHHHHHHHHhhhhHHHHhhhhhhh Confidence 2221111 0011 01111111111 1100 0001111110 012222346777788889999 Q ss_pred hccCceEEEEeccCCccceec--cchHHHH----HhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC----cee Q lcl|NC_019705. 74 TACLPLDVFETDQNDNRKKVD--LSNPLAR----LLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG----DVI 143 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~--~~~~l~~----lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G----~~~ 143 (424) ++++.+..-+-+.|+...-.. .++|-.. ....=-..-+-..++++.+..++-+-|+.|+++.--..+ .+. T Consensus 72 ~Sr~rL~as~idpDtg~ptg~i~ed~p~~~~v~~~v~~iagG~lGqaqLlkr~~~~ltV~GE~~i~il~~~~~~pd~~~r 151 (629) T protein:vir:10 72 CSRVELIASELDPDTGKPTGGIRDDDPDGLRFLEIVKTMAGGPLGQAQLQKRAAECLTVPGEHRICLLDQGDKNPDGSVR 151 (629) T ss_pred heeeeEEEeeecCCCCCCccccccCchhHHHHHHHHHHhcCccchHHHHHHHHHhheeccCceEEEEeecCCCCCCcccc Confidence 999999887766554432211 2333222 222222233567789999999999999999997643333 334 Q ss_pred -EEEEecCceeEEeecCceEEEEEEeCCceEEecHhHEEEeecCCC---CCcccCchHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019705. 144 -SLLPLQSANMDVKLVGKKVVYRYQRDSEYAEFSQKEIFHLKGFGF---TGLVGLSPIAFACKSAGVAVAMEDQQRDFFA 219 (424) Q Consensus 144 -~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~eiih~r~~~~---~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ 219 (424) ..+.|....|. ..+....---..++...+|..+.=+.||-.++ ....--||+.+++..+.-.....+...+..+ T Consensus 152 ~~W~vVt~~Ei~--~kg~g~~~i~lpdg~~he~~~~~D~l~RvW~P~Prr~~e~DSpvra~l~~lrEi~r~tk~i~~aak 229 (629) T protein:vir:10 152 HNWYVVTNDEVK--NKGAGKTDIELPDGTIHEYSKGRDVMFRVWNPRPRRAKEPDSPVRACLDSLREIIRTTKKIRNASK 229 (629) T ss_pred cceeeecHHHhc--cccCceeEEEcCCCceeeeeCCCeeEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHH Confidence 33333444333 22212111222334444554444344554332 2356789999998888877766666665555 Q ss_pred cCCCCceeEEcCCCCC-----------C----------HHHHHHHHHHHHH----HhCCcc-cCc--ceecC-CC---ce Q lcl|NC_019705. 220 NGAKSPQILSTGEKVL-----------T----------EQQRSQVEENFKE----IAGGPV-KKR--LWILE-AG---FS 267 (424) Q Consensus 220 ng~~~~~vl~~~~~~~-----------~----------~~~~~~~~~~~~~----~~~~~~-ag~--~~~l~-~g---~~ 267 (424) .-..-.|||-++.... + ....+.+...+.+ .+...+ +.- ++++. .| -+ T Consensus 230 SRL~gnGvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~De~S~aA~vPiia~vP~E~l~~ 309 (629) T protein:vir:10 230 SRLIGNGVVFLPQELSLPRATAPVADNQPGAPVPIVDGVAAADELSNLLFQTAAAAVDDEDSQAALIPLLATVPGEHLQK 309 (629) T ss_pred hHHhhCceeEeccCcccccccCCCCCCCCcccccccCCCcchHHHHHHHHHHHHhhhcCCCCccceeeeEEeechHHhcC Confidence 5455555543332211 0 0122333333332 222221 111 22221 11 23 Q ss_pred eeec--ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019705. 268 TSAI--GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDV-EKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIP 344 (424) Q Consensus 268 ~~~l--~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~-~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~ 344 (424) ++.| .....+. -+..++..+..+|....|||+.|-+. .++|-+ +.-+-...-++--|.|.+..|++.|++.+|. T Consensus 310 ikhLkf~~eite~-~iktR~daI~RlAmglDispErLLGlGsd~NHW--sAWqI~dedvrlHI~P~l~~ic~Ait~~~Lr 386 (629) T protein:vir:10 310 IFHLKIGNEITEV-EIKTRNDAIARLAMGLDVSPERLLGLGSNSNHW--SAWQIGDEDVQLHIKPVMEVLCAAIYREVLV 386 (629) T ss_pred eeeeeecCchhHH-HHhhHHHHHHHHHhccCCChhheeeccCCccce--eeEEecccceeeecchHHHHHHHHHHhHHHH Confidence 3444 3444444 47899999999999999999888665 355543 2233444556667999999999999999886 Q ss_pred hh------hhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC-------------Ceeeecc Q lcl|NC_019705. 345 AK------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGG-------------DVAMRQS 405 (424) Q Consensus 345 ~~------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~gg-------------d~~~~~~ 405 (424) +- +...|-+.||.+.|.. | -++.+-+..+...|.+|-...|+.+|+.--++= |....+. T Consensus 387 p~L~~eGiDp~~Yvvw~DaS~Lt~-d-Pd~~deA~~a~drGaIt~eAlRr~lG~~~dd~y~~~t~~~~q~~A~~~v~~~P 464 (629) T protein:vir:10 387 ATLRAEGIDPDRYVLWYDASGLTV-D-PDKTDEATAAKEQGAITHEAYRRYLGLADEDGYDLETLEGAQAWARDAIVADP 464 (629) T ss_pred HHHHHhCCCHHHhEeeecCccccc-C-CCCcHHHHHHHHcCCccHHHHHHHhccccccCCCcCCcHHHHHHHHHHhcCCC Confidence 53 1224677899988743 2 245666677889999999999999999643221 1111111 Q ss_pred cccch---------------------hhccccCCC-----------cccCC Q lcl|NC_019705. 406 QYVPI---------------------TDLGTNKEP-----------RNNGA 424 (424) Q Consensus 406 n~~~~---------------------~~~~~~~~~-----------~~~ga 424 (424) .++++ ...++.+++ .+++| T Consensus 465 ~Li~~~apll~~~l~~i~~P~p~~a~~~~~~~~~~~E~~~~~~e~~~e~dA 515 (629) T protein:vir:10 465 SLIKVLAPLLTDELAEIDWPEPPAALPPGEDDQADEEQDTTGSEPSTEDDA 515 (629) T ss_pred chhhhhhhhcCCccccccccCCCCcCCCCCcccCccccCCCCCCcCCCcch Confidence 11111 001111111 11111 No 149 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.87 E-value=2.9e-09 Score=67.37 Aligned_cols=358 Identities=9% Similarity=-0.010 Sum_probs=167.0 Q ss_pred ccCcccc-cHHHH---hccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHH Q lcl|NC_019705. 46 HLGDSSI-NDERI---LQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMT 121 (424) Q Consensus 46 ~~~~~~v-s~~~~---~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~ 121 (424) .+....- ..+.. ....+..-||+.+++.+---.|.+ .++. ....+.+++. + |. .......+. T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~----~d~~-----~~~~~~~i~~-~-N~---~d~~~~~~~ 66 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVTG----PDGE-----PDTRASRWWQ-A-NR---LDSRQKLVW 66 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccCceec----CCCc-----hHHHHHHHHH-h-cC---hhHHHHHHH Confidence 0000000 00000 112344567777776554333432 1211 1223444443 2 32 235667788 Q ss_pred HHHHHcCCeEEEEeeCCCCce------eEEEEecCceeEEeecCce------EEEEE-EeCCce----------EE---- Q lcl|NC_019705. 122 MQLCFYGNAYALVDRNSAGDV------ISLLPLQSANMDVKLVGKK------VVYRY-QRDSEY----------AE---- 174 (424) Q Consensus 122 ~~~ll~G~a~~~~~r~~~G~~------~~l~~l~~~~v~~~~~~~~------~~~~~-~~~~~~----------~~---- 174 (424) .+++.+|.||+.+.++.++.. ..+.+++|.++.+..|... ..|.+ ..++.. .. T Consensus 67 ~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~ 146 (434) T protein:vir:98 67 RMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYARVFFDDTSFPYRTR 146 (434) T ss_pred HHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEEEEEeCcEEEEEEe Confidence 999999999999987665432 2367789988887776421 11100 000000 00 Q ss_pred ---------e----------------cHhH--EEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCcee Q lcl|NC_019705. 175 ---------F----------------SQKE--IFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQI 227 (424) Q Consensus 175 ---------~----------------~~~e--iih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~v 227 (424) . +-.. |+||++....+-.|.|-++.....++....+.........-.+.|..+ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~ 226 (434) T protein:vir:98 147 ERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKW 226 (434) T ss_pred eccccccccccccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhh Confidence 0 0011 344443321122588877777777766665555544444444555544 Q ss_pred EEcCCC-CCCHHHHHHHHHHHHHHhCCcccCcceecC-CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC Q lcl|NC_019705. 228 LSTGEK-VLTEQQRSQVEENFKEIAGGPVKKRLWILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDV 305 (424) Q Consensus 228 l~~~~~-~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~ 305 (424) +.-... ...++. ......++..... .++++.++ ++.++.++.....+ .+++.++..+.+|+..=++|+..++.. T Consensus 227 i~G~~~~~~~~~~-~~~~~~~~~~~~~--~~~i~~~~~~~~~~~q~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~~~~~ 302 (434) T protein:vir:98 227 IKGHKFAKRTDPA-TGMTVVDQPFVPS--PSAVWASEGENTQFGQLDATDLS-GFLKEHASDVRDMLTISQTPTYLYATD 302 (434) T ss_pred hcCCCcccccccc-cccchhhhhhhcc--ccccccCCCCCceEEEecCcchH-HHHHHHHHHHHHHhcccCCCHHHhccc Confidence 431100 001111 1111222222221 23466665 35677777654333 378888999999999999999999853 Q ss_pred CCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhh--ccChh-h--hcccchhhhhhhhhccCHHHHHHHHHHHHhCCC Q lcl|NC_019705. 306 EKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRW--LIPAK-D--VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGL 380 (424) Q Consensus 306 ~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~--l~~~~-~--~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~ 380 (424) . ++.|+...+.+...+...+-. ..+.|.+.|.+- |.... + .....+++.+......+..+.++++.++.+.|+ T Consensus 303 ~-~n~Sg~Al~~~~~~l~~k~~~-k~~~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g~ 380 (434) T protein:vir:98 303 L-VNISADTIGALDILHVAKVRE-HIASFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVKADAATKLKSIGY 380 (434) T ss_pred c-CChHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCCCCHHHHHHHHHHHHhcCC Confidence 2 233322222222222221111 122222222211 10000 0 011223344455667888999999999998886 Q ss_pred CCHHHHHHHhCCCCCC------C--CCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 381 RTINEMRRTDNLPPLP------G--GDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 381 ~t~NE~R~~~g~~p~~------g--gd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +..-+++.+|+++-+ . .+...................+.+++| T Consensus 381 -~~e~~~~~lg~~~~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 431 (434) T protein:vir:98 381 -PLDVIAEELDESPARVRRIVAGAASQALLAASLLPAPGAPSAGNVPDSGGA 431 (434) T ss_pred -cHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcccCC Confidence 777788888887631 0 000000000000011112223333333 No 150 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.82 E-value=2.1e-08 Score=62.63 Aligned_cols=347 Identities=8% Similarity=0.015 Sum_probs=166.7 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCc--ccccHH---HH-hccHHHHHHHHHHHHhhccCceEE Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGD--SSINDE---RI-LQISTVWRCVSLISTLTACLPLDV 81 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~vs~~---~~-~~~~~v~~~i~~ia~~ia~~~~~v 81 (424) |+.+ .+++|...+...... .......+.+-..+.. ..+.++ .+ +-..+..-+|+.+++.+.=-.|. T Consensus 1 ~~~~----~i~~L~~~~~~~~~r---~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~- 72 (409) T protein:vir:94 1 MTEK----GIGYLRFKLSVHKRR---AEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFE- 72 (409) T ss_pred CCHH----HHHHHHHHHHHHhHH---HHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCccc- Confidence 4332 233333322221111 1011111111111111 111111 00 11234555666666643322222 Q ss_pred EEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCce Q lcl|NC_019705. 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK 161 (424) Q Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~ 161 (424) . .+ ..+.+++. + |.. ......+..+.+.+|.||+.+..+.+|.| .+.+++|..+.+..|... T Consensus 73 --~-~d---------~~l~~i~~-~-N~l---d~~~~~~~~~aliyG~sf~~v~~~~dg~~-~i~~~sp~~~~~i~D~~~ 134 (409) T protein:vir:94 73 --N-DD---------FTVNEIFE-E-NNP---DIFFDSAVLSSLIASCSFTYISKGENDAV-RLQVIEAVNATGIIDPIT 134 (409) T ss_pred --C-Cc---------hHHHHHHH-h-cCh---hHHHHHHHHHHHHhcceeEEEecCCCCce-EEEEeccceEEEEEecCC Confidence 1 11 12444443 2 322 34556788899999999999999888876 677889988887766421 Q ss_pred --E--EEEEEeC---Cce---EEecHhH----------------------EEEeecC-CCCCcccCch----HHHHHHHH Q lcl|NC_019705. 162 --V--VYRYQRD---SEY---AEFSQKE----------------------IFHLKGF-GFTGLVGLSP----IAFACKSA 204 (424) Q Consensus 162 --~--~~~~~~~---~~~---~~~~~~e----------------------iih~r~~-~~~~~~G~s~----i~~~~~~i 204 (424) . .|.+... +.. ..+.+++ |++|.+. ..+..+|.|. +..+.+.+ T Consensus 135 ~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~ 214 (409) T protein:vir:94 135 GLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNA 214 (409) T ss_pred CceeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHH Confidence 1 1111110 000 1112222 3444332 2345677774 45556666 Q ss_pred HHHHHHHHHHHHHHhcCCCCceeEE-cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-----CCceeeecccChhHH Q lcl|NC_019705. 205 GVAVAMEDQQRDFFANGAKSPQILS-TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-----AGFSTSAIGVTPQDA 278 (424) Q Consensus 205 ~~~~~~~~~~~~~~~ng~~~~~vl~-~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-----~g~~~~~l~~~~~d~ 278 (424) .....-......||.+ |.-++. .+.+ ....+.++.... +++.++ .+.++.++....-+ T Consensus 215 ~r~~~~~~~~~e~~a~---pqr~i~G~d~d---~~~~~~~~~~~~---------~i~~~~~d~dg~~~~v~q~~~~~l~- 278 (409) T protein:vir:94 215 KRTLERADVTAEFYSF---PQKYVTGLSDD---AEPMETWKATVS---------SMLQFTKDEDGDKPTLGQFTQPSMS- 278 (409) T ss_pred HHHHHHHHHHHHHhcC---hhheeEecCCC---CcccchhhhhHH---------HhhcCCCCCCCCCceEEecCCCChh- Confidence 6655555566666655 443443 2211 112222332222 244443 23455555443222 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHH---HHHHHHHHHHHh--hccChhh--hc-c Q lcl|NC_019705. 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ---PYISRWENSIQR--WLIPAKD--VG-R 350 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~---P~~~~ie~~l~~--~l~~~~~--~~-~ 350 (424) .|++.++..+.++|+.-++|++.+|....+.+|...++.+...+...+=. -+-..+++.+-. .+..... .. . T Consensus 279 ~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~ 358 (409) T protein:vir:94 279 PFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQF 358 (409) T ss_pred HHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccc Confidence 48999999999999999999999998665333333333333222222211 122222222211 1111111 00 1 Q ss_pred cchhhhhhhhhccCH---HHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC Q lcl|NC_019705. 351 IHAEHNLDGLLRGDS---ASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP 396 (424) Q Consensus 351 ~~~~fd~~~l~~~d~---~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~ 396 (424) ..+++.+..+...+. .+.++++.+++++| +...+-+++++|+..-+ T Consensus 359 ~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 359 RKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred ccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 122333333333343 45678889999998 66779999999998755 No 151 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.82 E-value=5.4e-09 Score=65.92 Aligned_cols=392 Identities=13% Similarity=0.053 Sum_probs=179.4 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchhhccccccccC--cccc-----cHHHHhccHHHHHHHHHHHHhhccCceE Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSI-----NDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v-----s~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |.--|-.-|++++.......... ......-+.+..... +... ....-..+.....+|+..++.+-.-|+. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r---~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~ 77 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSR---VRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHH---HHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCee Confidence 44445445555544433221110 000000111111000 0000 0011122346677889888888888887 Q ss_pred EEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc Q lcl|NC_019705. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~ 160 (424) +. ...+.. ....+.+++.. |. ...+...+..+++.+|.||+.+.++.+|.+ .+..++|..+.+..++. T Consensus 78 ~~-~~~d~~-----~~~~~~~~~~~--n~---~d~~~~~~~~~a~~~G~a~~~~~~~edg~~-~i~~~~p~~~~~i~d~~ 145 (456) T protein:vir:79 78 VG-GSADSD-----LALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPL 145 (456) T ss_pred cC-CCCCcc-----HHHHHHHHHHh--cC---hhHHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEeccceeEEEEcCC Confidence 52 211111 11234445442 22 235667889999999999999988888887 57888898887776642 Q ss_pred eE-------EEEEEeCCce---EEec-------------------------------HhHEEEeecC----CCCCcccCc Q lcl|NC_019705. 161 KV-------VYRYQRDSEY---AEFS-------------------------------QKEIFHLKGF----GFTGLVGLS 195 (424) Q Consensus 161 ~~-------~~~~~~~~~~---~~~~-------------------------------~~eiih~r~~----~~~~~~G~s 195 (424) .. .|....++.. ..+. ..++-|.-.. ..+...|+| T Consensus 146 ~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~~~g 225 (456) T protein:vir:79 146 QPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMG 225 (456) T ss_pred CCCceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEecCCCCCc Confidence 10 0110000000 0000 0111111000 011234566 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCC--CCCCHHHHHH--HHHHHHHHhCCcccCcceecCCCceeeec Q lcl|NC_019705. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE--KVLTEQQRSQ--VEENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) Q Consensus 196 ~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~--~~~~~~~~~~--~~~~~~~~~~~~~ag~~~~l~~g~~~~~l 271 (424) -++.....++....+..-........+.|..++.-.. ....++.-+. ....+... .+.++.++++.++.++ T Consensus 226 d~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~-----~~~~~~~~~~~~~~q~ 300 (456) T protein:vir:79 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAA-----PGALWELPPGVDIWES 300 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhh-----ccccccCCCCcceeee Confidence 5555544444333222111111122222222221100 0000110000 11112211 2356777888888877 Q ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHH---HHHHHHHHHHHhhccChhhh Q lcl|NC_019705. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ---PYISRWENSIQRWLIPAKDV 348 (424) Q Consensus 272 ~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~---P~~~~ie~~l~~~l~~~~~~ 348 (424) ..+.-+ .+.+..+..+.+|+..-++|+..++.... |.|+...+.....+...+-. -+-..|++.+..-+-..... T Consensus 301 ~~~~~~-~~~~~l~~~i~~i~~~t~~p~~~~~~~~~-N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~ 378 (456) T protein:vir:79 301 QTNDFT-PMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES 378 (456) T ss_pred cccChH-HHHHHHHHHHHHHHhhcCCChhHhccccc-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 654332 37889999999999999999999986432 23333333222222221111 11111222111111001111 Q ss_pred cccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--C--CCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 349 GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL--P--GGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 349 ~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~--~--ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ....++..+......+..+.++++.++++.|+.+..-+++.+|+.+- + ..+......+.. ... --+.+..+|| T Consensus 379 ~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~~-~~~--~~~~~~~~~~ 455 (456) T protein:vir:79 379 VEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLF-AGN--PVQRPQEDGS 455 (456) T ss_pred ccccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHHHHHHHHHHHHHHHH-hhh--HhhcCCCCCC Confidence 11223333344556777889999999999999999888888998663 1 111111111100 111 1234666677 No 152 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.72 E-value=4.3e-08 Score=60.95 Aligned_cols=394 Identities=11% Similarity=0.082 Sum_probs=171.0 Q ss_pred CCCCcccccCCCCCc-hHHHHHhhccCcccCCccccchhhccccccccC--cccccH---HHHhccHHHHHHHHHHHHhh Q lcl|NC_019705. 1 MEEPKYTIDLRTNNG-WWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSIND---ERILQISTVWRCVSLISTLT 74 (424) Q Consensus 1 ~~~~~~~~~~~~~~G-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~vs~---~~~~~~~~v~~~i~~ia~~i 74 (424) |.-|-=-|+.-.+.. +++.|...+...... .......+.+-.... +..+.. ..-........+|+..++.+ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~r---~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l 77 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDSTQN---LKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQ 77 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhh Confidence 433333333222222 333444433322110 000000111111000 000000 11111234456777766655 Q ss_pred ccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce-------eEEEE Q lcl|NC_019705. 75 ACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV-------ISLLP 147 (424) Q Consensus 75 a~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~-------~~l~~ 147 (424) --.+|. ..++. .....+.+++. + | ....+...+..+++.+|.||+.+.++..+.. ..+.+ T Consensus 78 ~~~g~~---~~~~~-----~~~~~~~~i~~-~-N---~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~ 144 (485) T protein:vir:10 78 AVEGFR---FGDAD-----EADEELWQWWQ-A-N---NLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRV 144 (485) T ss_pred ccccee---cCCCc-----hhHHHHHHHHH-h-c---CHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEE Confidence 333333 22111 11223444443 2 2 2346778899999999999999988765432 24777 Q ss_pred ecCceeEEeecCce--E----EEEEEeCCce----EEecHhH-------------------------EEEeecC-CCCCc Q lcl|NC_019705. 148 LQSANMDVKLVGKK--V----VYRYQRDSEY----AEFSQKE-------------------------IFHLKGF-GFTGL 191 (424) Q Consensus 148 l~~~~v~~~~~~~~--~----~~~~~~~~~~----~~~~~~e-------------------------iih~r~~-~~~~~ 191 (424) ++|..+.+..|+.. . .+.+...+.. ..+.++. |++|.+. ...+. T Consensus 145 ~~p~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~ 224 (485) T protein:vir:10 145 EPPTRMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDL 224 (485) T ss_pred EccceeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCC Confidence 88888877665321 1 1111111100 0122222 3444432 23445 Q ss_pred ccCchHH----HHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHH--HHHHHHHHHHhCCcccCcceecC-C Q lcl|NC_019705. 192 VGLSPIA----FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR--SQVEENFKEIAGGPVKKRLWILE-A 264 (424) Q Consensus 192 ~G~s~i~----~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~--~~~~~~~~~~~~~~~ag~~~~l~-~ 264 (424) +|.|-+. .+.+.+.....-......+|. .|..++.-. . .++... ..-...++.. .++++.++ + T Consensus 225 ~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a---~p~~~i~G~-~-~~~~~~~~~~~~~~~~~~-----~~~i~~~~~~ 294 (485) T protein:vir:10 225 YGTSEITPELRSMTDAAARILMLMQATAELMG---VPQRLIFGI-K-PEEIGVDPETGQTLFDAY-----LARILAFEDA 294 (485) T ss_pred CCccchhHHHHHHHHHHHHHHHHHHHHHHhhc---chHHHHhcC-C-cccccccccccchhhhhc-----ccceeccCCC Confidence 7777543 333444333333333334443 343333211 0 010000 0011111111 23456655 4 Q ss_pred CceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHh---- Q lcl|NC_019705. 265 GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR---- 340 (424) Q Consensus 265 g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~---- 340 (424) +.++.++....-+ .+++..+..+.+|+.+=++|+..+|....+..|+.........+...+- -....+...|.+ T Consensus 295 d~k~~q~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~-~k~~~f~~~l~~~~~l 372 (485) T protein:vir:10 295 EGKIQQFSAAELA-NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVE-RKNSIFGGAWEEAMRL 372 (485) T ss_pred CceEEeecccchH-HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 5677776654322 3788888999999999999999998654433332222222222221111 111111111111 Q ss_pred --hccChhhh--cccchhhhhhhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC--CCCe------------ Q lcl|NC_019705. 341 --WLIPAKDV--GRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP--GGDV------------ 400 (424) Q Consensus 341 --~l~~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~--ggd~------------ 400 (424) .+....+. ....+++.+......+..+.++++.+++++| +++..-+++.+|+.+-+ .... T Consensus 373 ~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~ 452 (485) T protein:vir:10 373 AYRMMKGGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGL 452 (485) T ss_pred HHHHhCCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHH Confidence 11111111 1133445555566788888999999999866 88888889999986532 1110 Q ss_pred ---eeecccccchhhccc------c--CCCcccCC Q lcl|NC_019705. 401 ---AMRQSQYVPITDLGT------N--KEPRNNGA 424 (424) Q Consensus 401 ---~~~~~n~~~~~~~~~------~--~~~~~~ga 424 (424) +..+.... ++..+ + +....+|| T Consensus 453 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 453 IGTMVDPNPTV--PGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred HHHhhccCCCC--CCCCCccccccCcCCCCCCCCC Confidence 11111110 00000 0 01111222 No 153 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=98.65 E-value=8.5e-08 Score=59.35 Aligned_cols=407 Identities=11% Similarity=0.075 Sum_probs=194.4 Q ss_pred CCCCcccccCCCCC---chHHHHHhhccCccc---CCccccchhhccc----cccccCccccc--------HHHHhccHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNN---GWWARLQSWFVGGRL---VTPNQGSQTGPVS----AHGHLGDSSIN--------DERILQIST 62 (424) Q Consensus 1 ~~~~~~~~~~~~~~---G~~~~~~~~~~~~~~---~~~~~~~~~~~~~----~~~~~~~~~vs--------~~~~~~~~~ 62 (424) |..-|+-+++.... -+++.+.++ ..+.. +.+.......++. .....++..-+ ++.++++|. T Consensus 1 ~~~~~~w~~~de~~~~~~~~~~~~~~-~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~~pE 79 (533) T protein:vir:58 1 MPSLEKYKKLNEAVNFTNFLSPMYGM-GAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYTDPL 79 (533) T ss_pred CCCcchhhhhhHHHHHHHhhchhhcc-cCccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhccCcc Confidence 43333333332221 222333222 11111 0111111111110 01111222112 334467899 Q ss_pred HHHHHHHHHHhhccC-----ceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee- Q lcl|NC_019705. 63 VWRCVSLISTLTACL-----PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR- 136 (424) Q Consensus 63 v~~~i~~ia~~ia~~-----~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r- 136 (424) |..||+.|.+.+.-. |+.+--.+. ...+... ..+..+|+ ...--+.++..|...|..|..++- T Consensus 80 Vd~AideIvneaiv~d~~~~pV~v~l~~~--e~s~~iK-~kI~~lld--------f~~~~~~~fR~WYVDGriy~Hkiik 148 (533) T protein:vir:58 80 ISTVLDIIADECTIPNENGNIVDVVTKDI--ELAKAIL-SYLDYVIN--------IEKNAYPIIRNMIKYGDMFLHILEK 148 (533) T ss_pred hhhHHHhhhceeeEecCCCceeEeecccc--cccHHHH-HHHHHHhc--------chhhhhHHHHhhhhcceeEEEeccC Confidence 999999999986643 333321111 1111111 12222222 222334567788899999988753 Q ss_pred CCCCceeEEEEecCceeEEeecC--ceEEEEEE-------eCCceEEecHhHEEEeec--CCCCCcccCchHHHHHHHHH Q lcl|NC_019705. 137 NSAGDVISLLPLQSANMDVKLVG--KKVVYRYQ-------RDSEYAEFSQKEIFHLKG--FGFTGLVGLSPIAFACKSAG 205 (424) Q Consensus 137 ~~~G~~~~l~~l~~~~v~~~~~~--~~~~~~~~-------~~~~~~~~~~~eiih~r~--~~~~~~~G~s~i~~~~~~i~ 205 (424) +..+-+.+|..|+|.+|+.+++- +..+|.|. .+.....++.+.|+|+.. ...++.+++|-+..+.+.+. T Consensus 149 ~~k~GI~elr~lDPr~i~~vr~~~t~~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~N 228 (533) T protein:vir:58 149 GSDGTIEKFQVVSPYIFSKRYNPETDTWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWN 228 (533) T ss_pred CcccchhhheecCCeeeEEEEeeccceEEEeecccccccccCccccccchhheeeeeeccccCCCCceehhhhHHHHHHH Confidence 35567889999999999876653 34445454 233456789999999975 34567788999999988888 Q ss_pred HHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC---Cc-ccCcc------e----ec-------- Q lcl|NC_019705. 206 VAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG---GP-VKKRL------W----IL-------- 262 (424) Q Consensus 206 ~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~---~~-~ag~~------~----~l-------- 262 (424) ....++....-+--.-+.-+-|+.++.+++... +.+-++....++.. -+ +.|.+ + .| T Consensus 229 QLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRR 308 (533) T protein:vir:58 229 QLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRR 308 (533) T ss_pred HHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhccccc Confidence 777777766655545454455777776665432 23334444433321 11 22222 1 22 Q ss_pred --CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019705. 263 --EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR 340 (424) Q Consensus 263 --~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~ 340 (424) ..|.++..|.-. .+.-++-.++..+.+.++++||.+-|+..++...+. .+ ....-=+...|.-+-..|.+.|.. T Consensus 309 eGgrgTEI~TLpGg--~lgemeDV~YF~kkLy~ALnVP~sRl~~e~~fgr~~-eI-tRDEiKF~KFI~rLR~rF~~ll~~ 384 (533) T protein:vir:58 309 GDRRAVEIDILQGS--KVDLAEDVEYMLNRLISALKVPKAFIGYEGDVNAKN-TL-ATQDIKFNNTIKRIQGFFVEELER 384 (533) T ss_pred CCCccceeeecCCC--CCCcHHHHHHHHHHHHHHhCCCeeecCCCCCCccch-hh-hHHHHHHHHHHHHHHHHHHHHHhc Confidence 135677776542 255667788889999999999999887654422211 11 111122445556666677788887 Q ss_pred hccChhhhc--ccchhhhhhhhh----ccC-HHHHHHHHHHH---HhC-----C--CCCHHHHH------HHhCCCCC-- Q lcl|NC_019705. 341 WLIPAKDVG--RIHAEHNLDGLL----RGD-SASRAAFMKAM---GEA-----G--LRTINEMR------RTDNLPPL-- 395 (424) Q Consensus 341 ~l~~~~~~~--~~~~~fd~~~l~----~~d-~~~~~~~~~~~---~~~-----g--~~t~NE~R------~~~g~~p~-- 395 (424) +|....... .+.+.|..|... ... ...|..++..+ ++. . -+| +|+. +..+..++ T Consensus 385 qLilk~iit~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~t-dei~~q~e~ie~E~~~~~~~ 463 (533) T protein:vir:58 385 MVRMNKEFADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIP-YDLKPQEEVAEAAGGGGLFD 463 (533) T ss_pred ccccccCcchhheeeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCC-hhhhHHHHHHHHhhcCCCCC Confidence 776544321 233333332221 110 11222222221 110 0 122 2222 22222221 Q ss_pred -CCCCeeeecc----ccc-chhh--------ccccCCCcccCC Q lcl|NC_019705. 396 -PGGDVAMRQS----QYV-PITD--------LGTNKEPRNNGA 424 (424) Q Consensus 396 -~ggd~~~~~~----n~~-~~~~--------~~~~~~~~~~ga 424 (424) |+-++-+.|. ... |++. .++..++..+|+ T Consensus 464 ~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 506 (533) T protein:vir:58 464 TGGFGEETTPADFLGERGSPIESPRGRTEFDFGTEGGEELGGE 506 (533) T ss_pred CCCcccccCCcccCccccCcccCCCChhhHhcccCCccccccc Confidence 1111111111 111 2221 111111111111 No 154 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.64 E-value=1.5e-07 Score=58.03 Aligned_cols=347 Identities=8% Similarity=0.011 Sum_probs=165.1 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCc--ccccHHH---H-hccHHHHHHHHHHHHhhccCceEE Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGD--SSINDER---I-LQISTVWRCVSLISTLTACLPLDV 81 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~vs~~~---~-~~~~~v~~~i~~ia~~ia~~~~~v 81 (424) |+.+ .+++|...+...... .......+.+-..+.. ..+.++. + +...+..-+|+.+++.+.=-.|. T Consensus 1 ~~~~----~i~~L~~~~~~~~~r---~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~- 72 (409) T protein:vir:16 1 MTEK----GIGYLRFKLSVHKRR---AEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFE- 72 (409) T ss_pred CCHH----HHHHHHHHHHHHhHH---HHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhccccccc- Confidence 4332 223332222211110 0011111111111111 1111100 0 11234455666666544322332 Q ss_pred EEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCce Q lcl|NC_019705. 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK 161 (424) Q Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~ 161 (424) . .+ ..+.+++. + |. .......+..+.+.+|.||+.+..+.+|.| .+.+++|.++....|... T Consensus 73 --~-~d---------~~l~~i~~-~-N~---ld~~~~~~~~~al~yG~sf~~v~~~~dg~~-~i~~~sP~~~~~i~D~~~ 134 (409) T protein:vir:16 73 --N-DD---------FTVNEIFE-E-NN---PDIFFDSTVLSALIASCSFTYISKGENDAV-RLQVIEATNATGIIDPIT 134 (409) T ss_pred --C-cc---------hHHHHHHH-h-cC---hhHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEeeccc Confidence 1 11 12444443 2 32 234556788899999999999999888875 677888888877665421 Q ss_pred ----EEEEEEe-C--Cce---EEecHhH----------------------EEEeecC-CCCCcccCc----hHHHHHHHH Q lcl|NC_019705. 162 ----VVYRYQR-D--SEY---AEFSQKE----------------------IFHLKGF-GFTGLVGLS----PIAFACKSA 204 (424) Q Consensus 162 ----~~~~~~~-~--~~~---~~~~~~e----------------------iih~r~~-~~~~~~G~s----~i~~~~~~i 204 (424) ..+.+.. + +.. ..+.+++ |++|.+. ..+..+|.| |+..+.+.+ T Consensus 135 ~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~ 214 (409) T protein:vir:16 135 GLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNA 214 (409) T ss_pred ccceeeeEEEEecCCCceEEEEEEecCcEEEEEecCccccceecCCCCcceEEecccccccccCCccccchhHHHHHHHH Confidence 1111110 0 000 0111222 3444432 234567776 455666666 Q ss_pred HHHHHHHHHHHHHHhcCCCCceeEE-cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-----CCceeeecccChhHH Q lcl|NC_019705. 205 GVAVAMEDQQRDFFANGAKSPQILS-TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-----AGFSTSAIGVTPQDA 278 (424) Q Consensus 205 ~~~~~~~~~~~~~~~ng~~~~~vl~-~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-----~g~~~~~l~~~~~d~ 278 (424) .....-......||.+ |.-++. .+.+. ...+. |+.. .++++.++ .+.++.++....-+ T Consensus 215 ~r~~~~~~~~~e~~a~---pqr~i~G~d~d~---~~~~~----~~~~-----~~~i~~~~~d~~g~~~~v~q~~~~~l~- 278 (409) T protein:vir:16 215 KRTLERADVTAEFYSF---PQKYVTGLSDDA---EPMET----WKAT-----VSSMLQFTKDEDGDKPTLGQFTQPSMS- 278 (409) T ss_pred HHHHHHHHHHHHHhcC---hhheeEecCCCC---Cccch----hhhh-----hhHhhccCCCCCCCCceEEecCCCChh- Confidence 6666666666777755 444443 22111 11122 2221 12355553 23456566543322 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHH---HHHHHHHHHHHhhcc--Chhh--hc-c Q lcl|NC_019705. 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ---PYISRWENSIQRWLI--PAKD--VG-R 350 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~---P~~~~ie~~l~~~l~--~~~~--~~-~ 350 (424) .|++..+..+.++|++=++|++.+|....+-+|...++.+...+...+-. -+-..+++.+-.-+. ...+ .. . T Consensus 279 ~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~ 358 (409) T protein:vir:16 279 PFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQF 358 (409) T ss_pred HHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhh Confidence 48999999999999999999999997654323322333333222222211 122222222111111 1110 00 0 Q ss_pred cchhhhhhhhhc---cCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC Q lcl|NC_019705. 351 IHAEHNLDGLLR---GDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP 396 (424) Q Consensus 351 ~~~~fd~~~l~~---~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~ 396 (424) ..+++.+..... .+..+.++++.+++++| +...+-+++++|+..-+ T Consensus 359 ~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 359 SKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred ccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 122333333322 23567888899999986 44457789999997755 No 155 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.56 E-value=2.2e-07 Score=57.10 Aligned_cols=383 Identities=12% Similarity=0.067 Sum_probs=161.4 Q ss_pred CCCCccc---cc-C----CCCCchHHHHHhhccCcccCCccccchhhccccccccCccccc---HHHHhccHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYT---ID-L----RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIN---DERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~---~~-~----~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs---~~~~~~~~~v~~~i~~ 69 (424) .+++--+ ++ | ..++.=+..+..+..+.... + . .+..+. ......+....-+|+. T Consensus 8 ~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i-~----------~----~~~~~~~~~~~~~~~~n~~~~ivd~ 72 (485) T protein:vir:24 8 QEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRP-E----------A----IGVTVPVQMQSLLAHVGYPRLYVDS 72 (485) T ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCch-h----------h----cCcccchhhhhhhhccchHHHHHHH Confidence 1111110 00 0 00111111122222211100 0 0 000000 0111122344556666 Q ss_pred HHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCcee------ Q lcl|NC_019705. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI------ 143 (424) Q Consensus 70 ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~------ 143 (424) .++.+.-.+|.+ ..+. .....+.+++.. | ........+..+++.+|.||+++.++.++.+. T Consensus 73 ~~~~l~~~g~~~---~~~~-----~~~~~l~~i~~~--N---~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~ 139 (485) T protein:vir:24 73 IAERQAVEGFRL---GDAD-----EADEELWQWWQA--N---NLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNV 139 (485) T ss_pred HhhhhccCceec---CCCc-----hhHHHHHHHHHh--c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCc Confidence 666654445542 1111 111234555542 2 23466788999999999999999887765432 Q ss_pred -EEEEecCceeEEeecCce------EEEEEEeCCce----EEecHhH-------------------------EEEeecCC Q lcl|NC_019705. 144 -SLLPLQSANMDVKLVGKK------VVYRYQRDSEY----AEFSQKE-------------------------IFHLKGFG 187 (424) Q Consensus 144 -~l~~l~~~~v~~~~~~~~------~~~~~~~~~~~----~~~~~~e-------------------------iih~r~~~ 187 (424) .+.+++|..+.+..|... ..+.+...+.. ..+.++. |++|++.. T Consensus 140 ~~i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~ 219 (485) T protein:vir:24 140 PLIRVEPPTRMYAEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRT 219 (485) T ss_pred ceEEEeccceeEEEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEeccCc Confidence 577888888877665421 00111110000 0111111 34554332 Q ss_pred -CCCcccCchHHH-HHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHH--HHHHHHHHHHHhCCcccCcceecC Q lcl|NC_019705. 188 -FTGLVGLSPIAF-ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ--RSQVEENFKEIAGGPVKKRLWILE 263 (424) Q Consensus 188 -~~~~~G~s~i~~-~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~--~~~~~~~~~~~~~~~~ag~~~~l~ 263 (424) ..+.+|.|-+.- +...++....+...........+.|..++.- .. .++.. .+.-...++. ..+.++.++ T Consensus 220 ~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G-~~-~~~~~~~~~~~~~~~~~-----~~~~i~~~~ 292 (485) T protein:vir:24 220 RLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFG-IK-PEEIGVDPETGQTLFDA-----YLARILAFE 292 (485) T ss_pred ccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhcc-CC-ccccccccccccchhhh-----cccceeccC Confidence 345678876542 2333333222222222222223334444431 11 01000 0001111211 123455554 Q ss_pred -CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHH----------HHHHHHHHHH Q lcl|NC_019705. 264 -AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF----------LQYTLQPYIS 332 (424) Q Consensus 264 -~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~----------~~~tl~P~~~ 332 (424) ++.++.++....-+ .+++.++..+.+++..=++|+..+|....++.|+.........+ +...++-.++ T Consensus 293 ~~~~~~~q~~~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~ 371 (485) T protein:vir:24 293 DAEGKIQQFSAAELA-NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMR 371 (485) T ss_pred CCCceEEeecccchH-HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 46677776654333 37888888899999999999999986544323322222111111 1112222222 Q ss_pred HHHHHHHhhccChhh-h-cccchhhhhhhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC--CCCeeee--- Q lcl|NC_019705. 333 RWENSIQRWLIPAKD-V-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP--GGDVAMR--- 403 (424) Q Consensus 333 ~ie~~l~~~l~~~~~-~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~--ggd~~~~--- 403 (424) .+.. +....+ . ....+++.+......+..+.++.+.+++++| +++..-+++.+|+.+-+ ......- T Consensus 372 l~~~-----~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~ 446 (485) T protein:vir:24 372 LAYR-----LMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEA 446 (485) T ss_pred HHHH-----HhcCCCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHh Confidence 1111 111111 0 1123333344445577788888888888765 78877778888885432 1100000 Q ss_pred cccccchhhcc---------------ccCCCcccCC Q lcl|NC_019705. 404 QSQYVPITDLG---------------TNKEPRNNGA 424 (424) Q Consensus 404 ~~n~~~~~~~~---------------~~~~~~~~ga 424 (424) ......++... ..+++..+|+ T Consensus 447 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~ 482 (485) T protein:vir:24 447 AMGLGLLGTMVDADPTVPGSPNPTPAPKPQPAIEGG 482 (485) T ss_pred hhhhhHHHhhcccCCCCCCCCCCCCCCCCccCCCCC Confidence 00000000000 0111112222 No 156 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.54 E-value=3.2e-07 Score=56.17 Aligned_cols=397 Identities=10% Similarity=0.075 Sum_probs=166.9 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccC--cccccH---HHHhccHHHHHHHHHHHHhhc Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSIND---ERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~vs~---~~~~~~~~v~~~i~~ia~~ia 75 (424) |.-|-..-+=-+.-=+..++...+....... .....-+.+...+. +..+.. .....+.+..-+|+..+..+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl---~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~ 77 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDL---GDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQE 77 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHH---HHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHHhhhc Confidence 2222111111110011222222221110000 00000011110000 001111 111223445557777776554 Q ss_pred cCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCcee-------EEEEe Q lcl|NC_019705. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI-------SLLPL 148 (424) Q Consensus 76 ~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~-------~l~~l 148 (424) -.+|.+ ..+. .....+.+++. + |. .......+..+++.+|.||+.+.++.+|.+. .+.++ T Consensus 78 ~~g~~~---~~~~-----~~~~~l~~i~~-~-N~---~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~ 144 (484) T protein:vir:77 78 LEGFRL---GGAD-----KADEQLWDWWQ-A-ND---LDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVE 144 (484) T ss_pred cCceec---CCcc-----hhHHHHHHHHH-h-cC---HhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEe Confidence 444442 1111 11223444443 2 22 2456788899999999999999888877542 47778 Q ss_pred cCceeEEeecCc--eE----EEEEEeCCc-e---EEecHhH-------------------------EEEeecCC-CCCcc Q lcl|NC_019705. 149 QSANMDVKLVGK--KV----VYRYQRDSE-Y---AEFSQKE-------------------------IFHLKGFG-FTGLV 192 (424) Q Consensus 149 ~~~~v~~~~~~~--~~----~~~~~~~~~-~---~~~~~~e-------------------------iih~r~~~-~~~~~ 192 (424) +|..+.+..|+. .. .+.+...+. . ..|.+++ |++|.+.. ..+.. T Consensus 145 ~p~~~~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~ 224 (484) T protein:vir:77 145 PPTNLYAQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLY 224 (484) T ss_pred ccceeEEEecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCCCcceEEeccccccCccC Confidence 888887666532 10 000000000 0 0111111 35555432 34457 Q ss_pred cCchHH----HHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHH--HHHHHHHHHhCCcccCcceecC-CC Q lcl|NC_019705. 193 GLSPIA----FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRS--QVEENFKEIAGGPVKKRLWILE-AG 265 (424) Q Consensus 193 G~s~i~----~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~--~~~~~~~~~~~~~~ag~~~~l~-~g 265 (424) |.|-+. .+.+.+.....-......++ +.|..++.-- . .++...+ .-...++.. .++++.++ ++ T Consensus 225 G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~---a~p~~~i~G~-~-~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~ 294 (484) T protein:vir:77 225 GTTEITPELRSVTDAAARTLMLMQATAELM---GVPQRLLFGV-K-GEELGVDPETGQTLFDAY-----LARILAFEDHE 294 (484) T ss_pred CcccchHHHHHHHHHHHHHHHHHHHHHHhh---hhhHHHHhCC-C-cchhcccccccchhhhhh-----hhhhcccCCCC Confidence 777554 33334333333333333333 3344343211 0 1111000 011112111 23456665 45 Q ss_pred ceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHH---HHHHHHHHHHHHhh- Q lcl|NC_019705. 266 FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTL---QPYISRWENSIQRW- 341 (424) Q Consensus 266 ~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl---~P~~~~ie~~l~~~- 341 (424) .++.++....-+ -|++..+..+.+|+.+-++|+..+|....+..++.........+...+- .-+-..+++.+..- T Consensus 295 ~~~~q~~~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~ 373 (484) T protein:vir:77 295 SKAQQFSAAELR-NFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAY 373 (484) T ss_pred ceeEeecCCChH-HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 777777654433 3788899999999999999999998654332332222222211111110 01111122221111 Q ss_pred -ccChhhh--cccchhhhhhhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC--CCCeee------eccccc Q lcl|NC_019705. 342 -LIPAKDV--GRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP--GGDVAM------RQSQYV 408 (424) Q Consensus 342 -l~~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~--ggd~~~------~~~n~~ 408 (424) +....+. ....+++.+......+....++.+.+++++| +++..-+++++|+-+-+ ...... ....+. T Consensus 374 ~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~ 453 (484) T protein:vir:77 374 KVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMG 453 (484) T ss_pred HHhCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHh Confidence 1111110 1123444455556677888999999998876 88888888888885432 110000 000000 Q ss_pred chhhc----------cccCCCcccCC Q lcl|NC_019705. 409 PITDL----------GTNKEPRNNGA 424 (424) Q Consensus 409 ~~~~~----------~~~~~~~~~ga 424 (424) ++... .+..++..+.+ T Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (484) T protein:vir:77 454 TMFGTDPSGGGNPDNPETPEPQPNPA 479 (484) T ss_pred hhccccccCCCCCCCCCcccccCCCc Confidence 11000 00011111111 No 157 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.52 E-value=2.7e-07 Score=56.58 Aligned_cols=388 Identities=13% Similarity=0.028 Sum_probs=171.7 Q ss_pred CCCchHHHHHhhccCcccCC--------cc---ccchhhcc-ccccccCcc--------------cccHHHHhccHHHHH Q lcl|NC_019705. 12 TNNGWWARLQSWFVGGRLVT--------PN---QGSQTGPV-SAHGHLGDS--------------SINDERILQISTVWR 65 (424) Q Consensus 12 ~~~G~~~~~~~~~~~~~~~~--------~~---~~~~~~~~-~~~~~~~~~--------------~vs~~~~~~~~~v~~ 65 (424) ==.+++.++++|+++-.... +. .......+ .+...+.|. ... ...+....... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~-~~~~~~n~~k~ 79 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVN-RRQLSMNLPKV 79 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccc-cceeecchHHH Confidence 00133333333333211100 00 00000000 000001110 000 11122345566 Q ss_pred HHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEE Q lcl|NC_019705. 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISL 145 (424) Q Consensus 66 ~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l 145 (424) +++..|+-+.+-|..+--.++ .....+.+++. -| ....-...++.+.+.+|.+|+.+..|.+|.+ .+ T Consensus 80 i~~~~a~~l~~~p~~i~~~d~-------~~~e~l~~~~~--~n---~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i 146 (496) T protein:vir:38 80 TAKYMSKLLFNEKVKINIDDK-------AAEEFVLNVLK--TN---GFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KV 146 (496) T ss_pred HHHHHhhhhhCCcceEeeCCh-------HHHHHHHHHHh--cc---CHHHHHHHHHHHHhhhCcEEEEEEEcCCCcE-EE Confidence 777888877776665422110 01112333333 11 2345567788899999999999988887765 35 Q ss_pred EEecCceeEEeecC-ceE--------------EEE------------------EEeCC---ceEEecHh----------- Q lcl|NC_019705. 146 LPLQSANMDVKLVG-KKV--------------VYR------------------YQRDS---EYAEFSQK----------- 178 (424) Q Consensus 146 ~~l~~~~v~~~~~~-~~~--------------~~~------------------~~~~~---~~~~~~~~----------- 178 (424) -.++|..+-+...+ +.. .|. |.... .+..++-. T Consensus 147 ~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~ 226 (496) T protein:vir:38 147 SFATADCMYPLSNDSENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVP 226 (496) T ss_pred EEEcccceEEEEecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCcccccccccccccccee Confidence 55666665543221 110 000 00000 00011100 Q ss_pred ----H---EEEeecCC-----CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCC------CHHHH Q lcl|NC_019705. 179 ----E---IFHLKGFG-----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL------TEQQR 240 (424) Q Consensus 179 ----e---iih~r~~~-----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~------~~~~~ 240 (424) + +.+++.+- .+...|+|.+..+...++....+.....+-|.. +.+..++ +.... +.+.. T Consensus 227 ~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~-~~~~i~v--~~~~l~~~~~~~g~~~ 303 (496) T protein:vir:38 227 LPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLV--PSSFVKTAVNLDGSTT 303 (496) T ss_pred ecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh-cccceec--chHHhhccCCCCCccc Confidence 1 22333321 123579999998888887665555555555555 3344333 11100 00000 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHH- Q lcl|NC_019705. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN- 319 (424) Q Consensus 241 ~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~- 319 (424) .......+.+. .....-.+++..++.+......-++.+..+....+|+...|+||..+|....+..+...+.... T Consensus 304 ~~~~~~~~~~~----~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~ 379 (496) T protein:vir:38 304 QYFDSTDEAFF----LYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKS 379 (496) T ss_pred cCCCCccceEE----EeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHH Confidence 00000000000 0001112233346666666566678888999999999999999999987654433222221111 Q ss_pred ---------HHHHHHHHHHHHHHHHHHHHhhcc-ChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHH Q lcl|NC_019705. 320 ---------LGFLQYTLQPYISRWENSIQRWLI-PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRT 389 (424) Q Consensus 320 ---------~~~~~~tl~P~~~~ie~~l~~~l~-~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~ 389 (424) ...++.+|..++..+.+..+.... .........+.+.++.-+..|.++.++.+.+++.+|+++.-.++.. T Consensus 380 ~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~ 459 (496) T protein:vir:38 380 ETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQR 459 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHh Confidence 122233344444433322221111 1111112234455556667888899999999999999998888764 Q ss_pred h-CCCCCCCCCeeeecc-----cccchhhccccCCCcc Q lcl|NC_019705. 390 D-NLPPLPGGDVAMRQS-----QYVPITDLGTNKEPRN 421 (424) Q Consensus 390 ~-g~~p~~ggd~~~~~~-----n~~~~~~~~~~~~~~~ 421 (424) + |.+. +..++.+... ...|-++.+...+..+ T Consensus 460 ~~~~~d-~ea~~el~ri~~E~~~~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 460 AWNITE-AEADEWAEMLAKEKQAEMPNNDMNGIFGEEE 496 (496) T ss_pred cCCCCh-HHHHHHHHHHHHhhhccCccccccCCCCCCC Confidence 3 4422 1221111100 0011111111111111 No 158 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.51 E-value=3.1e-07 Score=56.27 Aligned_cols=361 Identities=10% Similarity=0.056 Sum_probs=165.2 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcc--cccHH--HHhc--cHHHHHHHHHHHHhhccCceEE Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDS--SINDE--RILQ--ISTVWRCVSLISTLTACLPLDV 81 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~vs~~--~~~~--~~~v~~~i~~ia~~ia~~~~~v 81 (424) |+.++-..+++.+.....+- ......+.+....... .+.++ ...+ ..+..-+|+.+++.+.=-.|. T Consensus 1 m~~~~i~~L~~~~~~~~~r~-------~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~- 72 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGV-------DKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIFREFT- 72 (422) T ss_pred CChHHHHHHHHHHHHHHHHH-------HHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccccceee- Confidence 66665555554443322110 0001111111111111 11111 1111 133445566555532222222 Q ss_pred EEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-CCceeEEEEecCceeEEeecCc Q lcl|NC_019705. 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-AGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~-~G~~~~l~~l~~~~v~~~~~~~ 160 (424) . .+ ..+.+.+. + |.. ......+..+.+.+|.||+.+.++. +|.| .+.+++|.++....|.. T Consensus 73 --~-~d---------~~l~~~w~-~-N~l---d~~~~~~~~~al~~G~sf~~v~~~~~~~~p-~i~~~sp~~~~~i~D~~ 134 (422) T protein:vir:97 73 --N-DD---------FNAWEIFK-A-NNP---DIFFDTAIQSALIASCCFVYIMPGAEDGLP-KMQVIEASKATGILDPT 134 (422) T ss_pred --C-Cc---------hhHHHHHH-h-cCh---HHHHHHHHHHHHHhcceeEEEeeCCCCCee-EEEEechhhEEEEEeCC Confidence 1 11 12444443 2 332 3445577889999999999998875 5665 57888999988777643 Q ss_pred eE----EEE-E--EeCCce--E-EecHhH---------------------EEEeecC-CCCCcccCch----HHHHHHHH Q lcl|NC_019705. 161 KV----VYR-Y--QRDSEY--A-EFSQKE---------------------IFHLKGF-GFTGLVGLSP----IAFACKSA 204 (424) Q Consensus 161 ~~----~~~-~--~~~~~~--~-~~~~~e---------------------iih~r~~-~~~~~~G~s~----i~~~~~~i 204 (424) .. .+. + ..++.. . .++... |++|.+. ....++|.|. +..+.+.+ T Consensus 135 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~ 214 (422) T protein:vir:97 135 TFLLTEGYAILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAA 214 (422) T ss_pred CCcceeeEEEEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHH Confidence 11 011 1 111110 0 011111 4455432 3455678774 44455555 Q ss_pred HHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-----CceeeecccChhHHH Q lcl|NC_019705. 205 GVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-----GFSTSAIGVTPQDAE 279 (424) Q Consensus 205 ~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~-----g~~~~~l~~~~~d~~ 279 (424) .....-......|+.. |.-++.- -+ .+....+.++.. .++++.++. +.++.++..+.-+ . T Consensus 215 ~r~~~~~~~~~e~~a~---pqr~i~G-~d-~d~~~~~~~~~~---------~~~i~~~~~de~~~~~~v~q~~~~~l~-~ 279 (422) T protein:vir:97 215 KRTLERAEVTAEFYSF---PQKYVLG-MD-PDAKPMEKWRAT---------VSTLLEISKDEDGDKPTVGQFTTASMA-P 279 (422) T ss_pred HHHHHHHHHHHHHhcc---hhhhhcc-cC-cccccCchhhhh---------hhhhhccCCCCCCCcceeeecCCCChh-H Confidence 5554444555555544 3333321 11 011111112221 224555542 2456555543322 4 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHH---HHHHHHHHHHHHhh--ccChhh--hc-cc Q lcl|NC_019705. 280 MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTL---QPYISRWENSIQRW--LIPAKD--VG-RI 351 (424) Q Consensus 280 ~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl---~P~~~~ie~~l~~~--l~~~~~--~~-~~ 351 (424) |++.++..+.+++++=++|++.+|....+.+|...++.+...+...+- .-+-..+++.+-.- +..... .. .. T Consensus 280 ~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~ 359 (422) T protein:vir:97 280 FMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFM 359 (422) T ss_pred HHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhc Confidence 899999999999999999999999866533333333333322222211 11222222222111 111111 00 11 Q ss_pred chhhhhhhhhccC---HHHHHHHHHHHHhC--CCCCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCCCcccC Q lcl|NC_019705. 352 HAEHNLDGLLRGD---SASRAAFMKAMGEA--GLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 352 ~~~fd~~~l~~~d---~~~~~~~~~~~~~~--g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) .+.+.+......+ ..+.++++.+++++ |++..+-+++++|+... |.-.... ++.+-+| T Consensus 360 ~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~---~~~~~~~-----------~~~~~d~ 422 (422) T protein:vir:97 360 DTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGA---DKPIPAI-----------TEVTTDG 422 (422) T ss_pred cceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCch---hHHHHHH-----------HhhhccC Confidence 1223223333344 44566777788887 78889999999999542 1111000 0111111 No 159 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=98.50 E-value=1.2e-09 Score=69.40 Aligned_cols=260 Identities=13% Similarity=0.120 Sum_probs=134.6 Q ss_pred hccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC Q lcl|NC_019705. 58 LQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN 137 (424) Q Consensus 58 ~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~ 137 (424) +.. -.+...|.+++=-.|.|.....+ .. -..+.-|...--|...+-..-++.+.. T Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~~~~~-----~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------- 55 (279) T protein:vir:40 1 MSL----FNLSRRAEDVSFSTFTVQDPTTD-----LL-LGKLLGLVSYFDNVDYSEASKLEDLFY--------------- 55 (279) T ss_pred Ccc----cccchhhcccceeeeeecCcchh-----HH-HHHHHHHHHHhhcccchhhhhhhhhhh--------------- Confidence 000 01122233333223332111000 00 001111112223333333332332222 Q ss_pred CCCceeEEEEecCceeEEeecCceEEE------------EEEeCCceEEecHhHEEEeecCCCCCcccCchHHHHHHHHH Q lcl|NC_019705. 138 SAGDVISLLPLQSANMDVKLVGKKVVY------------RYQRDSEYAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAG 205 (424) Q Consensus 138 ~~G~~~~l~~l~~~~v~~~~~~~~~~~------------~~~~~~~~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~ 205 (424) |.+....|-..+-++..+| ...++...+++|-.|+..|-+ +++|.-+-+.. .-++ T Consensus 56 --------~~~~~~~~~~~~~~~~~~~~~~~~~d~fn~~vr~~~~~~vtVP~~Dv~IieN----Plv~v~~ee~~-kM~~ 122 (279) T protein:vir:40 56 --------WALQGKEVYRVWYGGFKYYAQRVNADQFNIVVREPNRREVTIRTNDYEMLLN----PFYGANPQRFG-VMFG 122 (279) T ss_pred --------hhhccceeehhhhhhHHHHHhhcCcchhhhheecCCcceeEeecchhhhhhc----chheeccchhh-HHHH Confidence 2222222222222222221 112233345566666666643 34555543221 1122 Q ss_pred HHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc-cCcceecCCCceeeecccChhHHHHHHHH Q lcl|NC_019705. 206 VAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV-KKRLWILEAGFSTSAIGVTPQDAEMMASR 284 (424) Q Consensus 206 ~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~-ag~~~~l~~g~~~~~l~~~~~d~~~~e~~ 284 (424) +. .+....++ .+.+..+++++++.+...++..++.+..++++....+ -+++.+++.|.+++++..+..-. ..+-. T Consensus 123 la--~nai~~KL-D~~~qIk~fIKTd~d~glee~kekaR~rIk~mlalAk~~nGityid~~ddItQL~kDYSts-lk~di 198 (279) T protein:vir:40 123 MA--SNGIGRRL-DSQAQIKIYWKTKVSSGLKEVWDRIRERLTQQQQLAREFNGVSVIGSDDDIKQIQPDYSGS-LQNDA 198 (279) T ss_pred HH--Hhhhhhhh-cccceeeeEEecCcchhHHHHHHHHHHHHHHHHHHHHhcCCeeeecCCceeEeeccccccc-cHHHH Confidence 22 22223333 7888899999999887778888888888888776655 47899999999999999876655 36778 Q ss_pred HHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHH------hhccChhhhcccchhhhhh Q lcl|NC_019705. 285 KFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQ------RWLIPAKDVGRIHAEHNLD 358 (424) Q Consensus 285 ~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~------~~l~~~~~~~~~~~~fd~~ 358 (424) ++.+.+.+..+|||..+|-+. ..|++..+|+..+|.|++++++.+|. .+.++...+ T Consensus 199 e~lkS~l~Sq~GinekIL~Gs--------AtE~q~iAyy~rtVePILkQyek~liY~~E~fv~y~ttta~---------- 260 (279) T protein:vir:40 199 NLAIEIALSEYGMPRELLYGQ--------SNEVTIIAFAIQKVLPLLKQHDKNIIFNQENFVAYISTTAK---------- 260 (279) T ss_pred HHHHHHHHhhcCCchhhcccc--------CchhhhhhHHHhhHHHHHHHhcccccchhhhhhhhheeccc---------- Confidence 889999999999999998532 34899999999999999999776443 233322111 Q ss_pred hhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC Q lcl|NC_019705. 359 GLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD 399 (424) Q Consensus 359 ~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd 399 (424) +|.+ |-.-.+..-+|+ |.| T Consensus 261 -------------------gg~~--~s~~~~~~~~~~-~~~ 279 (279) T protein:vir:40 261 -------------------GGAI--ESKSSKRDSEPV-GND 279 (279) T ss_pred -------------------Cccc--ccccccccCCCC-CCC Confidence 1111 000111112222 111 No 160 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.48 E-value=2.2e-07 Score=57.10 Aligned_cols=395 Identities=11% Similarity=0.046 Sum_probs=172.8 Q ss_pred CchHHHHHhhccCcccC------C-----ccccch---hhcc-ccccccCccc--c-----c----HHHHhccHHHHHHH Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLV------T-----PNQGSQ---TGPV-SAHGHLGDSS--I-----N----DERILQISTVWRCV 67 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~------~-----~~~~~~---~~~~-~~~~~~~~~~--v-----s----~~~~~~~~~v~~~i 67 (424) +|+|.++++||++.-.. . +..... ...+ .+..++.|.. + . .+..........+. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~ 80 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSA 80 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHH Confidence 88888888888542110 0 001000 0000 0000111110 0 0 00111222333444 Q ss_pred HHHHHhhccCceEEEEeccCCccce----eccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCcee Q lcl|NC_019705. 68 SLISTLTACLPLDVFETDQNDNRKK----VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI 143 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~----~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~ 143 (424) +.+|+-+..=+..+.-.+.+..... ......+..++. -| ......+..+.+.+..|.+++-+..+. |.+ T Consensus 81 ~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~--~n---~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~~- 153 (517) T protein:vir:98 81 DVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQ--HN---KFIKNLSDYLEPTFALGGLTVRPYVDN-GEI- 153 (517) T ss_pred HHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHH--hc---cHHHHHHHHHHHHhhhCCEEEEEEEeC-Cee- Confidence 4444444332322221221111111 111122333333 11 123444556666777788887766554 322 Q ss_pred EEEEecCceeEEee-c------------------CceEEEE----------------EEe-------CC---ceEEec-- Q lcl|NC_019705. 144 SLLPLQSANMDVKL-V------------------GKKVVYR----------------YQR-------DS---EYAEFS-- 176 (424) Q Consensus 144 ~l~~l~~~~v~~~~-~------------------~~~~~~~----------------~~~-------~~---~~~~~~-- 176 (424) .+..++++++-+.. + ++..+|. |++ +. -+.+++ T Consensus 154 ~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~ 233 (517) T protein:vir:98 154 EFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLE 233 (517) T ss_pred EEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCcccccccccc Confidence 24445555443211 1 1111110 000 00 011111 Q ss_pred ------HhH----------EEEeecCCC-----CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCC Q lcl|NC_019705. 177 ------QKE----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL 235 (424) Q Consensus 177 ------~~e----------iih~r~~~~-----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~ 235 (424) +++ +.|++.+-. +...|+|....+...+......-.....-|..|.+ ..++ +.... T Consensus 234 ~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~-~i~v--p~~~l 310 (517) T protein:vir:98 234 ELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQR-TVFV--SDVML 310 (517) T ss_pred ccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCc-ceec--Chhhh Confidence 111 224544322 23579999998888887666655555566666444 3322 22211 Q ss_pred CH--HH-HHHHHHHHHHHhCCcccCcceec-CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccc Q lcl|NC_019705. 236 TE--QQ-RSQVEENFKEIAGGPVKKRLWIL-EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW 311 (424) Q Consensus 236 ~~--~~-~~~~~~~~~~~~~~~~ag~~~~l-~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~ 311 (424) .. +. .......|+. .+.....+-. +++-.++.++....+-++.+..+...++|+...|+++..++...++..+ T Consensus 311 ~~~~~~~g~~~~~~~d~---~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kT 387 (517) T protein:vir:98 311 RTVPDESGMPPPQVFDP---DVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKT 387 (517) T ss_pred ccccCCCCcccCCCCCc---ccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCccccccccccccc Confidence 00 00 0000000000 0000000111 1223466666677778899999999999999999999999987654432 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHh------------hccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCC Q lcl|NC_019705. 312 GSGIEQQNLGFLQYTLQPYISRWENSIQR------------WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAG 379 (424) Q Consensus 312 ~~n~e~~~~~~~~~tl~P~~~~ie~~l~~------------~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g 379 (424) +.--.....-.-.|+.-+...++..|.. .++...-.....+.+++++-+..|.++..+...+++.+| T Consensus 388 -ATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG 466 (517) T protein:vir:98 388 -ATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFG 466 (517) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcC Confidence 1111111122223443344444333322 122222112344567777878899999999999999999 Q ss_pred CCCHHHHHHH-hCCCCCCCCCeeeec--ccccchhhccccCCCcccCC Q lcl|NC_019705. 380 LRTINEMRRT-DNLPPLPGGDVAMRQ--SQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 380 ~~t~NE~R~~-~g~~p~~ggd~~~~~--~n~~~~~~~~~~~~~~~~ga 424 (424) +|++-+++.+ .|+.. +..++.+.. ......+.... .++.+++. T Consensus 467 ~ms~~~~i~~~~g~~e-eeA~~e~~~i~~E~~~~~~~~~-~~~~~~~~ 512 (517) T protein:vir:98 467 FIPTVEAIQRIFKVPK-KTAEQWLEEIRKDQIELDPVTI-SQRAQKRM 512 (517) T ss_pred CCCHHHHHHHhCCCCh-HHHHHHHHHHHHhccccCCCCc-cccccCCC Confidence 9999998765 47743 222111110 00001111011 01111111 No 161 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=98.36 E-value=1.1e-06 Score=53.32 Aligned_cols=350 Identities=11% Similarity=0.054 Sum_probs=164.2 Q ss_pred ccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccc--cHHH---H-hccHHHHHHHHHHHHhhccCceEE Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSI--NDER---I-LQISTVWRCVSLISTLTACLPLDV 81 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v--s~~~---~-~~~~~v~~~i~~ia~~ia~~~~~v 81 (424) |.+- ..=..++..+..+.. .+....+ .++. . +-..+..-+|+.+++.+.=-.|. T Consensus 1 l~~~--~~r~~~~~~yY~g~~-----------------~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~- 60 (410) T protein:vir:95 1 MNLY--QSRVNLRYKHYAMQH-----------------YEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFA- 60 (410) T ss_pred CCcc--hhhHHHHHHHhcCCC-----------------CccccchhccHHHHhHHHhhcchhHHHHHHhHhhhcccccc- Confidence 1111 111222222222211 1110000 0000 0 11234455666666543322232 Q ss_pred EEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCc- Q lcl|NC_019705. 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK- 160 (424) Q Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~- 160 (424) . .+ ..+.+++. + |. .......+..+.+.+|.||+.+..+.+|.| .+.+++|.++....|.. T Consensus 61 --~-~d---------~~l~~i~~-~-N~---ld~~~~~~~~~al~~G~sf~~v~~~~d~~~-~i~~~sP~~~~~i~Dp~~ 122 (410) T protein:vir:95 61 --N-DD---------FNVTEIFD-R-NN---PDIFFDSAILSALIGSCSFVYISKGEDDEV-RLQVIESSNATGVIDPIT 122 (410) T ss_pred --C-CC---------chHHHHHh-h-cC---hHHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEEeCCC Confidence 1 11 12444443 2 32 234556778899999999999998888876 57889999888776642 Q ss_pred -eEE--EEEEe--CC-c---eEEecHhH---------------------EEEeecC-CCCCcccCc----hHHHHHHHHH Q lcl|NC_019705. 161 -KVV--YRYQR--DS-E---YAEFSQKE---------------------IFHLKGF-GFTGLVGLS----PIAFACKSAG 205 (424) Q Consensus 161 -~~~--~~~~~--~~-~---~~~~~~~e---------------------iih~r~~-~~~~~~G~s----~i~~~~~~i~ 205 (424) ... +.+.. .+ . ...+.++. |++|.+. ..+..+|.| ++..+.+.+. T Consensus 123 ~~~~~al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~ 202 (410) T protein:vir:95 123 GLLVEGYAVLARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAK 202 (410) T ss_pred CceEEEEEEEEecCCCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHH Confidence 111 11111 10 0 01122222 2344332 234456766 4556666666 Q ss_pred HHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-----CceeeecccChhHHHH Q lcl|NC_019705. 206 VAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-----GFSTSAIGVTPQDAEM 280 (424) Q Consensus 206 ~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~-----g~~~~~l~~~~~d~~~ 280 (424) ....-......||.+ |.-++.-- + .+....+ .|+.. .++++.++. +.++.++....-+ .| T Consensus 203 r~~~~~~~~~e~~a~---pqr~i~G~-d-~d~~~~~----~~~~~-----~~~i~~~~~~~~~~~~~v~q~~~~~l~-~~ 267 (410) T protein:vir:95 203 RTLERADITAEFYSW---PQKYILGL-D-PDAEPME----KWKAT-----VSSLLTISSSDKGVKPSVGQFTTASMS-PF 267 (410) T ss_pred HHHHHHHHHHHHhcc---hhheeecc-C-CCCCcCc----hhhhh-----hhhheeccCCCCCCcceEEecCCCChH-HH Confidence 666666666777655 44343211 1 0111111 12221 234566643 2456556543222 48 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHH---HHHHHHHHHHHHHHhh--ccChhh--h-cccc Q lcl|NC_019705. 281 MASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY---TLQPYISRWENSIQRW--LIPAKD--V-GRIH 352 (424) Q Consensus 281 ~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~---tl~P~~~~ie~~l~~~--l~~~~~--~-~~~~ 352 (424) ++.++....+|+.+=++|+..+|....+.+|...++.+...+... .-.-+-..+++.+-.- +..... . .... T Consensus 268 ~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~ 347 (410) T protein:vir:95 268 TEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVR 347 (410) T ss_pred HHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccce Confidence 999999999999999999999997654323322222222222221 1112222333322111 111111 0 1111 Q ss_pred hhhhhh---hhhccCHHHHHHHHHHHHhC--CCCCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCC Q lcl|NC_019705. 353 AEHNLD---GLLRGDSASRAAFMKAMGEA--GLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKE 418 (424) Q Consensus 353 ~~fd~~---~l~~~d~~~~~~~~~~~~~~--g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~ 418 (424) +.+.+. +....+..+.++++.+++++ |+.+..-+++.+|+.+-+ +.. .-++....+.+ T Consensus 348 ~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~---~~~-----~~~~e~~~~g~ 410 (410) T protein:vir:95 348 TAVKWEPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGDM---SAK-----PVVSEGGSNGE 410 (410) T ss_pred eeEEeeecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChHH---HHH-----HHHHHHHhCCC Confidence 222222 22223457788888889887 788888899999997531 110 00000001111 No 162 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.30 E-value=1.5e-06 Score=52.49 Aligned_cols=387 Identities=10% Similarity=0.025 Sum_probs=178.7 Q ss_pred CchHHHHHhhccCcccC----C--------ccccch----------hhcccccc-ccCcccc----cHHHHhccHHHHHH Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLV----T--------PNQGSQ----------TGPVSAHG-HLGDSSI----NDERILQISTVWRC 66 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~----~--------~~~~~~----------~~~~~~~~-~~~~~~v----s~~~~~~~~~v~~~ 66 (424) +|||.+++++|++.... . +..... ...+.+.. +...... ..+..........+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTA 80 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHH Confidence 99999999998652110 0 000000 00111100 0111000 01111222344556 Q ss_pred HHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEE Q lcl|NC_019705. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLL 146 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~ 146 (424) ++..|+-+..=|..+.-.+. . ..+..+..+|. -|. ...-.+..+.+.+..|.+++.+..+.++ ..+. T Consensus 81 ~~~~A~lv~~e~~~i~v~~~-~-----~~~e~l~~il~--~n~---f~~~~~~~~e~a~a~G~~~~k~~~d~~~--~~i~ 147 (508) T protein:vir:15 81 ARRIASVVFNEKAEIHVKDN-N-----EADKFLNDVLE--DND---FKNKFEEALEKGVALGGFAMRPYIDGNH--IKIA 147 (508) T ss_pred HHHHHhhhhCCCceEEeCCc-h-----HHHHHHHHHHH--hcc---HHHHHHHHHHHHhhcCceEEEEEEeCCe--eEEE Confidence 66666666554444321111 0 01112344443 122 2344456677788889888877665432 3455 Q ss_pred EecCceeEEe-ecCc------------------eEEEE-------------------EEeCC---ceEEecHh------- Q lcl|NC_019705. 147 PLQSANMDVK-LVGK------------------KVVYR-------------------YQRDS---EYAEFSQK------- 178 (424) Q Consensus 147 ~l~~~~v~~~-~~~~------------------~~~~~-------------------~~~~~---~~~~~~~~------- 178 (424) .++|..+-+. .+.+ ..+|. |.... -+.+++-. T Consensus 148 ~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~ 227 (508) T protein:vir:15 148 WVRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKE 227 (508) T ss_pred EEcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccC Confidence 5556554432 1111 11110 00000 00111111 Q ss_pred ---H----------EEEeecCCC-----CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCC--CHH Q lcl|NC_019705. 179 ---E----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL--TEQ 238 (424) Q Consensus 179 ---e----------iih~r~~~~-----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~--~~~ 238 (424) + ..||+.+-. +...|+|.+..+...+......-....+-|+ .+.+..++. .... +++ T Consensus 228 l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~-~~~~~i~v~--~~~l~~d~~ 304 (508) T protein:vir:15 228 LAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIR-LGQKHIAVQ--PGMLRFDDE 304 (508) T ss_pred CCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHH-hcccceeec--hHHhcCCCC Confidence 1 124443222 2357999999999888877666666666665 444554442 1111 111 Q ss_pred HHHHHHHHHHHHhCCcccCcceec--CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHH Q lcl|NC_019705. 239 QRSQVEENFKEIAGGPVKKRLWIL--EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l--~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e 316 (424) ....+. ...+....+-. ++|..++.++....+-++.+..+...+.|....|++|..++...++..+..-+. T Consensus 305 ~~~~~~-------~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~ 377 (508) T protein:vir:15 305 HKPTFD-------TEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVV 377 (508) T ss_pred CccccC-------CCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHH Confidence 000000 11111111111 234457777777677788999999999999999999999987655433211111 Q ss_pred ----------HHHHHHHHHHHHHHHHHHHHHHHhhccChhh---------hcccchhhhhhhhhccCHHHHHHHHHHHHh Q lcl|NC_019705. 317 ----------QQNLGFLQYTLQPYISRWENSIQRWLIPAKD---------VGRIHAEHNLDGLLRGDSASRAAFMKAMGE 377 (424) Q Consensus 317 ----------~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~---------~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~ 377 (424) ......++.+|..++..|..-....-+...+ .....+.+++++-+..|.++..+...+++. T Consensus 378 s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~ 457 (508) T protein:vir:15 378 SNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLA 457 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHh Confidence 0112223344444444433322211111111 112334566677778999999999999999 Q ss_pred CCCCCHHHHHHHh-CCCCCCCCCeeeec--ccccchhhccccCCCcccCC Q lcl|NC_019705. 378 AGLRTINEMRRTD-NLPPLPGGDVAMRQ--SQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 378 ~g~~t~NE~R~~~-g~~p~~ggd~~~~~--~n~~~~~~~~~~~~~~~~ga 424 (424) +|+|+.-+++... |+.. +..++.+.. .........+...++.+++- T Consensus 458 aGi~s~e~~i~~~~g~~d-eea~~el~ri~~E~~~~~~~~~~~~~~~g~~ 506 (508) T protein:vir:15 458 IGALSKQTFLQRNYGMTD-EQAAEELAKIQSEAPTDTFEGGRSAILNGGD 506 (508) T ss_pred cCCCCHHHHHHhcCCCCh-HHHHHHHHHHHHhccccCccccccccCCCCC Confidence 9999999987653 5533 112111110 00000000111111111111 No 163 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.29 E-value=1.7e-06 Score=52.28 Aligned_cols=384 Identities=10% Similarity=0.043 Sum_probs=158.4 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccC--cccccH--HH---HhccHHHHHHHHHHHHh Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSIND--ER---ILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~vs~--~~---~~~~~~v~~~i~~ia~~ 73 (424) ++.|.-+|+-.-..=+.+.+...+.... +.-......+.+..... +..... +. -..+.+..-+|+..++. T Consensus 16 ~~~p~~~~~~~~~~~l~~~l~~~~~~~~---~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~ 92 (501) T protein:vir:25 16 VEFPEDSMSREQLGALVADMWRLHISER---QWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQN 92 (501) T ss_pred ccCCcccCChHHHHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhh Confidence 5566555543322222222222221110 10001011111111000 000011 00 01123455567766664 Q ss_pred hccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCcee Q lcl|NC_019705. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) +--..|. .. ++.. ...+..++. + |.. ...-..+..+++.+|.||+.+.++..|. .+..++|..+ T Consensus 93 l~~~gf~---~~-d~~~-----~~~l~~i~~-~-N~~---d~~~~~~~~~a~i~G~ay~~v~~de~~~--~i~~~sp~~~ 156 (501) T protein:vir:25 93 LSVVGYR---NA-LAKE-----NDPAWEMWQ-R-NRM---DARQAEVHRPALTYGASYVTVTPTDEGP--VFRTRSPRQI 156 (501) T ss_pred hccccee---cC-Cccc-----hHHHHHHHH-h-cCh---hHHHHHHHHHHhhcCceEEEEecCCCCC--eEEEeccccE Confidence 4323332 22 1111 123444332 2 322 3455678889999999999998888874 3556788888 Q ss_pred EEeecC-c---eEE--EEEE---eC-Cc---eEEecHhH----------------------------------------- Q lcl|NC_019705. 154 DVKLVG-K---KVV--YRYQ---RD-SE---YAEFSQKE----------------------------------------- 179 (424) Q Consensus 154 ~~~~~~-~---~~~--~~~~---~~-~~---~~~~~~~e----------------------------------------- 179 (424) .+..++ . ... ..|. .. +. ...+.+.. T Consensus 157 ~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (501) T protein:vir:25 157 LAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEG 236 (501) T ss_pred EEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCC Confidence 755422 1 111 1110 00 00 00011111 Q ss_pred -----EEEeecCCCCCcccCchHHHHH---HHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_019705. 180 -----IFHLKGFGFTGLVGLSPIAFAC---KSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIA 251 (424) Q Consensus 180 -----iih~r~~~~~~~~G~s~i~~~~---~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~ 251 (424) |+||.+...-...|.|-++... ..+.....-......++. .|..++. +.. .++ .+. |+. T Consensus 237 ~~~vPiv~f~N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a---~p~~~i~-G~~-~~~--~~~----~~~-- 303 (501) T protein:vir:25 237 KPVCPVVRFVNGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGA---NPQRVIS-GWT-GSK--AEV----LKA-- 303 (501) T ss_pred ccceeeEeccCccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhc---cHHHHHh-CCC-CCc--cch----hhh-- Confidence 2233221111224677554444 333333333333333333 3433332 111 111 111 111 Q ss_pred CCcccCcceecC-CCceeeecccChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHH Q lcl|NC_019705. 252 GGPVKKRLWILE-AGFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 252 ~~~~ag~~~~l~-~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P 329 (424) ..++++.++ ++.++.++.. .+++ |++.++..+.+|+..-++|+..++....+ .|...+......+...+ .- T Consensus 304 ---~~~~i~~~~~~~~~~~q~~~--~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N-~Sg~Al~~~~~~l~~ka-~~ 376 (501) T protein:vir:25 304 ---SALRVWTFEDPEVKAQAFPP--ASVEPYNLILEEMLQHVAMVAQISPAQVTGKMIN-VSAEALAAAEANQQRKL-AA 376 (501) T ss_pred ---cccceeccCCCCceEEEecc--cChHHHHHHHHHHHHHHHhhcCCChhhhccccCC-hHHHHHHHHHHHHHHHH-HH Confidence 124567765 3566666543 2333 88999999999999999999999865432 23222222222222111 11 Q ss_pred HHHHHHHHHHh------hccChhhh-cccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHH-HhCCCCCC----- Q lcl|NC_019705. 330 YISRWENSIQR------WLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRR-TDNLPPLP----- 396 (424) Q Consensus 330 ~~~~ie~~l~~------~l~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~-~~g~~p~~----- 396 (424) ....+...|.+ .+....+. ....+.+.+......+..+.++++.++++.|+ +.-.+.. +.|+.+-+ T Consensus 377 k~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi-s~et~~~~~~g~~~~~ie~~~ 455 (501) T protein:vir:25 377 KRESFGESWEQLLRLAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI-PIEHLLSMVPGMTQQTIQAIK 455 (501) T ss_pred HHHHHHHHHHHHHHHHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC-CHHHHHHHcCCCCHHHHHHHH Confidence 22222222221 11111111 11334455556667888999999999998875 3333333 34665411 Q ss_pred ------CCCeee---ecccccchhhccc-------cCCC--cccCC Q lcl|NC_019705. 397 ------GGDVAM---RQSQYVPITDLGT-------NKEP--RNNGA 424 (424) Q Consensus 397 ------ggd~~~---~~~n~~~~~~~~~-------~~~~--~~~ga 424 (424) ..+..+ .+....+...... +++. .++|| T Consensus 456 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 456 DSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGVNGNGGA 501 (501) T ss_pred HHHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccCCCCCCC Confidence 011000 0111111111110 1111 11222 No 164 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=98.28 E-value=1.7e-06 Score=52.24 Aligned_cols=402 Identities=11% Similarity=-0.012 Sum_probs=175.1 Q ss_pred CCCCccccc-C-CCCCchHHHHHhhccCcccCCccccchhhccccccccCcccc--cH--HH-HhccHHHHHHHHHHHHh Q lcl|NC_019705. 1 MEEPKYTID-L-RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSI--ND--ER-ILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~-~-~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v--s~--~~-~~~~~~v~~~i~~ia~~ 73 (424) ..--.+++. + -+.+-++.+|...+........ .....+.+-..+....+ .+ +. .....+..-||+.+++. T Consensus 2 ~~~~~~~~~gl~~~~~~~~~~L~~~~~~~~~~~~---~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~r 78 (474) T protein:vir:81 2 IQQQTVRIPSLSNDENALINGLLAQIENLRWKNL---LRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARR 78 (474) T ss_pred cCCCcCcCCCCChhHHHHHHHHHHHHHHHhhHHH---HHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhh Confidence 000000000 0 0112344444444433221110 11111111111111111 11 11 11234556677777775 Q ss_pred hccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce-eEEEEecCce Q lcl|NC_019705. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV-ISLLPLQSAN 152 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~-~~l~~l~~~~ 152 (424) +.--.|.+ . ++.. .+..+..++. +.+- ......+..+.+.+|.||+.+..+.+|.+ ..+.+++|.+ T Consensus 79 l~~~Gf~~---~-d~~~----~~~~l~~iw~-~N~l----d~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~ 145 (474) T protein:vir:81 79 CNLEGFVW---P-DGDL----DSLGGTEVVD-DNHL----LSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASE 145 (474) T ss_pred hcccceEC---C-CCCc----cchHHHHHHH-hcCh----hHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccce Confidence 55444432 1 1111 1123444442 2222 24556778899999999999988777764 4577889998 Q ss_pred eEEeecCceE-------EEEEEeCCce---EEecHhH-------------------------EEEeecC-CCCCcccCch Q lcl|NC_019705. 153 MDVKLVGKKV-------VYRYQRDSEY---AEFSQKE-------------------------IFHLKGF-GFTGLVGLSP 196 (424) Q Consensus 153 v~~~~~~~~~-------~~~~~~~~~~---~~~~~~e-------------------------iih~r~~-~~~~~~G~s~ 196 (424) +....|.... .+....++.. ..|.+++ |++|.+. ..++.+|.|. T Consensus 146 ~~~~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvPvV~~~n~~~~~~~~G~s~ 225 (474) T protein:vir:81 146 ATGEWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVPAQVLPYKPAPKRPFGQSR 225 (474) T ss_pred EEEEEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcceEEecccccccCcCCccc Confidence 8876653211 0001111110 0111122 3444332 2445567764 Q ss_pred ----HHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEE-cCCCCC---CHHHHHHHHHHHHHHhCCc-ccCcceecCCCce Q lcl|NC_019705. 197 ----IAFACKSAGVAVAMEDQQRDFFANGAKSPQILS-TGEKVL---TEQQRSQVEENFKEIAGGP-VKKRLWILEAGFS 267 (424) Q Consensus 197 ----i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~-~~~~~~---~~~~~~~~~~~~~~~~~~~-~ag~~~~l~~g~~ 267 (424) +..+.+.+.....-......|+.. |.-++. ...... +......++....+...-. +..+-.....+.+ T Consensus 226 i~e~v~~l~da~~r~~~~~~~~~e~~a~---pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~ 302 (474) T protein:vir:81 226 ITKPMMGLQDAGVRELARREGHMDVFSY---PEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARAD 302 (474) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhcc---hhheeecCChhhcccccccccchhhhhHHHHhcCCCccccccccccccc Confidence 445555555555555555666654 333332 111111 1111223333333322111 1111111223456 Q ss_pred eeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCc-ccchhHHHHHHHHHHH---HHHHHHHHHHHHHHhhcc Q lcl|NC_019705. 268 TSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST-SWGSGIEQQNLGFLQY---TLQPYISRWENSIQRWLI 343 (424) Q Consensus 268 ~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~-~~~~n~e~~~~~~~~~---tl~P~~~~ie~~l~~~l~ 343 (424) +-++.... -..|++.++..+..+|..-++|+..||.....| .|...+..+...+... .-.-+=..+++.+-.-+. T Consensus 303 ~~q~~~a~-l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~ 381 (474) T protein:vir:81 303 VKQFPAAS-PDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALA 381 (474) T ss_pred ccccCCCC-hhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66665432 223899999999999999999999998654222 2222222222222221 111222223322221111 Q ss_pred Chhh-------hcccchhhhhhhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC---CCCeeeecccccchh Q lcl|NC_019705. 344 PAKD-------VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP---GGDVAMRQSQYVPIT 411 (424) Q Consensus 344 ~~~~-------~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~---ggd~~~~~~n~~~~~ 411 (424) -..+ .....+++.+.+....+..+.++++.+++++| +.+..=+++++|+.+-+ .-++........+++ T Consensus 382 i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~ 461 (474) T protein:vir:81 382 MKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQ 461 (474) T ss_pred HhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHH Confidence 1110 01123333444556677889999999999987 34345568889998642 111111111122333 Q ss_pred hcc--ccCCCccc Q lcl|NC_019705. 412 DLG--TNKEPRNN 422 (424) Q Consensus 412 ~~~--~~~~~~~~ 422 (424) ... .++.+... T Consensus 462 ~l~~~~~~~~~aq 474 (474) T protein:vir:81 462 ALIDRSNNGATAQ 474 (474) T ss_pred HHHhcCCCCCCCC Confidence 221 11111111 No 165 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.27 E-value=1.8e-06 Score=52.10 Aligned_cols=403 Identities=11% Similarity=0.016 Sum_probs=168.2 Q ss_pred CC-----CCcccccCCC----CCchHHHHHhhccCcccCCccccchhhccccccccCccc--ccH---HHHhccHHHHHH Q lcl|NC_019705. 1 ME-----EPKYTIDLRT----NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSS--IND---ERILQISTVWRC 66 (424) Q Consensus 1 ~~-----~~~~~~~~~~----~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--vs~---~~~~~~~~v~~~ 66 (424) |. --+||-.+.. .+-++++|...+.... +........+.+-....... +.. .......+..-| T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~~---~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~i 77 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLVDRT---PRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKA 77 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHHHHh---HHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHH Confidence 21 2233322211 1134555544443221 11111111111111111111 111 111122344456 Q ss_pred HHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCcee-EE Q lcl|NC_019705. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI-SL 145 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~-~l 145 (424) |+.+++.+.--.|. ..+ +. .....+.++.. . |.. ......+..+.+.+|.||+.+..+.+|.+. .+ T Consensus 78 Vd~~a~rl~~~Gf~---~~d-~~----~~~~~l~~i~~-~-N~l---d~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I 144 (504) T protein:vir:99 78 VDTLARRCNLESFV---WPD-GD----YGSIGGPDVWD-E-NFF---ATKANNAMVSSLIHGPAFLINTEGGAGEPDSLI 144 (504) T ss_pred HHHHHhhhccceee---CCC-CC----hhhHHHHHHHH-h-cCh---hhHHHHHHHHHHhhCceeEEEecCCCCCceeEE Confidence 77776654333332 221 11 11123444432 2 332 345678888999999999999988888764 56 Q ss_pred EEecCceeEEeecCceE----EEE---EEeCCce---EEecHhH------------------------EEEeecC-CCCC Q lcl|NC_019705. 146 LPLQSANMDVKLVGKKV----VYR---YQRDSEY---AEFSQKE------------------------IFHLKGF-GFTG 190 (424) Q Consensus 146 ~~l~~~~v~~~~~~~~~----~~~---~~~~~~~---~~~~~~e------------------------iih~r~~-~~~~ 190 (424) .+++|..+.+..|+... .+. ...++.. ..+.++. |++|.+. ..+. T Consensus 145 ~~~sP~~~~~iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~gvPvV~~~n~~~~~~ 224 (504) T protein:vir:99 145 HVKSAMQATGEWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKLGVPVEVLPYKPREDR 224 (504) T ss_pred EEeccceeEEEEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCCCcceEEecccccCcc Confidence 78899988877664210 011 1111110 1122222 4444432 2344 Q ss_pred cccCchH----HHHHHHHHHHHHHHHHHHHHHhcCCCCceeEE-cCCCCC---CHHHHHHHHHHHHHHhCCcccCcceec Q lcl|NC_019705. 191 LVGLSPI----AFACKSAGVAVAMEDQQRDFFANGAKSPQILS-TGEKVL---TEQQRSQVEENFKEIAGGPVKKRLWIL 262 (424) Q Consensus 191 ~~G~s~i----~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~-~~~~~~---~~~~~~~~~~~~~~~~~~~~ag~~~~l 262 (424) .+|.|.+ ..+.+.+.....-......||.. |.-++. ...... +......++....+...-+......+. T Consensus 225 ~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~---p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~ 301 (504) T protein:vir:99 225 PLGSSRITRPVMSLQQRALKGCIRMDGHADVYSF---PQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDA 301 (504) T ss_pred ccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcc---hhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccc Confidence 5676643 34444444444444444555544 222221 110000 111112233332222221111111111 Q ss_pred -CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CcccchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019705. 263 -EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STSWGSGIEQQNLGFLQYTLQPYISRWENSIQR 340 (424) Q Consensus 263 -~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~-~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~ 340 (424) ....++.++....-+ .|++.++..+.+|+.+=++|++.||.... ++.|...++.+...+... +.-..+.|.+.+.+ T Consensus 302 ~~~~~~~~q~~~~~l~-~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~k-a~~k~~~f~~~l~~ 379 (504) T protein:vir:99 302 ARARADVKQFPASSPQ-PHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAE-AEGATDDWSPAFRR 379 (504) T ss_pred cCccceeeecCCCChH-HHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Confidence 123455555543222 38899999999999999999999986544 333332232222222221 12222222222221 Q ss_pred ------hccChhh--h-cccchhhhhhhhhccCHHHHHHHHHHHHhCCCCC--H-HHHHHHhCCCCCC-----------C Q lcl|NC_019705. 341 ------WLIPAKD--V-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRT--I-NEMRRTDNLPPLP-----------G 397 (424) Q Consensus 341 ------~l~~~~~--~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t--~-NE~R~~~g~~p~~-----------g 397 (424) .+....+ . ....+++.+......+..+.++++.++++.|... . .-+.+++|+.+-+ . T Consensus 380 ~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~ 459 (504) T protein:vir:99 380 SMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRAS 459 (504) T ss_pred HHHHHHHHhcCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHh Confidence 1111111 0 1123344445566678888999999999988532 2 3344566775421 0 Q ss_pred C----Ceeeecccccc------hhhccccCCCcccCC Q lcl|NC_019705. 398 G----DVAMRQSQYVP------ITDLGTNKEPRNNGA 424 (424) Q Consensus 398 g----d~~~~~~n~~~------~~~~~~~~~~~~~ga 424 (424) + +.+....+... -+..++...++.++| T Consensus 460 ~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~ 496 (504) T protein:vir:99 460 SVSIIEALNRRQQEAATAGEDQDQGAGEPPANEPPAA 496 (504) T ss_pred hHHHHHHHhcccCCCCCCCCCCCcCCCCCCCCCCCcc Confidence 0 00000000000 000000001111111 No 166 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.25 E-value=2.1e-06 Score=51.76 Aligned_cols=387 Identities=11% Similarity=0.024 Sum_probs=177.6 Q ss_pred CchHHHHHhhccCcccC-----------Cccccchh----------hccccc-cccCccc----ccHHHHhccHHHHHHH Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLV-----------TPNQGSQT----------GPVSAH-GHLGDSS----INDERILQISTVWRCV 67 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~-----------~~~~~~~~----------~~~~~~-~~~~~~~----vs~~~~~~~~~v~~~i 67 (424) +|||+++++||++.... .+...... ..+.+. ....... ...+.......-..++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 99999999998652110 01011000 011110 0000000 0011222334445566 Q ss_pred HHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019705. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..|+-+..=|..+.-. +. .....+.++|. -|. .....+..+...+..|.+++.+..+. |.+ .+.. T Consensus 81 ~~~A~lv~~e~~~i~~~---d~----~~~~~l~~il~--~n~---f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~-~I~~ 146 (500) T protein:vir:30 81 KKIASLVFNEQAEIKVD---DD----AANEFISETLK--NDR---FNKNFERYLESCLALGGLAMRPYVDG-DKV-RVAF 146 (500) T ss_pred HHHhhhhcCCcceEecC---Ch----HHHHHHHHHHh--hcc---HHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEE Confidence 66666665544333111 10 11122333333 222 23445666777777888887776654 333 3555 Q ss_pred ecCceeEEee-cC------------------ceEEEE----E-EeCCce-----------------EEec--------Hh Q lcl|NC_019705. 148 LQSANMDVKL-VG------------------KKVVYR----Y-QRDSEY-----------------AEFS--------QK 178 (424) Q Consensus 148 l~~~~v~~~~-~~------------------~~~~~~----~-~~~~~~-----------------~~~~--------~~ 178 (424) +++..+.+.. +. +..+|. + ..++.. ..++ +. T Consensus 147 v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~ 226 (500) T protein:vir:30 147 VQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKD 226 (500) T ss_pred EcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCc Confidence 6666655421 11 111110 0 001100 0010 01 Q ss_pred H----------EEEeecCCC-----CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeE-----EcCCCCCCHH Q lcl|NC_019705. 179 E----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL-----STGEKVLTEQ 238 (424) Q Consensus 179 e----------iih~r~~~~-----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl-----~~~~~~~~~~ 238 (424) + ..||+.+.. +.+.|+|....+...+......-.....-|+.|. ...++ .......+.+ T Consensus 227 ~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~-~~i~v~~~~l~~~~~~~~g~ 305 (500) T protein:vir:30 227 EAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQ-RRVAVPESLTALTVRTTDGD 305 (500) T ss_pred ceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCc-ceeeechHHhcccCCCCCcc Confidence 1 234443322 2357999999999888866666666666666544 33333 1111100000 Q ss_pred HHHHHHHHHHHHhCCcccCccee--cCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHH Q lcl|NC_019705. 239 QRSQVEENFKEIAGGPVKKRLWI--LEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~--l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e 316 (424) .. ....+. ..++....+- ..++..++.++....+-++.+..+....+|+...|+++..++...++..+..-+. T Consensus 306 ~~--~~~~~d---~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~ 380 (500) T protein:vir:30 306 VV--PRPRFE---SDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIV 380 (500) T ss_pred cc--CCcccC---CCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHH Confidence 00 000000 0000000000 1223356677766777889999999999999999999999987655433211110 Q ss_pred ----------HHHHHHHHHHHHHHHHHHHHHHHh-hccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHH Q lcl|NC_019705. 317 ----------QQNLGFLQYTLQPYISRWENSIQR-WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINE 385 (424) Q Consensus 317 ----------~~~~~~~~~tl~P~~~~ie~~l~~-~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE 385 (424) ......++.+|.-++..|.+.... .++...-.....+.+++++-+..|.++..+...+++.+|+|+.-+ T Consensus 381 s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~ 460 (500) T protein:vir:30 381 SENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREM 460 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHH Confidence 112233344444444444332211 122111112234556666667788999999999999999999999 Q ss_pred HHHH-hCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 386 MRRT-DNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 386 ~R~~-~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++.+ .|++. +..++.+..... +.........+.++ T Consensus 461 ~i~~~~g~~e-eea~~~l~~i~~---E~~~~~~~~~~~~~ 496 (500) T protein:vir:30 461 AIQKVLNVTE-EKAQEIAAEINT---GIVDEINQQRTDTH 496 (500) T ss_pred HHHhcCCCCH-HHHHHHHHHHHH---hccccCCCCCcccc Confidence 8855 46532 112111111000 00111111111111 No 167 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.25 E-value=2.1e-06 Score=51.76 Aligned_cols=387 Identities=11% Similarity=0.024 Sum_probs=177.6 Q ss_pred CchHHHHHhhccCcccC-----------Cccccchh----------hccccc-cccCccc----ccHHHHhccHHHHHHH Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLV-----------TPNQGSQT----------GPVSAH-GHLGDSS----INDERILQISTVWRCV 67 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~-----------~~~~~~~~----------~~~~~~-~~~~~~~----vs~~~~~~~~~v~~~i 67 (424) +|||+++++||++.... .+...... ..+.+. ....... ...+.......-..++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 99999999998652110 01011000 011110 0000000 0011222334445566 Q ss_pred HHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019705. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..|+-+..=|..+.-. +. .....+.++|. -|. .....+..+...+..|.+++.+..+. |.+ .+.. T Consensus 81 ~~~A~lv~~e~~~i~~~---d~----~~~~~l~~il~--~n~---f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~-~I~~ 146 (500) T protein:vir:98 81 KKIASLVFNEQAEIKVD---DD----AANEFISETLK--NDR---FNKNFERYLESCLALGGLAMRPYVDG-DKV-RVAF 146 (500) T ss_pred HHHhhhhcCCcceEecC---Ch----HHHHHHHHHHh--hcc---HHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEE Confidence 66666665544333111 10 11122333333 222 23445666777777888887776654 333 3555 Q ss_pred ecCceeEEee-cC------------------ceEEEE----E-EeCCce-----------------EEec--------Hh Q lcl|NC_019705. 148 LQSANMDVKL-VG------------------KKVVYR----Y-QRDSEY-----------------AEFS--------QK 178 (424) Q Consensus 148 l~~~~v~~~~-~~------------------~~~~~~----~-~~~~~~-----------------~~~~--------~~ 178 (424) +++..+.+.. +. +..+|. + ..++.. ..++ +. T Consensus 147 v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~ 226 (500) T protein:vir:98 147 VQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKD 226 (500) T ss_pred EcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCc Confidence 6666655421 11 111110 0 001100 0010 01 Q ss_pred H----------EEEeecCCC-----CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeE-----EcCCCCCCHH Q lcl|NC_019705. 179 E----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL-----STGEKVLTEQ 238 (424) Q Consensus 179 e----------iih~r~~~~-----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl-----~~~~~~~~~~ 238 (424) + ..||+.+.. +.+.|+|....+...+......-.....-|+.|. ...++ .......+.+ T Consensus 227 ~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~-~~i~v~~~~l~~~~~~~~g~ 305 (500) T protein:vir:98 227 EAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQ-RRVAVPESLTALTVRTTDGD 305 (500) T ss_pred ceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCc-ceeeechHHhcccCCCCCcc Confidence 1 234443322 2357999999999888866666666666666544 33333 1111100000 Q ss_pred HHHHHHHHHHHHhCCcccCccee--cCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHH Q lcl|NC_019705. 239 QRSQVEENFKEIAGGPVKKRLWI--LEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~--l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e 316 (424) .. ....+. ..++....+- ..++..++.++....+-++.+..+....+|+...|+++..++...++..+..-+. T Consensus 306 ~~--~~~~~d---~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~ 380 (500) T protein:vir:98 306 VV--PRPRFE---SDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIV 380 (500) T ss_pred cc--CCcccC---CCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHH Confidence 00 000000 0000000000 1223356677766777889999999999999999999999987655433211110 Q ss_pred ----------HHHHHHHHHHHHHHHHHHHHHHHh-hccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHH Q lcl|NC_019705. 317 ----------QQNLGFLQYTLQPYISRWENSIQR-WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINE 385 (424) Q Consensus 317 ----------~~~~~~~~~tl~P~~~~ie~~l~~-~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE 385 (424) ......++.+|.-++..|.+.... .++...-.....+.+++++-+..|.++..+...+++.+|+|+.-+ T Consensus 381 s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~ 460 (500) T protein:vir:98 381 SENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREM 460 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHH Confidence 112233344444444444332211 122111112234556666667788999999999999999999999 Q ss_pred HHHH-hCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 386 MRRT-DNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 386 ~R~~-~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++.+ .|++. +..++.+..... +.........+.++ T Consensus 461 ~i~~~~g~~e-eea~~~l~~i~~---E~~~~~~~~~~~~~ 496 (500) T protein:vir:98 461 AIQKVLNVTE-EKAQEIAAEINT---GIVDEINQQRTDTH 496 (500) T ss_pred HHHhcCCCCH-HHHHHHHHHHHH---hccccCCCCCcccc Confidence 8855 46532 112111111000 00111111111111 No 168 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.25 E-value=2.1e-06 Score=51.72 Aligned_cols=394 Identities=9% Similarity=0.006 Sum_probs=172.2 Q ss_pred CC-----CCcccccC------------------------CCCCchHHHHHhhccCcccCCccccchhhccccccccCccc Q lcl|NC_019705. 1 ME-----EPKYTIDL------------------------RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSS 51 (424) Q Consensus 1 ~~-----~~~~~~~~------------------------~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 51 (424) |. .+-|+|.+ ..+.--+.++.....+........................+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 80 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKT 80 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccc Confidence 00 00111110 00111233333333222110000000000000000000000 Q ss_pred ccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeE Q lcl|NC_019705. 52 INDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAY 131 (424) Q Consensus 52 vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~ 131 (424) ..=+.++....+|+..++-+.+-|+.+- . .+. + ....+. .+.. | ........+..+++.+|.+| T Consensus 81 ---~~ri~~n~~~~ivd~~~~yl~g~~~~~~-~-~d~---~--~~~~l~-~~~~--n---~~~~~~~~~~~~~~~~G~~~ 144 (503) T protein:vir:59 81 ---NNRTSHAWHKLFVDQKTQYLVGEPVTFT-S-DNK---T--LLEYVN-ELAD--D---DFDDILNETVKNMSNKGIEY 144 (503) T ss_pred ---cceeecchHHHHHHHHHhhhhcCCeeec-c-CcH---H--HHHHHH-HHHh--c---CHHHHHHHHHHHHhhCCeEE Confidence 0012355677789999998888887752 1 111 1 111222 2322 2 23456677888999999999 Q ss_pred EEEeeCCCCceeEEEEecCceeEEeecCce---E-----EEEEEe-CCce----EEecHhHEEEeecC------------ Q lcl|NC_019705. 132 ALVDRNSAGDVISLLPLQSANMDVKLVGKK---V-----VYRYQR-DSEY----AEFSQKEIFHLKGF------------ 186 (424) Q Consensus 132 ~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~---~-----~~~~~~-~~~~----~~~~~~eiih~r~~------------ 186 (424) +.+..+.+|++ .+..++|..+.+..++.. . +|.... .+.. ..+.++.|.+++.. T Consensus 145 ~~v~~d~dg~~-~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~ 223 (503) T protein:vir:59 145 WHPFVDEEGEF-DYVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGE 223 (503) T ss_pred EEEeecCCCce-EEEEEccceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccc Confidence 99999888876 478888888887665421 1 111111 1110 12333333332210 Q ss_pred -------------------C----CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019705. 187 -------------------G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 187 -------------------~----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~ 243 (424) + .+...|.|-+..+...++....+.......+...+.|-.+++--.....++....+ T Consensus 224 ~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~ 303 (503) T protein:vir:59 224 NNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANL 303 (503) T ss_pred cccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhh Confidence 0 12235777677666666665555555555556666676665532221111111111 Q ss_pred HHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHH------- Q lcl|NC_019705. 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE------- 316 (424) Q Consensus 244 ~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e------- 316 (424) ..++++.++++.+++.+........+....+...+.|...-++|..-... ..++.++...+ T Consensus 304 -----------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~~~Sg~Ai~~~~~~l~ 371 (503) T protein:vir:59 304 -----------RYHSVIKVSGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPET-IGGGATGPALENLYALLD 371 (503) T ss_pred -----------hcccceeccCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCccc-ccccccHHHHHHHHHHHH Confidence 12235556555555444433333444555666666666666665321111 11222322221 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCC Q lcl|NC_019705. 317 ---QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLP 393 (424) Q Consensus 317 ---~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~ 393 (424) +.....+...|+-.+..|...++........ ....+.+.+..-+..|..+.++.+.+++.+|+++...+.++++.- T Consensus 372 ~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~-~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v 450 (503) T protein:vir:59 372 LKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFN-PDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPFV 450 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc-cccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCC Confidence 1122223333333333333333321111111 111234444566778899999999999999999998888887663 Q ss_pred CCCCCC--eee--------ecccc---cchhhccccCCCc-----ccCC Q lcl|NC_019705. 394 PLPGGD--VAM--------RQSQY---VPITDLGTNKEPR-----NNGA 424 (424) Q Consensus 394 p~~ggd--~~~--------~~~n~---~~~~~~~~~~~~~-----~~ga 424 (424) +-|..+ ..- ...+. .+.....+++++. .+|+ T Consensus 451 ~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (503) T protein:vir:59 451 QDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAESGGA 499 (503) T ss_pred CCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCCCcccCCCC Confidence 321110 000 00000 0000001111111 1111 No 169 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.23 E-value=2.3e-06 Score=51.54 Aligned_cols=398 Identities=12% Similarity=0.113 Sum_probs=163.6 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccC--cccccH---HHHhccHHHHHHHHHHHHhhc Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSIND---ERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~vs~---~~~~~~~~v~~~i~~ia~~ia 75 (424) |-+ -..++ ..-|++++...+....... .....-+.+-..+. +..+.. ..-+.......+|+.+++.+- T Consensus 1 ~~~-~~~~d---~~~~i~~L~~~~~~~~~r~---~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~ 73 (488) T protein:vir:23 1 MAE-TESID---PEKLRDQLLDAFENKQNEL---KSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQE 73 (488) T ss_pred CCc-ccCCC---HHHHHHHHHHHHHHHHHHH---HHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhh Confidence 111 11111 1134444444333221100 00000011111000 011111 111223455567777766544 Q ss_pred cCceEEEEecc----CCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC--------CCcee Q lcl|NC_019705. 76 CLPLDVFETDQ----NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS--------AGDVI 143 (424) Q Consensus 76 ~~~~~v~~~~~----~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~--------~G~~~ 143 (424) --+|.+-.... .+...+ ....+.+++. -| ........+..+++.+|.||+.+.++. .|.+ T Consensus 74 ~~Gf~~~~~~~~~~~~~~d~~--~~~~l~~i~~--~N---~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~- 145 (488) T protein:vir:23 74 LEGFRIPSANGEEPESGGEND--PASELWDWWQ--AN---NLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVP- 145 (488) T ss_pred ccceeccCCcccccccccchh--HHHHHHHHHH--hc---ChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcc- Confidence 33343321110 011111 1123444443 22 235667778899999999999886643 2222 Q ss_pred EEEEecCceeEEeecCc--eE----EEEEEeCCc-e---EEecHhH-------------------------EEEeecCC- Q lcl|NC_019705. 144 SLLPLQSANMDVKLVGK--KV----VYRYQRDSE-Y---AEFSQKE-------------------------IFHLKGFG- 187 (424) Q Consensus 144 ~l~~l~~~~v~~~~~~~--~~----~~~~~~~~~-~---~~~~~~e-------------------------iih~r~~~- 187 (424) .+.+++|..+.+..|+. .. .|.+...+. . ..+.++. |++|++.. T Consensus 146 ~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~ 225 (488) T protein:vir:23 146 LIRVEPPTALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTR 225 (488) T ss_pred eEEEeccceeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEeccccc Confidence 36677888877666532 11 111111110 0 0112222 34554322 Q ss_pred CCCcccCchHH----HHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHH--HHHHHHHHHHhCCcccCccee Q lcl|NC_019705. 188 FTGLVGLSPIA----FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR--SQVEENFKEIAGGPVKKRLWI 261 (424) Q Consensus 188 ~~~~~G~s~i~----~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~--~~~~~~~~~~~~~~~ag~~~~ 261 (424) ..+.+|.|-+. .+.+.+....+-..-...++.. |..++.- .. .++... ..-...++.. .++++. T Consensus 226 ~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~---p~~~i~G-~~-~~~~~~~~~~~~~~~~~~-----~~~v~~ 295 (488) T protein:vir:23 226 LSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAI---PQRLIFG-AK-PEELGINAETGQRMFDAY-----MARILA 295 (488) T ss_pred cCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhh---HHHHHhC-CC-cccccccccccchhhhhh-----hhhhcc Confidence 34557777553 2333333333222222333332 3322221 00 011000 0011112221 234666 Q ss_pred cCCC--ceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHH---HHHHHHHH Q lcl|NC_019705. 262 LEAG--FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ---PYISRWEN 336 (424) Q Consensus 262 l~~g--~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~---P~~~~ie~ 336 (424) +++| .++.++.....+ .+++..+..+.+|+..=++|+..+|....+..|..........+...+-. -+-..+.+ T Consensus 296 ~~~g~~~~~~q~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~ 374 (488) T protein:vir:23 296 FEGGEGAHAEQFSAAELR-NFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQ 374 (488) T ss_pred CCCCCCceeEecCCCChH-HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7666 456666543333 37888999999999999999999986544333322222222222111110 11111111 Q ss_pred HHHhh--ccChhhh--cccchhhhhhhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC--CCC----eeeec Q lcl|NC_019705. 337 SIQRW--LIPAKDV--GRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP--GGD----VAMRQ 404 (424) Q Consensus 337 ~l~~~--l~~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~--ggd----~~~~~ 404 (424) .+..- +....+. ....+++.+......+....++.+.+++++| +++..-+++++|+-+-+ ..+ +-... T Consensus 375 ~~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~ 454 (488) T protein:vir:23 375 AMRLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQ 454 (488) T ss_pred HHHHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHHH Confidence 11110 1111110 1122333334445567788888889988865 78898888999875432 111 10000 Q ss_pred --ccccchh------------hccccCCCcccCC Q lcl|NC_019705. 405 --SQYVPIT------------DLGTNKEPRNNGA 424 (424) Q Consensus 405 --~n~~~~~------------~~~~~~~~~~~ga 424 (424) ..+..+. ..++...++.+.| T Consensus 455 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 455 GLGLIGSLYGASTPEGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred HHHHHHHHhccCCCcccCCCCCCCCCCCCCCCCC Confidence 0000000 0111223333333 No 170 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=98.23 E-value=4e-08 Score=61.12 Aligned_cols=182 Identities=10% Similarity=0.046 Sum_probs=97.2 Q ss_pred eEEcCC--CCCCHHHHHHHHHHHHHHhCCc-ccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhC Q lcl|NC_019705. 227 ILSTGE--KVLTEQQRSQVEENFKEIAGGP-VKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVG 303 (424) Q Consensus 227 vl~~~~--~~~~~~~~~~~~~~~~~~~~~~-~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~ 303 (424) |++.+. ...+.. ...++++++...... +.+.+.+...+.+|..++.+...+ .+........||++-|||...|- T Consensus 1 V~k~~~l~~~~~~~-~~~~~~r~~~~~~~~~~~~~~~ld~~~e~~e~~~~~lsGl--~d~l~~~~~~iaa~s~iP~t~Lf 77 (201) T protein:vir:10 1 MWKAKGLADLCDDS-DGAARLRLAQVDNNSGVGQAIGIDADSEEYNVLNSDIGGI--DTFLSQKFDRIVALSGIHEIILK 77 (201) T ss_pred CccchHHHHHhcCC-hHHHHHHHHHHHHhhhhhhhheeecCCcceeeeecCcCCh--HHHHHHHHHHHHhHhcCchhhhc Confidence 333221 011111 122334444322221 223344555567888888877755 47788889999999999999997 Q ss_pred CCCCCcccchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHH-------H Q lcl|NC_019705. 304 DVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASR-------A 369 (424) Q Consensus 304 ~~~~~~~~~~n~e~~~~~~~~-------~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~-------~ 369 (424) +...+..+. +.+...++||. .-+.|.++.+-+-+ ..+. .+.|.++.|...+.+++ + T Consensus 78 G~sp~Glna-tge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~----~~~~-----~~~~~f~pL~~~s~kekAei~~~~a 147 (201) T protein:vir:10 78 GKNVGGVSA-SQNTALETFYGYVDRKRKAELLPLLEFLLPFI----VTEQ-----EWSVEFNPLSQVSDKDKSEILEKNV 147 (201) T ss_pred CCCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHhh----cCCC-----CceEeeCCCCCCCHHHHHHHHHHHH Confidence 776655532 22334445554 44556665544322 1121 23455567777776664 5 Q ss_pred HHHHHHHhCCCCCHHHHHHHhCCCCCCCC--CeeeecccccchhhccccCCCccc Q lcl|NC_019705. 370 AFMKAMGEAGLRTINEMRRTDNLPPLPGG--DVAMRQSQYVPITDLGTNKEPRNN 422 (424) Q Consensus 370 ~~~~~~~~~g~~t~NE~R~~~g~~p~~gg--d~~~~~~n~~~~~~~~~~~~~~~~ 422 (424) +++++++++|+++++|+|+.|--.+.-++ +... .......+.....+.+.++ T Consensus 148 ~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~-~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 148 NSVAALIAAGIIDADEARDTLRAISTEVKIGEGSI-QTEVVINESEDPLDVSANN 201 (201) T ss_pred HHHHHHHHcCCCCHHHHHHHHHhcCCcCCCCCCCC-CccccccccCCCCCCCCCC Confidence 56678889999999999998755443221 1111 1011111111122233444 No 171 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.17 E-value=3.2e-06 Score=50.71 Aligned_cols=389 Identities=11% Similarity=0.087 Sum_probs=162.9 Q ss_pred CCCCcccccCCCCC----chHHHHHhhccCcccCCccccchhhccccccccC--cccccHH--H-HhccHHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNN----GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSINDE--R-ILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~~~~~~~~----G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~vs~~--~-~~~~~~v~~~i~~ia 71 (424) |. +-+++.++. =|...+...+.... +.......-+.+-.... +..+..+ . -.......-+|+..+ T Consensus 1 ~~---~~~~~~~e~~~~~~~~~~l~~~~~~~~---~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~ 74 (486) T protein:vir:42 1 MT---APLPGMEEIEDPAVVREEMISAFEDAS---KDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVA 74 (486) T ss_pred CC---CCCCCCCCcccHHHHHHHHHHHHHHHH---HHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHH Confidence 11 111111111 12333333332211 00000001111111110 0011110 0 011334556777777 Q ss_pred HhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce-------eE Q lcl|NC_019705. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV-------IS 144 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~-------~~ 144 (424) +.+--.+|.+ ..+. .....+.+++. + |.. ......+..+++.+|.||+.+.++..|.. .. T Consensus 75 ~~l~~~g~~~---~~~~-----~~~~~~~~i~~-~-N~~---d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~ 141 (486) T protein:vir:42 75 ERQAVEGFRL---GDAD-----EADEELWQWWQ-A-NNL---DIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPI 141 (486) T ss_pred hhhcccceec---CCCc-----hhHHHHHHHHH-h-cCh---hHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeE Confidence 6654444432 1111 11123444443 2 322 35567788999999999999988765432 35 Q ss_pred EEEecCceeEEeecCce--E--E--EEEEeCCce----EEecHhH-------------------------EEEeecC-CC Q lcl|NC_019705. 145 LLPLQSANMDVKLVGKK--V--V--YRYQRDSEY----AEFSQKE-------------------------IFHLKGF-GF 188 (424) Q Consensus 145 l~~l~~~~v~~~~~~~~--~--~--~~~~~~~~~----~~~~~~e-------------------------iih~r~~-~~ 188 (424) +.+++|..+.+..|... . . +.+...+.. ..+.++. |++|++. .. T Consensus 142 i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~ 221 (486) T protein:vir:42 142 IRVEPPTRMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRL 221 (486) T ss_pred EEEecccceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCceEEEecccccc Confidence 67788888877665321 1 0 111111100 0112222 2333332 23 Q ss_pred CCcccCchHH----HHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHH--HHHHHHHHHHHhCCcccCcceec Q lcl|NC_019705. 189 TGLVGLSPIA----FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ--RSQVEENFKEIAGGPVKKRLWIL 262 (424) Q Consensus 189 ~~~~G~s~i~----~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~--~~~~~~~~~~~~~~~~ag~~~~l 262 (424) .+.+|.|-+. .+.+.+....+-......++ +.|..++.-. . .++.. ...-+..|+. ..++++.+ T Consensus 222 ~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~---a~p~~~i~G~-~-~~~~~~~~~~~~~~~~~-----~~~~~~~~ 291 (486) T protein:vir:42 222 SDLYGTSEITPELRSMTDAAARILMLMQATAELM---GVPQRLIFGI-K-PEEIGVDSETGQTLFDA-----YLARILAF 291 (486) T ss_pred CCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhh---cchHHHhhcC-C-ccccccccccccchhhh-----hhchhccc Confidence 4456777544 33344443333222223333 2344333310 0 00000 0001111211 12345555 Q ss_pred C-CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHH----------HHHHHHHHH Q lcl|NC_019705. 263 E-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF----------LQYTLQPYI 331 (424) Q Consensus 263 ~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~----------~~~tl~P~~ 331 (424) + ++.++.++.....+ .+++.++..+.+++..=++|+..+|....+..|+.........+ +...+.-.+ T Consensus 292 ~~~~~~~~q~~~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~ 370 (486) T protein:vir:42 292 EDAEGKIQQFSAAELA-NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAM 370 (486) T ss_pred CCCCceEEeecccCHH-HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4 45677666544333 37888999999999999999999986544323322222222211 122222222 Q ss_pred HHHHHHHHhhccChhhh--cccchhhhhhhhhccCHHHHHHHHHHHHhC--CCCCHHHHHHHhCCCCCC--CCCee---- Q lcl|NC_019705. 332 SRWENSIQRWLIPAKDV--GRIHAEHNLDGLLRGDSASRAAFMKAMGEA--GLRTINEMRRTDNLPPLP--GGDVA---- 401 (424) Q Consensus 332 ~~ie~~l~~~l~~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~--g~~t~NE~R~~~g~~p~~--ggd~~---- 401 (424) +.+.. +....+. ....+++.+......+....++.+.+++++ |+++..-+++.+|+-+-+ ..... T Consensus 371 ~l~~~-----~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~ 445 (486) T protein:vir:42 371 RIAYR-----IMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEE 445 (486) T ss_pred HHHHH-----HhcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHH Confidence 21111 1111110 112333444455567788889999999886 688888888888885432 11100 Q ss_pred ----ee-------cccccchhhcc-----ccCCCcccCC Q lcl|NC_019705. 402 ----MR-------QSQYVPITDLG-----TNKEPRNNGA 424 (424) Q Consensus 402 ----~~-------~~n~~~~~~~~-----~~~~~~~~ga 424 (424) .. ..+-.+-.... +++.....|+ T Consensus 446 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (486) T protein:vir:42 446 AAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIESSGG 484 (486) T ss_pred HHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCCCCC Confidence 00 00000000000 0011011111 No 172 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.14 E-value=3.8e-06 Score=50.29 Aligned_cols=389 Identities=13% Similarity=0.037 Sum_probs=162.7 Q ss_pred CCCCCchHHHHHhhccCcccCCccccchhhcccccccc--CcccccHHH---HhccHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_019705. 10 LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL--GDSSINDER---ILQISTVWRCVSLISTLTACLPLDVFET 84 (424) Q Consensus 10 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~vs~~~---~~~~~~v~~~i~~ia~~ia~~~~~v~~~ 84 (424) +-|-+=|+++|..++...... ......-+.+-..+ .+..+..+. -..+.....+|+..++.+--.+|. . T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r---~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~---~ 74 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPN---LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---I 74 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHH---HHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCcee---c Confidence 445555666665555321110 00001111111111 011111110 012334556677766655433443 2 Q ss_pred ccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC------CCCceeEEEEecCceeEEeec Q lcl|NC_019705. 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN------SAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~------~~G~~~~l~~l~~~~v~~~~~ 158 (424) ..+. .....+..++.. |. .......+..+++.+|.||..+.++ ..|.+ .+.+++|..+.+..| T Consensus 75 ~~d~-----~~~~~l~~i~~~--N~---~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~-~i~~~~p~~~~~~~D 143 (480) T protein:vir:78 75 SEDS-----EGLEELWNWWQA--ND---LDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIP-LIRVESPLYMYAELD 143 (480) T ss_pred CCCc-----hhHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCceEEEEecCccccCCCCCee-EEEEEcccceEEEEc Confidence 2111 112345555542 22 2466778889999999999988763 34444 467788888887776 Q ss_pred Cc---eE----EEEEEeCC-ce----EEecHhH-----------------------------EEEeecCC-CCCcccCch Q lcl|NC_019705. 159 GK---KV----VYRYQRDS-EY----AEFSQKE-----------------------------IFHLKGFG-FTGLVGLSP 196 (424) Q Consensus 159 ~~---~~----~~~~~~~~-~~----~~~~~~e-----------------------------iih~r~~~-~~~~~G~s~ 196 (424) .. .. .|.+.... .. ..+.+++ |++|++.. ..+.+|.|- T Consensus 144 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~ 223 (480) T protein:vir:78 144 PRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) T ss_pred CCCccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCccc Confidence 42 11 11000000 00 0111111 34444332 344677775 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-CCceeeecccC Q lcl|NC_019705. 197 IAF-ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AGFSTSAIGVT 274 (424) Q Consensus 197 i~~-~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-~g~~~~~l~~~ 274 (424) +.- +...++..............-.+.|..++. +.. .++...+.-...+.... +.++.++ ++.++.++... T Consensus 224 i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~-~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~ 296 (480) T protein:vir:78 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVT-TDELTNDGENTTLDIYY-----GRILTLASEAAKISEFKAA 296 (480) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-cCC-ccccccccccchhhhhh-----hhhccCCCCCceEEecCcc Confidence 542 333333222222222222222233444442 111 11111111111122211 2344443 34667666653 Q ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHh------hccChhhh Q lcl|NC_019705. 275 PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR------WLIPAKDV 348 (424) Q Consensus 275 ~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~------~l~~~~~~ 348 (424) .-+. +++..+..+.+|+..=++|+..+|....+..|+.........+...+ .-....+...|.+ .+...... T Consensus 297 ~~~~-~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka-~~~~~~f~~~l~~~~~l~~~~~g~~~~ 374 (480) T protein:vir:78 297 ELRN-FAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMA-ERKGRIFGGAWERAMRIAMQIMGREVT 374 (480) T ss_pred CHHH-HHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHcCCCcc Confidence 3222 67888899999999999999999865433233222222222222111 1111111111111 11111111 Q ss_pred -cccchhhhhhhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCC--CCe--------eeecccc-------- Q lcl|NC_019705. 349 -GRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLPG--GDV--------AMRQSQY-------- 407 (424) Q Consensus 349 -~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~g--gd~--------~~~~~n~-------- 407 (424) ....+++.+......+..+.++.+.+++++| +++..-+++.+|+.+-+- -++ .+-.... T Consensus 375 ~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~ 454 (480) T protein:vir:78 375 EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA 454 (480) T ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccccCCC Confidence 1122333333444566777888888888765 677777778888765321 100 0000000 Q ss_pred cchhhccccCCCc-ccCC Q lcl|NC_019705. 408 VPITDLGTNKEPR-NNGA 424 (424) Q Consensus 408 ~~~~~~~~~~~~~-~~ga 424 (424) .+-+..++..++. +.++ T Consensus 455 ~~~~~~~~~~~~~~~~~~ 472 (480) T protein:vir:78 455 TPKPTVTETKTETQTSPS 472 (480) T ss_pred CCCCCCCCCCCccccccC Confidence 0000111111111 1111 No 173 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.09 E-value=4.9e-06 Score=49.68 Aligned_cols=388 Identities=13% Similarity=0.018 Sum_probs=173.6 Q ss_pred CCCchHHHHHhhccCcccCCc------ccc--ch---hhcc-ccccccCcc--------------cccHHHHhccHHHHH Q lcl|NC_019705. 12 TNNGWWARLQSWFVGGRLVTP------NQG--SQ---TGPV-SAHGHLGDS--------------SINDERILQISTVWR 65 (424) Q Consensus 12 ~~~G~~~~~~~~~~~~~~~~~------~~~--~~---~~~~-~~~~~~~~~--------------~vs~~~~~~~~~v~~ 65 (424) ==.|+++++++++++-..... ... .. ...+ .+...+.|. ... +.-+....... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~-~~~~s~n~~~~ 79 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVN-RRQLSMNLPKV 79 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccc-cceeecchHHH Confidence 011444455555543111000 000 00 0000 000001110 000 11223344556 Q ss_pred HHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEE Q lcl|NC_019705. 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISL 145 (424) Q Consensus 66 ~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l 145 (424) +++..|+-+.+=|..+--.++ .....+..++. -|. ...-...++...+..|.+|+.+..|.+|.+ .+ T Consensus 80 iv~~~a~~l~~ep~~i~~~d~-------~~~e~l~~~~~--~n~---f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i 146 (499) T protein:vir:80 80 TAKYMSKLLFNEKVKINIDDE-------TAEEFVLNVLK--TNG---FTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KV 146 (499) T ss_pred HHHHHHHhhhCCcceEeeCCH-------HHHHHHHHHHh--hcc---HHHHHHHHHHHHhhcCcEEEEEEECCCCcE-EE Confidence 777777777666655422110 11122333332 122 345556677788889999999988887765 35 Q ss_pred EEecCceeEEe-ecCceE--------------EE--------------EEEeC--------C--ceEEecHhH------- Q lcl|NC_019705. 146 LPLQSANMDVK-LVGKKV--------------VY--------------RYQRD--------S--EYAEFSQKE------- 179 (424) Q Consensus 146 ~~l~~~~v~~~-~~~~~~--------------~~--------------~~~~~--------~--~~~~~~~~e------- 179 (424) -.++|..+-+. .+.+.. .| .|.+. . .+..++..+ T Consensus 147 ~~v~a~~~~Pi~~d~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~ 226 (499) T protein:vir:80 147 SFATADCMYPLSNDSENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEP 226 (499) T ss_pred EEEcCCceEEEEecCCCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCC Confidence 66666666543 222110 00 01000 0 011111111 Q ss_pred -----------EEEeecCCC-----CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCC------CH Q lcl|NC_019705. 180 -----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL------TE 237 (424) Q Consensus 180 -----------iih~r~~~~-----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~------~~ 237 (424) +.+||.+-. +...|+|.+.-+...++.....-....+-|..+ ....++ +.... +. T Consensus 227 ~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~-~~~i~v--~~~~l~~~~~~~g 303 (499) T protein:vir:80 227 VVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLV--PSSFVKTAVNLDG 303 (499) T ss_pred ceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhc-ccceec--chhhhhccCCCCC Confidence 334554321 235699999988888886666655555666654 333333 11110 00 Q ss_pred HHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHH Q lcl|NC_019705. 238 QQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ 317 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~ 317 (424) +....+.... +.+. +.....-+++-.++.++....+-++.+..+...++|....|+++..+|....+..+...+.. T Consensus 304 ~~~~~~~~~~-~~~~---~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s 379 (499) T protein:vir:80 304 STTQYFDSTD-EAFF---LYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVS 379 (499) T ss_pred CcccCCCccc-ceee---EeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHH Confidence 0000000000 0000 00001112233466666666666788889999999999999999999876554332111111 Q ss_pred HH----------HHHHHHHHHHHHHHHHHHHHhhccChh-hhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHH Q lcl|NC_019705. 318 QN----------LGFLQYTLQPYISRWENSIQRWLIPAK-DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEM 386 (424) Q Consensus 318 ~~----------~~~~~~tl~P~~~~ie~~l~~~l~~~~-~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~ 386 (424) .. ...++.+|..++..|-...+....... ......+.++++.-+..|.++.++...+++.+|+|+.-.+ T Consensus 380 ~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~ 459 (499) T protein:vir:80 380 EKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIA 459 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHH Confidence 11 112222233333333222111111111 1122345556666677889999999999999999999888 Q ss_pred HHHh-CCCCCCCCCeeeeccc-----ccchhhccccCCCcc Q lcl|NC_019705. 387 RRTD-NLPPLPGGDVAMRQSQ-----YVPITDLGTNKEPRN 421 (424) Q Consensus 387 R~~~-g~~p~~ggd~~~~~~n-----~~~~~~~~~~~~~~~ 421 (424) +... |.+- +..++.+.... ..|-.+.+...+..+ T Consensus 460 l~~~~~~~d-~ea~~el~~i~~E~~~~~~~~d~~g~~ge~e 499 (499) T protein:vir:80 460 LQRAWNITE-AEADEWAEMLAKEKQAEIPNNDMTGIFGEEE 499 (499) T ss_pred HhhcCCCCh-HHHHHHHHHHHHHhhcCCCCCCccccCCCCC Confidence 7653 5432 12221111100 001111111000011 No 174 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.03 E-value=6.6e-06 Score=48.98 Aligned_cols=382 Identities=14% Similarity=0.062 Sum_probs=162.9 Q ss_pred CCCCCchHHHHHhhccCcccCCccccchhhccccccccC--cccccHH---HHhccHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_019705. 10 LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLG--DSSINDE---RILQISTVWRCVSLISTLTACLPLDVFET 84 (424) Q Consensus 10 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~vs~~---~~~~~~~v~~~i~~ia~~ia~~~~~v~~~ 84 (424) +-|-+=|+.++...+...... ......-+.+-..+. +..+..+ .-.......-+|+..++.+--.+|.+ T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r---~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~--- 74 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPN---LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--- 74 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHH---HHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCceec--- Confidence 334444555554444221110 000000011111000 0011110 01123344556776666554334432 Q ss_pred ccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC------CCCceeEEEEecCceeEEeec Q lcl|NC_019705. 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN------SAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~------~~G~~~~l~~l~~~~v~~~~~ 158 (424) ..+. .....+..++. + |. .......+..+++.+|.||+.+.++ .+|.+ .+.+++|..+.+..| T Consensus 75 ~~d~-----~~~~~l~~i~~-~-N~---~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~i~D 143 (480) T protein:vir:78 75 SEDS-----EGLEELWNWWQ-A-ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELD 143 (480) T ss_pred CCCc-----hhHHHHHHHHH-h-cC---HHHHHHHHHHHHhhcCceEEEeecCccccCCCCCee-EEEEEcccceEEEEc Confidence 2111 11234555554 2 22 3456678899999999999988753 34544 477888888887776 Q ss_pred Cce---E----EEEEEeCC-c----eEEecHhH-----------------------------EEEeecCC-CCCcccCch Q lcl|NC_019705. 159 GKK---V----VYRYQRDS-E----YAEFSQKE-----------------------------IFHLKGFG-FTGLVGLSP 196 (424) Q Consensus 159 ~~~---~----~~~~~~~~-~----~~~~~~~e-----------------------------iih~r~~~-~~~~~G~s~ 196 (424) ... . .|.+.... . ...+.++. |+||.+.. ..+.+|.|- T Consensus 144 ~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sd 223 (480) T protein:vir:78 144 PRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) T ss_pred CCCccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccc Confidence 421 1 11111100 0 01111222 34444332 344567775 Q ss_pred HH----HHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-CCceeeec Q lcl|NC_019705. 197 IA----FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AGFSTSAI 271 (424) Q Consensus 197 i~----~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-~g~~~~~l 271 (424) +. .+.+.+.....-......+| +.|..++. +.. .++...+.-...+... .++++.++ ++.++.++ T Consensus 224 i~~~i~~l~Da~~~~~s~~~~~~~~~---a~p~~~i~-G~~-~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 293 (480) T protein:vir:78 224 ISPELRKVTDAASRTLMNLQSASQIL---GTPLRVIS-GVT-TDELTNDGENTTLDIY-----YGRILTLASEAAKISEF 293 (480) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhh---cchhhhhh-CCC-ccccccccccchhhhh-----hhhhccCCCCCceEEec Confidence 54 23333333333333333333 33443442 111 1111111111112111 12345544 34567666 Q ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHH----HHHh------h Q lcl|NC_019705. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWEN----SIQR------W 341 (424) Q Consensus 272 ~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~----~l~~------~ 341 (424) ....-+ .+++..+..+.+++..-++|+..+|....+..|+.... +....|.-.+...+. .|.+ . T Consensus 294 ~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~-----~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~ 367 (480) T protein:vir:78 294 KAAELR-NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAII-----ATDSRIVKMAERKGRIFGGAWERAMRIAMQ 367 (480) T ss_pred CccCHH-HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 654333 37888899999999999999999986443222221221 111222222222111 1111 1 Q ss_pred ccChhhh-cccchhhhhhhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCC--CCCeeeecccccchh----- Q lcl|NC_019705. 342 LIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP--GGDVAMRQSQYVPIT----- 411 (424) Q Consensus 342 l~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~--ggd~~~~~~n~~~~~----- 411 (424) +...... ....+++.+......+....++.+.+++.+| +++..-+++++|+.+-+ .-++........+++ T Consensus 368 ~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~ 447 (480) T protein:vir:78 368 IMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYST 447 (480) T ss_pred HcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhcc Confidence 1111111 1123344444445567777888888888765 67777778888886532 111100000000000 Q ss_pred -----------hccccC-CCcccCC Q lcl|NC_019705. 412 -----------DLGTNK-EPRNNGA 424 (424) Q Consensus 412 -----------~~~~~~-~~~~~ga 424 (424) ..++.. +.++.++ T Consensus 448 ~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (480) T protein:vir:78 448 TKAQADATPKPTVTETKTETQTSPS 472 (480) T ss_pred ccCCCccccCCCCCCCCCccCCCcc Confidence 111100 1111111 No 175 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=97.98 E-value=8.2e-06 Score=48.47 Aligned_cols=384 Identities=10% Similarity=0.035 Sum_probs=178.5 Q ss_pred CchHHHHHhhccCcccC---Ccc--------c---cchhhcc-ccccccCcc-----------cccHHHHhccHHHHHHH Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLV---TPN--------Q---GSQTGPV-SAHGHLGDS-----------SINDERILQISTVWRCV 67 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~---~~~--------~---~~~~~~~-~~~~~~~~~-----------~vs~~~~~~~~~v~~~i 67 (424) +|||.++++||++.... .+. . ......+ .+..++.+. ....+.......-..++ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 89999999998743211 000 0 0000000 000010010 00111122234445566 Q ss_pred HHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019705. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..|+-+..=|..+.-.+ . .....+..+|. -|. ....++..+...+..|.+++.+..+. |. +.+-. T Consensus 81 ~~~A~lv~~e~~~i~v~d---~----~~~~~l~~~l~--~n~---f~~~~~~~~e~a~a~G~~a~k~~~d~-~~-~~i~~ 146 (522) T protein:vir:47 81 KKIASLVYNEQATITTKN---E----ILQKFLDDMLT--NDR---FNKNFERYLESCLALGGLAMRPYIDG-DK-VRVAF 146 (522) T ss_pred HHHhhhhcCCcceeecCC---h----HHHHHHHHHHh--hcc---hHHHHHHHHHHhhccCCEEEEEEEcC-Cc-eEEEE Confidence 666666555343321111 0 11123444443 222 23445666777777787777665553 32 23333 Q ss_pred ecCceeEEe-ec------------------CceEEEE-----------------------EEe------CC----ceEEe Q lcl|NC_019705. 148 LQSANMDVK-LV------------------GKKVVYR-----------------------YQR------DS----EYAEF 175 (424) Q Consensus 148 l~~~~v~~~-~~------------------~~~~~~~-----------------------~~~------~~----~~~~~ 175 (424) +++..+-+. .+ ....+|. |.+ +. -+.++ T Consensus 147 v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v 226 (522) T protein:vir:47 147 IQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRV 226 (522) T ss_pred EcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccc Confidence 444433322 11 1111111 100 00 00000 Q ss_pred c----------HhH----------EEEeecCCC-----CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeE-- Q lcl|NC_019705. 176 S----------QKE----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL-- 228 (424) Q Consensus 176 ~----------~~e----------iih~r~~~~-----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl-- 228 (424) + +++ ..|||.+.. +...|+|....+...+......-.....-|+-|-. ..++ T Consensus 227 ~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~-~i~v~~ 305 (522) T protein:vir:47 227 NLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQR-RVIVPE 305 (522) T ss_pred cccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccc-eeecch Confidence 0 011 235554322 23679999998888887666665555666665543 2222 Q ss_pred ---EcCCCCCCHHHHHHHHHHH---HHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_019705. 229 ---STGEKVLTEQQRSQVEENF---KEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV 302 (424) Q Consensus 229 ---~~~~~~~~~~~~~~~~~~~---~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l 302 (424) ............ ....+ +..+.+-+ .-..++-+++.++....+-++.+..+...+.|+...|+++..+ T Consensus 306 ~~l~~~~~~~~g~~~--~~~~fd~~~~~f~~~~----~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf 379 (522) T protein:vir:47 306 HLTQRQYQRPDGTID--FRPRFDVEQNVYMQIG----GSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMF 379 (522) T ss_pred HHhccCCCCCCcccc--cccccCcccceEeecC----CCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCcccc Confidence 211111110000 00000 11111100 0012333566777777888899999999999999999999999 Q ss_pred CCCCCCcccchhHHHH-------------HHHHHHHHHHHHHHHHHHHHHh-hccChhhhcccchhhhhhhhhccCHHHH Q lcl|NC_019705. 303 GDVEKSTSWGSGIEQQ-------------NLGFLQYTLQPYISRWENSIQR-WLIPAKDVGRIHAEHNLDGLLRGDSASR 368 (424) Q Consensus 303 ~~~~~~~~~~~n~e~~-------------~~~~~~~tl~P~~~~ie~~l~~-~l~~~~~~~~~~~~fd~~~l~~~d~~~~ 368 (424) +...++.. ++.+. .+..++.+|..++..|.+..+. .++...-.....+.+++++-+..|.++. T Consensus 380 ~~~~~~~k---TAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~ 456 (522) T protein:vir:47 380 TFDGQGMK---TATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAE 456 (522) T ss_pred Cccccccc---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHH Confidence 87655432 22222 3344555555555555433322 1222211223445667777778999999 Q ss_pred HHHHHHHHhCCCCCHHHHHHH-hCCCCCCCCCeee----------ec--ccccchhhccccCCCcccC Q lcl|NC_019705. 369 AAFMKAMGEAGLRTINEMRRT-DNLPPLPGGDVAM----------RQ--SQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 369 ~~~~~~~~~~g~~t~NE~R~~-~g~~p~~ggd~~~----------~~--~n~~~~~~~~~~~~~~~~g 423 (424) .+...+++.+|+|++-+++.+ .|+..- ..++.+ .| ....+.+.. +++.+.+.| T Consensus 457 ~~~~~~~v~aG~~s~e~~i~~~~g~~ee-ea~~el~ri~~E~~~~~~~~~~~~~~~~~-~~~~~d~~~ 522 (522) T protein:vir:47 457 LDYWAKMVAAGFSTKKRAIGKTLNISGV-EAEKELNAINSELLPMNDAELAIYGMHDQ-NEEKADDKG 522 (522) T ss_pred HHHHHHHHhcCCCCHHHHHHhcCCCChH-HHHHHHHHHHHhhccCCCCCCCCCCCCCc-ccccCCCCC Confidence 999999999999999998765 365431 110000 00 011111110 111122222 No 176 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=97.96 E-value=9e-06 Score=48.26 Aligned_cols=398 Identities=11% Similarity=0.022 Sum_probs=178.6 Q ss_pred CCCCcccccC-CC-CCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_019705. 1 MEEPKYTIDL-RT-NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~~~~~~~-~~-~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) .+.-+--|.- ++ ..-=++++..+..+............ ..+... .-..++....+|+..++-+-+-| T Consensus 42 ~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~--------~~~~~~---~ki~~n~~k~Ivd~~~~yl~g~p 110 (502) T protein:vir:48 42 WELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRK--------DNEMAD---KRAVHNYGRMISKFKTGYLAGNP 110 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccc--------cccccc---ceeecchHHHHHHHHhhhhcccC Confidence 0000000000 00 01223445556554311110000000 000000 01234566778888888888888 Q ss_pred eEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeec Q lcl|NC_019705. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~ 158 (424) +.+.-.+... ...+...|. +....-........+..+++.+|.||+.+.++.+|.+ .+..++|..+.+..+ T Consensus 111 ~~~~~~d~~~-------~~~~~~~l~-~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~vyd 181 (502) T protein:vir:48 111 IRVEYDDNED-------NSQNDDAIK-RIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYD 181 (502) T ss_pred eeEecCCccc-------hhHHHHHHH-HHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCce-EEEEEcccceEEEEc Confidence 8764322111 122333332 1111123456778899999999999999999988875 467788888877665 Q ss_pred Cc---eEE--EE-EE--eC-Cc---eEEecHhHEEEeecC----------C----------CCCcccCchHHHHHHHHHH Q lcl|NC_019705. 159 GK---KVV--YR-YQ--RD-SE---YAEFSQKEIFHLKGF----------G----------FTGLVGLSPIAFACKSAGV 206 (424) Q Consensus 159 ~~---~~~--~~-~~--~~-~~---~~~~~~~eiih~r~~----------~----------~~~~~G~s~i~~~~~~i~~ 206 (424) +. ... ++ |. .. +. ...+.++.++++..- + .+...|.|-+..+...++. T Consensus 182 d~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa 261 (502) T protein:vir:48 182 NSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGTVPITEFLNNADGIGDYETELYLIDL 261 (502) T ss_pred CCCCCceEEEEEEEEEeecCCcEEEEEEEeCCeEEEEEeCCceeeccceecCCCccceEEecCCCCCCCchhhhHHHHHH Confidence 32 111 11 11 11 11 012233333333210 0 1223577778777777776 Q ss_pred HHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHH Q lcl|NC_019705. 207 AVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKF 286 (424) Q Consensus 207 ~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~ 286 (424) ...+.....+.+...+.|-.++.-......++....++.... ......+..-....+.+++-+.....+..+....+. T Consensus 262 ~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~ 339 (502) T protein:vir:48 262 YDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRL--MQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTR 339 (502) T ss_pred HHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhcce--eeccccccccccccCcceeEeeecCCHHHHHHHHHH Confidence 666666666666666667666553322222222222211110 000000001112234455555444444445567888 Q ss_pred HHHHHHHHhCCCHHHhCCCCCCcccchhHHHH----------HHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhh Q lcl|NC_019705. 287 QVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHN 356 (424) Q Consensus 287 ~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd 356 (424) ....|+..-++|+...+... ++.++...+-. ....+...+.-.++.+...+....-. .......+.+. T Consensus 340 L~~~I~~~s~~p~~~~~~~~-~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~-~~~d~~~i~i~ 417 (502) T protein:vir:48 340 LNKDIHVFTNTPDMSDNHFS-GNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEF-KDFDESRLKIT 417 (502) T ss_pred HHHHHHHHhCCCCcCccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-cccccccceEE Confidence 89999999999975443322 22222222111 11223333333333333333221110 11111223344 Q ss_pred hhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCee----------eecccccchhhcc-c--cCCCcc Q lcl|NC_019705. 357 LDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVA----------MRQSQYVPITDLG-T--NKEPRN 421 (424) Q Consensus 357 ~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~~----------~~~~n~~~~~~~~-~--~~~~~~ 421 (424) +...+..|..+.++.+.++ .|+++..-+.+++++-.-|. .+.. ..+.........+ + .+++.+ T Consensus 418 f~~~~p~d~~e~a~~~~kl--~g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~ 495 (502) T protein:vir:48 418 FTPNLPKSLYEQVSILNDL--GGQVSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETHTD 495 (502) T ss_pred eCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhcccccccccccccCCCccCCCCc Confidence 4566677888889988888 47899888888877632111 1000 0000000000000 0 011111 Q ss_pred cCC Q lcl|NC_019705. 422 NGA 424 (424) Q Consensus 422 ~ga 424 (424) +.. T Consensus 496 ~~~ 498 (502) T protein:vir:48 496 DFE 498 (502) T ss_pred CcC Confidence 111 No 177 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=97.86 E-value=1.4e-05 Score=47.15 Aligned_cols=380 Identities=11% Similarity=0.026 Sum_probs=161.7 Q ss_pred CCCCccccc---------------CCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHh----ccH Q lcl|NC_019705. 1 MEEPKYTID---------------LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERIL----QIS 61 (424) Q Consensus 1 ~~~~~~~~~---------------~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~----~~~ 61 (424) .+-|.=.|. +..+..-++++..+..+...... ... ...-.....+ .+. T Consensus 2 ~~~p~~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~-----------~~~--~~~~~~~~~~~~~~~~n 68 (479) T protein:vir:99 2 IDLPDEDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPD-----------LAT--RHKNKEREVLQQLSRKP 68 (479) T ss_pred ccCCcccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccc-----------ccc--ccCChhHHHHHHHhhcC Confidence 333332221 11112222333333332211000 000 0000011111 234 Q ss_pred HHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee----- Q lcl|NC_019705. 62 TVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR----- 136 (424) Q Consensus 62 ~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r----- 136 (424) ....+|+..++.+--.+|.+ . ++. ....+.+++.. |. .......+..+++.+|.||+++.. T Consensus 69 ~~~~iVd~~~~~l~~~gf~~---~-d~~-----~~~~~~~i~~~--N~---~d~~~~~~~~~a~~~G~af~~v~~~~~~~ 134 (479) T protein:vir:99 69 WMGLMVNSFAQQLIVDGYRK---T-GTN-----ENAKGWDTWRL--NQ---MDKQQFWLNRAVLTFGYAFIKVTSGISPL 134 (479) T ss_pred cHHHHHHHHHhhcccccccC---C-Cch-----hhHHHHHHHHh--cC---hhHHHHHHHHHHhhcCceEEEEecCCCCc Confidence 55567777776553333322 1 111 12234555542 32 235567788899999999998864 Q ss_pred CCCCceeEEEEecCceeEEeecCce----EEEEEE-------------------eCCceEEe-c-----HhH--EEEeec Q lcl|NC_019705. 137 NSAGDVISLLPLQSANMDVKLVGKK----VVYRYQ-------------------RDSEYAEF-S-----QKE--IFHLKG 185 (424) Q Consensus 137 ~~~G~~~~l~~l~~~~v~~~~~~~~----~~~~~~-------------------~~~~~~~~-~-----~~e--iih~r~ 185 (424) +..|.+ .+..++|..+.+..++.. ..|.+. .+.....+ . -.. |++|++ T Consensus 135 d~~g~~-~i~~~~p~~~~~iydd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n 213 (479) T protein:vir:99 135 DGTTVA-RIKCIDPRDAFAIWEDPYWDEWPKYLLERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVN 213 (479) T ss_pred CCCCce-EEEEechhheEEEecCCcccceeeEEEeecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeec Confidence 334444 466778888776654321 111111 01100011 0 011 455554 Q ss_pred CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-CC Q lcl|NC_019705. 186 FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-EA 264 (424) Q Consensus 186 ~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-~~ 264 (424) ......+|.|-+......++............+.-.+.|..++.-- ...+....+ ...+.. ..++++.+ ++ T Consensus 214 ~~~~~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~-~~~~~~~~~--~~~~~~-----~~~~i~~~~~~ 285 (479) T protein:vir:99 214 VMDLRGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGL-MLPEGANAD--QEKMRF-----AQESMLISQNE 285 (479) T ss_pred CCCcCcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCC-Ccccccccc--hhcccc-----ccccceeecCC Confidence 3211236888777666666655555444444444445555444321 111111000 001111 11234433 45 Q ss_pred CceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHH--------HHH Q lcl|NC_019705. 265 GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISR--------WEN 336 (424) Q Consensus 265 g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~--------ie~ 336 (424) +.++.++.... --.+++.++..+.+|+.+=++|++.+|...+ .|+..+. +....+.-.+.. |++ T Consensus 286 ~~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n--~Sg~Al~-----~~~~~l~~ka~~~~~~f~~al~~ 357 (479) T protein:vir:99 286 KASFGAIPAAP-LDGLLNAYKESLLEFLALAQLPPHIAGQIVN--VAADALA-----AGTRQTMQKLFEKQATWKASHNQ 357 (479) T ss_pred CceEEEecccc-hHHHHHHHHHHHHHHhccCCCCHHHcccccc--hHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHH Confidence 56776665322 2336788888899999999999999985432 2221211 222222221211 111 Q ss_pred HHHh--hccChhhh-cccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCC--CCC----------Ce Q lcl|NC_019705. 337 SIQR--WLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD-NLPPL--PGG----------DV 400 (424) Q Consensus 337 ~l~~--~l~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~-g~~p~--~gg----------d~ 400 (424) .+.. .+....+. ....+.+.+......+..+.++.+.+++++|+++...+.+++ |+.+- +.- +. T Consensus 358 ~~~l~~~~~~~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~ 437 (479) T protein:vir:99 358 TMRLVNKIEGRTEEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGK 437 (479) T ss_pred HHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHH Confidence 1111 11111111 112233333344456778899999999999998887777776 66542 100 00 Q ss_pred eeecc--cccchhh------ccccCCC-cccCC Q lcl|NC_019705. 401 AMRQS--QYVPITD------LGTNKEP-RNNGA 424 (424) Q Consensus 401 ~~~~~--n~~~~~~------~~~~~~~-~~~ga 424 (424) ..... ...+.+. ..+.+++ .+.|. T Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (479) T protein:vir:99 438 YMRKLQNGPDPAEQRGGPNGATNMQQANNKTGE 470 (479) T ss_pred HHHHHhcccCcccccCCCCCCCCCCCCCCCCcc Confidence 00000 0000000 0000000 00011 No 178 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=97.85 E-value=1.5e-05 Score=47.07 Aligned_cols=383 Identities=10% Similarity=-0.009 Sum_probs=165.9 Q ss_pred CCCCcccc----------cCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTI----------DLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLI 70 (424) Q Consensus 1 ~~~~~~~~----------~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~i 70 (424) ..++.+.+ +.+.+.--+.++..+..+........ .. .........+. +..-+.++....+|+.. T Consensus 20 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~-~~--~~~~~~~~~~~---~~~ki~~n~~~~Ivd~~ 93 (474) T protein:vir:95 20 QLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQM-KK--VDVYGNIDYDK---PDWRITTNFHQNLVDQK 93 (474) T ss_pred hhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhccc-cc--ccccccccccc---ccceeccchHHHHHHHH Confidence 11111111 01112222333333333211000000 00 00000000000 00112245566688888 Q ss_pred HHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecC Q lcl|NC_019705. 71 STLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQS 150 (424) Q Consensus 71 a~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~ 150 (424) ++-+-+-|+.+- . .+.. ....+..++. |. .......+....+.+|.||+.+.++.+|.+ .+..++| T Consensus 94 ~~~l~g~p~~~~-~-~d~~-----~~~~l~~~~~---n~---~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p 159 (474) T protein:vir:95 94 VSYVASKPVTYS-C-EDES-----VLKIIHDVLD---TR---WDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPA 159 (474) T ss_pred HhhhccCCceec-c-CchH-----HHHHHHHHHh---cc---HHHHHHHHHHHHhhcCcEEEEEEecCCCce-EEEEEcc Confidence 887878887652 1 1111 1123333332 22 345566778899999999999988888875 4677888 Q ss_pred ceeEEeecCce------EEEEEEeCCc--eEEecHhHEEEeecC----------------------C---------CCCc Q lcl|NC_019705. 151 ANMDVKLVGKK------VVYRYQRDSE--YAEFSQKEIFHLKGF----------------------G---------FTGL 191 (424) Q Consensus 151 ~~v~~~~~~~~------~~~~~~~~~~--~~~~~~~eiih~r~~----------------------~---------~~~~ 191 (424) ..+.+..++.. ..+.|...+. ...+.++.+.+++.. . .+.. T Consensus 160 ~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~ 239 (474) T protein:vir:95 160 EQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNNP 239 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCCC Confidence 88876665421 1111111111 112223333222100 0 0223 Q ss_pred ccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeec Q lcl|NC_019705. 192 VGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) Q Consensus 192 ~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l 271 (424) .|.|-+..+...++....+.....+.+...+.|-.++.-- ...+.+. ... ....++++.++++.+.+.+ T Consensus 240 ~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~-~~~~~~~---~~~-------~~~~~~~i~~~~~~~~~~l 308 (474) T protein:vir:95 240 EEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGY-EGQDLEE---FMR-------GLKYYKAINVDGDGGVETI 308 (474) T ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC-Ccccchh---hhh-------hhhccceeeccCCCceeEE Confidence 5777777766777665555555555556666666555422 1111111 111 1123456777777666666 Q ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHH----------HHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019705. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYISRWENSIQRW 341 (424) Q Consensus 272 ~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~----------~~~~~~tl~P~~~~ie~~l~~~ 341 (424) ..+.....+....+...+.|+..-++|..-.+. ..++.++...+... ...+...+.-.++.|.+.+.. T Consensus 309 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~- 386 (474) T protein:vir:95 309 QVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDK-FGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNL- 386 (474) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC- Confidence 555555556778888899999999999522211 11222221111111 122223333333333222211 Q ss_pred ccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeee-----ecccccchhhc- Q lcl|NC_019705. 342 LIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAM-----RQSQYVPITDL- 413 (424) Q Consensus 342 l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~~~-----~~~n~~~~~~~- 413 (424) ..+.....+.|+ .-...|..+ .+..+.+.|+++...+.+.+++-.-+. -+..- ........... T Consensus 387 ---~~d~~~i~v~f~--~~~p~d~~e---~a~~~~~~g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~ 458 (474) T protein:vir:95 387 ---KMDVKDIEISFN--FNRMMNDAE---QSQIIAQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGG 458 (474) T ss_pred ---CcccceeeEEec--cCCCcCHHH---HHHHHHhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhccccccccc Confidence 111122233443 333344444 444566679999888888876532111 00000 00000000000 Q ss_pred -cccCCCcccCC Q lcl|NC_019705. 414 -GTNKEPRNNGA 424 (424) Q Consensus 414 -~~~~~~~~~ga 424 (424) ...++.++.+- T Consensus 459 ~d~~~~~~~~~~ 470 (474) T protein:vir:95 459 ADGAQQQERSND 470 (474) T ss_pred CCCCcCCCCCcc Confidence 00011111111 No 179 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=97.83 E-value=1.6e-05 Score=46.88 Aligned_cols=389 Identities=8% Similarity=0.005 Sum_probs=169.4 Q ss_pred ccccCCCC-CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_019705. 6 YTIDLRTN-NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFET 84 (424) Q Consensus 6 ~~~~~~~~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~ 84 (424) +-.++.+. +=.++++..+..+............ ..+.. ..-+.++....+|+..++-+-+-|+.+.-. T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~--------~~~~~---~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~ 69 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHRRL--------DDEKA---DYRVRHKWGGYISSFATGYVIGNPVSIGVM 69 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcccccccccc--------cccCC---cceeecchHHHHHHhhhhheeccCceEeeC Confidence 11111111 1234444444433211000000000 00000 001234556667787777777777765321 Q ss_pred ccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCce--- Q lcl|NC_019705. 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK--- 161 (424) Q Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~--- 161 (424) +.+. .+ ....+.+.+.. | .-......+..+.+.+|.||+.+..+.+|.+ .+..++|..+.+..++.. T Consensus 70 -~~~~-~~--~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~ 139 (440) T protein:vir:95 70 -EGGS-AD--QLSTIKDIEWQ--N---DINALNSDLAFDASVYGRAYEYHFRDKDKVD-RVVLISPLEMFVIRDLTVEQN 139 (440) T ss_pred -CCcc-HH--HHHHHHHHHHh--c---CHhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCc Confidence 1111 11 11223333331 1 2345556788899999999999998888876 467788988887776421 Q ss_pred EE--EE-EEeCCc--eEEecHhHEEEeecC--------------C----------CCCcccCchHHHHHHHHHHHHHHHH Q lcl|NC_019705. 162 VV--YR-YQRDSE--YAEFSQKEIFHLKGF--------------G----------FTGLVGLSPIAFACKSAGVAVAMED 212 (424) Q Consensus 162 ~~--~~-~~~~~~--~~~~~~~eiih~r~~--------------~----------~~~~~G~s~i~~~~~~i~~~~~~~~ 212 (424) .. +. +..... ...+.++.+++++.. + .+...|.|-++.+...++....+.. T Consensus 140 ~~~~i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s 219 (440) T protein:vir:95 140 IIAAVHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQS 219 (440) T ss_pred eEEEEEEEEecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHH Confidence 11 11 111111 112233333322100 0 0123467767766666665555555 Q ss_pred HHHHHHhcCCCCceeEEcCC--CCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHH Q lcl|NC_019705. 213 QQRDFFANGAKSPQILSTGE--KVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSE 290 (424) Q Consensus 213 ~~~~~~~ng~~~~~vl~~~~--~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~ 290 (424) ...+.....+.|-.+++-.. ...+++....++....-+. .........+.+.+++.+........+....+...+. T Consensus 220 ~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~ 297 (440) T protein:vir:95 220 DTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFL--KTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLAND 297 (440) T ss_pred HHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceec--ccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHH Confidence 55555555566666665321 1123333332222111111 1111112223333444443333344466788888999 Q ss_pred HHHHhCCCHHHhCCCCCCcccchhHHHHHH----------HHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhh Q lcl|NC_019705. 291 LARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL 360 (424) Q Consensus 291 Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~----------~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l 360 (424) |+..-++|..-.+... ++.++...+.... ..+...+.-.++.|...+...- ........+.+.+..- T Consensus 298 i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~--~~~~~~~~v~i~f~~~ 374 (440) T protein:vir:95 298 IHRFSRIPNLDDDRFN-STSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAIN--GPVIEANKLTFTFHPN 374 (440) T ss_pred HHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC--CcccccccceEEeCCC Confidence 9999999964443222 2222222211111 2222223333333322222111 1111122334444566 Q ss_pred hccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeee-cccccchhhccccCCCcccCC Q lcl|NC_019705. 361 LRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMR-QSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 361 ~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~-~~n~~~~~~~~~~~~~~~~ga 424 (424) +..|..+.++.+.++ .|+++.--+.++++.-..+ ++... -..-........+..+..+++ T Consensus 375 ~p~~~~~~ad~~~kl--~g~iS~et~~~~l~~~d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~ 435 (440) T protein:vir:95 375 IPQDVWTEIKAYIEA--GGEISQETLMENASFTDYK--TEHSRILKQGGSSDLEIGQIVGDADVG 435 (440) T ss_pred CCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCCcH--HHHHHHHHHHHHhhhhHHhhccCCCCC Confidence 678888899988887 5789887777777652211 00000 000000000000111111111 No 180 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=97.78 E-value=2e-05 Score=46.40 Aligned_cols=379 Identities=8% Similarity=-0.015 Sum_probs=166.0 Q ss_pred CCCC------cccc----------cCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHH Q lcl|NC_019705. 1 MEEP------KYTI----------DLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVW 64 (424) Q Consensus 1 ~~~~------~~~~----------~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~ 64 (424) -|++ ++.+ +.+++..-+.++..+..+........... .......... +..=+.++... T Consensus 13 ~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~---~~~~~~~~~~---~~~ki~~n~~~ 86 (474) T protein:vir:96 13 HERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKL---DNKGEIDPLK---PDWRMFTNYHQ 86 (474) T ss_pred hhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchh---cccccccccc---cchhcccchHH Confidence 1222 2211 11222222333333333221100000000 0000000000 00012345666 Q ss_pred HHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeE Q lcl|NC_019705. 65 RCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVIS 144 (424) Q Consensus 65 ~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~ 144 (424) .+|+..++-+-+-|+.+--. +. .....+..++. | ........+...+..+|.||..+..+.+|.+. T Consensus 87 ~Ivd~~~~~l~g~p~~~~~~--d~-----~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~- 152 (474) T protein:vir:96 87 NLVDQKVAYAVANPVTFSSD--DD-----KSLKTIQEVLN---H---KWDDKLVDILTAASNKGIEWLQPYIDENGEFK- 152 (474) T ss_pred HHHHhhhhhhcccCceeecC--ch-----HHHHHHHHHHh---c---CHHHHHHHHHHHHHhcCeeEEEEEecCCCceE- Confidence 78888888887788775221 11 11233444443 2 22344566778899999999998888888764 Q ss_pred EEEecCceeEEeecCc---eE---EEEEEeCCc--eEEecHhHEEEeecC-------------------------C---- Q lcl|NC_019705. 145 LLPLQSANMDVKLVGK---KV---VYRYQRDSE--YAEFSQKEIFHLKGF-------------------------G---- 187 (424) Q Consensus 145 l~~l~~~~v~~~~~~~---~~---~~~~~~~~~--~~~~~~~eiih~r~~-------------------------~---- 187 (424) +..++|..+.+..+++ .. ...|...+. ...+..+.+.+++.. + T Consensus 153 i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 232 (474) T protein:vir:96 153 TFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGR 232 (474) T ss_pred EEEEcccceEEEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCc Confidence 7788998888776542 11 111111111 111222222222110 0 Q ss_pred ------CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCccee Q lcl|NC_019705. 188 ------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWI 261 (424) Q Consensus 188 ------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~ 261 (424) .+...|.|-+......++....+.....+.+...+.|-.+++--......+ ..... ..++++. T Consensus 233 iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-------~~~~~----~~~~~i~ 301 (474) T protein:vir:96 233 VPFIPFKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDE-------FMRNL----KYYKAIN 301 (474) T ss_pred eeEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccc-------hhhhh----hcCceEE Confidence 012357777777777777666666556666666676766654321111111 11111 1235666 Q ss_pred cC-CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHH----------HHHHHHHHH Q lcl|NC_019705. 262 LE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG----------FLQYTLQPY 330 (424) Q Consensus 262 l~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~----------~~~~tl~P~ 330 (424) ++ .|.+++.+........+.+..+...+.|+..-++|..-.... +++.++...+-.... .+...+.-. T Consensus 302 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~ 380 (474) T protein:vir:96 302 VDGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKF-GNSPSGIALKFMYSNLDLKANKLKNKTLTALQEL 380 (474) T ss_pred ecCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 65 455666665544445567788888999999999996432211 222222222111111 111222222 Q ss_pred HHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeec------ Q lcl|NC_019705. 331 ISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQ------ 404 (424) Q Consensus 331 ~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~------ 404 (424) ++.|..-+. ...+.....+.|+ .-+..|..+ .+..+.++|+++...+.++++.-.-+ +.-+.. T Consensus 381 ~~~i~~~~~----~~~~~~~i~i~f~--~~~p~~~~e---~~~~~~~ag~iS~et~~~~~~~v~d~--~~E~~ri~~E~~ 449 (474) T protein:vir:96 381 LQYIIDFYK----LNIKVQDVEITFN--FNVMVNELE---QSQIGVQSQYLSKETVVTNHPWVDDP--VAELERIEQDNI 449 (474) T ss_pred HHHHHHHhC----CCcccceeeEEec--cCCCcCHHH---HHHHHHhcCCCchHHHHHhCCCCCCH--HHHHHHHHHHHH Confidence 222211110 0111111223343 333344444 44456678999999999887653211 110000 Q ss_pred ---ccccchh----hccccCCCccc Q lcl|NC_019705. 405 ---SQYVPIT----DLGTNKEPRNN 422 (424) Q Consensus 405 ---~n~~~~~----~~~~~~~~~~~ 422 (424) ....+.. .....++.+.| T Consensus 450 e~~~~~~~~~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 450 DFNKQLPPLEGDANGRAQDNESETN 474 (474) T ss_pred HHHhcccccccccccccCCCcccCC Confidence 0000100 00011111111 No 181 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=97.70 E-value=2.7e-05 Score=45.65 Aligned_cols=388 Identities=13% Similarity=0.073 Sum_probs=179.6 Q ss_pred CchHHHHHhhccCcccCC------------ccccchh----------hccccc-cccCccccc----HHHHhccHHHHHH Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVT------------PNQGSQT----------GPVSAH-GHLGDSSIN----DERILQISTVWRC 66 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~------------~~~~~~~----------~~~~~~-~~~~~~~vs----~~~~~~~~~v~~~ 66 (424) +|+|.+++++|++..... +...... ..+.+. .++....+. .+..........+ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHH Confidence 899999999887632110 0011000 001100 001000000 0111222344556 Q ss_pred HHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEE Q lcl|NC_019705. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLL 146 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~ 146 (424) ++..|+-+..=|..+--.+ . .....+..+|.. |. .....+..+.+.+..|.+++.+..+. |. ..+. T Consensus 81 ~~~~A~ll~~e~~~i~~~d---~----~~~e~l~~i~~~--n~---f~~~~~~~~e~a~a~G~~~~k~~~D~-~~-~~i~ 146 (505) T protein:vir:79 81 SAKLASLIFNEQCQVTVSD---E----TANDFLDDVFQQ--ND---FYTTFEEKLEEWIALGSGCVRPYVDS-GK-IKLA 146 (505) T ss_pred HHHHHhhhcCCCceeecCC---h----HHHHHHHHHHHh--cc---HHHHHHHHHHHHhhcCCeEEEEEEeC-Cc-eEEE Confidence 6666666655444332111 0 111223333431 11 24455667788888888888776653 33 2345 Q ss_pred EecCceeEEee-cCc------------------eEEE-----------EEEeCC----------ceEEecHh-------- Q lcl|NC_019705. 147 PLQSANMDVKL-VGK------------------KVVY-----------RYQRDS----------EYAEFSQK-------- 178 (424) Q Consensus 147 ~l~~~~v~~~~-~~~------------------~~~~-----------~~~~~~----------~~~~~~~~-------- 178 (424) .++|..+.+.. +.+ ..+| .|.+.. -+..++-. T Consensus 147 ~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l 226 (505) T protein:vir:79 147 WATADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGL 226 (505) T ss_pred EEcCCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhccccccc Confidence 55665554421 111 0011 011000 00111101 Q ss_pred ------------HEEEeecCCC-----CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeE-----EcCCCCCC Q lcl|NC_019705. 179 ------------EIFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL-----STGEKVLT 236 (424) Q Consensus 179 ------------eiih~r~~~~-----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl-----~~~~~~~~ 236 (424) -+.||+.+.. ....|+|.+..+...+......-....+-|..|.. ..++ ........ T Consensus 227 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~ 305 (505) T protein:vir:79 227 EPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQR-RLIVPAEWLKTGSSYGG 305 (505) T ss_pred CcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhccc-ceeechHHhcccCCCCc Confidence 1335554322 23579999999988888766666666666665543 3232 11111000 Q ss_pred HHHHHHHHHHHHHHhCCcccCcceec-CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhH Q lcl|NC_019705. 237 EQQRSQVEENFKEIAGGPVKKRLWIL-EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGI 315 (424) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~ag~~~~l-~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~ 315 (424) + ........+.. .......+.. +++..++.++....+-++.+..+...++|+...|+++..++....+..+..-+ T Consensus 306 ~-~~~~~~~~fd~---~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei 381 (505) T protein:vir:79 306 Q-ASETHPPMFDP---DETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEV 381 (505) T ss_pred c-cccccccCCCc---cceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHH Confidence 0 00000000000 0000001111 22345777777777888899999999999999999999998765543321111 Q ss_pred H----------HHHHHHHHHHHHHHHHHHHHHHHhhccChh-------hhcccchhhhhhhhhccCHHHHHHHHHHHHhC Q lcl|NC_019705. 316 E----------QQNLGFLQYTLQPYISRWENSIQRWLIPAK-------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGEA 378 (424) Q Consensus 316 e----------~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~ 378 (424) . ......++.+|..++..|........+... ......+.+++++-+..|.++..+...+++.+ T Consensus 382 ~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~ 461 (505) T protein:vir:79 382 VTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQA 461 (505) T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHc Confidence 0 111222344444444444332221111111 11123456677777789999999999999999 Q ss_pred CCCCHHHHHHHh-CCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 379 GLRTINEMRRTD-NLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 379 g~~t~NE~R~~~-g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) |+|+.-+++... |++. +..++.+.. +..+.....++..+-|+ T Consensus 462 Gi~s~e~~l~~~~~~~e-eea~~el~r---i~~E~~~~~p~~~~~gg 504 (505) T protein:vir:79 462 QVMPKKQFLMRNYGLDE-EEADEWLAQ---IDAENSTAEPEFNQFGG 504 (505) T ss_pred CCCCHHHHHHhcCCCCh-HHHHHHHHH---HHHhccccCCCchhccC Confidence 999998887653 4432 112111110 01111111222223333 No 182 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=97.70 E-value=2.7e-05 Score=45.62 Aligned_cols=406 Identities=10% Similarity=0.035 Sum_probs=174.5 Q ss_pred CCCCcccc-cCCCCC-c-------------------------hHHHHHhhccCcccCCccccchhhccccccc--cC-c- Q lcl|NC_019705. 1 MEEPKYTI-DLRTNN-G-------------------------WWARLQSWFVGGRLVTPNQGSQTGPVSAHGH--LG-D- 49 (424) Q Consensus 1 ~~~~~~~~-~~~~~~-G-------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~- 49 (424) ||.--|+- .++.++ + ++.++...+.... .+........+.+..+ +. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~--~~r~~~~~~yY~g~~~~i~~~~~ 78 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQ--APRIQELLDYARGENHDVLKSGR 78 (501) T ss_pred CceeeeeecccceeccccccchhHHhhhcccccccccCChHHHHHHHHHHHHHHH--HHHHHHHHHHhcCCCCcccCccc Confidence 33333321 111111 0 1111111110000 0000000000001000 00 0 Q ss_pred -c-cccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHc Q lcl|NC_019705. 50 -S-SINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFY 127 (424) Q Consensus 50 -~-~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~ 127 (424) . ..-+..-+.++....+|+..++-+-+-|+++.-.+... ...+...|. +....-........+..+++.+ T Consensus 79 ~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~~~-------~~~~~~~l~-~~~~~n~~~~~~~~~~~~~~~~ 150 (501) T protein:vir:96 79 RKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDD-------NSQNDDAIK-RIGRINDLDSLNRTLIRDLSQT 150 (501) T ss_pred cCccccccceeecchHHHHHHHHhhhhcccCeeEeeCCccc-------hhHHHHHHH-HHHHhcCHHHHHHHHHHHHhhc Confidence 0 00001113456677788888888888887763322211 122233332 1111223456778899999999 Q ss_pred CCeEEEEeeCCCCceeEEEEecCceeEEeecCc---eE----EEEEEeC--Cce---EEecHhHEEEeec---------- Q lcl|NC_019705. 128 GNAYALVDRNSAGDVISLLPLQSANMDVKLVGK---KV----VYRYQRD--SEY---AEFSQKEIFHLKG---------- 185 (424) Q Consensus 128 G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~---~~----~~~~~~~--~~~---~~~~~~eiih~r~---------- 185 (424) |.||+.+.++.+|.+ .+..++|..+.+..++. .. .|.+... +.. ..+.++.+.++.. T Consensus 151 G~a~~~v~~dedg~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~ 229 (501) T protein:vir:96 151 GRAYEVIYRSEYDET-RIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISVT 229 (501) T ss_pred CeEEEEEEEcCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeCCCceecccc Confidence 999999999988876 46778898888777642 11 1111101 110 1122333332211 Q ss_pred CC----------CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc Q lcl|NC_019705. 186 FG----------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV 255 (424) Q Consensus 186 ~~----------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (424) ++ .+...|.|.+..+...++....+.....+.+...+.|-.++.-......++....++... ...... T Consensus 230 ~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~--~~~~~~ 307 (501) T protein:vir:96 230 THAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTR--LMQLKP 307 (501) T ss_pred ccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcC--eeeecc Confidence 00 122357787777777777666555555555566666666654322211222211111100 011111 Q ss_pred cCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH----------HHHHH Q lcl|NC_019705. 256 KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GFLQY 325 (424) Q Consensus 256 ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~----------~~~~~ 325 (424) .+.......+.++.-+.....+..+....+...+.|+..-++|..-.+... ++.++...+.... ..+.. T Consensus 308 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~ 386 (501) T protein:vir:96 308 PKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFS-GNTSGEALKYKLFGLDQDRVDTQSQFTK 386 (501) T ss_pred cccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111122233444444444444445667778888999999999965544332 2223222211111 22222 Q ss_pred HHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CC----- Q lcl|NC_019705. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GG----- 398 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~--gg----- 398 (424) .++-.++.+...+...--.. ......+.+.+...+..|..+.++.+.++. |+++..-+.+++++-.-| .. T Consensus 387 ~l~~~~~li~~~~~~~~~~~-~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~--g~iS~et~~~~l~~v~D~~~E~~ri~~ 463 (501) T protein:vir:96 387 GLKRRYRLAARIGSLVNEFK-DFDESLLKITFTPNLPKSLNEQVSILTGLG--GQVSQETALSLSGLVESPNEELDKINK 463 (501) T ss_pred HHHHHHHHHHHHHHhccccc-ccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHH Confidence 22232222222222111000 011112334445566778888888888885 788887777777652211 00 Q ss_pred --Ce---eeecccccchhhccccCCCcccCC Q lcl|NC_019705. 399 --DV---AMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 --d~---~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++ -..+....+..... ..+.++..+ T Consensus 464 E~~~~~~~~~~~~~~~~~~~~-~~~~~e~~~ 493 (501) T protein:vir:96 464 EMSEIDFKGYSNDFNEHVGKY-TDEVKETHT 493 (501) T ss_pred HHHHhhccccccchhhccccc-CCcCCCCCC Confidence 00 01111111111111 111111111 No 183 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=97.63 E-value=3.5e-05 Score=45.02 Aligned_cols=396 Identities=11% Similarity=0.061 Sum_probs=176.8 Q ss_pred CCCCc-ccccCCCC-CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCc Q lcl|NC_019705. 1 MEEPK-YTIDLRTN-NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~~-~~~~~~~~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) .++-+ +--.-++. ..=++++..+..+.............. .. ...-+.++....+|+..++-+-+-| T Consensus 41 ~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~--------~~---~~~ki~~n~~k~Ivd~~~~yl~g~p 109 (501) T protein:vir:27 41 WELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRKDR--------EM---ADKRAVHNYGRMISKFKTGYLAGNP 109 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccCcc--------cc---ccceeccchHHHHHHHHhhhhcccC Confidence 11111 10000111 111445555554421111000000000 00 0011235567778888888888888 Q ss_pred eEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeec Q lcl|NC_019705. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 79 ~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~ 158 (424) +.+.-.+... .+. ....+.+++. -| .-......+..+++.+|.||+++.++.+|.+ .+..++|..+.+..+ T Consensus 110 ~~~~~~d~~~--~~~-~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~-~i~~~~p~~~~~v~d 180 (501) T protein:vir:27 110 IRVEYDDNDN--NSQ-NDDTIKRIGR--IN---DIDSHNRTLIRDLSQTGRAYEVIYRNEYDET-RIKRLNPLETFVIYD 180 (501) T ss_pred eeEecCCccc--hHH-HHHHHHHHHH--hc---ChhHHHHHHHHHHhhCCeEEEEEEeCCCCce-EEEEEccceeEEEec Confidence 7764332211 110 0111222322 12 3346778889999999999999999888875 466788888877665 Q ss_pred Cc---eE--EEE-EE--eC-Cc---eEEecHhHEEEeec----------CC----------CCCcccCchHHHHHHHHHH Q lcl|NC_019705. 159 GK---KV--VYR-YQ--RD-SE---YAEFSQKEIFHLKG----------FG----------FTGLVGLSPIAFACKSAGV 206 (424) Q Consensus 159 ~~---~~--~~~-~~--~~-~~---~~~~~~~eiih~r~----------~~----------~~~~~G~s~i~~~~~~i~~ 206 (424) +. .. ..+ |. .. +. ...+.++.++.+.. ++ .+...|.|-+..+...++. T Consensus 181 ~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa 260 (501) T protein:vir:27 181 NSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDL 260 (501) T ss_pred CCCCCceEEEEEEEEeeecCCcEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEecCCCCCCCchhhhHHHHHH Confidence 42 11 111 11 00 00 01122222222210 00 1233577777777777776 Q ss_pred HHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHH Q lcl|NC_019705. 207 AVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKF 286 (424) Q Consensus 207 ~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~ 286 (424) ...+.....+.+...+.|-.++.-......++....++.. ........+.......+.++.-+.....+..+....+. T Consensus 261 ~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 338 (501) T protein:vir:27 261 YDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRT--RLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTR 338 (501) T ss_pred HHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhc--CceeecccccccCCCCCcceeeeeccCCHHHHHHHHHH Confidence 6666655565556556666665532222222222222211 11111111112233445555555554445556677888 Q ss_pred HHHHHHHHhCCCHHHhCCCCCCcccchhHHHHH----------HHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhh Q lcl|NC_019705. 287 QVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHN 356 (424) Q Consensus 287 ~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~----------~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd 356 (424) ..+.|+..-++|..-.+... ++.++...+-.. ...+...+.-.+..+...++..--. .......+.+. T Consensus 339 l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~-~~~d~~~i~v~ 416 (501) T protein:vir:27 339 LNRDIHIFTNIPDMSDTNFS-GNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEF-KDFDESLLKIT 416 (501) T ss_pred HHHHHHHHhCCcccCccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-cccccccceEE Confidence 88999999999864433221 222322221111 1222222333333222222211100 01111223444 Q ss_pred hhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CC----------Ceeeeccccc-chhhccccCCCcccC Q lcl|NC_019705. 357 LDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GG----------DVAMRQSQYV-PITDLGTNKEPRNNG 423 (424) Q Consensus 357 ~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~--gg----------d~~~~~~n~~-~~~~~~~~~~~~~~g 423 (424) +...+..+..+.++.+.++ .|+++..-+.+++++-.-| .- +.-.....+. +... ..+++++.+ T Consensus 417 f~~~~p~n~~e~ad~~~kl--~g~iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~--~~d~~~~~~ 492 (501) T protein:vir:27 417 FTPNLPKSLNEQVSILTGL--GGQVSQETALSLSGLVESPNEELDKINKEVSEIDFKGYSNDFNEHVGK--YTDEVKETH 492 (501) T ss_pred eCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCcccccccc--ccCCCCCCc Confidence 4566677888888888887 4789887777777553211 10 1001111110 0011 111222222 Q ss_pred C Q lcl|NC_019705. 424 A 424 (424) Q Consensus 424 a 424 (424) + T Consensus 493 ~ 493 (501) T protein:vir:27 493 T 493 (501) T ss_pred c Confidence 2 No 184 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=97.62 E-value=3.6e-05 Score=44.94 Aligned_cols=387 Identities=7% Similarity=-0.022 Sum_probs=168.3 Q ss_pred CCCCcccccCCC--CCchHHHHHhhccCcccCCccccchhhccccccc---------cCcc--cccHHHHhccHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRT--NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGH---------LGDS--SINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~~~~~~~~--~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~--~vs~~~~~~~~~v~~~i 67 (424) |-.--+.|.... ..-++.++...+... .+.......-+.+... ..+. ..-...=+.++....+| T Consensus 11 ~~~~~~~~~~~~~~~~~~i~~~i~~~~~~---~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv 87 (472) T protein:vir:93 11 IFDAIVRTNNKPETLEEMIVRYIKQHLEK---LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLV 87 (472) T ss_pred hhhceeeecCchhhHHHHHHHHHHHHHHH---HHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHH Confidence 111111111111 112222222211110 0000000000000000 0000 00000012246677788 Q ss_pred HHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019705. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..++-+-+-|+.+-- .+.. ....+..++. |. -......+..+++.+|.||+++..+.+|.+ .+-. T Consensus 88 d~~~~~l~g~~~~~~~--~d~~-----~~~~l~~~~~---n~---~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~-~i~~ 153 (472) T protein:vir:93 88 DQKVSYIVGKPIAFKH--TDDE-----VVKRIDEVLG---NR---FDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFR 153 (472) T ss_pred HHHhhhhcccCeeecc--CChH-----HHHHHHHHHh---cc---HHHHHHHHHHHHhhcCeEEEEEEECCCCce-EEEE Confidence 8888888777776521 1111 1122333332 22 235556678899999999999998888876 4677 Q ss_pred ecCceeEEeecCce---EE--E-EEEeCC--ceEEecHhHEEEeecC---------------------------C----C Q lcl|NC_019705. 148 LQSANMDVKLVGKK---VV--Y-RYQRDS--EYAEFSQKEIFHLKGF---------------------------G----F 188 (424) Q Consensus 148 l~~~~v~~~~~~~~---~~--~-~~~~~~--~~~~~~~~eiih~r~~---------------------------~----~ 188 (424) ++|..+.+..++.. .. . .|.... ....+.+..+.+++.. + . T Consensus 154 ~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~ 233 (472) T protein:vir:93 154 VPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK 233 (472) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEec Confidence 88888887765321 10 1 111110 0111122222221100 0 0 Q ss_pred CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCcee Q lcl|NC_019705. 189 TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFST 268 (424) Q Consensus 189 ~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~ 268 (424) +...|.|-+..+...++....+.....+.+...+.|..+++-.......+ ..... ...+++.++++.+. T Consensus 234 nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~----~~~~~-------~~~~~~~~~~~~~~ 302 (472) T protein:vir:93 234 NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPE----FKRLL-------RYYGAIKVSDNGGV 302 (472) T ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchh----hHHHH-------hhccccccCCCCcc Confidence 23357787777777776666555555555566676766665322111111 11111 12235556665565 Q ss_pred eecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH----------HHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 269 SAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWENSI 338 (424) Q Consensus 269 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~----------~~~~~tl~P~~~~ie~~l 338 (424) +.+........+....+...+.|+..-++|..-.+... ++.++...+.... ..+...+.-.++.+...+ T Consensus 303 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~ 381 (472) T protein:vir:93 303 DTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF 381 (472) T ss_pred eeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 55554455666778888889999999999854333221 2222222211111 111222222222222221 Q ss_pred HhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCee-----eecccccchh Q lcl|NC_019705. 339 QRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVA-----MRQSQYVPIT 411 (424) Q Consensus 339 ~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~--ggd~~-----~~~~n~~~~~ 411 (424) +. +.+. ..+.+.+..-+..|..+.++.+.++ .|+++..-+.+++++-.-+ ..+.. -.......+. T Consensus 382 ~~----~~~~--~~i~v~f~~~~p~~~~~~~~~~~k~--~giis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~ 453 (472) T protein:vir:93 382 DI----KGEH--KDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLD 453 (472) T ss_pred CC----Cccc--ceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccCcC Confidence 11 1111 2233333455667788888888887 4788887777777653211 00000 0000111111 Q ss_pred hcccc--CCCcccCC Q lcl|NC_019705. 412 DLGTN--KEPRNNGA 424 (424) Q Consensus 412 ~~~~~--~~~~~~ga 424 (424) ..+.. ++..+.+- T Consensus 454 ~~~~d~~~~~~~~~~ 468 (472) T protein:vir:93 454 DGGADGAQQQERSNN 468 (472) T ss_pred cccCCCCCCCCCCCc Confidence 11110 01111111 No 185 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=97.60 E-value=3.9e-05 Score=44.75 Aligned_cols=370 Identities=12% Similarity=0.071 Sum_probs=156.9 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhcccccccc--CcccccH---HHHhccHHHHHHHHHHHHhhc Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL--GDSSIND---ERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~vs~---~~~~~~~~v~~~i~~ia~~ia 75 (424) |... .+=|++++...+.... +.......-+.+.... .+..... ..-..+....-+|+..++.+- T Consensus 1 ~~~~--------~~~~i~~l~~~~~~~~---~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~ 69 (441) T protein:vir:80 1 MNSD--------ELALIEGMYDRIQRLS---SWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLD 69 (441) T ss_pred CCcc--------HHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhc Confidence 2211 1122333332222110 0000000000000000 0000000 111122344456665555442 Q ss_pred cCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEE Q lcl|NC_019705. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDV 155 (424) Q Consensus 76 ~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~ 155 (424) -..| ...+ ...+..++. . | +.......+..+++.+|.||+.+.++.+|.+ .+.+++|.++.+ T Consensus 70 ~~g~---~~~d---------~~~l~~i~~-~-n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~ 131 (441) T protein:vir:80 70 WLGW---TNGD---------GYGLDGVYA-A-N---RLATASCDVHLDALIFGLSFVAIIPHGDGTV-SVRPQSPKNCTG 131 (441) T ss_pred cccc---cCCC---------hHHHHHHHH-h-c---CHHHHHHHHHHHHhhcCeeEEEEEeCCCCce-EEEEEccceEEE Confidence 1122 1111 123444443 2 3 2457778889999999999999999999987 578889999887 Q ss_pred eecCce--E----EEEEEeCCce---EEecHhH--------------------------EEEeecCC-CCCcccCchHH- Q lcl|NC_019705. 156 KLVGKK--V----VYRYQRDSEY---AEFSQKE--------------------------IFHLKGFG-FTGLVGLSPIA- 198 (424) Q Consensus 156 ~~~~~~--~----~~~~~~~~~~---~~~~~~e--------------------------iih~r~~~-~~~~~G~s~i~- 198 (424) ..|... . .+.+...+.. ..+.++. |+||.+.. ....+|.|-+. T Consensus 132 i~d~~~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~ 211 (441) T protein:vir:80 132 KFSADGSRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITR 211 (441) T ss_pred EEeCCCCceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchh Confidence 665321 0 0001000000 0011111 45554332 34567777543 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCC-----ceeee Q lcl|NC_019705. 199 ---FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAG-----FSTSA 270 (424) Q Consensus 199 ---~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g-----~~~~~ 270 (424) .+.+.+.....-......++ +.|..++. +.. .++...+. ++. ..++++.++.+ .++.+ T Consensus 212 ~v~~liDa~~~~~s~~~~~~~~~---~~~~~~i~-G~~-~~~~~~~~----~~~-----~~~~i~~~~~~~~~~~~~~~~ 277 (441) T protein:vir:80 212 SIRAYTDEAVRTLLGQSVNRDFY---AYPQRWVT-GVS-ADEFSQPG----WVL-----SMASVWAVDKDDDGDTPNVGS 277 (441) T ss_pred hHHHHHHHHHHHHHHHHHHHHhh---cCceeeee-cCC-ccccccch----hhh-----cccccccCCCCCCCCcceeEe Confidence 33333333333222333333 34554543 211 22221111 111 12334444432 34444 Q ss_pred cccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHH----------HHHHHHHHHHHHHHHHh Q lcl|NC_019705. 271 IGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL----------QYTLQPYISRWENSIQR 340 (424) Q Consensus 271 l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~----------~~tl~P~~~~ie~~l~~ 340 (424) +.....+ .+++..+..+..|+..-++|+..+|....+..|+.........+. ...|.-.++.+...++. T Consensus 278 ~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~ 356 (441) T protein:vir:80 278 FPVNSPT-PYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDS 356 (441) T ss_pred cCccchH-HHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 4433222 367888889999999999999999875543223222221111111 11111111111111111 Q ss_pred hccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCC--HHHHHHHhCCCCCCCCCeeeecccccchhhccccC- Q lcl|NC_019705. 341 WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRT--INEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNK- 417 (424) Q Consensus 341 ~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t--~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~- 417 (424) ..-... ....+++.+...+..+..+.++.+.+++++|+.. ..-+++.+|+.+-+- .+... +....++ T Consensus 357 ~~~~~~--~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e~-~~~~~-------e~~e~~~~ 426 (441) T protein:vir:80 357 RVDEAD--FFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDVQV-EAVMR-------HRAESSDP 426 (441) T ss_pred CCcccc--cceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHHHH-HHHHH-------HHHHHHHH Confidence 000000 0123344445566788888999999999999753 345677777754321 00000 0000000 Q ss_pred CCcccCC Q lcl|NC_019705. 418 EPRNNGA 424 (424) Q Consensus 418 ~~~~~ga 424 (424) -.+-+|. T Consensus 427 ~~~~~~~ 433 (441) T protein:vir:80 427 LAVLAGA 433 (441) T ss_pred HHHHhhh Confidence 0000111 No 186 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=97.52 E-value=5.1e-05 Score=44.10 Aligned_cols=389 Identities=11% Similarity=0.068 Sum_probs=173.8 Q ss_pred CchHHHH----HhhccCcccCC-----ccc-cchhhc------cc----cccccCcccccHHHHhccHHHHHHHHHHHHh Q lcl|NC_019705. 14 NGWWARL----QSWFVGGRLVT-----PNQ-GSQTGP------VS----AHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 14 ~G~~~~~----~~~~~~~~~~~-----~~~-~~~~~~------~~----~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ 73 (424) +|+|.++ ++|+++..... +.. ...... .. .++......+. ..-+..+.-..+++.+|+- T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~-~~~~~~~l~~~i~~~~A~l 79 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVH-DKLMNSGTGNEIVVVAAEY 79 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccc-cccccCChHHHHHHHHHHh Confidence 6776654 56676543211 000 000000 00 00000011111 1123344456678888887 Q ss_pred hccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCcee Q lcl|NC_019705. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) +..=+..+.-.+.+....+ .....+.++|.. | ....-+...+.+.+..|.+++.+..+ +|++ .+..+++..+ T Consensus 80 l~~e~~~i~v~~~~~~d~e-~~~~~l~~il~~--n---~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~~-~i~~v~ad~~ 151 (518) T protein:vir:78 80 ISGKPLSIDVTGVNGSKDE-NLTKQLKEALRI--D---NFDSKSVKIVELAGGSGVSAVKINIL-NGRP-SISVHSSSQF 151 (518) T ss_pred hcCCCceEEecCccccCcH-HHHHHHHHHHHh--c---cHHHHHHHHHHHhhccCceEEEEEEE-CCee-EEEEEcCCee Confidence 7665544322221111111 111223343432 1 12334445566677778877665444 3443 4555666666 Q ss_pred EEeecCc----------------eEEEE--------------------------EEeC-CceEE-------------ecH Q lcl|NC_019705. 154 DVKLVGK----------------KVVYR--------------------------YQRD-SEYAE-------------FSQ 177 (424) Q Consensus 154 ~~~~~~~----------------~~~~~--------------------------~~~~-~~~~~-------------~~~ 177 (424) .+...++ ..+|. |..+ +.... ... T Consensus 152 ~P~~~~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~ 231 (518) T protein:vir:78 152 WIDFKNNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHT 231 (518) T ss_pred EEEeecCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCccccccccccccccccccccc Confidence 5433211 11111 0000 00000 000 Q ss_pred hH---------------EEEeecCCC-----CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEc-----CC Q lcl|NC_019705. 178 KE---------------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILST-----GE 232 (424) Q Consensus 178 ~e---------------iih~r~~~~-----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~-----~~ 232 (424) +. +.|+++... +...|+|.+..+...+......-.....-|+. +.+..++.. .. T Consensus 232 ~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~ 310 (518) T protein:vir:78 232 NDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKV 310 (518) T ss_pred ccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCC Confidence 00 233343221 23569999999999888777766666666765 445544421 10 Q ss_pred CCCCHHHHHHHHHHHHHHhCCcccCcce--ecCCCc----eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Q lcl|NC_019705. 233 KVLTEQQRSQVEENFKEIAGGPVKKRLW--ILEAGF----STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~ag~~~--~l~~g~----~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~ 306 (424) ........-.+. ...+....+ -.++|. .++.++....+.++.+..+...++|....|++|..++... T Consensus 311 ~~~~~~~~~~fd-------~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~ 383 (518) T protein:vir:78 311 NKSTDKEEWSMN-------VDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGN 383 (518) T ss_pred CCCCCccccccC-------CCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCccc Confidence 000000000000 001111000 112222 2666776777888999999999999999999999997643 Q ss_pred CCcccchhHH--H--------HHHHHHHHHHHHHHHHHHHHHHhhccCh-hh---hcccchhhhhhhhhccCHHHHHHHH Q lcl|NC_019705. 307 KSTSWGSGIE--Q--------QNLGFLQYTLQPYISRWENSIQRWLIPA-KD---VGRIHAEHNLDGLLRGDSASRAAFM 372 (424) Q Consensus 307 ~~~~~~~n~e--~--------~~~~~~~~tl~P~~~~ie~~l~~~l~~~-~~---~~~~~~~fd~~~l~~~d~~~~~~~~ 372 (424) + ..+..-+. . .....++.+|.-++..+...+.. +... .. .....+.+++++-+..|.+++++.. T Consensus 384 ~-~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~-~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~ 461 (518) T protein:vir:78 384 R-EVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTG-GTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTL 461 (518) T ss_pred c-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hcCccccccCCCceeEEEEeCCCCCCCHHHHHHHH Confidence 2 22211111 1 11233333333333333322221 1111 10 1123466777778889999999999 Q ss_pred HHHHhCCCCCHHHHHHHh--CCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 373 KAMGEAGLRTINEMRRTD--NLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 373 ~~~~~~g~~t~NE~R~~~--g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) .+++.+|+|++.++-+++ |+.. ++.++.+....-. ......++|.+-++ T Consensus 462 ~~~v~aGimS~e~~i~~~~~~~~d-eea~~e~~ri~~E--~~~~~~~~p~~~~g 512 (518) T protein:vir:78 462 NNMNSALAMSVEEKVKLIHPKWED-EEIQAEVKRIYLE--NAIGEVPDPEAIGG 512 (518) T ss_pred HHHHhcCCCCHHHHHHHhCCCCCH-HHHHHHHHHHHHH--hcccCCCCCccccC Confidence 999999999998855554 3221 1122111100000 00011111111111 No 187 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=97.45 E-value=6.5e-05 Score=43.53 Aligned_cols=398 Identities=9% Similarity=0.017 Sum_probs=172.9 Q ss_pred CC--CCcc---ccc-CCCCCchHHHHHhhccCcccC---Cccccchh-hccccc---cccCcccccHHHHhccHHHHHHH Q lcl|NC_019705. 1 ME--EPKY---TID-LRTNNGWWARLQSWFVGGRLV---TPNQGSQT-GPVSAH---GHLGDSSINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~--~~~~---~~~-~~~~~G~~~~~~~~~~~~~~~---~~~~~~~~-~~~~~~---~~~~~~~vs~~~~~~~~~v~~~i 67 (424) -+ .|+. -|+ ....+=-+.+...+..+.... ........ ...... -...+.+ ..+ +.++....+| T Consensus 12 ~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~k--i~~n~~~~iv 88 (474) T protein:vir:10 12 AQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSV-NNK--LNNSFDSEIV 88 (474) T ss_pred ccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCc-ccc--cccchHHHHH Confidence 00 0000 000 011111122222222211000 00000000 000000 0000000 000 2245566678 Q ss_pred HHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019705. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..++-+-+-|+.+--.. +....+ .....+.+++.. ..-......+...++.+|.||.++..+.+|.+ .+.. T Consensus 89 d~~~~yl~g~pv~~~~~~-~~~~~e-~~~~~l~~~~~~-----n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~~~ 160 (474) T protein:vir:10 89 DTRVGYLHGVPVTYDLDE-NAEKNE-KLKKFITNFAIR-----NSVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RIKN 160 (474) T ss_pred HhHhhheeccceeEeeCC-CCcchH-HHHHHHHHHHhh-----cCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EEEE Confidence 888888878888753222 111111 111123333321 23356677888999999999999988888875 5677 Q ss_pred ecCceeEEeecCceE------EEEEEeCC--ce----EEecHhHEEEeecC------------C----------CCCccc Q lcl|NC_019705. 148 LQSANMDVKLVGKKV------VYRYQRDS--EY----AEFSQKEIFHLKGF------------G----------FTGLVG 193 (424) Q Consensus 148 l~~~~v~~~~~~~~~------~~~~~~~~--~~----~~~~~~eiih~r~~------------~----------~~~~~G 193 (424) ++|..+.+..++... +|...... .. ..+.+..+++++.. + .+...| T Consensus 161 i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 240 (474) T protein:vir:10 161 IDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNKEM 240 (474) T ss_pred EcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecCCCCC Confidence 788777665543210 11111000 00 01122222222110 0 022357 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccc Q lcl|NC_019705. 194 LSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV 273 (424) Q Consensus 194 ~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~ 273 (424) .|-+......++....+.....+.+...+.|-.+++ +.. .+++....++ ..+-+.+.+++.+++-+.. T Consensus 241 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~-g~~-~~~~~~~~~~----------~~~~i~~~~~~~~~~~l~~ 308 (474) T protein:vir:10 241 IGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR-GMG-MSEEMIQETQ----------KSGAFELFDKDMDVKYLTK 308 (474) T ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc-cCC-CCchhhhhhh----------hcceeEecCCCCceeEEec Confidence 776666666666555554444444454555655553 222 2332222111 1233445566666666655 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019705. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWENSIQRWLI 343 (424) Q Consensus 274 ~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ie~~l~~~l~ 343 (424) ...+..+....+...+.|...-++|....+... ++.++....-. ....+...+.-.++.|...+..+-. T Consensus 309 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~ 387 (474) T protein:vir:10 309 DVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFN-GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGY 387 (474) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 544555677888889999999999864433222 23332222111 1123333344444444333333211 Q ss_pred ChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeee-----ecccccchhhcccc Q lcl|NC_019705. 344 PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAM-----RQSQYVPITDLGTN 416 (424) Q Consensus 344 ~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~~~-----~~~n~~~~~~~~~~ 416 (424) .........+.+.+..-+..|....++.+.++. |+++..-+.+++++-+-+. .++.- .............+ T Consensus 388 ~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~~~ 465 (474) T protein:vir:10 388 NLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGDAN 465 (474) T ss_pred CCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCCcC Confidence 111111122344445556788888999988874 8999988888887633111 00000 00011111111111 Q ss_pred CCCcccCC Q lcl|NC_019705. 417 KEPRNNGA 424 (424) Q Consensus 417 ~~~~~~ga 424 (424) +++.++-+ T Consensus 466 ~~~~~~~s 473 (474) T protein:vir:10 466 DKSQNNQS 473 (474) T ss_pred CCCccccC Confidence 11111111 No 188 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=97.45 E-value=6.5e-05 Score=43.53 Aligned_cols=398 Identities=9% Similarity=0.017 Sum_probs=172.9 Q ss_pred CC--CCcc---ccc-CCCCCchHHHHHhhccCcccC---Cccccchh-hccccc---cccCcccccHHHHhccHHHHHHH Q lcl|NC_019705. 1 ME--EPKY---TID-LRTNNGWWARLQSWFVGGRLV---TPNQGSQT-GPVSAH---GHLGDSSINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~--~~~~---~~~-~~~~~G~~~~~~~~~~~~~~~---~~~~~~~~-~~~~~~---~~~~~~~vs~~~~~~~~~v~~~i 67 (424) -+ .|+. -|+ ....+=-+.+...+..+.... ........ ...... -...+.+ ..+ +.++....+| T Consensus 12 ~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~k--i~~n~~~~iv 88 (474) T protein:vir:94 12 AQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSV-NNK--LNNSFDSEIV 88 (474) T ss_pred ccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCc-ccc--cccchHHHHH Confidence 00 0000 000 011111122222222211000 00000000 000000 0000000 000 2245566678 Q ss_pred HHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019705. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..++-+-+-|+.+--.. +....+ .....+.+++.. ..-......+...++.+|.||.++..+.+|.+ .+.. T Consensus 89 d~~~~yl~g~pv~~~~~~-~~~~~e-~~~~~l~~~~~~-----n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~~~ 160 (474) T protein:vir:94 89 DTRVGYLHGVPVTYDLDE-NAEKNE-KLKKFITNFAIR-----NSVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RIKN 160 (474) T ss_pred HhHhhheeccceeEeeCC-CCcchH-HHHHHHHHHHhh-----cCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EEEE Confidence 888888878888753222 111111 111123333321 23356677888999999999999988888875 5677 Q ss_pred ecCceeEEeecCceE------EEEEEeCC--ce----EEecHhHEEEeecC------------C----------CCCccc Q lcl|NC_019705. 148 LQSANMDVKLVGKKV------VYRYQRDS--EY----AEFSQKEIFHLKGF------------G----------FTGLVG 193 (424) Q Consensus 148 l~~~~v~~~~~~~~~------~~~~~~~~--~~----~~~~~~eiih~r~~------------~----------~~~~~G 193 (424) ++|..+.+..++... +|...... .. ..+.+..+++++.. + .+...| T Consensus 161 i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 240 (474) T protein:vir:94 161 IDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNKEM 240 (474) T ss_pred EcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecCCCCC Confidence 788777665543210 11111000 00 01122222222110 0 022357 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccc Q lcl|NC_019705. 194 LSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV 273 (424) Q Consensus 194 ~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~ 273 (424) .|-+......++....+.....+.+...+.|-.+++ +.. .+++....++ ..+-+.+.+++.+++-+.. T Consensus 241 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~-g~~-~~~~~~~~~~----------~~~~i~~~~~~~~~~~l~~ 308 (474) T protein:vir:94 241 IGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR-GMG-MSEEMIQETQ----------KSGAFELFDKDMDVKYLTK 308 (474) T ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc-cCC-CCchhhhhhh----------hcceeEecCCCCceeEEec Confidence 776666666666555554444444454555655553 222 2332222111 1233445566666666655 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019705. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWENSIQRWLI 343 (424) Q Consensus 274 ~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ie~~l~~~l~ 343 (424) ...+..+....+...+.|...-++|....+... ++.++....-. ....+...+.-.++.|...+..+-. T Consensus 309 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~ 387 (474) T protein:vir:94 309 DVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFN-GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGY 387 (474) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 544555677888889999999999864433222 23332222111 1123333344444444333333211 Q ss_pred ChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeee-----ecccccchhhcccc Q lcl|NC_019705. 344 PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAM-----RQSQYVPITDLGTN 416 (424) Q Consensus 344 ~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~~~-----~~~n~~~~~~~~~~ 416 (424) .........+.+.+..-+..|....++.+.++. |+++..-+.+++++-+-+. .++.- .............+ T Consensus 388 ~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~~~ 465 (474) T protein:vir:94 388 NLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGDAN 465 (474) T ss_pred CCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCCcC Confidence 111111122344445556788888999988874 8999988888887633111 00000 00011111111111 Q ss_pred CCCcccCC Q lcl|NC_019705. 417 KEPRNNGA 424 (424) Q Consensus 417 ~~~~~~ga 424 (424) +++.++-+ T Consensus 466 ~~~~~~~s 473 (474) T protein:vir:94 466 DKSQNNQS 473 (474) T ss_pred CCCccccC Confidence 11111111 No 189 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=97.45 E-value=6.5e-05 Score=43.52 Aligned_cols=396 Identities=10% Similarity=0.034 Sum_probs=173.6 Q ss_pred CCCCcccc--cCCCC----------------CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHH Q lcl|NC_019705. 1 MEEPKYTI--DLRTN----------------NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQIST 62 (424) Q Consensus 1 ~~~~~~~~--~~~~~----------------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~ 62 (424) -....|.. +..+. +--++++..+..+............. .... . .=+.++. T Consensus 26 n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~--------~~~~-~--~ki~~n~ 94 (511) T protein:vir:93 26 NVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKE--------EYMA-D--NRVAHDY 94 (511) T ss_pred CCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcc--------cccC-c--ceeecch Confidence 00001100 01111 11122233333221110000000000 0000 0 0122455 Q ss_pred HHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019705. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~ 142 (424) ...+|+..++-+-+-|+.+--. +.. ....+..++.. | ........+..+++.+|.||.++.++.+|.+ T Consensus 95 ~k~Iv~~~~~yl~g~p~~~~~~--d~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~ 162 (511) T protein:vir:93 95 ASYISDFINGYFLGNPIQYQDD--DKD-----VLEVIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET 162 (511) T ss_pred HHHHHHHHhhhhcccCeeeccC--ChH-----HHHHHHHHHhh--c---CHhHHHHHHHHHHHhcCeeEEEEEeCCCCce Confidence 6677888888787888775211 111 11233334332 2 2346667888999999999999999888876 Q ss_pred eEEEEecCceeEEeecCc---eEE--EEE-E-e--C-C--c----eEEecHhHEEEeecCC------------------- Q lcl|NC_019705. 143 ISLLPLQSANMDVKLVGK---KVV--YRY-Q-R--D-S--E----YAEFSQKEIFHLKGFG------------------- 187 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~---~~~--~~~-~-~--~-~--~----~~~~~~~eiih~r~~~------------------- 187 (424) .+..++|..+.+..++. ... .+| . . . . . ...+.++.+.+++... T Consensus 163 -~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g 241 (511) T protein:vir:93 163 -RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFE 241 (511) T ss_pred -EEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCC Confidence 47778998888776642 111 111 1 0 0 0 0 0123444454442111 Q ss_pred -------CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHH-HHHHHhCC-cccCc Q lcl|NC_019705. 188 -------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE-NFKEIAGG-PVKKR 258 (424) Q Consensus 188 -------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~-~~~~~~~~-~~ag~ 258 (424) .+...|.|-++.+...++....+.....+.+...+.|-.++.-... .+++.....+. ..-..... .-.+. T Consensus 242 ~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (511) T protein:vir:93 242 RMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN-LDPVEVRKQKEANVLFLEPTVYADSE 320 (511) T ss_pred ccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcc-cCchhhcccccccceecccccccccc Confidence 0123577777777777776665555555555656666655543222 22222111111 00000000 00011 Q ss_pred ceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHH----------HHHHHHHHH Q lcl|NC_019705. 259 LWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQ 328 (424) Q Consensus 259 ~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~----------~~~~~~tl~ 328 (424) ..-.+++.+++-+........+....+...+.|+..-++|..-.+... ++.|+....-.. ...+...|. T Consensus 321 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~ 399 (511) T protein:vir:93 321 GRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLR 399 (511) T ss_pred cccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122344555555554444555567788889999999999964433222 233322221111 122233333 Q ss_pred HHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCee----- Q lcl|NC_019705. 329 PYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVA----- 401 (424) Q Consensus 329 P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~~----- 401 (424) -.++.|...+..+.-.........+++.+..-+..|..+.++.+.++ .|+++..-+.++++.-+-|. .+.. T Consensus 400 ~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~ 477 (511) T protein:vir:93 400 RRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDEK 477 (511) T ss_pred HHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHH Confidence 33333333222211111111112234444556677888888888887 48899888888876533211 0000 Q ss_pred -eecccccchh---hcc---ccCCCcccCC Q lcl|NC_019705. 402 -MRQSQYVPIT---DLG---TNKEPRNNGA 424 (424) Q Consensus 402 -~~~~n~~~~~---~~~---~~~~~~~~ga 424 (424) .......... ... ++++..++.+ T Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (511) T protein:vir:93 478 ESIKKAQKGIYKDPRDINDDEQDDDTKDTV 507 (511) T ss_pred HHHHHHhhhcccCCCCCCCCCCCCcccccc Confidence 0000000000 000 0111111111 No 190 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=97.44 E-value=6.7e-05 Score=43.48 Aligned_cols=393 Identities=10% Similarity=0.046 Sum_probs=173.6 Q ss_pred CCCCcccc---------------cCCC-----CCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhcc Q lcl|NC_019705. 1 MEEPKYTI---------------DLRT-----NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQI 60 (424) Q Consensus 1 ~~~~~~~~---------------~~~~-----~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~ 60 (424) -..-.|.| +|.+ ++--++++..+..+............ ....+ ..=+.. T Consensus 24 ~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~--------~~~~~---~~ki~~ 92 (511) T protein:vir:96 24 EANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRK--------EEYMA---DNRVAH 92 (511) T ss_pred hhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCc--------ccccC---cceeec Confidence 00000000 0100 01122333333332111000000000 00000 001223 Q ss_pred HHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019705. 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G 140 (424) .....+++..+.-+-+-|+.+.- .++. ....+..++.. | ........+..+++.+|.||.++.++.+| T Consensus 93 n~~k~Iv~~~~~yl~g~p~~~~~--~~~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~i~G~a~~~vy~ded~ 160 (511) T protein:vir:96 93 DYASYISDFINGYFLGNPIQYQD--DDKD-----VLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred chHHHHHHHHHhhhccCCceeec--CchH-----HHHHHHHHHhh--c---CHHHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 55666788888888788887631 1111 11234444432 2 23456677888999999999999998888 Q ss_pred ceeEEEEecCceeEEeecCce---EE--EE-EEe---C-C--ce----EEecHhHEEEeecCC----------------- Q lcl|NC_019705. 141 DVISLLPLQSANMDVKLVGKK---VV--YR-YQR---D-S--EY----AEFSQKEIFHLKGFG----------------- 187 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~~~~---~~--~~-~~~---~-~--~~----~~~~~~eiih~r~~~----------------- 187 (424) .+ .+..++|..+.+..++.. .. ++ |.. . . .. ..+.++.+.+++... T Consensus 161 ~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 239 (511) T protein:vir:96 161 ET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHS 239 (511) T ss_pred ce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccc Confidence 65 567788888887765421 11 11 110 0 0 00 123444444432100 Q ss_pred ---------CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHH-HHHHHhCC-ccc Q lcl|NC_019705. 188 ---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE-NFKEIAGG-PVK 256 (424) Q Consensus 188 ---------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~-~~~~~~~~-~~a 256 (424) .+...|.|-++.+...++....+.....+.+...+.|-.+++-... .+.+.....+. ..-..... .-. T Consensus 240 ~~~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 318 (511) T protein:vir:96 240 FERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN-LDPVEVRKQKEANVLFLEPTVYAD 318 (511) T ss_pred CCceeeEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCcc-CCchhhcccccccceecccccccc Confidence 0123577777777777776666555555556666666665553222 22222111111 10000000 001 Q ss_pred CcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH----------HHHHHH Q lcl|NC_019705. 257 KRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GFLQYT 326 (424) Q Consensus 257 g~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~----------~~~~~t 326 (424) +.....+++.++.-+........+....+...+.|...-++|..-.+... ++.++....-... ..+... T Consensus 319 ~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~ 397 (511) T protein:vir:96 319 SEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKG 397 (511) T ss_pred cccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11122334555555554444555677888889999999999964433222 2333222211111 222223 Q ss_pred HHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCee--- Q lcl|NC_019705. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVA--- 401 (424) Q Consensus 327 l~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~~--- 401 (424) +.-.++.|...+..+.-.........+++.+..-+..|..+.++.+.++ .|+++.-.+.+++++-.-|. -+.. T Consensus 398 l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~G~iS~et~l~~l~~v~D~~~E~~ri~~E 475 (511) T protein:vir:96 398 LRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEED 475 (511) T ss_pred HHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHH Confidence 3333333333222211111111112234444555667888888888887 58999988888887633211 0000 Q ss_pred ---eecccccchhhccccCCCcccCC Q lcl|NC_019705. 402 ---MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ---~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ....... ......++.+++. T Consensus 476 ~~~~~~~~~~---~~~~~~~~~~~~~ 498 (511) T protein:vir:96 476 EKESIKKAQK---GIYKDPRDINDDE 498 (511) T ss_pred HHHHHHHHhh---ccccCCCCCCCCC Confidence 0000000 0000111111111 No 191 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=97.36 E-value=8.6e-05 Score=42.88 Aligned_cols=387 Identities=8% Similarity=-0.014 Sum_probs=168.6 Q ss_pred CCCCcccccCCCCC----chHHHHHhhccCcccCCccccchhhcccccc---------ccCc--ccccHHHHhccHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNN----GWWARLQSWFVGGRLVTPNQGSQTGPVSAHG---------HLGD--SSINDERILQISTVWR 65 (424) Q Consensus 1 ~~~~~~~~~~~~~~----G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~--~~vs~~~~~~~~~v~~ 65 (424) -+.-.++....++. -++.++...+... .+.......-+.+.. ...+ ...-...=+.++.... T Consensus 20 ~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~---~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ 96 (483) T protein:vir:12 20 TEIFDAIVRTNNKPETLEEMIVRYIKQHLEK---LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHAN 96 (483) T ss_pred hhhhhcccccCCchhhHHHHHHHHHHHHHHH---HHHHHHHHHHhccccccccccccccccccccccccccccccchHHH Confidence 11111112122221 1222222111110 000000000000000 0000 0000001123566677 Q ss_pred HHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEE Q lcl|NC_019705. 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISL 145 (424) Q Consensus 66 ~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l 145 (424) +|+..++-+-+-|+.+-- .+.. ....+..++. |. -......+..+++.+|.||+.+..+.+|.+ .+ T Consensus 97 Ivd~~~~~l~G~p~~~~~--~d~~-----~~~~l~~~~~---n~---~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~-~i 162 (483) T protein:vir:12 97 LVDQKVSYIVGKPIAFKH--TDDE-----VVKRIDEVLG---NR---FDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KL 162 (483) T ss_pred HHHHHhhhhcccCceecc--CChH-----HHHHHHHHHh---cc---HHHHHHHHHHHHhhCCeEEEEEEEcCCCce-EE Confidence 888888888777877521 1111 1122333332 22 234456678889999999999999888876 47 Q ss_pred EEecCceeEEeecCce---E---EEEEEeCC--ceEEecHhHEEEeec----------------------CC-------- Q lcl|NC_019705. 146 LPLQSANMDVKLVGKK---V---VYRYQRDS--EYAEFSQKEIFHLKG----------------------FG-------- 187 (424) Q Consensus 146 ~~l~~~~v~~~~~~~~---~---~~~~~~~~--~~~~~~~~eiih~r~----------------------~~-------- 187 (424) ..++|..+.+..++.. . .+.|.... ....+.+..+.++.. +. T Consensus 163 ~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 242 (483) T protein:vir:12 163 FRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIP 242 (483) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEEeecceEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEE Confidence 7789988887765321 1 11111111 111222233322210 00 Q ss_pred -CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCc Q lcl|NC_019705. 188 -FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGF 266 (424) Q Consensus 188 -~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~ 266 (424) .+...|.|-+......++....+.....+.+...+.|..+++-.......+ ..... ...+++.++++. T Consensus 243 ~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~----~~~~~-------~~~~~~~~~~~~ 311 (483) T protein:vir:12 243 FKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPE----FKRLL-------RYYGAIKVSDNG 311 (483) T ss_pred ecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchh----HHHhh-------hhccccccCCCC Confidence 012357777776666666555555555555555566766655322111111 11111 122355555555 Q ss_pred eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHH----------HHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------QNLGFLQYTLQPYISRWEN 336 (424) Q Consensus 267 ~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~----------~~~~~~~~tl~P~~~~ie~ 336 (424) +..-+..+..+..+....+...+.|+..-++|..-.+... ++.++...+. .....+...++-.++.+.. T Consensus 312 ~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~ 390 (483) T protein:vir:12 312 GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE 390 (483) T ss_pred cceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5555554445555677788888899999999853332211 2222211111 1112222233333333322 Q ss_pred HHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeee-----ecccccc Q lcl|NC_019705. 337 SIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAM-----RQSQYVP 409 (424) Q Consensus 337 ~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~--ggd~~~-----~~~n~~~ 409 (424) .+.. +.+. ..+.+.+..-+..|..+.++.+.++ .|+++..-+.++++.-.-+ ..+..- ....... T Consensus 391 ~~~~----~~~~--~~i~v~f~~~~p~~~~~~a~~~~kl--~GiiS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~ 462 (483) T protein:vir:12 391 HFDI----KGEH--KDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPN 462 (483) T ss_pred HhcC----CCcc--ceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccc Confidence 2221 1111 2233334555667888888888887 4899998888888763211 000000 0000111 Q ss_pred hhhccccC---CCcccCC Q lcl|NC_019705. 410 ITDLGTNK---EPRNNGA 424 (424) Q Consensus 410 ~~~~~~~~---~~~~~ga 424 (424) +...+... ..+.+-+ T Consensus 463 ~~~~~~d~~~~~~~~~~~ 480 (483) T protein:vir:12 463 LDDGGADGAQQQERSNNK 480 (483) T ss_pred ccccccCCcccCCCCCcc Confidence 11111100 0000111 No 192 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=97.35 E-value=8.9e-05 Score=42.80 Aligned_cols=384 Identities=7% Similarity=-0.014 Sum_probs=168.9 Q ss_pred CCCCcccccCCCCC----chHHHH--------------HhhccCcccCCccccchhhccccccccCcccccHHHHhccHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNN----GWWARL--------------QSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQIST 62 (424) Q Consensus 1 ~~~~~~~~~~~~~~----G~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~ 62 (424) -++-+=...+..+. -++.++ ..+..+.-...... ............-...=+.++. T Consensus 29 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~------~~~~~~~~~~~~~~~~ri~~n~ 102 (492) T protein:vir:94 29 TEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEP------KPVDATGAVDPLKPDDRMITNF 102 (492) T ss_pred hhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc------ccccccccccccccccccccch Confidence 00000001111111 122222 22222110000000 0000000000000001123566 Q ss_pred HHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019705. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~ 142 (424) ...+|+..++-+-+-|+.+.- .+.. ....+..++. |. -......+..+++.+|.||+++..+.+|.+ T Consensus 103 ~k~Ivd~~~~yl~G~p~~~~~--~d~~-----~~~~l~~~~~---n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~ 169 (492) T protein:vir:94 103 HANLVDQKVSYIVGKPIAFKH--TDDE-----VVKRIDEVLG---NR---FDDKLHSVLTGASNKGIEWLHPYLDEEGEF 169 (492) T ss_pred HHHHHHHHHhhhcccCceecc--CchH-----HHHHHHHHHh---cc---HHHHHHHHHHHHhhCCeEEEEEEecCCCce Confidence 777888888888788876521 1111 1122333332 22 235566788999999999999988888876 Q ss_pred eEEEEecCceeEEeecCce---EE--E-EEEeCC--ceEEecHhHEEEeec----------------------CCC---- Q lcl|NC_019705. 143 ISLLPLQSANMDVKLVGKK---VV--Y-RYQRDS--EYAEFSQKEIFHLKG----------------------FGF---- 188 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~---~~--~-~~~~~~--~~~~~~~~eiih~r~----------------------~~~---- 188 (424) .+..++|..+.+..++.. .. . .|.... ....+.+..|.++.. .++ T Consensus 170 -~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 248 (492) T protein:vir:94 170 -KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP 248 (492) T ss_pred -EEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccc Confidence 467788988887765321 11 1 111111 111122222222210 000 Q ss_pred -----CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC Q lcl|NC_019705. 189 -----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE 263 (424) Q Consensus 189 -----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~ 263 (424) +...|.|-+......++....+.....+.+...+.|..+++--......+ ..... ...+++.++ T Consensus 249 vv~~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~----~~~~~-------~~~~~~~~~ 317 (492) T protein:vir:94 249 FIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPE----FKRLL-------RYYGAIKVS 317 (492) T ss_pred eEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchh----hHHHH-------hhccceecC Confidence 12357777777777777666666555666666666766665322211111 11111 123455566 Q ss_pred CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHH----------HHHHHHHHHHHHHH Q lcl|NC_019705. 264 AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYISR 333 (424) Q Consensus 264 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~----------~~~~~~tl~P~~~~ 333 (424) .+.+...+........+....+...+.|+..-++|..-.+.. +++.++...+-.. ...+...+.-.++. T Consensus 318 ~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l 396 (492) T protein:vir:94 318 DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF-GSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWF 396 (492) T ss_pred CCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCcccc-ccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 555555444433444456677788888888888884322211 1222221111111 11122222222222 Q ss_pred HHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCee-----eeccc Q lcl|NC_019705. 334 WENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVA-----MRQSQ 406 (424) Q Consensus 334 ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~--ggd~~-----~~~~n 406 (424) +...+.. ..+. ..+.+.+..-+..|..+.++.+.++. |+++..-+.++++.-+-+ ..++. -.... T Consensus 397 i~~~~~~----~~~~--~~i~v~f~~~~p~~~~e~~~~~~kl~--giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~ 468 (492) T protein:vir:94 397 VFEHFDI----KGEH--KDVDISFNYNKVANTELQVQTAQQSM--GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ 468 (492) T ss_pred HHHHhcC----Cccc--ceeeEEecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhh Confidence 2222211 1111 22333345556678888888888874 889988888888763321 11110 00011 Q ss_pred ccchhhccccCCCcccCC Q lcl|NC_019705. 407 YVPITDLGTNKEPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~~~~~~~ga 424 (424) ...........++.++++ T Consensus 469 ~~~~~~~~~~~~~~~~~~ 486 (492) T protein:vir:94 469 LPNLDDGGADSAQQQERS 486 (492) T ss_pred ccccccccCCCCccccCC Confidence 111111111112222222 No 193 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=97.33 E-value=9.2e-05 Score=42.71 Aligned_cols=391 Identities=7% Similarity=-0.042 Sum_probs=176.7 Q ss_pred CCCCcccccCCCCCchH-HHHHhhccCcccCCccccchhhcccccc--------cc-Cccccc----HHHHhccHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWW-ARLQSWFVGGRLVTPNQGSQTGPVSAHG--------HL-GDSSIN----DERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~-~~~~vs----~~~~~~~~~v~~~ 66 (424) |+.--+.+.+..+...+ ..+..-+..... .+........+.+.. .. .+.... ...=+.++....+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~-~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~I 85 (479) T protein:vir:79 7 SETDLIKVQLKKESTINLVKVIEHYILKHR-PEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLL 85 (479) T ss_pred cccceEeeccccCChhHHHHHHHHHHhhhh-HHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHH Confidence 54444555555555332 111111111100 000000000000000 00 000000 0001235556678 Q ss_pred HHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEE Q lcl|NC_019705. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLL 146 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~ 146 (424) |+..++-+.+-|+.+-- .+.. . ..+...+.. | .-......++...+.+|.+|..+..+.+|.+. +. T Consensus 86 vd~~~~~l~g~p~~~~~--~~~~-----~-~~~~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~ 151 (479) T protein:vir:79 86 VDQKVGYSVGNPIVFNA--DDDN-----L-TKLLNDLLG--E---EFDDTITELYLNASNKGVEWLHPYINRKGEFK-YV 151 (479) T ss_pred HHHHHhhhhcCCceecc--CCHH-----H-HHHHHHHHh--c---CHHHHHHHHHHHHHhcCeEEEEEEeCCCCceE-EE Confidence 88888888788876521 1111 1 122233332 3 23455677788999999999999888888764 67 Q ss_pred EecCceeEEeecCce---E-----EEEEE-eCCce----EEecHhHEEEeecCC-------------------------- Q lcl|NC_019705. 147 PLQSANMDVKLVGKK---V-----VYRYQ-RDSEY----AEFSQKEIFHLKGFG-------------------------- 187 (424) Q Consensus 147 ~l~~~~v~~~~~~~~---~-----~~~~~-~~~~~----~~~~~~eiih~r~~~-------------------------- 187 (424) .++|..+.+..++.. . +|... ..+.. ..+.++.+.+++.-. T Consensus 152 ~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (479) T protein:vir:79 152 IIPAEEAIPIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRIN 231 (479) T ss_pred EEccceeEEEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCccccccccccccccccccccccccc Confidence 788888877665321 1 11111 11111 112333333332100 Q ss_pred --------------CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCC Q lcl|NC_019705. 188 --------------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGG 253 (424) Q Consensus 188 --------------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~ 253 (424) .+...|.|-+..+...++....+.....+.+...+.|-.+++--.....++... . T Consensus 232 ~~~~~~~~vPvv~~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~-----------~ 300 (479) T protein:vir:79 232 NKEQGWGKVPFIPFKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFID-----------N 300 (479) T ss_pred ccccCCCcccEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchh-----------h Confidence 022357777777777776666555555555666666766655322211121111 1 Q ss_pred cccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHH----------HHHHHH Q lcl|NC_019705. 254 PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------QNLGFL 323 (424) Q Consensus 254 ~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~----------~~~~~~ 323 (424) ...++++.++++.+++-+..+..+..+....+...+.|+..-++|..-.+. .++.++..... .....+ T Consensus 301 ~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~gn~Sg~Ai~~~~~~l~~k~~~~~~~~ 378 (479) T protein:vir:79 301 IRYYKSIKVDGGGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQN--TGDKSGVALKFLYSLLDLKCSKTEKKF 378 (479) T ss_pred hhhccceecCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccccccc--ccchhHHHHHHHHHHHHHHHHHHHHHH Confidence 123456667766665555544445556677888888899988988643322 12322222111 112222 Q ss_pred HHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeee Q lcl|NC_019705. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMR 403 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~ 403 (424) ...++-.++.+...+...-.. ......+.+.+..-+..|.++.++.+.++ .|+++...+.++++.-.-+..+.-.+ T Consensus 379 ~~~l~~~~~li~~~~~~~~~~--~~~~~~i~i~f~~~~p~~~~~~a~~~~kl--~g~iS~et~l~~l~~v~d~~~E~~ri 454 (479) T protein:vir:79 379 KKAIRELLWFVCEYLKISGNK--SYDYKTVQITFNHSMIINEAEKIDMAAKS--TGIVSDETIVSNHPWVEDVNDELERL 454 (479) T ss_pred HHHHHHHHHHHHHHHhccCCC--ccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHH Confidence 333333333333332221111 11112334444455567888888888887 48999888888876522111000000 Q ss_pred cccc----cchhhcc-ccCCCcccC Q lcl|NC_019705. 404 QSQY----VPITDLG-TNKEPRNNG 423 (424) Q Consensus 404 ~~n~----~~~~~~~-~~~~~~~~g 423 (424) ...- ...+... ..++..++. T Consensus 455 ~~E~~~~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 455 KKQEDTQKEYDDLIPNNQDGVIDET 479 (479) T ss_pred HHHHHHHHHHHhccCcccCCCcCcC Confidence 0000 0000000 011111111 No 194 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=97.28 E-value=0.00011 Score=42.37 Aligned_cols=393 Identities=9% Similarity=0.033 Sum_probs=173.4 Q ss_pred CCCCccccc--------CCCC----------------CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHH Q lcl|NC_019705. 1 MEEPKYTID--------LRTN----------------NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDER 56 (424) Q Consensus 1 ~~~~~~~~~--------~~~~----------------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~ 56 (424) -=.+++.+. ..+. +--++++..+..+............. ...+ .. T Consensus 20 ~~~~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~--------~~~~---~~ 88 (511) T protein:vir:10 20 LFNDEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKE--------EYMA---DN 88 (511) T ss_pred hhhhhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccc--------cccC---cc Confidence 000011000 0000 11222333333322110000000000 0000 00 Q ss_pred HhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019705. 57 ILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 57 ~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r 136 (424) =+.++....+|+..+.-+-+-|+.+--. ++. ....+..++.. | ........+..+++.+|.||.++.+ T Consensus 89 ki~~n~~k~Iv~~~~~yl~g~p~~~~~~--d~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~i~G~ay~~vy~ 156 (511) T protein:vir:10 89 RVAHDYASYISDFINGYFLGNPIQYQDD--DKD-----VLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYEIMIR 156 (511) T ss_pred eeecchHHHHHHHHhhhhcccCceeecC--chH-----HHHHHHHHHhh--c---CHHHHHHHHHHHHHhcCeeEEEEEe Confidence 1224556667888888777888775321 111 11234444432 2 2335667788899999999999999 Q ss_pred CCCCceeEEEEecCceeEEeecCce---EE--EE-EEe---C-C--c-e---EEecHhHEEEeecCC------------- Q lcl|NC_019705. 137 NSAGDVISLLPLQSANMDVKLVGKK---VV--YR-YQR---D-S--E-Y---AEFSQKEIFHLKGFG------------- 187 (424) Q Consensus 137 ~~~G~~~~l~~l~~~~v~~~~~~~~---~~--~~-~~~---~-~--~-~---~~~~~~eiih~r~~~------------- 187 (424) +.+|.+ .+..++|..+.+..++.. .. .+ |.. . . . . ..+.++.+.++.... T Consensus 157 dedg~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~ 235 (511) T protein:vir:10 157 NQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGF 235 (511) T ss_pred CCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCccccccccccc Confidence 888875 567788888887765421 11 11 110 0 0 0 0 123444444432100 Q ss_pred -------------CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHh-CC Q lcl|NC_019705. 188 -------------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIA-GG 253 (424) Q Consensus 188 -------------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~-~~ 253 (424) .+...|.|-++.+...++....+.....+.+...+.|-.++.-... .+++.....++...-.. .. T Consensus 236 ~~~~~~~vPvv~f~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~-~~~~~~~~~~~~~~~~~~~~ 314 (511) T protein:vir:10 236 ESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN-LDPVEVRKQKEANVLFLEPT 314 (511) T ss_pred ccccCcceeEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecccc-CCchhhccchhccceecccc Confidence 0123577777777777776655555555555666666665543222 22222211111100000 00 Q ss_pred c-ccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHH----------HHHHH Q lcl|NC_019705. 254 P-VKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------QNLGF 322 (424) Q Consensus 254 ~-~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~----------~~~~~ 322 (424) . -.+.....++|.+++-+.....+..+....+...+.|+..-++|..-.+... ++.++..... ..... T Consensus 315 ~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~ 393 (511) T protein:vir:10 315 VYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGL 393 (511) T ss_pred cccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHH Confidence 0 0111222344556665655545555667888888999999999864332221 2323222211 11222 Q ss_pred HHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCe Q lcl|NC_019705. 323 LQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDV 400 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~ 400 (424) +...|.-.++.|...+....-.........+++.+..-+..|..+.++.+.++. |+++.--+.+++++-+-|. -+. T Consensus 394 f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~--G~iS~et~~~~l~~v~d~~~E~~r 471 (511) T protein:vir:10 394 FTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKK 471 (511) T ss_pred HHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCCHHHHHHH Confidence 223333333333332222111111111123455556667788888999988884 7899877888876532110 000 Q ss_pred e------eecccccchhhccccCCCcccCC Q lcl|NC_019705. 401 A------MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 401 ~------~~~~n~~~~~~~~~~~~~~~~ga 424 (424) . ....... .....+++.+++. T Consensus 472 i~~E~~~~~~~~~~---~~~~~~~~~~~~~ 498 (511) T protein:vir:10 472 IEEDEKESIKKAQK---GIYKDPRDINDDE 498 (511) T ss_pred HHHHHHHHHHHHhh---hcccCCCCCCCCC Confidence 0 0000000 0011111111111 No 195 >protein:vir:97376 Length: 320 # NCBI annotation: putative portal protein # Family: family:all:11744 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762589;genbank:gi:115304290;genbank:GeneID:5130579 Probab=97.27 E-value=6.1e-06 Score=49.16 Aligned_cols=307 Identities=13% Similarity=0.140 Sum_probs=150.2 Q ss_pred CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCcccee Q lcl|NC_019705. 14 NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~ 93 (424) +|+|+ |.+++..+|+-...- + ..+.+..+..+.-..-|...+-+|..||.- +..|+... T Consensus 1 ~~~~~-----~~~~~~~~~~~~~~~--~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~------- 59 (320) T protein:vir:97 1 MGIFN-----FKKRETLTPELKESI--I------RQVTIEDESPFTGTTDFNVRNEVAESIATY-LGAYKTSA------- 59 (320) T ss_pred CCccc-----cccccccChhHHhhh--h------heeeeccCCCcccccccchhhHHHHHHHHH-hhhhcccc------- Confidence 66665 334443333221110 0 000000111111111222334444454432 11222221 Q ss_pred ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEeecCceEEEEEEeCCc-- Q lcl|NC_019705. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSE-- 171 (424) Q Consensus 94 ~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~-- 171 (424) + ...||.. || .|++.++.|.+..-..|++.... -|.. + .+.+++.-- .....++-... T Consensus 60 ---~-~~~~~~~--~~-----~~~~~~~~~~~~~~~~~~~~~~~-~~~~-----~-~~~~~~~~~--~~~~~~~~~D~FN 119 (320) T protein:vir:97 60 ---K-RLSLLTN--NP-----SFLRRLVKHALHNKTTYVYKSPT-YGWL-----I-TDSMTIEGL--RARLTFTLPDPFN 119 (320) T ss_pred ---c-eeeeeeC--CH-----HHHHHHHHHhhcccceEEeeCCc-ccee-----e-ecceeeeee--eeeEEEecCcccc Confidence 1 1223432 22 79999999999999999986432 2322 1 122222111 01111111000 Q ss_pred ---eEEecHhHEEEeecCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019705. 172 ---YAEFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 172 ---~~~~~~~eiih~r~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) +..++-.|+-.+ .++++|..|-++... ...+....-+-+.|.+....+++++.+..-++..++.+..++ T Consensus 120 ~~V~mtvpfyD~~IL----dnpl~gv~tqe~gkM----~g~a~~~v~kkL~~~~~IKafi~Tdid~GLee~kD~~~~kIk 191 (320) T protein:vir:97 120 SAVTMTVPFYDVGII----DSPLVEVDTEEANKM----LEAAYSAVMKKLHNTGAIKAFISSDIDVGLEKMKEESDSKIK 191 (320) T ss_pred eeEEEEeeeechhhh----hhhhcccChHHhhHH----HHHHhhhhhhhccccceeEEEEecccchhHHHHHHHHHHHHH Confidence 111111111111 246788877644332 222333345556788888999988866544666666666665 Q ss_pred HHhCCcc-cCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHH Q lcl|NC_019705. 249 EIAGGPV-KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTL 327 (424) Q Consensus 249 ~~~~~~~-ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl 327 (424) ++..-.+ -.++-+++.|.+++++.....-.. ..-....++..+.-|+||..+|-++ ..+.+..+|+...+ T Consensus 192 ~mq~~A~~~nG~T~i~~~dDI~Qi~pDYS~sn-~~D~~l~~t~alS~y~m~~~IL~Gs--------Ate~~~Iaf~~~~V 262 (320) T protein:vir:97 192 AMLATAELLSGYTYIQRGDDVTQMMPDYTTSN-VTDFAAMRTFAASQLSVSDKILDGS--------ATDGEKVAVMFRFV 262 (320) T ss_pred HHHHHHHHhcCcccccCCcceeeecccccccc-hhHHHHHHHHHHhhcCCchhhcccc--------CCcceeeehhhHhH Confidence 5443332 346889999999999987654443 3345566778889999999998543 23677889999999 Q ss_pred HHHHHHH---HHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC----CCCe Q lcl|NC_019705. 328 QPYISRW---ENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP----GGDV 400 (424) Q Consensus 328 ~P~~~~i---e~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~----ggd~ 400 (424) .|+++++ |.+|..++-. +|.+ ++... +|-+..|.+- -.|.+.-| |||+ T Consensus 263 ~PLL~Q~~~~Ek~Lvy~m~~---------E~FV-s~mtT--------------GG~l~S~~~~-~~~~~~~~~~~~~~~~ 317 (320) T protein:vir:97 263 EPILEQFREYEPSLIYAMRD---------EFFV-SFMTT--------------GGMLNSNRVD-GWGKEKAPNESKGGDV 317 (320) T ss_pred HHHHHHhhhcCcceeeeecc---------ceee-eeeec--------------Cceeeccccc-ccccccCCccccCCcc Confidence 9999997 5555544411 1211 11100 3444444321 12222221 3332 Q ss_pred eee Q lcl|NC_019705. 401 AMR 403 (424) Q Consensus 401 ~~~ 403 (424) --+ T Consensus 318 ~~~ 320 (320) T protein:vir:97 318 GDV 320 (320) T ss_pred cCC Confidence 221 No 196 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=97.25 E-value=0.00012 Score=42.15 Aligned_cols=394 Identities=9% Similarity=0.039 Sum_probs=172.9 Q ss_pred CCCCcccccCCCCCch----HHHHHhhccCcccCCccccchhhcccccc-ccCccc--cc-HHHHhccHHHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGW----WARLQSWFVGGRLVTPNQGSQTGPVSAHG-HLGDSS--IN-DERILQISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~--vs-~~~~~~~~~v~~~i~~ia~ 72 (424) --|+.-+|.+-++..+ +.++....... .+........+.+.. .+.... -. ...=+..+....+|+..++ T Consensus 2 ~~~~~~~~~~p~d~~~~~~~l~~~i~~~~~~---~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~ 78 (453) T protein:vir:39 2 KYKPPKLMTFPKDEPITNEVVTKFMEKHRLE---VARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVNFTKYIVDTFTG 78 (453) T ss_pred eecCCcceEcCCCCCCCHHHHHHHHHHHHHH---HHHHHHHHHHhhccCchhcCCCccccCccceeecchHHHHHHHHhh Confidence 2345556777776653 33333322211 000000000000000 000000 00 0001234567778888888 Q ss_pred hhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCce Q lcl|NC_019705. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) Q Consensus 73 ~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~ 152 (424) -+-+-|+.+--.+ . .....+.+++.. |. .......+..+.+.+|.||+.+..+.+|.+ .+-.++|.. T Consensus 79 ~l~g~~~~~~~~d--~-----~~~~~l~~i~~~--N~---~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~ 145 (453) T protein:vir:39 79 YFNGIPVKKSHSD--K-----ETLSKLQEFDNL--ND---MEDEESELAKMACIYGRAFELLYQNEETQT-NVIYNTPEN 145 (453) T ss_pred hhcccCceeccCC--h-----HHHHHHHHHHHh--cC---hhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccc Confidence 8777777652111 1 111235555543 32 235677888999999999999999888876 456678888 Q ss_pred eEEeecCce---E--EEEEEeCCce----EEecHhHEEEeecCC---------------------CCCcccCchHHHHHH Q lcl|NC_019705. 153 MDVKLVGKK---V--VYRYQRDSEY----AEFSQKEIFHLKGFG---------------------FTGLVGLSPIAFACK 202 (424) Q Consensus 153 v~~~~~~~~---~--~~~~~~~~~~----~~~~~~eiih~r~~~---------------------~~~~~G~s~i~~~~~ 202 (424) +.+..++.. . ..++...... ..+.++.+.++...+ .+...|.|-+..+.. T Consensus 146 ~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~ 225 (453) T protein:vir:39 146 MFMVYDDTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVIS 225 (453) T ss_pred eEEEecCCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEEEEecCCceeeecccccCCCceeEEEecCCCCCCcchhhhHH Confidence 877765421 1 1112111111 112233333222100 012357777766666 Q ss_pred HHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHH Q lcl|NC_019705. 203 SAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMA 282 (424) Q Consensus 203 ~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e 282 (424) .++....+.....+.+...+.|..++.- .+ .+++....++.. ......+ ....+.+.++..+........+.+ T Consensus 226 liDa~~~~~s~~~~~~~~~~~p~~~~~g-~~-~~~~~~~~~~~~--~~~~~~~---~~~~~~~~~~~~lt~~~~~~~~~~ 298 (453) T protein:vir:39 226 LVNAFNKAISEKANDVDYFSDQYLTFLG-AA-VEEEDLKNIRSN--RVINYYG---ESSEAKNVDVKFLEKPDSDSQTEN 298 (453) T ss_pred HHHHHHHHHHHHHHHHHHhhCceeeeec-CC-CCchhhhhhhhc--ceeeecC---CCCCCCCCceeEEeecCCHHHHHH Confidence 6655555444444445555566666542 22 233332222211 0110000 011122333333433333445566 Q ss_pred HHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH----------HHHHHHHHHHHHHHHHHHHhhccChhhhcccc Q lcl|NC_019705. 283 SRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWENSIQRWLIPAKDVGRIH 352 (424) Q Consensus 283 ~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~----------~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~ 352 (424) ..+...+.|+..-++|..-.+.. ++.++...+.... ..+...++..++.+...++..- ...+ ... T Consensus 299 ~~~~l~~~I~~~s~~p~~~~~~~--gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~-~~~~--~~~ 373 (453) T protein:vir:39 299 LLDRLTKLIFQTTMVANISDESF--GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVS-NKEA--WKD 373 (453) T ss_pred HHHHHHHHHHHHhCCcccccccc--cCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-Cccc--ccc Confidence 77888888888888884222111 2222222221111 2223333333333322222110 0111 122 Q ss_pred hhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhhc---------cccCCCcccC Q lcl|NC_019705. 353 AEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDL---------GTNKEPRNNG 423 (424) Q Consensus 353 ~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~---------~~~~~~~~~g 423 (424) +.+.+..-+..|..+.++.+.++ .|+++..-+.++++.-+-+..+.-.+...-...... +.+.+..+++ T Consensus 374 i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (453) T protein:vir:39 374 IEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVPETN 451 (453) T ss_pred ceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCcC Confidence 33444556667888888888887 578999888888876332111000000000000000 0000000001 Q ss_pred C Q lcl|NC_019705. 424 A 424 (424) Q Consensus 424 a 424 (424) - T Consensus 452 ~ 452 (453) T protein:vir:39 452 E 452 (453) T ss_pred C Confidence 1 No 197 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=97.17 E-value=0.00014 Score=41.65 Aligned_cols=395 Identities=10% Similarity=0.042 Sum_probs=168.3 Q ss_pred CCCCcccccCCCCC-------------chHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNN-------------GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~~~~~~~~~~-------------G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i 67 (424) ..+|+..+.|-.+. .=++++..+..+.......... . ...... -...-+.++....+| T Consensus 21 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~--~------~~~~~~-~~~~ki~~n~~~~iv 91 (481) T protein:vir:10 21 FVVSDLAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGER--R------LQKYGD-KADHRAVHNYAKYVS 91 (481) T ss_pred eeeecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCcc--c------cccccc-cccceeecchHHHHH Confidence 11122222222110 1112222222111100000000 0 000000 000112356667788 Q ss_pred HHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019705. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..++-+.+-|+.+.-. +. .....+..++.. | ....+...+..+.+.+|.||+.+..+.+|.+ .+.. T Consensus 92 d~~~~~l~g~~~~~~~~--d~-----~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~-~i~~ 158 (481) T protein:vir:10 92 RFIVGYLTGNPITITHQ--DN-----QTNDKIIELNDL--N---DADEVNSDLALNLSIYGRAYEIVYRDFEDRD-TFKV 158 (481) T ss_pred HHHHhhhccCCceEecC--Ch-----hHHHHHHHHHHh--c---ChhHHHHHHHHHHHhcCeEEEEEEeCCCCeE-EEEE Confidence 88888777777764321 11 112345555542 2 2446788899999999999999988888876 4777 Q ss_pred ecCceeEEeecCce---E-----EEEEEeCC-ce----EEecHhHEEEeecCC---------------------CCCccc Q lcl|NC_019705. 148 LQSANMDVKLVGKK---V-----VYRYQRDS-EY----AEFSQKEIFHLKGFG---------------------FTGLVG 193 (424) Q Consensus 148 l~~~~v~~~~~~~~---~-----~~~~~~~~-~~----~~~~~~eiih~r~~~---------------------~~~~~G 193 (424) ++|..+.+..++.. . +|...... .. ..+.++.|++++.-. .+...| T Consensus 159 ~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 238 (481) T protein:vir:10 159 LDPKSTFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPIIEYLNDQFK 238 (481) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCceeEEEeecCCCC Confidence 88888877765421 1 11111111 10 123344444432110 012346 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccc Q lcl|NC_019705. 194 LSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV 273 (424) Q Consensus 194 ~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~ 273 (424) .|-+..+...++............+...+.|..++.-.. ..+++....++..-. ....... .....+.+.++.-+.. T Consensus 239 ~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~-~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~l~~ 315 (481) T protein:vir:10 239 QGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNV-DLDSEDAKAFRDANM-IHLEPGT-NANGSEGKAEVKYVYK 315 (481) T ss_pred CCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCc-CCCccchhhhhhccc-eeccccc-cccCCCCCcceeEEee Confidence 676665555555444333333333444455655554221 123222222222100 0100000 0111122334433333 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHh------hccChhh Q lcl|NC_019705. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR------WLIPAKD 347 (424) Q Consensus 274 ~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~------~l~~~~~ 347 (424) ......+.+..+...+.|+..-++|....+... ++.++...+.....+... +.-....+...+.+ +++...+ T Consensus 316 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k-~~~~~~~~~~~l~~~~~li~~~~~~~~ 393 (481) T protein:vir:10 316 QYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFS-GVQSGESMKYKLFGLEQV-RAIKERLFKKGLMKRYKLLLNNVNLTG 393 (481) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 334455677888889999999999975554332 222322222111111111 11111111111111 1111111 Q ss_pred ---hcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--CCCCee------eec-ccccchhhccc Q lcl|NC_019705. 348 ---VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL--PGGDVA------MRQ-SQYVPITDLGT 415 (424) Q Consensus 348 ---~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~--~ggd~~------~~~-~n~~~~~~~~~ 415 (424) .....+.+.+..-...|..+.++.+.++. |+++.-.+.+++++-.- +..+.. ..+ .......+..+ T Consensus 394 ~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~--g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 471 (481) T protein:vir:10 394 LKQHNYAELTITFTPNLPKSMMESINAFNALS--GGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGYGEAFE 471 (481) T ss_pred CCccccceeeEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCCccCC Confidence 11122344445556678888888888874 78988777777766221 111000 000 01111112222 Q ss_pred cCCCcccCC Q lcl|NC_019705. 416 NKEPRNNGA 424 (424) Q Consensus 416 ~~~~~~~ga 424 (424) +.++.++|- T Consensus 472 ~~~~~dd~~ 480 (481) T protein:vir:10 472 NHLNVDDSN 480 (481) T ss_pred CCCCCCCCC Confidence 222222222 No 198 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=97.14 E-value=0.00015 Score=41.49 Aligned_cols=385 Identities=7% Similarity=-0.019 Sum_probs=169.3 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccchhhcccccc---------ccCcccc--cHHHHhccHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHG---------HLGDSSI--NDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~v--s~~~~~~~~~v~~~i~~ 69 (424) |.-...-++ ....++.++...+.... +.......-+.+.. ...+... -...=+.++...-+|+. T Consensus 35 ~~~~~~~~~--~~~~~i~~~i~~~~~~~---~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~ 109 (492) T protein:vir:97 35 IVRTNNKPE--TLEEMIVRYIKQHLEKL---PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQ 109 (492) T ss_pred cccCCCchh--hHHHHHHHHHHHHHHHH---HHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHH Confidence 111111111 11122333322221100 00000000000000 0000000 00001234566778888 Q ss_pred HHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEec Q lcl|NC_019705. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQ 149 (424) Q Consensus 70 ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~ 149 (424) .++-+-+-|+.+.- .+.. ....+..++. |. -......+..+++.+|.||.++..+.+|.+ .+..++ T Consensus 110 ~~~yl~g~p~~~~~--~d~~-----~~~~l~~~~~---n~---~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~-~~~~~~ 175 (492) T protein:vir:97 110 KVSYIVGKPIAFKH--TDDE-----VVKRIDEVLG---NR---FDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVP 175 (492) T ss_pred HhhhhcccCceecc--CchH-----HHHHHHHHHh---cc---HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEc Confidence 88888778876521 1111 1122333332 32 234556678899999999999999888875 467788 Q ss_pred CceeEEeecCce---E---EEEEEeCC--ceEEecHhHEEEeec----------------------CCC---------CC Q lcl|NC_019705. 150 SANMDVKLVGKK---V---VYRYQRDS--EYAEFSQKEIFHLKG----------------------FGF---------TG 190 (424) Q Consensus 150 ~~~v~~~~~~~~---~---~~~~~~~~--~~~~~~~~eiih~r~----------------------~~~---------~~ 190 (424) |..+.+..++.. . ...|.... ....+.+..+.++.. +.+ +. T Consensus 176 p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn 255 (492) T protein:vir:97 176 AEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN 255 (492) T ss_pred ccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCC Confidence 988887765321 1 11111111 111122222322210 000 12 Q ss_pred cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeee Q lcl|NC_019705. 191 LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSA 270 (424) Q Consensus 191 ~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~ 270 (424) ..|.|-+......++....+.....+.+...+.|-.+++-....... ...... ...+++.++.+.+.+. T Consensus 256 ~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~----~~~~~~-------~~~~~~~~~~~~~~~~ 324 (492) T protein:vir:97 256 DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELP----EFKRLL-------RYYGAIKVSDNGGVDT 324 (492) T ss_pred CCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccch----hHHHHH-------hhccceecCCCCccee Confidence 35777777777777766665555566666666676665432211111 111111 1223555666655555 Q ss_pred cccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHH----------HHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019705. 271 IGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYISRWENSIQR 340 (424) Q Consensus 271 l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~----------~~~~~~tl~P~~~~ie~~l~~ 340 (424) +.....+..+....+...+.|+..-++|..-.... +++.++...+... ...+...+...++.|...++. T Consensus 325 l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~ 403 (492) T protein:vir:97 325 IQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF-GSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI 403 (492) T ss_pred EeccCCHHHHHHHHHHHHHHHHHHhCCCCCCcccc-ccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 55444555667788888899999999885333221 1222222111111 111222222222222222211 Q ss_pred hccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCCeee-----ecccccchhhc Q lcl|NC_019705. 341 WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDVAM-----RQSQYVPITDL 413 (424) Q Consensus 341 ~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~--ggd~~~-----~~~n~~~~~~~ 413 (424) ..+. ..+.+.+..-+..|..+.++.+.++ .|+++..-+.++++.-.-+ ..++.- ...+...+... T Consensus 404 ----~~~~--~~i~v~f~~~~p~~~~e~a~~~~kl--~G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~~~~~~~~~~~ 475 (492) T protein:vir:97 404 ----KGEH--KDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQTEYNKQLPNLDDG 475 (492) T ss_pred ----Cccc--ceeeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccC Confidence 1111 2233333455567788888888887 4889988888887763211 110000 00111111111 Q ss_pred cccCCCcccCC Q lcl|NC_019705. 414 GTNKEPRNNGA 424 (424) Q Consensus 414 ~~~~~~~~~ga 424 (424) +...+..+.+. T Consensus 476 ~~~~~~~~~~~ 486 (492) T protein:vir:97 476 GADSAQQQERS 486 (492) T ss_pred CCCCCcccccc Confidence 11111111111 No 199 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=97.13 E-value=0.00016 Score=41.44 Aligned_cols=383 Identities=11% Similarity=0.016 Sum_probs=163.9 Q ss_pred CCCCcccc--c--------CCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTI--D--------LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLI 70 (424) Q Consensus 1 ~~~~~~~~--~--------~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~i 70 (424) -..|+-++ + .+++.--+.++..+..+...... ..... ...+. ...+.+ ..=+.++....+|+.. T Consensus 20 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~-~~~~~-~~~~~-~~~~~~---~~ki~~n~~k~Ivd~~ 93 (474) T protein:vir:94 20 QLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVK-QMKKV-DVHGN-IDYDKP---DWRITTNFHQNLVDQK 93 (474) T ss_pred hhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhc-ccchh-ccccc-cccccC---cceeecchHHHHHHHH Confidence 01111111 0 01111112222233222110000 00000 00000 000000 0012245566788888 Q ss_pred HHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecC Q lcl|NC_019705. 71 STLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQS 150 (424) Q Consensus 71 a~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~ 150 (424) ++-+-+-|+.+-- .+.. ....+.. ++. |. .......+..+.+.+|.||+.+..+.+|.+ .+..++| T Consensus 94 ~~~l~g~p~~~~~--~d~~-----~~~~l~~-~~~--n~---~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p 159 (474) T protein:vir:94 94 VSYVASKPVTYSC--EDEN-----VLKVIHD-VLD--TR---WDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPA 159 (474) T ss_pred HhhhhcCCceecc--CcHH-----HHHHHHH-HHh--cc---HHHHHHHHHHHHhhcCceEEEEEecCCCee-EEEEEcc Confidence 8888888877521 1110 1122222 222 22 345556678899999999999989888875 4667888 Q ss_pred ceeEEeecCce---E---EEEEEeCCc--eEEecHhHEEEeec---------------------------CC----CCCc Q lcl|NC_019705. 151 ANMDVKLVGKK---V---VYRYQRDSE--YAEFSQKEIFHLKG---------------------------FG----FTGL 191 (424) Q Consensus 151 ~~v~~~~~~~~---~---~~~~~~~~~--~~~~~~~eiih~r~---------------------------~~----~~~~ 191 (424) ..+.+..++.. . ...|...+. ...+.++.+.+.+. .+ .+.. T Consensus 160 ~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~ 239 (474) T protein:vir:94 160 EQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNNP 239 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCCc Confidence 88887766421 1 111111111 01122222222110 00 0223 Q ss_pred ccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeec Q lcl|NC_019705. 192 VGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) Q Consensus 192 ~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l 271 (424) .|.|-+..+...++....+.....+.+...+.|..+++--......+ .... ....+++.+++|.+.+.+ T Consensus 240 ~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~----~~~~-------~~~~~~i~~~~~~~~~~l 308 (474) T protein:vir:94 240 EEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEE----FMRG-------LKYYKAINVDGDGGVETI 308 (474) T ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchh----hhhh-------hhccceeeccCCCceeEE Confidence 67777777777777665555555555555566666654322111111 1111 123456777776666665 Q ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHH----------HHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019705. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------QNLGFLQYTLQPYISRWENSIQRW 341 (424) Q Consensus 272 ~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~----------~~~~~~~~tl~P~~~~ie~~l~~~ 341 (424) ........+.+..+...+.|...-++|..-.... .++.++...+. .....+...++..++.|..-+.. T Consensus 309 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~- 386 (474) T protein:vir:94 309 QVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKF-GSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL- 386 (474) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCccccCcccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC- Confidence 5544445556677888888888888884222111 12222211111 11122333333333333222211 Q ss_pred ccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CC-----Ceeeecccccchhhcc Q lcl|NC_019705. 342 LIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GG-----DVAMRQSQYVPITDLG 414 (424) Q Consensus 342 l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~--gg-----d~~~~~~n~~~~~~~~ 414 (424) ..+.....+.|+ .-+..+. .+.+..+.+.|+++..-+.++++.-+-+ .- ++--......+....+ T Consensus 387 ---~~d~~~i~v~f~--~~~p~~~---~e~a~~~~~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~ 458 (474) T protein:vir:94 387 ---KTDVKDIEISFN--FNRMMND---AEQSQIIAQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGG 458 (474) T ss_pred ---CcccceeeEEec--cCcccCH---HHHHHHHHHcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCC Confidence 111122333443 2223344 4444556667999998888888652211 00 0000001111122111 Q ss_pred ccCCCcccCC Q lcl|NC_019705. 415 TNKEPRNNGA 424 (424) Q Consensus 415 ~~~~~~~~ga 424 (424) ...++.+.+. T Consensus 459 ~~~~~~~~~~ 468 (474) T protein:vir:94 459 ADGAQQQEGS 468 (474) T ss_pred CCCcccCCCC Confidence 1111111111 No 200 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=97.13 E-value=0.00016 Score=41.44 Aligned_cols=383 Identities=11% Similarity=0.016 Sum_probs=163.9 Q ss_pred CCCCcccc--c--------CCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTI--D--------LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLI 70 (424) Q Consensus 1 ~~~~~~~~--~--------~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~i 70 (424) -..|+-++ + .+++.--+.++..+..+...... ..... ...+. ...+.+ ..=+.++....+|+.. T Consensus 20 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~-~~~~~-~~~~~-~~~~~~---~~ki~~n~~k~Ivd~~ 93 (474) T protein:vir:97 20 QLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVK-QMKKV-DVHGN-IDYDKP---DWRITTNFHQNLVDQK 93 (474) T ss_pred hhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhc-ccchh-ccccc-cccccC---cceeecchHHHHHHHH Confidence 01111111 0 01111112222233222110000 00000 00000 000000 0012245566788888 Q ss_pred HHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecC Q lcl|NC_019705. 71 STLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQS 150 (424) Q Consensus 71 a~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~ 150 (424) ++-+-+-|+.+-- .+.. ....+.. ++. |. .......+..+.+.+|.||+.+..+.+|.+ .+..++| T Consensus 94 ~~~l~g~p~~~~~--~d~~-----~~~~l~~-~~~--n~---~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p 159 (474) T protein:vir:97 94 VSYVASKPVTYSC--EDEN-----VLKVIHD-VLD--TR---WDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPA 159 (474) T ss_pred HhhhhcCCceecc--CcHH-----HHHHHHH-HHh--cc---HHHHHHHHHHHHhhcCceEEEEEecCCCee-EEEEEcc Confidence 8888888877521 1110 1122222 222 22 345556678899999999999989888875 4667888 Q ss_pred ceeEEeecCce---E---EEEEEeCCc--eEEecHhHEEEeec---------------------------CC----CCCc Q lcl|NC_019705. 151 ANMDVKLVGKK---V---VYRYQRDSE--YAEFSQKEIFHLKG---------------------------FG----FTGL 191 (424) Q Consensus 151 ~~v~~~~~~~~---~---~~~~~~~~~--~~~~~~~eiih~r~---------------------------~~----~~~~ 191 (424) ..+.+..++.. . ...|...+. ...+.++.+.+.+. .+ .+.. T Consensus 160 ~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~ 239 (474) T protein:vir:97 160 EQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNNP 239 (474) T ss_pred cceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCCc Confidence 88887766421 1 111111111 01122222222110 00 0223 Q ss_pred ccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeec Q lcl|NC_019705. 192 VGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) Q Consensus 192 ~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l 271 (424) .|.|-+..+...++....+.....+.+...+.|..+++--......+ .... ....+++.+++|.+.+.+ T Consensus 240 ~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~----~~~~-------~~~~~~i~~~~~~~~~~l 308 (474) T protein:vir:97 240 EEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEE----FMRG-------LKYYKAINVDGDGGVETI 308 (474) T ss_pred CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchh----hhhh-------hhccceeeccCCCceeEE Confidence 67777777777777665555555555555566666654322111111 1111 123456777776666665 Q ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHH----------HHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019705. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------QNLGFLQYTLQPYISRWENSIQRW 341 (424) Q Consensus 272 ~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~----------~~~~~~~~tl~P~~~~ie~~l~~~ 341 (424) ........+.+..+...+.|...-++|..-.... .++.++...+. .....+...++..++.|..-+.. T Consensus 309 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~- 386 (474) T protein:vir:97 309 QVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKF-GSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL- 386 (474) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCccccCcccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC- Confidence 5544445556677888888888888884222111 12222211111 11122333333333333222211 Q ss_pred ccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CC-----Ceeeecccccchhhcc Q lcl|NC_019705. 342 LIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GG-----DVAMRQSQYVPITDLG 414 (424) Q Consensus 342 l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~--gg-----d~~~~~~n~~~~~~~~ 414 (424) ..+.....+.|+ .-+..+. .+.+..+.+.|+++..-+.++++.-+-+ .- ++--......+....+ T Consensus 387 ---~~d~~~i~v~f~--~~~p~~~---~e~a~~~~~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~ 458 (474) T protein:vir:97 387 ---KTDVKDIEISFN--FNRMMND---AEQSQIIAQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGG 458 (474) T ss_pred ---CcccceeeEEec--cCcccCH---HHHHHHHHHcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCC Confidence 111122333443 2223344 4444556667999998888888652211 00 0000001111122111 Q ss_pred ccCCCcccCC Q lcl|NC_019705. 415 TNKEPRNNGA 424 (424) Q Consensus 415 ~~~~~~~~ga 424 (424) ...++.+.+. T Consensus 459 ~~~~~~~~~~ 468 (474) T protein:vir:97 459 ADGAQQQEGS 468 (474) T ss_pred CCCcccCCCC Confidence 1111111111 No 201 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=97.13 E-value=0.00016 Score=41.42 Aligned_cols=396 Identities=11% Similarity=0.051 Sum_probs=172.8 Q ss_pred CCCCccccc-----CCCCCch---------------HHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhcc Q lcl|NC_019705. 1 MEEPKYTID-----LRTNNGW---------------WARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQI 60 (424) Q Consensus 1 ~~~~~~~~~-----~~~~~G~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~ 60 (424) ...-.|.|. +-.+.-. ++++..+..+............. ... ...=+.. T Consensus 24 ~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~--------~~~---~~~ki~~ 92 (512) T protein:vir:97 24 EANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKE--------EYM---ADNRVAH 92 (512) T ss_pred ccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccc--------ccc---Ccceeec Confidence 222222221 0111111 22222222221100000000000 000 0000223 Q ss_pred HHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019705. 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G 140 (424) +...-+|+..++-+.+-|+.+--. ++. ....+..++.. | ........+..+++.+|.||.++.++.+| T Consensus 93 n~~k~Ivd~~~~yl~g~p~~~~~~--d~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~i~G~ay~~vy~ded~ 160 (512) T protein:vir:97 93 DYASYISDFINGYFLGNPIQCQDD--DKD-----VLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) T ss_pred chHHHHHHHHhhhhcccCceeccC--ChH-----HHHHHHHHHhh--c---CHHHHHHHHHHHHHhcCeEEEEEEeCCCC Confidence 556668888888888888775321 111 11234444432 2 23456677889999999999999998888 Q ss_pred ceeEEEEecCceeEEeecCce---E--EEEE-E--e-C-C--c----eEEecHhHEEEeecCC----------------- Q lcl|NC_019705. 141 DVISLLPLQSANMDVKLVGKK---V--VYRY-Q--R-D-S--E----YAEFSQKEIFHLKGFG----------------- 187 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~~~~---~--~~~~-~--~-~-~--~----~~~~~~~eiih~r~~~----------------- 187 (424) .+ .+..++|..+.+..++.. . ..+| . . . . . ...+.++.|++++... T Consensus 161 ~~-~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 239 (512) T protein:vir:97 161 ET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHS 239 (512) T ss_pred ce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccc Confidence 75 577889988887776431 1 1111 1 0 0 0 0 0124455555543110 Q ss_pred ---------CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCC---cc Q lcl|NC_019705. 188 ---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGG---PV 255 (424) Q Consensus 188 ---------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~---~~ 255 (424) .+...|.|-+..+...++....+..-..+.+...+.|-.++.-... .+++.....+....-.... .+ T Consensus 240 ~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 318 (512) T protein:vir:97 240 FERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN-LDPVEVRKQKEANVLFLEPTVYEN 318 (512) T ss_pred CcccceEeecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcc-CCchhhhhhhhcccccccccchhh Confidence 0123577777777777776666655555555666666666543222 2222222111111111110 01 Q ss_pred cCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHH----------HHHHHHH Q lcl|NC_019705. 256 KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQY 325 (424) Q Consensus 256 ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~----------~~~~~~~ 325 (424) .....-.++|.+++-+........+....+...+.|+..-++|..-.+... ++.|+....-. ....+.. T Consensus 319 ~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-gn~Sg~Al~~~~~~l~~ka~~k~~~f~~ 397 (512) T protein:vir:97 319 RDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTK 397 (512) T ss_pred cccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111122345556655555444445567788888999999999864433222 23332222111 1111222 Q ss_pred HHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CC---- Q lcl|NC_019705. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GD---- 399 (424) Q Consensus 326 tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd---- 399 (424) .|.-.++.|...+...--.........+.+.+..-+..|..+.++.+.++. |+++..-+.++++.-+-|. .+ T Consensus 398 ~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~--giiS~et~~~~l~~v~d~~~E~eri~~ 475 (512) T protein:vir:97 398 GLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEE 475 (512) T ss_pred HHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHH Confidence 222222222222211110000011112333334555677888888888874 8899988888876532110 00 Q ss_pred --eeeecccc-----cchhh-ccccCCCcccCC Q lcl|NC_019705. 400 --VAMRQSQY-----VPITD-LGTNKEPRNNGA 424 (424) Q Consensus 400 --~~~~~~n~-----~~~~~-~~~~~~~~~~ga 424 (424) +-...... .+-.. ..++++..++.+ T Consensus 476 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (512) T protein:vir:97 476 DEKESIKKAQKGIYKDPRDINDDEQDDDTKDTV 508 (512) T ss_pred HHHHHHHHHhhcccCCCCCCCCCCCCCCccccc Confidence 00000000 00000 001111112222 No 202 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=97.11 E-value=0.00017 Score=41.30 Aligned_cols=396 Identities=9% Similarity=0.045 Sum_probs=167.5 Q ss_pred CCCCcccccCCC-----CCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhc Q lcl|NC_019705. 1 MEEPKYTIDLRT-----NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~-----~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia 75 (424) +..++--.++-+ ++-.++++..+..+............... . ...=+.++...-+|+..++-+- T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~--------~---~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:96 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEY--------M---ADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred hcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccc--------c---CcceeecchHHHHHHHHhhhhc Confidence 000000001100 01123333333332211000000000000 0 0001223556667788888777 Q ss_pred cCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEE Q lcl|NC_019705. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDV 155 (424) Q Consensus 76 ~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~ 155 (424) +-|+.+.- .++. ....+..++.. | .-..+...+...++.+|.||.++.++.+|.+ .+..++|..+.+ T Consensus 108 g~p~~~~~--~d~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~ 174 (511) T protein:vir:96 108 GNPIQYQD--DDKD-----VLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFI 174 (511) T ss_pred ccCceeec--CchH-----HHHHHHHHHhh--c---ChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEE Confidence 77877521 1111 11234444432 2 2335667788899999999999999888875 567788888887 Q ss_pred eecCce---EE--EE-EEe---C-C--c----eEEecHhHEEEeecCC--------------------------CCCccc Q lcl|NC_019705. 156 KLVGKK---VV--YR-YQR---D-S--E----YAEFSQKEIFHLKGFG--------------------------FTGLVG 193 (424) Q Consensus 156 ~~~~~~---~~--~~-~~~---~-~--~----~~~~~~~eiih~r~~~--------------------------~~~~~G 193 (424) ..++.. .. .+ |.. . . . ...+.++.+.+++... .+...| T Consensus 175 v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 254 (511) T protein:vir:96 175 IYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERR 254 (511) T ss_pred EEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCC Confidence 766421 11 11 110 0 0 0 0123455555443210 012247 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHH-HHHHhCCcc-cCcceecCCCceeeec Q lcl|NC_019705. 194 LSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN-FKEIAGGPV-KKRLWILEAGFSTSAI 271 (424) Q Consensus 194 ~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~-~~~~~~~~~-ag~~~~l~~g~~~~~l 271 (424) .|-+..+...++....+..-..+.+...+.|-.++.-... .+++..+..+.. .-....... ...-.-...+.+++-+ T Consensus 255 ~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 333 (511) T protein:vir:96 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN-LDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYI 333 (511) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCcc-CCchhhcccccccceeccccceeccccccCCCCcceeEE Confidence 7777776666665555544444444555556555543222 222221111110 000000000 0000111223344444 Q ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHH----------HHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019705. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYISRWENSIQRW 341 (424) Q Consensus 272 ~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~----------~~~~~~tl~P~~~~ie~~l~~~ 341 (424) ........+....+...+.|+..-++|....+... ++.++....... ...+...+.-.++.|...+... T Consensus 334 ~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~ 412 (511) T protein:vir:96 334 YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNT 412 (511) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 44434455567788889999999999965443322 222322221111 1122222222222222222211 Q ss_pred ccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCC--------eeee-cccc--c Q lcl|NC_019705. 342 LIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGD--------VAMR-QSQY--V 408 (424) Q Consensus 342 l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~--ggd--------~~~~-~~n~--~ 408 (424) --.........+++.+..-+..|..+.++.+.++. |+++..-+.+++++-+-+ .-+ ..-. ..+. . T Consensus 413 ~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~ 490 (511) T protein:vir:96 413 RSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKD 490 (511) T ss_pred CCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccC Confidence 10001111122344445566778888888888885 789887777777552211 000 0000 0000 0 Q ss_pred chhhc-cccCCCcccCC Q lcl|NC_019705. 409 PITDL-GTNKEPRNNGA 424 (424) Q Consensus 409 ~~~~~-~~~~~~~~~ga 424 (424) +-+.- .++.+.+++.+ T Consensus 491 ~~~~~~~~~~~~~~~~~ 507 (511) T protein:vir:96 491 PRDINDDEQDDDTKDTV 507 (511) T ss_pred CCCCCCCCCCCCccCcc Confidence 00000 00111111111 No 203 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=97.11 E-value=0.00017 Score=41.30 Aligned_cols=396 Identities=9% Similarity=0.045 Sum_probs=167.5 Q ss_pred CCCCcccccCCC-----CCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhc Q lcl|NC_019705. 1 MEEPKYTIDLRT-----NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~-----~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia 75 (424) +..++--.++-+ ++-.++++..+..+............... . ...=+.++...-+|+..++-+- T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~--------~---~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:78 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEY--------M---ADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred hcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccc--------c---CcceeecchHHHHHHHHhhhhc Confidence 000000001100 01123333333332211000000000000 0 0001223556667788888777 Q ss_pred cCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEE Q lcl|NC_019705. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDV 155 (424) Q Consensus 76 ~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~ 155 (424) +-|+.+.- .++. ....+..++.. | .-..+...+...++.+|.||.++.++.+|.+ .+..++|..+.+ T Consensus 108 g~p~~~~~--~d~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~ 174 (511) T protein:vir:78 108 GNPIQYQD--DDKD-----VLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFI 174 (511) T ss_pred ccCceeec--CchH-----HHHHHHHHHhh--c---ChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEE Confidence 77877521 1111 11234444432 2 2335667788899999999999999888875 567788888887 Q ss_pred eecCce---EE--EE-EEe---C-C--c----eEEecHhHEEEeecCC--------------------------CCCccc Q lcl|NC_019705. 156 KLVGKK---VV--YR-YQR---D-S--E----YAEFSQKEIFHLKGFG--------------------------FTGLVG 193 (424) Q Consensus 156 ~~~~~~---~~--~~-~~~---~-~--~----~~~~~~~eiih~r~~~--------------------------~~~~~G 193 (424) ..++.. .. .+ |.. . . . ...+.++.+.+++... .+...| T Consensus 175 v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 254 (511) T protein:vir:78 175 IYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERR 254 (511) T ss_pred EEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCC Confidence 766421 11 11 110 0 0 0 0123455555443210 012247 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHH-HHHHhCCcc-cCcceecCCCceeeec Q lcl|NC_019705. 194 LSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN-FKEIAGGPV-KKRLWILEAGFSTSAI 271 (424) Q Consensus 194 ~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~-~~~~~~~~~-ag~~~~l~~g~~~~~l 271 (424) .|-+..+...++....+..-..+.+...+.|-.++.-... .+++..+..+.. .-....... ...-.-...+.+++-+ T Consensus 255 ~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 333 (511) T protein:vir:78 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN-LDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYI 333 (511) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCcc-CCchhhcccccccceeccccceeccccccCCCCcceeEE Confidence 7777776666665555544444444555556555543222 222221111110 000000000 0000111223344444 Q ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHH----------HHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019705. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYISRWENSIQRW 341 (424) Q Consensus 272 ~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~----------~~~~~~tl~P~~~~ie~~l~~~ 341 (424) ........+....+...+.|+..-++|....+... ++.++....... ...+...+.-.++.|...+... T Consensus 334 ~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~ 412 (511) T protein:vir:78 334 YKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNT 412 (511) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 44434455567788889999999999965443322 222322221111 1122222222222222222211 Q ss_pred ccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--CCC--------eeee-cccc--c Q lcl|NC_019705. 342 LIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGD--------VAMR-QSQY--V 408 (424) Q Consensus 342 l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~--ggd--------~~~~-~~n~--~ 408 (424) --.........+++.+..-+..|..+.++.+.++. |+++..-+.+++++-+-+ .-+ ..-. ..+. . T Consensus 413 ~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~ 490 (511) T protein:vir:78 413 RSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKD 490 (511) T ss_pred CCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccC Confidence 10001111122344445566778888888888885 789887777777552211 000 0000 0000 0 Q ss_pred chhhc-cccCCCcccCC Q lcl|NC_019705. 409 PITDL-GTNKEPRNNGA 424 (424) Q Consensus 409 ~~~~~-~~~~~~~~~ga 424 (424) +-+.- .++.+.+++.+ T Consensus 491 ~~~~~~~~~~~~~~~~~ 507 (511) T protein:vir:78 491 PRDINDDEQDDDTKDTV 507 (511) T ss_pred CCCCCCCCCCCCccCcc Confidence 00000 00111111111 No 204 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=97.06 E-value=0.00019 Score=41.03 Aligned_cols=386 Identities=10% Similarity=-0.003 Sum_probs=166.0 Q ss_pred CCCCc-----cc--ccCCCCCchHHHHHhhccCcccCCccccchhhcccccccc---------Ccccc-c-HHHHhccHH Q lcl|NC_019705. 1 MEEPK-----YT--IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL---------GDSSI-N-DERILQIST 62 (424) Q Consensus 1 ~~~~~-----~~--~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~v-s-~~~~~~~~~ 62 (424) ..||. -. .+..+..-++.++........ +.......-+.+.... .+... . +..=+.++. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:96 9 WDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKL---KDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNF 85 (474) T ss_pred CCCCCCcchhhhccccccchHHHHHHHHHHHHHHH---HHHHHHHHHhcccCccccccchhhhcccccccccccccccch Confidence 11110 01 112223334444444332110 0000000000000000 00000 0 000122455 Q ss_pred HHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019705. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~ 142 (424) ....|+..++-+-+-|+.+--. +.. ....+...+. | ........+..+++.+|.||..+.++.+|.+ T Consensus 86 ~k~Iv~~~~~yl~g~p~~~~~~--~~~-----~~~~l~~~~~---n---~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~ 152 (474) T protein:vir:96 86 HQNLVDQKVSYVAGKPVTYAHD--DDK-----VLDVIHQVLD---T---RWDNKLIDILTAASNKGIDWLQVYINEDGEL 152 (474) T ss_pred HHHHHHhhhhhhcccCceeccC--ChH-----HHHHHHHHHh---c---cHHHHHHHHHHHHhhCCeEEEEeeeCCCCce Confidence 6678888888888888775321 111 1123333332 2 2345566778999999999999989888875 Q ss_pred eEEEEecCceeEEeecCce---E---EEEEEeCCce--EEecHhHEEEeecC----------------------C----- Q lcl|NC_019705. 143 ISLLPLQSANMDVKLVGKK---V---VYRYQRDSEY--AEFSQKEIFHLKGF----------------------G----- 187 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~---~---~~~~~~~~~~--~~~~~~eiih~r~~----------------------~----- 187 (424) .+..++|..+.+..+++. . .+.|...+.. ..+.++.+.++..- . T Consensus 153 -~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP 231 (474) T protein:vir:96 153 -KLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVP 231 (474) T ss_pred -EEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccc Confidence 567788888887765421 0 1112222211 11223333332210 0 Q ss_pred ----CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC Q lcl|NC_019705. 188 ----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE 263 (424) Q Consensus 188 ----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~ 263 (424) .+...|.|-+......++....+.....+.+...+.|-.+++--.. + ........ .+..+++.++ T Consensus 232 vv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~--~--~~~~~~~~-------~~~~~~i~~~ 300 (474) T protein:vir:96 232 FIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEG--E--DLSEFMEG-------LKYYKAINVS 300 (474) T ss_pred eEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCc--c--cccchhhh-------hhccceeecc Confidence 0223566766666666665555444444444555556555432111 1 11111111 1233566676 Q ss_pred CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHH----------HHHHHHHHHHHHHHH Q lcl|NC_019705. 264 AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISR 333 (424) Q Consensus 264 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ 333 (424) ++.++.-+..+..+..+....+...+.|...-++|..-.... .++.++...+.. ....+...+.-.++. T Consensus 301 ~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~ 379 (474) T protein:vir:96 301 SDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKF-GSATSGIALKFLYTNLNLKANKLKNKANVALQELMQF 379 (474) T ss_pred CCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666665555555555667788888899999999985332211 122222111111 111222222222222 Q ss_pred HHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeee-----eccc Q lcl|NC_019705. 334 WENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAM-----RQSQ 406 (424) Q Consensus 334 ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~~~-----~~~n 406 (424) |...+. ...+.....+.| ..-+..+..+.++ .+.+.|+++.-.+++.++.-.-+. .+..- .... T Consensus 380 i~~~~g----~~~d~~~i~i~f--~~~~p~~~~e~a~---~~~~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~ 450 (474) T protein:vir:96 380 ILDFNK----IKLDAKEIEITF--NFNVMVNDLEQSQ---IGAQSQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQ 450 (474) T ss_pred HHHHhC----CCcccceeeEEe--cCCCccCHHHHHH---HHHHcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhh Confidence 222111 011111122333 3334445444444 455679999988888887633211 11000 0001 Q ss_pred ccchhhccccCCCcccCC Q lcl|NC_019705. 407 YVPITDLGTNKEPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~~~~~~~ga 424 (424) ...+.+.+...+..+.++ T Consensus 451 ~~~~~~~~~~~~~~~~~~ 468 (474) T protein:vir:96 451 LPNLDDGGADGAQQQQQS 468 (474) T ss_pred ccccccccCCCCCCcCCC Confidence 111111111111111111 No 205 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=97.06 E-value=0.00019 Score=41.03 Aligned_cols=386 Identities=10% Similarity=-0.003 Sum_probs=166.0 Q ss_pred CCCCc-----cc--ccCCCCCchHHHHHhhccCcccCCccccchhhcccccccc---------Ccccc-c-HHHHhccHH Q lcl|NC_019705. 1 MEEPK-----YT--IDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHL---------GDSSI-N-DERILQIST 62 (424) Q Consensus 1 ~~~~~-----~~--~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~v-s-~~~~~~~~~ 62 (424) ..||. -. .+..+..-++.++........ +.......-+.+.... .+... . +..=+.++. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~---~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~ 85 (474) T protein:vir:95 9 WDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKL---KDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNF 85 (474) T ss_pred CCCCCCcchhhhccccccchHHHHHHHHHHHHHHH---HHHHHHHHHhcccCccccccchhhhcccccccccccccccch Confidence 11110 01 112223334444444332110 0000000000000000 00000 0 000122455 Q ss_pred HHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019705. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~ 142 (424) ....|+..++-+-+-|+.+--. +.. ....+...+. | ........+..+++.+|.||..+.++.+|.+ T Consensus 86 ~k~Iv~~~~~yl~g~p~~~~~~--~~~-----~~~~l~~~~~---n---~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~ 152 (474) T protein:vir:95 86 HQNLVDQKVSYVAGKPVTYAHD--DDK-----VLDVIHQVLD---T---RWDNKLIDILTAASNKGIDWLQVYINEDGEL 152 (474) T ss_pred HHHHHHhhhhhhcccCceeccC--ChH-----HHHHHHHHHh---c---cHHHHHHHHHHHHhhCCeEEEEeeeCCCCce Confidence 6678888888888888775321 111 1123333332 2 2345566778999999999999989888875 Q ss_pred eEEEEecCceeEEeecCce---E---EEEEEeCCce--EEecHhHEEEeecC----------------------C----- Q lcl|NC_019705. 143 ISLLPLQSANMDVKLVGKK---V---VYRYQRDSEY--AEFSQKEIFHLKGF----------------------G----- 187 (424) Q Consensus 143 ~~l~~l~~~~v~~~~~~~~---~---~~~~~~~~~~--~~~~~~eiih~r~~----------------------~----- 187 (424) .+..++|..+.+..+++. . .+.|...+.. ..+.++.+.++..- . T Consensus 153 -~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP 231 (474) T protein:vir:95 153 -KLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVP 231 (474) T ss_pred -EEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccc Confidence 567788888887765421 0 1112222211 11223333332210 0 Q ss_pred ----CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC Q lcl|NC_019705. 188 ----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE 263 (424) Q Consensus 188 ----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~ 263 (424) .+...|.|-+......++....+.....+.+...+.|-.+++--.. + ........ .+..+++.++ T Consensus 232 vv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~--~--~~~~~~~~-------~~~~~~i~~~ 300 (474) T protein:vir:95 232 FIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEG--E--DLSEFMEG-------LKYYKAINVS 300 (474) T ss_pred eEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCc--c--cccchhhh-------hhccceeecc Confidence 0223566766666666665555444444444555556555432111 1 11111111 1233566676 Q ss_pred CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHH----------HHHHHHHHHHHHHHH Q lcl|NC_019705. 264 AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISR 333 (424) Q Consensus 264 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ 333 (424) ++.++.-+..+..+..+....+...+.|...-++|..-.... .++.++...+.. ....+...+.-.++. T Consensus 301 ~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~ 379 (474) T protein:vir:95 301 SDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKF-GSATSGIALKFLYTNLNLKANKLKNKANVALQELMQF 379 (474) T ss_pred CCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666665555555555667788888899999999985332211 122222111111 111222222222222 Q ss_pred HHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeee-----eccc Q lcl|NC_019705. 334 WENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAM-----RQSQ 406 (424) Q Consensus 334 ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~~~-----~~~n 406 (424) |...+. ...+.....+.| ..-+..+..+.++ .+.+.|+++.-.+++.++.-.-+. .+..- .... T Consensus 380 i~~~~g----~~~d~~~i~i~f--~~~~p~~~~e~a~---~~~~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~ 450 (474) T protein:vir:95 380 ILDFNK----IKLDAKEIEITF--NFNVMVNDLEQSQ---IGAQSQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQ 450 (474) T ss_pred HHHHhC----CCcccceeeEEe--cCCCccCHHHHHH---HHHHcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhh Confidence 222111 011111122333 3334445444444 455679999988888887633211 11000 0001 Q ss_pred ccchhhccccCCCcccCC Q lcl|NC_019705. 407 YVPITDLGTNKEPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~~~~~~~ga 424 (424) ...+.+.+...+..+.++ T Consensus 451 ~~~~~~~~~~~~~~~~~~ 468 (474) T protein:vir:95 451 LPNLDDGGADGAQQQQQS 468 (474) T ss_pred ccccccccCCCCCCcCCC Confidence 111111111111111111 No 206 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.02 E-value=0.00021 Score=40.79 Aligned_cols=382 Identities=9% Similarity=-0.009 Sum_probs=178.4 Q ss_pred CCCC------cccccCCC-CCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHh Q lcl|NC_019705. 1 MEEP------KYTIDLRT-NNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~------~~~~~~~~-~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ 73 (424) -++. ++-=+.++ ++--++++..+..+.... ..... ....+ ..=+.++....+|+..++- T Consensus 22 ~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i-~~~~~----------~~~~~---~~ki~~n~~~~Ivd~~~~~ 87 (470) T protein:vir:99 22 GEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKI-LTAPE----------KETGA---DNRIVVNSAKYVVDVYNGY 87 (470) T ss_pred CCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccc-ccCcc----------cccCC---cceeecchHHHHHHHHhhh Confidence 0000 01001111 123456666665543211 10000 00000 0012345566788888887 Q ss_pred hccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCcee Q lcl|NC_019705. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v 153 (424) +-+-|+.+.-.++ .. ....+..++.. | ........+....+.+|.||+.+..+.+|.+ .+..++|..+ T Consensus 88 l~g~p~~~~~~~d-~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~p~~~ 155 (470) T protein:vir:99 88 FCGIEPKLALLND-SS-----KIDEIARWNRQ--E---NFFDTINEISKQCDIFGRSIASIYQGEDARP-HLMYSSPNHA 155 (470) T ss_pred hccCCeeEeeCCc-hh-----HHHHHHHHHHh--c---CHhHHHHHHHHHHHhcCeeEEEEEeCCCCeE-EEEEEcccee Confidence 7777877532221 11 11234444432 2 3456778889999999999999988888876 4777899988 Q ss_pred EEeecCce---E--EE-EEEe-CCc-----eEEecHhHEEEeecC-------------------C----CCCcccCchHH Q lcl|NC_019705. 154 DVKLVGKK---V--VY-RYQR-DSE-----YAEFSQKEIFHLKGF-------------------G----FTGLVGLSPIA 198 (424) Q Consensus 154 ~~~~~~~~---~--~~-~~~~-~~~-----~~~~~~~eiih~r~~-------------------~----~~~~~G~s~i~ 198 (424) .+..++.. . .. .|.. ++. ...+.++.+++++.. + .+...|.|-+. T Consensus 156 ~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e 235 (470) T protein:vir:99 156 FIIYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFENEERQGIFD 235 (470) T ss_pred EEEEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCCccceEeecCCCCCCcchH Confidence 87776431 1 11 1111 111 011223333332110 0 12335777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeeccc Q lcl|NC_019705. 199 FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGV 273 (424) Q Consensus 199 ~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~ 273 (424) .+...++....+.......+...+.|-.++.-... ..++.-+.++.. . ..+++.+ +.+.++..+.. T Consensus 236 ~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~-~~~~~g~~~~~~-~-------~~~~~~~~~~~~~~~~~~~~l~~ 306 (470) T protein:vir:99 236 SIKTLINALDKVISQKANQVEYFDNAYMYMIGFKL-PEDDEGNPKFDF-K-------NNRVLYVSQLDPDTNPQIGFIAK 306 (470) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc-ccccccchhhhh-h-------hcceeeecCCCCCCCCcceEEee Confidence 77777776666655556566666667666653211 111111111111 1 1122322 23445555554 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH----------HHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019705. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWENSIQRWLI 343 (424) Q Consensus 274 ~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~----------~~~~~tl~P~~~~ie~~l~~~l~ 343 (424) ......+....+...+.|+..-++|+...+... ++.++...+.... ..+...|.-.++.+...+..+-- T Consensus 307 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~ 385 (470) T protein:vir:99 307 PDADQMQENLIQHLTDFIFMMAMVPNIQDKNFA-GNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQ 385 (470) T ss_pred cCChHHHHHHHHHHHHHHHHHhCCccccccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 444444566788889999999999964433222 2233222211111 22222333333323222222111 Q ss_pred ChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCC-CCCCCeee---------ecccccchhhc Q lcl|NC_019705. 344 PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPP-LPGGDVAM---------RQSQYVPITDL 413 (424) Q Consensus 344 ~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p-~~ggd~~~---------~~~n~~~~~~~ 413 (424) ...+ ...+.+.+..-+..|....++.+.++. |+++...+.+.++.-. -.+-+.+. ......+.+.. T Consensus 386 ~~~~--~~~i~v~f~~~~p~~~~e~a~~~~kl~--giis~et~l~~l~~vd~~~E~eri~~E~~~~~~~~~~~~~~~d~~ 461 (470) T protein:vir:99 386 DQEL--WSELDFKFTRNLPEDMASAIDNAKNAE--GIVSKKTQLGMIPDIEPDAEMKQIAKEKADAIKQTQQLSMPIDIL 461 (470) T ss_pred cccc--cccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCcC Confidence 1111 123344445666678888888888875 7899888888776531 10000000 00011122222 Q ss_pred cccCCCccc Q lcl|NC_019705. 414 GTNKEPRNN 422 (424) Q Consensus 414 ~~~~~~~~~ 422 (424) ....+++++ T Consensus 462 ~~d~~~ee~ 470 (470) T protein:vir:99 462 KRDNNAEEE 470 (470) T ss_pred CCCCCccCC Confidence 222222222 No 207 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=96.91 E-value=0.00026 Score=40.20 Aligned_cols=382 Identities=10% Similarity=0.051 Sum_probs=170.3 Q ss_pred CCCCccccc-----------------CCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHH Q lcl|NC_019705. 1 MEEPKYTID-----------------LRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTV 63 (424) Q Consensus 1 ~~~~~~~~~-----------------~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v 63 (424) |+.+|+-.- .++++.-+.++..+..+........ ....+.. . .=+.++.. T Consensus 3 ~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~----------~~~~~~~-~--~ki~~n~~ 69 (453) T protein:vir:73 3 LKPIKLMTYSRDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQK----------AKDSWKP-D--NRLTNNFA 69 (453) T ss_pred cccceeeeccccccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCC----------CCCccCc-c--ceeecchH Confidence 444443111 0111122222222222211000000 0000000 0 01224566 Q ss_pred HHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCcee Q lcl|NC_019705. 64 WRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI 143 (424) Q Consensus 64 ~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~ 143 (424) ..+|+..++-+-+-|+.+-- + +. .....+..++.. | ........+..+.+.+|.||+.+..+.+|.+ T Consensus 70 ~~ivd~~~~~l~g~~~~~~~-~-d~-----~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~- 136 (453) T protein:vir:73 70 KYIVDTFVGYFNGIPIKKTH-D-DK-----SVLEAMQLFDNL--N---DMEDEESELAKIACVYGRAYELMYQNESTES- 136 (453) T ss_pred HHHHHHhhhhhcccCceeec-C-Ch-----HHHHHHHHHHHh--c---ChhHHHHHHHHHHHhcCeEEEEEEeCCCCce- Confidence 67788777777666766421 1 11 111234344332 2 2345667788999999999999999888876 Q ss_pred EEEEecCceeEEeecCce---E----EEEEEeCCce--EEecHhHEEEeecC-----------C----------CCCccc Q lcl|NC_019705. 144 SLLPLQSANMDVKLVGKK---V----VYRYQRDSEY--AEFSQKEIFHLKGF-----------G----------FTGLVG 193 (424) Q Consensus 144 ~l~~l~~~~v~~~~~~~~---~----~~~~~~~~~~--~~~~~~eiih~r~~-----------~----------~~~~~G 193 (424) .+..++|..+.+..++.. . .|.+..++.. ..+.++.+++++.. + .+...| T Consensus 137 ~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 216 (453) T protein:vir:73 137 EVIYCSPLNVFMVYDDSIKQKPLFAVYYGFDEEGNLSGTVYTLLETISITGKAGEVKFGESTYNVYSDLPIVEYNFNEER 216 (453) T ss_pred EEEEEcccceEEEEeCCCCceeEEEEEEEEecCceEEEEEEeCCeEEEEEecCCceEEccceeccCCceeEEEecCCCCC Confidence 466678888876665421 0 1111111111 11233333333210 0 012357 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccc Q lcl|NC_019705. 194 LSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV 273 (424) Q Consensus 194 ~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~ 273 (424) .|-+..+...++....+.....+.....+.|..++.-- ..+++....++..--........+.....+.+.+++-+.. T Consensus 217 ~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~ 294 (453) T protein:vir:73 217 QSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGA--EVDEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDK 294 (453) T ss_pred CcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecC--CCCchhhhcccccccccccccccccccccccCceeEEeee Confidence 77777766666655555544455555556666665422 2233333333221111111122233334445555555554 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH----------HHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019705. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GFLQYTLQPYISRWENSIQRWLI 343 (424) Q Consensus 274 ~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~----------~~~~~tl~P~~~~ie~~l~~~l~ 343 (424) ...+..+....+...+.|+..-++|. +....-++.++...+.... ..+...+.-.++.+..-++.. - T Consensus 295 ~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~-~ 371 (453) T protein:vir:73 295 PDSDVQTENLLNRLERSIFQFTMAAN--ISDENFGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNA-S 371 (453) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcc--cCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-C Confidence 44455566778888888999888984 2222223333222222111 122222222222222111111 0 Q ss_pred ChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhh--cc------- Q lcl|NC_019705. 344 PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITD--LG------- 414 (424) Q Consensus 344 ~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~--~~------- 414 (424) ...+ ...+++.+..-+..|..+.++.+.++. |+++..-+.++++.-+-+..+ ...+.. .. T Consensus 372 ~~~~--~~~i~v~f~~~~p~~~~~~a~~~~k~~--giis~et~~~~~~~~~d~~~E-------~~ri~~E~~~~~~~~~~ 440 (453) T protein:vir:73 372 NKDA--WKDIEYTFTRNEPKDIKEQAETANILK--GITSEETALSVISVIPDVQAE-------MEKIKKKKLLQLSLTRT 440 (453) T ss_pred Cccc--cccceEEeCCCCCCCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCCHHHH-------HHHHHHHHHHHHHHHHh Confidence 1111 122344445666788889999988886 789887777777663211110 000100 00 Q ss_pred --ccCCCcccCC Q lcl|NC_019705. 415 --TNKEPRNNGA 424 (424) Q Consensus 415 --~~~~~~~~ga 424 (424) -.+...+.|- T Consensus 441 ~~~~~~~~~~~~ 452 (453) T protein:vir:73 441 SNLVRMKQMRGN 452 (453) T ss_pred ccCCcchhhhcC Confidence 0000011111 No 208 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=96.60 E-value=0.00048 Score=38.78 Aligned_cols=395 Identities=10% Similarity=0.053 Sum_probs=166.0 Q ss_pred CCCCc----ccc----cCCCC----------------CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHH Q lcl|NC_019705. 1 MEEPK----YTI----DLRTN----------------NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDER 56 (424) Q Consensus 1 ~~~~~----~~~----~~~~~----------------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~ 56 (424) -=.++ |.| +..+. +--++++..+..+............. ...+ .. T Consensus 20 ~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~--------~~~~---~~ 88 (511) T protein:vir:99 20 LFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKE--------EYMA---DN 88 (511) T ss_pred hhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccc--------cccC---cc Confidence 00000 000 00110 11222333333322110000000000 0000 00 Q ss_pred HhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019705. 57 ILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 57 ~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r 136 (424) =+.++...-.|+..++-+.+-|+.+--. +.. ....+..++.. | ........+..+++.+|.||.++.+ T Consensus 89 ki~~n~~k~Iv~~~~~yl~g~p~~~~~~--d~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~i~G~a~~~vy~ 156 (511) T protein:vir:99 89 RVAHDYASYISDFINGYFLGNPIQYQDD--DKD-----VLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIR 156 (511) T ss_pred eeecchHHHHHHHHHhhhcccCceeecC--chH-----HHHHHHHHHhh--c---CHhHHHHHHHHHHHhcCeeEEEEEe Confidence 0223455667777787777778775211 111 12334444432 2 2345667788999999999999999 Q ss_pred CCCCceeEEEEecCceeEEeecCc---eE--EEE-EEe---C---Cc----eEEecHhHEEEeecCC------------- Q lcl|NC_019705. 137 NSAGDVISLLPLQSANMDVKLVGK---KV--VYR-YQR---D---SE----YAEFSQKEIFHLKGFG------------- 187 (424) Q Consensus 137 ~~~G~~~~l~~l~~~~v~~~~~~~---~~--~~~-~~~---~---~~----~~~~~~~eiih~r~~~------------- 187 (424) +.+|.+ .+..++|..+.+..++. .. .++ |.. . .. ...+.++.+.+++... T Consensus 157 ded~~~-~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~ 235 (511) T protein:vir:99 157 NQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGF 235 (511) T ss_pred CCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCcccccccccccc Confidence 888865 56778888888776542 11 111 110 0 00 1124444554443110 Q ss_pred -------------CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHH---HHHHh Q lcl|NC_019705. 188 -------------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN---FKEIA 251 (424) Q Consensus 188 -------------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~---~~~~~ 251 (424) .+...|.|-+..+...++....+.....+.+...+.|-.++.-... .++......++. +-... T Consensus 236 ~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~-~~~~~~~~~~~~~~~~~~~~ 314 (511) T protein:vir:99 236 ESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN-LDPVEVRKQKEANVLFLEPT 314 (511) T ss_pred ccCCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcc-cCchhhcccccccceecccc Confidence 0123577767766666665555444444444444445444432211 222221111110 00000 Q ss_pred CCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHHH--------- Q lcl|NC_019705. 252 GGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF--------- 322 (424) Q Consensus 252 ~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~--------- 322 (424) .. -.+......+|.+++-+........+....+...+.|+..-++|..-.+... ++.++....-....+ T Consensus 315 ~~-~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-gn~Sg~Alk~~~~~l~~ka~~k~~ 392 (511) T protein:vir:99 315 VY-ADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEG 392 (511) T ss_pred cc-cccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHH Confidence 00 0111222344556665555444555667788889999999999975443222 233322222111111 Q ss_pred -HHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--C- Q lcl|NC_019705. 323 -LQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--G- 398 (424) Q Consensus 323 -~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--g- 398 (424) +...+.-.++.|...+...--.........+++.+..-+..|..+.++.+.++. |+++.--+.++++.-+-+. - T Consensus 393 ~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~--GiiS~et~l~~l~~v~D~~~E~~ 470 (511) T protein:vir:99 393 LFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVK 470 (511) T ss_pred HHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCCCCHHHHHH Confidence 111222222222221111100000001112334444555677888888888874 8899888888875522110 0 Q ss_pred -------Cee---eecccccchhhccccCCCcccCC Q lcl|NC_019705. 399 -------DVA---MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 -------d~~---~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +.. ..+....+-....++++++++.. T Consensus 471 ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) T protein:vir:99 471 KIEEDEKESIKKAQKNMYQDPRNINDDEQDDSTKDS 506 (511) T ss_pred HHHHHHHHHHHHHhhcccccCCCCCCCCCCCCCcCc Confidence 000 00000000000000111111111 No 209 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=96.54 E-value=0.00053 Score=38.56 Aligned_cols=384 Identities=9% Similarity=0.001 Sum_probs=164.8 Q ss_pred CCCCcccccCC--------------CCCchH--------------HHHHhhccCcccCCccccchhhccccccccCcccc Q lcl|NC_019705. 1 MEEPKYTIDLR--------------TNNGWW--------------ARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSI 52 (424) Q Consensus 1 ~~~~~~~~~~~--------------~~~G~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 52 (424) |-+-.|-++.- +..-++ .++..+..+............ . ......+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~--~-~~~~~~~~-- 75 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRD--V-NGDYDETK-- 75 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhh--c-cccccccc-- Confidence 32222211111 111222 222323222110000000000 0 00000000 Q ss_pred cHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEE Q lcl|NC_019705. 53 NDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYA 132 (424) Q Consensus 53 s~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~ 132 (424) +..=+.++....+|+..++-+-+-|+.+.- .+.. ....+...+. | ........+...++.+|.+|+ T Consensus 76 -~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~--~~~~-----~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~ 141 (478) T protein:vir:10 76 -PDWRMYTNYHQNLVDQKVAYAVANPVTFGV--DNDK-----ALKQIQHTLN---H---KWDDKLVDILTAASNKGIEWV 141 (478) T ss_pred -ccceeccchHHHHHHHHhhhhcccCceeec--CChH-----HHHHHHHHHh---c---cHHHHHHHHHHHHhhCCeEEE Confidence 000123456677888888888888877521 1111 1123344442 2 235666777899999999999 Q ss_pred EEeeCCCCceeEEEEecCceeEEeecCc---eE---EEEEEeCCc--eEEecHhHEEEeecC------------------ Q lcl|NC_019705. 133 LVDRNSAGDVISLLPLQSANMDVKLVGK---KV---VYRYQRDSE--YAEFSQKEIFHLKGF------------------ 186 (424) Q Consensus 133 ~~~r~~~G~~~~l~~l~~~~v~~~~~~~---~~---~~~~~~~~~--~~~~~~~eiih~r~~------------------ 186 (424) .+..+.+|.+ .+..++|..+.+..++. .. .+.|...+. ...+.++.|.+.+.. T Consensus 142 ~v~~d~~~~~-~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 220 (478) T protein:vir:10 142 QPYVDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPH 220 (478) T ss_pred EEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecCCeeeccccccccccccc Confidence 9988888875 46778888887766532 11 111221111 111233333332210 Q ss_pred -------C----------CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHH Q lcl|NC_019705. 187 -------G----------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) Q Consensus 187 -------~----------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~ 249 (424) + .+...|.|-+......++....+.....+.+...+.|-.+++--......+....++ T Consensus 221 ~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~----- 295 (478) T protein:vir:10 221 YYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLK----- 295 (478) T ss_pred eecccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchhhhhh----- Confidence 0 012347777776666666555554444444444455554544221111111111111 Q ss_pred HhCCcccCcceec--CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH------- Q lcl|NC_019705. 250 IAGGPVKKRLWIL--EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL------- 320 (424) Q Consensus 250 ~~~~~~ag~~~~l--~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~------- 320 (424) ..+++.+ +.|.+++-+........+.+..+...+.|...-++|..-.... .++.++...+-... T Consensus 296 ------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Ai~~~~~~l~~k~~ 368 (478) T protein:vir:10 296 ------YYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKF-GNSPSGIALKFMYSNLDLKAN 368 (478) T ss_pred ------hCceeEecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCcccc-ccchHHHHHHHHHHHHHHHHH Confidence 1123333 2333444444444455567788888999999999985322211 12222222111111 Q ss_pred ---HHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Q lcl|NC_019705. 321 ---GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 321 ---~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g 397 (424) ..+...+.-.++.|... +... .....+.+.++.-+..|..+.++.+.++ .|+++..-+.++++.-.-+. T Consensus 369 ~~~~~~~~~l~~~~~li~~~-----~~~~-~d~~~i~i~f~~~~p~~~~e~~~~~~~~--~g~iS~et~i~~~~~v~d~~ 440 (478) T protein:vir:10 369 KLKNKTLTALQELLQYIIDF-----YRLD-VRVQDIEITFNFNVMVNELENSQIAMNS--TGLLSKETILGNHSWVQDPV 440 (478) T ss_pred HHHHHHHHHHHHHHHHHHHH-----hCCC-cccccceEEeCCCCCCCHHHHHHHHHHH--hCCCChHHHHHhCCCCCCHH Confidence 11122222222222111 1111 1111233333445556777778877766 58898877777776521110 Q ss_pred C-------Ceeeecccccchhh---ccccCCCcccCC Q lcl|NC_019705. 398 G-------DVAMRQSQYVPITD---LGTNKEPRNNGA 424 (424) Q Consensus 398 g-------d~~~~~~n~~~~~~---~~~~~~~~~~ga 424 (424) . ++--.......... -.+..++.++++ T Consensus 441 ~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~d~~~ 477 (478) T protein:vir:10 441 AEMERIEQENIELNQQLPDIEEGLNDEQQRQSEDNQS 477 (478) T ss_pred HHHHHHHHHHHHHHHhccccCCCCcccccccCcCCCC Confidence 0 00000000111111 112233444455 No 210 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=96.40 E-value=0.00066 Score=38.01 Aligned_cols=384 Identities=10% Similarity=0.006 Sum_probs=162.4 Q ss_pred CCCCccccc------------C--CCCCchHHHHH--------------hhccCcccCCccccchhhccccccccCcccc Q lcl|NC_019705. 1 MEEPKYTID------------L--RTNNGWWARLQ--------------SWFVGGRLVTPNQGSQTGPVSAHGHLGDSSI 52 (424) Q Consensus 1 ~~~~~~~~~------------~--~~~~G~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 52 (424) |-+-.|-++ - .+..=|+.++. .+..+.......... ...........+ T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~---~~~~~~~~~~~~- 76 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPK---RDVNGDYDETKP- 76 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccc---cccccccccccc- Confidence 222211000 0 11122333332 222221100000000 000000000000 Q ss_pred cHHHHhccHHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEE Q lcl|NC_019705. 53 NDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYA 132 (424) Q Consensus 53 s~~~~~~~~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~ 132 (424) ..-+.++....+|+..+.-+-+-|+.+. .+. .. ....+..++. | ........+..+++.+|.||+ T Consensus 77 --~~ki~~n~~~~ivd~~~~~l~g~~~~~~-~~~-d~-----~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~ 141 (478) T protein:vir:10 77 --DWRMYTNYHQNLVDQKVAYAVANPVTFG-VDN-DK-----ALKQIQHTLN---H---KWDDKLVDILTAASNKGIEWV 141 (478) T ss_pred --cceeccchHHHHHHHHHhhhccCCeeee-cCC-hH-----HHHHHHHHHh---c---CHHHHHHHHHHHHHhcCeEEE Confidence 0012344566688888887777787752 211 11 1123444443 2 235566777889999999999 Q ss_pred EEeeCCCCceeEEEEecCceeEEeecCce---E---EEEEEeCCc--eEEecHhHEEEeecC------------------ Q lcl|NC_019705. 133 LVDRNSAGDVISLLPLQSANMDVKLVGKK---V---VYRYQRDSE--YAEFSQKEIFHLKGF------------------ 186 (424) Q Consensus 133 ~~~r~~~G~~~~l~~l~~~~v~~~~~~~~---~---~~~~~~~~~--~~~~~~~eiih~r~~------------------ 186 (424) .+..+.+|.+ .+..++|..+.+..+++. . .+.|...+. ...+.+++|.+.+.. T Consensus 142 ~~~~d~~g~~-~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 220 (478) T protein:vir:10 142 QPYVDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPH 220 (478) T ss_pred EEEecCCCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeeeccccccccccccc Confidence 9988888876 466788888877665321 1 111111111 111222222222110 Q ss_pred -------C----------CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHH Q lcl|NC_019705. 187 -------G----------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) Q Consensus 187 -------~----------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~ 249 (424) + .+...|.|-+......++....+.....+.+...+.|-.++.--......+....+ T Consensus 221 ~~~~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~------ 294 (478) T protein:vir:10 221 YYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNL------ 294 (478) T ss_pred eecccccccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhhhhh------ Confidence 0 02335777676666666655555555555555555565554422111111111111 Q ss_pred HhCCcccCcceec--CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHH--------- Q lcl|NC_019705. 250 IAGGPVKKRLWIL--EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ--------- 318 (424) Q Consensus 250 ~~~~~~ag~~~~l--~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~--------- 318 (424) ..++++.+ +.|.+..-+........+.+..+...+.|...-++|..-.... .++.++...+.. T Consensus 295 -----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~ 368 (478) T protein:vir:10 295 -----KYYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKF-GNSPSGIALKFMYSNLDLKAN 368 (478) T ss_pred -----hhcceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCcccc-ccccHHHHHHHHHHHHHHHHH Confidence 11233333 2333444343333445566778888888888888885332221 122222111111 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Q lcl|NC_019705. 319 -NLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 319 -~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g 397 (424) ....+...++-.++.|...+. ...+. ..+.+.+..-+..|..+.++.+.++ +|+++...+.+++++-.-+. T Consensus 369 ~~~~~~~~~l~~~~~li~~~~g----~~~~~--~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~ 440 (478) T protein:vir:10 369 KLKNKTLTALQELLQYIIDFYR----LDVKV--QDIEITFNFNVMVNELENSQIAMNS--TGLLSKETILSNHAWVEDPV 440 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHhC----CCccc--ccceEEecCCCCCCHHHHHHHHHHH--hCCCChHHHHHhCCCCCCHH Confidence 111122222222222211110 01111 1223333444557788888888876 68999988888887632111 Q ss_pred --CCeee-----ecccccch-hhcc-ccC-CCcccCC Q lcl|NC_019705. 398 --GDVAM-----RQSQYVPI-TDLG-TNK-EPRNNGA 424 (424) Q Consensus 398 --gd~~~-----~~~n~~~~-~~~~-~~~-~~~~~ga 424 (424) .+..- .......+ +... +.+ +..++.. T Consensus 441 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (478) T protein:vir:10 441 AEMERIEQENIELNQQLPDIEEGLNGEQQRQSENNQP 477 (478) T ss_pred HHHHHHHHHHHHHHhhccccccccCCCCCCCCCCCCC Confidence 00000 00001111 1111 111 1122222 No 211 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=96.17 E-value=0.00092 Score=37.24 Aligned_cols=381 Identities=7% Similarity=0.044 Sum_probs=166.2 Q ss_pred CC--------CCc------ccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHH Q lcl|NC_019705. 1 ME--------EPK------YTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRC 66 (424) Q Consensus 1 ~~--------~~~------~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~ 66 (424) || +.. +.=+++.+..-++++..+..+........ .. ..... ..-+.++....+ T Consensus 5 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~------~~----~~~~~---~~ki~~n~~~~I 71 (499) T protein:vir:10 5 IDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKHE------FD----NATVE---AANVMVNHAKYI 71 (499) T ss_pred hhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCC------cC----cCCCC---cceeecchHHHH Confidence 11 110 00012223333444444443321100000 00 00000 001123455667 Q ss_pred HHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce---- Q lcl|NC_019705. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV---- 142 (424) Q Consensus 67 i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~---- 142 (424) |+..++-+-+-|+.+.-. +.. ....+.+++.. | .-..+...+....+.+|.||.++..+.+|.+ T Consensus 72 v~~~~~~l~g~p~~~~~~--~~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~ 139 (499) T protein:vir:10 72 TDMNVGFMTGNPVKYVAE--KGK-----NIDDILEVFNQ--I---DIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRD 139 (499) T ss_pred HHHHhhhhcccCceeecC--Chh-----HHHHHHHHHhh--c---CHhHHHHHHHHHHHhcCceEEEEEecccccccccc Confidence 888887777777764321 111 12335555432 2 2345678888999999999999888877743 Q ss_pred ------------eEEEEecCceeEEeecCce-------EEEEEEeC---Cce----EEecHhHEEEeecC---------- Q lcl|NC_019705. 143 ------------ISLLPLQSANMDVKLVGKK-------VVYRYQRD---SEY----AEFSQKEIFHLKGF---------- 186 (424) Q Consensus 143 ------------~~l~~l~~~~v~~~~~~~~-------~~~~~~~~---~~~----~~~~~~eiih~r~~---------- 186 (424) ..+..++|..+.+..++.. ..|.+..+ ... ..+.++.|.+++.. T Consensus 140 ~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~ 219 (499) T protein:vir:10 140 ELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDP 219 (499) T ss_pred cccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcce Confidence 3466777777765554321 11111111 111 12334444443210 Q ss_pred ------C-C---------CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHH Q lcl|NC_019705. 187 ------G-F---------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKE 249 (424) Q Consensus 187 ------~-~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~ 249 (424) + + +...|.|-+..+...++....+.....+.+...+.|-.+++-. ...+.. ....+ T Consensus 220 ~~~~~~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~-~~~~~~~~~~~~------ 292 (499) T protein:vir:10 220 IVYDGENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGF-GLGDDKDDIQRL------ 292 (499) T ss_pred ecccccCCCCccceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC-ccccccchhhhh------ Confidence 0 0 1234666676666666665555555555556666676666522 211111 11111 Q ss_pred HhCCcccCccee--cCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHH----------H Q lcl|NC_019705. 250 IAGGPVKKRLWI--LEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------Q 317 (424) Q Consensus 250 ~~~~~~ag~~~~--l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e----------~ 317 (424) +.+++.. .++|.+++.+........+....+...+.|...-++|..-.... .++.++...+ . T Consensus 293 -----~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~gn~Sg~Al~~~~~~l~~k~~ 366 (499) T protein:vir:10 293 -----KRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKF-MGNVSGEAMKFKLFGLENLLS 366 (499) T ss_pred -----hhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhh-cccchHHHHHHHHHHHHHHHH Confidence 1122333 24555555555444444456677777888888888874211111 1222221111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Q lcl|NC_019705. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g 397 (424) .....+...+.-.++.+...++.+ ........+.+.+..-+..|..+.++.+.++ .|+++.--+.++++.-.-+. T Consensus 367 ~k~~~~~~~l~~~~~li~~~~~~~---~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~ 441 (499) T protein:vir:10 367 IKQRYFFDGLRRRLKLIQTIVNIK---GANDDASGCKISLVANIPSNLSDVVNNVKNA--DGIIPRKYTYSWLPDVDNPQ 441 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcc---CCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHH Confidence 111222333333333333222211 0111111233333444567788888988887 57898887777776522110 Q ss_pred --C--------------Ceeeecccccchhhccc----cCCCcccCC Q lcl|NC_019705. 398 --G--------------DVAMRQSQYVPITDLGT----NKEPRNNGA 424 (424) Q Consensus 398 --g--------------d~~~~~~n~~~~~~~~~----~~~~~~~ga 424 (424) . .+.....+..+.+.... +++..++++ T Consensus 442 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (499) T protein:vir:10 442 DVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKEAGS 488 (499) T ss_pred HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCCCcc Confidence 0 01111111111111111 111111111 No 212 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=95.98 E-value=0.0012 Score=36.65 Aligned_cols=371 Identities=12% Similarity=0.062 Sum_probs=166.7 Q ss_pred CCCC---cccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccC Q lcl|NC_019705. 1 MEEP---KYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~---~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |... ++.=.++++.--+.++..+..+.-..-..... ..... ..-+.++....+|+..++-+-+- T Consensus 1 l~~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~----------~~~~~---~~ki~~n~~~~ivd~~~~~l~g~ 67 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFNLSYSAYKQLYEGDHAILQQKQK----------EQYKP---DNRLVVNFAKYIVDTFNGYFIGV 67 (429) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc----------ccCCC---cceeecchHHHHHHHHhhhhccc Confidence 1110 00001223333344444444332110000000 00000 01133566777888888888888 Q ss_pred ceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeEEee Q lcl|NC_019705. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~~~~ 157 (424) |+.+.-. +. .....+..++. . |. -......+..+++.+|.||+.+..+.+|.+ .+..++|..+.+.. T Consensus 68 ~~~~~~~--~~-----~~~~~l~~~~~-~-n~---~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~~~~~~p~~~~~v~ 134 (429) T protein:vir:98 68 PVQTSHE--NK-----QVSNYLELLDG-Y-ND---QDDNNAELSKICSIYGHGYELVFNDENAEA-GITYLTPLEAFIVY 134 (429) T ss_pred CceeecC--Ch-----HHHHHHHHHHh-h-cC---HhHHHHHHHHHHhhcCeEEEEEEecCCCcE-EEEEEcccceEEEE Confidence 8765321 11 11123444443 2 22 245667888999999999999999888876 46678888887665 Q ss_pred cCc---eEE--EEE-E-eCCce-EEecHhH--------------------------EEEeecCCCCCcccCchHHHHHHH Q lcl|NC_019705. 158 VGK---KVV--YRY-Q-RDSEY-AEFSQKE--------------------------IFHLKGFGFTGLVGLSPIAFACKS 203 (424) Q Consensus 158 ~~~---~~~--~~~-~-~~~~~-~~~~~~e--------------------------iih~r~~~~~~~~G~s~i~~~~~~ 203 (424) ++. ... .+| . .+... ..+...+ |++++ +...|.|-+..+... T Consensus 135 dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~----n~~~g~sd~e~v~~l 210 (429) T protein:vir:98 135 DDSIRQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIEYV----ENEERQSLLASVVTL 210 (429) T ss_pred eCCCCCceEEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccceEEec----CCCCCCCcHHHHHHH Confidence 531 111 011 1 01000 0011111 22222 234677777777777 Q ss_pred HHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCC----ceeeecccChhHHH Q lcl|NC_019705. 204 AGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAG----FSTSAIGVTPQDAE 279 (424) Q Consensus 204 i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g----~~~~~l~~~~~d~~ 279 (424) ++....+.....+.+...+.|-.+++- .. .+++....+ ..++++.++.+ .+...+..+..... T Consensus 211 iD~~d~~~s~~~~~~~~~~~p~~~i~g-~~-~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 277 (429) T protein:vir:98 211 INAFNKAISEKANDVEYFADAYLKILG-AE-LDDETLKSL-----------RDTRIINLKDTDAQQLTVEFLQKPDADAT 277 (429) T ss_pred HHHHHHHHHHHHHHHHHhcCceeeeec-CC-CCcchhhhH-----------hhCceeeccCCCCCCcceeEEeecCCHHH Confidence 776666666666556666767766652 22 232222111 11234444321 23333333333343 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHH----------HHHHHHHHHHHHHHHHHHhhccChhhh- Q lcl|NC_019705. 280 MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG----------FLQYTLQPYISRWENSIQRWLIPAKDV- 348 (424) Q Consensus 280 ~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~----------~~~~tl~P~~~~ie~~l~~~l~~~~~~- 348 (424) +....+...+.|+..-++|..-.. .-++.++...+.+... .+...+.-.++.+..- +...+. T Consensus 278 ~~~~~~~l~~~i~~~s~~p~~~~~--~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~-----~~~~~~~ 350 (429) T protein:vir:98 278 QEHLLDRLENLIFRTAMVANISDE--SFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASY-----PTSKIGP 350 (429) T ss_pred HHHHHHHHHHHHHHHhCccccCcc--ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hccCCCc Confidence 556678889999999999843221 1123222222111111 1111122222111111 111111 Q ss_pred -cccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeeccccc-chh----hccccCCCccc Q lcl|NC_019705. 349 -GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYV-PIT----DLGTNKEPRNN 422 (424) Q Consensus 349 -~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~-~~~----~~~~~~~~~~~ 422 (424) ....+.+.+...+..|..+.++.+.++ .|+++..-+.++++.-+-|..+--.+..... .++ ....++.+.+ T Consensus 351 ~d~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~- 427 (429) T protein:vir:98 351 KDWIGIKYKFTRNLPANLLEESQIAGNL--AGIVSEETQVGVLSIVENPQKEIERKNSDKSTLISRQAGGLNGQNTTTI- 427 (429) T ss_pred cccccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhcCCCCCCC- Confidence 112234444566677888888888887 5789887787788663321110000000000 000 0000000000 Q ss_pred CC Q lcl|NC_019705. 423 GA 424 (424) Q Consensus 423 ga 424 (424) .- T Consensus 428 ~~ 429 (429) T protein:vir:98 428 LE 429 (429) T ss_pred CC Confidence 00 No 213 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=95.43 E-value=0.0021 Score=35.27 Aligned_cols=379 Identities=7% Similarity=-0.040 Sum_probs=155.7 Q ss_pred CCCCcccccCCCC------------CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTN------------NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVS 68 (424) Q Consensus 1 ~~~~~~~~~~~~~------------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~ 68 (424) .|++++...+..+ .--...+..+..+............ ......... +..=+.++....+++ T Consensus 17 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~---~~~~~~~~~---~~~ki~~n~~~~Iv~ 90 (468) T protein:vir:96 17 VEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRN---VKGEIDPFK---PDWRMYTNYHQNLVD 90 (468) T ss_pred eecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccc---ccccccccc---cccccccchHHHHHH Confidence 3444333322222 1222222222222211000000000 000000000 011123455666777 Q ss_pred HHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEe Q lcl|NC_019705. 69 LISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPL 148 (424) Q Consensus 69 ~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l 148 (424) ..++-+-+-|+.+-- .+. .....+..++. | +.......+..+++.+|.+|+.+..+.+|.+ .+..+ T Consensus 91 ~~~~~l~g~p~~~~~--~d~-----~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~ 156 (468) T protein:vir:96 91 QKVAYAVANPVTYGT--EDE-----KSLKTIQEVLN---H---KWDDKLVDILTAASNKGVEWIQPYVDEQGEF-KTFRV 156 (468) T ss_pred HHHhhhccCCceecc--CCh-----HHHHHHHHHHh---c---CHHHHHHHHHHHHhhcCeEEEEEEEcCCCce-EEEEE Confidence 777777777776521 111 11233444443 2 2345567788999999999999888888865 46777 Q ss_pred cCceeEEeecCc---eE---EEEEEeCC--ceEEecHhHEEEeecC-------------------------C-------- Q lcl|NC_019705. 149 QSANMDVKLVGK---KV---VYRYQRDS--EYAEFSQKEIFHLKGF-------------------------G-------- 187 (424) Q Consensus 149 ~~~~v~~~~~~~---~~---~~~~~~~~--~~~~~~~~eiih~r~~-------------------------~-------- 187 (424) +|..+.+..++. .. .+.|...+ ....+.++.+.+++.. + T Consensus 157 ~p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv 236 (468) T protein:vir:96 157 PAEQAIPIWTNKERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFI 236 (468) T ss_pred cccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCcccEE Confidence 888877665432 11 11111111 1111122222222110 0 Q ss_pred --CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-- Q lcl|NC_019705. 188 --FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-- 263 (424) Q Consensus 188 --~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-- 263 (424) .+...|.|-+..+...++....+.....+.+...+.|-.+++-- ...+.+ ...... ..++++.++ T Consensus 237 ~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~-~~~~~~---~~~~~~-------~~~~~i~~~~d 305 (468) T protein:vir:96 237 PFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGY-EGEDLE---EFMYNL-------KYYKAINVDGD 305 (468) T ss_pred EecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC-Cccccc---hhhhhh-------hcCceEEecCC Confidence 02235777666666666655555554555555556676555422 111111 111111 122344443 Q ss_pred CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHH----------HHHHHHHHHHHHHH Q lcl|NC_019705. 264 AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYISR 333 (424) Q Consensus 264 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~----------~~~~~~tl~P~~~~ 333 (424) ++.+.+.+........+....+...+.|...-++|..-.. ...++.++...+... ...+...++-.++. T Consensus 306 ~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~l 384 (468) T protein:vir:96 306 GSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQD-KFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQY 384 (468) T ss_pred CCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccccc-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3333443443333444567788888889999898853211 111222222211111 11112222222222 Q ss_pred HHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccc---cch Q lcl|NC_019705. 334 WENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQY---VPI 410 (424) Q Consensus 334 ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~---~~~ 410 (424) |...+. ...+.....+.|+ .-+..|..+.+ ..+.+.|+++.-.+.+.++.-.-|..+.-.+...- ... T Consensus 385 i~~~~g----~~~d~~~i~i~f~--~~~p~d~~e~a---~~~~~~g~iS~et~i~~l~~v~D~~~E~~ri~~E~~~~~~~ 455 (468) T protein:vir:96 385 IIDFYK----LSIKVQDVEITFN--FNVMVNELEQS---QIGVNSQYLSKETVVTNHPWVDDPVAEMERIDQEELALPSI 455 (468) T ss_pred HHHHhC----CCcccceeeEEec--CCCCcCHHHHH---HHHHhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 211111 0111112233343 33334444444 45566799998888877654221110000000000 000 Q ss_pred hh---ccccCCCc Q lcl|NC_019705. 411 TD---LGTNKEPR 420 (424) Q Consensus 411 ~~---~~~~~~~~ 420 (424) .. .+++.+|. T Consensus 456 ~~~~~~~~~~~~~ 468 (468) T protein:vir:96 456 EEGLNGKENNEPT 468 (468) T ss_pred hhccCCCCCCCCC Confidence 00 01111222 No 214 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=95.29 E-value=0.0024 Score=34.99 Aligned_cols=371 Identities=10% Similarity=0.063 Sum_probs=167.6 Q ss_pred ccCCCCCchHHHH--------------HhhccCcccCCccccchhhcccccccc--CcccccHHHHhccHHHHHHHHHHH Q lcl|NC_019705. 8 IDLRTNNGWWARL--------------QSWFVGGRLVTPNQGSQTGPVSAHGHL--GDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 8 ~~~~~~~G~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~vs~~~~~~~~~v~~~i~~ia 71 (424) |++.+=.-+.++. .....+....-. .............. .+.+ . .=+.++.....|+..+ T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~-~~~~~~~~~~~~~~~~~~~~-~--~ki~~n~~k~Iv~~~~ 76 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITT-RNNGKAKLNKEGKKDPLRSA-D--NRIPSNFYQLLVDQEA 76 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhc-cccchhcccccccccccccC-C--cccccchHHHHHHhhh Confidence 4444433333322 222222110000 00000000000000 0000 0 0122445556777777 Q ss_pred HhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCc Q lcl|NC_019705. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) Q Consensus 72 ~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~ 151 (424) +-+-+-|+.+--.+ .. ....+...+.. +..+-...+..++..+|.||.++..+.+|.+ .+..++|. T Consensus 77 ~yl~G~p~~~~~~d--~~-----~~~~l~~~~~~------~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~-~~~~~~p~ 142 (470) T protein:vir:10 77 GYVASVFPDIDVGK--DA-----DNKKIIDVLGD------DRALTLNGLLVDSSNAGRAWLHYWIDEDGNF-RYGIIQPD 142 (470) T ss_pred hheeccceeeecCc--hH-----HHHHHHHHHhh------hHHHHHHHHHHHHhhcCeeEEEEEecCCCce-EEEEEccc Confidence 77778887753221 11 12335555542 2334445678899999999999999988875 46778888 Q ss_pred eeEEeecCce---E-----EEEEEeC-C-ce----EEecHhHEEEeecCC------------------------------ Q lcl|NC_019705. 152 NMDVKLVGKK---V-----VYRYQRD-S-EY----AEFSQKEIFHLKGFG------------------------------ 187 (424) Q Consensus 152 ~v~~~~~~~~---~-----~~~~~~~-~-~~----~~~~~~eiih~r~~~------------------------------ 187 (424) .+.+..++.. . +|..... + .. ..+.+..+.+++... T Consensus 143 ~~~~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (470) T protein:vir:10 143 QITPIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLK 222 (470) T ss_pred ceEEEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccc Confidence 8887765431 1 1111111 0 00 112233333332100 Q ss_pred -----------CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCccc Q lcl|NC_019705. 188 -----------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVK 256 (424) Q Consensus 188 -----------~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~a 256 (424) .+...|.|-+......++....+.....+.+...+.|-.++.--.....++... .+.. T Consensus 223 ~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~----~~~~------- 291 (470) T protein:vir:10 223 HNFGRVPFIEFSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMN----DLRK------- 291 (470) T ss_pred cCCCeeeEEEeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhh----hhhh------- Confidence 012357777777777777666666555656666666766665322211122211 1111 Q ss_pred CcceecC-------CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHH----------H Q lcl|NC_019705. 257 KRLWILE-------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------N 319 (424) Q Consensus 257 g~~~~l~-------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~----------~ 319 (424) .+.+.++ +++++..-..+ ...+....+...+.|...-++|.. .....++.|+....-. . T Consensus 292 ~~~i~~~~~~~~~~~~~~~lt~~~~--~~~~~~~~~~L~~~I~~~s~~p~~--~~~~~gn~Sg~Alk~~~~~l~~k~~~~ 367 (470) T protein:vir:10 292 YKSIKINNTGNGDNSGVDKLQIDIP--VEARDDALKITRKNIFLFGQGIDP--ANFESSNASGVAIKMLYSHLELKAAKT 367 (470) T ss_pred cCeEeccCCCCCcCceeEEEeecCC--hHHHHHHHHHHHHHHHHHhCCCCC--CccccccchHHHHHHHHHHHHHHHHHH Confidence 1223332 23455444443 333456777788888888888842 2222233332222111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC Q lcl|NC_019705. 320 LGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGD 399 (424) Q Consensus 320 ~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd 399 (424) ...+..+|+-.++.|...++ ..+.....+.+.+...+..|..+.++.+.++ .|+++.--+.+.+++-. +-+ T Consensus 368 ~~~~~~~l~~~~~~i~~~l~-----~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~~~p~v~--D~~ 438 (470) T protein:vir:10 368 QTYFEHAINELVRAIMRYLN-----FSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVD--DWQ 438 (470) T ss_pred HHHHHHHHHHHHHHHHHHhc-----ccCcccceeeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCC--CHH Confidence 22223333333333332222 1122223345555666778888889888887 58999888887776522 111 Q ss_pred eeee----------cccccchhhccccCCCcccCC Q lcl|NC_019705. 400 VAMR----------QSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 400 ~~~~----------~~n~~~~~~~~~~~~~~~~ga 424 (424) .-+. +.+ ...++. ++...+.. T Consensus 439 ~E~eri~~E~~e~~~~~-~~~~~~---~~~~~dde 469 (470) T protein:vir:10 439 QELKDLAKDKEENDPYS-NQADEL---NGKGVNDE 469 (470) T ss_pred HHHHHHHHHHHHHHHhh-cccccc---CCCCCCCC Confidence 1110 000 000000 00000000 No 215 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=95.23 E-value=0.0025 Score=34.86 Aligned_cols=396 Identities=11% Similarity=0.055 Sum_probs=160.1 Q ss_pred CCCCcccccCCCC--------------------CchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhcc Q lcl|NC_019705. 1 MEEPKYTIDLRTN--------------------NGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQI 60 (424) Q Consensus 1 ~~~~~~~~~~~~~--------------------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~ 60 (424) -..-.-++.++.+ +--++++..++.+............ . ..+. +..-+.. T Consensus 6 ~~~~~~~~~~~~~~~~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~------~-~~~~---~~~ki~~ 75 (506) T protein:vir:94 6 TEHKQANLIYQESLENLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRR------H-EDGK---ADHRATH 75 (506) T ss_pred hhhhcceeecccchhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccc------c-cccC---Ccceeec Confidence 0000000111111 0112333333332211000000000 0 0000 0011335 Q ss_pred HHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019705. 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G 140 (424) +.....|+..++-+-+-|+.+.-. ++. ....+.+++.. | ........+..+++.+|.||+.+..+.+| T Consensus 76 n~~~~Iv~~~~~~l~G~p~~~~~~--d~~-----~~~~l~~~~~~--N---~~~~~~~~~~~~~~~~G~a~~~v~~ded~ 143 (506) T protein:vir:94 76 SFAKYIADFQTSYSVGNPINVKLP--DDG-----SNSGFDTFNKA--N---DVDAENYDLFLDMSRYGRAYEYVYRGEDN 143 (506) T ss_pred chHHHHHHHhhhhhcccCceeecC--cch-----HHHHHHHHHhc--c---CHhHHHHHHHHHHHhcCeEEEEEEecCCC Confidence 567778888888877777765321 111 12334444432 2 23456677888899999999999888888 Q ss_pred ceeEEEEecCceeEEeecCce---EE---EEEE---eCCc--------eEEecHhHEEEeec-----------------C Q lcl|NC_019705. 141 DVISLLPLQSANMDVKLVGKK---VV---YRYQ---RDSE--------YAEFSQKEIFHLKG-----------------F 186 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~~~~---~~---~~~~---~~~~--------~~~~~~~eiih~r~-----------------~ 186 (424) .+ .+..++|..+.+..++.. .. +.|. ..+. ...+.+..+.++.. . T Consensus 144 ~~-~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~v 222 (506) T protein:vir:94 144 EE-HLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTF 222 (506) T ss_pred ee-EEEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccceeccccccCCcc Confidence 65 466688888877665321 11 0010 0000 00112222211110 0 Q ss_pred C----CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCC---------------------CCCHHHHH Q lcl|NC_019705. 187 G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK---------------------VLTEQQRS 241 (424) Q Consensus 187 ~----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~---------------------~~~~~~~~ 241 (424) + .+.-.|.|.+......++....+.....+.....+.|-.+++-... .......+ T Consensus 223 Pvv~~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (506) T protein:vir:94 223 PVVEFKNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLE 302 (506) T ss_pred ceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhH Confidence 0 0112355555554444443333332222222222222222221000 00111111 Q ss_pred HHHHHHHH-HhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHH--- Q lcl|NC_019705. 242 QVEENFKE-IAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ--- 317 (424) Q Consensus 242 ~~~~~~~~-~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~--- 317 (424) .++..... .......+.+.....+.+++-+..+.....+....+.....|...-++|..-.... .++.++..... T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Aik~~~~ 381 (506) T protein:vir:94 303 LIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENF-ASNSSGVAMQYKVL 381 (506) T ss_pred HHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccc-cccchHHHHHHHHH Confidence 11111111 11111112222333344555555555556667788889999999999996322111 12333222111 Q ss_pred -------HHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh Q lcl|NC_019705. 318 -------QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD 390 (424) Q Consensus 318 -------~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~ 390 (424) .....+...++..++.|...++.. -.........+.+.+..-+..|..+.++.+.++ .|+++...+++++ T Consensus 382 ~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~-~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l 458 (506) T protein:vir:94 382 GTVELASTKRRMFERGLYARYQIISDIENSI-HGDWTFDPQELTFTFRDNLPADNISQIKALVQA--GATLPQKYLYQQL 458 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC Confidence 122233333444443333333211 000111112234444555667888888888887 4899999999887 Q ss_pred CCCCCCCC--Ceeee-----cccccchhhccccCCCcccCC Q lcl|NC_019705. 391 NLPPLPGG--DVAMR-----QSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 391 g~~p~~gg--d~~~~-----~~n~~~~~~~~~~~~~~~~ga 424 (424) +.-.-|.- +..-. ..........++.++. +..+ T Consensus 459 p~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~-~~~~ 498 (506) T protein:vir:94 459 PGVTNPQDIVDMMKEQSANGDYSFDQNGVISNDGQT-NTTA 498 (506) T ss_pred CCCCCHHHHHHHHHHHHHHHhhcchhhcCCCcccCc-cccc Confidence 65331110 00000 0000000000111111 1111 No 216 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=95.16 E-value=0.0026 Score=34.73 Aligned_cols=383 Identities=10% Similarity=0.057 Sum_probs=165.9 Q ss_pred CCCCcccccCCCCC----chHHHHHhhccCcccCCccccchhhcccccc-------ccCcccccHHHHhccHHHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNN----GWWARLQSWFVGGRLVTPNQGSQTGPVSAHG-------HLGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~~~~~~~~----G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~vs~~~~~~~~~v~~~i~~ 69 (424) .+.|++ +.|-.+- -.+.++....... .+.......-+.+.. ...... ..+ +..+....+|+. T Consensus 3 ~~~~~~-~~~~~~~~~~~~~i~~~i~~~~~~---~~r~~~~~~Yy~g~~~i~~~~~~~~~~~-~~k--i~~n~~~~ivd~ 75 (452) T protein:vir:36 3 YKPPKL-MTFSKDEPITVEVVTKFMEKHKLE---VARYEYLKNMYLGIMAIDDEPAKDSWKP-DNR--LAVNFTKYIVDT 75 (452) T ss_pred ccCcee-EEcCCccCCCHHHHHHHHHHHHHH---HHHHHHHHHHhccccccccCccccccCc-cce--eecchHHHHHHH Confidence 223332 2222222 2233332222110 000000000000000 000000 001 234566677888 Q ss_pred HHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEec Q lcl|NC_019705. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQ 149 (424) Q Consensus 70 ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~ 149 (424) .++-+-+-|+.+--. +.. ....+.+++.. | ........+..+.+.+|.||..+..+.+|.+ .+..++ T Consensus 76 ~~~~l~g~~~~~~~~--d~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~ 142 (452) T protein:vir:36 76 FTGYFNGIPVKKSHS--DKE-----ILTKLQEFDNL--N---DMEDEESELAKMACIYGRAFEFLYQDEDTQT-NVVYNS 142 (452) T ss_pred HhhhhcccCceeecC--Chh-----HHHHHHHHHhh--c---ChhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEc Confidence 888777777765321 111 12234444432 2 2345667788999999999999988888876 467788 Q ss_pred CceeEEeecCce---EE--EEEEe-CCce---EEecHhHEEEeecC-----------------C----CCCcccCchHHH Q lcl|NC_019705. 150 SANMDVKLVGKK---VV--YRYQR-DSEY---AEFSQKEIFHLKGF-----------------G----FTGLVGLSPIAF 199 (424) Q Consensus 150 ~~~v~~~~~~~~---~~--~~~~~-~~~~---~~~~~~eiih~r~~-----------------~----~~~~~G~s~i~~ 199 (424) |..+.+..++.. .. .+|.. .... ..+.++.++++..- + .+...|.|-+.. T Consensus 143 p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~ 222 (452) T protein:vir:36 143 PENMFMVYDDTVKQEPLFAVRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLPVVEFYFNEERMSIFES 222 (452) T ss_pred ccceEEEEcCCCCCceEEEEEEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCcccEEEecCCCCCCcchHH Confidence 888877665421 11 11111 1101 11223333222110 0 012357676766 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-----CceeeecccC Q lcl|NC_019705. 200 ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-----GFSTSAIGVT 274 (424) Q Consensus 200 ~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~-----g~~~~~l~~~ 274 (424) ....++....+.....+.+...+.|-.++. +.. .+++....++. ++++.++. +.++.-+..+ T Consensus 223 v~~liDa~d~~~s~~~~~~~~~~~p~~~~~-g~~-~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~l~~~ 289 (452) T protein:vir:36 223 VISLVNAFNKAISEKANDVDYFSDQYLTFL-GAA-VEEEDLKNIRS-----------NRVINYYADGEGKNVDVKFLEKP 289 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeEee-cCC-cCchhhhhhhh-----------cceEEecCCCCccCCcceeEeec Confidence 666666555555555555556666766554 222 23322222111 12333322 1233333333 Q ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHH----------HHHHHHHHHHHHHHHHHHHHHhhccC Q lcl|NC_019705. 275 PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWENSIQRWLIP 344 (424) Q Consensus 275 ~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~----------~~~~~~~tl~P~~~~ie~~l~~~l~~ 344 (424) .....+....+...+.|+..-++|.. +....++.++...+.. ....+...+...++.|..-+..+ -. T Consensus 290 ~~~~~~~~~~~~l~~~I~~~s~~p~~--~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~ 366 (452) T protein:vir:36 290 DSDSQTENLLDRLTKLIFQTTMVANI--SDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNV-SN 366 (452) T ss_pred CCHHHHHHHHHHHHHHHHHHhCcccc--CcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CC Confidence 34455567788888999999999852 2222233332222111 11222333333333332222211 01 Q ss_pred hhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC--CCeeee--------cccccchhhcc Q lcl|NC_019705. 345 AKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG--GDVAMR--------QSQYVPITDLG 414 (424) Q Consensus 345 ~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g--gd~~~~--------~~n~~~~~~~~ 414 (424) ..+ ...+.+.+..-+..|..+.++.+.++ .|+++..-+.++++.-.-+. .+..-. ..+..+ +..+ T Consensus 367 ~~~--~~~i~i~f~~~~p~d~~~~a~~~~k~--~g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~-~~~~ 441 (452) T protein:vir:36 367 KDS--WKDIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQP-SEKG 441 (452) T ss_pred ccc--cccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccC-CCCc Confidence 111 11233333455667888888888887 57899877887776532111 000000 000000 0000 Q ss_pred ccCCCcccCC Q lcl|NC_019705. 415 TNKEPRNNGA 424 (424) Q Consensus 415 ~~~~~~~~ga 424 (424) .+++..++.- T Consensus 442 ~~~~~~~~~~ 451 (452) T protein:vir:36 442 TDTVVSETNE 451 (452) T ss_pred ccccCccccC Confidence 1111111111 No 217 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=94.22 E-value=0.005 Score=33.18 Aligned_cols=396 Identities=10% Similarity=0.021 Sum_probs=171.3 Q ss_pred CCCCcccccCCCCCchHHHHHhh-ccCcccCC---c------cccchhhccccccccC---cccccHHHHhccHHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSW-FVGGRLVT---P------NQGSQTGPVSAHGHLG---DSSINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~-~~~~~~~~---~------~~~~~~~~~~~~~~~~---~~~vs~~~~~~~~~v~~~i 67 (424) |-.|-.+|++-.-..++..-... +....... . .......+........ .....+..=+.+.-..-.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Iv 80 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELV 80 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHH Confidence 77777777775555665443222 11000000 0 0000000000000000 0000000012233445577 Q ss_pred HHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019705. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..++-+-+-|+++--.+ ++ ...+...|+.--+ .........+..++..+|.||.++-.+.+|.+ .+.. T Consensus 81 d~~~~yl~G~Pv~~~~~d-~~-------~~e~~~~l~~~~~--~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~-~~~~ 149 (537) T protein:vir:78 81 DQLAQYLLSNGVEVKVKD-ED-------NTQLDEILQEYFD--EDFQATIDTLVTNASKKGFEGIFARTTSEGKL-KFQT 149 (537) T ss_pred HHHhhhhcccCceeecCc-ch-------hHHHHHHHHHHhh--ccHHHHHHHHHHHHhhcCeeEEEeeecCCCce-EEEE Confidence 777777778888753221 11 1223344432211 12234556778899999999999988888865 4677 Q ss_pred ecCceeEEeecCceE------EEEE-EeC-----Cc----eEEecHhHEEEeecCC------------------------ Q lcl|NC_019705. 148 LQSANMDVKLVGKKV------VYRY-QRD-----SE----YAEFSQKEIFHLKGFG------------------------ 187 (424) Q Consensus 148 l~~~~v~~~~~~~~~------~~~~-~~~-----~~----~~~~~~~eiih~r~~~------------------------ 187 (424) ++|..+-+..++... +|.. ... .. ...+.++.|.+.+... T Consensus 150 i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 229 (537) T protein:vir:78 150 VDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAI 229 (537) T ss_pred EccceeEEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeec Confidence 788777666653211 1110 000 00 0123334443332100 Q ss_pred --------------------C---------CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH Q lcl|NC_019705. 188 --------------------F---------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 188 --------------------~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~ 238 (424) + +.-.|+|-+......++....+....++.+..-+.|-.++.--.....++ T Consensus 230 ~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~ 309 (537) T protein:vir:78 230 EESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDK 309 (537) T ss_pred cccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchh Confidence 0 12247777777777777666666666666666566655554222111222 Q ss_pred HHHHHHHHHHHHhCCcccCcceecC-CC--ceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhH Q lcl|NC_019705. 239 QRSQVEENFKEIAGGPVKKRLWILE-AG--FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGI 315 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l~-~g--~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~ 315 (424) .+..+ .. .+++.++ .+ ++|.....+ +.......+...+.|...-.+|. ......|+.|+... T Consensus 310 ~~~~l----~~-------~~~i~v~~d~~~v~~l~~~~~--~~~~e~~ld~L~~~I~~~s~~~~--~~~~~~gn~SGvAl 374 (537) T protein:vir:78 310 LRQNI----KA-------KKMIGVNGDNAGMEIQTVSIP--YEARKAKMDIDVENIYRSGMGFN--STAVGDGNVTNVVI 374 (537) T ss_pred HHHHH----hh-------cCceeecCCCCceeEEEecCC--HHHHHHHHHHHHHHHHHhcCCCC--CccccccCCcHHHH Confidence 22211 11 1233332 23 445443332 22223345555555555444432 12222233332221 Q ss_pred H----------HHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHH Q lcl|NC_019705. 316 E----------QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINE 385 (424) Q Consensus 316 e----------~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE 385 (424) . .....++...|+-.++.|...++.+-....+ ...+.+.+..-+..|..+.++.+.++++.|+++..- T Consensus 375 k~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d--~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT 452 (537) T protein:vir:78 375 KSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYD--SNDICFEIEPHVLANELDIATTRKTEAETEALKIGN 452 (537) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccc--cceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHH Confidence 1 1122333444444444444444332111112 223344445556788888999999999999999887 Q ss_pred HHHHhCCCCCC-------------------C-CCeeeecccccch--------hhcccc--CCCcccCC Q lcl|NC_019705. 386 MRRTDNLPPLP-------------------G-GDVAMRQSQYVPI--------TDLGTN--KEPRNNGA 424 (424) Q Consensus 386 ~R~~~g~~p~~-------------------g-gd~~~~~~n~~~~--------~~~~~~--~~~~~~ga 424 (424) +.+.+++-.-+ . .+.-.......|- ....++ -.+.+..| T Consensus 453 ~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 521 (537) T protein:vir:78 453 IMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVA 521 (537) T ss_pred HHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCC Confidence 77665441110 0 0000000000000 000000 00111111 No 218 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=94.14 E-value=0.0053 Score=33.07 Aligned_cols=402 Identities=14% Similarity=0.140 Sum_probs=172.1 Q ss_pred HHHHHhhccC--------cccCCccccchhhcccccc------ccCccc------c-cHHHHhccHHHHHHHHHHHHhhc Q lcl|NC_019705. 17 WARLQSWFVG--------GRLVTPNQGSQTGPVSAHG------HLGDSS------I-NDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 17 ~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~------~~~~~~------v-s~~~~~~~~~v~~~i~~ia~~ia 75 (424) +..|+++--+ .+.++|.......++...+ ...+.. | .-+..+.+|.|..||+.|.+.+. T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai 80 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVNETI 80 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 3333332111 1111111111111111110 111111 1 12556779999999999999876 Q ss_pred cC-----ceEEEEec-cCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC---CCceeEEE Q lcl|NC_019705. 76 CL-----PLDVFETD-QNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS---AGDVISLL 146 (424) Q Consensus 76 ~~-----~~~v~~~~-~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~---~G~~~~l~ 146 (424) -. |+.+--.+ +-.............++|+. ++...--+.++..|...|..|..++-+. ..-+.+|. T Consensus 81 v~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~l-----l~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr 155 (533) T protein:vir:10 81 CGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRL-----LDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELR 155 (533) T ss_pred eecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-----hccchhhhHHHhhhhhcceEEEEEEecCCCccccceeee Confidence 43 22221111 00000000111222333321 1122222455677788899998876543 34689999 Q ss_pred EecCceeEEeec------Cc--------------eEEEEEEeC------CceEEecHhHEEEee--cCCCCCcccCchHH Q lcl|NC_019705. 147 PLQSANMDVKLV------GK--------------KVVYRYQRD------SEYAEFSQKEIFHLK--GFGFTGLVGLSPIA 198 (424) Q Consensus 147 ~l~~~~v~~~~~------~~--------------~~~~~~~~~------~~~~~~~~~eiih~r--~~~~~~~~G~s~i~ 198 (424) .|+|.+|+..+. ++ ..+|.|.+. .....++.+-|.... ....++-.-+|-+. T Consensus 156 ~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLh 235 (533) T protein:vir:10 156 YIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAPDSICYVHSGIMDLNKNMTLSHLH 235 (533) T ss_pred eccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccccCCCceecchhheeeeeccceeCCCCceeccch Confidence 999999885321 11 112334332 334567775554433 12334445567788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC--------Cc--ccCccee-c---- Q lcl|NC_019705. 199 FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG--------GP--VKKRLWI-L---- 262 (424) Q Consensus 199 ~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~--------~~--~ag~~~~-l---- 262 (424) .+.+.+.....++....-+--.-+.-+-|+.++.+.++.. +.+-++....++.. |. +..+.+. | T Consensus 236 kAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyW 315 (533) T protein:vir:10 236 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFW 315 (533) T ss_pred HhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhc Confidence 8888877777776666555444555566777776665543 33444554444321 10 1111221 1 Q ss_pred ------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchh-H---HHHHHHHHHHHHHHHHH Q lcl|NC_019705. 263 ------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG-I---EQQNLGFLQYTLQPYIS 332 (424) Q Consensus 263 ------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n-~---e~~~~~~~~~tl~P~~~ 332 (424) ..|.++..|.-. ..+.-++-.++..+.+.++++||.+-|....+-+...++ + |-....|+..-=.-+.. T Consensus 316 LPRReGgrgTEItTLpGg-qnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~ 394 (533) T protein:vir:10 316 LPRREGGRGTEITTLPGG-QNLGELEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRFSE 394 (533) T ss_pred ccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHH Confidence 135566655432 223334456677889999999999988654332221111 0 11111222222223444 Q ss_pred HHHHHHHhhccC-----hhhhcc--cch--hhhhhhh----hccC-HHHHHHHHHHHH--hCCCCCHHHHHH-HhCCCCC Q lcl|NC_019705. 333 RWENSIQRWLIP-----AKDVGR--IHA--EHNLDGL----LRGD-SASRAAFMKAMG--EAGLRTINEMRR-TDNLPPL 395 (424) Q Consensus 333 ~ie~~l~~~l~~-----~~~~~~--~~~--~fd~~~l----~~~d-~~~~~~~~~~~~--~~g~~t~NE~R~-~~g~~p~ 395 (424) .|.+-|..+|+. +.+... ..+ .|..|.. .... ...|...+..+- -+-.++.+=+|+ .|.+.-. T Consensus 395 lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDe 474 (533) T protein:vir:10 395 LFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDV 474 (533) T ss_pred HHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHH Confidence 445555555443 233221 222 2222222 1111 122333333321 112445544443 2332110 Q ss_pred ----------CCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 396 ----------PGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 396 ----------~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) .+.+....+--..+.+......+|..+|+ T Consensus 475 ei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~ 513 (533) T protein:vir:10 475 EMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGA 513 (533) T ss_pred HHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCc Confidence 00011111100011111111112222222 No 219 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=92.82 E-value=0.0098 Score=31.59 Aligned_cols=369 Identities=8% Similarity=-0.001 Sum_probs=161.2 Q ss_pred ccCCCCCc--------------hHHHHHhhccCcccCCccccch-h----hccccccccCcccccHHHHhccHHHHHHHH Q lcl|NC_019705. 8 IDLRTNNG--------------WWARLQSWFVGGRLVTPNQGSQ-T----GPVSAHGHLGDSSINDERILQISTVWRCVS 68 (424) Q Consensus 8 ~~~~~~~G--------------~~~~~~~~~~~~~~~~~~~~~~-~----~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~ 68 (424) |++..-.= .+.++..+..+........... . ............ +..-+.++....+|+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~---~~~ki~~n~~~~Ivd 77 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRN---ADNRISHNWHQLLLD 77 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhccccccccccccccc---ccceeccchhHHHHH Confidence 44443332 2333333333321100000000 0 000000000000 001123455666788 Q ss_pred HHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-CCceeEEEE Q lcl|NC_019705. 69 LISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-AGDVISLLP 147 (424) Q Consensus 69 ~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~-~G~~~~l~~ 147 (424) ..++-+.+-|+.+-- .+.. ....+...+. |. .......+...++.+|.||.++.++. +|. ..+.. T Consensus 78 ~~~~yl~G~p~~~~~--~~~~-----~~~~l~~~~~---n~---~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~-~~~~~ 143 (471) T protein:vir:10 78 QKKAYALTYPPTFDV--DDKK-----VNDMIVDVLG---DD---YERISKQLCVNAGNAGIAWLHVWKDASDNS-FRYAC 143 (471) T ss_pred hhhhhhcccCceecc--CChH-----HHHHHHHHHh---cC---HHHHHHHHHHHHhhCCeEEEEEEeeCCCCe-eEEEE Confidence 878777778877521 1111 1112222221 22 23455667889999999999998875 465 45777 Q ss_pred ecCceeEEeecCce---E-----EEEEEe--CCce----EEecHhHEEEeecCC-------------------------- Q lcl|NC_019705. 148 LQSANMDVKLVGKK---V-----VYRYQR--DSEY----AEFSQKEIFHLKGFG-------------------------- 187 (424) Q Consensus 148 l~~~~v~~~~~~~~---~-----~~~~~~--~~~~----~~~~~~eiih~r~~~-------------------------- 187 (424) ++|..+.+..++.. . +|.... ++.. ..+..+.+.|++... T Consensus 144 ~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (471) T protein:vir:10 144 VDSKEVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSD 223 (471) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCccccccccccccccccccccccccc Confidence 88888877665421 1 111110 1111 112334444432100 Q ss_pred ------C---------CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhC Q lcl|NC_019705. 188 ------F---------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAG 252 (424) Q Consensus 188 ------~---------~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~ 252 (424) + +...|.|-+......++....+.....+.+...+.|-.+++-......++... .+. T Consensus 224 ~~~~~~~g~iPvv~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~----~~~---- 295 (471) T protein:vir:10 224 NSFKHDFGLVPFIPFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLE----DLK---- 295 (471) T ss_pred ccccCCCCceeEEEeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHH----Hhh---- Confidence 0 12246666666666666555555445555555555655554322221222111 111 Q ss_pred CcccCcceecC-------CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHH---- Q lcl|NC_019705. 253 GPVKKRLWILE-------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG---- 321 (424) Q Consensus 253 ~~~ag~~~~l~-------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~---- 321 (424) .++.+.++ ++++|..-.. ....+....+...+.|...-++|..-... .++.++...+.+... T Consensus 296 ---~~~~i~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~I~~~s~tp~~~~~~--~gn~Sg~Alk~~~~~l~~k 368 (471) T protein:vir:10 296 ---RYKMIKMDNDGMGDQSGVTTIAIDI--PTEARNLILERTKKQIFISGQGVNPETDK--LGNSSGVALKFLYSLLELK 368 (471) T ss_pred ---cCCeEEecCCCCccCccceEEeecC--ChHHHHHHHHHHHHHHHHHhCCcCCCccc--ccCccHHHHHHHHHHHHHH Confidence 11222221 2344444333 33445667778888888888888532221 133332222222111 Q ss_pred ------HHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC Q lcl|NC_019705. 322 ------FLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL 395 (424) Q Consensus 322 ------~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~ 395 (424) .+...+.-.++.|.. ++...+.. .+.+.+...+..|..+.++.+.++ .|+++..-+.++++.-. T Consensus 369 ~~~~~~~~~~~l~~~~~li~~-----~~~~~d~~--~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~p~v~- 438 (471) T protein:vir:10 369 AGNMETQFRSGYATLVKMILK-----HLGLSDKL--KIKQTWTRNSINNDTEMAQVVSTL--ATITSRENVAKSNPIVE- 438 (471) T ss_pred HHHHHHHHHHHHHHHHHHHHH-----HhccCCCc--eeEEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCC- Confidence 112222222222222 12222222 234444566678888888888887 57899888887765522 Q ss_pred CCCCeeeecccccchhh--ccc-cCCCcccCC Q lcl|NC_019705. 396 PGGDVAMRQSQYVPITD--LGT-NKEPRNNGA 424 (424) Q Consensus 396 ~ggd~~~~~~n~~~~~~--~~~-~~~~~~~ga 424 (424) +-+.- +..+.. ... ++.+.-.++ T Consensus 439 -D~~~E-----~eri~~E~~~~~~~~~~~~~~ 464 (471) T protein:vir:10 439 -DWQDE-----LRLQKAEQEGRSEKLYDMEEV 464 (471) T ss_pred -CHHHH-----HHHHHHHHHHHHhcccccCCC Confidence 11110 111100 000 001111111 No 220 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=89.00 E-value=0.029 Score=29.02 Aligned_cols=376 Identities=14% Similarity=0.099 Sum_probs=146.6 Q ss_pred CCCCcccccCCCCCch--------HHHHHhhccCcccCCccccc----hhhcc--ccccccCcccccHHHHhccHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGW--------WARLQSWFVGGRLVTPNQGS----QTGPV--SAHGHLGDSSINDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~--------~~~~~~~~~~~~~~~~~~~~----~~~~~--~~~~~~~~~~vs~~~~~~~~~v~~~ 66 (424) |+| +.|+ |+++++. +.. -...|. ...|. ...+..+... ....+ +++-..| T Consensus 1 ~~~---------~~~~~~~~~~~r~~~l~~~---R~~-~e~~w~e~~~y~lP~~~~~~~~~~~~~--~~~~~-dst~~~a 64 (522) T protein:vir:94 1 MAE---------REGFAAEGAKAVYDRLKNG---RQP-YETRAQNCAAVTIPSLFPKESDNSSTE--YTTPW-QAVGARC 64 (522) T ss_pred Ccc---------cchhhHHHHHHHHHHHHHH---hhH-HHHHHHHHHHHhcccccCCCCCccccc--ccccc-cccHHHH Confidence 544 2343 3332221 000 000000 00010 0000111110 11122 3444456 Q ss_pred HHHHHHhhccC-----ceEEEEeccCCc---cceeccchH-------HHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeE Q lcl|NC_019705. 67 VSLISTLTACL-----PLDVFETDQNDN---RKKVDLSNP-------LARLLRYSPNQYMTAQEFREAMTMQLCFYGNAY 131 (424) Q Consensus 67 i~~ia~~ia~~-----~~~v~~~~~~~~---~~~~~~~~~-------l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~ 131 (424) ++.+|+.+.+. ||-=....+... ..+.....+ +.+.+.. --..-+.+.-+..+..+++.+||+. T Consensus 65 ~~~Las~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~-~~~~snf~~~~~~~~~~L~~~G~a~ 143 (522) T protein:vir:94 65 LNNLAAKLMLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMA-YMETNSFRVPLFEALKQLIVSGNCL 143 (522) T ss_pred HHHHHHHHHhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHH-HHHhcCcHHHHHHHHHHHHhhCcEe Confidence 67776655432 332111111000 000000011 1111111 0011234455667788999999999 Q ss_pred EEEeeCCCCce--eEEEEecCceeEEeecCce---------------------------------EEEE--EEeCCceE- Q lcl|NC_019705. 132 ALVDRNSAGDV--ISLLPLQSANMDVKLVGKK---------------------------------VVYR--YQRDSEYA- 173 (424) Q Consensus 132 ~~~~r~~~G~~--~~l~~l~~~~v~~~~~~~~---------------------------------~~~~--~~~~~~~~- 173 (424) .++..+..|.+ ...|||....|.....+.. ..|. +..++... T Consensus 144 l~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~ 223 (522) T protein:vir:94 144 LYIPEPEQGTYSPMRMYRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYLR 223 (522) T ss_pred EeeeccCCCceeeEEEEEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCceeE Confidence 99877766654 4556665433332222211 0000 00111100 Q ss_pred -------Ee-------cHhH--EEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCC Q lcl|NC_019705. 174 -------EF-------SQKE--IFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT 236 (424) Q Consensus 174 -------~~-------~~~e--iih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~ 236 (424) .+ +.++ .+.+|+...++ .||.||...+...+.....+.+...........|..++. +.+... T Consensus 224 ~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~-~~g~~~ 302 (522) T protein:vir:94 224 YEEVEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVN-PNGITQ 302 (522) T ss_pred EeeccCceecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec-cccccc Confidence 00 0011 23444443344 899999999999999999999999999988888886554 333333 Q ss_pred HHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHH-HHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhH Q lcl|NC_019705. 237 EQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGI 315 (424) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~-~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~ 315 (424) +... .-++.+. -+.--.+++...++... .+.+. .+..+.....|..+|-+. .+...+....+ +.= T Consensus 303 ~~~~---------~~~~~g~-~v~g~~~~v~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~--~~~~~~~~r~T-AtE 368 (522) T protein:vir:94 303 PRRL---------NKAATGE-FVAGRVEDINFLQLTKG-QDFTIAKSVADAIEQRLGWAFLLN--SAVQRNAERVT-AEE 368 (522) T ss_pred chhe---------eccCCce-eecCCcccceeeecccc-cchhHHHHHHHHHHHHHHHHHhhh--hhccCCCcccc-HHH Confidence 3221 1111111 01111223344444432 23332 455666777788888665 23222222222 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcc-------------ChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCC Q lcl|NC_019705. 316 EQQNLGFLQYTLQPYISRWENSIQRWLI-------------PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRT 382 (424) Q Consensus 316 e~~~~~~~~~tl~P~~~~ie~~l~~~l~-------------~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t 382 (424) -..+..-....|.|....+.++|-.-|+ ++..... ++.++.+.+. ...|..-+..+.+ . T Consensus 369 V~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~--v~v~~~s~La--~~qr~~~~~~l~~----~ 440 (522) T protein:vir:94 369 IRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEA--VEPTVSTGLE--ALGRGQDLEKLTQ----A 440 (522) T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccc--EEeeEecHHH--HHHHHHHHHHHHH----H Confidence 1233355566777777777766643332 1111110 1111111110 0112222222211 0 Q ss_pred HHHHHHHhCCCCCCCCCeeeecccccch-hhccccCCCcccCC Q lcl|NC_019705. 383 INEMRRTDNLPPLPGGDVAMRQSQYVPI-TDLGTNKEPRNNGA 424 (424) Q Consensus 383 ~NE~R~~~g~~p~~ggd~~~~~~n~~~~-~~~~~~~~~~~~ga 424 (424) .+.+ -.+.|. -.|. ..|.-.+ +...+.- +-+... T Consensus 441 ~~~i---a~l~P~-~~~~---~id~d~~~~~~a~~~-Gv~~~~ 475 (522) T protein:vir:94 441 VNMM---TGLQPL-SQDP---DINLPTLKLRLLNAL-GIDTAG 475 (522) T ss_pred HHHH---Hhccch-hhhh---cCCHHHHHHHHHHHc-CCChhh Confidence 1111 112221 0110 0111110 0011100 001111 No 221 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=87.00 E-value=0.042 Score=28.15 Aligned_cols=403 Identities=12% Similarity=0.094 Sum_probs=176.3 Q ss_pred hHHHHHhhc-c-------CcccCCccccchhhcccc------ccccCcccc-------cHHHHhccHHHHHHHHHHHHhh Q lcl|NC_019705. 16 WWARLQSWF-V-------GGRLVTPNQGSQTGPVSA------HGHLGDSSI-------NDERILQISTVWRCVSLISTLT 74 (424) Q Consensus 16 ~~~~~~~~~-~-------~~~~~~~~~~~~~~~~~~------~~~~~~~~v-------s~~~~~~~~~v~~~i~~ia~~i 74 (424) +-+.|+++- . +.+.+.|........+.. .....+..- .-+..+.+|.|..||+.|.+.+ T Consensus 1 ~~~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea 80 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDSAVDDVVNET 80 (537) T ss_pred CccccccceeecccccccCCcccCCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchhhHHHHhhcce Confidence 223333321 0 111112221111111111 111111111 1355677999999999999987 Q ss_pred ccC-----ceEEEEeccC-CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC---CCceeEE Q lcl|NC_019705. 75 ACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS---AGDVISL 145 (424) Q Consensus 75 a~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~---~G~~~~l 145 (424) .-. |+.+--.+-+ .............++|+. ++...--+.++..|...|..|..++-|. ..-+.+| T Consensus 81 iv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~l-----l~F~~~~~e~fR~WYVDgRi~fhKiid~k~pk~GI~EL 155 (537) T protein:vir:10 81 ICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRL-----LDFDNRAYEIFRRWYVDGRLFFHKVIDPKKPRQGLVEL 155 (537) T ss_pred eEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-----hccchhhhHHHhhheeeeEEEEEEEEeCCCccccceee Confidence 643 3222111111 100001111223333321 1222222455677788899998776543 2358999 Q ss_pred EEecCceeEEeec-----C-ce--------------EEEEEEe------CCceEEecHhHEEEee--cCCCCCcccCchH Q lcl|NC_019705. 146 LPLQSANMDVKLV-----G-KK--------------VVYRYQR------DSEYAEFSQKEIFHLK--GFGFTGLVGLSPI 197 (424) Q Consensus 146 ~~l~~~~v~~~~~-----~-~~--------------~~~~~~~------~~~~~~~~~~eiih~r--~~~~~~~~G~s~i 197 (424) ..|+|.+|+.++. . .. .+|.|.+ .+....++.+-|.... -.+.++-+.+|-+ T Consensus 156 r~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~n~~~i~syL 235 (537) T protein:vir:10 156 RYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAPDSIAYCHSGIQDLNKNMVLSHL 235 (537) T ss_pred eeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceeccHhheeeecccceeCCCCeeeeee Confidence 9999999864332 1 11 0122332 2334567775554443 2345666788889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC--------Cc--ccCccee-c--- Q lcl|NC_019705. 198 AFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG--------GP--VKKRLWI-L--- 262 (424) Q Consensus 198 ~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~--------~~--~ag~~~~-l--- 262 (424) ..+.+.+.....++....-+--.-+.-+-|+.++.+.++.. +.+-++....++.. |. +..+.+. | T Consensus 236 hkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDy 315 (537) T protein:vir:10 236 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (537) T ss_pred hhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhhh Confidence 98888888777777766655555555566777777765543 34444554444321 10 1111221 1 Q ss_pred -------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchh----HHHHHHHHHHHHHHHHH Q lcl|NC_019705. 263 -------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG----IEQQNLGFLQYTLQPYI 331 (424) Q Consensus 263 -------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n----~e~~~~~~~~~tl~P~~ 331 (424) ..|.++..|.-. ..+.-++-.++..+.+.++++||.+-|....+-+...++ =|-....|+..-=.-+. T Consensus 316 WLPRReGgrgTEItTLpGg-qnlgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs 394 (537) T protein:vir:10 316 WLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFS 394 (537) T ss_pred cccccCCCcccceeecccc-CCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHH Confidence 135566655432 223334456677889999999999988654432221111 01111122222222344 Q ss_pred HHHHHHHHhhccC-----hhhhcc--cch--hhhhhhh----hccC-HHHHHHHHHHHHh--CCCCCHHHHHHH-hCC-- Q lcl|NC_019705. 332 SRWENSIQRWLIP-----AKDVGR--IHA--EHNLDGL----LRGD-SASRAAFMKAMGE--AGLRTINEMRRT-DNL-- 392 (424) Q Consensus 332 ~~ie~~l~~~l~~-----~~~~~~--~~~--~fd~~~l----~~~d-~~~~~~~~~~~~~--~g~~t~NE~R~~-~g~-- 392 (424) ..|.+-|...|+. +.+... ..+ .|..|.. .... ...|...+..+-. +-.++.+=+|+. |.+ T Consensus 395 ~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tD 474 (537) T protein:vir:10 395 ELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTE 474 (537) T ss_pred HHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCH Confidence 4445555555443 233321 222 2222221 1111 1223333332210 112333333321 111 Q ss_pred ----------------CCC--CC----C------Ceeeecccccchhhc--cccCCCcccCC Q lcl|NC_019705. 393 ----------------PPL--PG----G------DVAMRQSQYVPITDL--GTNKEPRNNGA 424 (424) Q Consensus 393 ----------------~p~--~g----g------d~~~~~~n~~~~~~~--~~~~~~~~~ga 424 (424) +.+ |. | +..+.|.+..|-.+. +..++....|- T Consensus 475 eeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 536 (537) T protein:vir:10 475 SEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKRGE 536 (537) T ss_pred HHHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCCccCCCCCCccCCC Confidence 111 11 1 122323332222111 11111112222 No 222 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=81.49 E-value=0.085 Score=26.45 Aligned_cols=399 Identities=10% Similarity=0.056 Sum_probs=181.3 Q ss_pred ccCCCCCchHHHHHh--------hccC--cccCCccccchhh----------ccccc-ccc-Cccc--------c-cHHH Q lcl|NC_019705. 8 IDLRTNNGWWARLQS--------WFVG--GRLVTPNQGSQTG----------PVSAH-GHL-GDSS--------I-NDER 56 (424) Q Consensus 8 ~~~~~~~G~~~~~~~--------~~~~--~~~~~~~~~~~~~----------~~~~~-~~~-~~~~--------v-s~~~ 56 (424) |-+ .-..+|..+.. .+.. .+.+.|....... +.++. ++. +... | .-+. T Consensus 1 m~~-~~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ 79 (521) T protein:vir:10 1 MNP-IFLKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRS 79 (521) T ss_pred CCc-chhHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHH Confidence 444 22222222110 1111 1122222221110 00000 010 0000 1 1255 Q ss_pred HhccHHHHHHHHHHHHhhccC-----ceEEEEeccCC-ccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCe Q lcl|NC_019705. 57 ILQISTVWRCVSLISTLTACL-----PLDVFETDQND-NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNA 130 (424) Q Consensus 57 ~~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~~~-~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a 130 (424) .+.+|.|..||+.|.+.+.-. |+.+--.+-+. ............++|+. ++...--+.++..|...|.. T Consensus 80 ma~~pEvd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~l-----l~F~~~~~~~fR~WYVDgRi 154 (521) T protein:vir:10 80 LSKYHEVDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKL-----LKFEREGKRHFRRWYVDSRI 154 (521) T ss_pred HhhccchhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHH-----hccchhhhHHHhhheeeeeE Confidence 677999999999999987643 23322111111 11001111223333321 12222224556778888999 Q ss_pred EEEEeeCC---CCceeEEEEecCceeEEeec------Cc-------eEEEEEEe--------C---CceEEecHhHEEEe Q lcl|NC_019705. 131 YALVDRNS---AGDVISLLPLQSANMDVKLV------GK-------KVVYRYQR--------D---SEYAEFSQKEIFHL 183 (424) Q Consensus 131 ~~~~~r~~---~G~~~~l~~l~~~~v~~~~~------~~-------~~~~~~~~--------~---~~~~~~~~~eiih~ 183 (424) |..++-+. ..-+.+|..|+|.+|+..+. ++ ..+|.|.. + +....++.+-|.|. T Consensus 155 ~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~daI~y~ 234 (521) T protein:vir:10 155 YFHKMIDPARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPIDAIVYS 234 (521) T ss_pred EEEEEeeCCCccccceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeeccCCCceecCCCCCCcceeechhheeee Confidence 98876542 33589999999999975431 11 12334432 1 12345777777666 Q ss_pred e--cCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC-------- Q lcl|NC_019705. 184 K--GFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG-------- 252 (424) Q Consensus 184 r--~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~-------- 252 (424) . ..+.++-+.+|-+..|.+.+.....++....-+--.-+.-+-|+.++.+.++.. +.+-++....++.. T Consensus 235 hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~T 314 (521) T protein:vir:10 235 HSGKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSST 314 (521) T ss_pred cccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccC Confidence 5 345667788888999988888877777776655555555566777777765543 33444554444321 Q ss_pred Cc--ccCcceec-----------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-Ccccchh---- Q lcl|NC_019705. 253 GP--VKKRLWIL-----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STSWGSG---- 314 (424) Q Consensus 253 ~~--~ag~~~~l-----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~-~~~~~~n---- 314 (424) |. +..+.+.+ ..|.++..|.-. ..+.-++-.++..+.+.++++||.+-|..... -+...++ T Consensus 315 Gev~ddrk~msMlEDyWLpRReGgrgTEI~TLpgg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItR 393 (521) T protein:vir:10 315 GKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGA-QSMGEMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITR 393 (521) T ss_pred ceeccchhhhhhHhhhcccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCccccCCCCCceecccccchhH Confidence 10 11122211 135566655432 22333445667788999999999998865422 1221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccC-----hhhhcc--cch--hhhhhhh----hccC-HHHHHHHHHHHH---- Q lcl|NC_019705. 315 IEQQNLGFLQYTLQPYISRWENSIQRWLIP-----AKDVGR--IHA--EHNLDGL----LRGD-SASRAAFMKAMG---- 376 (424) Q Consensus 315 ~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~~--~~~--~fd~~~l----~~~d-~~~~~~~~~~~~---- 376 (424) =|-....|+..-=.-+...+.+.|..+|+. +.+... ..+ .|..|.. .... ...|...+..+- T Consensus 394 DEikF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~y 473 (521) T protein:vir:10 394 DELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEV 473 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccc Confidence 111111222222223444445555555443 333221 222 2222222 1111 123455554441 Q ss_pred hCCCCCHHHHHH-HhCCCCCC----------CCCeeeecccccchhhccccCCCcccC Q lcl|NC_019705. 377 EAGLRTINEMRR-TDNLPPLP----------GGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 377 ~~g~~t~NE~R~-~~g~~p~~----------ggd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) -+-+++.+=+|+ .|.+.-.+ +......+ +.+++.++= T Consensus 474 vGky~s~dyi~k~ILr~tDeeik~~~k~I~~E~~~~~~~----------~p~~e~~df 521 (521) T protein:vir:10 474 TGKYLSHEYVMKNILRMSDEDIKTEREKIDGELKDSVYK----------NPEDPMEEF 521 (521) T ss_pred cccccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCC----------CCcchhhcC Confidence 123666666665 35554210 01110000 000000111 No 223 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=78.53 E-value=0.11 Score=25.77 Aligned_cols=370 Identities=12% Similarity=0.074 Sum_probs=161.7 Q ss_pred ccCCCCCch-------HHHHHhhccCcccCCccccchhhccccccccCccccc-HHHHhc----cHHHHHHHHHHHHhhc Q lcl|NC_019705. 8 IDLRTNNGW-------WARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIN-DERILQ----ISTVWRCVSLISTLTA 75 (424) Q Consensus 8 ~~~~~~~G~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs-~~~~~~----~~~v~~~i~~ia~~ia 75 (424) |-+.|..-- |..++....|..... ...........++.-. .+.+++ .+++...++.++..+- T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r------~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf 74 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVK------KKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVL 74 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHH------cCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhh Confidence 666666633 444444443321100 0000011111122111 122233 4556667777776666 Q ss_pred cCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecCceeE- Q lcl|NC_019705. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMD- 154 (424) Q Consensus 76 ~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~~l~~l~~~~v~- 154 (424) +-|..+- ....+.++.. =-...+-.+|.+.++...+.+|-|++.+.....|.-=.+..++|..|. T Consensus 75 ~k~p~~~------------~p~~l~~~~~--D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~ 140 (452) T protein:vir:94 75 DQPPVIT------------HPDAMSKYFE--DQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILN 140 (452) T ss_pred cCCceec------------ccHHHHHHHh--cccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcC Confidence 6665431 0122333322 145678899999999999999999999987766531123333332221 Q ss_pred --E----------------eecCc-------eEEEE-------------EEeCCceE-E----ecHh------HEEEee- Q lcl|NC_019705. 155 --V----------------KLVGK-------KVVYR-------------YQRDSEYA-E----FSQK------EIFHLK- 184 (424) Q Consensus 155 --~----------------~~~~~-------~~~~~-------------~~~~~~~~-~----~~~~------eiih~r- 184 (424) . ..++. ...|+ |...+... . ..++ ..|=|- T Consensus 141 W~~~~~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v~ 220 (452) T protein:vir:94 141 WEEDEDGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFFC 220 (452) T ss_pred ccccccCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEEE Confidence 0 11110 00111 11111110 0 0000 111111 Q ss_pred -c-CCCCCcccCchHHHHHHH-HHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCccee Q lcl|NC_019705. 185 -G-FGFTGLVGLSPIAFACKS-AGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWI 261 (424) Q Consensus 185 -~-~~~~~~~G~s~i~~~~~~-i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~ 261 (424) + .+.+...|.+|+..++.. +...+.... ....+...+.|-.+++-.... + .-..|. +.++. T Consensus 221 ~~~~~~~~~~~~pPLl~LA~ln~~hy~~~sd-~~~~l~~~~~P~l~~~g~~~~-~-----------~i~iG~---~~~~~ 284 (452) T protein:vir:94 221 ITPSGLSMTPAKPPMIDIVDINYSHYRTSAD-LEHGRHFTGLPTPWITGAESQ-S-----------TMHIGS---TKAWV 284 (452) T ss_pred EcCCCCCCCCCccchHHHHHHHHHHhcchhH-HHHHHHHcccceeEeecCcCC-C-----------ceEecc---ccccc Confidence 1 112335688888766544 333333333 334445667777666532211 1 112332 23567 Q ss_pred cCC-Cc--eeeecccChhHHH--HHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHH--HHHHHHHHHHHHHHHHH Q lcl|NC_019705. 262 LEA-GF--STSAIGVTPQDAE--MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ--QNLGFLQYTLQPYISRW 334 (424) Q Consensus 262 l~~-g~--~~~~l~~~~~d~~--~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~--~~~~~~~~tl~P~~~~i 334 (424) +++ |. .|.+.+-+.-... .++.++....++.+ .++-....++. +.+. ...+-....|.-++..+ T Consensus 285 lpe~~~~~~yie~~g~~i~~~~~~l~~le~~m~~~Ga------~ll~~~~~~~~---s~ea~~~~~~~~~s~L~~~a~~~ 355 (452) T protein:vir:94 285 IPEVAAKVGFLEFTGQGLQSLEKALSEKQAQLASLSA------RLIDNSTRGSE---ATETVKLRYMSETASLKSVTRAV 355 (452) T ss_pred CCCCCCcceEEccCchhHHHHHHHHHHHHHHHHHHHH------HhhccCCCcch---HHHHHHHHHHHhhHHHHHHHHHH Confidence 774 54 5556555443322 22222222222222 12222111111 1222 22233357777788888 Q ss_pred HHHHHhhccChh---h-hcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh---CCCCCCCCCeeeecccc Q lcl|NC_019705. 335 ENSIQRWLIPAK---D-VGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD---NLPPLPGGDVAMRQSQY 407 (424) Q Consensus 335 e~~l~~~l~~~~---~-~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~---g~~p~~ggd~~~~~~n~ 407 (424) |+.++.-|---- + .....++.+.+=+.........+++.++++.|.++....++.+ |....+. +.-.+ ..- T Consensus 356 e~al~~~l~~~a~w~g~~~~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~-e~~~i-~~E 433 (452) T protein:vir:94 356 EALLNKAYSCIMDMESMGGTLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPG-ESMGV-IPD 433 (452) T ss_pred HHHHHHHHHHHHHHcCCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCcc-CHHHH-HHH Confidence 877764331100 0 0112233333222333334566667788999999999998887 4432221 11100 001 Q ss_pred cchhhccccCCCcccCC Q lcl|NC_019705. 408 VPITDLGTNKEPRNNGA 424 (424) Q Consensus 408 ~~~~~~~~~~~~~~~ga 424 (424) .+...+....+|-++|+ T Consensus 434 ~~~~~~~~~~~~~~~~~ 450 (452) T protein:vir:94 434 PPAPEPSPSNTPPNPSS 450 (452) T ss_pred hhccCcccCCCCCCCcc Confidence 12112223334555555 No 224 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=76.87 E-value=0.13 Score=25.43 Aligned_cols=397 Identities=10% Similarity=0.044 Sum_probs=173.2 Q ss_pred CCCCCchHHHHHhhccC--------------cccCCccccchhhc---------cccccc-----cCccc------c-cH Q lcl|NC_019705. 10 LRTNNGWWARLQSWFVG--------------GRLVTPNQGSQTGP---------VSAHGH-----LGDSS------I-ND 54 (424) Q Consensus 10 ~~~~~G~~~~~~~~~~~--------------~~~~~~~~~~~~~~---------~~~~~~-----~~~~~------v-s~ 54 (424) ..+=..++ .+++++.+ .+.++|........ .++..+ .-+.. | .- T Consensus 1 ~~~~~~~~-~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:10 1 MANFNTIL-SFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTY 79 (524) T ss_pred CCchhhHH-HHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHH Confidence 11111122 22222222 11222222111100 011100 00100 0 12 Q ss_pred HHHhccHHHHHHHHHHHHhhccC-----ceEEEEeccC-CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcC Q lcl|NC_019705. 55 ERILQISTVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYG 128 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G 128 (424) +..+.+|.|..||+.|.+.+.-. |+.+--.+-+ .............++|+. -|-...+ +.++..|...| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~l-l~F~~~~----~~~fR~WYVDg 154 (524) T protein:vir:10 80 RNLMNNYEVDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNL-LNFQRKG----TDHFQRWYVDS 154 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhh----hHHHhhheeec Confidence 45677999999999999986643 3332111110 000000111122333321 1112222 45567788889 Q ss_pred CeEEEEeeCC---CCceeEEEEecCceeEEeec------Cc-------eEEEEEE-------------eCCceEEecHhH Q lcl|NC_019705. 129 NAYALVDRNS---AGDVISLLPLQSANMDVKLV------GK-------KVVYRYQ-------------RDSEYAEFSQKE 179 (424) Q Consensus 129 ~a~~~~~r~~---~G~~~~l~~l~~~~v~~~~~------~~-------~~~~~~~-------------~~~~~~~~~~~e 179 (424) ..|..++-+. ..-+.+|..|+|.+|+..+. ++ ..+|.|. ..+....++.+. T Consensus 155 Ri~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~dA 234 (524) T protein:vir:10 155 RIFFHKIINPKKMKDGVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRAA 234 (524) T ss_pred eEEEEEEeeCCCccccceeeeeeCCccceeeeeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecchhh Confidence 9998876542 33589999999999975331 11 1123332 223445688899 Q ss_pred EEEeec--CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC---- Q lcl|NC_019705. 180 IFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG---- 252 (424) Q Consensus 180 iih~r~--~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~---- 252 (424) |+|... .+.++-.-+|-+..|.+.+.....++....-+--.-+.-+-|+.++.+.++.. +.+-++....++.. T Consensus 235 Ivy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvY 314 (524) T protein:vir:10 235 VVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVY 314 (524) T ss_pred eeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEE Confidence 988753 34454455677888888777777666666555444555566677776665543 33444544443321 Q ss_pred ----Cc--ccCcceec-----------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhH Q lcl|NC_019705. 253 ----GP--VKKRLWIL-----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGI 315 (424) Q Consensus 253 ----~~--~ag~~~~l-----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~ 315 (424) |. +..+.+.+ ..|.++..|.-. ..+.-++-.++..+.+.++++||.+-|...+++..+.... T Consensus 315 Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~ 393 (524) T protein:vir:10 315 DASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGA-TGMSDMDDVLYFRTALYRALRIPESRIPSESNSGVMFDAG 393 (524) T ss_pred eccCCeeccchhhhhhHhhhcccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCchhccCCCCcccccccc Confidence 11 11122211 135566555432 2222344566778899999999999995443322222111 Q ss_pred HHH------HHHHHHHHHHHHHHHHHHHHHhhccC-----hhhhcc--cch--hhhhhhh----hccC-HHHHHHHHHHH Q lcl|NC_019705. 316 EQQ------NLGFLQYTLQPYISRWENSIQRWLIP-----AKDVGR--IHA--EHNLDGL----LRGD-SASRAAFMKAM 375 (424) Q Consensus 316 e~~------~~~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~~--~~~--~fd~~~l----~~~d-~~~~~~~~~~~ 375 (424) .+- ...|+..-=.-+...+.+.|..+|+. +.+... ..+ .|..|.. .... ...|...+..+ T Consensus 394 ~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 473 (524) T protein:vir:10 394 TAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMA 473 (524) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHh Confidence 111 11222222223444445555555443 333221 222 2222221 1111 12233333332 Q ss_pred Hh--CCCCCHHHHHH-HhCCCCCC--C--------CCeeeecccccchhhccccCCCcccC Q lcl|NC_019705. 376 GE--AGLRTINEMRR-TDNLPPLP--G--------GDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 376 ~~--~g~~t~NE~R~-~~g~~p~~--g--------gd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) -. +-.++.+=+|+ .|.+.-.+ . -++...+ +.+++.++= T Consensus 474 dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~----------~~~~~~~~f 524 (524) T protein:vir:10 474 EPFIGKYISHQTAMKDFLQMTDEEINQEAKQIEEESKEARFQ----------NPDEEEEDF 524 (524) T ss_pred hhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCC----------CCChhhhcC Confidence 11 12345554443 23332110 0 0000000 000000000 No 225 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=75.17 E-value=0.15 Score=25.11 Aligned_cols=399 Identities=11% Similarity=0.097 Sum_probs=169.1 Q ss_pred HHHHHhhcc---------CcccCCccccchhhcccccc---cc---Ccccc-------cHHHHhccHHHHHHHHHHHHhh Q lcl|NC_019705. 17 WARLQSWFV---------GGRLVTPNQGSQTGPVSAHG---HL---GDSSI-------NDERILQISTVWRCVSLISTLT 74 (424) Q Consensus 17 ~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~---~~---~~~~v-------s~~~~~~~~~v~~~i~~ia~~i 74 (424) +..++++.- ..+.++|........+...+ .. .+..- .-+..+.+|.|..||+.|.+.+ T Consensus 1 m~~lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea 80 (558) T protein:vir:10 1 MAKLFGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEADGAIEDVVNEA 80 (558) T ss_pred CcchhcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchhhHHHHhhcce Confidence 222333221 01112222222111111111 11 11111 1255677999999999999986 Q ss_pred ccC-----ceEEEEeccCCc-cceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC---CCceeEE Q lcl|NC_019705. 75 ACL-----PLDVFETDQNDN-RKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS---AGDVISL 145 (424) Q Consensus 75 a~~-----~~~v~~~~~~~~-~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~---~G~~~~l 145 (424) .-. |+.+--.+-+.. .-.........++|+. ++...--+.++..|...|..|..++-|. ..-+.+| T Consensus 81 iv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~l-----l~F~~~~~e~fR~WYVDgRiyfHKiid~k~pk~GI~EL 155 (558) T protein:vir:10 81 IVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEM-----MDFDKKSHEIFRNWYVDGRVFYLKVIDTKNPQEGIQDL 155 (558) T ss_pred eEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-----hccchhhhHHHhhheeeeEEEEEEEEeCCCccccceee Confidence 643 332211111100 0011111223333331 2222222455777888899998776543 2358899 Q ss_pred EEecCceeEEeecC-----------------c-------eEEEEEEeCC-------------ceEEecHhHEEEeec--C Q lcl|NC_019705. 146 LPLQSANMDVKLVG-----------------K-------KVVYRYQRDS-------------EYAEFSQKEIFHLKG--F 186 (424) Q Consensus 146 ~~l~~~~v~~~~~~-----------------~-------~~~~~~~~~~-------------~~~~~~~~eiih~r~--~ 186 (424) ..|+|.+|+..+.- + ..+|.|...+ ....++.+-|....- . T Consensus 156 r~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~dAI~y~hSGL~ 235 (558) T protein:vir:10 156 RYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKDSITMCTSGLV 235 (558) T ss_pred eeeCcccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeechhheeeecccce Confidence 99999998643320 0 1123343321 123344444433321 2 Q ss_pred CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC--------Cc--c Q lcl|NC_019705. 187 GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG--------GP--V 255 (424) Q Consensus 187 ~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~--------~~--~ 255 (424) +.++-.-+|-+..+.+.+.....++....-+--.-+.-+-|+.++.+.++.. +.+-++....++.. |. + T Consensus 236 d~~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~d 315 (558) T protein:vir:10 236 DRNKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRD 315 (558) T ss_pred ecCCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecc Confidence 3344455566787777777776666666555444455566677776665543 33444444444321 10 1 Q ss_pred cCccee-c----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHH-----H Q lcl|NC_019705. 256 KKRLWI-L----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE-----Q 317 (424) Q Consensus 256 ag~~~~-l----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e-----~ 317 (424) ..+.+. | ..|.++..|.- +..+| +-.++..+.+.++++||.+-|....+-+.. ..+| - T Consensus 316 drk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem---~DV~YF~kKLy~aLnVP~SRl~~e~~f~~G-r~~EItRDEi 391 (558) T protein:vir:10 316 DRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGEL---SDVDYFQKKLYRALGVPESRIAAEGGFNLG-RSSEILRDEL 391 (558) T ss_pred cchhhhhHhhhcccccCCCCccceeeccccCCcchH---HHHHHHHHHHHHHhCCCccccCCCCccccc-ccchhhHHHH Confidence 111221 1 13556665543 33344 445667889999999999988754332221 1111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccC-----hhhhcc--cch--hhhhhhh----hccC-HHHHHHHHHHHHh--CCCC Q lcl|NC_019705. 318 QNLGFLQYTLQPYISRWENSIQRWLIP-----AKDVGR--IHA--EHNLDGL----LRGD-SASRAAFMKAMGE--AGLR 381 (424) Q Consensus 318 ~~~~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~~--~~~--~fd~~~l----~~~d-~~~~~~~~~~~~~--~g~~ 381 (424) ....|+..-=.-+...|.+-|...|+. +.+... ..+ .|..|.. .... ...|...+..+-. +-.+ T Consensus 392 KF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~ 471 (558) T protein:vir:10 392 KFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYY 471 (558) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 122222222223444445555555443 233221 222 2222221 1111 1223333333211 1133 Q ss_pred CHHHHHHH-hCC------------------CCCC--CCCeeee-----cccccchhhcccc-CCCcccCC Q lcl|NC_019705. 382 TINEMRRT-DNL------------------PPLP--GGDVAMR-----QSQYVPITDLGTN-KEPRNNGA 424 (424) Q Consensus 382 t~NE~R~~-~g~------------------~p~~--ggd~~~~-----~~n~~~~~~~~~~-~~~~~~ga 424 (424) +.+=+|+. |.+ +-++ ...+++. +.+-......+.+ .++.-.++ T Consensus 472 S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 541 (558) T protein:vir:10 472 STEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLEAQ 541 (558) T ss_pred chHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchhccCCCCCcccccccc Confidence 44333321 221 1122 1112221 1111111111221 12222222 No 226 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=73.37 E-value=0.17 Score=24.79 Aligned_cols=399 Identities=9% Similarity=0.043 Sum_probs=173.1 Q ss_pred ccCCCCCchHHHHHhhc--------c--CcccCCccccchhh-----------cccccc-ccCcc----c------c-cH Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWF--------V--GGRLVTPNQGSQTG-----------PVSAHG-HLGDS----S------I-ND 54 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~--------~--~~~~~~~~~~~~~~-----------~~~~~~-~~~~~----~------v-s~ 54 (424) |.+ +..|+|+.+..-- . ..+.++|....... ++.+.. ...+. . | .- T Consensus 1 m~~-~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:72 1 MKF-NVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTY 79 (524) T ss_pred CCC-chhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHH Confidence 777 4666665442210 0 01122222211111 111100 00110 0 1 12 Q ss_pred HHHhccHHHHHHHHHHHHhhccC-----ceEEEEeccC-CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcC Q lcl|NC_019705. 55 ERILQISTVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYG 128 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G 128 (424) +..+.+|.|..||+.|.+.+.-. |+.+--.+-+ ...-.........++|+. ++...--+.++..|...| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l-----l~F~~~~~~~fR~WYVDg 154 (524) T protein:vir:72 80 RNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLNH-----LSFQRKGSDHFRRWYVDS 154 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHH-----hccchhhhHHHhhheeee Confidence 55677999999999999987643 3332211111 000000111223333321 222222245567788889 Q ss_pred CeEEEEeeCC---CCceeEEEEecCceeEEeec------Cc-------eEEEEEEeCC-------------ceEEecHhH Q lcl|NC_019705. 129 NAYALVDRNS---AGDVISLLPLQSANMDVKLV------GK-------KVVYRYQRDS-------------EYAEFSQKE 179 (424) Q Consensus 129 ~a~~~~~r~~---~G~~~~l~~l~~~~v~~~~~------~~-------~~~~~~~~~~-------------~~~~~~~~e 179 (424) ..|..++-|. ..-+.+|..|+|.+|+..+. ++ ..+|.|..+. ....++.+- T Consensus 155 Ri~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dA 234 (524) T protein:vir:72 155 RIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAA 234 (524) T ss_pred EEEEEEEEeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhh Confidence 9998776543 23589999999999975321 11 1123343221 234555566 Q ss_pred EEEeec--CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC---- Q lcl|NC_019705. 180 IFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG---- 252 (424) Q Consensus 180 iih~r~--~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~---- 252 (424) |.+... .+.++-.-+|-+..|.+.+.....++....-+--.-+.-+-|+.++.+.++.. +.+-++....++.. T Consensus 235 I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvY 314 (524) T protein:vir:72 235 VVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVY 314 (524) T ss_pred eeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEE Confidence 655541 23344455667788877777776666666555444455566677776665543 33444444444321 Q ss_pred ----Cc--ccCcceec-----------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchh- Q lcl|NC_019705. 253 ----GP--VKKRLWIL-----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG- 314 (424) Q Consensus 253 ----~~--~ag~~~~l-----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n- 314 (424) |. +..+.+.+ ..|.++..|.-. ..+.-++-.++..+.+..+++||.+-|.....+..+... T Consensus 315 Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~ 393 (524) T protein:vir:72 315 DASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGA-DNTGNMEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSG 393 (524) T ss_pred eCCCCeeccchhhhhhHhhhcccccCCCcccceeecccc-CCcChHHHHHHHHHHHHHHhCCchhhcCCCCCcccccccc Confidence 10 11122211 135566555432 222234456677889999999999988432211121111 Q ss_pred HH----H-HHHHHHHHHHHHHHHHHHHHHHhhccC-----hhhhcc--cch--hhhhhhh----hccC-HHHHHHHHHHH Q lcl|NC_019705. 315 IE----Q-QNLGFLQYTLQPYISRWENSIQRWLIP-----AKDVGR--IHA--EHNLDGL----LRGD-SASRAAFMKAM 375 (424) Q Consensus 315 ~e----~-~~~~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~~--~~~--~fd~~~l----~~~d-~~~~~~~~~~~ 375 (424) +| + ....|+..-=.-+...+.+-|..+|+. +.+... ..+ .|..|.. .... ...|...+..+ T Consensus 394 ~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 473 (524) T protein:vir:72 394 TSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMA 473 (524) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHh Confidence 11 1 111222222223444445555555443 333221 222 2222222 1111 12233333332 Q ss_pred Hh--CCCCCHHHHHH-HhCCCCCC----------CCCeeeecccccchhhccccCCCcccC Q lcl|NC_019705. 376 GE--AGLRTINEMRR-TDNLPPLP----------GGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 376 ~~--~g~~t~NE~R~-~~g~~p~~----------ggd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) -. +-.++.+=+++ .|.+.-.+ +.++...+. .++..++= T Consensus 474 dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~----------~~~~~~~f 524 (524) T protein:vir:72 474 EPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQD----------PDQEQEDF 524 (524) T ss_pred hhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCC----------CchhhhcC Confidence 11 12345544443 23332110 000000000 00000000 No 227 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=72.07 E-value=0.19 Score=24.57 Aligned_cols=406 Identities=10% Similarity=0.091 Sum_probs=179.5 Q ss_pred ccCCCCCchHHHHHhh-----c--cCcccCCccccchhhcc---------ccc-c---ccCccc------c-cHHHHhcc Q lcl|NC_019705. 8 IDLRTNNGWWARLQSW-----F--VGGRLVTPNQGSQTGPV---------SAH-G---HLGDSS------I-NDERILQI 60 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~-----~--~~~~~~~~~~~~~~~~~---------~~~-~---~~~~~~------v-s~~~~~~~ 60 (424) |.+-.=.|||.+.-.. . ...+.++|.....+..+ ++. . ...+.. | .-+..+.+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 4333334554433221 0 11222233222211111 000 0 001111 1 12556789 Q ss_pred HHHHHHHHHHHHhhccC-----ceEEEEeccC-CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019705. 61 STVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~ 134 (424) |.|..||+.|.+.+.-. |+.+--.+-+ .............++|+. -|-...+ +.++..|...|..|... T Consensus 81 pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~l-l~F~~~~----~~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 81 PEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRL-LDASRKL----DTLFRRWYVDSRIFFHK 155 (516) T ss_pred cchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhh----hHHHhhhhhcceEEEEE Confidence 99999999999987643 3333111110 000000111122333321 1112223 45567778889988875 Q ss_pred ee-CCCCceeEEEEecCceeEEeec-----C-c-------eEEEEEEeC-------------CceEEecHhHEEEee--c Q lcl|NC_019705. 135 DR-NSAGDVISLLPLQSANMDVKLV-----G-K-------KVVYRYQRD-------------SEYAEFSQKEIFHLK--G 185 (424) Q Consensus 135 ~r-~~~G~~~~l~~l~~~~v~~~~~-----~-~-------~~~~~~~~~-------------~~~~~~~~~eiih~r--~ 185 (424) +- +...-+.+|..|+|.+|+..+. . + ..+|.|..+ +....++.+-|.+.. . T Consensus 156 iid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL 235 (516) T protein:vir:10 156 IMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGL 235 (516) T ss_pred EecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccc Confidence 43 4445689999999999875431 1 1 122333321 123456666665554 2 Q ss_pred CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC--------Cc-- Q lcl|NC_019705. 186 FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG--------GP-- 254 (424) Q Consensus 186 ~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~--------~~-- 254 (424) .+.++-.-+|-+..|.+.+.....++....-+--.-+.-+-|+.++.+.++.. +.+-++....++.. |. T Consensus 236 ~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ 315 (516) T protein:vir:10 236 MDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVK 315 (516) T ss_pred eeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeec Confidence 34444444777888888887777777666555444555566777776665543 33444544444321 10 Q ss_pred ccCcceec-----------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccc-chhHHHHHH-- Q lcl|NC_019705. 255 VKKRLWIL-----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW-GSGIEQQNL-- 320 (424) Q Consensus 255 ~ag~~~~l-----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~-~~n~e~~~~-- 320 (424) +..+.+.+ ..|.++..|.-. ..+.-++-.++..+.+.++++||.+-|...++.+.. +.++|=.+. T Consensus 316 ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEi 394 (516) T protein:vir:10 316 NQKRNLSMTEDYWLMRRDGKSVTEVSSLPGA-QTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDEL 394 (516) T ss_pred cchhhhhhHhhhcccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHH Confidence 11122211 135566555432 222234456677889999999999988765543321 122221111 Q ss_pred ---HHHHHHHHHHHHHHHHHHHhhccC-----hhhhcc--cch--hhhhhhh----hccC-HHHHHHHHHHHH--hCCCC Q lcl|NC_019705. 321 ---GFLQYTLQPYISRWENSIQRWLIP-----AKDVGR--IHA--EHNLDGL----LRGD-SASRAAFMKAMG--EAGLR 381 (424) Q Consensus 321 ---~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~~--~~~--~fd~~~l----~~~d-~~~~~~~~~~~~--~~g~~ 381 (424) .|+..-=.-+...+.+.|..+|+. +.+... ..+ .|..|.. .... ...|...+..+- -+.++ T Consensus 395 KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~ 474 (516) T protein:vir:10 395 DFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYV 474 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 222222223444445555555443 333321 222 2222221 1111 123444444332 23577 Q ss_pred CHHHHHH-HhCCCCCC--CCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 382 TINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 382 t~NE~R~-~~g~~p~~--ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +.+=+|+ .|.+.-.+ .-++..-. .. +.+--+.|.+..- T Consensus 475 s~~yi~k~ILr~tDeei~~e~k~I~~----E~-~~~~~~~p~~~~~ 515 (516) T protein:vir:10 475 SHDYVMKNILQMTEEQIAQEEKQIEQ----EA-GIKRFQNPENEDD 515 (516) T ss_pred chHHHHHHHhcCCHhhHHHHHHHHHH----hh-hCCCCCCCCcccc Confidence 7777765 35553211 00000000 00 0000011111101 No 228 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=72.07 E-value=0.19 Score=24.57 Aligned_cols=406 Identities=10% Similarity=0.091 Sum_probs=179.5 Q ss_pred ccCCCCCchHHHHHhh-----c--cCcccCCccccchhhcc---------ccc-c---ccCccc------c-cHHHHhcc Q lcl|NC_019705. 8 IDLRTNNGWWARLQSW-----F--VGGRLVTPNQGSQTGPV---------SAH-G---HLGDSS------I-NDERILQI 60 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~-----~--~~~~~~~~~~~~~~~~~---------~~~-~---~~~~~~------v-s~~~~~~~ 60 (424) |.+-.=.|||.+.-.. . ...+.++|.....+..+ ++. . ...+.. | .-+..+.+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 4333334554433221 0 11222233222211111 000 0 001111 1 12556789 Q ss_pred HHHHHHHHHHHHhhccC-----ceEEEEeccC-CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019705. 61 STVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~ 134 (424) |.|..||+.|.+.+.-. |+.+--.+-+ .............++|+. -|-...+ +.++..|...|..|... T Consensus 81 pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~l-l~F~~~~----~~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 81 PEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRL-LDASRKL----DTLFRRWYVDSRIFFHK 155 (516) T ss_pred cchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhh----hHHHhhhhhcceEEEEE Confidence 99999999999987643 3333111110 000000111122333321 1112223 45567778889988875 Q ss_pred ee-CCCCceeEEEEecCceeEEeec-----C-c-------eEEEEEEeC-------------CceEEecHhHEEEee--c Q lcl|NC_019705. 135 DR-NSAGDVISLLPLQSANMDVKLV-----G-K-------KVVYRYQRD-------------SEYAEFSQKEIFHLK--G 185 (424) Q Consensus 135 ~r-~~~G~~~~l~~l~~~~v~~~~~-----~-~-------~~~~~~~~~-------------~~~~~~~~~eiih~r--~ 185 (424) +- +...-+.+|..|+|.+|+..+. . + ..+|.|..+ +....++.+-|.+.. . T Consensus 156 iid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL 235 (516) T protein:vir:10 156 IMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGL 235 (516) T ss_pred EecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccc Confidence 43 4445689999999999875431 1 1 122333321 123456666665554 2 Q ss_pred CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC--------Cc-- Q lcl|NC_019705. 186 FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG--------GP-- 254 (424) Q Consensus 186 ~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~--------~~-- 254 (424) .+.++-.-+|-+..|.+.+.....++....-+--.-+.-+-|+.++.+.++.. +.+-++....++.. |. T Consensus 236 ~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ 315 (516) T protein:vir:10 236 MDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVK 315 (516) T ss_pred eeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeec Confidence 34444444777888888887777777666555444555566777776665543 33444544444321 10 Q ss_pred ccCcceec-----------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccc-chhHHHHHH-- Q lcl|NC_019705. 255 VKKRLWIL-----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW-GSGIEQQNL-- 320 (424) Q Consensus 255 ~ag~~~~l-----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~-~~n~e~~~~-- 320 (424) +..+.+.+ ..|.++..|.-. ..+.-++-.++..+.+.++++||.+-|...++.+.. +.++|=.+. T Consensus 316 ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEi 394 (516) T protein:vir:10 316 NQKRNLSMTEDYWLMRRDGKSVTEVSSLPGA-QTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDEL 394 (516) T ss_pred cchhhhhhHhhhcccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHH Confidence 11122211 135566555432 222234456677889999999999988765543321 122221111 Q ss_pred ---HHHHHHHHHHHHHHHHHHHhhccC-----hhhhcc--cch--hhhhhhh----hccC-HHHHHHHHHHHH--hCCCC Q lcl|NC_019705. 321 ---GFLQYTLQPYISRWENSIQRWLIP-----AKDVGR--IHA--EHNLDGL----LRGD-SASRAAFMKAMG--EAGLR 381 (424) Q Consensus 321 ---~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~~--~~~--~fd~~~l----~~~d-~~~~~~~~~~~~--~~g~~ 381 (424) .|+..-=.-+...+.+.|..+|+. +.+... ..+ .|..|.. .... ...|...+..+- -+.++ T Consensus 395 KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~ 474 (516) T protein:vir:10 395 DFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYV 474 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 222222223444445555555443 333321 222 2222221 1111 123444444332 23577 Q ss_pred CHHHHHH-HhCCCCCC--CCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 382 TINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 382 t~NE~R~-~~g~~p~~--ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) +.+=+|+ .|.+.-.+ .-++..-. .. +.+--+.|.+..- T Consensus 475 s~~yi~k~ILr~tDeei~~e~k~I~~----E~-~~~~~~~p~~~~~ 515 (516) T protein:vir:10 475 SHDYVMKNILQMTEEQIAQEEKQIEQ----EA-GIKRFQNPENEDD 515 (516) T ss_pred chHHHHHHHhcCCHhhHHHHHHHHHH----hh-hCCCCCCCCcccc Confidence 7777765 35553211 00000000 00 0000011111101 No 229 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=71.45 E-value=0.2 Score=24.47 Aligned_cols=399 Identities=10% Similarity=0.042 Sum_probs=173.0 Q ss_pred ccCCCCCchHHHHHhhc--------c--CcccCCccccchhh-----------cccccc-ccCcc----c------c-cH Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWF--------V--GGRLVTPNQGSQTG-----------PVSAHG-HLGDS----S------I-ND 54 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~--------~--~~~~~~~~~~~~~~-----------~~~~~~-~~~~~----~------v-s~ 54 (424) |.+ +..|+|+.+..-- . ..+.++|....... ++.+.. ...+. . | .- T Consensus 1 m~~-~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:10 1 MKF-NVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTY 79 (524) T ss_pred CCC-chhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHH Confidence 777 4666665442210 0 01122222211111 111100 10110 0 1 12 Q ss_pred HHHhccHHHHHHHHHHHHhhccC-----ceEEEEeccC-CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcC Q lcl|NC_019705. 55 ERILQISTVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYG 128 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G 128 (424) +..+.+|.|..||+.|.+.+.-. |+.+--.+-+ ...-.........++|+. ++...--+.++..|...| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l-----l~F~~~~~~~fR~WYVDg 154 (524) T protein:vir:10 80 RNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLNH-----LSFQRKGSDHFRRWYVDS 154 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHH-----hccchhhhHHHhhheeee Confidence 55677999999999999987643 3332211111 000000111223333321 222222245567788889 Q ss_pred CeEEEEeeCC---CCceeEEEEecCceeEEeec------Cc-------eEEEEEEeCC-------------ceEEecHhH Q lcl|NC_019705. 129 NAYALVDRNS---AGDVISLLPLQSANMDVKLV------GK-------KVVYRYQRDS-------------EYAEFSQKE 179 (424) Q Consensus 129 ~a~~~~~r~~---~G~~~~l~~l~~~~v~~~~~------~~-------~~~~~~~~~~-------------~~~~~~~~e 179 (424) ..|..++-|. ..-+.+|..|+|.+|+..+. ++ ..+|.|..+. ....++.+- T Consensus 155 Ri~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dA 234 (524) T protein:vir:10 155 RIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAA 234 (524) T ss_pred EEEEEEEeeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhh Confidence 9998776543 23589999999999975321 11 1123343221 234455565 Q ss_pred EEEeec--CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC---- Q lcl|NC_019705. 180 IFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG---- 252 (424) Q Consensus 180 iih~r~--~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~---- 252 (424) |.+... .+.++-.-+|-+..|.+.+.....++....-+--.-+.-+-|+.++.+.++.. +.+-++....++.. T Consensus 235 I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvY 314 (524) T protein:vir:10 235 IVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVY 314 (524) T ss_pred eeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEE Confidence 655541 23344455667788877777776666666555444455566677776665543 33444444444321 Q ss_pred ----Cc--ccCcceec-----------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchh- Q lcl|NC_019705. 253 ----GP--VKKRLWIL-----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG- 314 (424) Q Consensus 253 ----~~--~ag~~~~l-----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n- 314 (424) |. +..+.+.+ ..|.++..|.-. ..+.-++-.++..+.+..+++||.+-|.....+..+... T Consensus 315 Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~ 393 (524) T protein:vir:10 315 DASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGA-DNTGNMEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSG 393 (524) T ss_pred eCCCCeeccchhhhhhHhhhcccccCCCcccceeecccc-CCcChHHHHHHHHHHHHHHhCCchhhcCCCCCcccccccc Confidence 10 11122211 135566555432 222234456677889999999999988432211121111 Q ss_pred HH----H-HHHHHHHHHHHHHHHHHHHHHHhhccC-----hhhhcc--cch--hhhhhhh----hccC-HHHHHHHHHHH Q lcl|NC_019705. 315 IE----Q-QNLGFLQYTLQPYISRWENSIQRWLIP-----AKDVGR--IHA--EHNLDGL----LRGD-SASRAAFMKAM 375 (424) Q Consensus 315 ~e----~-~~~~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~~--~~~--~fd~~~l----~~~d-~~~~~~~~~~~ 375 (424) +| + ....|+..-=.-+...+.+-|..+|+. +.+... ..+ .|..|.. .... ...|...+..+ T Consensus 394 ~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 473 (524) T protein:vir:10 394 TSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMA 473 (524) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHh Confidence 11 1 111222222223444445555555443 333221 222 2222222 1111 12233333332 Q ss_pred Hh--CCCCCHHHHHH-HhCCCCCC----------CCCeeeecccccchhhccccCCCcccC Q lcl|NC_019705. 376 GE--AGLRTINEMRR-TDNLPPLP----------GGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 376 ~~--~g~~t~NE~R~-~~g~~p~~----------ggd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) -. +-.++.+=+++ .|.+.-.+ +.++...+ +.++..++= T Consensus 474 dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~----------~~~~~~~~f 524 (524) T protein:vir:10 474 EPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQ----------DPDQEQEDF 524 (524) T ss_pred hhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCC----------CCchhhhcC Confidence 11 12344444443 23332110 00000000 000000000 No 230 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=69.15 E-value=0.23 Score=24.11 Aligned_cols=394 Identities=11% Similarity=0.084 Sum_probs=155.3 Q ss_pred CCCCcccccCC-----------------CCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHH Q lcl|NC_019705. 1 MEEPKYTIDLR-----------------TNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTV 63 (424) Q Consensus 1 ~~~~~~~~~~~-----------------~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v 63 (424) .++--+-+... .++.-++++..+..+..... ... ..... ..+. .+ +.++.. T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~-~~~-----~~~~~---~~~~-~k--i~~n~~ 69 (489) T protein:vir:99 2 LQEDFEAIDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIK-YRP-----AKTDK---YAAD-NR--IASDFA 69 (489) T ss_pred CccceeeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccc-ccc-----ccccc---cCCc-ce--eecchH Confidence 11111111111 11233344444443321100 000 00000 0000 00 234556 Q ss_pred HHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee----CCC Q lcl|NC_019705. 64 WRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR----NSA 139 (424) Q Consensus 64 ~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r----~~~ 139 (424) .-+|+..++-+-+-|+.+--. +.. ....+..++. . | ....+...+..+++.+|.+|..+.. +.. T Consensus 70 ~~iv~~~~~~l~g~~~~~~~~--d~~-----~~~~l~~~~~-~-n---~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~ 137 (489) T protein:vir:99 70 KYITVFEQGYMLGVPVEYKNE--NKD-----LQAAIDLMSV-R-N---NEDYHNVKIKTDLSIYGRAYELLTVEKIDDKK 137 (489) T ss_pred HHHHHHHhhhhccCCceeecC--Chh-----HHHHHHHHHh-h-c---ChhHHHHHHHHHHhhCCeEEEEEeeccCcCCC Confidence 678888888777777764221 111 1123444443 2 2 2235668889999999999977643 333 Q ss_pred CceeEEEEecCceeEEeecCce---EE-----EEEEeC-Cc----eEEecHhHEEEeecCC------------------- Q lcl|NC_019705. 140 GDVISLLPLQSANMDVKLVGKK---VV-----YRYQRD-SE----YAEFSQKEIFHLKGFG------------------- 187 (424) Q Consensus 140 G~~~~l~~l~~~~v~~~~~~~~---~~-----~~~~~~-~~----~~~~~~~eiih~r~~~------------------- 187 (424) |. ..+..++|.++.+..++.. .. |..... +. ...+.++.+++++... T Consensus 138 ~~-~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~v 216 (489) T protein:vir:99 138 TE-VKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGV 216 (489) T ss_pred cc-eEEEEEcccceEEEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCce Confidence 33 4577788888776655321 11 111000 00 0122333333332100 Q ss_pred -----CCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHH-------hCCcc Q lcl|NC_019705. 188 -----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI-------AGGPV 255 (424) Q Consensus 188 -----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~-------~~~~~ 255 (424) .+...|.|.+......++....+.....+.....+.|-.+++-- .. ..+........+... ..... T Consensus 217 Pvv~~~n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 294 (489) T protein:vir:99 217 PVNEYANNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGN-AY-TGADENDYLDDGRLNPNGRLAISIGFK 294 (489) T ss_pred eEEEeecCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccC-Cc-ccccchhhhhhcccccccccccccccc Confidence 01123555555554444444333333333333333344333211 11 111111111111111 01112 Q ss_pred cCcceecCCCc-------eeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHH----------H Q lcl|NC_019705. 256 KKRLWILEAGF-------STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------Q 318 (424) Q Consensus 256 ag~~~~l~~g~-------~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~----------~ 318 (424) .++++.++.+. +.+-+.....+..+....+...+.|...-++|..-.... .++.++...+. . T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~ 373 (489) T protein:vir:99 295 KAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKF-SGVQSGESMKYKLMASDNYREK 373 (489) T ss_pred cceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-cccchHHHHHHHHHHHHHHHHH Confidence 33444444332 223333333333445667788889999999985322111 12222222211 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccChhhh-cccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Q lcl|NC_019705. 319 NLGFLQYTLQPYISRWENSIQRWLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 319 ~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~g 397 (424) ....+...++-.++.+...+...-...... ....+.+.++.-+..|..+.++.+.++. |+++...+.++++.=.-++ T Consensus 374 k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~--giis~et~~~~l~~v~~~d 451 (489) T protein:vir:99 374 QERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY--GIVSDQTIFEILNTVTGVD 451 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhcCCCCchh Confidence 112223333333333333222111000000 0112334445556677888888888874 8899888887764411011 Q ss_pred CCe----e-------eecccccchhhccccCCCcccCC Q lcl|NC_019705. 398 GDV----A-------MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd~----~-------~~~~n~~~~~~~~~~~~~~~~ga 424 (424) .+. . ..........+..+++++.++.- T Consensus 452 ~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 452 AEAELKRLKEEADKKQSLPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred HHHHHHHHHHHHHHHhccccccccCCCCCCcCCCCCCC Confidence 100 0 00001111111111112111111 No 231 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=68.86 E-value=0.23 Score=24.07 Aligned_cols=400 Identities=15% Similarity=0.147 Sum_probs=174.1 Q ss_pred HHHHHhhcc-------CcccCCccccchh-----hccccccccCcc--c------c-cHHHHhccHHHHHHHHHHHHhhc Q lcl|NC_019705. 17 WARLQSWFV-------GGRLVTPNQGSQT-----GPVSAHGHLGDS--S------I-NDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 17 ~~~~~~~~~-------~~~~~~~~~~~~~-----~~~~~~~~~~~~--~------v-s~~~~~~~~~v~~~i~~ia~~ia 75 (424) +..|+++-- +.+.++|...... +.++......|. . | .-+..+.+|.|..||+.|.+.+. T Consensus 1 m~~lfgf~i~~~~~~~~~S~vpp~~~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~Av~eIVneaI 80 (564) T protein:vir:10 1 MSQLFGFLINEKEGQKGQSPVPPNDEASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDSAIDEIVNEFV 80 (564) T ss_pred CcchhcceeeeeccCCCCCcccCCcCCChhhhhccccceeeecccccchhhHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 333333311 1222222221111 111111112221 1 1 12556779999999999999855 Q ss_pred cC-----ceEEEEeccC-CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---CceeEEE Q lcl|NC_019705. 76 CL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---GDVISLL 146 (424) Q Consensus 76 ~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~---G~~~~l~ 146 (424) -. |+.|--.+.+ +............++|+. -|-...+ +.++..|...|..|..++-|.. .-+.+|. T Consensus 81 v~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~l-l~F~~~~----~e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr 155 (564) T protein:vir:10 81 VNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRM-MNFNVNA----HEIIRNWYVDGRSHYHKVIDLDNPKKGILELR 155 (564) T ss_pred EecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhh----hHHHhhhhhcceEEEEEEeeCCChhhhhhhhh Confidence 32 3322111100 000000011122333321 1222223 4556777888999987755422 2388999 Q ss_pred EecCceeEEeec------Cc-----------------eEEEEEEeC-----------------CceEEecHhHEEEeec- Q lcl|NC_019705. 147 PLQSANMDVKLV------GK-----------------KVVYRYQRD-----------------SEYAEFSQKEIFHLKG- 185 (424) Q Consensus 147 ~l~~~~v~~~~~------~~-----------------~~~~~~~~~-----------------~~~~~~~~~eiih~r~- 185 (424) .|+|..|+..+. .. ..+|.|.+. .....++.+-|.+... T Consensus 156 ~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~daI~y~hSG 235 (564) T protein:vir:10 156 YIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIASDAIAQSTSG 235 (564) T ss_pred hhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeechhhcceeccc Confidence 999998764431 10 013334321 1235677778877753 Q ss_pred -CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC--------Cc- Q lcl|NC_019705. 186 -FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG--------GP- 254 (424) Q Consensus 186 -~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~--------~~- 254 (424) .+.++-.-+|-+..+.+.+.....++....-+--.-+.-+-|+.++.+.+... +.+-++....++.. |. T Consensus 236 L~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev 315 (564) T protein:vir:10 236 LMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEI 315 (564) T ss_pred ceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCcee Confidence 34455556677888888877777776666555444455566677776665543 33444544444321 10 Q ss_pred -ccCcce-ec----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-Ccccchh---H- Q lcl|NC_019705. 255 -VKKRLW-IL----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STSWGSG---I- 315 (424) Q Consensus 255 -~ag~~~-~l----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~-~~~~~~n---~- 315 (424) +..+.+ .| ..|.++..|.- +..+| +-.++..+.+.++++||.+-|..... -+...++ - T Consensus 316 rddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem---~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRD 392 (564) T protein:vir:10 316 RDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL---KDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRD 392 (564) T ss_pred cccchhhhhHhhhcccccCCCcccceeeccccCCcchH---HHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHH Confidence 111122 11 13556666543 33444 44566788999999999998876422 1211111 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccC-----hhhhcc--cch--hhhhhhh----hccC-HHHHHHHHHHHHh--CC Q lcl|NC_019705. 316 EQQNLGFLQYTLQPYISRWENSIQRWLIP-----AKDVGR--IHA--EHNLDGL----LRGD-SASRAAFMKAMGE--AG 379 (424) Q Consensus 316 e~~~~~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~~--~~~--~fd~~~l----~~~d-~~~~~~~~~~~~~--~g 379 (424) |-....|+..-=.-+...|.+-|...|+. +.+... ..+ .|..|.. .... ...|...+..+-. +- T Consensus 393 EiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGk 472 (564) T protein:vir:10 393 ELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGK 472 (564) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 11112222222223444445555555443 233221 222 2222221 1111 1223333332210 11 Q ss_pred CCCHH------------HHHHH---------hCC--CCC--CCCCeeee-cccccchhhcc------ccCCCcccCC Q lcl|NC_019705. 380 LRTIN------------EMRRT---------DNL--PPL--PGGDVAMR-QSQYVPITDLG------TNKEPRNNGA 424 (424) Q Consensus 380 ~~t~N------------E~R~~---------~g~--~p~--~ggd~~~~-~~n~~~~~~~~------~~~~~~~~ga 424 (424) .++.+ |+-++ .|+ +|. ..||..-+ +....|.+... +.+....++| T Consensus 473 y~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a 549 (564) T protein:vir:10 473 YFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIKKLNSA 549 (564) T ss_pred ccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhccccccccChhhhccC Confidence 22333 33211 122 221 12432222 22223332111 1111112222 No 232 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=64.48 E-value=0.3 Score=23.46 Aligned_cols=398 Identities=11% Similarity=0.085 Sum_probs=177.6 Q ss_pred CchHHHHHhh-----ccCc--ccCCccccchhh--------cccc---cc---ccCccc-----c-cHHHHhccHHHHHH Q lcl|NC_019705. 14 NGWWARLQSW-----FVGG--RLVTPNQGSQTG--------PVSA---HG---HLGDSS-----I-NDERILQISTVWRC 66 (424) Q Consensus 14 ~G~~~~~~~~-----~~~~--~~~~~~~~~~~~--------~~~~---~~---~~~~~~-----v-s~~~~~~~~~v~~~ 66 (424) .-+|.+.-.. .... +.++|.....+. +..+ .+ ...+.. | .-+..+.+|.|..| T Consensus 1 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~A 80 (511) T protein:vir:56 1 MKFWTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVKELIKSYRALAEYHEVDDA 80 (511) T ss_pred CCCccchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchHHHHHHHHHHhhccchhhH Confidence 1222221111 1111 122222211110 0000 00 011110 1 13556779999999 Q ss_pred HHHHHHhhccC-----ceEEEEeccC-CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019705. 67 VSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 67 i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G 140 (424) |+.|.+.+.-. |+.+--.+-+ .............++|+. -|-...+ +.++..|...|..|..++-+... T Consensus 81 v~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~----~~~fR~WYVDgRi~fHkiid~k~ 155 (511) T protein:vir:56 81 IQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSL-LQMRKHG----YKWFRKWYVDSRIYFHKILDKDN 155 (511) T ss_pred HHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhh----hHHHhhhhhcceEEEEEEecccc Confidence 99999987643 3333211110 000000111223333321 1112222 45567778889999888777666 Q ss_pred ceeEEEEecCceeEEeec-------C------ceEEEEEEeC--------------CceEEecHhHEEEeec--C--CCC Q lcl|NC_019705. 141 DVISLLPLQSANMDVKLV-------G------KKVVYRYQRD--------------SEYAEFSQKEIFHLKG--F--GFT 189 (424) Q Consensus 141 ~~~~l~~l~~~~v~~~~~-------~------~~~~~~~~~~--------------~~~~~~~~~eiih~r~--~--~~~ 189 (424) -+.+|..|+|.+|+..+. + -.-+|.|.+. .....++.+.|.|... . ..+ T Consensus 156 GI~eLr~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~hSGL~d~~~~ 235 (511) T protein:vir:56 156 NIIELRPLNPMKMELVREIQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAHSGLMRGCAD 235 (511) T ss_pred ceeehhhcCcccchhhhhhhcccccccccccceeeeeEecCCCcccCcccccccccccceeechhheeeecccceeccCC Confidence 789999999999875431 1 1123444321 1336788999966652 2 245 Q ss_pred CcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC--------Cc--ccCc Q lcl|NC_019705. 190 GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG--------GP--VKKR 258 (424) Q Consensus 190 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~--------~~--~ag~ 258 (424) .-+.+|-+..|.+.+.....++....-+--.-+.-+-|+.++.+.++.. +.+-++....++.. |. +..+ T Consensus 236 ~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk 315 (511) T protein:vir:56 236 DPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTN 315 (511) T ss_pred CCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchh Confidence 5678888999988888777777766655555555566777777765543 34444554444321 11 1112 Q ss_pred ceec-----------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-Ccccc-hhHH-----HHHH Q lcl|NC_019705. 259 LWIL-----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STSWG-SGIE-----QQNL 320 (424) Q Consensus 259 ~~~l-----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~-~~~~~-~n~e-----~~~~ 320 (424) .+.+ ..|.++..|.-. ..+.-++-.++..+.+..+++||.+-|...++ +..+. ..+| -... T Consensus 316 ~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~ 394 (511) T protein:vir:56 316 AMSMLEDYYLPRREGSKGTEVSTLPGG-QSLGDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFT 394 (511) T ss_pred hhhhHhhhcccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHH Confidence 2211 135566655432 22233445667788999999999998875432 22221 1111 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccC-----hhhhcc--cch--hhhhhhh----hccC-HHHHHHHHHHHHh--CCCCCHH Q lcl|NC_019705. 321 GFLQYTLQPYISRWENSIQRWLIP-----AKDVGR--IHA--EHNLDGL----LRGD-SASRAAFMKAMGE--AGLRTIN 384 (424) Q Consensus 321 ~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~~--~~~--~fd~~~l----~~~d-~~~~~~~~~~~~~--~g~~t~N 384 (424) .|+..-=.-+...+.+.|..+|+. +.+... ..+ .|..|.. .... ...|...+..+-. +-.++.+ T Consensus 395 KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~ 474 (511) T protein:vir:56 395 KFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHK 474 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchH Confidence 222222223444445555555443 333221 222 2222222 1111 1223333332211 1244555 Q ss_pred HHHH-HhCCCCCC--CCCeeeecccccchhhc-cccCCCcccC Q lcl|NC_019705. 385 EMRR-TDNLPPLP--GGDVAMRQSQYVPITDL-GTNKEPRNNG 423 (424) Q Consensus 385 E~R~-~~g~~p~~--ggd~~~~~~n~~~~~~~-~~~~~~~~~g 423 (424) =+++ .|.+.-.+ .-++.. .-+.. +--+.+.++= T Consensus 475 yi~k~ILr~tDeei~~~~k~I------~~E~k~~~~~~~e~~f 511 (511) T protein:vir:56 475 YIQKNILRLSDDQITAMQSEI------DEEETNPRFQQDDQGF 511 (511) T ss_pred HHHHHHhccCHHHHHHHHHHH------HHhhcCCCCCCcccCC Confidence 5554 24432110 000000 00000 0000111111 No 233 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=57.87 E-value=0.42 Score=22.62 Aligned_cols=388 Identities=15% Similarity=0.065 Sum_probs=143.9 Q ss_pred cCCCCCchHHHHHhhccCcccCCccccchhhc-----------------------cc-----------cccccCcccc-- Q lcl|NC_019705. 9 DLRTNNGWWARLQSWFVGGRLVTPNQGSQTGP-----------------------VS-----------AHGHLGDSSI-- 52 (424) Q Consensus 9 ~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~-----------~~~~~~~~~v-- 52 (424) --|-+.-+-++..+.--.+..++|.+.....+ +. +......... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~ 80 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDE 80 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCc Confidence 11222233333333322222222211110000 00 0000010000 Q ss_pred -c---HHHHhcc----HHHHHHHHHHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHH Q lcl|NC_019705. 53 -N---DERILQI----STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQL 124 (424) Q Consensus 53 -s---~~~~~~~----~~v~~~i~~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ 124 (424) + .+.+++- +++...++.++..+-+-|..+ .....+..++..---...+-.+|.+.++... T Consensus 81 E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~------------~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~ 148 (535) T protein:vir:80 81 EQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIR------------QLPPALEAIVEDIDGEGVSLDQQAKKALGYT 148 (535) T ss_pred CCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCcce------------eccHHHHHHHhccCCCCCCHHHHHHHHHHHH Confidence 0 1222333 334444444444444333222 1123355555433335668899999999999 Q ss_pred HHcCCeEEEEeeCCCCce------------eEEEEecCceeE----------------------EeecCc---eE--EEE Q lcl|NC_019705. 125 CFYGNAYALVDRNSAGDV------------ISLLPLQSANMD----------------------VKLVGK---KV--VYR 165 (424) Q Consensus 125 ll~G~a~~~~~r~~~G~~------------~~l~~l~~~~v~----------------------~~~~~~---~~--~~~ 165 (424) +.+|-+++++.....|.. =.+..+.|..|. ...+++ .. .|+ T Consensus 149 l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~R 228 (535) T protein:vir:80 149 MGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWR 228 (535) T ss_pred HhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEEEE Confidence 999999999976544421 012222222210 011211 00 011 Q ss_pred --------------EEeCCce-------EEecHh------HEEEeec---CCCCCcccCchHHHHHHH-HHHHHHHHHHH Q lcl|NC_019705. 166 --------------YQRDSEY-------AEFSQK------EIFHLKG---FGFTGLVGLSPIAFACKS-AGVAVAMEDQQ 214 (424) Q Consensus 166 --------------~~~~~~~-------~~~~~~------eiih~r~---~~~~~~~G~s~i~~~~~~-i~~~~~~~~~~ 214 (424) |...+.. ..++.+ ..|=|-. .+.+...|.+|+..++.. +...+.... . T Consensus 229 vL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd-~ 307 (535) T protein:vir:80 229 VLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSAD-Y 307 (535) T ss_pred EEEecCCceEEEEEEEeecCCccccccceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhH-H Confidence 1110000 011111 1111111 122335677887766544 333333333 3 Q ss_pred HHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCcee--eecccChhHHHHHHHHHHHHHHHH Q lcl|NC_019705. 215 RDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFST--SAIGVTPQDAEMMASRKFQVSELA 292 (424) Q Consensus 215 ~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~--~~l~~~~~d~~~~e~~~~~~~~Ia 292 (424) ...+...+.|-.+++-......+. ..+...-..| . ...+.++.+.++ .+++-+.-..+.++.++.....+. T Consensus 308 ~~il~~~~~P~l~i~G~~~~~~~~----~~~~~~i~iG-~--~~~~~lP~~~~~~~~e~~~~~~a~~~l~~~e~qM~~lG 380 (535) T protein:vir:80 308 EEMAFVAGQPTAFFTGLTKDWVED----VFKDFKVHLG-S--RAIIPLPQGATAGILQITPNSVPFEAMTHKESQMIAMG 380 (535) T ss_pred HHHHHHhcCceeeeecCchhhhhc----CCCCcceEec-C--cccccCCCCCCcceeeeccchhHHHHHHHHHHHHHHHH Confidence 333445667777766322211110 0000001122 2 235667665544 444433333333333332222222 Q ss_pred HHhCCCHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChh-------hhcccchhhhhhhhhccCH Q lcl|NC_019705. 293 RFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-------DVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 293 ~~fgVPp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-------~~~~~~~~fd~~~l~~~d~ 365 (424) +.+-+. ....-+...+ .....--...|.-++..+|+.++.-|---- ......+..+.+=...... T Consensus 381 a~ll~~-----~~~~~Ta~~a---~~~~~~~~S~L~~~a~~le~al~~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld 452 (535) T protein:vir:80 381 ANLLVK-----SGGNRTFGEA---QQEEASEQSILSACTKNVSMAFRKALRWANQFQTGIVNDETVEYNLNTDFPAARLT 452 (535) T ss_pred HHhhcc-----CcccccHHHH---HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCceEEEeccccccccCC Confidence 222121 1111111111 111122245566777777777764332111 1011112222221122222 Q ss_pred HHHHHHHHHHHhCCCCCHHHHHHHhCCCCC-----CCC-----------CeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 366 ASRAAFMKAMGEAGLRTINEMRRTDNLPPL-----PGG-----------DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 366 ~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~-----~gg-----------d~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) ....+.+.++++.|.++....++.+..-.+ ++- +.....+-.......+++..+.+||. T Consensus 453 ~~~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~ 527 (535) T protein:vir:80 453 PNERAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNNGN 527 (535) T ss_pred HHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccCCc Confidence 335566667888888888888776633211 111 11111110011111122222222222 No 234 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=56.95 E-value=0.44 Score=22.51 Aligned_cols=397 Identities=10% Similarity=0.054 Sum_probs=171.0 Q ss_pred ccCCCCC---chHHHHHhh-----cc--CcccCCccccchhhcc--c---cc---c---c--cC---cccc-------cH Q lcl|NC_019705. 8 IDLRTNN---GWWARLQSW-----FV--GGRLVTPNQGSQTGPV--S---AH---G---H--LG---DSSI-------ND 54 (424) Q Consensus 8 ~~~~~~~---G~~~~~~~~-----~~--~~~~~~~~~~~~~~~~--~---~~---~---~--~~---~~~v-------s~ 54 (424) |.+ +-. |||.+--.. .. ..+.++|........+ . +. + + .. +..- .- T Consensus 1 m~f-~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~Y 79 (523) T protein:vir:68 1 MKF-NILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTY 79 (523) T ss_pred CCC-chhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHH Confidence 666 111 333321110 00 0122233222211111 0 10 0 0 01 1001 12 Q ss_pred HHHhccHHHHHHHHHHHHhhccC-----ceEEEEecc-CCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcC Q lcl|NC_019705. 55 ERILQISTVWRCVSLISTLTACL-----PLDVFETDQ-NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYG 128 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~-~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G 128 (424) +..+.+|.|..||+.|.+.+.-. |+.+--.+- -.............++|+. ++...--+.++..|...| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~l-----l~F~~~~~~~fR~WYVDg 154 (523) T protein:vir:68 80 RNLMTNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNH-----LSFQRKGSDHFRRWYVDS 154 (523) T ss_pred HHHhhccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-----hccchhhhHHHHhheeee Confidence 55677999999999999987643 222211110 0000000111223333321 222222245567788889 Q ss_pred CeEEEEeeCC---CCceeEEEEecCceeEEeec-----Cc--------eEEEEEEeC-------------CceEEecHhH Q lcl|NC_019705. 129 NAYALVDRNS---AGDVISLLPLQSANMDVKLV-----GK--------KVVYRYQRD-------------SEYAEFSQKE 179 (424) Q Consensus 129 ~a~~~~~r~~---~G~~~~l~~l~~~~v~~~~~-----~~--------~~~~~~~~~-------------~~~~~~~~~e 179 (424) ..|..++-|. ..-+.+|..|+|.+|+..+. .. ..+|.|... +....++.+- T Consensus 155 Ri~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dA 234 (523) T protein:vir:68 155 RIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKAA 234 (523) T ss_pred EEEEEEEeeCCCccccceeeeeeCCcceeEEEeecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecchhh Confidence 9998776543 23589999999999975331 11 112334322 2334566666 Q ss_pred EEEeec--CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC---- Q lcl|NC_019705. 180 IFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG---- 252 (424) Q Consensus 180 iih~r~--~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~---- 252 (424) |.+... .+.++-.-+|-+..|.+.+.....++....-+--.-+.-+-|+.++.+.++.. +.+-++....++.. T Consensus 235 I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvY 314 (523) T protein:vir:68 235 IVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAY 314 (523) T ss_pred eeeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEE Confidence 655541 23444455667888877777777666666555444555566777777665543 33444554444321 Q ss_pred ----Cc--ccCcceec-----------CCCceeeeccc--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-Ccccc Q lcl|NC_019705. 253 ----GP--VKKRLWIL-----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STSWG 312 (424) Q Consensus 253 ----~~--~ag~~~~l-----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~-~~~~~ 312 (424) |. +..+.+.+ ..|.++..|.- +.-+ ++-.++..+.+.++++||.+-|....+ -+... T Consensus 315 Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlge---m~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr 391 (523) T protein:vir:68 315 DATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGN---MEDVRWFRNALYMALRIPITRIPSDQGGIQFDA 391 (523) T ss_pred eccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcCh---HHHHHHHHHHHHHHhCCcceeecCCCcceeccc Confidence 11 11122211 13556665543 3334 445667788999999999988854322 12111 Q ss_pred hh----HHHHHHHHHHHHHHHHHHHHHHHHHhhccC-----hhhhcc--cch--hhhhhhh----hccC-HHHHHHHHHH Q lcl|NC_019705. 313 SG----IEQQNLGFLQYTLQPYISRWENSIQRWLIP-----AKDVGR--IHA--EHNLDGL----LRGD-SASRAAFMKA 374 (424) Q Consensus 313 ~n----~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~~--~~~--~fd~~~l----~~~d-~~~~~~~~~~ 374 (424) ++ =|-....|+..-=.-+...+.+-|..+|+. +.+... ..+ .|..|.. .... ...|...+.. T Consensus 392 ~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~ 471 (523) T protein:vir:68 392 GTSITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQM 471 (523) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHH Confidence 11 111111222222223444445555555443 333221 222 2222222 1111 1223333333 Q ss_pred HHh--CCCCCHHHHHH-HhCCCCCC----------CCCeeeecccccchhhccccCCCcccC Q lcl|NC_019705. 375 MGE--AGLRTINEMRR-TDNLPPLP----------GGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 375 ~~~--~g~~t~NE~R~-~~g~~p~~----------ggd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) +-. +-.++.+=+|+ .|.+.-.+ +.++...+ +.++..++= T Consensus 472 ~dpyvGky~s~~yi~k~ILr~tDeei~~~~kqI~~E~k~~~~~----------~p~~e~~~f 523 (523) T protein:vir:68 472 AEPFIGKYISHRTAMKDILQMSDEEIEQEAKQIEEESKEARFQ----------DPDQEQEDF 523 (523) T ss_pred hhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCC----------CCchhhhcC Confidence 211 12345544443 23332110 00000000 000000000 No 235 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=53.95 E-value=0.52 Score=22.16 Aligned_cols=378 Identities=10% Similarity=-0.002 Sum_probs=157.1 Q ss_pred CCCCc---ccccCCCCCchHHHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhccC Q lcl|NC_019705. 1 MEEPK---YTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~~---~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |.... +-=....++.-+.++.....+........... ..........+..=+.++-..-+|+..++-+-+- T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~------~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~ 74 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVV------QNRDENPLRNADNRISHNFHEILVDEKASYMFTY 74 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccc------cccccccccccccccccchHHHHHHhhhhheecc Confidence 10000 00001122333444444443321110000000 0000000000011122455666888888888788 Q ss_pred ceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC-------ceeEEEEecC Q lcl|NC_019705. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG-------DVISLLPLQS 150 (424) Q Consensus 78 ~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G-------~~~~l~~l~~ 150 (424) |+... ...+... ...+...+ . | ........+..+++.+|.||.++.++.+. ....+..++| T Consensus 75 p~~~~-~~~~~~~-----~~~~~~~~--~-n---~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p 142 (451) T protein:vir:10 75 PVLFD-IDNNKEL-----NEKVTDVL--G-N---EFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNT 142 (451) T ss_pred cceee-cCCcHHH-----HHHHHHHh--c-c---CHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcc Confidence 87642 2111110 01122222 1 2 23466677889999999999988777642 1234667788 Q ss_pred ceeEEeecCce---E-----EEEEEeC--C----c----eEEecHhHEEEeecCC----------------C-------- Q lcl|NC_019705. 151 ANMDVKLVGKK---V-----VYRYQRD--S----E----YAEFSQKEIFHLKGFG----------------F-------- 188 (424) Q Consensus 151 ~~v~~~~~~~~---~-----~~~~~~~--~----~----~~~~~~~eiih~r~~~----------------~-------- 188 (424) ..+.+..++.. . +|....+ + . ...+.++.+.+.+... + T Consensus 143 ~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 222 (451) T protein:vir:10 143 EEIIPIYRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEF 222 (451) T ss_pred cceEEEEcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEe Confidence 88776654321 1 1110000 0 0 0112333443332100 0 Q ss_pred -CCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC---- Q lcl|NC_019705. 189 -TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE---- 263 (424) Q Consensus 189 -~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~---- 263 (424) +...|.|-++.....++....+.....+.+...+.|-.+++--....+++....+ . ..+++.++ T Consensus 223 ~nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~----~-------~~~~i~~~~~~~ 291 (451) T protein:vir:10 223 SNNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKEL----K-------RYKTIKTETDSE 291 (451) T ss_pred ccCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHH----h-------hCCeEEecCcCC Confidence 1223566666666666655555444444455555565555422111122211111 1 11233332 Q ss_pred ---CCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHH----------HHHHHHHHHH Q lcl|NC_019705. 264 ---AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GFLQYTLQPY 330 (424) Q Consensus 264 ---~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~----------~~~~~tl~P~ 330 (424) ++++|..-.. ....+.+..+...+.|...-++|. +....-++.++....-... ..+...++-. T Consensus 292 ~~~~~~~~l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~ 367 (451) T protein:vir:10 292 GDSGGLKTMQIEI--PTEARKIILEILKKQIYESGQGLQ--QDTENFGNASGVALKFFYRKLELKSGLLETEFRTSFDKL 367 (451) T ss_pred ccCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCccc--ccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2344443333 334456678888899999999984 2221112333222111111 1112222222 Q ss_pred HHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCeeeecccc--- Q lcl|NC_019705. 331 ISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQY--- 407 (424) Q Consensus 331 ~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~--- 407 (424) ++.+...+ ...+.. .+.+.++.-+..|..+.++.+.++. |+++..-+.++++.-.-+ ++...-..- T Consensus 368 ~~li~~~~-----~~~d~~--~i~i~f~~~~p~n~~e~~~~~~kl~--g~iS~et~~~~~p~v~d~--~~e~~~~~ee~~ 436 (451) T protein:vir:10 368 IKAILYFL-----GVTDYK--KIQQTYTRNMMSNDLEDADIATKSV--GIIPTKIILRHHPWVDDV--EEAEKLYLEEKK 436 (451) T ss_pred HHHHHHHh-----CCCCcc--ceeEEecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCH--HHHHHHHHHHHH Confidence 22222221 111212 2233334556678888899888885 789888887777653211 111100000 Q ss_pred cchhhccccCCCccc Q lcl|NC_019705. 408 VPITDLGTNKEPRNN 422 (424) Q Consensus 408 ~~~~~~~~~~~~~~~ 422 (424) .......+.-.+-++ T Consensus 437 ~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 437 IQASKVSDDYNNFTE 451 (451) T ss_pred HHHHHHHhhcCCCCC Confidence 000000000000111 No 236 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=49.36 E-value=0.64 Score=21.64 Aligned_cols=404 Identities=11% Similarity=0.105 Sum_probs=177.0 Q ss_pred ccCCCCCchHHHHHhh-----cc--CcccCCccccchhhcc---------ccc-ccc---Cccc------c-cHHHHhcc Q lcl|NC_019705. 8 IDLRTNNGWWARLQSW-----FV--GGRLVTPNQGSQTGPV---------SAH-GHL---GDSS------I-NDERILQI 60 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~-----~~--~~~~~~~~~~~~~~~~---------~~~-~~~---~~~~------v-s~~~~~~~ 60 (424) |.+-.=.|||.+.-.. .. ..+.++|.....+..+ ++. .+. .+.. | +-+..+.+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTNN 80 (516) T ss_pred CCchHhcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhhc Confidence 4444335665442221 11 1222233222111111 000 000 0000 1 23566779 Q ss_pred HHHHHHHHHHHHhhccC-----ceEEEEeccC-CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019705. 61 STVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~ 134 (424) |.|..||+.|.+.+.-+ |+.+--.+-+ ...-.........++|+. -+-...+ +.++..|...|..|..+ T Consensus 81 pEvd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~l-l~F~~~~----~~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 81 PEVERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRL-LDASRKL----DTLFRRWYIDSRIFFHK 155 (516) T ss_pred cchhHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhh----hHHHHhhhhcceEEEEE Confidence 99999999999987643 3333110000 000000011122223321 1112222 44567778889988875 Q ss_pred ee-CCCCceeEEEEecCceeEEeec-----Cc--------eEEEEEEeCC-------------ceEEecHhHEEEee--c Q lcl|NC_019705. 135 DR-NSAGDVISLLPLQSANMDVKLV-----GK--------KVVYRYQRDS-------------EYAEFSQKEIFHLK--G 185 (424) Q Consensus 135 ~r-~~~G~~~~l~~l~~~~v~~~~~-----~~--------~~~~~~~~~~-------------~~~~~~~~eiih~r--~ 185 (424) +- +...-+.+|..|+|.+|+..+. .+ ..+|.|..+. ....++.+-|.... . T Consensus 156 iid~~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~daI~y~hSGl 235 (516) T protein:vir:10 156 IMPNPKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSAIVYAHSGL 235 (516) T ss_pred EecCcccceeeeeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecchhheeeeecCc Confidence 43 4445689999999999875431 11 1123332211 22344444443332 1 Q ss_pred CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhC--------Cc-- Q lcl|NC_019705. 186 FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAG--------GP-- 254 (424) Q Consensus 186 ~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~-~~~~~~~~~~~~~~--------~~-- 254 (424) ...++-.=+|-+..|.+.+.....++....-+--.-+.-+-|+.++.+.++.. +.+-++....++.. |. T Consensus 236 ~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ 315 (516) T protein:vir:10 236 QDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVK 315 (516) T ss_pred ccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeec Confidence 22333333667777777777776666666555444455566677776665543 33444544444321 10 Q ss_pred ccCcceec-----------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccc-chhHHHHH--- Q lcl|NC_019705. 255 VKKRLWIL-----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW-GSGIEQQN--- 319 (424) Q Consensus 255 ~ag~~~~l-----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~-~~n~e~~~--- 319 (424) +..+.+.+ ..|.+++.|.-. ..+.-++-.++..+.+.++++||.+-|...++.+.. +.++|=.+ T Consensus 316 ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEi 394 (516) T protein:vir:10 316 NQKRNLSMTEDYWLMRRDGKSVTEVTSLPGA-QTMGEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDEL 394 (516) T ss_pred cchhhhhhHhhhcccccCCCcccceeecccc-CCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHH Confidence 11122211 135566655432 222234456677889999999999988765443321 12222211 Q ss_pred --HHHHHHHHH-HHHHHHHHHHHhhcc-----Chhhhcc--cchhhh--hhhh----hccC-HHHHHHHHHHHH--hCCC Q lcl|NC_019705. 320 --LGFLQYTLQ-PYISRWENSIQRWLI-----PAKDVGR--IHAEHN--LDGL----LRGD-SASRAAFMKAMG--EAGL 380 (424) Q Consensus 320 --~~~~~~tl~-P~~~~ie~~l~~~l~-----~~~~~~~--~~~~fd--~~~l----~~~d-~~~~~~~~~~~~--~~g~ 380 (424) ..|+ ..|+ -+...|.+.|..+|+ ++.+... ..+.|+ .|.. .... ...|...+..+- -+.+ T Consensus 395 KF~KFI-~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky 473 (516) T protein:vir:10 395 DFRKFI-VQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKY 473 (516) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 1222 2233 344445556666544 3333321 122222 2222 1111 123444444332 2357 Q ss_pred CCHHHHHH-HhCCCCCC--CCCeeeecccccchh-hccccCCCcccCC Q lcl|NC_019705. 381 RTINEMRR-TDNLPPLP--GGDVAMRQSQYVPIT-DLGTNKEPRNNGA 424 (424) Q Consensus 381 ~t~NE~R~-~~g~~p~~--ggd~~~~~~n~~~~~-~~~~~~~~~~~ga 424 (424) ++.+=+|+ .|.+.-.+ .-++.. .-+ +.+--++|.+..- T Consensus 474 ~s~~yi~k~ILr~tDeei~~~~k~I------~~E~~~~~~~~p~~e~~ 515 (516) T protein:vir:10 474 VSHDYVMKNILQMTDEQIAQEEKQI------EKEANVKRFQNPENEDD 515 (516) T ss_pred cchHHHHHHHhcCCHhHHHHHHHHH------HHhhhCCCCCCCCcccc Confidence 77777765 35553211 000000 000 0000011111111 No 237 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=43.63 E-value=0.84 Score=21.01 Aligned_cols=402 Identities=11% Similarity=0.065 Sum_probs=175.6 Q ss_pred CCCCCchHHHHHhhc--------cC--cccCCccccchhhcc--------ccccccCcccc--------------cHHHH Q lcl|NC_019705. 10 LRTNNGWWARLQSWF--------VG--GRLVTPNQGSQTGPV--------SAHGHLGDSSI--------------NDERI 57 (424) Q Consensus 10 ~~~~~G~~~~~~~~~--------~~--~~~~~~~~~~~~~~~--------~~~~~~~~~~v--------------s~~~~ 57 (424) .-++.-+|+.+.+.- .. .+.++|.....+..+ ...++..+..+ .-+.. T Consensus 1 ~~~~l~~~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~m 80 (521) T protein:vir:65 1 MFSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGL 80 (521) T ss_pred CccchhhhhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHH Confidence 233333333322210 00 111222222111111 11111111111 12456 Q ss_pred hccHHHHHHHHHHHHhhccC-----ceEEEEecc-CCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeE Q lcl|NC_019705. 58 LQISTVWRCVSLISTLTACL-----PLDVFETDQ-NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAY 131 (424) Q Consensus 58 ~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~-~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~ 131 (424) +.+|.|..||+.|.+.+.-. |+.+--.+- -.............++|+. -|-...+ +.++..|...|..| T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~----~~~fR~WYVDgRi~ 155 (521) T protein:vir:65 81 MNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNT-IQFDRRG----QDMFRRWYVDSRIF 155 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhh----hHHHhhhhhcceeE Confidence 77999999999999987643 333211110 0000000111222333321 1112222 45567778889999 Q ss_pred EEEeeCCC--CceeEEEEecCceeEEeec-------Cc------eEEEEEEeC-------------CceEEecHhHEEEe Q lcl|NC_019705. 132 ALVDRNSA--GDVISLLPLQSANMDVKLV-------GK------KVVYRYQRD-------------SEYAEFSQKEIFHL 183 (424) Q Consensus 132 ~~~~r~~~--G~~~~l~~l~~~~v~~~~~-------~~------~~~~~~~~~-------------~~~~~~~~~eiih~ 183 (424) ..++-+.+ .-+.+|..|+|.+|+.++. +. ..+|.|..+ +....++.+-|... T Consensus 156 fhkiid~~pk~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~ 235 (521) T protein:vir:65 156 FHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYA 235 (521) T ss_pred EEEEEcCCccccceeeeeeCCcceeeeeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeechhheeee Confidence 98875443 4689999999999985442 11 112334322 12244555555433 Q ss_pred e--cCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCH-HHHHHHHHHHHHHhC-------- Q lcl|NC_019705. 184 K--GFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE-QQRSQVEENFKEIAG-------- 252 (424) Q Consensus 184 r--~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~-~~~~~~~~~~~~~~~-------- 252 (424) . ....++-.-+|-+..|.+.+.....++....-+--.-|.-+-|+.++.+.++. .+.+-++....++.. T Consensus 236 hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~T 315 (521) T protein:vir:65 236 HSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDAST 315 (521) T ss_pred eccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccc Confidence 3 12334445567788888888777777766655544555556677777776554 334445555554432 Q ss_pred Cc--ccCcce-ec----------CCCceeeecc--cChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccch-hHH Q lcl|NC_019705. 253 GP--VKKRLW-IL----------EAGFSTSAIG--VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGS-GIE 316 (424) Q Consensus 253 ~~--~ag~~~-~l----------~~g~~~~~l~--~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~-n~e 316 (424) |. +..+.+ .| ..|.++..|. .+.-+| +-.++..+.+.++++||.+-++..+++..+.. .+| T Consensus 316 Gev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~E 392 (521) T protein:vir:65 316 GKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDI---DDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSE 392 (521) T ss_pred ccccccccccchhhhhcccccCCCCccceeecccCCCcChH---HHHHHHHHHHHHHhCCCceeccCCCCcceeccccch Confidence 21 111222 22 1355666654 344444 45566788999999999988755443332211 111 Q ss_pred ----H-HHHHHHHHHHHHHHHHHHHHHHhhccC-----hhhhc----ccchhhhhhhh----hccC-HHHHHHHHHHHHh Q lcl|NC_019705. 317 ----Q-QNLGFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGL----LRGD-SASRAAFMKAMGE 377 (424) Q Consensus 317 ----~-~~~~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~----~~~~~fd~~~l----~~~d-~~~~~~~~~~~~~ 377 (424) + ....|+..-=.-+...+.+.|..+|+. +.+.. ...+.|..|.. .... ...|...+..+-. T Consensus 393 ItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dp 472 (521) T protein:vir:65 393 ITRDELEFSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITP 472 (521) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhh Confidence 1 111222222223444445555555443 33321 12222222222 1111 1223333333221 Q ss_pred --CCCCCHHHHHH-HhCCCCCC--CCCeeeecccccchhhccccCCCc-ccCC Q lcl|NC_019705. 378 --AGLRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPR-NNGA 424 (424) Q Consensus 378 --~g~~t~NE~R~-~~g~~p~~--ggd~~~~~~n~~~~~~~~~~~~~~-~~ga 424 (424) +-.++.+=+|+ .|.+.-.+ .-++..-. .. ..+--+++. +-+. T Consensus 473 yvGky~S~dyi~k~ILr~tDeei~~~~k~I~~----E~-~~~~~~~p~~~~~~ 520 (521) T protein:vir:65 473 YIGKYFSNQTVMRDILKYTDDQMDTEKKQIEE----EA-NDPRFKQTPDEIED 520 (521) T ss_pred hhccccchHHHHHHHhccCHHHHHHHHHHHHH----hh-hCCCCCCCcccccC Confidence 22445555554 24432210 00000000 00 000000000 0011 No 238 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=41.64 E-value=0.92 Score=20.79 Aligned_cols=402 Identities=11% Similarity=0.068 Sum_probs=174.5 Q ss_pred CCCCCchHHHHHhhccC----------cccCCccccchh--------hccccccccCcccc--------------cHHHH Q lcl|NC_019705. 10 LRTNNGWWARLQSWFVG----------GRLVTPNQGSQT--------GPVSAHGHLGDSSI--------------NDERI 57 (424) Q Consensus 10 ~~~~~G~~~~~~~~~~~----------~~~~~~~~~~~~--------~~~~~~~~~~~~~v--------------s~~~~ 57 (424) .-+..-+|..+.+.--. .+.++|...... .+....+...+..+ .-+.. T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~m 80 (521) T protein:vir:81 1 MFSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGL 80 (521) T ss_pred CcchhhhhHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHH Confidence 22333333333221100 111122211111 01111111111111 12556 Q ss_pred hccHHHHHHHHHHHHhhccC-----ceEEEEecc-CCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeE Q lcl|NC_019705. 58 LQISTVWRCVSLISTLTACL-----PLDVFETDQ-NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAY 131 (424) Q Consensus 58 ~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~-~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~ 131 (424) +.+|.|..||+.|.+.+.-. |+.+--.+- -.............++|+. ++...--+.++..|...|..| T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l-----l~F~~~~~~~fR~WYVDgRi~ 155 (521) T protein:vir:81 81 MNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNT-----IQFDRRGQDMFRRWYVDSRIF 155 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-----hccchhhhHHHhhhhhcceEE Confidence 77999999999999987643 333211110 0000000111122333321 111222245567788889999 Q ss_pred EEEeeCCC--CceeEEEEecCceeEEeec-------Cc------eEEEEEEeC-------------CceEEecHhHEEEe Q lcl|NC_019705. 132 ALVDRNSA--GDVISLLPLQSANMDVKLV-------GK------KVVYRYQRD-------------SEYAEFSQKEIFHL 183 (424) Q Consensus 132 ~~~~r~~~--G~~~~l~~l~~~~v~~~~~-------~~------~~~~~~~~~-------------~~~~~~~~~eiih~ 183 (424) ..++-+.+ .-+.+|..|+|.+|+.++. +. ..+|.|.++ +....++.+-|... T Consensus 156 fhkiid~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~ 235 (521) T protein:vir:81 156 FHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYA 235 (521) T ss_pred EEEEEcCCccccceeeeeeCCcceeeeeeecccccCccceecceeeeeeeecCCccccccceeecCCcceeechhheeee Confidence 98875443 4589999999999875442 10 112344332 12234555555433 Q ss_pred e--cCCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCH-HHHHHHHHHHHHHhC-------- Q lcl|NC_019705. 184 K--GFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE-QQRSQVEENFKEIAG-------- 252 (424) Q Consensus 184 r--~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~-~~~~~~~~~~~~~~~-------- 252 (424) . ..+.++-.-+|-+..|.+.+.....++....-+--.-|.-+-|+.++.+.++. .+.+-++....++.. T Consensus 236 hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~T 315 (521) T protein:vir:81 236 HSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDAST 315 (521) T ss_pred eccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccc Confidence 3 12334444567788888888777777766655544555556677777776554 334445555554432 Q ss_pred Cc--ccCcce-ec----------CCCceeeecc--cChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchh-HH Q lcl|NC_019705. 253 GP--VKKRLW-IL----------EAGFSTSAIG--VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG-IE 316 (424) Q Consensus 253 ~~--~ag~~~-~l----------~~g~~~~~l~--~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n-~e 316 (424) |. +..+.+ .| ..|.++..|. .+.-+| +-.++..+.+.++++||.+-|+..+++..+... +| T Consensus 316 Gev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~E 392 (521) T protein:vir:81 316 GKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDI---DDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSE 392 (521) T ss_pred ccccccccccchhhhhcccccCCCcccceeecccCCCCChH---HHHHHHHHHHHHHhCCccccccCCCCcceeccccch Confidence 21 111222 22 1355666654 344444 455667889999999999998544332222111 11 Q ss_pred ----H-HHHHHHHHHHHHHHHHHHHHHHhhccC-----hhhhc----ccchhhhhhhh----hccC-HHHHHHHHHHHHh Q lcl|NC_019705. 317 ----Q-QNLGFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGL----LRGD-SASRAAFMKAMGE 377 (424) Q Consensus 317 ----~-~~~~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~----~~~~~fd~~~l----~~~d-~~~~~~~~~~~~~ 377 (424) + ....|+..-=.-+...+.+.|..+|+. +.+.. ...+.|..|.. .... ...|...+..+-. T Consensus 393 ItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dp 472 (521) T protein:vir:81 393 ITRDELEFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITP 472 (521) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhh Confidence 1 112222222223444445555555443 33321 12222222222 1111 1223333333221 Q ss_pred --CCCCCHHHHHH-HhCCCCCC--CCCeeeecccccchhhccccCCCc-ccCC Q lcl|NC_019705. 378 --AGLRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPR-NNGA 424 (424) Q Consensus 378 --~g~~t~NE~R~-~~g~~p~~--ggd~~~~~~n~~~~~~~~~~~~~~-~~ga 424 (424) +-.++.+=+|+ .|.+.-.+ .-++..-. .. ..+--++|. +-+. T Consensus 473 yvGky~s~dyi~k~ILr~tDeei~~~~k~I~~----E~-~~~~~~~p~~~~~~ 520 (521) T protein:vir:81 473 YIGKYFSNQTVMRDILKYTDDQMDTEKKQIEE----EA-NDPRFKQTPDEIED 520 (521) T ss_pred hhccccchHHHHHHHhccCHHHHHHHHHHHHH----Hh-hCCCCCCCcccccC Confidence 12445555554 23332110 00000000 00 000000000 0011 No 239 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=39.28 E-value=1 Score=20.52 Aligned_cols=352 Identities=15% Similarity=0.105 Sum_probs=144.9 Q ss_pred CCCCc---ccccCCCCCchH----HHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHh Q lcl|NC_019705. 1 MEEPK---YTIDLRTNNGWW----ARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~---~~~~~~~~~G~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ 73 (424) |+--+ .--.|+++|.-| +.+..+.. |..... +......++..-....-+-+++-..|++.+|+. T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~l------P~~~~~---~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~ 71 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIM------PMRSDF---FSDLRSEGSINWNQNREVFDSTAGDGLETLSSS 71 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhc------cccccc---ccCCCCCcccccccccccccchHHHHHHHHHHH Confidence 22211 011233333222 11111111 111000 000000000000000001234555566666665 Q ss_pred hcc------CceEEEEe-ccCCccce--ecc----chHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC- Q lcl|NC_019705. 74 TAC------LPLDVFET-DQNDNRKK--VDL----SNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA- 139 (424) Q Consensus 74 ia~------~~~~v~~~-~~~~~~~~--~~~----~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~- 139 (424) +-+ -||-=... +.+..... ... ...+...|. +-| .+.-+..+..+++.+|+|.+++..+.+ T Consensus 72 L~~~ltPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~-~sn----f~~~~~~~~~~L~~~G~a~l~~~~d~~~ 146 (547) T protein:vir:10 72 LHGSLTSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQ-DSN----FNLEANETYIDLCGYGNAIMVEEEDEDE 146 (547) T ss_pred HHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhHCcEeEEeccCCCC Confidence 543 24321111 11111000 000 011222232 333 344466778999999999999876532 Q ss_pred Cce--eEEEEecCceeEEeecCceE------------------------------------------E--E--EEEe-CC Q lcl|NC_019705. 140 GDV--ISLLPLQSANMDVKLVGKKV------------------------------------------V--Y--RYQR-DS 170 (424) Q Consensus 140 G~~--~~l~~l~~~~v~~~~~~~~~------------------------------------------~--~--~~~~-~~ 170 (424) +.. ...+|+. ++.+..|..+. . + .+.. +. T Consensus 147 ~~~~r~~~~pl~--~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~ 224 (547) T protein:vir:10 147 EGSVVFQSSPIQ--DSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDK 224 (547) T ss_pred CCceeEEEeecc--eEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCC Confidence 122 3344443 33333222110 0 0 0000 00 Q ss_pred c-----------------eEEecHhH--------------EEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019705. 171 E-----------------YAEFSQKE--------------IFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFF 218 (424) Q Consensus 171 ~-----------------~~~~~~~e--------------iih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~ 218 (424) . ...+..++ .+.+|+...++ .||.||...+...+.....+.+...... T Consensus 225 ~~~~~~~~~~~~~~~p~~s~~~e~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 304 (547) T protein:vir:10 225 KQNRNAGTVLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSS 304 (547) T ss_pred CCCccccceeeccccceeEEEEEecCceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 0 00011111 23334433344 8999999999999999999999888888 Q ss_pred hcCCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHH-HHHHHHHHHHHHHHhCC Q lcl|NC_019705. 219 ANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGV 297 (424) Q Consensus 219 ~ng~~~~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~-~e~~~~~~~~Ia~~fgV 297 (424) .-...|...+. +.+...+ + +. ..|++.+....-.++++... .+.+. .+..+.....|-.+|=+ T Consensus 305 ~~~~~pp~~v~-~~g~~~~-----~----~~-----~pgg~~~~~~~~~v~pl~~~-~~~~~~~~~i~~~~~rI~~af~~ 368 (547) T protein:vir:10 305 EKVIDPAIMVT-ERGLISD-----I----DL-----GASGLTVVRDMESMKPFESR-ARFDVSSIQLTDLRSAVRRIYYV 368 (547) T ss_pred HHHhcCceecc-ccccccc-----c----ee-----cCCeeeecCCcccceeeecc-cchHHHHHHHHHHHHHHHHHhhh Confidence 88888876543 2222221 1 11 12445566666677777654 34443 45677778888999877 Q ss_pred CHHHhCCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhcccchhhhhhhhhccCHHHHHHHHHHHHh Q lcl|NC_019705. 298 PPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGE 377 (424) Q Consensus 298 Pp~~l~~~~~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~ 377 (424) ....+- +....+ +.=-.++..-....|.|....+.++|-.-|+.. .+..+.. T Consensus 369 d~~~~~--~~~~~T-AtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r-------------------------~~~il~r 420 (547) T protein:vir:10 369 DQLQMK--DSPAMT-ATEVQVRYELMQRLLGPTLGRLENDFLSPMIQR-------------------------TFNIRFR 420 (547) T ss_pred hhhhcC--CCcccc-HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH-------------------------HHHHHHh Confidence 653332 222111 111223345566777788887777764332211 0011222 Q ss_pred CCCCCHHHHHHHhCCCCCC------CCCeeeecccccchhhccc------------------cCCCc-------ccCC Q lcl|NC_019705. 378 AGLRTINEMRRTDNLPPLP------GGDVAMRQSQYVPITDLGT------------------NKEPR-------NNGA 424 (424) Q Consensus 378 ~g~~t~NE~R~~~g~~p~~------ggd~~~~~~n~~~~~~~~~------------------~~~~~-------~~ga 424 (424) .|.+ ||+| +|..+-+. -..++..+.+ +-.|. +... T Consensus 421 ~g~l-----------P~~p~~l~~~~~~~~~v~-~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~ 486 (547) T protein:vir:10 421 AGKL-----------GELPSKLLESGKAAMDIV-YTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMV 486 (547) T ss_pred cCCC-----------CCCchhhhccCcceEEEE-eccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHH Confidence 2332 1121 11111000 0001110000 00000 0000 No 240 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=38.65 E-value=1.1 Score=20.45 Aligned_cols=402 Identities=11% Similarity=0.054 Sum_probs=178.0 Q ss_pred CCCCcccccCCCCCchHHHHHhhc--------c--CcccCCccccchhhcc---------ccc-cc-cC---ccc----- Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWF--------V--GGRLVTPNQGSQTGPV---------SAH-GH-LG---DSS----- 51 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~--------~--~~~~~~~~~~~~~~~~---------~~~-~~-~~---~~~----- 51 (424) |. -++|.+-.++|+.+...- . ..+.++|.....+..+ ++. .+ .. +.. T Consensus 1 ~~----~~~~~~~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~e 76 (524) T protein:vir:98 1 MN----FLGFGNVLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQ 76 (524) T ss_pred CC----CcchhhHHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHH Confidence 11 123444444443332210 0 0111222222111000 000 00 00 100 Q ss_pred -c-cHHHHhccHHHHHHHHHHHHhhccC-----ceEEEEeccC-CccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHH Q lcl|NC_019705. 52 -I-NDERILQISTVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQ 123 (424) Q Consensus 52 -v-s~~~~~~~~~v~~~i~~ia~~ia~~-----~~~v~~~~~~-~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~ 123 (424) | .-+..+.+|.|..||+.|.+.+.-. |+.+--.+-+ .............++|+. -|-...+ +.++.. T Consensus 77 LI~~YR~ma~~pEvd~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~l-l~F~~~~----~~~fR~ 151 (524) T protein:vir:98 77 LINTYRGIMSYPEVENAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNI-YDFDNMG----ARLFRD 151 (524) T ss_pred HHHHHHHHhhccchhhHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHH-hccchhh----hHHHhh Confidence 1 1255677999999999999987532 3332111100 000000111122333321 1112222 455677 Q ss_pred HHHcCCeEEEEeeCCCCc--eeEEEEecCceeEEee-------cCc-------eEEEEEEe-------------CCceEE Q lcl|NC_019705. 124 LCFYGNAYALVDRNSAGD--VISLLPLQSANMDVKL-------VGK-------KVVYRYQR-------------DSEYAE 174 (424) Q Consensus 124 ~ll~G~a~~~~~r~~~G~--~~~l~~l~~~~v~~~~-------~~~-------~~~~~~~~-------------~~~~~~ 174 (424) |...|..|..++-+.+.. +.+|..|+|.+|+..+ +++ .-+|.|.. .+.... T Consensus 152 WYVDgRi~fhkiid~~~~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ik 231 (524) T protein:vir:98 152 WYVDSRIYFHKIMHKDESKGIRELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIK 231 (524) T ss_pred hhhcceeEEEEEEcCCCCcceeeeeeeCCccceeeeeccccccccchhhccceeeeeeeccCCCccccccceecCCCcee Confidence 888899999888655443 8999999999997543 222 11233321 223456 Q ss_pred ecHhHEEEeec--CCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCH-HHHHHHHHHHHHHh Q lcl|NC_019705. 175 FSQKEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE-QQRSQVEENFKEIA 251 (424) Q Consensus 175 ~~~~eiih~r~--~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~-~~~~~~~~~~~~~~ 251 (424) ++.+-|+|... .+.++- -+|-+..|.+.+.....++....-+--.-+.-+-|+.++.+.++. .+.+-++....++. T Consensus 232 I~~dAIvy~hSGL~d~~~~-iisyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~k 310 (524) T protein:vir:98 232 IPRSAIVYAHSGLEDCSNN-IIGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLK 310 (524) T ss_pred echhheeeeccCcccCCCC-eeeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcC Confidence 88888988752 233321 246788888777777777766655544555556677777776554 34444565555543 Q ss_pred --------CCc--ccCccee-c----------CCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-Cc Q lcl|NC_019705. 252 --------GGP--VKKRLWI-L----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-ST 309 (424) Q Consensus 252 --------~~~--~ag~~~~-l----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~~-~~ 309 (424) .|. +..+.+. | ..|.++..|.-. ..+.-++-.++..+.+.++++||.+-|...++ -+ T Consensus 311 NklvYDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpgg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~ 389 (524) T protein:vir:98 311 NRVVYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGG-QNFSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQ 389 (524) T ss_pred ceeEeeccCceeeccccccchhhhhcccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCceeccCCCCccc Confidence 121 1222222 2 135566665432 22223445667788999999999988864322 12 Q ss_pred ccchh----HHHHHHHHHHHHHHHHHHHHHHHHHhhccC-----hhhhc----ccchhhhhhhh----hccC-HHHHHHH Q lcl|NC_019705. 310 SWGSG----IEQQNLGFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGL----LRGD-SASRAAF 371 (424) Q Consensus 310 ~~~~n----~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~-----~~~~~----~~~~~fd~~~l----~~~d-~~~~~~~ 371 (424) ...++ =|-....|+..-=.-+...+.+.|..+|+. +.+.. ...+.|..|.. .... ...|... T Consensus 390 ~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~ 469 (524) T protein:vir:98 390 IGGGGEITRDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNL 469 (524) T ss_pred cccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHH Confidence 11111 111112222222223444445555555443 33321 12222222222 1111 1223333 Q ss_pred HHHHHh--CCCCCHHHHHH-HhCCCCCC--C--------CCeeeecccccchhhccccCCCcccC Q lcl|NC_019705. 372 MKAMGE--AGLRTINEMRR-TDNLPPLP--G--------GDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 372 ~~~~~~--~g~~t~NE~R~-~~g~~p~~--g--------gd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) +..+-. +-+++.+=+|+ .|.+.-.+ . .+....+ +.+++.++= T Consensus 470 l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E~k~~~~~----------~p~~e~~~f 524 (524) T protein:vir:98 470 MSQVEGVVGKYVSHKYIMKEILRMSDEDIDEQAKLIEEESKEERFK----------NPEAEEENF 524 (524) T ss_pred HHHhccccccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCc----------CCccccccC Confidence 333221 22555555554 23332110 0 0000000 000000000 No 241 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=31.98 E-value=1.5 Score=19.69 Aligned_cols=392 Identities=13% Similarity=0.066 Sum_probs=140.4 Q ss_pred CCCCcccccCCCCCchHHHHHhhccCcccCCccccc----hhhcc--ccccccCcccccHHHHhccHHHHHHHHHHHHhh Q lcl|NC_019705. 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGS----QTGPV--SAHGHLGDSSINDERILQISTVWRCVSLISTLT 74 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~----~~~~~--~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~i 74 (424) |-|-+--..-++=..-|+++++. +.. -...|. ...|. ...+..+... ....+ +++-..|++.+|+.+ T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~---R~~-~e~~w~e~~~~~lP~~~~~~~~~~~~~--~~~~~-dst~~~a~~~Laa~l 73 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKND---RAP-YETRAQNCAQYTIPSLFPKDSDNASTD--YQTPW-QAVGARGLNNLASKL 73 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHH---hhH-HHHHHHHHHHHhcccccCCCCCccccc--ccccc-cccHHHHHHHHHHHH Confidence 65532110000101112222110 000 000000 00000 0000111100 01122 334445666666654 Q ss_pred cc-C----ceEEEEeccCCccc---------ee-----ccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe Q lcl|NC_019705. 75 AC-L----PLDVFETDQNDNRK---------KV-----DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD 135 (424) Q Consensus 75 a~-~----~~~v~~~~~~~~~~---------~~-----~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~ 135 (424) .+ + ||-=....+..-.+ ++ ...+.+...|. +-| .+.-+..+..+++.+||+..++. T Consensus 74 ~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~L~~~G~a~ly~~ 148 (536) T protein:vir:21 74 MLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNVLLYLP 148 (536) T ss_pred HHhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhHCcEeEEEe Confidence 43 1 33211111101000 00 00112222332 333 34555677889999999999987 Q ss_pred eCCCCce--eEEEEecCceeEEeecCceE-----------------------------------EEE--EEe-CC-ce-- Q lcl|NC_019705. 136 RNSAGDV--ISLLPLQSANMDVKLVGKKV-----------------------------------VYR--YQR-DS-EY-- 172 (424) Q Consensus 136 r~~~G~~--~~l~~l~~~~v~~~~~~~~~-----------------------------------~~~--~~~-~~-~~-- 172 (424) .+..+.+ ...|||....|.....+... +|. +.. ++ .. T Consensus 149 e~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~ 228 (536) T protein:vir:21 149 EPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLR 228 (536) T ss_pred eCCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEE Confidence 6554333 45677754444332222100 010 001 10 00 Q ss_pred ------EEecH-------hH--EEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCC Q lcl|NC_019705. 173 ------AEFSQ-------KE--IFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT 236 (424) Q Consensus 173 ------~~~~~-------~e--iih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~ 236 (424) ..+.. ++ .+.+|+...++ .||.||...++..+.....+.+.......-...+...+. +.+... T Consensus 229 ~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~ 307 (536) T protein:vir:21 229 YEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQ 307 (536) T ss_pred EeccCCeeeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccC-cccccc Confidence 11111 11 24445443344 899999999999999998888888776666666554443 333333 Q ss_pred HHHHHHHHHHHHHHhCCcccCcce-ecCCCceeeecccChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchh Q lcl|NC_019705. 237 EQQRSQVEENFKEIAGGPVKKRLW-ILEAGFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG 314 (424) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~ag~~~-~l~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n 314 (424) +... ..+.+ |.++ ...+.....++... .+++ ..+..+.....|..+|-+. .+...+....+ +. T Consensus 308 ~~~~----------~~~~~-g~~v~g~~~~v~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~--~l~~~~~~r~T-At 372 (536) T protein:vir:21 308 PRRL----------TKAQT-GDFVTGRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLN--SAVQRTGERVT-AE 372 (536) T ss_pred hhhh----------ccCCC-cceecCCcccceeeecccc-ccchHHHHHHHHHHHHHHHHHhhh--hcccCCCCCcc-HH Confidence 3211 11111 1111 12233334444433 3333 3345666777788888543 22222221111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhh-------------ccChhhhcccchhh--hhhhhhccCH-HHHHHHHHHHHhC Q lcl|NC_019705. 315 IEQQNLGFLQYTLQPYISRWENSIQRW-------------LIPAKDVGRIHAEH--NLDGLLRGDS-ASRAAFMKAMGEA 378 (424) Q Consensus 315 ~e~~~~~~~~~tl~P~~~~ie~~l~~~-------------l~~~~~~~~~~~~f--d~~~l~~~d~-~~~~~~~~~~~~~ 378 (424) =-..+..-....|.|....+.++|-.- ++++.........+ -+..+.+.-. +....++..+.+- T Consensus 373 EV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~ 452 (536) T protein:vir:21 373 EIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAAL 452 (536) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhh Confidence 112222344455555555554444322 33322111111111 1222222111 1111111211111 Q ss_pred C------CCCHHHHH----HHhCCCCCCCCCeeeeccc-ccchh--------------hccc------cCCCcc------ Q lcl|NC_019705. 379 G------LRTINEMR----RTDNLPPLPGGDVAMRQSQ-YVPIT--------------DLGT------NKEPRN------ 421 (424) Q Consensus 379 g------~~t~NE~R----~~~g~~p~~ggd~~~~~~n-~~~~~--------------~~~~------~~~~~~------ 421 (424) + .+..+++- +.+|.+|.. ++.+.. ...+- .++. ...++. T Consensus 453 ~Pe~ld~~id~d~~~~~~a~~~Gv~p~~----~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 528 (536) T protein:vir:21 453 APMRDDPDINLAMIKLRIANAIGIDTSG----ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAAD 528 (536) T ss_pred chhhhcccCCHHHHHHHHHHHcCCChhh----hcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhhhh Confidence 1 12222222 234553310 000000 00000 0000 000111 Q ss_pred cCC Q lcl|NC_019705. 422 NGA 424 (424) Q Consensus 422 ~ga 424 (424) +++ T Consensus 529 ~~g 531 (536) T protein:vir:21 529 SVG 531 (536) T ss_pred ccc Confidence 111 No 242 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=30.97 E-value=1.5 Score=19.56 Aligned_cols=388 Identities=11% Similarity=-0.004 Sum_probs=150.9 Q ss_pred CCCCcccccCCCCC-------chHHHHHhhccCcccCCccccchhhccccccccCcccc-cHHHHhccH----HHHHHHH Q lcl|NC_019705. 1 MEEPKYTIDLRTNN-------GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSI-NDERILQIS----TVWRCVS 68 (424) Q Consensus 1 ~~~~~~~~~~~~~~-------G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v-s~~~~~~~~----~v~~~i~ 68 (424) |.. +.|.. --|..++..+.|..............+... ....+.- ..+.+++-+ ++...++ T Consensus 1 m~~------V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e-~~~~e~~~~Y~~rl~rA~~~n~~~~t~~ 73 (501) T protein:vir:95 1 MPN------VSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAE-DQSKENKARYEAYLKRAVFYNVARRTLF 73 (501) T ss_pred CCC------CCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCC-CCcccchHHHHHHhhccccCchHHHHHH Confidence 221 33332 335555555544321100000000000000 0011000 012233333 3344444 Q ss_pred HHHHhhccCceEEEEeccCCccceeccchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCc------- Q lcl|NC_019705. 69 LISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGD------- 141 (424) Q Consensus 69 ~ia~~ia~~~~~v~~~~~~~~~~~~~~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~------- 141 (424) .+...+-+-|..+ .....+..++..---...+-.+|.+.++...+.+|-+++.+.....+. T Consensus 74 ~l~G~vf~k~p~~------------~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a 141 (501) T protein:vir:95 74 GLVGQVFMRDPVV------------KVPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIA 141 (501) T ss_pred HHhhhhhcCCcce------------eCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHH Confidence 4444443333322 112335555543333566889999999999999999999997543211 Q ss_pred --------eeEEEEecCceeE----------------------EeecCc------------------eEEEEEE-eCCce Q lcl|NC_019705. 142 --------VISLLPLQSANMD----------------------VKLVGK------------------KVVYRYQ-RDSEY 172 (424) Q Consensus 142 --------~~~l~~l~~~~v~----------------------~~~~~~------------------~~~~~~~-~~~~~ 172 (424) | .+..+.|..|. ...++. ...+++. ....+ T Consensus 142 ~~~~~~~rP-y~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~ 220 (501) T protein:vir:95 142 DLEAGRIRP-TLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPT 220 (501) T ss_pred HHHhccCCc-EEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCc Confidence 1 13333332220 011111 1111111 10000 Q ss_pred ----EEec------------------HhHEEEeecC---CCCCcccCchHHHHHHH-HHHHHHHHHHHHHHHhcCCCCce Q lcl|NC_019705. 173 ----AEFS------------------QKEIFHLKGF---GFTGLVGLSPIAFACKS-AGVAVAMEDQQRDFFANGAKSPQ 226 (424) Q Consensus 173 ----~~~~------------------~~eiih~r~~---~~~~~~G~s~i~~~~~~-i~~~~~~~~~~~~~~~ng~~~~~ 226 (424) ..+. +=..|=|-.. +.+...+.+|+..++.. +...+....+ ...+...+.|-. T Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~-~~~l~~~~~P~l 299 (501) T protein:vir:95 221 KADGSKIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSADY-EESCYIVGQPTP 299 (501) T ss_pred ccCcceecCCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhHH-HHHHHHccccee Confidence 0000 0001111111 12234567777766533 2333333333 333445666776 Q ss_pred eEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Q lcl|NC_019705. 227 ILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 227 vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~ 306 (424) +++-.. ++..+..... .-..| .+ ..+.++.|.++.=+..++.-+. .+..+...+++.. .| ..++.... T Consensus 300 ~i~G~~----~~~~~~~~~~-~i~~G-~~--~~~~lP~~~~~~~ie~~~~~i~-~~~l~~l~~~m~~-~G--a~ll~~~~ 367 (501) T protein:vir:95 300 VLIGLT----EEWVTNVLKG-SVNFG-SR--GGIPLPVGADAKLLQASENTML-KEAMDTKERQMVA-LG--AKLVEQKE 367 (501) T ss_pred eeeCCc----ccccccCCCC-ceeec-cc--ccccCCCCCceeEEecChhhHH-HHHHHHHHHHHHH-HH--HhhccCCc Confidence 665221 1111100000 01122 22 3567776655544443333332 2223333333322 23 22332211 Q ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc------ChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCC Q lcl|NC_019705. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLI------PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGL 380 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~------~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~ 380 (424) ++.+ .........--...|.-++..+|+.++.-|- ...+ ....++.+.+=..........+++.++++.|. T Consensus 368 -~~~T-a~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~~-~~~~v~i~~df~~~~~~~~~~~al~~~~~~G~ 444 (501) T protein:vir:95 368 -VQRT-ATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWVGQAD-SGVKFELNTDFDIARMTPDERRSLVEEWQKGA 444 (501) T ss_pred -cchh-HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCCC-CceEEEEecccccccCCHHHHHHHHHHHhCCC Confidence 1111 1112222333456677788888877775432 1111 11223333322222223445677778899999 Q ss_pred CCHHHHHHHhCCCCCCCCC-----eee--ecccccchhhccccCCCcccCC Q lcl|NC_019705. 381 RTINEMRRTDNLPPLPGGD-----VAM--RQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 381 ~t~NE~R~~~g~~p~~ggd-----~~~--~~~n~~~~~~~~~~~~~~~~ga 424 (424) ++..+.++.+-.--++..| +.. ..-+..+.+........+++|. T Consensus 445 is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~ 495 (501) T protein:vir:95 445 ITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGD 495 (501) T ss_pred CcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCcccccccCCCCCCCcccc Confidence 9999998876443332210 000 0001111111112222222111 No 243 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=30.91 E-value=1.5 Score=19.56 Aligned_cols=367 Identities=11% Similarity=0.046 Sum_probs=138.0 Q ss_pred ccCCCCCchHHHHHhhccCcc-----cCCccccchhhccccccccCcccccHHHHhccHHHHHHHHHHHHhhcc------ Q lcl|NC_019705. 8 IDLRTNNGWWARLQSWFVGGR-----LVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTAC------ 76 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~~i~~ia~~ia~------ 76 (424) |+++.+---++.-++.|-... -..|.... .......++.. ....+ +++--.|++.+|+.+.+ T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~----~~~~~~~~~~~--~~~~~-dstg~~a~~~LAa~l~~~ltpp~ 73 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLID----DDISSRPNHKS--LTVPW-QSVGAKCCVTLAAKLMLAVLPPQ 73 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccC----CCCCCCccccc--ccccc-cchHHHHHHHHHHHHHHhhcCCC Confidence 666644333222222221110 00111100 00000001111 01122 33444566666665543 Q ss_pred CceEEEEeccCCccceec-------------cchHHHHHhhcCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCcee Q lcl|NC_019705. 77 LPLDVFETDQNDNRKKVD-------------LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI 143 (424) Q Consensus 77 ~~~~v~~~~~~~~~~~~~-------------~~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll~G~a~~~~~r~~~G~~~ 143 (424) -||-=....+....+... ..+.+...|. +-| .+.-+..+..+++.+|||.+++..+. . T Consensus 74 ~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~-~sn----f~~~~~~~~~~L~~~G~a~ly~~~~~----~ 144 (522) T protein:vir:10 74 TSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIA-ASN----DRVAVHQALKHLIVGGNALIFMGKDG----L 144 (522) T ss_pred CccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhHCceeEEEcCCC----c Confidence 344222111110000000 0011222232 333 45666788889999999998875532 3 Q ss_pred EEEEecCceeEEeecCce-------------------------------------EEEE--EEe-C-CceEEe--cHhH- Q lcl|NC_019705. 144 SLLPLQSANMDVKLVGKK-------------------------------------VVYR--YQR-D-SEYAEF--SQKE- 179 (424) Q Consensus 144 ~l~~l~~~~v~~~~~~~~-------------------------------------~~~~--~~~-~-~~~~~~--~~~e- 179 (424) ..|||....|.....+.. ..|. +.. + +....+ ..+. T Consensus 145 ~~~pl~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~ 224 (522) T protein:vir:10 145 KTFPLTRYVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQEAFDKI 224 (522) T ss_pred eEEEcceEEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEccCCcc Confidence 455664433332221210 0111 000 0 000000 0111 Q ss_pred --------------EEEeecCCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeEEcCCCCCCHHHHHHHH Q lcl|NC_019705. 180 --------------IFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVE 244 (424) Q Consensus 180 --------------iih~r~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~vl~~~~~~~~~~~~~~~~ 244 (424) .+..|+...++ .||.||...++..+.....+.+...........|..++.. .+...+.. T Consensus 225 ~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~-~~~~~~~~----- 298 (522) T protein:vir:10 225 IPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSP-SSTTKPAT----- 298 (522) T ss_pred ccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeecc-cccccccc----- Confidence 12223333234 8999999999999999999999999988888888765542 23223221 Q ss_pred HHHHHHhCCcccCcceec--CCCceeeecccChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCcccchhHHHHHHH Q lcl|NC_019705. 245 ENFKEIAGGPVKKRLWIL--EAGFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG 321 (424) Q Consensus 245 ~~~~~~~~~~~ag~~~~l--~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~l~~~~~~~~~~~n~e~~~~~ 321 (424) ...+.+ +.++. .+++...+++. ..+++ ..+..+..+..|..+|= +....++..-++.=-..+.. T Consensus 299 -----l~~~~~--~~~v~g~~~~v~~~~~~~-~~d~~~~~~~i~~~~~ri~~aFl-----~~~~~d~~rvTAtEV~~r~~ 365 (522) T protein:vir:10 299 -----IAKAGN--GAIVQGRPEDVAVIQVGK-TADFSTAANMATAIEKRLLEAFL-----VMNVRNAERVTAEEVRLTQL 365 (522) T ss_pred -----ccCCCC--cceecCCCccceeecccc-cccchHHHHHHHHHHHHHHHHHh-----hccCCCCCCCCHHHHHHHHH Confidence 111111 11121 22333344332 33444 23455666777777773 22222222221121123334 Q ss_pred HHHHHHHHHHHHHHHHHHhhc-------------cChhhhcccchhhhhhhhhccCHHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_019705. 322 FLQYTLQPYISRWENSIQRWL-------------IPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRR 388 (424) Q Consensus 322 ~~~~tl~P~~~~ie~~l~~~l-------------~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~ 388 (424) -....|.|....+.++|-.-| +++...... ....+...+.-.|+..+..+.+. ...+-. T Consensus 366 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~----~~~~v~~is~Laraq~~~~l~~~----~~~i~~ 437 (522) T protein:vir:10 366 ELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIV----RPTIVAGVNALGRGQDRESLTAF----VGTIAQ 437 (522) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccc----ccccccchhHHHHHHHHHHHHHH----HHHHHH Confidence 555666666666665553322 211110000 00011111122222222222110 011111 Q ss_pred HhCCCCCCCCCeeeecccccc-hhhccccCCCcccCC Q lcl|NC_019705. 389 TDNLPPLPGGDVAMRQSQYVP-ITDLGTNKEPRNNGA 424 (424) Q Consensus 389 ~~g~~p~~ggd~~~~~~n~~~-~~~~~~~~~~~~~ga 424 (424) .+| | | ......|.-. ++...+.- +-+-.. T Consensus 438 ~~~--p-~---~~~~~id~d~~~~~~a~~~-Gvp~~~ 467 (522) T protein:vir:10 438 TLG--P-E---ALMQYLNPLEAIKRLAAAQ-GIDVLN 467 (522) T ss_pred hhC--c-h---hhhhcCCHHHHHHHHHHHh-CCChhh Confidence 111 1 0 0000111111 11111100 000001 No 244 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=26.35 E-value=2 Score=18.99 Aligned_cols=378 Identities=11% Similarity=0.035 Sum_probs=143.5 Q ss_pred CCCCccc-----------ccCCCCCchH----HHHHhhccCcccCCccccchhhccccccccCcccccHHHHhccHHHHH Q lcl|NC_019705. 1 MEEPKYT-----------IDLRTNNGWW----ARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWR 65 (424) Q Consensus 1 ~~~~~~~-----------~~~~~~~G~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~v~~ 65 (424) |-|-|-+ =.|+++|.-| +.+..+.. |. .+...+..++.. ....+ +++-.. T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~l------P~------~~~~~~~~~~~~--~~~~~-dst~~~ 65 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTI------PS------LFPKDSDNSSTD--YTTPW-QAVGAR 65 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhc------cc------cCCCCCCccccc--ccccc-cchHHH Confidence 5553310 0122222111 11111110 10 000000001110 01122 344456 Q ss_pred HHHHHHHhhccC-----ceEEEEeccCCcc---------ceecc-----chHHHHHhhcCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019705. 66 CVSLISTLTACL-----PLDVFETDQNDNR---------KKVDL-----SNPLARLLRYSPNQYMTAQEFREAMTMQLCF 126 (424) Q Consensus 66 ~i~~ia~~ia~~-----~~~v~~~~~~~~~---------~~~~~-----~~~l~~lL~~~pn~~~s~~~f~~~~~~~~ll 126 (424) |++.+|+.+.+. ||-=....+.... .++.. .+.+...|. +-| .+.-+..+..+++. T Consensus 66 a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~-~sn----f~~~~~~~~~~L~~ 140 (543) T protein:vir:88 66 GLNNLSAKVMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYME-ANS----YRVTLFELIRQLAL 140 (543) T ss_pred HHHHHHHHHHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHh Confidence 777777665431 3311111110000 00000 011222222 333 44556677888999 Q ss_pred cCCeEEEEeeCCCC----ceeEEEEecCceeEEeecCc----------------------------------eEEEE--E Q lcl|NC_019705. 127 YGNAYALVDRNSAG----DVISLLPLQSANMDVKLVGK----------------------------------KVVYR--Y 166 (424) Q Consensus 127 ~G~a~~~~~r~~~G----~~~~l~~l~~~~v~~~~~~~----------------------------------~~~~~--~ 166 (424) +||+.+++..+... .+...|||....|.....+. ...|. + T Consensus 141 ~G~a~ly~~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~V~ 220 (543) T protein:vir:88 141 AGTALIYLPPPDASSNSYNPMKLYTLHNHVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIY 220 (543) T ss_pred hCceeeeeccCccccceecceEEeEcceEEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEEEE Confidence 99999988655421 23556777544433222221 01111 1 Q ss_pred Ee-CC-c--------eEEec-------HhH--EEEeecCCCC-CcccCchHHHHHHHHHHHHHHHHHHHHHHhcCCCCce Q lcl|NC_019705. 167 QR-DS-E--------YAEFS-------QKE--IFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQ 226 (424) Q Consensus 167 ~~-~~-~--------~~~~~-------~~e--iih~r~~~~~-~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ 226 (424) .. ++ . +..+. .++ .+..|+...+ ..||.||...+...+.....+.+...........|.. T Consensus 221 pr~~~~~~~~~~~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~ 300 (543) T protein:vir:88 221 IDDESGDFLSYQEIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVG 300 (543) T ss_pred eecCCCcccccccccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 11 10 0 01111 111 2344444333 4899999999999999999999999998888888886 Q ss_pred eEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecccChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC Q lcl|NC_019705. 227 ILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 227 vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l~~~~ 306 (424) ++..+ ....+... .-++.+. -+.-..++....++...++=.-..+..+.....|..+|-+.. +...+ T Consensus 301 ~v~~~-g~~~~~~~---------~~~~~g~-~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~ 367 (543) T protein:vir:88 301 LVNPN-GITQVRRL---------VKAQTGD-FVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNS--AVQRS 367 (543) T ss_pred eeccc-cccchhhc---------ccCCCce-eecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhh--hccCC Confidence 55322 33333211 1111111 011223444455555433222234566677778888885542 22222 Q ss_pred CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccChh-h---hcccchhhhhhhhhccC------HHHHHHHHHHHH Q lcl|NC_019705. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-D---VGRIHAEHNLDGLLRGD------SASRAAFMKAMG 376 (424) Q Consensus 307 ~~~~~~~n~e~~~~~~~~~tl~P~~~~ie~~l~~~l~~~~-~---~~~~~~~fd~~~l~~~d------~~~~~~~~~~~~ 376 (424) .. ..++.=-..+..-....|.|....+.++|-.-|+... . +.+..-... ++++..+ ...|..-...+. T Consensus 368 ~~-r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p-~~~v~~~~vs~l~~l~r~~~~~~l~ 445 (543) T protein:vir:88 368 GE-RVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLP-QEAVEPTVTTGAEALGRGQDLDKLT 445 (543) T ss_pred CC-cccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc-hhceeeeEEecHHHHHHHHHHHHHH Confidence 22 2211111233355666777777777776643332110 0 000000000 0000111 011111111110 Q ss_pred hCCCCCHHHHHHHhCCCCCCCCCeeeecccccchhhccccCCCcccCC Q lcl|NC_019705. 377 EAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 377 ~~g~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ga 424 (424) + ..+.+ ..+ .+ |+-.. ..|+-.+-+.-...-+-+-.. T Consensus 446 ~----~~~~v-~~~--~~-p~vld---~id~d~~~~~~a~~~Gv~~~~ 482 (543) T protein:vir:88 446 Q----FLNAV-ATV--SQ-LNGDP---DLNVNNIKLRLANAIGIDTAG 482 (543) T ss_pred H----HHHHH-Hhc--cc-hhhhc---cCCHHHHHHHHHHHhCCChhh Confidence 0 00000 011 11 11000 001111000000000111111 Done!