Query lcl|NC_019456.1_cdsid_YP_007002969.1 [gene=F367_gp17] [protein=putative portal protein] [protein_id=YP_007002969.1] [location=4058..5365] Match_columns 435 No_of_seqs 133 out of 998 Neff 9.9 Searched_HMMs 1612 Date Thu Nov 7 17:13:08 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_6 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_6_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:105002 Length: 432 100.0 1.9E-96 1.2E-99 545.4 46.3 414 1-422 1-432 (432) 2 protein:vir:102855 Length: 432 100.0 1.9E-96 1.2E-99 545.4 46.3 414 1-422 1-432 (432) 3 protein:vir:107605 Length: 432 100.0 1.9E-96 1.2E-99 545.4 46.3 414 1-422 1-432 (432) 4 protein:vir:102080 Length: 429 100.0 7.9E-96 4.9E-99 542.0 45.6 414 1-433 1-429 (429) 5 protein:vir:1380 Length: 422 # 100.0 7.1E-95 4.4E-98 536.7 43.8 404 1-424 1-422 (422) 6 protein:vir:81072 Length: 432 100.0 1.5E-93 9.3E-97 529.5 45.0 412 1-429 7-432 (432) 7 protein:vir:10362 Length: 432 100.0 1.8E-93 1.1E-96 529.1 45.0 412 1-429 7-432 (432) 8 protein:vir:81152 Length: 411 100.0 1.7E-93 1.1E-96 529.2 44.1 394 1-421 1-411 (411) 9 protein:vir:100249 Length: 431 100.0 1.3E-93 8.1E-97 529.8 43.2 399 1-414 1-431 (431) 10 protein:vir:97060 Length: 432 100.0 2.8E-93 1.7E-96 528.0 45.0 412 1-429 7-432 (432) 11 protein:vir:96980 Length: 409 100.0 5.3E-93 3.3E-96 526.5 43.6 404 1-426 4-409 (409) 12 protein:vir:6240 Length: 457 # 100.0 2.3E-92 1.4E-95 522.9 46.2 432 1-435 1-457 (457) 13 protein:vir:4454 Length: 414 # 100.0 1.8E-92 1.1E-95 523.5 44.3 402 1-429 1-414 (414) 14 protein:vir:1266 Length: 416 # 100.0 2E-92 1.3E-95 523.3 44.2 408 2-425 1-416 (416) 15 protein:vir:2683 Length: 412 # 100.0 1.9E-92 1.2E-95 523.4 43.4 408 1-426 1-412 (412) 16 protein:vir:1326 Length: 457 # 100.0 5.8E-92 3.6E-95 520.8 45.7 430 1-435 1-457 (457) 17 protein:vir:93610 Length: 454 100.0 5.2E-92 3.2E-95 521.0 44.7 426 3-435 1-446 (454) 18 protein:vir:4337 Length: 434 # 100.0 6.1E-92 3.8E-95 520.6 44.8 414 1-431 1-434 (434) 19 protein:vir:4509 Length: 424 # 100.0 3.3E-92 2.1E-95 522.1 43.3 398 1-423 16-424 (424) 20 protein:vir:1884 Length: 424 # 100.0 5.8E-92 3.6E-95 520.8 44.4 397 1-419 14-424 (424) 21 protein:vir:189 Length: 424 # 100.0 7.4E-92 4.6E-95 520.2 43.9 397 1-407 14-424 (424) 22 protein:vir:93943 Length: 409 100.0 8.6E-92 5.3E-95 519.8 43.8 404 1-426 4-409 (409) 23 protein:vir:8418 Length: 409 # 100.0 1.9E-91 1.2E-94 518.0 44.1 399 1-435 1-408 (409) 24 protein:vir:4598 Length: 416 # 100.0 1.8E-91 1.1E-94 518.0 43.4 405 1-423 1-416 (416) 25 protein:vir:81095 Length: 416 100.0 1.8E-91 1.1E-94 518.0 43.4 405 1-423 1-416 (416) 26 protein:vir:94426 Length: 409 100.0 2.1E-91 1.3E-94 517.7 43.4 404 1-426 4-409 (409) 27 protein:vir:105064 Length: 421 100.0 3.2E-91 2E-94 516.7 43.5 405 1-428 1-421 (421) 28 protein:vir:98396 Length: 441 100.0 4E-91 2.5E-94 516.2 43.6 408 1-423 23-441 (441) 29 protein:vir:102118 Length: 409 100.0 4.2E-91 2.6E-94 516.0 43.4 394 1-421 1-409 (409) 30 protein:vir:79984 Length: 441 100.0 7.7E-91 4.8E-94 514.6 43.8 411 1-423 11-441 (441) 31 protein:vir:9408 Length: 441 # 100.0 7.7E-91 4.8E-94 514.6 43.8 411 1-423 11-441 (441) 32 protein:vir:100150 Length: 437 100.0 2.9E-90 1.8E-93 511.5 45.8 414 1-432 1-437 (437) 33 protein:vir:101648 Length: 518 100.0 3.1E-90 1.9E-93 511.3 45.8 425 1-435 1-445 (518) 34 protein:vir:7853 Length: 518 # 100.0 2.4E-90 1.5E-93 511.9 44.1 425 1-435 1-445 (518) 35 protein:vir:483 Length: 413 # 100.0 5.8E-90 3.6E-93 509.8 43.8 401 1-429 1-413 (413) 36 protein:vir:5737 Length: 419 # 100.0 6.8E-90 4.2E-93 509.4 43.0 404 1-427 1-419 (419) 37 protein:vir:1431 Length: 419 # 100.0 1.6E-88 9.7E-92 502.0 42.8 404 2-430 1-419 (419) 38 protein:vir:9702 Length: 406 # 100.0 2.6E-88 1.6E-91 500.8 43.8 400 1-428 1-406 (406) 39 protein:vir:3868 Length: 417 # 100.0 2.8E-88 1.7E-91 500.6 43.6 407 1-435 1-417 (417) 40 protein:vir:80333 Length: 419 100.0 3E-87 1.8E-90 495.0 41.2 401 1-435 1-416 (419) 41 protein:vir:81218 Length: 423 100.0 4.2E-86 2.6E-89 488.6 42.5 396 1-428 1-423 (423) 42 protein:vir:101647 Length: 460 100.0 2.6E-85 1.6E-88 484.3 43.4 400 3-424 1-460 (460) 43 protein:vir:8317 Length: 409 # 100.0 4.4E-84 2.7E-87 477.6 39.6 373 1-391 1-409 (409) 44 protein:vir:9359 Length: 348 # 100.0 4E-83 2.5E-86 472.3 39.5 346 64-426 1-348 (348) 45 protein:vir:960 Length: 413 # 100.0 5.9E-83 3.7E-86 471.4 39.4 392 1-407 1-413 (413) 46 protein:vir:95378 Length: 406 100.0 3.6E-82 2.2E-85 467.1 42.7 393 1-429 1-406 (406) 47 protein:vir:80134 Length: 403 100.0 3.9E-82 2.4E-85 466.9 40.8 390 1-429 1-403 (403) 48 protein:vir:94666 Length: 723 100.0 2.1E-81 1.3E-84 462.9 43.0 408 1-435 1-445 (723) 49 protein:vir:3843 Length: 397 # 100.0 1.7E-81 1.1E-84 463.4 41.5 389 1-433 1-397 (397) 50 protein:vir:104259 Length: 403 100.0 1.6E-80 9.9E-84 458.1 41.1 383 1-433 1-403 (403) 51 protein:vir:102727 Length: 945 100.0 1.4E-79 8.7E-83 452.9 44.4 422 1-435 62-539 (945) 52 protein:vir:8100 Length: 466 # 100.0 8.9E-80 5.5E-83 454.0 42.1 409 1-424 1-466 (466) 53 protein:vir:100187 Length: 385 100.0 7.1E-79 4.4E-82 449.0 40.0 375 1-424 1-385 (385) 54 protein:vir:4089 Length: 395 # 100.0 1.2E-78 7.7E-82 447.7 39.0 383 1-428 1-395 (395) 55 protein:vir:6210 Length: 394 # 100.0 2.5E-78 1.5E-81 446.1 39.3 382 1-424 1-394 (394) 56 protein:vir:95965 Length: 385 100.0 5.3E-78 3.3E-81 444.2 37.5 372 1-424 1-385 (385) 57 protein:vir:4854 Length: 386 # 100.0 4.6E-77 2.9E-80 439.1 39.7 379 1-425 1-386 (386) 58 protein:vir:9507 Length: 395 # 100.0 9.1E-77 5.6E-80 437.5 39.6 383 1-427 1-395 (395) 59 protein:vir:100650 Length: 395 100.0 9.1E-77 5.6E-80 437.5 39.6 383 1-427 1-395 (395) 60 protein:vir:101289 Length: 395 100.0 9.1E-77 5.6E-80 437.5 39.6 383 1-427 1-395 (395) 61 protein:vir:100882 Length: 383 100.0 1.3E-76 7.8E-80 436.7 39.8 373 1-422 1-383 (383) 62 protein:vir:4952 Length: 386 # 100.0 5.1E-76 3.2E-79 433.3 40.2 379 1-425 1-386 (386) 63 protein:vir:80796 Length: 574 100.0 1.3E-74 8.2E-78 425.6 42.6 427 1-435 27-524 (574) 64 protein:vir:94002 Length: 378 100.0 6.4E-76 4E-79 432.8 35.3 357 1-424 1-378 (378) 65 protein:vir:80644 Length: 551 100.0 1.5E-74 9E-78 425.4 42.6 427 1-435 5-534 (551) 66 protein:vir:93867 Length: 378 100.0 9.4E-76 5.8E-79 431.9 35.8 356 1-424 1-378 (378) 67 protein:vir:1661 Length: 378 # 100.0 1.9E-75 1.2E-78 430.2 36.4 357 1-424 1-378 (378) 68 protein:vir:4828 Length: 382 # 100.0 7E-75 4.3E-78 427.1 39.2 373 1-425 1-382 (382) 69 protein:vir:9641 Length: 395 # 100.0 1.7E-75 1.1E-78 430.5 35.9 378 1-427 1-395 (395) 70 protein:vir:7407 Length: 392 # 100.0 6.7E-75 4.2E-78 427.2 39.0 380 1-414 3-392 (392) 71 protein:vir:4995 Length: 384 # 100.0 4.3E-75 2.7E-78 428.3 35.4 372 1-385 1-384 (384) 72 protein:vir:1023 Length: 392 # 100.0 2.4E-74 1.5E-77 424.2 39.2 378 1-414 3-392 (392) 73 protein:vir:3989 Length: 392 # 100.0 2.4E-74 1.5E-77 424.2 39.2 378 1-414 3-392 (392) 74 protein:vir:100691 Length: 535 100.0 1.2E-73 7.7E-77 420.3 42.5 427 1-435 1-528 (535) 75 protein:vir:63755 Length: 547 100.0 2.4E-73 1.5E-76 418.7 43.4 427 1-435 1-530 (547) 76 protein:vir:1082 Length: 359 # 100.0 3.3E-74 2.1E-77 423.4 38.3 350 1-377 1-359 (359) 77 protein:vir:78310 Length: 376 100.0 2.7E-74 1.7E-77 423.9 35.7 364 1-422 1-376 (376) 78 protein:vir:98643 Length: 395 100.0 5.4E-74 3.4E-77 422.3 36.9 378 1-427 1-395 (395) 79 protein:vir:858 Length: 378 # 100.0 6.9E-74 4.3E-77 421.7 36.0 356 1-424 1-378 (378) 80 protein:vir:94869 Length: 378 100.0 1.6E-73 1E-76 419.6 36.3 357 1-424 1-378 (378) 81 protein:vir:96579 Length: 576 100.0 6.8E-71 4.2E-74 405.3 41.7 424 1-435 32-537 (576) 82 protein:vir:79772 Length: 648 100.0 6.9E-71 4.3E-74 405.2 41.6 431 1-435 8-508 (648) 83 protein:vir:4194 Length: 540 # 100.0 3.4E-70 2.1E-73 401.4 38.7 408 1-435 6-462 (540) 84 protein:vir:3153 Length: 467 # 100.0 1.1E-69 7E-73 398.6 38.7 388 43-435 1-467 (467) 85 protein:vir:99312 Length: 563 100.0 9.2E-69 5.7E-72 393.6 43.2 420 1-435 43-548 (563) 86 protein:vir:95599 Length: 563 100.0 9.2E-69 5.7E-72 393.6 43.2 420 1-435 43-548 (563) 87 protein:vir:4156 Length: 542 # 100.0 3E-69 1.9E-72 396.3 38.6 413 3-435 1-462 (542) 88 protein:vir:99452 Length: 651 100.0 3.2E-67 2E-70 385.1 34.2 431 1-435 1-560 (651) 89 protein:vir:78641 Length: 278 100.0 8.2E-64 5.1E-67 366.4 32.5 276 64-341 1-278 (278) 90 protein:vir:79150 Length: 368 100.0 2.2E-51 1.3E-54 298.3 27.3 336 1-359 1-368 (368) 91 protein:vir:103971 Length: 376 100.0 1.8E-49 1.1E-52 287.9 30.4 313 1-348 26-376 (376) 92 protein:vir:267 Length: 348 # 100.0 4.7E-49 2.9E-52 285.5 31.6 322 1-352 1-348 (348) 93 protein:vir:79207 Length: 351 100.0 2.3E-48 1.5E-51 281.7 31.3 313 1-348 1-351 (351) 94 protein:vir:100328 Length: 346 100.0 2.7E-48 1.7E-51 281.4 30.8 323 1-346 1-346 (346) 95 protein:vir:78191 Length: 351 100.0 3.9E-48 2.4E-51 280.5 31.3 314 1-348 1-351 (351) 96 protein:vir:98567 Length: 340 100.0 2.4E-48 1.5E-51 281.7 26.9 323 1-345 1-340 (340) 97 protein:vir:78749 Length: 337 100.0 4.8E-48 3E-51 280.0 27.1 314 1-342 1-337 (337) 98 protein:vir:3780 Length: 345 # 100.0 1.7E-47 1.1E-50 277.0 29.9 328 1-343 1-345 (345) 99 protein:vir:4698 Length: 251 # 100.0 4.5E-48 2.8E-51 280.2 25.6 240 1-251 1-251 (251) 100 protein:vir:6058 Length: 344 # 100.0 6.3E-48 3.9E-51 279.4 26.3 320 1-346 1-344 (344) 101 protein:vir:3743 Length: 345 # 100.0 2.3E-47 1.4E-50 276.3 28.9 323 1-343 1-345 (345) 102 protein:vir:5691 Length: 344 # 100.0 3.6E-47 2.2E-50 275.2 27.8 320 1-346 1-344 (344) 103 protein:vir:1150 Length: 350 # 100.0 6.3E-47 3.9E-50 273.9 28.4 320 1-341 1-350 (350) 104 protein:vir:2013 Length: 344 # 100.0 4.8E-47 3E-50 274.5 26.7 320 1-346 1-344 (344) 105 protein:vir:98853 Length: 219 100.0 2.2E-38 1.4E-41 227.0 21.5 201 138-345 1-219 (219) 106 protein:vir:5249 Length: 437 # 100.0 7.2E-30 4.4E-33 180.4 32.4 393 1-432 1-437 (437) 107 protein:vir:94049 Length: 532 99.9 2.1E-26 1.3E-29 161.3 31.5 421 1-435 17-524 (532) 108 protein:vir:107742 Length: 537 99.9 4.4E-26 2.7E-29 159.6 32.5 417 1-435 25-531 (537) 109 protein:vir:79647 Length: 435 99.9 4E-25 2.5E-28 154.3 28.0 386 1-422 1-435 (435) 110 protein:vir:99563 Length: 862 99.9 3.5E-24 2.2E-27 149.2 31.2 415 1-435 93-594 (862) 111 protein:vir:96068 Length: 765 99.9 1.6E-23 1E-26 145.5 31.8 420 1-435 37-565 (765) 112 protein:vir:104338 Length: 422 99.9 7.3E-24 4.6E-27 147.4 29.6 381 1-431 1-422 (422) 113 protein:vir:107662 Length: 427 99.9 6.2E-24 3.9E-27 147.8 28.7 381 1-425 1-427 (427) 114 protein:vir:80040 Length: 461 99.9 9.1E-23 5.6E-26 141.4 26.0 392 1-434 1-461 (461) 115 protein:vir:108215 Length: 469 99.8 2.3E-20 1.4E-23 128.3 33.4 408 1-435 1-464 (469) 116 protein:vir:79538 Length: 502 99.8 2.1E-21 1.3E-24 133.9 25.8 422 1-427 1-502 (502) 117 protein:vir:103860 Length: 528 99.8 2.6E-19 1.6E-22 122.5 34.2 407 1-435 1-460 (528) 118 protein:vir:99232 Length: 526 99.8 3.2E-18 2E-21 116.5 34.3 396 1-435 1-448 (526) 119 protein:vir:79063 Length: 491 99.8 2E-18 1.2E-21 117.6 33.0 399 1-435 3-429 (491) 120 protein:vir:79233 Length: 526 99.8 5.3E-18 3.3E-21 115.3 34.3 405 1-435 1-458 (526) 121 protein:vir:107880 Length: 491 99.8 1.8E-17 1.1E-20 112.3 34.8 388 1-435 1-423 (491) 122 protein:vir:99853 Length: 488 99.8 8.6E-18 5.3E-21 114.1 32.5 390 1-435 14-420 (488) 123 protein:vir:96738 Length: 505 99.7 5.5E-19 3.4E-22 120.7 22.8 413 1-433 8-505 (505) 124 protein:vir:95542 Length: 548 99.7 2.2E-18 1.4E-21 117.4 25.2 431 1-435 1-544 (548) 125 protein:vir:1986 Length: 512 # 99.7 6.2E-17 3.9E-20 109.4 32.3 404 4-435 1-469 (512) 126 protein:vir:389 Length: 530 # 99.7 3E-17 1.9E-20 111.2 28.5 418 1-435 1-530 (530) 127 protein:vir:98816 Length: 446 99.7 1.8E-17 1.1E-20 112.4 24.7 367 1-380 3-446 (446) 128 protein:vir:6382 Length: 553 # 99.7 8E-17 5E-20 108.8 27.8 419 1-433 2-553 (553) 129 protein:vir:3420 Length: 533 # 99.7 4.7E-17 2.9E-20 110.1 24.5 422 1-435 3-533 (533) 130 protein:vir:10321 Length: 495 99.7 5.2E-17 3.2E-20 109.8 24.2 412 1-433 1-495 (495) 131 protein:vir:95254 Length: 488 99.7 1.9E-15 1.2E-18 101.3 32.1 408 1-427 1-488 (488) 132 protein:vir:105782 Length: 449 99.6 6.2E-16 3.8E-19 104.0 24.1 367 1-431 23-449 (449) 133 protein:vir:3648 Length: 695 # 99.6 1.6E-15 1E-18 101.6 25.7 401 1-435 67-550 (695) 134 protein:vir:77981 Length: 448 99.6 1.8E-14 1.1E-17 95.9 30.2 396 1-435 1-442 (448) 135 protein:vir:78589 Length: 695 99.6 2.6E-15 1.6E-18 100.5 25.5 407 1-435 67-550 (695) 136 protein:vir:101541 Length: 694 99.6 2.7E-15 1.7E-18 100.4 25.4 405 1-435 62-549 (694) 137 protein:vir:79511 Length: 448 99.6 3.8E-14 2.3E-17 94.2 30.3 400 1-418 1-448 (448) 138 protein:vir:106716 Length: 698 99.6 8.9E-15 5.5E-18 97.6 24.6 419 1-435 67-575 (698) 139 protein:vir:78161 Length: 355 99.5 1.4E-13 9E-17 91.0 27.3 309 115-434 1-355 (355) 140 protein:vir:102426 Length: 631 99.4 5.1E-13 3.1E-16 88.0 21.2 425 1-435 1-533 (631) 141 protein:vir:106491 Length: 646 99.3 2.8E-12 1.8E-15 83.9 21.8 422 1-435 1-504 (646) 142 protein:vir:99088 Length: 629 99.2 4.7E-12 2.9E-15 82.7 18.9 420 1-435 1-525 (629) 143 protein:vir:8654 Length: 629 # 99.2 7.7E-12 4.7E-15 81.5 19.0 420 1-435 1-521 (629) 144 protein:vir:99916 Length: 504 99.2 1.3E-10 7.9E-14 74.8 25.2 418 1-434 23-504 (504) 145 protein:vir:5839 Length: 533 # 99.1 8.8E-11 5.5E-14 75.7 22.4 423 1-435 4-522 (533) 146 protein:vir:106027 Length: 629 99.1 1.5E-10 9.2E-14 74.5 22.2 424 1-435 1-516 (629) 147 protein:vir:97900 Length: 639 99.1 1.2E-10 7.5E-14 75.0 20.5 422 1-435 1-531 (639) 148 protein:vir:107517 Length: 639 99.1 1.2E-10 7.5E-14 75.0 20.5 422 1-435 1-531 (639) 149 protein:vir:4782 Length: 522 # 99.0 5E-09 3.1E-12 66.1 25.4 402 1-434 1-522 (522) 150 protein:vir:9815 Length: 500 # 98.9 4.1E-09 2.6E-12 66.5 24.1 392 1-420 1-500 (500) 151 protein:vir:3028 Length: 500 # 98.9 4.1E-09 2.6E-12 66.5 24.1 392 1-420 1-500 (500) 152 protein:vir:8184 Length: 474 # 98.9 8.2E-09 5.1E-12 64.9 25.6 393 1-409 17-474 (474) 153 protein:vir:98883 Length: 517 98.9 1.1E-08 6.9E-12 64.2 25.6 397 1-433 1-517 (517) 154 protein:vir:94742 Length: 409 98.9 1.6E-08 1E-11 63.3 26.7 345 1-377 3-409 (409) 155 protein:vir:104082 Length: 485 98.9 1.9E-08 1.2E-11 63.0 27.6 396 1-435 8-483 (485) 156 protein:vir:98444 Length: 434 98.9 1.6E-08 9.9E-12 63.3 25.7 370 19-434 1-434 (434) 157 protein:vir:79703 Length: 505 98.9 2.1E-08 1.3E-11 62.7 27.2 393 1-420 1-505 (505) 158 protein:vir:7768 Length: 484 # 98.9 2.6E-08 1.6E-11 62.2 26.6 404 1-435 14-483 (484) 159 protein:vir:78907 Length: 518 98.8 3.2E-08 2E-11 61.7 28.7 403 1-425 1-518 (518) 160 protein:vir:1634 Length: 409 # 98.8 3.3E-08 2E-11 61.6 27.2 348 1-377 3-409 (409) 161 protein:vir:103219 Length: 201 98.8 5.4E-10 3.4E-13 71.4 15.1 187 213-428 1-201 (201) 162 protein:vir:2341 Length: 488 # 98.8 6.9E-09 4.3E-12 65.3 21.1 401 1-431 10-488 (488) 163 protein:vir:2427 Length: 485 # 98.8 6E-08 3.7E-11 60.2 26.6 397 1-435 6-485 (485) 164 protein:vir:7987 Length: 456 # 98.8 9E-09 5.6E-12 64.7 20.2 381 1-424 7-456 (456) 165 protein:vir:1587 Length: 508 # 98.7 6.6E-08 4.1E-11 59.9 30.6 392 1-424 1-508 (508) 166 protein:vir:4898 Length: 502 # 98.7 9E-08 5.6E-11 59.2 26.3 411 1-435 39-500 (502) 167 protein:vir:105819 Length: 456 98.7 3.7E-08 2.3E-11 61.3 21.3 390 1-424 7-456 (456) 168 protein:vir:102602 Length: 456 98.7 3.7E-08 2.3E-11 61.3 21.3 390 1-424 7-456 (456) 169 protein:vir:96494 Length: 501 98.7 1.3E-07 8.2E-11 58.3 25.7 412 1-435 38-499 (501) 170 protein:vir:5961 Length: 503 # 98.6 1.7E-07 1.1E-10 57.7 30.2 401 1-429 13-503 (503) 171 protein:vir:2732 Length: 501 # 98.6 2.3E-07 1.4E-10 57.0 26.0 409 1-431 38-501 (501) 172 protein:vir:2500 Length: 501 # 98.6 2.8E-07 1.7E-10 56.5 23.8 397 1-435 29-501 (501) 173 protein:vir:38 Length: 496 # N 98.6 2.9E-07 1.8E-10 56.4 25.3 394 1-425 3-496 (496) 174 protein:vir:99072 Length: 479 98.5 3.1E-07 1.9E-10 56.3 25.4 395 1-435 15-470 (479) 175 protein:vir:4223 Length: 486 # 98.5 3.2E-07 2E-10 56.2 27.6 402 1-435 15-484 (486) 176 protein:vir:99522 Length: 470 98.5 4.3E-07 2.7E-10 55.5 25.2 376 1-433 39-470 (470) 177 protein:vir:94101 Length: 474 98.5 5.4E-07 3.4E-10 54.9 27.4 393 1-433 1-474 (474) 178 protein:vir:105889 Length: 474 98.5 5.4E-07 3.4E-10 54.9 27.4 393 1-433 1-474 (474) 179 protein:vir:9306 Length: 511 # 98.5 6E-07 3.7E-10 54.7 23.8 407 1-433 39-511 (511) 180 protein:vir:78537 Length: 480 98.4 6.4E-07 4E-10 54.5 26.2 404 1-435 1-477 (480) 181 protein:vir:1236 Length: 483 # 98.4 6.5E-07 4.1E-10 54.5 25.2 389 1-433 34-483 (483) 182 protein:vir:9751 Length: 422 # 98.4 6.8E-07 4.2E-10 54.4 27.9 358 1-403 1-422 (422) 183 protein:vir:9568 Length: 410 # 98.4 7.7E-07 4.8E-10 54.1 23.1 352 1-404 1-410 (410) 184 protein:vir:95806 Length: 440 98.4 9.7E-07 6E-10 53.6 23.0 383 1-424 10-440 (440) 185 protein:vir:106639 Length: 481 98.3 1.2E-06 7.4E-10 53.1 28.3 386 1-434 44-481 (481) 186 protein:vir:80959 Length: 499 98.3 1.3E-06 8E-10 52.9 24.6 395 1-425 3-499 (499) 187 protein:vir:103951 Length: 511 98.3 1.4E-06 8.8E-10 52.6 25.7 406 1-433 39-511 (511) 188 protein:vir:94805 Length: 492 98.3 1.6E-06 9.7E-10 52.4 25.1 386 1-433 43-492 (492) 189 protein:vir:99781 Length: 511 98.3 1.8E-06 1.1E-09 52.1 23.6 404 1-433 39-511 (511) 190 protein:vir:96240 Length: 511 98.3 1.9E-06 1.2E-09 52.0 25.5 407 1-433 39-511 (511) 191 protein:vir:3964 Length: 453 # 98.3 2E-06 1.2E-09 51.9 24.4 389 1-433 19-453 (453) 192 protein:vir:78227 Length: 480 98.2 2.9E-06 1.8E-09 50.9 26.4 405 1-435 1-479 (480) 193 protein:vir:78805 Length: 511 98.2 3.3E-06 2.1E-09 50.6 24.5 408 1-433 39-511 (511) 194 protein:vir:96366 Length: 511 98.2 3.3E-06 2.1E-09 50.6 24.5 408 1-433 39-511 (511) 195 protein:vir:78083 Length: 537 98.2 3.3E-06 2.1E-09 50.6 29.0 409 1-434 8-537 (537) 196 protein:vir:97171 Length: 512 98.1 3.9E-06 2.4E-09 50.2 23.7 405 1-433 42-512 (512) 197 protein:vir:95113 Length: 474 98.1 4.4E-06 2.7E-09 50.0 24.9 380 1-433 33-474 (474) 198 protein:vir:97336 Length: 492 98.1 5E-06 3.1E-09 49.7 26.2 391 1-431 42-492 (492) 199 protein:vir:93747 Length: 472 98.1 5.6E-06 3.5E-09 49.4 27.0 389 1-433 23-472 (472) 200 protein:vir:3609 Length: 452 # 98.1 5.7E-06 3.6E-09 49.3 23.3 383 1-433 17-452 (452) 201 protein:vir:80680 Length: 441 98.0 7.3E-06 4.5E-09 48.7 25.9 372 1-418 6-441 (441) 202 protein:vir:106571 Length: 499 98.0 9.4E-06 5.8E-09 48.1 25.2 404 1-435 1-493 (499) 203 protein:vir:96266 Length: 474 97.9 1.3E-05 8.1E-09 47.4 26.2 386 1-433 26-474 (474) 204 protein:vir:95899 Length: 474 97.9 1.3E-05 8.1E-09 47.4 26.2 386 1-433 26-474 (474) 205 protein:vir:105292 Length: 478 97.9 1.3E-05 8.2E-09 47.3 27.7 379 1-433 42-478 (478) 206 protein:vir:79043 Length: 479 97.8 2E-05 1.2E-08 46.4 26.0 378 1-425 22-479 (479) 207 protein:vir:94498 Length: 474 97.7 2.7E-05 1.7E-08 45.6 27.3 379 1-435 29-473 (474) 208 protein:vir:97447 Length: 474 97.7 2.7E-05 1.7E-08 45.6 27.3 379 1-435 29-473 (474) 209 protein:vir:4073 Length: 279 # 97.4 4.3E-06 2.7E-09 50.0 9.6 273 48-382 1-279 (279) 210 protein:vir:9871 Length: 429 # 97.2 0.00013 7.8E-08 42.0 23.8 366 1-430 7-429 (429) 211 protein:vir:106999 Length: 564 97.2 0.00014 8.9E-08 41.7 23.1 416 1-428 1-564 (564) 212 protein:vir:107112 Length: 478 97.1 0.00016 9.9E-08 41.4 27.6 378 1-435 42-477 (478) 213 protein:vir:96839 Length: 474 97.1 0.00016 1E-07 41.3 28.4 377 1-431 28-474 (474) 214 protein:vir:94546 Length: 506 97.0 0.0002 1.2E-07 40.9 24.3 387 1-435 36-502 (506) 215 protein:vir:733 Length: 453 # 96.7 0.00037 2.3E-07 39.4 24.6 388 1-421 1-453 (453) 216 protein:vir:105461 Length: 470 96.4 0.00064 4E-07 38.1 26.1 375 1-424 18-470 (470) 217 protein:vir:5665 Length: 511 # 96.4 0.00068 4.2E-07 37.9 21.8 393 1-420 1-511 (511) 218 protein:vir:102950 Length: 471 96.3 0.00073 4.5E-07 37.8 26.4 378 1-426 1-471 (471) 219 protein:vir:104892 Length: 558 96.2 0.00084 5.2E-07 37.4 22.9 417 1-435 5-554 (558) 220 protein:vir:97265 Length: 513 95.9 0.0013 7.9E-07 36.5 26.9 402 1-432 20-513 (513) 221 protein:vir:96179 Length: 468 95.8 0.0015 9E-07 36.1 26.7 369 1-429 32-468 (468) 222 protein:vir:108049 Length: 524 95.2 0.0025 1.6E-06 34.8 22.6 397 1-421 1-524 (524) 223 protein:vir:97376 Length: 320 95.0 0.0011 6.6E-07 36.9 10.1 304 1-386 1-320 (320) 224 protein:vir:9922 Length: 489 # 94.4 0.0046 2.8E-06 33.4 24.2 402 1-432 13-489 (489) 225 protein:vir:104500 Length: 537 94.0 0.0056 3.5E-06 32.9 23.6 411 1-435 1-535 (537) 226 protein:vir:98265 Length: 524 93.8 0.0064 4E-06 32.6 24.5 397 1-421 4-524 (524) 227 protein:vir:80453 Length: 535 93.7 0.0067 4.1E-06 32.5 22.6 406 1-435 46-535 (535) 228 protein:vir:101806 Length: 516 93.6 0.0071 4.4E-06 32.4 23.0 396 1-421 1-516 (516) 229 protein:vir:101189 Length: 516 93.6 0.0071 4.4E-06 32.4 23.0 396 1-421 1-516 (516) 230 protein:vir:103177 Length: 533 92.3 0.012 7.5E-06 31.1 23.0 402 1-435 1-529 (533) 231 protein:vir:100598 Length: 516 91.7 0.015 9.2E-06 30.6 22.8 396 1-421 1-516 (516) 232 protein:vir:81017 Length: 521 90.6 0.02 1.2E-05 29.9 23.9 397 1-420 2-521 (521) 233 protein:vir:106282 Length: 521 90.1 0.023 1.4E-05 29.6 23.3 397 1-421 1-521 (521) 234 protein:vir:105154 Length: 525 89.8 0.024 1.5E-05 29.5 18.0 396 1-435 47-525 (525) 235 protein:vir:6596 Length: 521 # 88.1 0.035 2.2E-05 28.6 24.3 397 1-420 8-521 (521) 236 protein:vir:7208 Length: 524 # 87.5 0.038 2.4E-05 28.3 20.3 397 1-421 1-524 (524) 237 protein:vir:6896 Length: 523 # 87.2 0.04 2.5E-05 28.2 20.5 398 1-421 1-523 (523) 238 protein:vir:103458 Length: 524 85.8 0.05 3.1E-05 27.7 20.3 397 1-421 1-524 (524) 239 protein:vir:94956 Length: 452 85.7 0.051 3.2E-05 27.7 27.3 373 1-422 1-452 (452) 240 protein:vir:102330 Length: 451 85.3 0.054 3.3E-05 27.5 25.4 364 1-423 14-451 (451) 241 protein:vir:95149 Length: 501 83.9 0.064 4E-05 27.1 26.6 396 1-433 15-501 (501) 242 protein:vir:103765 Length: 549 80.8 0.091 5.7E-05 26.3 16.0 404 1-435 1-547 (549) 243 protein:vir:102668 Length: 547 69.7 0.22 0.00014 24.2 19.7 377 1-426 36-547 (547) 244 protein:vir:78393 Length: 489 68.1 0.24 0.00015 24.0 23.9 387 1-424 22-489 (489) 245 protein:vir:94709 Length: 522 63.8 0.31 0.00019 23.4 14.7 402 1-435 1-521 (522) 246 protein:vir:95315 Length: 559 59.0 0.4 0.00025 22.8 15.1 404 1-429 1-559 (559) 247 protein:vir:7430 Length: 563 # 58.0 0.42 0.00026 22.6 19.6 417 1-435 1-548 (563) 248 protein:vir:101494 Length: 527 57.9 0.42 0.00026 22.6 23.9 388 1-435 1-523 (527) 249 protein:vir:102239 Length: 527 57.8 0.43 0.00026 22.6 24.0 388 1-435 1-523 (527) 250 protein:vir:95014 Length: 491 54.6 0.5 0.00031 22.2 24.9 389 1-431 22-491 (491) 251 protein:vir:572 Length: 506 # 50.8 0.6 0.00037 21.8 10.1 394 1-435 30-502 (506) 252 protein:vir:3361 Length: 535 # 50.1 0.62 0.00038 21.7 15.7 385 1-435 56-535 (535) 253 protein:vir:8883 Length: 543 # 49.7 0.63 0.00039 21.7 12.4 401 1-435 1-541 (543) 254 protein:vir:94572 Length: 535 47.8 0.69 0.00043 21.5 14.2 383 1-430 45-535 (535) 255 protein:vir:7017 Length: 515 # 41.4 0.93 0.00058 20.8 17.6 381 1-425 41-515 (515) 256 protein:vir:100039 Length: 522 33.2 1.4 0.00085 19.8 12.3 392 1-433 1-522 (522) 257 protein:vir:98506 Length: 555 26.1 2 0.0012 19.0 16.9 410 1-435 1-552 (555) 258 protein:vir:107404 Length: 555 26.1 2 0.0012 19.0 16.9 410 1-435 1-552 (555) 259 protein:vir:107822 Length: 555 26.1 2 0.0012 19.0 16.9 410 1-435 1-552 (555) 260 protein:vir:103330 Length: 517 21.7 2.6 0.0016 18.3 13.4 399 1-427 1-517 (517) 261 protein:vir:1538 Length: 535 # 21.7 2.6 0.0016 18.3 18.7 382 1-430 40-535 (535) No 1 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=1.9e-96 Score=545.38 Aligned_cols=414 Identities=26% Similarity=0.427 Sum_probs=357.9 Q ss_pred CchHHHHHhhcccccccccccc---ccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc-- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQ---NPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY-- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~-- 75 (435) ||||+|++++|+..+++..... .....++.+.|... ....++.+.++++++||+||++||++||++||++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~-~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~ 79 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISP-STISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEY 79 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCc-CccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCC Confidence 9999999999987666543322 22345556666544 34567788899999999999999999999999998754 Q ss_pred --cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC-----ce Q lcl|NC_019456. 76 --KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN-----NS 148 (435) Q Consensus 76 --~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~-----~~ 148 (435) ....+|+++++|+.+||++||+++||+.++.+++++||+|++++++. .|+|++||||+|++|++..+..+ .. T Consensus 80 ~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~ 158 (432) T protein:vir:10 80 GIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR-KGKVQALWPIDASKVTVYIDDVGLLNSKTK 158 (432) T ss_pred ceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEEcCceeEEEEcCcccccccce Confidence 45678999999999999999999999999999999999999999874 48999999999999999887643 34 Q ss_pred EEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHH Q lcl|NC_019456. 149 YWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQ 226 (435) Q Consensus 149 ~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~ 226 (435) .+|.+..+|..++|+++||||++++++.++++|+||+..+...+..+.++++++.++|.|| +.+++++++.+++++.+ T Consensus 159 ~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~ 238 (432) T protein:vir:10 159 MWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKK 238 (432) T ss_pred EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHH Confidence 5677778888899999999999988788999999999999999999999999999999997 67899999999999999 Q ss_pred HHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH-HHH Q lcl|NC_019456. 227 AMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV-THS 302 (435) Q Consensus 227 ~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~-~~~ 302 (435) ++++.|.... .|+++++|+++|++|+++++++.++|+.+.+++++++||++|||||.+||..+.+++++.|++ ..| T Consensus 239 ~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~ 318 (432) T protein:vir:10 239 VFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQF 318 (432) T ss_pred HHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHH Confidence 9999997654 567899999999999999999999999999999999999999999999999998888887654 567 Q ss_pred HHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCc Q lcl|NC_019456. 303 WTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAAD 382 (435) Q Consensus 303 ~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd 382 (435) +++||.|++.+|+++|+++||++.++..|++++||++++++.|.+++++++++++++|++|+||+|+++|+||+ |||| T Consensus 319 ~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi--~ggD 396 (432) T protein:vir:10 319 YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE--AGGD 396 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCCC Confidence 79999999999999999999999999999999999999999999999999999999999999999999999998 5899 Q ss_pred eeeecccccchhccccccccccccccccccccccCCCCCC Q lcl|NC_019456. 383 HLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENT 422 (435) Q Consensus 383 ~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (435) ++++++|++|++...+....++.. .....++|++++ T Consensus 397 ~~~~~~n~~~~~~~~~~~~k~~~~----~~~~~~~~~~~~ 432 (432) T protein:vir:10 397 RLLVNGNMLPIDMAGQAYLKGGDT----NGEVSKEGNEGN 432 (432) T ss_pred eEeecccccchhhccccccCCCCC----CCCCCCCCCCCC Confidence 999999999998765533221111 111222222222 No 2 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=1.9e-96 Score=545.38 Aligned_cols=414 Identities=26% Similarity=0.427 Sum_probs=357.9 Q ss_pred CchHHHHHhhcccccccccccc---ccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc-- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQ---NPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY-- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~-- 75 (435) ||||+|++++|+..+++..... .....++.+.|... ....++.+.++++++||+||++||++||++||++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~-~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~ 79 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISP-STISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEY 79 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCc-CccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCC Confidence 9999999999987666543322 22345556666544 34567788899999999999999999999999998754 Q ss_pred --cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC-----ce Q lcl|NC_019456. 76 --KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN-----NS 148 (435) Q Consensus 76 --~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~-----~~ 148 (435) ....+|+++++|+.+||++||+++||+.++.+++++||+|++++++. .|+|++||||+|++|++..+..+ .. T Consensus 80 ~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~ 158 (432) T protein:vir:10 80 GIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR-KGKVQALWPIDASKVTVYIDDVGLLNSKTK 158 (432) T ss_pred ceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEEcCceeEEEEcCcccccccce Confidence 45678999999999999999999999999999999999999999874 48999999999999999887643 34 Q ss_pred EEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHH Q lcl|NC_019456. 149 YWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQ 226 (435) Q Consensus 149 ~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~ 226 (435) .+|.+..+|..++|+++||||++++++.++++|+||+..+...+..+.++++++.++|.|| +.+++++++.+++++.+ T Consensus 159 ~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~ 238 (432) T protein:vir:10 159 MWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKK 238 (432) T ss_pred EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHH Confidence 5677778888899999999999988788999999999999999999999999999999997 67899999999999999 Q ss_pred HHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH-HHH Q lcl|NC_019456. 227 AMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV-THS 302 (435) Q Consensus 227 ~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~-~~~ 302 (435) ++++.|.... .|+++++|+++|++|+++++++.++|+.+.+++++++||++|||||.+||..+.+++++.|++ ..| T Consensus 239 ~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~ 318 (432) T protein:vir:10 239 VFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQF 318 (432) T ss_pred HHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHH Confidence 9999997654 567899999999999999999999999999999999999999999999999998888887654 567 Q ss_pred HHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCc Q lcl|NC_019456. 303 WTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAAD 382 (435) Q Consensus 303 ~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd 382 (435) +++||.|++.+|+++|+++||++.++..|++++||++++++.|.+++++++++++++|++|+||+|+++|+||+ |||| T Consensus 319 ~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi--~ggD 396 (432) T protein:vir:10 319 YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE--AGGD 396 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCCC Confidence 79999999999999999999999999999999999999999999999999999999999999999999999998 5899 Q ss_pred eeeecccccchhccccccccccccccccccccccCCCCCC Q lcl|NC_019456. 383 HLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENT 422 (435) Q Consensus 383 ~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (435) ++++++|++|++...+....++.. .....++|++++ T Consensus 397 ~~~~~~n~~~~~~~~~~~~k~~~~----~~~~~~~~~~~~ 432 (432) T protein:vir:10 397 RLLVNGNMLPIDMAGQAYLKGGDT----NGEVSKEGNEGN 432 (432) T ss_pred eEeecccccchhhccccccCCCCC----CCCCCCCCCCCC Confidence 999999999998765533221111 111222222222 No 3 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=1.9e-96 Score=545.38 Aligned_cols=414 Identities=26% Similarity=0.427 Sum_probs=357.9 Q ss_pred CchHHHHHhhcccccccccccc---ccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc-- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQ---NPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY-- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~-- 75 (435) ||||+|++++|+..+++..... .....++.+.|... ....++.+.++++++||+||++||++||++||++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~-~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~ 79 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISP-STISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEY 79 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCc-CccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCC Confidence 9999999999987666543322 22345556666544 34567788899999999999999999999999998754 Q ss_pred --cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC-----ce Q lcl|NC_019456. 76 --KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN-----NS 148 (435) Q Consensus 76 --~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~-----~~ 148 (435) ....+|+++++|+.+||++||+++||+.++.+++++||+|++++++. .|+|++||||+|++|++..+..+ .. T Consensus 80 ~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~ 158 (432) T protein:vir:10 80 GIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR-KGKVQALWPIDASKVTVYIDDVGLLNSKTK 158 (432) T ss_pred ceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEEcCceeEEEEcCcccccccce Confidence 45678999999999999999999999999999999999999999874 48999999999999999887643 34 Q ss_pred EEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHH Q lcl|NC_019456. 149 YWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQ 226 (435) Q Consensus 149 ~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~ 226 (435) .+|.+..+|..++|+++||||++++++.++++|+||+..+...+..+.++++++.++|.|| +.+++++++.+++++.+ T Consensus 159 ~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~ 238 (432) T protein:vir:10 159 MWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKK 238 (432) T ss_pred EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHH Confidence 5677778888899999999999988788999999999999999999999999999999997 67899999999999999 Q ss_pred HHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH-HHH Q lcl|NC_019456. 227 AMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV-THS 302 (435) Q Consensus 227 ~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~-~~~ 302 (435) ++++.|.... .|+++++|+++|++|+++++++.++|+.+.+++++++||++|||||.+||..+.+++++.|++ ..| T Consensus 239 ~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~ 318 (432) T protein:vir:10 239 VFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQF 318 (432) T ss_pred HHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHH Confidence 9999997654 567899999999999999999999999999999999999999999999999998888887654 567 Q ss_pred HHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCc Q lcl|NC_019456. 303 WTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAAD 382 (435) Q Consensus 303 ~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd 382 (435) +++||.|++.+|+++|+++||++.++..|++++||++++++.|.+++++++++++++|++|+||+|+++|+||+ |||| T Consensus 319 ~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi--~ggD 396 (432) T protein:vir:10 319 YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE--AGGD 396 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCCC Confidence 79999999999999999999999999999999999999999999999999999999999999999999999998 5899 Q ss_pred eeeecccccchhccccccccccccccccccccccCCCCCC Q lcl|NC_019456. 383 HLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENT 422 (435) Q Consensus 383 ~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (435) ++++++|++|++...+....++.. .....++|++++ T Consensus 397 ~~~~~~n~~~~~~~~~~~~k~~~~----~~~~~~~~~~~~ 432 (432) T protein:vir:10 397 RLLVNGNMLPIDMAGQAYLKGGDT----NGEVSKEGNEGN 432 (432) T ss_pred eEeecccccchhhccccccCCCCC----CCCCCCCCCCCC Confidence 999999999998765533221111 111222222222 No 4 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=7.9e-96 Score=541.96 Aligned_cols=414 Identities=25% Similarity=0.413 Sum_probs=356.1 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc----c Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY----K 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~----~ 76 (435) ||||++++++++...............+..++|... ....++.+.++++++|++||++||++||++||++++++ + T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~-~~~~v~~~~al~~~~v~~~i~~ia~~ia~l~~~~~~~~~~~~~ 79 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISP-STISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYGIQ 79 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCC-CcceechhhhhccHHHHHHHHHHHHhhccCceEEEEecCCcee Confidence 999999887655433333333344556677777654 34557788899999999999999999999999999753 4 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC-----ceEEE Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN-----NSYWY 151 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~-----~~~~~ 151 (435) ...+|+++++|+.+||++||+++||+.++.+++++||+|+++.++. .|+|++|||++|++|++..+..+ ...+| T Consensus 80 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~G~~~~L~~i~~~~v~v~~~~~~~~~~~~~~~~ 158 (429) T protein:vir:10 80 RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR-KGKVQALWPIDASKVTVYIDDVGLLNSKTKMWY 158 (429) T ss_pred eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEEcCceeEEEEcCcccccccceEEE Confidence 5678999999999999999999999999999999999999999875 48999999999999999988654 33467 Q ss_pred EEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHH Q lcl|NC_019456. 152 RVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMV 229 (435) Q Consensus 152 ~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~ 229 (435) .+..+|..++|+++||||+++.++.++++|+||+..+...+..+.++++++.++|+|+ +++++++++.+++++.++++ T Consensus 159 ~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~ 238 (429) T protein:vir:10 159 VVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFR 238 (429) T ss_pred EEccCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHH Confidence 7777888889999999999988888999999999999999999999999999999997 67899999999999999999 Q ss_pred HHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH-HHHHHH Q lcl|NC_019456. 230 NDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV-THSWTM 305 (435) Q Consensus 230 ~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~-~~~~~~ 305 (435) +.|.... .|+++++|+++|++|++++.++.++|+.+.+++++++||++|||||.+||+.+.++++|.+++ ..|++. T Consensus 239 ~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~ 318 (429) T protein:vir:10 239 ENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTD 318 (429) T ss_pred HHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHH Confidence 9997654 567899999999999999999999999999999999999999999999999999998887654 557799 Q ss_pred HHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceee Q lcl|NC_019456. 306 TLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLY 385 (435) Q Consensus 306 ~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~ 385 (435) +|.|++.+|+++|+++||++.++..|++++||++.+++.|.+++++++++++++|+||+||+|+++|+||+ ||||+++ T Consensus 319 ~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~--~ggD~~~ 396 (429) T protein:vir:10 319 TLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE--AGGDRLL 396 (429) T ss_pred HHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999998 5899999 Q ss_pred ecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 386 ISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 386 ~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (435) +++|++|++...+.... +|+++++..++..+.+ T Consensus 397 ~~~n~~~~d~~~~~~~k---------------~g~~~~~~~~~~~e~~ 429 (429) T protein:vir:10 397 VNGNMLPIDMAGQAYLK---------------GGDTNGEVSKEGNEGN 429 (429) T ss_pred ecccccchhhccccccC---------------CCCCCCCCCCCCCCCC Confidence 99999999875443222 2222222111111111 No 5 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=7.1e-95 Score=536.73 Aligned_cols=404 Identities=20% Similarity=0.352 Sum_probs=348.5 Q ss_pred CchHHHHHhhcccccccccc------ccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQI------VQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) ||||++|++.........+. .+.....+....|.. ....++...++++++|++||++||++||++|++++++ T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~--~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~ 78 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIK--LNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKD 78 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhcccc--CCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEec Confidence 99999986654332221111 111122333333332 3345677788999999999999999999999999999 Q ss_pred ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc------e Q lcl|NC_019456. 75 YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN------S 148 (435) Q Consensus 75 ~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~------~ 148 (435) ++.+.+|+++++|+.+||++||+++||+.++.+++++||||++|+++. .|+|++|+|++|++|++..+.++. . T Consensus 79 ~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~-~G~~~~L~~i~~~~v~~~~~~~~~~~~~~~~ 157 (422) T protein:vir:13 79 KEEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDR-KGKIIGLYPINSDNVTKIIDDDNFLSSLSKV 157 (422) T ss_pred CcccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEECCcceEEEEcCCcceeccceE Confidence 999999999999999999999999999999999999999999999875 589999999999999999988762 2 Q ss_pred EEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHH Q lcl|NC_019456. 149 YWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQ 226 (435) Q Consensus 149 ~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~ 226 (435) .|.+...+|...+|++++|||++.+++.++++|+||+..+..++..+.++++++.++|+|+ +++++++++.+++++.+ T Consensus 158 ~y~~~~~~g~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~ 237 (422) T protein:vir:13 158 WYVVTDKNGKEHKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKK 237 (422) T ss_pred EEEEEeCCCeEEEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHH Confidence 3334445677788999999999988788999999999999999999999999999999997 67999999999999999 Q ss_pred HHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH-HHHH Q lcl|NC_019456. 227 AMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH-VTHS 302 (435) Q Consensus 227 ~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~-~~~~ 302 (435) +++++|.... +|+++++||++|++|++++.++.++|++|.+++++++||++|||||.+||+.+.++++|.++ ...| T Consensus 238 ~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f 317 (422) T protein:vir:13 238 IFKKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKDF 317 (422) T ss_pred HHHHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHH Confidence 9999997654 46789999999999999999999999999999999999999999999999999888888765 5567 Q ss_pred HHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCc Q lcl|NC_019456. 303 WTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAAD 382 (435) Q Consensus 303 ~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd 382 (435) +++||.|++.+|+++|+++||++.++..|++|+||++++++.|.+++++++++++++|+||+||+|+++|+||+ |||| T Consensus 318 ~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~--~ggD 395 (422) T protein:vir:13 318 YVTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPV--EGGD 395 (422) T ss_pred HHHHHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcC Confidence 79999999999999999999999998889999999999999999999999999999999999999999999999 5899 Q ss_pred eeeecccccchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 383 HLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 383 ~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) ++++++|++|++.+.+... +||+++.+ T Consensus 396 ~~~~~~n~~~l~~~~~~~~---------------~~g~~~g~ 422 (422) T protein:vir:13 396 RLLVNGNMIPIEMAGEQYK---------------KGGEKGGK 422 (422) T ss_pred eeeeccCccchhhcccccc---------------cCCCcCCC Confidence 9999999999987654321 23333333 No 6 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=1.5e-93 Score=529.47 Aligned_cols=412 Identities=23% Similarity=0.321 Sum_probs=343.9 Q ss_pred CchHHHHHhhcccccccccc---ccccchh-hhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec-- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQI---VQNPIPQ-PLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN-- 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~-- 74 (435) ||||+|++++|.+....... ...+... ...+.+.....+..++.+.|+++++||+||++||++||+|||++|++ T Consensus 7 mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:81 7 LGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTP 86 (432) T ss_pred cchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhCceeeEEecC Confidence 99999999888665432111 1111111 11223333445667888999999999999999999999999999874 Q ss_pred --ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEE Q lcl|NC_019456. 75 --YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYR 152 (435) Q Consensus 75 --~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~ 152 (435) +.++.+|+++++|+.+||++||+++||+.++.+++++||||++++++ +|++++||||+|+.|++..+.+|...|+. T Consensus 87 ~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~--~g~~~~L~~l~~~~v~v~~~~~g~~~y~~ 164 (432) T protein:vir:81 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT--DGRIESLQYLANDRLTITTDPKGNTAYRY 164 (432) T ss_pred CcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec--CCcEEEEEEEcCCceEEEECCCCcEEEEE Confidence 34557899999999999999999999999999999999999999875 38999999999999999999988877777 Q ss_pred EecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHH Q lcl|NC_019456. 153 VTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVN 230 (435) Q Consensus 153 ~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~ 230 (435) ...+|....|++++|+|++++ +.++++|+||+..+...|..+.++++++.++|+|| +.+++++++.+++++++++++ T Consensus 165 ~~~~g~~~~~~~~~iih~r~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~ 243 (432) T protein:vir:81 165 RRTDGQMIDIPKQQIWKIMGY-SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAK 243 (432) T ss_pred EecCceEEEEccccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHH Confidence 777888889999999999976 67889999999999999999999999999999997 568999999999999999999 Q ss_pred HHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc---cHHHH-HHHHHHHH Q lcl|NC_019456. 231 DFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST---TNVEH-VTHSWTMT 306 (435) Q Consensus 231 ~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~---~~~e~-~~~~~~~~ 306 (435) +|.. ..++|++++|++|++|+++++++.++|++|.+++++++||++|||||.+||..+.+++ ++.|| ...|+++| T Consensus 244 ~~~~-~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~~~t 322 (432) T protein:vir:81 244 KVSG-SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMT 322 (432) T ss_pred HHhh-hhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHHHHH Confidence 9864 4678999999999999999999999999999999999999999999999998877554 56655 55677899 Q ss_pred HhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeee Q lcl|NC_019456. 307 LMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYI 386 (435) Q Consensus 307 i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~ 386 (435) |.||+..||++|+++|+++.+. .+++++||++.+++.|.+++++++.+++++|+||+||+|+++|+||++ ++++.+++ T Consensus 323 l~P~~~~ie~~l~~kLl~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~-g~~~~~~~ 400 (432) T protein:vir:81 323 LSPWLRRIEQSIALNLLSPAER-RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLG-GNAAVLTV 400 (432) T ss_pred HHHHHHHHHHHHHhhccCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC-CCcceEee Confidence 9999999999999999998775 578999999999999999999999999999999999999999999996 35567778 Q ss_pred cccccchhccccccccccccccccccccccCCCCCCCCCCCCC Q lcl|NC_019456. 387 SKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQS 429 (435) Q Consensus 387 ~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (435) ++|++|++........ ++.++..+.+++..+. T Consensus 401 ~~~~~pl~~~~~~~~~-----------~~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 401 QSAMVPLDSIGLQASP-----------EPASGLGNQQQDKVSK 432 (432) T ss_pred cCcccchhhhccCCCC-----------CCCCCCCCcccccccC Confidence 9999999876432211 1111111111111111 No 7 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=1.8e-93 Score=529.08 Aligned_cols=412 Identities=23% Similarity=0.321 Sum_probs=344.7 Q ss_pred CchHHHHHhhccccccccccc---cccchhh-hhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIV---QNPIPQP-LDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~- 75 (435) ||+|++++.+|.+........ ..+.... ..+.+.....+..++.+.|+++++||+||++||++||+|||++|++. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:10 7 LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTP 86 (432) T ss_pred cchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhhhCceeEEEecC Confidence 999999999987654321111 1111111 12233344556778889999999999999999999999999998743 Q ss_pred ---cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEE Q lcl|NC_019456. 76 ---KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYR 152 (435) Q Consensus 76 ---~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~ 152 (435) ..+.+|+++++|+.+||++||+++||+.++.+++++||||++++++ +|++++||||+|++|++..+.+|...|+. T Consensus 87 ~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~--~g~~~~L~~l~~~~v~v~~~~~g~~~y~~ 164 (432) T protein:vir:10 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT--DGRIESLQYLANDRLTITTDTKGNTAYRY 164 (432) T ss_pred CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec--CCcEEEEEEEcCCceEEEEcCCCcEEEEE Confidence 4557899999999999999999999999999999999999999875 48999999999999999999988887777 Q ss_pred EecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHH Q lcl|NC_019456. 153 VTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVN 230 (435) Q Consensus 153 ~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~ 230 (435) ...+|..++|++++|||++++ +.++++|+||+..+.+.|..+.++++++.++|+|| +.+++++++.+++|+.+++++ T Consensus 165 ~~~~g~~~~~~~~~iih~~~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~ 243 (432) T protein:vir:10 165 RRTDGQMIDIPKQQIWKIMGY-SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAK 243 (432) T ss_pred EecCceEEEEcCccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHH Confidence 777888899999999999976 67889999999999999999999999999999997 568999999999999999999 Q ss_pred HHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc---cHHHH-HHHHHHHH Q lcl|NC_019456. 231 DFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST---TNVEH-VTHSWTMT 306 (435) Q Consensus 231 ~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~---~~~e~-~~~~~~~~ 306 (435) +|.. ..++|+++||++|++|+++++++.|+||+|.+++++++||++|||||.+||..+.+++ +|.|+ ...|+++| T Consensus 244 ~~~~-~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~f~~~t 322 (432) T protein:vir:10 244 KVSG-SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLSMT 322 (432) T ss_pred HHhh-hhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHHHHHHH Confidence 9964 5678999999999999999999999999999999999999999999999998876544 56655 55677999 Q ss_pred HhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeee Q lcl|NC_019456. 307 LMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYI 386 (435) Q Consensus 307 i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~ 386 (435) |.||+..||++|+++|+++.++ .+++++||++.+++.|.+++++++.+++++|+||+||+|+++|+||++ ++++.+++ T Consensus 323 l~P~~~~ie~~ln~kL~~~~~~-~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~-g~~~~~~~ 400 (432) T protein:vir:10 323 LSPWLRRIEQSIALNLLSPAER-RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLG-GNAAVLTV 400 (432) T ss_pred HHHHHHHHHHHHHhhhcCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC-CCcceEee Confidence 9999999999999999998775 468999999999999999999999999999999999999999999996 34566678 Q ss_pred cccccchhccccccccccccccccccccccCCCCCCCCCCCCC Q lcl|NC_019456. 387 SKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQS 429 (435) Q Consensus 387 ~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (435) ++|++|++.+...... ++..+.++.+++..+. T Consensus 401 ~~~~~pl~~~~~~~~~-----------~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 401 QSAMVPLDSIGLQASP-----------EPASGLGNQQQDKVSK 432 (432) T ss_pred cCcccchhhhcccCCC-----------CCCCCCCCcccccccC Confidence 9999999876432111 1111112221111111 No 8 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=1.7e-93 Score=529.16 Aligned_cols=394 Identities=27% Similarity=0.419 Sum_probs=344.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc----c Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY----K 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~----~ 76 (435) ||||+|++++|.......+.. .+..+.+.|.. .++.+.++++++|++||++||++||++||++++++ . T Consensus 1 MG~~~~~~~~~~~~~~~~~~~---~~~~~~~~g~~-----~~~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~ 72 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVDMT---NPLLLQWLGVD-----PDTPRNQLSEATYFACLKILSESLGKLPLKMYQKTERGIV 72 (411) T ss_pred CchHHHHHhhccCcccccccc---hHHHHHHhcCc-----ccChhhhhccHHHHHHHHHHHHhHhhCceeEEEecCCcee Confidence 999999998886554433322 23344444432 34567789999999999999999999999999743 3 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCce-------E Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNS-------Y 149 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-------~ 149 (435) +..+|+++++|+.+||++||+++||+.++.+++++||||++++++ +|.+.+|||++|+.|++..+.++.. + T Consensus 73 ~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~--~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~ 150 (411) T protein:vir:81 73 KSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS--GPQLQALWILPSQYVTIVVDDRGLLGEKNAIWY 150 (411) T ss_pred eecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec--CCceEEEEEECCceEEEEEcCcccccccceEEE Confidence 556899999999999999999999999999999999999999986 4899999999999999998876522 2 Q ss_pred EEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHH Q lcl|NC_019456. 150 WYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQA 227 (435) Q Consensus 150 ~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~ 227 (435) .|....+|...+|+++||||+|++++.++++|+||+..+...+..+.++++++.++|+|+ +.+++++++.+++++.++ T Consensus 151 ~~~~~~~g~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~ 230 (411) T protein:vir:81 151 RYNDPYDGKMYVFRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDR 230 (411) T ss_pred EEEecCCceEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHH Confidence 233344677788999999999987788999999999999999999999999999999997 678999999999999999 Q ss_pred HHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-HHH Q lcl|NC_019456. 228 MVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-HSW 303 (435) Q Consensus 228 ~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~~~ 303 (435) ++++|.... +|+|+++|+++|++|+++++++.++|+.|.+++++++||++|||||.+||..+.++++|.|++. .|+ T Consensus 231 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~ 310 (411) T protein:vir:81 231 LVKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNLAFY 310 (411) T ss_pred HHHHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHHHH Confidence 999997654 4678899999999999999999999999999999999999999999999999999999887654 677 Q ss_pred HHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCce Q lcl|NC_019456. 304 TMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADH 383 (435) Q Consensus 304 ~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~ 383 (435) ++||.|++.+|+++|+++||++.++..|.+|+||++++++.|.+++++.+++++++|+||+||+|+++|+||+ ||||+ T Consensus 311 ~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~--~ggD~ 388 (411) T protein:vir:81 311 VDTLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPAD--DYGNN 388 (411) T ss_pred HHHHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCCCe Confidence 9999999999999999999999999999999999999999999999999999999999999999999999998 58999 Q ss_pred eeecccccchhccccccccccccccccccccccCCCCC Q lcl|NC_019456. 384 LYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGEN 421 (435) Q Consensus 384 ~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (435) +++++|++|++.+.+. ..+|||+ T Consensus 389 ~~~~~n~~pl~~~~~~---------------~~kgGd~ 411 (411) T protein:vir:81 389 LMANGNYIPLSMLGAN---------------YGKGGDS 411 (411) T ss_pred eeeccCccchhhhhhh---------------hccCCCC Confidence 9999999999875322 1234444 No 9 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=1.3e-93 Score=529.80 Aligned_cols=399 Identities=18% Similarity=0.203 Sum_probs=340.1 Q ss_pred CchHHHHHhhccccccccc----------cc-c--------ccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQ----------IV-Q--------NPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLS 61 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~----------~~-~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia 61 (435) ||||+||++.-...+.... .. . ..++.+..+.+.....+..++...++++++|++||++|| T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci~~Ia 80 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCVTLIS 80 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHHHHHH Confidence 9999998763222111100 00 0 011122333444444556678889999999999999999 Q ss_pred HHHhhCceeeeecc---cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCcee Q lcl|NC_019456. 62 NVLASLPLHEYQNY---KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTV 138 (435) Q Consensus 62 ~~ia~~~~~~~~~~---~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v 138 (435) ++||++||++++++ +...+|+++++|+.+||++||+++||+.++.+++++||+|++|+++ .|.+++|+|++|..| T Consensus 81 ~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~--~g~~~~L~pl~~~~v 158 (431) T protein:vir:10 81 GTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWS--GNRPIRLIPMDRGSA 158 (431) T ss_pred HhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc--CCceEEEEEEcCcee Confidence 99999999998853 4567799999999999999999999999999999999999999986 378999999999999 Q ss_pred EEEEcCCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEe Q lcl|NC_019456. 139 SILRNTDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQY 216 (435) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~ 216 (435) ++..+.++..+|++...+|..++|+++||||||++ +.++++|+||+..+.+.|..+.+++++..++|+|| +++++++ T Consensus 159 ~~~~~~~~~~~y~~~~~~g~~~~~~~~dViHir~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~ 237 (431) T protein:vir:10 159 KGRLTSTWQIVYDYTTPTGDKIELPAREVFHLRDL-SIDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEV 237 (431) T ss_pred EEEEcCCCeEEEEEEeCCceEEEEchhhEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEec Confidence 99998888888777778888889999999999986 46889999999999999999999999999999997 6789999 Q ss_pred CCcCCHHHHHHHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc Q lcl|NC_019456. 217 DRSISPEKRQAMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST 293 (435) Q Consensus 217 ~~~~~~e~~~~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~ 293 (435) ++.+++|+.++++++|.... +|+|+++||++|++|+++++++.++|++|++++++++||++|||||.+||+.+.+++ T Consensus 238 ~~~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~ 317 (431) T protein:vir:10 238 PKELSDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWG 317 (431) T ss_pred CCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCcc Confidence 99999999999999997654 567899999999999999999999999999999999999999999999999988888 Q ss_pred cHHHHHH-HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCC----cCHHHHH Q lcl|NC_019456. 294 TNVEHVT-HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGI----FKPNEIR 368 (435) Q Consensus 294 ~~~e~~~-~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~----~t~NE~R 368 (435) +|.||+. .|+++||.||+.+||++|+++||++.+. .+++|+||++++++.|.+++++.+++++..|+ ||+||+| T Consensus 318 sn~eq~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R 396 (431) T protein:vir:10 318 SGIEQLAIFFIQYGLSHWFVSWEQAAARAFLPEKML-GQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVR 396 (431) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhc-CCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHHH Confidence 8887655 5678999999999999999999987655 57899999999999999999999999987654 9999999 Q ss_pred HHhCCCCCCCcCCceeeecccccchhcccccccccccccccccccc Q lcl|NC_019456. 369 ELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAP 414 (435) Q Consensus 369 ~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~ 414 (435) +++|+||+++++||++++|.|+.+.+...++ +. +. T Consensus 397 ~~~gl~p~~~~~gD~~~~p~n~~~~~~~~~~------p~-----~~ 431 (431) T protein:vir:10 397 EMLDLPRADDPVADQLRNPMTQKQKGSGDEP------PA-----TT 431 (431) T ss_pred HHhCCCCCCCccccceecccccccCCCCCCC------CC-----CC Confidence 9999999999999999999987654322111 00 00 No 10 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=2.8e-93 Score=527.98 Aligned_cols=412 Identities=23% Similarity=0.324 Sum_probs=343.9 Q ss_pred CchHHHHHhhcccccccccc-cc--ccchhh-hhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQI-VQ--NPIPQP-LDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~-~~--~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~- 75 (435) ||||++++.+|.+..+.... .. .+.... ..+.+.....+..++.+.|+++++||+||++||++||+|||++|+++ T Consensus 7 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:97 7 LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAVAAMPLMMYMRTP 86 (432) T ss_pred CchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHHHHhhccCceEEEEecC Confidence 99999999998765432111 11 111111 12233344556778899999999999999999999999999999743 Q ss_pred ---cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEE Q lcl|NC_019456. 76 ---KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYR 152 (435) Q Consensus 76 ---~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~ 152 (435) ..+.+|+++++|+.+||++||+++||+.++.+++++||||++++++ +|++++||||+|++|++..+.+|...|+. T Consensus 87 ~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~--~g~~~~L~~l~p~~v~v~~~~~g~~~y~~ 164 (432) T protein:vir:97 87 DGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT--DGRIESLQYLANDRLTITTDTKGNTAYRY 164 (432) T ss_pred CCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec--CCcEEEEEEEcCcceEEEEcCCCcEEEEE Confidence 4557899999999999999999999999999999999999999985 38999999999999999999888877777 Q ss_pred EecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHH Q lcl|NC_019456. 153 VTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVN 230 (435) Q Consensus 153 ~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~ 230 (435) ...+|..++|++++|||+|++ +.++++|+||+..+...+..+.+++++.+++|+|| +.+++++++.+++|+++++++ T Consensus 165 ~~~~g~~~~~~~~~iih~r~~-~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~ 243 (432) T protein:vir:97 165 RRTDGQMIDIPRQQIWKIMGY-SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFSK 243 (432) T ss_pred EecCceEEEEccccEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCCCCHHHHHHHHH Confidence 777888889999999999976 56889999999999999999999999999999997 568999999999999999999 Q ss_pred HHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc---cHHHH-HHHHHHHH Q lcl|NC_019456. 231 DFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST---TNVEH-VTHSWTMT 306 (435) Q Consensus 231 ~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~---~~~e~-~~~~~~~~ 306 (435) +|.. ..++|+++||++|++|+++++++.|+|++|.+++++++||++|||||.+||..+.+++ ++.|+ ...|+++| T Consensus 244 ~~~~-~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~f~~~t 322 (432) T protein:vir:97 244 KVSG-SVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMT 322 (432) T ss_pred HHhh-hhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHHHHHHH Confidence 9864 4678999999999999999999999999999999999999999999999998876554 55555 45677999 Q ss_pred HhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeee Q lcl|NC_019456. 307 LMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYI 386 (435) Q Consensus 307 i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~ 386 (435) |.||++.||++|+++|+++.++ .+++++||++.+++.|.+++++++.+++++|+||+||+|+++|+||++ ++++.+++ T Consensus 323 l~P~~~~ie~~ln~kLl~~~e~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~-g~~~~~~~ 400 (432) T protein:vir:97 323 LSPWLRRIEQSIALNLLTPAER-RRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLG-GNAAVLTV 400 (432) T ss_pred HHHHHHHHHHHHhhhccCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC-CCcceEee Confidence 9999999999999999998775 578999999999999999999999999999999999999999999995 34455668 Q ss_pred cccccchhccccccccccccccccccccccCCCCCCCCCCCCC Q lcl|NC_019456. 387 SKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQS 429 (435) Q Consensus 387 ~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (435) ++|++|++.+...... ++..+.++.+++..+. T Consensus 401 ~~~~~pl~~~~~~~~~-----------~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:97 401 QSAMVPLDSIGLQASP-----------EPASGLGNQQQDKVSK 432 (432) T ss_pred cccccchhhhcccCCC-----------CCCCCCCCcccccccC Confidence 9999999876432111 1111111111111111 No 11 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=5.3e-93 Score=526.48 Aligned_cols=404 Identities=34% Similarity=0.575 Sum_probs=343.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) |+|++|+++.+-..-...+.... ..+..+.+ .....++.+.|+++++|++||++||++||+|||+++++++ ..+ T Consensus 4 ~~~~~~~k~~~~~~~~~~~~~~~--~~~~~~~~---~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~-~~~ 77 (409) T protein:vir:96 4 ENIVTRIKKKLIDNWIDQSASKL--YDFSPWKN---KSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK-VVN 77 (409) T ss_pred ccchhhhhhHHhhhhhccccccc--cccccccC---ccccccchhhHhhhHHHHHHHHHHHHhhhhCceEEeeccc-ccc Confidence 89999998864211111111111 11111111 1223355678999999999999999999999999998765 457 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEE-ecCCee Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRV-TSDIYN 159 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~-~~~~~~ 159 (435) |+++++|+.+||++||+++||+.++.+++++||||++|+++. .|.+++|||++|++|++..+.++..++|.+ ..+|.. T Consensus 78 ~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~ 156 (409) T protein:vir:96 78 TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI-YHQPSKLFLLNPDVVEMLIENQSRELYYSIHAATGNK 156 (409) T ss_pred hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECC-CCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCCceE Confidence 999999999999999999999999999999999999999874 589999999999999999888776665554 445677 Q ss_pred EEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCC Q lcl|NC_019456. 160 FTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKEN 239 (435) Q Consensus 160 ~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~ 239 (435) .+|+++|||||+++++.++++|+||+..+...+..+.+++++....+.+++.++++.++.+++++.++++++|.+..+++ T Consensus 157 ~~~~~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~ 236 (409) T protein:vir:96 157 LIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQYYEEN 236 (409) T ss_pred EEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHhcCCCceeEEecCCCCCHHHHHHHHHHHHHHhhcC Confidence 89999999999988888999999999999999999999888876555556678889999999999999999999999999 Q ss_pred CccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-HHHHHHHhHHHHHHHHHH Q lcl|NC_019456. 240 GGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-HSWTMTLMPIIRQYESQF 318 (435) Q Consensus 240 ~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~~~~~~i~P~~~~i~~~l 318 (435) ++++++++|++|+++++++.++|+.|.+++++++||++|||||.+||+...++++|.|++. .|+++||.|++.+|+++| T Consensus 237 g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~l 316 (409) T protein:vir:96 237 GGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEF 316 (409) T ss_pred CCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999999888988887655 667899999999999999 Q ss_pred HHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccc Q lcl|NC_019456. 319 NMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYD 398 (435) Q Consensus 319 ~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~ 398 (435) +++||++.++..|.+|+||++++++.|.+++++++++++++|++|+||+|+++|+||+ ||||++++++|++|++...+ T Consensus 317 ~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi--~ggD~~~~~~n~~~~~~~~~ 394 (409) T protein:vir:96 317 NRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPV--EGGDKPLISGDLYPIDTPLE 394 (409) T ss_pred HhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--CCcceeeecccccccccchh Confidence 9999999999999999999999999999999999999999999999999999999999 58999999999999876422 Q ss_pred ccccccccccccccccccCCCCCCCCCC Q lcl|NC_019456. 399 AILDNKIQTDASVAAPKQEGGENTNENG 426 (435) Q Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (435) .. ...+||+++.+++ T Consensus 395 ~~-------------~~~~gG~~n~~e~ 409 (409) T protein:vir:96 395 LR-------------KSLKGGDKNVNES 409 (409) T ss_pred hc-------------ccccCCCCCcCCC Confidence 21 1234554444433 No 12 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=2.3e-92 Score=522.95 Aligned_cols=432 Identities=16% Similarity=0.203 Sum_probs=339.8 Q ss_pred CchHHHHHhhccccccccc--cccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc--- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQ--IVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY--- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~--- 75 (435) ||||++|+++......... ....+....+..+|.....+..++.+.|+++++||+||++||++||+|||+++++. T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGT 80 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCCc Confidence 9999998654322211111 11111222223344444556778889999999999999999999999999999743 Q ss_pred cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc---e--EE Q lcl|NC_019456. 76 KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN---S--YW 150 (435) Q Consensus 76 ~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~---~--~~ 150 (435) ....+|+.+..|+.+||++||+++||+.++.+++++||||++|.++ .|.+.+||||+|.+|++..+..+. . +. T Consensus 81 ~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~--~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 158 (457) T protein:vir:62 81 RKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA--GPNIAGLDVLDPTKIHVHMVMVDGLRRKVFEA 158 (457) T ss_pred cccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC--CCcEEEEEEEcCcceEEEEeccCCccceeEEE Confidence 3345667677777899999999999999999999999999998654 489999999999999987765432 1 22 Q ss_pred EEEecCCe---eEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHH Q lcl|NC_019456. 151 YRVTSDIY---NFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKR 225 (435) Q Consensus 151 ~~~~~~~~---~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~ 225 (435) |.+..++. ...|++++||||+++++.+.++|+||+..+.+.|..+.+++++++++|+|| +++++++++.+++|+. T Consensus 159 y~~~~~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~ 238 (457) T protein:vir:62 159 YDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGL 238 (457) T ss_pred EEEccCCceeEEEeeCccceEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHH Confidence 33333332 246899999999998777779999999999999999999999999999997 5689999999999999 Q ss_pred HHHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc--cHHHH-H Q lcl|NC_019456. 226 QAMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST--TNVEH-V 299 (435) Q Consensus 226 ~~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~--~~~e~-~ 299 (435) +++++.|.... +|+++++||++|++|+++++++.|+||+|++++++++||++|||||.+||..+.+++ +|.|+ . T Consensus 239 ~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~ 318 (457) T protein:vir:62 239 ARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQN 318 (457) T ss_pred HHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHH Confidence 99999997754 467899999999999999999999999999999999999999999999998888765 55554 5 Q ss_pred HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCc Q lcl|NC_019456. 300 THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDE 379 (435) Q Consensus 300 ~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~ 379 (435) ..|+++||.|++.+|+++|+++|+++.+. .+++++||++.+++.|.+++++++.+++++|+||+||+|+++|+||++++ T Consensus 319 ~~f~~~~l~P~~~~ie~~ln~~L~~~~~~-~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g 397 (457) T protein:vir:62 319 IAFTMFSLRPWLERIEAGFNRLLFAETAD-RFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDG 397 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 57779999999999999999999998775 57899999999999999999999999999999999999999999999888 Q ss_pred CCceeeecccccchhccccccccccccccccccccccCC----CCCCCCCCCCCCCCCCC Q lcl|NC_019456. 380 AADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEG----GENTNENGLQSTEPEGS 435 (435) Q Consensus 380 ~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~ 435 (435) +||++++|+|+.+++................+..++..+ +..++.++++....|.+ T Consensus 398 ~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 457 (457) T protein:vir:62 398 LGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDEGETEDDDDA 457 (457) T ss_pred CcceeeeccccccccccccccccCCCccCCCCccCCCCCCCCCCCCCCCccccccccccC Confidence 889999999999988765443333222211111111111 11111222222222222 No 13 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=1.8e-92 Score=523.54 Aligned_cols=402 Identities=22% Similarity=0.302 Sum_probs=339.4 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccc--cccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc--- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGV--KLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY--- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~--- 75 (435) ||||++|++. ... .... ......+..+. ....+..++.+.++++++|++||++||++||++||++++.. T Consensus 1 Mg~f~~lf~r---~~~-~~~~--~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~ 74 (414) T protein:vir:44 1 MVFFSGLFQR---KSD-APVT--TPAELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSL 74 (414) T ss_pred Cchhhhhhcc---Ccc-Cccc--chhhHhHhhccCccccCCceechhhhhccHHHHHHHHHHHHHhccCceEEEEecCCc Confidence 9999987543 111 1111 11222232222 22334556778899999999999999999999999999743 Q ss_pred -cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEe Q lcl|NC_019456. 76 -KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVT 154 (435) Q Consensus 76 -~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 154 (435) ....+|+++++|+.+||++||+++||+.++.+++++||||++++++ .|.|.+||||+|..|++..+.++...|+... T Consensus 75 ~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~--~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~ 152 (414) T protein:vir:44 75 KQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA--FGEVAELLPVDPGCVVPKLNSSWEPVYQVTF 152 (414) T ss_pred eeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC--CCcEEEEEEEcCceEEEEECCCCcEEEEEEe Confidence 3556799999999999999999999999999999999999999865 4899999999999999999988888887777 Q ss_pred cCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHH Q lcl|NC_019456. 155 SDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDF 232 (435) Q Consensus 155 ~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~ 232 (435) .+|...+|++++|||++++ +.++++|+||+..+...+..+.++++++.++|.|+ +++++++++.+++|+.++++++| T Consensus 153 ~~g~~~~~~~~evih~~~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~ 231 (414) T protein:vir:44 153 PDGSTDVLSQEDIWHVRTL-TLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDF 231 (414) T ss_pred cCceEEEEccccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHH Confidence 8888889999999999976 56889999999999999999999999999999997 57899999999999999999999 Q ss_pred HHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH-HHHHHHHHh Q lcl|NC_019456. 233 LRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV-THSWTMTLM 308 (435) Q Consensus 233 ~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~-~~~~~~~i~ 308 (435) .... +|+++++|+++|++|+++++++.++||+|.+++++++||++|||||.+||..+.++++|.|++ ..|+++||+ T Consensus 232 ~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~l~ 311 (414) T protein:vir:44 232 EERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLV 311 (414) T ss_pred HHHhcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHH Confidence 7655 467889999999999999999999999999999999999999999999999998898887655 566789999 Q ss_pred HHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecc Q lcl|NC_019456. 309 PIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISK 388 (435) Q Consensus 309 P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~ 388 (435) |++++|+++|+++|+++.++ .+++|+||++++++.|.+++++++++++++|+||+||+|+++|+||+ ||||++++++ T Consensus 312 P~~~~ie~~ln~~L~~~~~~-~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~--~ggD~~~~~~ 388 (414) T protein:vir:44 312 PYLTRIEQRINTGLVRKSKQ-GVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPR--PGGDVYLTPM 388 (414) T ss_pred HHHHHHHHHHHhhcCCcccc-CceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcceecccc Confidence 99999999999999998875 47899999999999999999999999999999999999999999999 6899999999 Q ss_pred cccchhccccccccccccccccccccccCCCCCCCCCCCCC Q lcl|NC_019456. 389 DLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQS 429 (435) Q Consensus 389 n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (435) |+.+..... .+..++++++++++.++ T Consensus 389 n~~~~~~~~---------------~~~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 389 NMTTKPSDG---------------SKAGKQKDNANADETTS 414 (414) T ss_pred cccccCCcc---------------ccCCCCCCCCCCCCCCC Confidence 986543211 11111222333222222 No 14 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=2e-92 Score=523.26 Aligned_cols=408 Identities=24% Similarity=0.368 Sum_probs=349.9 Q ss_pred chHHHHHhhccccccccccccccchhhhhhcc-ccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc----c Q lcl|NC_019456. 2 SFMSKVRQFFGVHDQANQIVQNPIPQPLDMAG-VKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY----K 76 (435) Q Consensus 2 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~----~ 76 (435) =||++++ +.................++++ ..+.....++.+.++++++||+||++||++||+|||+++++. + T Consensus 1 m~~~~~f---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~~~ 77 (416) T protein:vir:12 1 MLLERMF---EKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKLPIHTYKRTDGGIE 77 (416) T ss_pred Cccchhc---ccccCccccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCccc Confidence 2444442 2233323333333333444443 334455678888999999999999999999999999998743 3 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecC Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSD 156 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~ 156 (435) ...+|+++++|+.+||++||+++||+.++.+++++||||+++.++. .|.|.+||||+|.+|++..+.++..++|.+..+ T Consensus 78 ~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~-~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~ 156 (416) T protein:vir:12 78 RKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGS-HGYPEALFPLRPDYTNAYVHPTTGMLWYQTVLN 156 (416) T ss_pred cccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEECCcceEEEEeCCCcEEEEEEecC Confidence 4567999999999999999999999999999999999999999874 489999999999999999998888888888889 Q ss_pred CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHH Q lcl|NC_019456. 157 IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLR 234 (435) Q Consensus 157 ~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~ 234 (435) |...+|++++|||++++ +.++++|+||+..+..++..+.+++++..++|+|+ +++++++++.+++|+.++++++|.. T Consensus 157 g~~~~~~~~eiih~~~~-~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~ 235 (416) T protein:vir:12 157 GKAIELYDYEVLHFKGL-STDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVRKEWKR 235 (416) T ss_pred CeEEEecCccEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHH Confidence 99999999999999976 46789999999999999999999999999999997 6799999999999999999999975 Q ss_pred HhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH-HHHHHHHHhHHHHH Q lcl|NC_019456. 235 MVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV-THSWTMTLMPIIRQ 313 (435) Q Consensus 235 ~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~-~~~~~~~i~P~~~~ 313 (435) . .++++++|+++|++|+++++++.++|+.|.+++++++||++|||||.+||....++++|.|++ ..|+++||.|++.+ T Consensus 236 ~-~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ 314 (416) T protein:vir:12 236 V-NKVENIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRNTLQPWIVN 314 (416) T ss_pred H-hcCCCeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHHHHHHHHHH Confidence 4 467889999999999999999999999999999999999999999999999998999988765 46779999999999 Q ss_pred HHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccch Q lcl|NC_019456. 314 YESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPL 393 (435) Q Consensus 314 i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l 393 (435) |+++|+++|+++.++..|++|+||++++++.|.+++++++.+++++|+||+||+|+++|+||+ ||||++++++|++++ T Consensus 315 ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi--~ggd~~~~~~n~~~~ 392 (416) T protein:vir:12 315 FEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPI--ENGDKYISSLNYVFL 392 (416) T ss_pred HHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcceeeeccccccc Confidence 999999999999999999999999999999999999999999999999999999999999999 579999999999999 Q ss_pred hccccccccccccccccccccccCCCCCCCCC Q lcl|NC_019456. 394 DKYYDAILDNKIQTDASVAAPKQEGGENTNEN 425 (435) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (435) +...+....+.+ ...+|||+.++. T Consensus 393 ~~~~~~~~~~~~--------~~~~gge~~~~g 416 (416) T protein:vir:12 393 DFLEEYQRLKAG--------GAMKGGDNKNEG 416 (416) T ss_pred cccchhhccccc--------cccCCCCCcCCC Confidence 876544322211 123355544433 No 15 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=1.9e-92 Score=523.45 Aligned_cols=408 Identities=35% Similarity=0.585 Sum_probs=341.0 Q ss_pred CchHHH--HHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_019456. 1 MSFMSK--VRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQM 78 (435) Q Consensus 1 Mg~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 78 (435) |+||++ ++..+...-...... ......+++.+........++...++++|+|++||++||++||++||+++++++ . T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~-~ 78 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWID-QSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK-V 78 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhc-ccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeeccc-c Confidence 999955 332221110000000 011111222222222333456788999999999999999999999999998765 4 Q ss_pred ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEe-cCC Q lcl|NC_019456. 79 DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVT-SDI 157 (435) Q Consensus 79 ~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~-~~~ 157 (435) .+|+++++|+.+||++||+++||+.++.+++++||+|++++++. .|.+++|+||+|+.|++..+.++..++|.+. .+| T Consensus 79 ~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~g 157 (412) T protein:vir:26 79 VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI-YHQPSKLFLLNPDVVEMLIENQSRELYYSIHAATG 157 (412) T ss_pred ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECC-CCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCCc Confidence 57899999999999999999999999999999999999999874 5899999999999999999887766655544 456 Q ss_pred eeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhc Q lcl|NC_019456. 158 YNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVK 237 (435) Q Consensus 158 ~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 237 (435) ....|+++|||||+++++.++++|+||+..+...+..+.++++++...+.+++.++++.++.+++++.++++++|+...+ T Consensus 158 ~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~ 237 (412) T protein:vir:26 158 NKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYE 237 (412) T ss_pred eEEEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhh Confidence 67789999999999988889999999999999999999999888765556667788999999999999999999999889 Q ss_pred CCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-HHHHHHHhHHHHHHHH Q lcl|NC_019456. 238 ENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-HSWTMTLMPIIRQYES 316 (435) Q Consensus 238 ~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~~~~~~i~P~~~~i~~ 316 (435) ++++++||++|++|+++++++.++||.|.+++++++||++|||||.+||+...+++++.|++. .|+++||.|++.+|++ T Consensus 238 ~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~ 317 (412) T protein:vir:26 238 ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEE 317 (412) T ss_pred cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999998888888887655 5678999999999999 Q ss_pred HHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcc Q lcl|NC_019456. 317 QFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKY 396 (435) Q Consensus 317 ~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~ 396 (435) +|+++|+++.++..+.+|+||++++++.|.+++++.+++++++|++|+||+|+++|+||+ ||||++++++|++|++.. T Consensus 318 ~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~--~ggD~~~~~~n~~~~~~~ 395 (412) T protein:vir:26 318 EFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPV--EGGDKPLISGDLYPIDTP 395 (412) T ss_pred HHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeeeecccccccccc Confidence 999999999999999999999999999999999999999999999999999999999999 589999999999998764 Q ss_pred ccccccccccccccccccccCCCCCCCCCC Q lcl|NC_019456. 397 YDAILDNKIQTDASVAAPKQEGGENTNENG 426 (435) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (435) .+.+ ...+||+++...+ T Consensus 396 ~~~~-------------~~~~gG~~n~~e~ 412 (412) T protein:vir:26 396 LELR-------------KSLKGGDKNVNES 412 (412) T ss_pred hhhc-------------ccccCCCCCcCCC Confidence 3221 1123444333322 No 16 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=5.8e-92 Score=520.75 Aligned_cols=430 Identities=17% Similarity=0.225 Sum_probs=352.9 Q ss_pred CchHHHHHhhccccccccccc--cc-cchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc-- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIV--QN-PIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY-- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~-- 75 (435) ||||++|++++........-. .. ....... .+.....+..++.+.++++++||+||++||++||+|||++|+++ T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~ 79 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYN-LGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGG 79 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHh-hcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecCC Confidence 999999987654432111111 11 1222222 33344556778889999999999999999999999999999843 Q ss_pred --cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc---e-- Q lcl|NC_019456. 76 --KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN---S-- 148 (435) Q Consensus 76 --~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~---~-- 148 (435) +.+..|+++.+++..|| +||+++||+.++.+++++||+|++|.++ +|.|++||||+|.+|++..+.++. . T Consensus 80 ~~~~~~~~~l~~~ln~~~n-~~t~~~f~~~~~~~lll~Gna~~~i~~~--~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~ 156 (457) T protein:vir:13 80 SRKEIVTPEWLDYPNAEPG-GMGRIDILSQTVLSLLLQGNAFLAVRWQ--GPNIVGLDVLDPTKIHVHMVMVDGLRRKVF 156 (457) T ss_pred cccccccchHHHhccccCC-CCCHHHHHHHHHHHHhhcCCeEEEEEec--CCcEEEEEEEccCceEEEEecCCCccceeE Confidence 45667888888876666 7999999999999999999999998764 489999999999999998765543 1 Q ss_pred EEEEEecCCe---eEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHH Q lcl|NC_019456. 149 YWYRVTSDIY---NFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPE 223 (435) Q Consensus 149 ~~~~~~~~~~---~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e 223 (435) +.|.+..++. ...|++++|||++++++.+.++|+||+..+...|..+.++++++.++|+|| +.+++++++.++++ T Consensus 157 ~~y~~~~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e 236 (457) T protein:vir:13 157 EAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEE 236 (457) T ss_pred EEEEEecCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHH Confidence 2233433332 246899999999998777779999999999999999999999999999997 56899999999999 Q ss_pred HHHHHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc--cHHHH Q lcl|NC_019456. 224 KRQAMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST--TNVEH 298 (435) Q Consensus 224 ~~~~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~--~~~e~ 298 (435) +.+++++.|.... .|+|+++||++|++|+++++++.++||++++++++++||++|||||.+||..+.+++ +|.|+ T Consensus 237 ~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq 316 (457) T protein:vir:13 237 GLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAE 316 (457) T ss_pred HHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHH Confidence 9999999997654 567899999999999999999999999999999999999999999999998887764 55554 Q ss_pred -HHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCC Q lcl|NC_019456. 299 -VTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIP 377 (435) Q Consensus 299 -~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ 377 (435) ...|+++||.||+..|+++|+++|+++.+. .+++++||++++++.|.+++++++.+++++|+||+||+|+++|++|++ T Consensus 317 ~~~~f~~~tl~P~~~~ie~~ln~~L~~~~~~-~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ 395 (457) T protein:vir:13 317 QNIAFTMFSLRPWLERIEAGFNRLLFAETAD-RFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLP 395 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCcccc-CceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 556779999999999999999999998775 578999999999999999999999999999999999999999999998 Q ss_pred CcCCceeeecccccchhccccccccccccc----cccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 378 DEAADHLYISKDLYPLDKYYDAILDNKIQT----DASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 378 ~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) +++||++++|+|+.+++...+........+ ..++..++++.|.++++++.++.+.|.| T Consensus 396 ~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 396 DGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred CCcccceeeccccccccccccccccCCCCCCCCCccccCCCCCCCCCCccccCCCCcccccC Confidence 888899999999999876654433222222 2233345555666777777777777777 No 17 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=5.2e-92 Score=521.02 Aligned_cols=426 Identities=16% Similarity=0.186 Sum_probs=342.9 Q ss_pred hHHHHHhhccccccccccccccchhhh----hhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc--- Q lcl|NC_019456. 3 FMSKVRQFFGVHDQANQIVQNPIPQPL----DMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY--- 75 (435) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~--- 75 (435) +|+.+++.-...+.+.......+...+ ..++..+..+..++.+.++++++|++||++||++||+|||++|++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~g 80 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQTDAQG 80 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEeccCC Confidence 777665533333333333333333322 2234455566778889999999999999999999999999999743 Q ss_pred -cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEe Q lcl|NC_019456. 76 -KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVT 154 (435) Q Consensus 76 -~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 154 (435) .....++.+++|+.+||++||+++||+.++.+++++||+|++++++.. |+|++|||++|++|++..+.+|...|.... T Consensus 81 ~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~-G~~~~L~~i~~~~v~v~~~~~g~~~y~~~~ 159 (454) T protein:vir:93 81 IRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNAR-GQIKELRILDWNRVEPLVADDGEVFYRITP 159 (454) T ss_pred ccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCC-CcEEEEEEEcCcceEEEEcCCCcEEEEEEe Confidence 233445667778889999999999999999999999999999998755 899999999999999999888766554433 Q ss_pred cC----CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHH Q lcl|NC_019456. 155 SD----IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAM 228 (435) Q Consensus 155 ~~----~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~ 228 (435) .. +...+|+++||||+++..+.++++|+||+..+...+..+.++++++.++|+|| +++++++++.+++|+.+++ T Consensus 160 ~~~~~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~ 239 (454) T protein:vir:93 160 DRNCGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKL 239 (454) T ss_pred ccccccceeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHH Confidence 32 33568999999999987788999999999999999999999999999999997 5689999999999999999 Q ss_pred HHHHHHHh--cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-HHHHH Q lcl|NC_019456. 229 VNDFLRMV--KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-HSWTM 305 (435) Q Consensus 229 ~~~~~~~~--~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~~~~~ 305 (435) +++|+... .|+|+++||++|++|+++++++.++||+|.+++++++||++|||||.+||..+.++++|.|++. .|+++ T Consensus 240 ~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~ 319 (454) T protein:vir:93 240 KSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQ 319 (454) T ss_pred HHHHHHHhcccccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHH Confidence 99998765 4678999999999999999999999999999999999999999999999999999998877654 57799 Q ss_pred HHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceee Q lcl|NC_019456. 306 TLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLY 385 (435) Q Consensus 306 ~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~ 385 (435) ||.|++..|+++|+++|++. .+.+++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+ +|||+++ T Consensus 320 ~l~P~~~~ie~~ln~~L~~~----~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi--~ggD~~~ 393 (454) T protein:vir:93 320 CLQTLIESIELLLDEALETG----ENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPL--AGGDALY 393 (454) T ss_pred HHHHHHHHHHHHHHHhhcCC----CCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCCCeee Confidence 99999999999999999865 35789999999999999999999999999999999999999999999 5899999 Q ss_pred ecccccchhcccccccccccccc-ccccccc--cCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 386 ISKDLYPLDKYYDAILDNKIQTD-ASVAAPK--QEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 386 ~~~n~~~l~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 435 (435) +++|+++++.+.+....+..... +.+.+.+ ..+++.+....+...++..+ T Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~e~~~d~~~~ 446 (454) T protein:vir:93 394 LQQQNYSLEALSRRDAREDPFASSGKTASVPQAVAASDGNKAITETEHDAVKA 446 (454) T ss_pred eccCccchHhhhccCcccCCCCCCccCCCCCCCCCCCCCCCCccCCccchhhh Confidence 99999999887765544332211 1111111 11111111111111122222 No 18 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=6.1e-92 Score=520.65 Aligned_cols=414 Identities=21% Similarity=0.287 Sum_probs=341.0 Q ss_pred Cc--hHHHHHhhccccccc-----cccccccchh-hhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeee Q lcl|NC_019456. 1 MS--FMSKVRQFFGVHDQA-----NQIVQNPIPQ-PLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEY 72 (435) Q Consensus 1 Mg--~~~~~~~~~~~~~~~-----~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~ 72 (435) |. |.+-+.++.+..+.+ .......+.. +..+.|..+..+..++.+.++++++||+||++||++||++||++| T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~~ia~lp~~~~ 80 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESSSGKKVTVDKAMKLSAVWACVRLISTSVAGLPLGVY 80 (434) T ss_pred CccchhhhhhhcccccchhhhcccccccccCchHHHHHHhcCCccCCceechhhhhccHHHHHHHHHHHHhhhhCceEEE Confidence 42 222222222222111 0001111122 223456566667788899999999999999999999999999998 Q ss_pred ecc-----cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc Q lcl|NC_019456. 73 QNY-----KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN 147 (435) Q Consensus 73 ~~~-----~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~ 147 (435) +++ ....+|+++++|+.+||++||+++||+.++.+++++||+|++|.++ +|+|++|+||+|.+|++..+.+|. T Consensus 81 ~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~--~G~~~~L~~l~p~~v~~~~~~~g~ 158 (434) T protein:vir:43 81 ERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA--AGRPAALDFLLPSRVDLECDENGR 158 (434) T ss_pred EEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC--CCcEEEEEEEcCcceEEEEcCCCe Confidence 743 3457899999999999999999999999999999999999998764 599999999999999999999998 Q ss_pred eEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHH Q lcl|NC_019456. 148 SYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKR 225 (435) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~ 225 (435) ..|+++..+|..++|+++||||++++ +.++++|+||+..+...+..+.++++++.++|+|+ +.+++++++.+++++. T Consensus 159 ~~y~~~~~~g~~~~~~~~eVih~~~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~ 237 (434) T protein:vir:43 159 LKYFYTTKKGARREIERTNMLHIPAF-TLDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQR 237 (434) T ss_pred EEEEEEecCceEEEEccccEEEecCc-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHHH Confidence 88888888888899999999999986 67889999999999999999999999999999997 6789999999999999 Q ss_pred HHHHHHHHHH--hcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCc--ccHHHH-HH Q lcl|NC_019456. 226 QAMVNDFLRM--VKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKS--TTNVEH-VT 300 (435) Q Consensus 226 ~~~~~~~~~~--~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~--~~~~e~-~~ 300 (435) +++++.|+.. ..|+|+++||++|++|+++++++.++||++.+++++++||++|||||.+||..+.++ +++.|+ .. T Consensus 238 ~~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~ 317 (434) T protein:vir:43 238 EEFREYVKSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQML 317 (434) T ss_pred HHHHHHHHHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHH Confidence 9999988654 356789999999999999999999999999999999999999999999999877654 566555 45 Q ss_pred HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcC Q lcl|NC_019456. 301 HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEA 380 (435) Q Consensus 301 ~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~ 380 (435) .|+++||.|++.+||++|+++|+++.++ .+++++||++.+++.|.+++++++.+++++|+||+||+|+++|+||+ || T Consensus 318 ~f~~~~L~P~~~~ie~~ln~kL~~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~--~g 394 (434) T protein:vir:43 318 AFLTFSISSITNQIQQCVNKRLLTAPER-IRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPEL--PG 394 (434) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCChhhh-cCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CC Confidence 6779999999999999999999998775 47899999999999999999999999999999999999999999999 57 Q ss_pred CceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCC Q lcl|NC_019456. 381 ADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTE 431 (435) Q Consensus 381 gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (435) ||++++++|++|++.+++....+.........++..+. .| T Consensus 395 gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~ 434 (434) T protein:vir:43 395 GDILTVQSNLVPIDQLGQSNKSQAVRAALMNWFSQPEP-----------QE 434 (434) T ss_pred CCeEeeccCccchhhhhccCCCcchhhhhhccCCCCCC-----------CC Confidence 99999999999998775543333222111111111111 11 No 19 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=3.3e-92 Score=522.11 Aligned_cols=398 Identities=21% Similarity=0.372 Sum_probs=334.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec----cc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN----YK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~----~~ 76 (435) +.||++|++..+...++.... .......+ ....+.+++.+.|+++++|++||++||++||++||++|++ .+ T Consensus 16 ~~~~~~lf~~~~~~~~~~~~~----~~~~~~~~-~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~lp~~v~~~~~~~~~ 90 (424) T protein:vir:45 16 RVLLDALFRSKSLENPSTPIT----GDAVDTDG-LFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMPLHVMRRHKGKVE 90 (424) T ss_pred hHHHHhhccccCCCCCccccc----hhhhhhhc-cccCCceechHHhhccHHHHHHHHHHHHHHhhCceEEEEecCCcee Confidence 778888765433333222221 22222222 2334557888999999999999999999999999999973 24 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecC Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSD 156 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~ 156 (435) .+.+|+++++|+.+||++||+++||+.++.+++++||+|++|.++. .|.|++|+|++|..|++..+.+ .+.|.+... T Consensus 91 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~-~G~~~~L~~l~~~~v~i~~~~~--~~~y~~~~~ 167 (424) T protein:vir:45 91 PARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNR-RGEVISLDCCMPWETTLMNTGG--RYTYGLYNE 167 (424) T ss_pred ecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CCcEEEEEEecCceEEEEEcCC--eEEEEEEec Confidence 5568999999999999999999999999999999999999999874 4899999999999999876553 344555556 Q ss_pred CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHH Q lcl|NC_019456. 157 IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLR 234 (435) Q Consensus 157 ~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~ 234 (435) +...+|+++|||||++++ .++++|+||+..+.+.|+.+.+++++++++|+|| +.+++++++.+++|+.+++++.|.. T Consensus 168 ~~~~~~~~~eVih~r~~~-~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~ 246 (424) T protein:vir:45 168 YGAFAISPDDMIHIRALG-NNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQK 246 (424) T ss_pred CceEEECcccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHH Confidence 667789999999999864 6899999999999999999999999999999997 5689999999999999999999976 Q ss_pred Hh----cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-HHHHHHHhH Q lcl|NC_019456. 235 MV----KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-HSWTMTLMP 309 (435) Q Consensus 235 ~~----~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~~~~~~i~P 309 (435) .. +|+|+++|+++|++|+++++++.|+||+|.+++++++||++|||||.+||+.+.++++|.||+. .|+++||.| T Consensus 247 ~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P 326 (424) T protein:vir:45 247 ASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMP 326 (424) T ss_pred HhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHH Confidence 43 4678999999999999999999999999999999999999999999999999989988887654 566899999 Q ss_pred HHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeeccc Q lcl|NC_019456. 310 IIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKD 389 (435) Q Consensus 310 ~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n 389 (435) ++..|+++|+++|+++.++..|++++||++++++.|.+++++.+.+++++|+||+||+|+++|+||+ +|||++++|+| T Consensus 327 ~~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi--~ggD~~~~~~n 404 (424) T protein:vir:45 327 WVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPV--EGLDEMLVSVN 404 (424) T ss_pred HHHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcceeeeccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999 58999999999 Q ss_pred ccchhccccccccccccccccccccccCCCCCCC Q lcl|NC_019456. 390 LYPLDKYYDAILDNKIQTDASVAAPKQEGGENTN 423 (435) Q Consensus 390 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 423 (435) +++..... ..++.+.+.+++ T Consensus 405 ~~~~~~~~--------------~~~~~~~~~~~~ 424 (424) T protein:vir:45 405 AANPAGDF--------------KPPKNDEGKTNE 424 (424) T ss_pred cccccccc--------------CCCCCCCCCCCC Confidence 87532110 001111111111 No 20 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=5.8e-92 Score=520.77 Aligned_cols=397 Identities=19% Similarity=0.295 Sum_probs=336.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc---- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK---- 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~---- 76 (435) =|||++++++|...+........ ...++... ....+..++.+.|+++++||+||++||++||+|||++|+.++ T Consensus 14 ~g~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~~~~~~~~~~~ 90 (424) T protein:vir:18 14 NGWWARLQSWFVGGRLVTPNQGS-QTGPVSAH--GHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) T ss_pred CchHHHHHhhhcccccccccccc-cccccccc--cccccccccHHHhhccHHHHHHHHHHHHhhccCceEEEEeecCCce Confidence 68999999999765443322111 11112211 222345578889999999999999999999999999987432 Q ss_pred --ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEe Q lcl|NC_019456. 77 --QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVT 154 (435) Q Consensus 77 --~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 154 (435) ...+|+++++|+.+||++||+++||+.++.+++++||+|++|+++. .|+|++|||++|.+|++..+.+ .++|.+. T Consensus 91 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~G~~~~L~pl~~~~V~v~~~~~--~~~y~~~ 167 (424) T protein:vir:18 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-AGDVISLLPLQSANMDVKLVGK--KVVYRYQ 167 (424) T ss_pred eeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEecCcceEEEEcCC--eEEEEEE Confidence 2257999999999999999999999999999999999999999875 4899999999999999987643 4556667 Q ss_pred cCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCc-CCHHHHHHHHHH Q lcl|NC_019456. 155 SDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRS-ISPEKRQAMVND 231 (435) Q Consensus 155 ~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~-~~~e~~~~~~~~ 231 (435) .+|..++|+++||||+|+++ .++++|+||+..+.+.++.+.++++++.++|.|+ +.++++++.. +++++.+++++. T Consensus 168 ~~g~~~~~~~~eIih~r~~~-~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~ 246 (424) T protein:vir:18 168 RDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) T ss_pred eCCeEEEeccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHHHHH Confidence 77888899999999999874 6889999999999999999999999999999997 6689988764 799999999999 Q ss_pred HHHHh--cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc--cHHH-HHHHHHHHH Q lcl|NC_019456. 232 FLRMV--KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST--TNVE-HVTHSWTMT 306 (435) Q Consensus 232 ~~~~~--~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~--~~~e-~~~~~~~~~ 306 (435) |+... .++|+++||++|++|+++++++.|+||+|.+++++++||++|||||.+||..+.+++ +|.| +...|+++| T Consensus 247 ~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~t 326 (424) T protein:vir:18 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) T ss_pred HHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHH Confidence 97654 467899999999999999999999999999999999999999999999998887764 5555 456778999 Q ss_pred HhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeee Q lcl|NC_019456. 307 LMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYI 386 (435) Q Consensus 307 i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~ 386 (435) |.|++..||++|+++|+++.++ .+++++||++++++.|.+++++++.+++++|+||+||+|+++|+||+ ||||++++ T Consensus 327 l~P~~~~ie~~l~~~L~~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi--~gGD~~~~ 403 (424) T protein:vir:18 327 LQPYISRWENSIQRWLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL--PGGDVAMR 403 (424) T ss_pred HHHHHHHHHHHHHhhcCCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeeee Confidence 9999999999999999998876 47899999999999999999999999999999999999999999999 58999999 Q ss_pred cccccchhccccccccccccccccccccccCCC Q lcl|NC_019456. 387 SKDLYPLDKYYDAILDNKIQTDASVAAPKQEGG 419 (435) Q Consensus 387 ~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (435) ++|++|++.+.+...... +|- T Consensus 404 ~~n~~~l~~~~~~~~p~~------------~ga 424 (424) T protein:vir:18 404 QSQYVPITDLGTNKEPRN------------NGA 424 (424) T ss_pred ccCccchHhhhccCCCcc------------CCC Confidence 999999987543211000 000 No 21 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=7.4e-92 Score=520.18 Aligned_cols=397 Identities=19% Similarity=0.298 Sum_probs=336.2 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc---- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK---- 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~---- 76 (435) =|||+++++||.+.+...... ......+ .+.....+..++.+.|+++++||+||++||++||+||+++|+..+ T Consensus 14 ~g~~~~~~~~f~~~~~~~~~~-~~~~~~~--~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~vy~~~~~~~~ 90 (424) T protein:vir:18 14 NGWWARLKSWFVGGRLVTPNQ-GSQTGPV--SAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) T ss_pred CchHHHHHhhccccccccccc-hhhcccc--ccccccccccccHHHhhccHHHHHHHHHHHHhhccCceEEEEeccCCce Confidence 689999999986664433221 1111111 122233345578889999999999999999999999999997432 Q ss_pred -c-cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEe Q lcl|NC_019456. 77 -Q-MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVT 154 (435) Q Consensus 77 -~-~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 154 (435) . ..+|+++++|+.+||++||+++||+.++.+++++||+|++|+++ ..|++++|||++|.+|++..+.+ .++|.+. T Consensus 91 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-~~G~~~~L~~l~~~~v~v~~~~~--~~~y~~~ 167 (424) T protein:vir:18 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN-SAGDVISLLPLQSANMDVKLVGK--KVVYRYQ 167 (424) T ss_pred eeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEC-CCCcEEEEEEecCcceEEEEcCC--eEEEEEE Confidence 1 25799999999999999999999999999999999999999987 45899999999999999987643 4456666 Q ss_pred cCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCc-CCHHHHHHHHHH Q lcl|NC_019456. 155 SDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRS-ISPEKRQAMVND 231 (435) Q Consensus 155 ~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~-~~~e~~~~~~~~ 231 (435) .+|...+|+++||||+|+++ .++++|+||+..+...|..+.++++++.++|.|+ +.++++++.. +++++.+++++. T Consensus 168 ~~g~~~~~~~~eVihir~~~-~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~ 246 (424) T protein:vir:18 168 RDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) T ss_pred eCCeEEEeccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHHHHH Confidence 77888899999999999874 6889999999999999999999999999999997 5689998765 799999999999 Q ss_pred HHHHh--cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc--cHHH-HHHHHHHHH Q lcl|NC_019456. 232 FLRMV--KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST--TNVE-HVTHSWTMT 306 (435) Q Consensus 232 ~~~~~--~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~--~~~e-~~~~~~~~~ 306 (435) |.... .++|+++||++|++|+++++++.++||.|.+++++++||++|||||.+||..+.+++ ++.| +...|+++| T Consensus 247 ~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~t 326 (424) T protein:vir:18 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) T ss_pred HHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHH Confidence 97654 467899999999999999999999999999999999999999999999998887765 5555 456777999 Q ss_pred HhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeee Q lcl|NC_019456. 307 LMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYI 386 (435) Q Consensus 307 i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~ 386 (435) |.|++++||++|+++|+++.++ .+++++||++++++.|.+++++++.+++++|+||+||+|+++|+||+ ||||++++ T Consensus 327 l~P~~~~ie~~ln~~L~~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi--~ggD~~~~ 403 (424) T protein:vir:18 327 LQPYISRWENSIQRWLIPSKDV-GRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPL--PGGDVAMR 403 (424) T ss_pred HHHHHHHHHHHHHhhcCCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeeee Confidence 9999999999999999998876 47999999999999999999999999999999999999999999999 58999999 Q ss_pred cccccchhccccccccccccc Q lcl|NC_019456. 387 SKDLYPLDKYYDAILDNKIQT 407 (435) Q Consensus 387 ~~n~~~l~~~~~~~~~~~~~~ 407 (435) ++|++|++.+++........+ T Consensus 404 ~~n~~~l~~~~~~~~~~~n~a 424 (424) T protein:vir:18 404 QAQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred ccCccchhhhhccCCccccCC Confidence 999999987543211110000 No 22 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=8.6e-92 Score=519.84 Aligned_cols=404 Identities=34% Similarity=0.585 Sum_probs=341.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) =++++|+++.+...-.+.+... .+++.+........++...|+++++|++||++||++||++||+++++++ ..+ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~-~~~ 77 (409) T protein:vir:93 4 ENIVTRIKKKLIDNWIDQSTSK-----LYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK-VVN 77 (409) T ss_pred cchhhhhhhhhhhhhhcccccc-----ccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeeccc-ccc Confidence 4677787775432211111111 1111121122223356678999999999999999999999999998765 457 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEE-ecCCee Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRV-TSDIYN 159 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~-~~~~~~ 159 (435) |+++++|+.+||++||+++||+.++.+++++||+|+++.++. .|++++||||+|+.|++..+.++..++|.+ ..+|.. T Consensus 78 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~G~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~g~~ 156 (409) T protein:vir:93 78 TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI-YHQPSKLFLLNPDVVEMLIENQSRELYYSIHAATGNK 156 (409) T ss_pred chHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECC-CCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCCceE Confidence 899999999999999999999999999999999999999874 489999999999999999888776665554 455667 Q ss_pred EEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCC Q lcl|NC_019456. 160 FTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKEN 239 (435) Q Consensus 160 ~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~ 239 (435) ..|+++||||++++++.++++|+||+.++...+..+.+++++....+.+++.++++.+..+++++.++++++|++..+++ T Consensus 157 ~~~~~~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~ 236 (409) T protein:vir:93 157 LIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEEN 236 (409) T ss_pred EEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhhcC Confidence 78999999999988888999999999999999999999988876555566778899999999999999999999988999 Q ss_pred CccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-HHHHHHHhHHHHHHHHHH Q lcl|NC_019456. 240 GGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-HSWTMTLMPIIRQYESQF 318 (435) Q Consensus 240 ~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~~~~~~i~P~~~~i~~~l 318 (435) ++++|+++|++|+++++++.++|+.|.+++++++||++|||||.+||+...++++|.|++. .|+++||.|++.+|+++| T Consensus 237 g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l 316 (409) T protein:vir:93 237 GGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEF 316 (409) T ss_pred CCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999999888988887655 567899999999999999 Q ss_pred HHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccc Q lcl|NC_019456. 319 NMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYD 398 (435) Q Consensus 319 ~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~ 398 (435) +++|+++.++..+++|+||++++++.|.+++++++++++++|++|+||+|+++|+||+ ||||++++++|++|++.... T Consensus 317 ~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~--~ggD~~~~~~n~~~~~~~~~ 394 (409) T protein:vir:93 317 NRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPV--EGGDKPLISGDLYPIDTPLE 394 (409) T ss_pred HhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeeeecccccccccchh Confidence 9999999999899999999999999999999999999999999999999999999999 58999999999999876432 Q ss_pred ccccccccccccccccccCCCCCCCCCC Q lcl|NC_019456. 399 AILDNKIQTDASVAAPKQEGGENTNENG 426 (435) Q Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (435) .. ...+||+++.+.+ T Consensus 395 ~~-------------~~~~gG~~n~~e~ 409 (409) T protein:vir:93 395 LR-------------KSLKGGDKNVNES 409 (409) T ss_pred hc-------------ccccCCCCCcCCC Confidence 21 1134444443333 No 23 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=1.9e-91 Score=518.00 Aligned_cols=399 Identities=22% Similarity=0.326 Sum_probs=328.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec--cccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN--YKQM 78 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~--~~~~ 78 (435) ||||+|+++.......... .... ..........+..++.+.++++++|++||++||++||+|||+++++ +... T Consensus 1 Mgl~~~~f~~~~~~~~~~~--~~~~---~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~ 75 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLTK--ISGI---PSPAEDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVRI 75 (409) T ss_pred CchhhhhhcCCCccccccc--cccc---ccccchhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCccc Confidence 9999987553221111111 1111 1111112223445677889999999999999999999999999984 4456 Q ss_pred ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC--ceEEEEEecC Q lcl|NC_019456. 79 DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN--NSYWYRVTSD 156 (435) Q Consensus 79 ~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~~~~~~~~ 156 (435) ..|+++++|+.+||++||+++||+.++.+++++||+|++|.++..+|.|++||||+|.+|++....+. ..+++.+..+ T Consensus 76 ~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 155 (409) T protein:vir:84 76 PVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGDWIEPVYRID 155 (409) T ss_pred ccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcceEEEEEecCC Confidence 68999999999999999999999999999999999999998777889999999999999998765444 3333333334 Q ss_pred CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHH Q lcl|NC_019456. 157 IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLR 234 (435) Q Consensus 157 ~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~ 234 (435) | ++|+++||||++++++.+.++|+||+..+...+..+.++++++.++|.|+ +++++++++.+++|+.++++++|.. T Consensus 156 g--~~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~ 233 (409) T protein:vir:84 156 G--KVVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDADLTPDQVKQTQKQWIQ 233 (409) T ss_pred c--eEEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHH Confidence 3 57999999999998777778999999999999999999999999999997 5789999999999999999999999 Q ss_pred HhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc--cHHHH-HHHHHHHHHhHHH Q lcl|NC_019456. 235 MVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST--TNVEH-VTHSWTMTLMPII 311 (435) Q Consensus 235 ~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~--~~~e~-~~~~~~~~i~P~~ 311 (435) ...|+|+++||++|++|+++++++.++||.|.+++++++||++|||||.+||..+.+++ ++.|+ ...|+++||.|++ T Consensus 234 ~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~ 313 (409) T protein:vir:84 234 SHHNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWL 313 (409) T ss_pred HhccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999998776654 55554 5577899999999 Q ss_pred HHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeeccccc Q lcl|NC_019456. 312 RQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLY 391 (435) Q Consensus 312 ~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~ 391 (435) ..||++|+++|. .|++|+||++.+++.|.+++++++.+++++|+||+||+|+++|+||+ ||||++++|+|++ T Consensus 314 ~~ie~~l~~~L~------~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~--~ggD~~~~~~n~~ 385 (409) T protein:vir:84 314 RCIEQALDTFLP------RGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPI--PEGDIHLQPMNFV 385 (409) T ss_pred HHHHHHHHHhcc------CCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcceeeeccccc Confidence 999999999873 47899999999999999999999999999999999999999999999 5799999999999 Q ss_pred chhccccccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 392 PLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 392 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) |++.....+..++.. +.++.+|+ T Consensus 386 ~~~~~~~~~~~~~~~---------------------~~~~~~gn 408 (409) T protein:vir:84 386 PLGYVPPEEPAQEPQ---------------------PNSATEGN 408 (409) T ss_pred ccccCCccccCcCCC---------------------CCCccCCC Confidence 988643321111111 11111111 No 24 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=1.8e-91 Score=518.02 Aligned_cols=405 Identities=19% Similarity=0.277 Sum_probs=341.1 Q ss_pred CchHHHHHhhccccccccccccccchhhhhh-ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDM-AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMD 79 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 79 (435) ||||++.. ++...........++.. .+........++...++++++||+||++||++||++||+++++++... T Consensus 1 Mg~f~~~~------~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~ 74 (416) T protein:vir:45 1 MGIFYKNE------KRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINY 74 (416) T ss_pred CCcccccc------cccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCccccc Confidence 99987543 22222222333334433 333333455677788999999999999999999999999999999889 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecC--- Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSD--- 156 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~--- 156 (435) .|+++++|+.+||++||+++||+.++.+++++||||++++++. .|+|++|||++|++|++..+.+|..+|+....+ T Consensus 75 ~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~-~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~ 153 (416) T protein:vir:45 75 SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK-TGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNG 153 (416) T ss_pred cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEEcCceeEEEECCCccEEEEEEEecCCC Confidence 9999999999999999999999999999999999999999875 489999999999999999998887776554332 Q ss_pred -CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCC-HHHHHHHHHHH Q lcl|NC_019456. 157 -IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSIS-PEKRQAMVNDF 232 (435) Q Consensus 157 -~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~-~e~~~~~~~~~ 232 (435) +..+.|+++||||+|++ +.++++|+||+..+.+++..+.++++++.++|+|+ +++++++++.++ +++.++++++| T Consensus 154 ~~~~~~~~~~evihir~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~ 232 (416) T protein:vir:45 154 NNIERNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEF 232 (416) T ss_pred ceeEEEEccccEEEeccC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHH Confidence 23468999999999976 57889999999999999999999999999999997 678999998875 56788899999 Q ss_pred HHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhH Q lcl|NC_019456. 233 LRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMP 309 (435) Q Consensus 233 ~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P 309 (435) .... +++|+++||++|++|++++.++.++||.|.+++++++||++|||||.++|.... + ++.+++..+|.+||+| T Consensus 233 ~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~-~~~~~~~~~~~~~l~P 310 (416) T protein:vir:45 233 HKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N-MSITDANLDYLSTLKP 310 (416) T ss_pred HHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-C-ccHHHHHHHHHHHHHH Confidence 7654 567899999999999999999999999999999999999999999999986443 3 3456677888889999 Q ss_pred HHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeeccc Q lcl|NC_019456. 310 IIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKD 389 (435) Q Consensus 310 ~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n 389 (435) ++.+|+++|+++|+++. .+++++||++.+++.|.+++++++++++++|+||+||+|+++|+||+|++.++.+++++| T Consensus 311 ~~~~ie~~ln~~l~~~~---~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n 387 (416) T protein:vir:45 311 YITCVCAELNFKFNDEY---VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLN 387 (416) T ss_pred HHHHHHHHHhhhccccc---cCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccc Confidence 99999999999998754 468999999999999999999999999999999999999999999998777778999999 Q ss_pred ccchhccccccccccccccccccccccCCCCCCC Q lcl|NC_019456. 390 LYPLDKYYDAILDNKIQTDASVAAPKQEGGENTN 423 (435) Q Consensus 390 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 423 (435) ++|++.+...+..+... ...+.+|||+++ T Consensus 388 ~~~~~~~~~~~~~~~~~-----~~~~~kgGe~n~ 416 (416) T protein:vir:45 388 HVNIELVDEYQMNKSRA-----TDKKLKGGEENE 416 (416) T ss_pred ccccccccccCcccccc-----cccccCCCCCCC Confidence 99999765433333222 235566777555 No 25 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=1.8e-91 Score=518.02 Aligned_cols=405 Identities=19% Similarity=0.277 Sum_probs=341.1 Q ss_pred CchHHHHHhhccccccccccccccchhhhhh-ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDM-AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMD 79 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 79 (435) ||||++.. ++...........++.. .+........++...++++++||+||++||++||++||+++++++... T Consensus 1 Mg~f~~~~------~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~ 74 (416) T protein:vir:81 1 MGIFYKNE------KRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINY 74 (416) T ss_pred CCcccccc------cccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCccccc Confidence 99987543 22222222333334433 333333455677788999999999999999999999999999999889 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecC--- Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSD--- 156 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~--- 156 (435) .|+++++|+.+||++||+++||+.++.+++++||||++++++. .|+|++|||++|++|++..+.+|..+|+....+ T Consensus 75 ~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~-~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~ 153 (416) T protein:vir:81 75 SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK-TGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNG 153 (416) T ss_pred cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEEcCceeEEEECCCccEEEEEEEecCCC Confidence 9999999999999999999999999999999999999999875 489999999999999999998887776554332 Q ss_pred -CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCC-HHHHHHHHHHH Q lcl|NC_019456. 157 -IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSIS-PEKRQAMVNDF 232 (435) Q Consensus 157 -~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~-~e~~~~~~~~~ 232 (435) +..+.|+++||||+|++ +.++++|+||+..+.+++..+.++++++.++|+|+ +++++++++.++ +++.++++++| T Consensus 154 ~~~~~~~~~~evihir~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~ 232 (416) T protein:vir:81 154 NNIERNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEF 232 (416) T ss_pred ceeEEEEccccEEEeccC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHH Confidence 23468999999999976 57889999999999999999999999999999997 678999998875 56788899999 Q ss_pred HHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhH Q lcl|NC_019456. 233 LRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMP 309 (435) Q Consensus 233 ~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P 309 (435) .... +++|+++||++|++|++++.++.++||.|.+++++++||++|||||.++|.... + ++.+++..+|.+||+| T Consensus 233 ~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~-~~~~~~~~~~~~~l~P 310 (416) T protein:vir:81 233 HKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N-MSITDANLDYLSTLKP 310 (416) T ss_pred HHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-C-ccHHHHHHHHHHHHHH Confidence 7654 567899999999999999999999999999999999999999999999986443 3 3456677888889999 Q ss_pred HHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeeccc Q lcl|NC_019456. 310 IIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKD 389 (435) Q Consensus 310 ~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n 389 (435) ++.+|+++|+++|+++. .+++++||++.+++.|.+++++++++++++|+||+||+|+++|+||+|++.++.+++++| T Consensus 311 ~~~~ie~~ln~~l~~~~---~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n 387 (416) T protein:vir:81 311 YITCVCAELNFKFNDEY---VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLN 387 (416) T ss_pred HHHHHHHHHhhhccccc---cCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccc Confidence 99999999999998754 468999999999999999999999999999999999999999999998777778999999 Q ss_pred ccchhccccccccccccccccccccccCCCCCCC Q lcl|NC_019456. 390 LYPLDKYYDAILDNKIQTDASVAAPKQEGGENTN 423 (435) Q Consensus 390 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 423 (435) ++|++.+...+..+... ...+.+|||+++ T Consensus 388 ~~~~~~~~~~~~~~~~~-----~~~~~kgGe~n~ 416 (416) T protein:vir:81 388 HVNIELVDEYQMNKSRA-----TDKKLKGGEENE 416 (416) T ss_pred ccccccccccCcccccc-----cccccCCCCCCC Confidence 99999765433333222 235566777555 No 26 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=2.1e-91 Score=517.68 Aligned_cols=404 Identities=34% Similarity=0.585 Sum_probs=342.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) =++++|+++.+-..-...+.. ...++.+........++.+.|+++++|++||++||++||+|||+++++++ ..+ T Consensus 4 ~~~~~~~k~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~-~~~ 77 (409) T protein:vir:94 4 ENIVTRIKKKLIDNWIDQSAS-----KLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK-VVN 77 (409) T ss_pred cccchhhhhHHhhhhhcCCcc-----cccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeeccc-ccc Confidence 357788877652111111111 11112221222223356678999999999999999999999999998765 456 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEE-ecCCee Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRV-TSDIYN 159 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~-~~~~~~ 159 (435) |+++++|+.+||++||+++||+.++.+++++||+|+++.++. .|.|++||||+|++|++..+.++..++|.+ ..+|.. T Consensus 78 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~ 156 (409) T protein:vir:94 78 TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI-YHQPSKLFLLNPDVVEMLIENQSRELYYSIHAATGNK 156 (409) T ss_pred hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcCCceE Confidence 999999999999999999999999999999999999999874 589999999999999999888776666555 455677 Q ss_pred EEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCC Q lcl|NC_019456. 160 FTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKEN 239 (435) Q Consensus 160 ~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~ 239 (435) ..|+++||||+|++++.++++|+||+..+...+....++++++...+.+++.++++.++.+++++.+++++.|++..+++ T Consensus 157 ~~~~~~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~ 236 (409) T protein:vir:94 157 LIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEEN 236 (409) T ss_pred EEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHhcCCCCeeEEecCCCCCHHHHHHHHHHHHHHhhcC Confidence 88999999999988888999999999999999999999988876666666778999999999999999999999988999 Q ss_pred CccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-HHHHHHHhHHHHHHHHHH Q lcl|NC_019456. 240 GGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-HSWTMTLMPIIRQYESQF 318 (435) Q Consensus 240 ~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~~~~~~i~P~~~~i~~~l 318 (435) ++++|+++|++|+++++++.++|+.|.+++++++||++|||||.+||+...++++|.|++. .|+++||.|++..|+++| T Consensus 237 g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l 316 (409) T protein:vir:94 237 GGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEF 316 (409) T ss_pred CCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999998888888877654 567899999999999999 Q ss_pred HHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccc Q lcl|NC_019456. 319 NMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYD 398 (435) Q Consensus 319 ~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~ 398 (435) +++|+++.++..+++|+||++++++.|.+++++++++++++|++|+||+|+++|+||+ ||||++++++|++|++...+ T Consensus 317 n~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~--~ggD~~~~~~n~~~~~~~~~ 394 (409) T protein:vir:94 317 NRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPV--EGGDKPLISGDLYPIDTPLE 394 (409) T ss_pred HHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeEeecccccccccchh Confidence 9999999999899999999999999999999999999999999999999999999999 58999999999999886432 Q ss_pred ccccccccccccccccccCCCCCCCCCC Q lcl|NC_019456. 399 AILDNKIQTDASVAAPKQEGGENTNENG 426 (435) Q Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (435) .+ ...+||+++.+.+ T Consensus 395 ~~-------------~~~kGG~~n~~e~ 409 (409) T protein:vir:94 395 LR-------------KSLKGGDKNVNES 409 (409) T ss_pred hc-------------ccccCCCCCcCCC Confidence 21 1134444333333 No 27 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=3.2e-91 Score=516.73 Aligned_cols=405 Identities=21% Similarity=0.302 Sum_probs=336.1 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhc-cccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc---- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMA-GVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY---- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~---- 75 (435) |.|+.++++ .+++.+. ...+...+... +..+..+..++.+.++++++||+||++||++||+|||++|+++ T Consensus 1 m~~~~~~~~----~~~~~s~-~~~w~~~~~~~~~~~~~~g~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~ 75 (421) T protein:vir:10 1 MFIPQMFEG----KKRSVSG-GGFWEAMLGGVRSSHSKAGVMITPETALALSAVRACVTLLAESVAQLPVELYRRDKNGG 75 (421) T ss_pred CCCcchhcc----cccccCc-chhhHHHhhhhccCcccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCc Confidence 998877653 2332221 12222222222 2333445678899999999999999999999999999998743 Q ss_pred -cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEe Q lcl|NC_019456. 76 -KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVT 154 (435) Q Consensus 76 -~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 154 (435) ..+.+|+++++|+.+||++||+++||+.++.+++++||||++++++.. |+|++||||+|..|++..+.+|..+| .+. T Consensus 76 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-G~~~~L~~l~~~~v~v~~~~~g~~~y-~~~ 153 (421) T protein:vir:10 76 RQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGK-GYPKELIPINPKKVIVLKGPDGMPYY-EIP 153 (421) T ss_pred eeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC-CcEEEEEEecCceEEEEECCCceEEE-EEc Confidence 345689999999999999999999999999999999999999998754 89999999999999999988877654 444 Q ss_pred cCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcC----CHHHHHHH Q lcl|NC_019456. 155 SDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSI----SPEKRQAM 228 (435) Q Consensus 155 ~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~----~~e~~~~~ 228 (435) ..| .++++++|+|++++ +.++++|+||+..+...+..+.++++++.++|.|| +++++++++.+ ++|+.+++ T Consensus 154 ~~g--~~~~~~eiih~~~~-~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~ 230 (421) T protein:vir:10 154 EIG--ETLPMRMMHHVKVF-SLDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQL 230 (421) T ss_pred CCC--cEEchhhEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHH Confidence 444 47999999999986 57899999999999999999999999999999997 66899887654 88999999 Q ss_pred HHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-HHHH Q lcl|NC_019456. 229 VNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-HSWT 304 (435) Q Consensus 229 ~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~~~~ 304 (435) +++|.+.. +++++++||++|++|+++++++.++||.|.+++++++||++|||||.+||..+.++++|.|++. .|++ T Consensus 231 ~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~ 310 (421) T protein:vir:10 231 LAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQGLQFVM 310 (421) T ss_pred HHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHHHHHHHH Confidence 99998765 4678999999999999999999999999999999999999999999999999999998887654 6678 Q ss_pred HHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCcee Q lcl|NC_019456. 305 MTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHL 384 (435) Q Consensus 305 ~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~ 384 (435) +||.|++.+||++|+++|+++.++ .+.+++||++.+++.|.+++++++++++++|+||+||+|+++|+||+ ||||++ T Consensus 311 ~tl~P~~~~ie~~ln~kL~~~~~~-~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~--~ggD~~ 387 (421) T protein:vir:10 311 YTLLAWLKRHEGALQRDLLLPSER-RDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPI--AGGDKY 387 (421) T ss_pred HHHHHHHHHHHHHHhhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCccee Confidence 999999999999999999998775 58899999999999999999999999999999999999999999999 589999 Q ss_pred eecccccchhccccccccccccccccccccccCCCCCCCCCCCC Q lcl|NC_019456. 385 YISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQ 428 (435) Q Consensus 385 ~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (435) ++|+|+++++...... +. ...+++++.++-..++ T Consensus 388 ~~~~n~~~~~~~~~~~---~~-------~~~~~~~e~d~~~~~~ 421 (421) T protein:vir:10 388 LTPLNMVDSAQIIPGD---KK-------PTAQQMAEIDTILSRT 421 (421) T ss_pred eeccccccccccccCC---CC-------cccccCcccccccccC Confidence 9999998766542211 11 1111222222222222 No 28 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=4e-91 Score=516.17 Aligned_cols=408 Identities=18% Similarity=0.262 Sum_probs=337.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhc-cccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMA-GVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMD 79 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 79 (435) |+++.-++ ..++++......+...++... +........++...|+++++|++||++||++||++|++++++++... T Consensus 23 ~~~~~~f~---~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~acv~~Ia~~iA~lpl~~~~~~~~~~ 99 (441) T protein:vir:98 23 LVVVGIFY---KNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINY 99 (441) T ss_pred hhcccccc---ccccccccCCCcchHHHHHHhhcccccCccccchhhhhccHHHHHHHHHHHHhhccCceEEecCCcccc Confidence 33332221 112333333333434444332 22333345577788999999999999999999999999999999889 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEec---- Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTS---- 155 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~---- 155 (435) .|+++++|+.+||++||+++||+.++.+++++||||++|+++. .|+|++|||++|+.|++..+.+|..+|+.... T Consensus 100 ~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~-~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~ 178 (441) T protein:vir:98 100 SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK-TGEPMNLTFRKTSEIELKLDARGRLYYFHQRIDSNG 178 (441) T ss_pred cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CCcEEEEEEEcCceeEEEECCCCcEEEEEEEeccCc Confidence 9999999999999999999999999999999999999999875 48999999999999999999888777665542 Q ss_pred CCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCC-HHHHHHHHHHH Q lcl|NC_019456. 156 DIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSIS-PEKRQAMVNDF 232 (435) Q Consensus 156 ~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~-~e~~~~~~~~~ 232 (435) .+..+.|+++||||++++ +.++++|+||+..+.++|..+.++++++.++|.|| +++++++++.++ +++.++++++| T Consensus 179 ~~~~~~~~~~dviHir~~-~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~ 257 (441) T protein:vir:98 179 NNIERNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEF 257 (441) T ss_pred ceeeEEEccccEEEeccC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHH Confidence 234578999999999976 57889999999999999999999999999999997 679999999875 67788899999 Q ss_pred HHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhH Q lcl|NC_019456. 233 LRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMP 309 (435) Q Consensus 233 ~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P 309 (435) .... +|+|+++||++|++|+++++++.++||+|.+++++++||++|||||.+||.... + ++.+++..+|.+||+| T Consensus 258 ~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~-~-~s~~q~~~~y~~tl~P 335 (441) T protein:vir:98 258 HKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N-MSITDANLDYLSTLKP 335 (441) T ss_pred HHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC-C-ccHHHHHHHHHHHHHH Confidence 7654 567899999999999999999999999999999999999999999999986443 3 3456667778789999 Q ss_pred HHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeeccc Q lcl|NC_019456. 310 IIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKD 389 (435) Q Consensus 310 ~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n 389 (435) ++.+|+++|+++|+++. .+++++||++.+++.|.+++++++++++++|+||+||+|+++|+||++++.++.+++++| T Consensus 336 ~~~~ie~~ln~~L~~~~---~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~~~~~~n 412 (441) T protein:vir:98 336 YITCVCAELNFKFNDEY---VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLN 412 (441) T ss_pred HHHHHHHHHHhhccccc---cCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccc Confidence 99999999999998754 468899999999999999999999999999999999999999999997544557889999 Q ss_pred ccchhccccccccccccccccccccccCCCCCCC Q lcl|NC_019456. 390 LYPLDKYYDAILDNKIQTDASVAAPKQEGGENTN 423 (435) Q Consensus 390 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 423 (435) ++|++.+...+..+... .....+|||+++ T Consensus 413 ~~~~~~~~~~q~~~~~~-----~~~~~kgGe~ne 441 (441) T protein:vir:98 413 HVNIELVDEYQMNKSRA-----TDKKLKGGEENE 441 (441) T ss_pred ccccccccccccccccc-----cccccCCCCCCC Confidence 99999875543332222 234456666555 No 29 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=4.2e-91 Score=516.05 Aligned_cols=394 Identities=27% Similarity=0.425 Sum_probs=341.4 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc---cc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY---KQ 77 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~---~~ 77 (435) |-|.++++ +.....+..+..+..+.|... ...+++.+.++++++|++||++||++||+|||+++++. +. T Consensus 1 m~f~~~~~-------~~~~~~~~~~~~~~~~~g~~~-~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~ 72 (409) T protein:vir:10 1 MLFRKGFK-------NQSQEISIDDKKILEWLGINP-SETYVNGKSCLKQATVFGCIRILSDNISKLPIKIYQKKDGIKR 72 (409) T ss_pred Cccccccc-------CcCCCCCCChHHHHHHhcCCc-CcceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCeee Confidence 87654332 222233344556667776544 35677888999999999999999999999999998743 45 Q ss_pred cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc------eEEE Q lcl|NC_019456. 78 MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN------SYWY 151 (435) Q Consensus 78 ~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~------~~~~ 151 (435) ..+|+++++|+.+||++||+++||+.++.+++++||||++++++. .|.+++|||++|++|++..+.++. ..|. T Consensus 73 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~-~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~ 151 (409) T protein:vir:10 73 VPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKK-NGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYL 151 (409) T ss_pred ccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CCcEEEEEEEcCCceEEEEcCCccccccceEEEE Confidence 668999999999999999999999999999999999999999874 489999999999999998876542 2344 Q ss_pred EEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHH Q lcl|NC_019456. 152 RVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMV 229 (435) Q Consensus 152 ~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~ 229 (435) +....|..+.|+++||||+++++ .++++|+||+..+...+....++++++.++|+|+ +++++++++.+++++.++++ T Consensus 152 ~~~~~g~~~~~~~~evih~r~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~ 230 (409) T protein:vir:10 152 YTDDLGQRHKFMSDEILHFKGLT-ADGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFK 230 (409) T ss_pred EEeCCceeEEeccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHH Confidence 44556778899999999999875 5789999999999999999999999999999997 57899999999999999999 Q ss_pred HHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH-HHHHHHH Q lcl|NC_019456. 230 NDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH-VTHSWTM 305 (435) Q Consensus 230 ~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~-~~~~~~~ 305 (435) +.|.... +|+++++|+++|++|++++.++.++|+.+.+++++++||++|||||.+||..+.++++|.++ ...|+++ T Consensus 231 ~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~f~~~ 310 (409) T protein:vir:10 231 ENFERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNREFYID 310 (409) T ss_pred HHHHHHhccccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHHHHHH Confidence 9997654 46789999999999999999999999999999999999999999999999998888888765 5567799 Q ss_pred HHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceee Q lcl|NC_019456. 306 TLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLY 385 (435) Q Consensus 306 ~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~ 385 (435) ||.|++++|+++|+++|+++.++..|++++||++++++.|.+++++++.+++++|++|+||+|+++|+||+ ||||+++ T Consensus 311 ~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~--~ggD~~~ 388 (409) T protein:vir:10 311 TLQSILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPL--EGGDVLL 388 (409) T ss_pred HHHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999 5899999 Q ss_pred ecccccchhccccccccccccccccccccccCCCCC Q lcl|NC_019456. 386 ISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGEN 421 (435) Q Consensus 386 ~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (435) +++|++|++.+++... +||+. T Consensus 389 ~~~n~~~~~~~~~~~~---------------kgGe~ 409 (409) T protein:vir:10 389 INGNMIPVKMAGEQYS---------------KGGEK 409 (409) T ss_pred eccCccchhhcccccc---------------ccCCC Confidence 9999999987643222 22222 No 30 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=7.7e-91 Score=514.62 Aligned_cols=411 Identities=18% Similarity=0.282 Sum_probs=335.9 Q ss_pred CchHHHHHh---------hccccccccccccccchhhhhhc-cccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSFMSKVRQ---------FFGVHDQANQIVQNPIPQPLDMA-GVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) |+|.+|-.+ |...++++......+...++... +........++...|+++++||+||++||++||+||++ T Consensus 11 ~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~iA~lp~~ 90 (441) T protein:vir:79 11 VDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIR 90 (441) T ss_pred ccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHHHHhhccCcee Confidence 222221111 11122333333333334444332 22233344566778999999999999999999999999 Q ss_pred eeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEE Q lcl|NC_019456. 71 EYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYW 150 (435) Q Consensus 71 ~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~ 150 (435) ++++++....|+++++|+.+||++||+++||+.++.+++++||||++|+++. .|+|++|+|++|+.|++..+.+|..+| T Consensus 91 ~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~G~~~~L~~i~~~~v~v~~d~~g~~~~ 169 (441) T protein:vir:79 91 VTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK-TGEPMNLTFRKTSEIELKSDARGRLYY 169 (441) T ss_pred eecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEEcCceeEEEECCCccEEE Confidence 9999988899999999999999999999999999999999999999999875 489999999999999999998887776 Q ss_pred EEEec----CCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCC-HH Q lcl|NC_019456. 151 YRVTS----DIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSIS-PE 223 (435) Q Consensus 151 ~~~~~----~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~-~e 223 (435) +.... .+..+.|+++||||+|++ +.++++|+||+..+.++|..+.++++++.++|+|| +++++++++.++ ++ T Consensus 170 ~~~~~~~~~~~~~~~~~~~dvih~k~~-~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e 248 (441) T protein:vir:79 170 FHQRIDSNGNNIERNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKK 248 (441) T ss_pred EEEEeccCCceeEEEEccccEEEeccC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHH Confidence 55532 234568999999999975 67889999999999999999999999999999997 679999999875 67 Q ss_pred HHHHHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH Q lcl|NC_019456. 224 KRQAMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT 300 (435) Q Consensus 224 ~~~~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~ 300 (435) +.++++++|.... +|+|+++||++|++|+++++++.++||+|.+++++++||++|||||.+||.... ++ +.+++. T Consensus 249 ~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~~-s~~q~~ 326 (441) T protein:vir:79 249 ARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-NM-SITDAN 326 (441) T ss_pred HHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC-Cc-cHHHHH Confidence 7888999997654 567899999999999999999999999999999999999999999999986433 33 446667 Q ss_pred HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcC Q lcl|NC_019456. 301 HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEA 380 (435) Q Consensus 301 ~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~ 380 (435) .+|..||+|++.+|+++|+++|+++. .+++++||++.+++.|.+++++++++++++|+||+||+|+++|+||++++. T Consensus 327 ~~~~~tl~P~~~~ie~eln~kl~~~~---~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd 403 (441) T protein:vir:79 327 LDYLSTLKPYITCVCAELNFKFNDEY---VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGN 403 (441) T ss_pred HHHHHHHHHHHHHHHHHHhhhccccc---cCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 78888999999999999999998653 478999999999999999999999999999999999999999999997544 Q ss_pred CceeeecccccchhccccccccccccccccccccccCCCCCCC Q lcl|NC_019456. 381 ADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTN 423 (435) Q Consensus 381 gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 423 (435) ++.+++++|++|++.+...+..+... .....+|||+++ T Consensus 404 ~~~~~~~~n~~~~~~~~~~~~~~~~~-----~~~~~kgGe~~e 441 (441) T protein:vir:79 404 GSIHRVDLNHVNIELVDEYQMNKSRA-----TDKKLKGGEENE 441 (441) T ss_pred cceEeecccccccccccccccccccc-----cccccCCCCCCC Confidence 55789999999999865433332222 234456666665 No 31 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=7.7e-91 Score=514.62 Aligned_cols=411 Identities=18% Similarity=0.282 Sum_probs=335.9 Q ss_pred CchHHHHHh---------hccccccccccccccchhhhhhc-cccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSFMSKVRQ---------FFGVHDQANQIVQNPIPQPLDMA-GVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) |+|.+|-.+ |...++++......+...++... +........++...|+++++||+||++||++||+||++ T Consensus 11 ~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~iA~lp~~ 90 (441) T protein:vir:94 11 VDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIR 90 (441) T ss_pred ccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHHHHhhccCcee Confidence 222221111 11122333333333334444332 22233344566778999999999999999999999999 Q ss_pred eeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEE Q lcl|NC_019456. 71 EYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYW 150 (435) Q Consensus 71 ~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~ 150 (435) ++++++....|+++++|+.+||++||+++||+.++.+++++||||++|+++. .|+|++|+|++|+.|++..+.+|..+| T Consensus 91 ~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~G~~~~L~~i~~~~v~v~~d~~g~~~~ 169 (441) T protein:vir:94 91 VTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK-TGEPMNLTFRKTSEIELKSDARGRLYY 169 (441) T ss_pred eecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEEcCceeEEEECCCccEEE Confidence 9999988899999999999999999999999999999999999999999875 489999999999999999998887776 Q ss_pred EEEec----CCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCC-HH Q lcl|NC_019456. 151 YRVTS----DIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSIS-PE 223 (435) Q Consensus 151 ~~~~~----~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~-~e 223 (435) +.... .+..+.|+++||||+|++ +.++++|+||+..+.++|..+.++++++.++|+|| +++++++++.++ ++ T Consensus 170 ~~~~~~~~~~~~~~~~~~~dvih~k~~-~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e 248 (441) T protein:vir:94 170 FHQRIDSNGNNIERNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKK 248 (441) T ss_pred EEEEeccCCceeEEEEccccEEEeccC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHH Confidence 55532 234568999999999975 67889999999999999999999999999999997 679999999875 67 Q ss_pred HHHHHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH Q lcl|NC_019456. 224 KRQAMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT 300 (435) Q Consensus 224 ~~~~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~ 300 (435) +.++++++|.... +|+|+++||++|++|+++++++.++||+|.+++++++||++|||||.+||.... ++ +.+++. T Consensus 249 ~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~~-s~~q~~ 326 (441) T protein:vir:94 249 ARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-NM-SITDAN 326 (441) T ss_pred HHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC-Cc-cHHHHH Confidence 7888999997654 567899999999999999999999999999999999999999999999986433 33 446667 Q ss_pred HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcC Q lcl|NC_019456. 301 HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEA 380 (435) Q Consensus 301 ~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~ 380 (435) .+|..||+|++.+|+++|+++|+++. .+++++||++.+++.|.+++++++++++++|+||+||+|+++|+||++++. T Consensus 327 ~~~~~tl~P~~~~ie~eln~kl~~~~---~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd 403 (441) T protein:vir:94 327 LDYLSTLKPYITCVCAELNFKFNDEY---VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGN 403 (441) T ss_pred HHHHHHHHHHHHHHHHHHhhhccccc---cCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 78888999999999999999998653 478999999999999999999999999999999999999999999997544 Q ss_pred CceeeecccccchhccccccccccccccccccccccCCCCCCC Q lcl|NC_019456. 381 ADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTN 423 (435) Q Consensus 381 gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 423 (435) ++.+++++|++|++.+...+..+... .....+|||+++ T Consensus 404 ~~~~~~~~n~~~~~~~~~~~~~~~~~-----~~~~~kgGe~~e 441 (441) T protein:vir:94 404 GSIHRVDLNHVNIELVDEYQMNKSRA-----TDKKLKGGEENE 441 (441) T ss_pred cceEeecccccccccccccccccccc-----cccccCCCCCCC Confidence 55789999999999865433332222 234456666665 No 32 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=2.9e-90 Score=511.50 Aligned_cols=414 Identities=24% Similarity=0.323 Sum_probs=340.3 Q ss_pred Cc-----hHHHHH----hhccccccccccccccchhhhhh-ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MS-----FMSKVR----QFFGVHDQANQIVQNPIPQPLDM-AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg-----~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) |+ ++++++ +|++.+ .+...+.+.+. .+.....+..++.+.|+++++||+||++||++||+|||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~------~s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~ 74 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVP------ISLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLN 74 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCc------ccCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCcee Confidence 87 333332 233332 12222333333 334444456688889999999999999999999999999 Q ss_pred eeecc-----cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC Q lcl|NC_019456. 71 EYQNY-----KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD 145 (435) Q Consensus 71 ~~~~~-----~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~ 145 (435) +++.+ ....+|+++++|+.+||++||+++||+.++.+++++||+|++|+++ . |.+++|||++|+.|++..+.+ T Consensus 75 ~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-~-g~~~~L~~l~p~~v~i~~~~~ 152 (437) T protein:vir:10 75 LYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-A-GVLIGLELMLPQRTTVKRLTS 152 (437) T ss_pred EEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-C-CcEEEEEEEcCcceEEEECCC Confidence 98743 3446899999999999999999999999999999999999999986 3 899999999999999999888 Q ss_pred CceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHH Q lcl|NC_019456. 146 NNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPE 223 (435) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e 223 (435) +...|+....+|....|+++||||||+++ .++++|+||+..+..++..+.++++++.++|.|+ +++++++++.++++ T Consensus 153 g~~~y~~~~~~g~~~~~~~~dIih~r~~~-~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e 231 (437) T protein:vir:10 153 GALQYTYRNVDGTVSTLAEDDVFHVRGFS-LDGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKE 231 (437) T ss_pred CeEEEEEEecCceEEEEccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHH Confidence 88777777778888899999999999874 6889999999999999999999999999999997 57999999999999 Q ss_pred HHHHHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc--cHHHH Q lcl|NC_019456. 224 KRQAMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST--TNVEH 298 (435) Q Consensus 224 ~~~~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~--~~~e~ 298 (435) +.+++++.|.... .|+|+++||++|++|+++++++.++||.|++++++++||++|||||.+||..+.+++ +|.|+ T Consensus 232 ~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~ 311 (437) T protein:vir:10 232 KRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQ 311 (437) T ss_pred HHHHHHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHH Confidence 9999999997654 567899999999999999999999999999999999999999999999999887654 56555 Q ss_pred -HHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCC Q lcl|NC_019456. 299 -VTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIP 377 (435) Q Consensus 299 -~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ 377 (435) ...|+++||.||+..|+++|+++||++.++. +.+|+||++++++.|.+++++++++++++|+||+||+|+++|+||++ T Consensus 312 ~~~~f~~~tl~P~~~~ie~~l~~kll~~~e~~-~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ 390 (437) T protein:vir:10 312 QTLGFLTFTLRPWLTRIEQAARRSLLRPGERD-QFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMG 390 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCccccC-ceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 5567899999999999999999999987764 57899999999999999999999999999999999999999999996 Q ss_pred CcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCC Q lcl|NC_019456. 378 DEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEP 432 (435) Q Consensus 378 ~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (435) ++++.+++++|++|++..++......++. ..++++.+.+......|. T Consensus 391 -gg~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 391 -GNAAVLTVQSALLPIDKLGEHTTATAAQD-------ALKAWLYQEEKTRATQER 437 (437) T ss_pred -CCcceEeecCcccchhhccCcCCCcchhc-------cccccCCCCCCCCccccC Confidence 34455668999999987655433322211 112222222222222222 No 33 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=3.1e-90 Score=511.31 Aligned_cols=425 Identities=19% Similarity=0.272 Sum_probs=330.4 Q ss_pred CchHHHHHhhccccccccccccccchhhhh------hccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLD------MAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) |=+ + +++..++........+..+ ..+.............|+++++||+||++||++||++||++++. T Consensus 1 ~~~------~-~~~~~~~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~ 73 (518) T protein:vir:10 1 MLL------A-NGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFT 73 (518) T ss_pred Ccc------c-CceeecCchhhhhhhhhhcccccccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEE Confidence 221 1 1111111110011111111 12222223333445678999999999999999999999999874 Q ss_pred ---ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEE Q lcl|NC_019456. 75 ---YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWY 151 (435) Q Consensus 75 ---~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~ 151 (435) +.....++.+++|+.+||++||+++||+.++.+++++||+|+++.++.. |+|++||||+|.+|++..+.++..++| T Consensus 74 ~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~-G~~~~L~~l~p~~v~v~~~~~~~~~~y 152 (518) T protein:vir:10 74 SGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS-GTPEKLMPMHPSRVAIKRNSRTGRYEY 152 (518) T ss_pred cCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-CcEEEEEEECCCceEEEEcCCCCEEEE Confidence 2333345556778889999999999999999999999999999998754 899999999999999999877665655 Q ss_pred EEecC----CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHH Q lcl|NC_019456. 152 RVTSD----IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKR 225 (435) Q Consensus 152 ~~~~~----~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~ 225 (435) .+... +...+|+++||||||++++.+..+|+||+..+...|....++++++.++|+|| +++++++++.+++++. T Consensus 153 ~~~~~~~~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~ 232 (518) T protein:vir:10 153 YFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQ 232 (518) T ss_pred EEEecCCccceEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHH Confidence 55432 24568999999999988765556899999999999999999999999999997 6789999999999999 Q ss_pred HHHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH-HH Q lcl|NC_019456. 226 QAMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV-TH 301 (435) Q Consensus 226 ~~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~-~~ 301 (435) +++++.|.... .|+|+++||++|++|+++++++.|+||+|.+++++++||++|||||.+||..+.++++|.|++ .. T Consensus 233 ~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~ 312 (518) T protein:vir:10 233 QRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRA 312 (518) T ss_pred HHHHHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHH Confidence 99999998765 567899999999999999999999999999999999999999999999999998999987765 56 Q ss_pred HHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCC Q lcl|NC_019456. 302 SWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAA 381 (435) Q Consensus 302 ~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~g 381 (435) |+++||.|++..|+++|+++|++..+ .+++++||++.+++.|.+++++++++++++|++|+||+|+++|+||++++|| T Consensus 313 f~~~tL~P~l~~ie~~ln~~L~~~~~--~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~g 390 (518) T protein:vir:10 313 FYRDTMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKA 390 (518) T ss_pred HHHHHHHHHHHHHHHHHHHhhccccc--CCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC Confidence 77999999999999999999988754 4689999999999999999999999999999999999999999999999999 Q ss_pred ceeeecccccchhccccccccc-cccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 382 DHLYISKDLYPLDKYYDAILDN-KIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 382 d~~~~~~n~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) |++++++|++|++...+....+ ..++.+++.+.+....+.....+....+.+.+ T Consensus 391 D~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (518) T protein:vir:10 391 DELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNS 445 (518) T ss_pred CeeeecccceecccccccccCCCCCCCCCCCCccccccccccccccCCCCCcccc Confidence 9999999999998655433222 22222222222111111111111111111101 No 34 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=2.4e-90 Score=511.95 Aligned_cols=425 Identities=19% Similarity=0.282 Sum_probs=331.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhh------ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDM------AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) |=+ . +++..+++....-..+..+. .+.............|+++++||+||++||++||++||+++++ T Consensus 1 ~~~----~---~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~ 73 (518) T protein:vir:78 1 MLL----A---NGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFT 73 (518) T ss_pred Ccc----c---CceeeccchhhhhhhhhhhcccccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEE Confidence 211 0 11111111100111111111 2222222334445678999999999999999999999999974 Q ss_pred c---cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEE Q lcl|NC_019456. 75 Y---KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWY 151 (435) Q Consensus 75 ~---~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~ 151 (435) + .....++.+.+|+.+||++||+++||+.++.+++++||+|+++.++. .|.|++||||+|.+|++..+.++..+.| T Consensus 74 ~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~-~G~~~~L~~l~p~~Vtv~~~~~~~~~~y 152 (518) T protein:vir:78 74 SGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK-SGTPEKLMPMHPSRVAIKRNSRTGRYEY 152 (518) T ss_pred cCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CCcEEEEEEECCCceEEEEcCCCCEEEE Confidence 3 23335566777888999999999999999999999999999999874 4899999999999999999877665555 Q ss_pred EEecC----CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHH Q lcl|NC_019456. 152 RVTSD----IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKR 225 (435) Q Consensus 152 ~~~~~----~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~ 225 (435) .+... +...+|+++|||||+++++.+..+|+||+..+...|....++++++.++|+|| +++++++++.+++++. T Consensus 153 ~~~~~~~~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~ 232 (518) T protein:vir:78 153 YFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQ 232 (518) T ss_pred EEEecCCccceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHH Confidence 54432 24567999999999988765556899999999999999999999999999997 5689999999999999 Q ss_pred HHHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH-HH Q lcl|NC_019456. 226 QAMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV-TH 301 (435) Q Consensus 226 ~~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~-~~ 301 (435) +++++.|.... .|+|+++||++|++|+++++++.++||+|.+++++++||++|||||.+||..+.++++|.|++ .. T Consensus 233 ~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~ 312 (518) T protein:vir:78 233 QRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRA 312 (518) T ss_pred HHHHHHHHHHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHH Confidence 99999998655 467899999999999999999999999999999999999999999999999998899887765 45 Q ss_pred HHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCC Q lcl|NC_019456. 302 SWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAA 381 (435) Q Consensus 302 ~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~g 381 (435) ||++||.|++.+|+++|+++|++..+ .+++++||++.+++.|.+++++++.+++++|+||+||+|+++|+||++++|| T Consensus 313 f~~~tL~P~~~~ie~eln~~L~~~~~--~~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~g 390 (518) T protein:vir:78 313 FYRDTMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKA 390 (518) T ss_pred HHHHHHHHHHHHHHHHHHHhhccccc--CcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC Confidence 77899999999999999999987654 4689999999999999999999999999999999999999999999999999 Q ss_pred ceeeecccccchhcccccccc-ccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 382 DHLYISKDLYPLDKYYDAILD-NKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 382 d~~~~~~n~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) |++++++|++|++...+.... +..++.+++...+...++.....+....+.+.+ T Consensus 391 D~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (518) T protein:vir:78 391 DELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNS 445 (518) T ss_pred ceeeecccceecccccccccCCCCCCCCCCCCcccccccccCccccCCCCCcccc Confidence 999999999999866544322 222222222222221222211111111111111 No 35 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=5.8e-90 Score=509.81 Aligned_cols=401 Identities=21% Similarity=0.292 Sum_probs=335.8 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccc--cccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc--- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGV--KLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY--- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~--- 75 (435) |-|.+++++ .... . ........+..+. ....+..++.+.|+++++|++||++||++||+|||++++.. T Consensus 1 ~~f~~~f~r----~~~~-~--~~~~~~~~~~~~~~~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~ 73 (413) T protein:vir:48 1 MFFSGLFQR----KSDA-P--VTTPAELAEAIGLSYDTYTGKRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTL 73 (413) T ss_pred Cccchhhcc----CccC-C--ccchHHHHHhhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCc Confidence 655444332 1111 1 1111223333322 23344567788899999999999999999999999999743 Q ss_pred -cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEe Q lcl|NC_019456. 76 -KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVT 154 (435) Q Consensus 76 -~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 154 (435) ..+.+|+++++|+.+||++||+++||+.++.+++++||+|++++++ .|+|++|||++|++|++..+.++...|+... T Consensus 74 ~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~--~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~ 151 (413) T protein:vir:48 74 KTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA--LGEVVELLPIDPGCVEPKLNSQWQPVYQVTF 151 (413) T ss_pred ceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC--CCcEEEEEEEcCceEEEEEcCCceEEEEEEe Confidence 3456899999999999999999999999999999999999999875 4899999999999999999988887777777 Q ss_pred cCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHH Q lcl|NC_019456. 155 SDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDF 232 (435) Q Consensus 155 ~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~ 232 (435) .+|...+|++++|||+++++ .++++|+||+..+...|..+.++++++.++|+|+ +++++++++.+++|+.++++++| T Consensus 152 ~~g~~~~~~~~evih~~~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~ 230 (413) T protein:vir:48 152 PDGSVDVLTQDEIWHVRTLT-LDGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDF 230 (413) T ss_pred cCceEEEEccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHH Confidence 78888899999999999874 6789999999999999999999999999999997 67999999999999999999999 Q ss_pred HHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH-HHHHHHHHh Q lcl|NC_019456. 233 LRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV-THSWTMTLM 308 (435) Q Consensus 233 ~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~-~~~~~~~i~ 308 (435) .... .|+|+++|+++|++|++++.++.++|+.|.+++++++||++|||||.+||..+.++++|.|++ ..|+++||. T Consensus 231 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~ 310 (413) T protein:vir:48 231 EERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLV 310 (413) T ss_pred HHHhcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHH Confidence 7654 567899999999999999999999999999999999999999999999999888888887654 567789999 Q ss_pred HHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecc Q lcl|NC_019456. 309 PIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISK 388 (435) Q Consensus 309 P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~ 388 (435) |++.+|+++|+++|+++.+. .+++++||++++++.|.+++++++++++++|++|+||+|+++|+||+ ||||++++++ T Consensus 311 P~~~~ie~~l~~~L~~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~--~ggD~~~~~~ 387 (413) T protein:vir:48 311 PYLTRIEQRINTGLVRESKQ-GKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPR--PGGDVYLTPM 387 (413) T ss_pred HHHHHHHHHHHhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcceeeccc Confidence 99999999999999988775 58899999999999999999999999999999999999999999999 6899999999 Q ss_pred cccchhccccccccccccccccccccccCCCCCCCCCCCCC Q lcl|NC_019456. 389 DLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQS 429 (435) Q Consensus 389 n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (435) |+.+++.+.+.... .+++++++++.+ T Consensus 388 n~~~~~~~~~~~~~---------------~~~~~~~~~~~~ 413 (413) T protein:vir:48 388 NMTTSPSAGDDNGK---------------KKESGDADKTAS 413 (413) T ss_pred cccccccccccCCC---------------CCCCCCccccCC Confidence 99877653222111 111111111111 No 36 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=6.8e-90 Score=509.44 Aligned_cols=404 Identities=22% Similarity=0.291 Sum_probs=332.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc----- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY----- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~----- 75 (435) |+|++++++. ... ...........+.+.....+..++.+.++++++|++||++||++||+|||++|+.. T Consensus 1 m~~~~~~~~~---~~~---~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~ 74 (419) T protein:vir:57 1 MFIPQFWKGR---PSE---NRVNWQVVPGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGGR 74 (419) T ss_pred CcchhhhccC---Ccc---ccccccccccccccccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCce Confidence 9999987643 111 11111111122233444555678888999999999999999999999999998743 Q ss_pred cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEec Q lcl|NC_019456. 76 KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTS 155 (435) Q Consensus 76 ~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~ 155 (435) ....+|+++++|+.+||++||+++||+.++.+++++||+|++|+++. .|+|++||||+|++|++..+.++..+ |.+.. T Consensus 75 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~-~G~~~~L~pl~~~~v~v~~~~~g~~~-y~~~~ 152 (419) T protein:vir:57 75 EIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNG-RGDITELIPINPHKVIVLKGPDGMPY-YDIPS 152 (419) T ss_pred eccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEEcCcceEEEECCCceEE-EEEcC Confidence 34468999999999999999999999999999999999999999874 48999999999999999998887654 44444 Q ss_pred CCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCC----cCCHHHHHHHH Q lcl|NC_019456. 156 DIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDR----SISPEKRQAMV 229 (435) Q Consensus 156 ~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~----~~~~e~~~~~~ 229 (435) .+ .+++.++|+|++++ +.++++|+||+..+...+....++++++.++|.|+ ++++++++. .+++++.++++ T Consensus 153 ~~--~~~~~~~vih~r~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~ 229 (419) T protein:vir:57 153 IG--EILPMRMVHHIKSF-SLDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAIL 229 (419) T ss_pred Cc--eEEchhhEEEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHH Confidence 33 57999999999986 57889999999999999999999999999999997 568888754 46789999999 Q ss_pred HHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-HHHHH Q lcl|NC_019456. 230 NDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-HSWTM 305 (435) Q Consensus 230 ~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~~~~~ 305 (435) +.|.... .|+|+++|+++|++|+++++++.++||.|++++++++||++|||||.+||..+.++++|.|++. .|+++ T Consensus 230 ~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~ 309 (419) T protein:vir:57 230 AKWTERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQYVIY 309 (419) T ss_pred HHHHHHhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHH Confidence 9997654 4678999999999999999999999999999999999999999999999999988888877654 56799 Q ss_pred HHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceee Q lcl|NC_019456. 306 TLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLY 385 (435) Q Consensus 306 ~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~ 385 (435) ||.|++..|+++|+++||++.++ .|++++||++.+++.|.+++++++++++++|+||+||+|+++|+||+ ||||+++ T Consensus 310 ~l~P~~~~ie~~l~~~ll~~~~~-~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~--~ggD~~~ 386 (419) T protein:vir:57 310 TMLAILKRHESAMMRDLLLPSER-RDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPI--PGGDKYL 386 (419) T ss_pred HHHHHHHHHHHHHHhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeee Confidence 99999999999999999998765 58999999999999999999999999999999999999999999999 5899999 Q ss_pred ecccccchhccccccccccccccccccccccCCCCCCCCCCC Q lcl|NC_019456. 386 ISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGL 427 (435) Q Consensus 386 ~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (435) +|+|+++++.+.........+. +..+..+.... T Consensus 387 ~~~n~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~ 419 (419) T protein:vir:57 387 TPLNMVDSKALTGIGKATPQQL---------KDIEAILCTRN 419 (419) T ss_pred eccccccccccccccCCCcccC---------cchhhhhhccC Confidence 9999988765443211111110 01110000000 No 37 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=1.6e-88 Score=501.96 Aligned_cols=404 Identities=20% Similarity=0.324 Sum_probs=328.0 Q ss_pred chHHHHHhhccccccccccccccchhhhhhccc-cccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc----c Q lcl|NC_019456. 2 SFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGV-KLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY----K 76 (435) Q Consensus 2 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~----~ 76 (435) =||+|... .....+ ......+...+.|. .+..+..++.+.|+++++||+||++||++||++||++|+++ . T Consensus 1 ~~~~r~~~---~~~~~~--~~~~~~~~~~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~ 75 (419) T protein:vir:14 1 MFFSRQLL---SNLGQT--QMSAGGWVSALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIAQLPIELYERSGEDRK 75 (419) T ss_pred Cccccccc---cccccc--ccCcchhhHHhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhccCceEEEEecCCccc Confidence 23333321 111111 11122233334433 34456778899999999999999999999999999998743 4 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecC Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSD 156 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~ 156 (435) .+.+|+++++|+.+||++||+++||+.++.+++++||+|++|+++.. |.|++||||+|++|++..+.++..+|.. ... T Consensus 76 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~-G~~~~l~pl~~~~v~v~~~~~~~~~y~~-~~~ 153 (419) T protein:vir:14 76 PATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSD-GVIQGLYPLDNEAVTVMRGSDLKPVYRV-RGS 153 (419) T ss_pred cccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-CcEEEEEEecCceEEEEECCCceEEEEE-ccC Confidence 56689999999999999999999999999999999999999998754 8999999999999999998887655443 322 Q ss_pred CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcC----CHHHHHHHHH Q lcl|NC_019456. 157 IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSI----SPEKRQAMVN 230 (435) Q Consensus 157 ~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~----~~e~~~~~~~ 230 (435) ..++.++|+|++++ +.++++|+||+..+...+..+.++++++.++|.|| +++++++++.+ ++++.+++++ T Consensus 154 ---~~~~~~~i~h~~~~-~~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~ 229 (419) T protein:vir:14 154 ---DPMPQRLVHHVRWM-SINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITD 229 (419) T ss_pred ---cccchhheeEecCc-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHH Confidence 24889999999976 56899999999999999999999999999999997 57899988765 5788999999 Q ss_pred HHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH-HHHHHHH Q lcl|NC_019456. 231 DFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV-THSWTMT 306 (435) Q Consensus 231 ~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~-~~~~~~~ 306 (435) .|+... .|+|+++++++|++|+++++++.|+|++|.+++++++||++|||||.+||..+.++++|.|++ ..|+++| T Consensus 230 ~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~~~~ 309 (419) T protein:vir:14 230 GWNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYT 309 (419) T ss_pred HHHHHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHH Confidence 997654 456889999999999999999999999999999999999999999999999988888887754 5667899 Q ss_pred HhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeee Q lcl|NC_019456. 307 LMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYI 386 (435) Q Consensus 307 i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~ 386 (435) |.|++.+|+++|+++||++.++ .+++++||++++++.|.+++++++++++++|++|+||+|+++|+||+ ||||++++ T Consensus 310 L~P~~~~ie~~l~~kll~~~~~-~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~--~gGD~~~~ 386 (419) T protein:vir:14 310 LLPWVKRHEQAKTRDLLLPSER-KQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPV--KGGDIYLS 386 (419) T ss_pred HHHHHHHHHHHHhhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeeee Confidence 9999999999999999998776 58899999999999999999999999999999999999999999999 58999999 Q ss_pred cccccchhccccccccccccccccccccccCCCCCCCCCCCCCC Q lcl|NC_019456. 387 SKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQST 430 (435) Q Consensus 387 ~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (435) |+|+++++........+..++.+. .++.++--. T Consensus 387 ~~n~~~~~~~~~~~~~~~~~~~~~-----------~~e~~~~l~ 419 (419) T protein:vir:14 387 PMNMVDASKPQQLPVGKSEPTKAA-----------IDEIGRILS 419 (419) T ss_pred ccccccccccccccCCCCCCcccc-----------ccchhcccC Confidence 999988775433211111111100 000000000 No 38 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=2.6e-88 Score=500.75 Aligned_cols=400 Identities=17% Similarity=0.188 Sum_probs=326.9 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec-ccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN-YKQMD 79 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~-~~~~~ 79 (435) ||||+.... .... .......+.+.... ..++. ..|+++++||+||++||++||+|||+++++ +..+. T Consensus 1 m~~f~~~~~--------~~~~--~~~~~~~~~~~~~~-~~~~~-~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~g~~~~ 68 (406) T protein:vir:97 1 MSFFQPLGT--------SKVS--YDDYISSVLAGDVS-QKYLG-VSALKNSDILTATSIIAGDIARFPLVKKDVNGDIIH 68 (406) T ss_pred CccccccCC--------CCCC--cchHHHHHhcCCCC-ccccc-chhhccHHHHHHHHHHHHhhhhCeeEEEecCccccc Confidence 999874211 1111 11222222332222 22233 358999999999999999999999999874 55667 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEE-EecCCe Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYR-VTSDIY 158 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~-~~~~~~ 158 (435) +|+++++|+.+||++||+++||+.++.+|+++||+|++++++...|.+.+|+|++|+.|++..+.++...|.+ ...++. T Consensus 69 ~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~~~~y~~~~~~~~~ 148 (406) T protein:vir:97 69 DEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNHEIVYTFTDMLTAK 148 (406) T ss_pred cchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCceEEEEEEecCCce Confidence 8999999999999999999999999999999999999999988889999999999999999988777655533 345667 Q ss_pred eEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHh Q lcl|NC_019456. 159 NFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMV 236 (435) Q Consensus 159 ~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~ 236 (435) ...|+++||||||++ +.+++.|+||+..+...|..+.+++++..++|+|| +.+++..+..+++++.++++++|+... T Consensus 149 ~~~~~~~evih~r~~-~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~ 227 (406) T protein:vir:97 149 QVKCFAHDVIHWKFF-SHDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGAQLSGDARQRARQEFEKMR 227 (406) T ss_pred EEEEccccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCCCCCHHHHHHHHHHHHHHh Confidence 788999999999975 67889999999999999999999999999999997 446778888899999999999998665 Q ss_pred --cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhHHHHHH Q lcl|NC_019456. 237 --KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMPIIRQY 314 (435) Q Consensus 237 --~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P~~~~i 314 (435) .++|+++||++|++|+++++++.++|++|.+++++++||++|||||.+||... ++.+.+++...|+++||.|++..| T Consensus 228 ~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~-~~~~~e~~~~~f~~~~l~P~~~~i 306 (406) T protein:vir:97 228 EGSVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNS-PNQSVAQLMEDYVTNDLPFYFDAI 306 (406) T ss_pred cccccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCC-CcchHHHHHHHHHHHHHHHHHHHH Confidence 35688999999999999999999999999999999999999999999998633 233445666778899999999999 Q ss_pred HHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchh Q lcl|NC_019456. 315 ESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLD 394 (435) Q Consensus 315 ~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~ 394 (435) +++|+++|+++.++ .+++++||++++ .+.+++.+.+++++|+||+||+|+++|+||+++++||++++|+|++|++ T Consensus 307 e~~l~~kll~~~~~-~~~~i~fd~~~~----~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~ 381 (406) T protein:vir:97 307 TSELGLKTLNDKDR-RLYHIEFDTRSV----TGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLD 381 (406) T ss_pred HHHHhhhhcChhhc-cceeEEEecCcc----chhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccCccchh Confidence 99999999998764 568899998764 4555677888999999999999999999999999999999999999998 Q ss_pred ccccccccccccccccccccccCCCCCCCCCCCC Q lcl|NC_019456. 395 KYYDAILDNKIQTDASVAAPKQEGGENTNENGLQ 428 (435) Q Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (435) ...+.+ +......+||+++++.+++ T Consensus 382 ~~~~~~---------~~~~~~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 382 KKEEYQ---------DKVGIKGKGGEVNAEEDKS 406 (406) T ss_pred cccccc---------cccccccCCCCCCCCCCCC Confidence 753331 1122234566655554444 No 39 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=2.8e-88 Score=500.62 Aligned_cols=407 Identities=19% Similarity=0.237 Sum_probs=325.4 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhcccccc-CcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc--cc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLE-QATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY--KQ 77 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~--~~ 77 (435) |+||+.+... .. ..+.......+.... .+.++ ...++++++||+||++||++||++|+++++++ .. T Consensus 1 m~~~~~~~~~----~~------~~~~~~~~~~~~~~~~~g~~~-~~~Al~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~ 69 (417) T protein:vir:38 1 MKLFRGLATE----VD------PHWADHLLDSGVIPSFRGGYL-GISALRNSDVLTAVSIVSGDVSRFPLVITDSSTDEV 69 (417) T ss_pred CccccccccC----CC------ccchhhhcccccccccCCcee-chhhcccHHHHHHHHHHHHhhccCeeEEEEcCCcce Confidence 9998543211 11 111111111122222 22233 34689999999999999999999999999854 34 Q ss_pred cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecC- Q lcl|NC_019456. 78 MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSD- 156 (435) Q Consensus 78 ~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~- 156 (435) ...|+++++|+.+||++||+++||+.++.+++++||+|++|+|+..+|.|..|+|++|+.|++..+..+...|++...+ T Consensus 70 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~~~~y~~~~~~~ 149 (417) T protein:vir:38 70 IDLANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPDNIIYRFTPYNS 149 (417) T ss_pred eccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCCeEEEEEEEcCC Confidence 4578999999999999999999999999999999999999999888899999999999999998887776665544444 Q ss_pred CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHH Q lcl|NC_019456. 157 IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLR 234 (435) Q Consensus 157 ~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~ 234 (435) +....++++||||||++ +.++++|+||+.++.++|..+.++++++.++|+|| +.++++.++.+++++.++++++|.. T Consensus 150 ~~~~~~~~~dviH~r~~-~~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~ 228 (417) T protein:vir:38 150 SMQKVCGFEDVIHWKFF-SYDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDFER 228 (417) T ss_pred cEEEEecCcceEEecCC-CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHH Confidence 44567999999999976 57889999999999999999999999999999997 5689999999999999999999976 Q ss_pred Hh--cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHH-HHHHHHHHHHhHHH Q lcl|NC_019456. 235 MV--KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVE-HVTHSWTMTLMPII 311 (435) Q Consensus 235 ~~--~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e-~~~~~~~~~i~P~~ 311 (435) .. .|+|+++||++|++|+++++++.++||+|.+++++++||++|||||++||. .+++++.+ +...|+++||.|++ T Consensus 229 ~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~--~~~~s~~e~~~~~~~~~tl~P~~ 306 (417) T protein:vir:38 229 AQAGADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQ--NSPNQSVKQLADDYIRNDLPFYF 306 (417) T ss_pred HhcccccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCC--CCcchhHHHHHHHHHHHHHHHHH Confidence 54 468899999999999999999999999999999999999999999999984 34556654 55667789999999 Q ss_pred HHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeeccccc Q lcl|NC_019456. 312 RQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLY 391 (435) Q Consensus 312 ~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~ 391 (435) ..|+++|+++||++.++ .+++++||++.+...+. ..+++++++|+||+||+|+++|+||++++++|++++++|++ T Consensus 307 ~~ie~~l~~~Ll~~~~~-~~~~~~fd~~~l~~~~~----~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~ 381 (417) T protein:vir:38 307 EPITSEFELKLLDDAQR-HQYCIGFDTKSVNGLPI----ADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTV 381 (417) T ss_pred HHHHHHHHhhhcChhhc-ccceEEechhhhhHHHH----HHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeeccccc Confidence 99999999999998775 46899999988765443 34677899999999999999999999888889999999999 Q ss_pred chhccccccccccccccccccccccCCCCCCCCCCCCCCCC-CCC Q lcl|NC_019456. 392 PLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEP-EGS 435 (435) Q Consensus 392 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 435 (435) +++.....+..+ ....+||+++.++.+..+.+ +-| T Consensus 382 ~~d~~~~~~~~~---------~~~~kgg~~~~~~~~~~~~~~~~~ 417 (417) T protein:vir:38 382 FLDQKEAYQAEH---------AAELKGGDTNAKGNQNGSGTNANS 417 (417) T ss_pred cccccccccccc---------ccccCCCCCCCCCCCcCCCCcCCC Confidence 999765433221 22233444433332222221 122 No 40 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=3e-87 Score=494.96 Aligned_cols=401 Identities=22% Similarity=0.342 Sum_probs=328.2 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccc-cccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc---- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGV-KLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY---- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~---- 75 (435) |-|-++.++..+.. ... ...+...++|. .+..+..++.+.++++++||+||++||++||++||++|+++ T Consensus 1 m~~~~~~~~~~~~~----~~~--~~~~~~~~~g~~~s~~~~~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~ 74 (419) T protein:vir:80 1 MFFSRQLLSNLGQT----QPG--SGGWVSALLGSARSEAGQVVTPASALSLTVLQNCVTLLAESIAQLPVELYERSGDDR 74 (419) T ss_pred CCcccccccccCcC----CCC--cchhhHHhhcccccccCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEecCCCc Confidence 76654433222111 111 12233333333 33456778889999999999999999999999999998743 Q ss_pred cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEec Q lcl|NC_019456. 76 KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTS 155 (435) Q Consensus 76 ~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~ 155 (435) +.+.+|+++++|+.+||++||+++||+.++.+++++||||++++++. .|+|++||||+|++|++..+.++...|. +.. T Consensus 75 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~-~G~~~~L~~i~~~~v~i~~~~~~~~~y~-~~~ 152 (419) T protein:vir:80 75 KPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQ-DGVIQGLYPLDNEAVTVMKGPDLKPMYR-VAG 152 (419) T ss_pred ccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEecCceEEEEECCCceEEEE-EcC Confidence 34568999999999999999999999999999999999999999875 4899999999999999999888765543 322 Q ss_pred CCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCc----CCHHHHHHHH Q lcl|NC_019456. 156 DIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRS----ISPEKRQAMV 229 (435) Q Consensus 156 ~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~----~~~e~~~~~~ 229 (435) ...+++++|+|++++ +.++++|+|++..+...|..+.++++++.++|.|| +++++++++. .++++.++++ T Consensus 153 ---~~~~~~~~i~h~~~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~ 228 (419) T protein:vir:80 153 ---ADPLPQRLVHHVRWM-SINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRIT 228 (419) T ss_pred ---ccccchhheEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHH Confidence 235899999999986 56889999999999999999999999999999997 5688887654 4688899999 Q ss_pred HHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH-HHHHHH Q lcl|NC_019456. 230 NDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV-THSWTM 305 (435) Q Consensus 230 ~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~-~~~~~~ 305 (435) +.|.... .++|+++||++|++|+++++++.|+|+.|.+++++++||++|||||.+||..+.++++|.|++ ..|+++ T Consensus 229 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~ 308 (419) T protein:vir:80 229 DGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIY 308 (419) T ss_pred HHHHHHhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHH Confidence 9997654 456889999999999999999999999999999999999999999999999988888887765 466799 Q ss_pred HHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceee Q lcl|NC_019456. 306 TLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLY 385 (435) Q Consensus 306 ~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~ 385 (435) ||.|++..|+++|+++||++.++ .+++++||++++++.|.+++++.+++++++|++|+||+|+++|+||+ ||||+++ T Consensus 309 ~l~P~~~~ie~~l~~kll~~~~~-~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~--~gGD~~~ 385 (419) T protein:vir:80 309 TLLPWVKRHEQAKTRDLLLPSER-KQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPV--KGGDIYL 385 (419) T ss_pred HHHHHHHHHHHHHhhhccCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcceee Confidence 99999999999999999998775 57899999999999999999999999999999999999999999999 5899999 Q ss_pred ecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 386 ISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 386 ~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) +|+|+++++.......++..++++. .+...- T Consensus 386 ~~~n~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~~ 416 (419) T protein:vir:80 386 SPMNMVDASKPQPIPMGKTEPTKAA-------------------LDEIGR 416 (419) T ss_pred eccccccccccccccCCCCCchhhh-------------------HHHHHh Confidence 9999987665432221111111100 000000 No 41 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=4.2e-86 Score=488.64 Aligned_cols=396 Identities=19% Similarity=0.238 Sum_probs=316.2 Q ss_pred CchHHHHHhhccccccccccccccchhhhhh-c-cccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc--- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDM-A-GVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY--- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~--- 75 (435) ||||+||..+- ...+ .+. ..+.... . +........+....++++|+|++||++||++||++|+++|++. T Consensus 1 Mg~~~~~~~~~--~~~~-~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg 74 (423) T protein:vir:81 1 MGFLQKLGLAP--SVVA-TPE---PIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDG 74 (423) T ss_pred CchhHhhcccc--cccc-Ccc---ccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCC Confidence 99999985321 1111 111 1111111 1 1111122234566678899999999999999999999998632 Q ss_pred --cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeC-CCCcEEEEEEeCCceeEEEEcCCC-ceEEE Q lcl|NC_019456. 76 --KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSL-STGEPIALWPLDPNTVSILRNTDN-NSYWY 151 (435) Q Consensus 76 --~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~-~~g~~~~l~~l~~~~v~~~~~~~~-~~~~~ 151 (435) +.+.+|+++.+|. +||++||+++||+.++.+++++||+|+++.++. ..+.+..|+|+++..+.+....++ ..++| T Consensus 75 ~~~~~~~~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~~~~~Y 153 (423) T protein:vir:81 75 GRERVREGHLARVCK-LANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGWGSLDY 153 (423) T ss_pred ceeeeccchHHHHhh-cCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCCcceEE Confidence 3456788887765 999999999999999999999999999998863 234677888888888877665443 22222 Q ss_pred EEe----cCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCC-----cC Q lcl|NC_019456. 152 RVT----SDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDR-----SI 220 (435) Q Consensus 152 ~~~----~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~-----~~ 220 (435) .+. .+|...++++++|||+|.+++.+..+|+||+..++++++.+.++++++.++|+|| ++++++++. .+ T Consensus 154 ~~~~~~~~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l 233 (423) T protein:vir:81 154 IIIESGDNDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKW 233 (423) T ss_pred EEEEecCCCceEEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccC Confidence 222 3566778999999999988877778999999999999999999999999999997 567887653 58 Q ss_pred CHHHHHHHHHHHHHHh----cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHH Q lcl|NC_019456. 221 SPEKRQAMVNDFLRMV----KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNV 296 (435) Q Consensus 221 ~~e~~~~~~~~~~~~~----~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~ 296 (435) ++|+.++++++|+..+ +++|+++||++|++|+++++++.|+||.|.+++++++||++|||||.+||..+.++++|. T Consensus 234 ~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~ 313 (423) T protein:vir:81 234 DAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNV 313 (423) T ss_pred CHHHHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccH Confidence 8999999999997654 457899999999999999999999999999999999999999999999999988898887 Q ss_pred HHHH-HHHHHHHhHHHHHHHHHHHHhhcccccc-cCcceeeechhhhhccCHHHHHHHHHHHHh-cCCcCHHHHHHHhCC Q lcl|NC_019456. 297 EHVT-HSWTMTLMPIIRQYESQFNMKLFTPGKR-VKGFYFSFNVNGLLRGDTAARTQYYQTLTR-NGIFKPNEIRELEGQ 373 (435) Q Consensus 297 e~~~-~~~~~~i~P~~~~i~~~l~~~l~~~~~~-~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~-~g~~t~NE~R~~~g~ 373 (435) |++. .|+++||.|++..||++|+++|+++.+. ..+++++||++++++.|.+++++++.+++. .|+||+||+|+++|+ T Consensus 314 e~~~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl 393 (423) T protein:vir:81 314 REFRKALYGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNL 393 (423) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCC Confidence 7654 6778999999999999999999998764 468899999999999999999999999885 699999999999999 Q ss_pred CCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCC Q lcl|NC_019456. 374 APIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQ 428 (435) Q Consensus 374 ~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (435) ||+ ||||++++|.|+.+.+....+ +++.+ + T Consensus 394 ~p~--~gGD~~~~p~n~~~~~~~~~~-------------------~~~~~----t 423 (423) T protein:vir:81 394 PSI--DGGDDLARPLNTEFGDSEDAP-------------------GEEVE----T 423 (423) T ss_pred CCC--CCcceeecccccccCccCCCC-------------------CCCCC----C Confidence 999 589999999998764431100 00000 0 No 42 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=2.6e-85 Score=484.30 Aligned_cols=400 Identities=15% Similarity=0.193 Sum_probs=325.0 Q ss_pred hHHHHHhhccccccccccccccchhhhhhcccccc----CcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccc- Q lcl|NC_019456. 3 FMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLE----QATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQ- 77 (435) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~- 77 (435) +.+.+.+++. ++++........+..+.|.... ....++...|+++|+||+||++||++||+|||++|+.... T Consensus 1 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g 77 (460) T protein:vir:10 1 MANRIIRALR---ELTGLDNKFNDAFIKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTK 77 (460) T ss_pred CchhHHHHHh---hhhccCCCchHHHHHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCCc Confidence 5555555542 2222222233334444443222 2344677889999999999999999999999999975322 Q ss_pred ------------------------------cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC---C Q lcl|NC_019456. 78 ------------------------------MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS---T 124 (435) Q Consensus 78 ------------------------------~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~---~ 124 (435) ...+++..+|+.+||++||+++||+.++.+++++||||++|+++.. . T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~~ 157 (460) T protein:vir:10 78 AYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGINA 157 (460) T ss_pred cchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCccC Confidence 2344566778899999999999999999999999999999998643 4 Q ss_pred CcEEEEEEeCCceeEEEEcCCCceE-------EEEEecCCeeEEEchhheEEeccCCCc-----cccccCcHHHHHHHHH Q lcl|NC_019456. 125 GEPIALWPLDPNTVSILRNTDNNSY-------WYRVTSDIYNFTIPINDVIHVKHVVPS-----NSWYGVSPIDVLSSSL 192 (435) Q Consensus 125 g~~~~l~~l~~~~v~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~iih~~~~~~~-----~~~~G~s~l~~~~~~i 192 (435) |.|.+||||+|+.|++..+.++... .|.+..++....|+++||||||++++. ++++|+||+..+.+.| T Consensus 158 G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i 237 (460) T protein:vir:10 158 GVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNI 237 (460) T ss_pred ceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHH Confidence 8999999999999999998877543 344556777889999999999987654 4679999999999999 Q ss_pred HHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHH Q lcl|NC_019456. 193 KFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVE 267 (435) Q Consensus 193 ~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~ 267 (435) ..+.++++++.++|.|| +.++++.++.+++++.+++++.|.... .|+|++++|++|++|+++++++.++|+++.+ T Consensus 238 ~~~~~~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~ 317 (460) T protein:vir:10 238 NSQNSTIDNNVKTMQNGGVFGFIHGGSTGLTQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYL 317 (460) T ss_pred HHHHHHHHHHHHHHhcCCCcceeeecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHH Confidence 99999999999999997 467889999999999999999998765 4678999999999999999999999999999 Q ss_pred HHHHHHHHHHhCCCHHHhCCcccC--cccHHHH-HHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhh-- Q lcl|NC_019456. 268 QISRIRIATAFNVPISFLNDDQAK--STTNVEH-VTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLL-- 342 (435) Q Consensus 268 ~~~~~~Ia~~fgvP~~~lg~~~~~--~~~~~e~-~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~-- 342 (435) ++++++||++|||||.+||..+.+ +++|.|+ ...|+++||.|++..|+++|+++|+++.+...+++++||++.+. T Consensus 318 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l 397 (460) T protein:vir:10 318 KYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEM 397 (460) T ss_pred HHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhH Confidence 999999999999999999987654 4667665 55677999999999999999999999998888999999999884 Q ss_pred ccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCC Q lcl|NC_019456. 343 RGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENT 422 (435) Q Consensus 343 ~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (435) +.|.+ ....++++|++|+||+|+++|+||++++|||++++|+|++|++..++...+ +++|. T Consensus 398 ~~d~~----~~~~~~~~g~~T~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~~~~---------------~~~nq 458 (460) T protein:vir:10 398 QTDMV----AMASWLNTIPVTPNEIRIAMKYETLNQDGMDIVFMPSNKVRIDDVSNNLID---------------SAFNQ 458 (460) T ss_pred HHHHH----HHHHHHhCCCCCHHHHHHHhCCCCCCCCCCCeeeecccccchhhcccccCC---------------CcccC Confidence 33433 344577899999999999999999999999999999999999865432111 11111 Q ss_pred CC Q lcl|NC_019456. 423 NE 424 (435) Q Consensus 423 ~~ 424 (435) ++ T Consensus 459 ~~ 460 (460) T protein:vir:10 459 NQ 460 (460) T ss_pred CC Confidence 11 No 43 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=4.4e-84 Score=477.59 Aligned_cols=373 Identities=17% Similarity=0.226 Sum_probs=309.0 Q ss_pred CchHHHHHhh------------------------ccccccccccccccchhhhhhccc----cccCcccccHHHHhhhHH Q lcl|NC_019456. 1 MSFMSKVRQF------------------------FGVHDQANQIVQNPIPQPLDMAGV----KLEQATFSREHILESNEY 52 (435) Q Consensus 1 Mg~~~~~~~~------------------------~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~ 52 (435) ||||+++++. |..+..........+..+..+.|. .+.....++.+.++++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 9999999875 111111111111112222222222 223345677889999999 Q ss_pred HHHHHHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEE Q lcl|NC_019456. 53 IFSIVTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWP 132 (435) Q Consensus 53 v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~ 132 (435) ||+||++||++||+||++++++++.. +.+.++++.+||++||+.+||+.++.+|++ ||+|++++.++..|.|++|+| T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~~~--~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~~~~L~p 157 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGRII--DSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGYPIRFRV 157 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCccc--cchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCcEEEEEE Confidence 99999999999999999999877654 455668899999999999999999999987 999998775567799999999 Q ss_pred eCCceeEEEEcCCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--C Q lcl|NC_019456. 133 LDPNTVSILRNTDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--D 210 (435) Q Consensus 133 l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~ 210 (435) |+|++|++..+.++... |++.. .+.+++|||||++++.++++|+||+..+...+....++++++.++|.|+ + T Consensus 158 l~p~~v~v~~~~~g~~~-y~~~~-----~~~~~eiiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p 231 (409) T protein:vir:83 158 VPPWLVNVELKKGARRE-YRIGG-----LNVTDEILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVP 231 (409) T ss_pred ECCcceEEEEcCCceEE-EEEcc-----ccCccceEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Confidence 99999999988776544 44432 2446899999998888999999999999999999999999999999997 6 Q ss_pred ceEEEeCCcCCHHHHHHHHHHHHHHh-cCCCccccccCCceee-eccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCc Q lcl|NC_019456. 211 KFVLQYDRSISPEKRQAMVNDFLRMV-KENGGAVVQEAGWKVD-RYESKFEPADLSSVEQISRIRIATAFNVPISFLNDD 288 (435) Q Consensus 211 ~~~~~~~~~~~~e~~~~~~~~~~~~~-~~~~~~~vl~~g~~~~-~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~ 288 (435) ++++++++.+++|+.++++++|+... +|+|++++|.+|+++. ++++++.|+||+|.+++++++||++|||||++||.. T Consensus 232 ~gil~~~~~ls~e~~~~~~~~~~~~~~~nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~ 311 (409) T protein:vir:83 232 LYWLGVERRLSETEAVDLMDRWIESRSKYAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVGLP 311 (409) T ss_pred ceEeecCCCCCHHHHHHHHHHHHHhhCCccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCC Confidence 79999999999999999999997654 5788899999999874 689999999999999999999999999999999975 Q ss_pred cc---CcccHHHHH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCH Q lcl|NC_019456. 289 QA---KSTTNVEHV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKP 364 (435) Q Consensus 289 ~~---~~~~~~e~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~ 364 (435) .. .+++|.||+ ..|+++||.|++.+||++|+++|+++ +.+++||++.+++.|.+++++++++++++|+||+ T Consensus 312 ~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~-----~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~ 386 (409) T protein:vir:83 312 GATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPS-----PQHLELNRDDYTRPSLVERATAYKIMIEAGVMEP 386 (409) T ss_pred CCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC-----CcEEEeehhhhhccCHHHHHHHHHHHHhCCCcCH Confidence 44 356777665 45678999999999999999999875 5689999999999999999999999999999999 Q ss_pred HHHHHHhCCCCCCCcCCceeeeccccc Q lcl|NC_019456. 365 NEIRELEGQAPIPDEAADHLYISKDLY 391 (435) Q Consensus 365 NE~R~~~g~~p~~~~~gd~~~~~~n~~ 391 (435) ||+|+++||||+ +|||++- .+-+ T Consensus 387 NE~R~~~glpp~--~ggd~l~--~~gv 409 (409) T protein:vir:83 387 NEARAMERLHSE--AAAVRLS--GGGV 409 (409) T ss_pred HHHHHHhCCCCC--CCCcccC--CCCC Confidence 999999999997 6899872 2212 No 44 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=100.00 E-value=4e-83 Score=472.34 Aligned_cols=346 Identities=36% Similarity=0.653 Sum_probs=310.6 Q ss_pred HhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEc Q lcl|NC_019456. 64 LASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRN 143 (435) Q Consensus 64 ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~ 143 (435) ||++||+++++++ ..+|+++++|+.+||++||+.+||+.++.+++++||||++++++. .|+|++||||+|.+|++..+ T Consensus 1 ia~lp~~~~~~~~-~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~-~G~~~~L~~l~~~~v~~~~~ 78 (348) T protein:vir:93 1 MASLPLKMYEDYK-VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI-YHQPSKLFLLNPDVVEMLIE 78 (348) T ss_pred CcccceEeEecCc-CcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-CCcEEEEEEEcCCceEEEEe Confidence 9999999998765 557999999999999999999999999999999999999999874 58999999999999999998 Q ss_pred CCCceEEEEE-ecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCH Q lcl|NC_019456. 144 TDNNSYWYRV-TSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISP 222 (435) Q Consensus 144 ~~~~~~~~~~-~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~ 222 (435) .++..++|.+ ..+|...+|+++||||||++++.++++|+|++..+..++..+.++++++...+.+++.++++.++.+++ T Consensus 79 ~~~~~~~y~~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~ 158 (348) T protein:vir:93 79 NQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVST 158 (348) T ss_pred CCCcEEEEEEEcCCCeEEEEccccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHhcCCCceeEEecCCCCCH Confidence 8876665544 445667889999999999988889999999999999999999999999877777777889999999999 Q ss_pred HHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-H Q lcl|NC_019456. 223 EKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-H 301 (435) Q Consensus 223 e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~ 301 (435) |+.++++++|.+..+|+++++++++|++|+++++++.++||.|++++++++||++|||||.+||....+++++.|++. . T Consensus 159 e~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~~ 238 (348) T protein:vir:93 159 EKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRF 238 (348) T ss_pred HHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999999999999888999877654 5 Q ss_pred HHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCC Q lcl|NC_019456. 302 SWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAA 381 (435) Q Consensus 302 ~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~g 381 (435) |+++||.|+++.|+++|+++||++.++..|.+|+||.+++++.|.+++++++++++++|++|+||+|+++|++|+ ||| T Consensus 239 ~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~--~gg 316 (348) T protein:vir:93 239 YLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPV--EGG 316 (348) T ss_pred HHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--CCc Confidence 668999999999999999999999999999999999999999999999999999999999999999999999999 579 Q ss_pred ceeeecccccchhccccccccccccccccccccccCCCCCCCCCC Q lcl|NC_019456. 382 DHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENG 426 (435) Q Consensus 382 d~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (435) |++++++|++|++...+.+ ...+||+++++++ T Consensus 317 D~~~~~~n~~~~~~~~~~~-------------~~~~gg~~n~~~~ 348 (348) T protein:vir:93 317 DKPLISGDLYPIDTPLELR-------------KSLKGGDKNVNES 348 (348) T ss_pred CeEeecccccccccchhhc-------------ccccCCCCCcCCC Confidence 9999999999987643332 1123444333333 No 45 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=5.9e-83 Score=471.40 Aligned_cols=392 Identities=17% Similarity=0.212 Sum_probs=311.4 Q ss_pred CchHHHHH-----hhccccccccccccc----cch-hhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSFMSKVR-----QFFGVHDQANQIVQN----PIP-QPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~~~~~~-----~~~~~~~~~~~~~~~----~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) |.+|+-++ ++|+..+......+. +.. ....................++++++|++||++||++||++||+ T Consensus 1 ~~~~~~~~~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~~~ 80 (413) T protein:vir:96 1 MPGVSEIRKDKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELISDGYTKLSDSPEVRMAVDCIADLVSNMTIQ 80 (413) T ss_pred CCccchhhhhhcCCccccCCCcchhhhhhccccccccccccchhhHhhhccchhHHHhhchHHHHHHHHHHHhhccCceE Confidence 44444222 123222211111000 000 00000000000000111234678999999999999999999999 Q ss_pred eeecc---cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc Q lcl|NC_019456. 71 EYQNY---KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN 147 (435) Q Consensus 71 ~~~~~---~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~ 147 (435) +++++ ....+|+++++|+.+||++||+++||+.++.+++++||||++++++..++.+++|||++|++|++..+.+.. T Consensus 81 ~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~~~~~~ 160 (413) T protein:vir:96 81 LMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNVSDDDL 160 (413) T ss_pred EEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEEcCCeE Confidence 98743 455679999999999999999999999999999999999999999877778999999999999998876543 Q ss_pred eEEEEEecCCeeEEEchhheEEecc-CCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHH Q lcl|NC_019456. 148 SYWYRVTSDIYNFTIPINDVIHVKH-VVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEK 224 (435) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~iih~~~-~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~ 224 (435) .|.+..++ .+++++|||||++ +++.++++|+||+.++...+..+.++++++.++|.|+ +++++++++.+++++ T Consensus 161 --~y~~~~~~--~~~~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~ 236 (413) T protein:vir:96 161 --DYSITFDN--KEYDPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDSDELS 236 (413) T ss_pred --EEEEeecC--cEEchhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHH Confidence 34444444 4789999999996 4556788999999999999999999999999999997 578999999999999 Q ss_pred HHHHHHHHHHHh---cCCCccccccCCc-eeeec-cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH Q lcl|NC_019456. 225 RQAMVNDFLRMV---KENGGAVVQEAGW-KVDRY-ESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV 299 (435) Q Consensus 225 ~~~~~~~~~~~~---~~~~~~~vl~~g~-~~~~~-~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~ 299 (435) .++++++|+... .++|+++|+++|. ++..+ ++++.++|+++.+++++++||++|||||.+||..+ ++.++. T Consensus 237 ~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~----~~~~~~ 312 (413) T protein:vir:96 237 DEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVGT----YNKDEF 312 (413) T ss_pred HHHHHHHHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCc----chHHHH Confidence 999999998754 4578888887665 55555 46899999999999999999999999999998643 345667 Q ss_pred HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCc Q lcl|NC_019456. 300 THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDE 379 (435) Q Consensus 300 ~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~ 379 (435) ..|+++||.|+++.|+++|+++|+++ +.+++||++.+++.|.+++++++++++++|++|+||+|+++|+||+ | T Consensus 313 ~~~~~~~l~P~~~~ie~~ln~~ll~~-----~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~--~ 385 (413) T protein:vir:96 313 NNFINTKIMSIAQVIQQTYNKLIVEE-----DMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPD--A 385 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCC-----CcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--C Confidence 78899999999999999999999864 6789999999999999999999999999999999999999999998 5 Q ss_pred CCceeeecccccchhccccccccccccc Q lcl|NC_019456. 380 AADHLYISKDLYPLDKYYDAILDNKIQT 407 (435) Q Consensus 380 ~gd~~~~~~n~~~l~~~~~~~~~~~~~~ 407 (435) |||++++++|++|++...+.+..+++.+ T Consensus 386 ~gd~~~~~~n~~~~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 386 EMDDLLVLENYLQQKDLVNQKKLIQDET 413 (413) T ss_pred CcceeeecccccchhhcccccCCCCCCC Confidence 7999999999999987655443333333 No 46 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=3.6e-82 Score=467.10 Aligned_cols=393 Identities=19% Similarity=0.243 Sum_probs=318.9 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc---cc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY---KQ 77 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~---~~ 77 (435) ||||++++.+.. + ......+....+... ........+....++++++|++||++||++||++||++++.+ .. T Consensus 1 Mg~f~~~~~~~~---~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 75 (406) T protein:vir:95 1 MGLFDRWRRTKR---K-SKIRADTGYVGLFMS-GEDVSFLVPGYVRLSDNPEVRMAVHKIADLISSMTIYLMQNTEDGDI 75 (406) T ss_pred Ccchhhhccccc---c-ccccccchhhhhhcc-CcccCccccCHHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Confidence 999998765422 2 222222222222222 223334456677899999999999999999999999998743 34 Q ss_pred cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceE--EeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEec Q lcl|NC_019456. 78 MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAW--IQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTS 155 (435) Q Consensus 78 ~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~--i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~ 155 (435) ...|+++++|+.+||++||+++||+.++.+++++|++|++ +.++ ..|.|++||||+|.+|++..+.++..+. . T Consensus 76 ~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~-~~g~~~~l~~i~~~~v~~~~~~~~~~~~----~ 150 (406) T protein:vir:95 76 RIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYT-ADGLIDELVPLTPSKVNFLDTPDGYQVL----Y 150 (406) T ss_pred eecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEEC-CCCcEEEEEEEcCceeEEEEcCCeEEEE----e Confidence 5578999999999999999999999999999999877555 4554 4589999999999999999988764332 2 Q ss_pred CCeeEEEchhheEEecc-CCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHH Q lcl|NC_019456. 156 DIYNFTIPINDVIHVKH-VVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDF 232 (435) Q Consensus 156 ~~~~~~~~~~~iih~~~-~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~ 232 (435) ++ ++|+++||||+++ +++.++++|.|++..+...+....++++++.++|.|+ +.+++++++.+++++.++++++| T Consensus 151 ~~--~~~~~~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~ 228 (406) T protein:vir:95 151 GG--QTFNYDEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAV 228 (406) T ss_pred cc--EEEchhHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHH Confidence 22 5799999999996 4667889999999999999999999999999999997 56899999999999999999999 Q ss_pred HHHh---cCCCcccccc-CCceeeec-cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHH Q lcl|NC_019456. 233 LRMV---KENGGAVVQE-AGWKVDRY-ESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTL 307 (435) Q Consensus 233 ~~~~---~~~~~~~vl~-~g~~~~~~-~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i 307 (435) .... .++++++|+. ++.+++++ .+++.++|+.|.+++++++||++|||||.+||..+ .+.++...||++|| T Consensus 229 ~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~----~~~~~~~~~~~~~l 304 (406) T protein:vir:95 229 FKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIGE----FNRDEYNNFINSTI 304 (406) T ss_pred HHHhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC----chHHHHHHHHHHHH Confidence 7654 4668887775 45677775 47899999999999999999999999999998643 34566778899999 Q ss_pred hHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeec Q lcl|NC_019456. 308 MPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYIS 387 (435) Q Consensus 308 ~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~ 387 (435) .|++++|+++|+++||++. +++++||++++++.|.+++++.+.+++++|+||+||+|+++|+||+ ||||+++++ T Consensus 305 ~P~~~~ie~~l~~~l~~~~----~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~--~~gd~~~~~ 378 (406) T protein:vir:95 305 LPIAKGIEQELTRKLLISP----DLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPK--EGLSELVIL 378 (406) T ss_pred HHHHHHHHHHHHHhcCCCC----CcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcceeeec Confidence 9999999999999999753 5689999999999999999999999999999999999999999998 579999999 Q ss_pred ccccchhccccccccccccccccccccccCCCCCCCCCCCCC Q lcl|NC_019456. 388 KDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQS 429 (435) Q Consensus 388 ~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (435) +|++|++.+.+... .+||+++++++++. T Consensus 379 ~n~~~~~~~~~~~~--------------~k~g~~~~~~~~~~ 406 (406) T protein:vir:95 379 ENYIPLDKIGDQSK--------------LKGGDNSGADGQTD 406 (406) T ss_pred cCccchhhcccccc--------------cCCCCCCCCCCCCC Confidence 99999987644322 23444433332222 No 47 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=3.9e-82 Score=466.87 Aligned_cols=390 Identities=19% Similarity=0.291 Sum_probs=309.7 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec---ccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN---YKQ 77 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~---~~~ 77 (435) ||||+++++. .+ +.+... ...+ ..+.............+..+|+||+||++||++||++|+++|++ +.. T Consensus 1 Mg~~~~f~~k----~~-~~~~~~-~~~~--~~~~~~~~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~ 72 (403) T protein:vir:80 1 MGLFNFFRRK----TR-SEPTNA-ISWF--LTQEAYDTLAIPGYTRLSDNPEVRMAVHKIAELISSMTIHLMQNTDNGDI 72 (403) T ss_pred Cccccccccc----cc-ccccch-hhhh--cccccccccccchhhhhhhhHHHHHHHHHHHHhhhhCceEEEEecCCcee Confidence 9999876532 11 111111 1111 11111111112222345678999999999999999999999874 334 Q ss_pred cccchHHHhhhccccccCCHHHHHHHHHHHHHhc--CCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEec Q lcl|NC_019456. 78 MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVS--GNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTS 155 (435) Q Consensus 78 ~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~--G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~ 155 (435) ...|+++++|+.+||++||+++||+.+++++++. ||||+++.++ ..|+|++||||+|..|++..+.++..++|. T Consensus 73 ~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~-~~g~~~~L~~l~p~~v~~~~~~~g~~~~y~--- 148 (403) T protein:vir:80 73 RIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYT-TSGLIDELIPLAPSKVSFVDTDTGYQIWYQ--- 148 (403) T ss_pred ecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEc-CCCcEEEEEEEcCCeeEEEEcCCceEEEEe--- Confidence 4578999999999999999999999999999985 7788888876 458999999999999999999888665543 Q ss_pred CCeeEEEchhheEEecc-CCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHH Q lcl|NC_019456. 156 DIYNFTIPINDVIHVKH-VVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDF 232 (435) Q Consensus 156 ~~~~~~~~~~~iih~~~-~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~ 232 (435) .+.|+++|||||+. +++.++++|+||+..+...+....++++++.++|+|| +++++++++.+++++.++++++| T Consensus 149 ---~~~~~~~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~ 225 (403) T protein:vir:80 149 ---GKAYNYDEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAV 225 (403) T ss_pred ---ecccchhhEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHH Confidence 24699999999995 5677889999999999999999999999999999997 57899999999999999988888 Q ss_pred HHH---hcCCCccccccCC-ceeeecc-CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHH Q lcl|NC_019456. 233 LRM---VKENGGAVVQEAG-WKVDRYE-SKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTL 307 (435) Q Consensus 233 ~~~---~~~~~~~~vl~~g-~~~~~~~-~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i 307 (435) ... ..++|++++++.+ .+++++. +++.++|++|.+++++++||++|||||++||..+. +.++...|+.+|| T Consensus 226 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~----~~~~~~~f~~~~l 301 (403) T protein:vir:80 226 FKKYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVGKY----DKDEYNNFINSTI 301 (403) T ss_pred HHHHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCc----cHHHHHHHHHHHH Confidence 643 3467888887655 4555544 58899999999999999999999999999986432 2345567999999 Q ss_pred hHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeec Q lcl|NC_019456. 308 MPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYIS 387 (435) Q Consensus 308 ~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~ 387 (435) .|++.+|+++|+++|+++. +++++||.+.+++.|.+++++++.+++++|+||+||+|+++|+||+ +|||+++++ T Consensus 302 ~P~~~~ie~~l~~kll~~~----~~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~--~ggd~~~~~ 375 (403) T protein:vir:80 302 LPIAKGIEQELTRKLLISP----DLYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPK--EGLSELVIL 375 (403) T ss_pred HHHHHHHHHHHHHhccCCC----CcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCCCeEeec Confidence 9999999999999999764 4689999999999999999999999999999999999999999998 589999999 Q ss_pred ccccchhccccccccccccccccccccccCCCCCCCCCCCCC Q lcl|NC_019456. 388 KDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQS 429 (435) Q Consensus 388 ~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (435) +|++|++..++.. ..++|++++++++.. T Consensus 376 ~n~~pl~~~~~~~--------------~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 376 ENYIPLDKIGDQN--------------KLKGGEKGGADGQTD 403 (403) T ss_pred ccccchhhccchh--------------hccCCCCCCCCCCCC Confidence 9999998654432 222333333322222 No 48 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=2.1e-81 Score=462.92 Aligned_cols=408 Identities=14% Similarity=0.166 Sum_probs=306.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec-ccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN-YKQMD 79 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~-~~~~~ 79 (435) |+-+ .+.... +...........+.++|+++++||+||++||++||++||+++++ ++... T Consensus 1 ~~~~----------------~~~~g~----~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~ 60 (723) T protein:vir:94 1 MTTF----------------PSGAGG----WNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDE 60 (723) T ss_pred Cccc----------------ccCCCc----cccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccch Confidence 2111 000000 00111112222456789999999999999999999999999875 44556 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeC--CCCcEEEEEEeCCceeEEEEcCCCc------eE-E Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSL--STGEPIALWPLDPNTVSILRNTDNN------SY-W 150 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~--~~g~~~~l~~l~~~~v~~~~~~~~~------~~-~ 150 (435) .|+++++|+.+||++||+++||+.++.+|+++||+|+++++++ ..|.|.+|+|+++..+.+....++. .+ | T Consensus 61 ~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y 140 (723) T protein:vir:94 61 LHPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGY 140 (723) T ss_pred hhHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEE Confidence 7999999999999999999999999999999999999999754 4589999999999887776554432 12 2 Q ss_pred EEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHH Q lcl|NC_019456. 151 YRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAM 228 (435) Q Consensus 151 ~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~ 228 (435) .....+|....|+++||||||++++.++++|+||+..+.+.|..+.++++++.++|+|| ++++++.+ .+++++.+++ T Consensus 141 ~~~~~~G~~~~~~~~dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~-~l~~e~~~~~ 219 (723) T protein:vir:94 141 VIERTDGVRVPVLADEMLWLRFSDPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLG-DMDEQTFTKT 219 (723) T ss_pred EEEecCceeEEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-CCCHHHHHHH Confidence 23345677788999999999998888999999999999999999999999999999998 56899876 5999999999 Q ss_pred HHHHHHHh---cCCCcccccc----------CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccH Q lcl|NC_019456. 229 VNDFLRMV---KENGGAVVQE----------AGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTN 295 (435) Q Consensus 229 ~~~~~~~~---~~~~~~~vl~----------~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~ 295 (435) +++|.... +|+|+++||+ .|++|+++++++.|+||+|.+++++++||++|||||.+|++. ++++| T Consensus 220 ~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~--st~sN 297 (723) T protein:vir:94 220 VAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGG--STYEN 297 (723) T ss_pred HHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCC--CCccc Confidence 99997543 5788888875 699999999999999999999999999999999999999753 45666 Q ss_pred HHH-HHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCC Q lcl|NC_019456. 296 VEH-VTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQA 374 (435) Q Consensus 296 ~e~-~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~ 374 (435) .++ ...||++||.||+..|+++|+++|++.... ..+++||...+++.|.+++++++.+++++|+||+||+|+++|+| T Consensus 298 ~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~g~--~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglp 375 (723) T protein:vir:94 298 QAEAKAAVWTETLIPQMEVMASITDLQLLPDIGW--TVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLD 375 (723) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHhhcccccC--ceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 654 556789999999999999999999975432 35677888889999999999999999999999999999999999 Q ss_pred CCCCcCCceeeecc--cccchhccccccccccccccccccccccCCCCCCCC---------CCCCCCCCCCC Q lcl|NC_019456. 375 PIPDEAADHLYISK--DLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE---------NGLQSTEPEGS 435 (435) Q Consensus 375 p~~~~~gd~~~~~~--n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~ 435 (435) |+|+..|+.++.|. |+.|.+.......++.... ........+.++..+ ..+++.+++.+ T Consensus 376 Pi~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~--~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 445 (723) T protein:vir:94 376 PLPGGIGQMTLTPYRAQFAPAPAPAPAVEEGAARM--LALLERVAADRPLPELPVRATTVLHHDPGPDPQQT 445 (723) T ss_pred CCCCCcccceeccccccccCCCCCCccchhhhHhh--hhhccccccccCcCCCCCCCCCCCCCCcccCCchh Confidence 99654444555554 4444433221111111000 000000011111111 11222222222 No 49 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=1.7e-81 Score=463.36 Aligned_cols=389 Identities=18% Similarity=0.237 Sum_probs=316.8 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ||||++.+. .++..+... ..+..+.+. .....+++.+.++++++|++||++||++||++||++ . T Consensus 1 M~~f~~~~~----~~~~~~~~~---~~~~~~~~~-~~~~~~v~~~~al~~~~V~~~v~~ia~~ia~~p~~~--------~ 64 (397) T protein:vir:38 1 MPLLKLNKS----HSQGFSLND---PDWVNFLTG-GEAQKYVSADTALKNSDIFSLIMQLSGDLAMVRYTS--------E 64 (397) T ss_pred Ccchhhhhc----ccCcccCCc---hhhhhhhcC-CcCCceechHHhhccHHHHHHHHHHHHHHhhCcccc--------c Confidence 999987642 122222222 222222222 234566888899999999999999999999999975 4 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEec----C Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTS----D 156 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~----~ 156 (435) |+.+++|+.+||++||+++||+.++.+++++||||++++++. .|.+++|+|++|++|++..+.++...+|.+.. . T Consensus 65 ~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~-~g~~~~l~~l~~~~v~i~~~~~~~~~~y~~~~~~~~~ 143 (397) T protein:vir:38 65 SDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNT-NGVDLSWEYLRPSQVQPMLLQDGSGLIYNINFDEPAI 143 (397) T ss_pred ccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC-CCcEEEEEEEcCceeEEEEcCCCceEEEEEEeccccc Confidence 677888999999999999999999999999999999999875 58999999999999999998887666655543 3 Q ss_pred CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHH Q lcl|NC_019456. 157 IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLR 234 (435) Q Consensus 157 ~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~ 234 (435) +..++|+++||||++++++.+.++|.|++.++...+....++++++.++|+|+ +++++++++.+++++.+++++.|+. T Consensus 144 ~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~ 223 (397) T protein:vir:38 144 GYMENVPAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEI 223 (397) T ss_pred cceeEecCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHH Confidence 45678999999999998887788999999999999999999999999999997 5789999999999999999999864 Q ss_pred H--hcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhHHHH Q lcl|NC_019456. 235 M--VKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMPIIR 312 (435) Q Consensus 235 ~--~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P~~~ 312 (435) . ..|+++++|++.|++|++++.++.++||.+.+++.+++||++|||||.+||+...++ ++.+++..||++||.|++. T Consensus 224 ~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~-~~~e~~~~~~~~~l~P~~~ 302 (397) T protein:vir:38 224 SKQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQ-SSITQISGQYAKSLNRYVQ 302 (397) T ss_pred HhcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc-cHHHHHHHHHHHHHHHHHH Confidence 4 356889999999999999999999999999999999999999999999999877554 6778888899999999999 Q ss_pred HHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccc Q lcl|NC_019456. 313 QYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYP 392 (435) Q Consensus 313 ~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~ 392 (435) .|+++|+++|+++. +|++..+...|.+++++.+++++++|++|+||+|+++|++|+ ++||.+.......+ T Consensus 303 ~ie~~ln~~l~~~~--------~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~--~~~d~~~~~~~~~~ 372 (397) T protein:vir:38 303 AIVGELNDKLHANI--------SANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGY--LAKDLPDPEKEPQQ 372 (397) T ss_pred HHHHHHHHhccChh--------cccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCCccccccccccc Confidence 99999999999753 345555677899999999999999999999999999999998 46776543322222 Q ss_pred hhccccccccccccccccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 393 LDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 393 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (435) ... ..+...|.+++++.++++.++| T Consensus 373 ~~~----------------~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 373 AIQ----------------LIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred ccc----------------ccccccCCCCCCCCCCCCCCCC Confidence 111 1122233334444444555555 No 50 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=1.6e-80 Score=458.05 Aligned_cols=383 Identities=18% Similarity=0.273 Sum_probs=306.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec------ Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN------ 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~------ 74 (435) |||++|+...+++..+..... ... ...+......+.+.|+++++|++||++||++||+||++++++ T Consensus 1 mg~~~~~~~~~~~~~~~~~~~-----~~~---~~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~~ 72 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRIIRDM-----EPV---SHRTNRKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNIVTY 72 (403) T ss_pred Ccchhhhhhccchhhhhhhcc-----ccc---ccccCCcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeeccccccc Confidence 999999987765443221111 011 111223333456788999999999999999999999999863 Q ss_pred ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEe Q lcl|NC_019456. 75 YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVT 154 (435) Q Consensus 75 ~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~ 154 (435) +..+..|++.++|+.+||++||+++||+.++.+++++||||+++.+ ..|++++++.|++..+.++..+++. . T Consensus 73 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~~-------~~l~~l~~~~~~v~~~~~~~~~~~~-~ 144 (403) T protein:vir:10 73 ANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWDG-------TSLYHVPAALMQVEADANKFIKKFI-F 144 (403) T ss_pred ccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEeC-------ceeEeecCcceEEEEcCCceEEEEE-e Confidence 2344578899999999999999999999999999999999987642 2599999999999888766554443 3 Q ss_pred cCCeeEEEchhheEEeccCC----CccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHH Q lcl|NC_019456. 155 SDIYNFTIPINDVIHVKHVV----PSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAM 228 (435) Q Consensus 155 ~~~~~~~~~~~~iih~~~~~----~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~ 228 (435) .+ ...+++++|+|++... +.++++|+||+.++.+.+..+.++++++.++|+|| +.+++++++.+++++.+++ T Consensus 145 ~~--~~~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~ 222 (403) T protein:vir:10 145 NN--QINYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEILNKKLRERK 222 (403) T ss_pred cC--ceeecccceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHH Confidence 32 3568899999999543 35789999999999999999999999999999997 5689999999999999999 Q ss_pred HHHHHHHh---cCCCccccccCCceeeeccC--ChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHH-HHHHHH Q lcl|NC_019456. 229 VNDFLRMV---KENGGAVVQEAGWKVDRYES--KFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNV-EHVTHS 302 (435) Q Consensus 229 ~~~~~~~~---~~~~~~~vl~~g~~~~~~~~--~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~-e~~~~~ 302 (435) ++.|.... +|+|+++||++|++|++++. ++.|+||+|.+++++++||++|||||.+||... ++|. ++...| T Consensus 223 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~---~sn~e~~~~~f 299 (403) T protein:vir:10 223 QEELQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGN---NANIRPNIELF 299 (403) T ss_pred HHHHHHHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC---CcCHHHHHHHH Confidence 99998654 56789999999999999985 577999999999999999999999999998643 4454 455678 Q ss_pred HHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhh--hccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcC Q lcl|NC_019456. 303 WTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGL--LRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEA 380 (435) Q Consensus 303 ~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l--~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~ 380 (435) +++||.|++..|+++|+++| +.+++||++.+ ++.|.+++++++++++++|++|+||+|+++|+||+|+++ T Consensus 300 ~~~tl~P~~~~ie~~l~~~L--------~~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~ 371 (403) T protein:vir:10 300 YYMTIIPMLNKLTSSLTFFF--------GYKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQ 371 (403) T ss_pred HHHHHHHHHHHHHHHHHHhc--------CceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCccc Confidence 89999999999999999987 35688888866 788999999999999999999999999999999999999 Q ss_pred CceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 381 ADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 381 gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (435) ||++++|+|+...... ..+++.+++ +.+++.| T Consensus 372 ~d~~~~p~n~~~~~~~-------------------~~~~e~~~~--~~~~~g~ 403 (403) T protein:vir:10 372 MNKIRIPANVAGSATG-------------------VSGQEGGRP--KGSTEGD 403 (403) T ss_pred cccccccccccccccc-------------------CCCCcCCCC--CCCcCCC Confidence 9999999997542211 111111111 1111111 No 51 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=1.4e-79 Score=452.90 Aligned_cols=422 Identities=13% Similarity=0.136 Sum_probs=311.7 Q ss_pred CchHH--HH-H-----hhccccccccccccccchhhhhhccccc-----------cCcccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMS--KV-R-----QFFGVHDQANQIVQNPIPQPLDMAGVKL-----------EQATFSREHILESNEYIFSIVTRLS 61 (435) Q Consensus 1 Mg~~~--~~-~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~v~~~i~~ia 61 (435) .=||+ ++ + .-|+.+..+ .....++..+-.. .....+....++++++|++||++|| T Consensus 62 ~~~~~~~~~~kk~~i~~pfkkk~~~------~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA 135 (945) T protein:vir:10 62 IIIFRKNQVLKKEKIIVPYNHQEPP------FKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIP 135 (945) T ss_pred eeeehhhhHHHhhcccccccccccc------hhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHHH Confidence 11221 11 1 111111000 0001111111000 0112245577889999999999999 Q ss_pred HHHhhCceeeeeccc----------ccccchHHHhhhccccccCCHHH----HHHHHHHHHHhcCCcceEEeeeCCCCcE Q lcl|NC_019456. 62 NVLASLPLHEYQNYK----------QMDNEPLADLLKTSPNPNMTAFE----FIARLETDRNVSGNGYAWIQKSLSTGEP 127 (435) Q Consensus 62 ~~ia~~~~~~~~~~~----------~~~~~~l~~~l~~~Pn~~~~~~~----f~~~~~~~~~~~G~~~~~i~~~~~~g~~ 127 (435) ++||++|+++|++.+ ....|+++.+|. +||++||+++ |++.++.+++++||+|++++++ ..|+| T Consensus 136 ~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~-rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd-~~G~i 213 (945) T protein:vir:10 136 KKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLE-RPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRD-EQGNL 213 (945) T ss_pred hhhccCceEEEEecccCcccccccccccchHHHHHHh-CCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEEC-CCCcE Confidence 999999999987432 234678887775 9999999998 5567889999999999999987 55899 Q ss_pred EEEEEeCCceeEEEEcCCCceEE-EEEecCC-eeEEEchhheEE-eccCCCccc--cccCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 128 IALWPLDPNTVSILRNTDNNSYW-YRVTSDI-YNFTIPINDVIH-VKHVVPSNS--WYGVSPIDVLSSSLKFQRSVENFS 202 (435) Q Consensus 128 ~~l~~l~~~~v~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~iih-~~~~~~~~~--~~G~s~l~~~~~~i~~~~~~~~~~ 202 (435) ++|+|++|.+|++..+.++...+ |....+| ....|+++|+|| ++.+++.+. .+|+||+.++.+.+..+.++++++ T Consensus 214 i~L~pLdPs~Vti~~ddDG~~~y~Yv~~idG~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~a 293 (945) T protein:vir:10 214 VAITPVDGTTIKPILSEDTGIVVGYVQEVDGAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGN 293 (945) T ss_pred EEEEEECCcceEEEEcCCCcEEEEEEEecCCceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHH Confidence 99999999999999988876554 3434343 455788888765 555544332 359999999999999999999999 Q ss_pred HHHhh-cC--CceEEEeC----------CcCCHHHHHHHHHHHHHHhc--CCCccccccCCceeeeccCChhhHHHHHHH Q lcl|NC_019456. 203 QNEME-KK--DKFVLQYD----------RSISPEKRQAMVNDFLRMVK--ENGGAVVQEAGWKVDRYESKFEPADLSSVE 267 (435) Q Consensus 203 ~~~~~-n~--~~~~~~~~----------~~~~~e~~~~~~~~~~~~~~--~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~ 267 (435) +++|. || ++++++++ +.+++++.+++++.|..... ++++++++++|++|+++++++.++|+.+.+ T Consensus 294 ar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s~~DaQfLEsr 373 (945) T protein:vir:10 294 LDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRDMQFKELA 373 (945) T ss_pred HHHHHhCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCChhHHHHHHHH Confidence 99995 55 56888754 56899999999999987653 457788999999999999999999999999 Q ss_pred HHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCH Q lcl|NC_019456. 268 QISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDT 346 (435) Q Consensus 268 ~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~ 346 (435) ++++++||++|||||.+||..+.+++++.|++. .|++++|.|++.+|+++|+++|+.... +.+++|+++.+...|. T Consensus 374 kfs~eeIArAFGVPP~lLG~~e~st~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~e---g~~i~fdFd~ldl~D~ 450 (945) T protein:vir:10 374 EFVARKICAVYQVSPQDVGILEGSNKATAEVMASLTKAKGLEPLMATISKGFDEVVSEFRN---EKDIKLWFKEDDLEKE 450 (945) T ss_pred HHHHHHHHHHhCCCHHHcccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc---CceeEEEecchhccCH Confidence 999999999999999999999988888877654 566899999999999999999875432 4456666666677889 Q ss_pred HHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecc-cccchhcccccccccc-ccccccccccccCCCCCCCC Q lcl|NC_019456. 347 AARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISK-DLYPLDKYYDAILDNK-IQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 347 ~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~-n~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 424 (435) +++++++.+++++|+||+||+|+++|+||+ +|||+++++. |+.|.+...+...+.. .+..+....++.+.+++.++ T Consensus 451 ksraEal~kli~sGiLTiNEvRe~lGLpPI--eGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dE 528 (945) T protein:vir:10 451 RDWWNIIQGQLNTGFRSINEARMEKGLEPV--PWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDE 528 (945) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCCCCCC Confidence 999999999999999999999999999999 5899999987 4667765543332222 12222222333333333344 Q ss_pred CCCCCCCCCCC Q lcl|NC_019456. 425 NGLQSTEPEGS 435 (435) Q Consensus 425 ~~~~~~~~~~~ 435 (435) +++.+++.+.+ T Consensus 529 ns~~psE~kda 539 (945) T protein:vir:10 529 NSSVPSEQKNA 539 (945) T ss_pred CCCCCCcccch Confidence 45555555555 No 52 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=8.9e-80 Score=453.98 Aligned_cols=409 Identities=14% Similarity=0.145 Sum_probs=316.8 Q ss_pred CchHHHHHhhccccccccccc-c----------------ccchhhhhh-ccccc----cCcccccHHHHhhhHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIV-Q----------------NPIPQPLDM-AGVKL----EQATFSREHILESNEYIFSIVT 58 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~-~----------------~~~~~~~~~-~~~~~----~~~~~~~~~~~~~~~~v~~~i~ 58 (435) ||||+||++.++...+..... . ...+....+ .|... .....++.+.|+++++|++||+ T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~v~~~i~ 80 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGPVFACML 80 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccchhhhhccHHHHHHHH Confidence 999999998775442221110 0 011111111 11111 2344578889999999999999 Q ss_pred HHHHHHhhCceeeeecc----cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeC-------CCCcE Q lcl|NC_019456. 59 RLSNVLASLPLHEYQNY----KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSL-------STGEP 127 (435) Q Consensus 59 ~ia~~ia~~~~~~~~~~----~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~-------~~g~~ 127 (435) +||++||+|||++++++ .++.+|++ +.|+.+||++||+++||+.++.+++++||||++|+++. ..|.+ T Consensus 81 ~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~-~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g~~ 159 (466) T protein:vir:81 81 VRQLVFSSVRFRWQRLRDGKPSDTFGSRD-LQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVDVV 159 (466) T ss_pred HHHHhhccCceEEEEecCCceeeccccHH-HHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCcce Confidence 99999999999999853 23445555 45667999999999999999999999999999999864 34678 Q ss_pred EEEEEeCCceeEEEEcCCCce-EEEEEecCC-----eeEEEchhheEEeccC-CCccccccCcHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 128 IALWPLDPNTVSILRNTDNNS-YWYRVTSDI-----YNFTIPINDVIHVKHV-VPSNSWYGVSPIDVLSSSLKFQRSVEN 200 (435) Q Consensus 128 ~~l~~l~~~~v~~~~~~~~~~-~~~~~~~~~-----~~~~~~~~~iih~~~~-~~~~~~~G~s~l~~~~~~i~~~~~~~~ 200 (435) ++|+|++|.+|++..+.++.. +.|.+..++ ...+|+++||||||++ ++.++++|+||+..+.++|..+.++++ T Consensus 160 ~~l~~l~~~~v~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~dviHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~ 239 (466) T protein:vir:81 160 VEERMVRGGRGELGGGQLGWRKVGYLYTEGGRQSGNESVGFLAEDVVHFAPIPDPLASYRGMSWLTPILREIRADQAMSK 239 (466) T ss_pred eEEEEecCcceEEEEcCCCceEEEEEEEecCcccccceeeeccccEEEEcCCCCcccccccccHHHHHHHHHHHHHHHHH Confidence 999999999999999887643 334443332 3568999999999975 567999999999999999999999999 Q ss_pred HHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHH Q lcl|NC_019456. 201 FSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIA 275 (435) Q Consensus 201 ~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia 275 (435) ++.++|+|+ +.+++++++.+++|+.+++++.|.... +|+|+++||++|++|+++++++.++||+|++++++++|| T Consensus 240 ~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia 319 (466) T protein:vir:81 240 HQAKFFDNGATVNLVIKHNPMADPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIA 319 (466) T ss_pred HHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 999999997 578999999999999999999997654 467899999999999999999999999999999999999 Q ss_pred HHhCCCHHHhCCcc---cCcccHHHHH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHH Q lcl|NC_019456. 276 TAFNVPISFLNDDQ---AKSTTNVEHV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQ 351 (435) Q Consensus 276 ~~fgvP~~~lg~~~---~~~~~~~e~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~ 351 (435) ++|||||++||+.+ .++++|.||+ ..|+++||.|++.+||++|+++|+++.+.. +.+++||.+.+++.|.+++++ T Consensus 320 ~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~llr~d~~~r~~ 398 (466) T protein:vir:81 320 AAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHVMPDMGPDV-RLWYDADDVPFLREDEKDAAD 398 (466) T ss_pred HHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCc-ceEEEecchhhhccCHHHHHH Confidence 99999999999764 4567777665 467899999999999999999999876654 578999999999999998876 Q ss_pred H-------HHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeee-cccccchhccccccccccccccccccccccCCCCCCC Q lcl|NC_019456. 352 Y-------YQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYI-SKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTN 423 (435) Q Consensus 352 ~-------~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~-~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 423 (435) + +..++++|+ |+||+|..+ ++||.+++ +.++.+++...... ........+..+|+++++ T Consensus 399 ~~~~~~~~~~~~~~~g~-t~nE~r~~~-------~~gd~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~Gg~~ng 465 (466) T protein:vir:81 399 IQKVRAETINTLITAGY-EPESVVAAV-------NSGDLRLLKHTGLTSVQLLPPGV-----SASASSDTPTSGGADDNG 465 (466) T ss_pred HHHHHHHHHHHHHHcCC-Chhhccccc-------cCCccccccCCCcchhhhccccc-----ccccCCCCcccCCCCcCC Confidence 5 677888985 999999632 46776554 44555555432211 111122223334443222 Q ss_pred C Q lcl|NC_019456. 424 E 424 (435) Q Consensus 424 ~ 424 (435) . T Consensus 466 n 466 (466) T protein:vir:81 466 N 466 (466) T ss_pred C Confidence 2 No 53 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=7.1e-79 Score=449.01 Aligned_cols=375 Identities=14% Similarity=0.208 Sum_probs=310.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ||||+|+... ....++..... ....+.... .......++.+.|+++++|++||++||++||++||++++ T Consensus 1 Mg~~~~~~~~-~~~~~~~~~~~--~~~~~~~~~-~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~~------- 69 (385) T protein:vir:10 1 MGLLTPRNFN-KRKAKNMVYPS--NPAFFTTTV-GGMQLSYVSALSALQNTNVYSVINRIASDVASAHFKTEN------- 69 (385) T ss_pred Cccccchhcc-ccccccccccc--chhhhhhhc-cccCccccCHHHhhccHHHHHHHHHHHHHHhhCceeeec------- Confidence 9999986422 11222211111 111222111 122245677889999999999999999999999999964 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCCeeE Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDIYNF 160 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~ 160 (435) |....+ +.+||++||+++||+.++.+++++||||++++++ +.+++|+++.+|++..+.++..+++....++..+ T Consensus 70 ~~~~~l-l~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~-----~~~~~p~~~~~v~~~~~~~~~~~~~~~~~~~~~~ 143 (385) T protein:vir:10 70 TATLNR-LESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ-----NLEHIPNSDVQINYLPGNMGIVYTVLESNDRPQM 143 (385) T ss_pred cchhhh-hhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC-----ceeEeecCCceEEEEEcCCceEEEEEEcCCceEE Confidence 344444 4599999999999999999999999999999864 4689999999999999888877777777777888 Q ss_pred EEchhheEEeccCCC--ccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCC-HHHHHHHHHHHHHH Q lcl|NC_019456. 161 TIPINDVIHVKHVVP--SNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSIS-PEKRQAMVNDFLRM 235 (435) Q Consensus 161 ~~~~~~iih~~~~~~--~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~-~e~~~~~~~~~~~~ 235 (435) +|+++||||||+.++ .++++|+||+..+...+....++++++.++|.|+ +++++++++.+. +++.+++++.|+.. T Consensus 144 ~~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~ 223 (385) T protein:vir:10 144 VLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKA 223 (385) T ss_pred EEccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 999999999998653 5678999999999999999999999999999997 678999987764 77899999999765 Q ss_pred h--cCCCccccccCCceeeeccCChhhHHHH-HHHHHHHHHHHHHhCCCHHHhCCcc--cCcccHHHHHHHHHHHHHhHH Q lcl|NC_019456. 236 V--KENGGAVVQEAGWKVDRYESKFEPADLS-SVEQISRIRIATAFNVPISFLNDDQ--AKSTTNVEHVTHSWTMTLMPI 310 (435) Q Consensus 236 ~--~~~~~~~vl~~g~~~~~~~~~~~~~~~~-e~~~~~~~~Ia~~fgvP~~~lg~~~--~~~~~~~e~~~~~~~~~i~P~ 310 (435) . .++|+++|+++|++|++++.++.++|++ |.+++++++||++|||||.+||+.+ .+++++.||+..+|..||.|+ T Consensus 224 ~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~ 303 (385) T protein:vir:10 224 NTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLANLNSY 303 (385) T ss_pred hCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHHHHHHHHH Confidence 5 4578899999999999999999999975 9999999999999999999999765 456778899999999999999 Q ss_pred HHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccc Q lcl|NC_019456. 311 IRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDL 390 (435) Q Consensus 311 ~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~ 390 (435) ++.|+++|+++|+++ +++|+++.+++.|.+++++.+++++++|++|+||+|+++|++|+|.+++|++.++.+. T Consensus 304 ~~~ie~~l~~~l~~~-------~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~~~~~~~~ 376 (385) T protein:vir:10 304 VNPIVDELRLKMNAP-------DLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPLTTQ 376 (385) T ss_pred HHHHHHHHHHhhCCc-------eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCccccCcccc Confidence 999999999999753 5899999999999999999999999999999999999999999998888888766653 Q ss_pred cchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 391 YPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 391 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) + ++|++++. T Consensus 377 ~-------------------------~~g~~~dn 385 (385) T protein:vir:10 377 V-------------------------KGGDEGDN 385 (385) T ss_pred c-------------------------CCCCCCCC Confidence 2 12222222 No 54 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=100.00 E-value=1.2e-78 Score=447.68 Aligned_cols=383 Identities=15% Similarity=0.189 Sum_probs=290.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) |||++|++.+|+...+...... ..+ ......+..+.++++++|++||++||++||++||+++++++. .. T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~-----~~~-----~~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~~-~~ 69 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLTD-----TVW-----CSIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGEE-VR 69 (395) T ss_pred CchHHHHHhhhccccccccccc-----chh-----hccccccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCcc-cc Confidence 9999999999986554433221 111 111223456789999999999999999999999999987655 45 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCC--e Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDI--Y 158 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~--~ 158 (435) |+++++|+.+||++||+++||+.++.+++++||||+++.++. +++.+...+... ......+..+..++ . T Consensus 70 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~-------~~~~~~~~~~~~--~~~~~~~~~v~~~~~~~ 140 (395) T protein:vir:40 70 KKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEY-------IYVADSFTKNDK--SLYENTYTEVTLKDLTL 140 (395) T ss_pred chHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCc-------eeecCCcccccc--ccccceeeeeeecCcee Confidence 889999999999999999999999999999999999887542 334333322211 11111222222222 3 Q ss_pred eEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceE--EEeCCcCCHHHHHHHHHHHHHHh Q lcl|NC_019456. 159 NFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFV--LQYDRSISPEKRQAMVNDFLRMV 236 (435) Q Consensus 159 ~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~--~~~~~~~~~e~~~~~~~~~~~~~ 236 (435) .++|+++||||+++.+......+.+.+..+...+.... ....+.++++++ +..+..+++++.+++++.|...+ T Consensus 141 ~~~~~~~evih~r~~~~~~~~~~~~l~~~~~~~~~~~~-----~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 215 (395) T protein:vir:40 141 KKEFKESEVLHLTLNNESIKSIIDGFYLLYGDLLTAAV-----NKYKKLNSRKIIVKLKAMFGQTPEAEEKLRLMLSERM 215 (395) T ss_pred eeeeccccEEEeecCCCCccccchhHHHHHHHHHHHHH-----HHHHhcCCCCceEEEecccCCCHHHHHHHHHHHHHHH Confidence 46799999999997654333333333343333332222 222345565554 44577799999999988886543 Q ss_pred ----cCCCccccccCCceeeeccCChhhHHHHHHHHHHH---HHHHHHhCCCHHHhCCcccCcccHH-HHHHHHHHHHHh Q lcl|NC_019456. 237 ----KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISR---IRIATAFNVPISFLNDDQAKSTTNV-EHVTHSWTMTLM 308 (435) Q Consensus 237 ----~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~---~~Ia~~fgvP~~~lg~~~~~~~~~~-e~~~~~~~~~i~ 308 (435) .++++++++++|++|+++++++.++|++|.++++. ++||++|||||.+||+ +++|. ++...|+++||. T Consensus 216 ~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~----~~sn~e~~~~~f~~~~L~ 291 (395) T protein:vir:40 216 KKFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKG----DTVGLSEQVNSFLMFSIN 291 (395) T ss_pred HHhhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC----CCcCHHHHHHHHHHHHHH Confidence 46778999999999999999999999999998874 7899999999999973 34554 556677899999 Q ss_pred HHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecc Q lcl|NC_019456. 309 PIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISK 388 (435) Q Consensus 309 P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~ 388 (435) |++.+|+++|+++||++.++..|++|+||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+++++||++++++ T Consensus 292 P~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD~~~~~~ 371 (395) T protein:vir:40 292 PIAEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQERFVTK 371 (395) T ss_pred HHHHHHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCceeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccchhccccccccccccccccccccccCCCCCCCCCCCC Q lcl|NC_019456. 389 DLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQ 428 (435) Q Consensus 389 n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (435) |++|++...+ ..+||++++++++. T Consensus 372 n~~~~~~~~~----------------~~kgge~~~~~~~~ 395 (395) T protein:vir:40 372 NYAPLGENEE----------------DLKGGDINENKGDS 395 (395) T ss_pred cccccccccc----------------ccCCCCCCCCcCCC Confidence 9998875322 23455544444333 No 55 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=2.5e-78 Score=446.07 Aligned_cols=382 Identities=14% Similarity=0.235 Sum_probs=297.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhc-cccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec-cccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMA-GVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN-YKQM 78 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~-~~~~ 78 (435) ||||+|+++++........ ...+.. +.....+..++.+.++++++|++||++||+.||+|||+++++ ++++ T Consensus 1 MGl~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~~~ 73 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEKRG-------YLDNVLGKSIRYSGVYVTDSNILQSSDVYELLQDISNQMVLADIVVEDEFGNEI 73 (394) T ss_pred CchhhhhhhhccCCCCchh-------hhhhhhhcccccCccccChhhhhccHHHHHHHHHHHHhhcccceEEEcCCCccc Confidence 9999999876543322211 111222 222334456778889999999999999999999999999974 4556 Q ss_pred ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCCe Q lcl|NC_019456. 79 DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDIY 158 (435) Q Consensus 79 ~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 158 (435) .+|+++ .|+.+||++||+++||+.++.+++++||+|+++.++ +..+ + ..+.+..+.++.. .+..+ T Consensus 74 ~~~~~~-~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-----~~~~--~--~~~~~~~~~~~~~---~~~~~-- 138 (394) T protein:vir:62 74 KDDIAL-QILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGA-----QIHL--A--SNVFTELDDNLVE---HFNIG-- 138 (394) T ss_pred chhhHH-HHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecc-----eeec--c--ccceEEECCceEE---EEeeC-- Confidence 666665 456799999999999999999999999999998643 2222 2 3455555554432 22222 Q ss_pred eEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCC--HHHHHHHHHHHHH Q lcl|NC_019456. 159 NFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSIS--PEKRQAMVNDFLR 234 (435) Q Consensus 159 ~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~--~e~~~~~~~~~~~ 234 (435) .++|++++|+|+|+++ .++++|+||+..+..+|..+.++++++.++|.|| +++++++++.++ +++.++++++|.. T Consensus 139 ~~~~~~~eiih~r~~~-~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~ 217 (394) T protein:vir:62 139 GHEIPPCMIRHVKNIG-ADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILD 217 (394) T ss_pred CEEechhheEEecCcC-CCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHH Confidence 3679999999999874 6889999999999999999999999999999997 678999988876 4557888999976 Q ss_pred Hh---cCCCccccccCCc--eeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHH-HHHHHHHHHHh Q lcl|NC_019456. 235 MV---KENGGAVVQEAGW--KVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVE-HVTHSWTMTLM 308 (435) Q Consensus 235 ~~---~~~~~~~vl~~g~--~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e-~~~~~~~~~i~ 308 (435) .. .++|+++|++.|. ++++++.++.++|++|.+++++++||++|||||.+||+... +|.| +...|+++||. T Consensus 218 ~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~---sn~e~~~~~~~~~~l~ 294 (394) T protein:vir:62 218 QLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELIK---EDIEKAMMYIHNKAVR 294 (394) T ss_pred HhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC---cCHHHHHHHHHHHHHH Confidence 54 4568888887665 66688899999999999999999999999999999986543 4445 56677899999 Q ss_pred HHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecc Q lcl|NC_019456. 309 PIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISK 388 (435) Q Consensus 309 P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~ 388 (435) |++.+|+++|+++||++.+. .+.+|+||...+. +..++++++.+++++|+||+||+|+++|+||+++++||++++++ T Consensus 295 P~~~~ie~~l~~kll~~~~~-~~~~~~fd~~~~~--~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~~ 371 (394) T protein:vir:62 295 PIMKNFEDHLSLLFYAQNSG-KRIKFKINILDFV--TYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAIYISN 371 (394) T ss_pred HHHHHHHHHHhhhhcCcccc-CceEEEechhhhc--CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeccc Confidence 99999999999999988664 3566777766655 45578899999999999999999999999999999999999999 Q ss_pred cccchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 389 DLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 389 n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) |++|++..... ..+.+||++++. T Consensus 372 n~~~~~~~~~~-------------~~~~kgge~~en 394 (394) T protein:vir:62 372 DVTEIGKKEAT-------------DGSLGGGEENEN 394 (394) T ss_pred ccccccccccc-------------cccCCCCCCCCC Confidence 99887643111 112334443222 No 56 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=100.00 E-value=5.3e-78 Score=444.21 Aligned_cols=372 Identities=15% Similarity=0.186 Sum_probs=299.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ||||+++++. .. ......... ....+..+.|+++++|++||++||++||++||++++++.. .. T Consensus 1 Mg~f~~~f~~---~~---~~~~~~~~~----------~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~-~~ 63 (385) T protein:vir:95 1 MGLFDSVFKR---HS---ELSWMYDLE----------FLQDKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTK-EK 63 (385) T ss_pred Cchhhhhhcc---Cc---ccccccchh----------hhhccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCcc-cc Confidence 9999998542 11 111111111 1122345678999999999999999999999999987765 46 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCCeeE Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDIYNF 160 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~ 160 (435) |++.++|+.+||++||+++||+.++.+++++||||+++.++ ++.+..++++.+..+...... .+...+...+..+ T Consensus 64 ~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~--~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~ 138 (385) T protein:vir:95 64 GTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDE--GHFFVADDFEKEDELGLYSHR---FTNVLVNDFEFKR 138 (385) T ss_pred chHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEecC--CCeeecccccccccccccccc---ceeeeecccceee Confidence 89999999999999999999999999999999999877654 356666666666665543322 2223333344567 Q ss_pred EEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCC--cCCHHHHHHHHHHHHHHhc- Q lcl|NC_019456. 161 TIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDR--SISPEKRQAMVNDFLRMVK- 237 (435) Q Consensus 161 ~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~--~~~~e~~~~~~~~~~~~~~- 237 (435) +|+++||||++++++.+..+|.||+..+...+..+.+++ .+.++++++++++. .+++++.+++++.|....+ T Consensus 139 ~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~-----~~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~~g 213 (385) T protein:vir:95 139 VFTMDDVIYLKYNNQKLDAFSLGLFEDYGEIFGRMIDLQ-----MLNNQIRGILKVDATKFYNKEKQKELQAYIDTLFDA 213 (385) T ss_pred eeccccEEEecCCCCCcccccchHHHHHHHHHHHHHHHH-----HhcCCCceEEEeCCccCCCHHHHHHHHHHHHHHhhh Confidence 899999999999877777899999999999887655533 45677888888754 5789999999999976543 Q ss_pred ---CCCccccccCCceeeeccC------ChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHH-HHHHHHHHHHH Q lcl|NC_019456. 238 ---ENGGAVVQEAGWKVDRYES------KFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNV-EHVTHSWTMTL 307 (435) Q Consensus 238 ---~~~~~~vl~~g~~~~~~~~------~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~-e~~~~~~~~~i 307 (435) +.++++++++|++|+++++ ++.|+||.|.+++++++||++|||||.+|+ ++++|. ++...|+++|| T Consensus 214 ~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~----~~~sn~e~~~~~~~~~~l 289 (385) T protein:vir:95 214 FQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVL----GEMADLEKTIESYLQFCI 289 (385) T ss_pred hhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCCcCHHHHHHHHHHHHH Confidence 3456889999999999875 567999999999999999999999999996 345555 45677889999 Q ss_pred hHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeec Q lcl|NC_019456. 308 MPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYIS 387 (435) Q Consensus 308 ~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~ 387 (435) .|++.+|+++|+++|+++.++. +.+++||++.+++.|.+++++++.+++++|+||+||+|+++|+||++++|||+++++ T Consensus 290 ~P~~~~ie~~l~~~L~~~~~~~-~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd~~~~~ 368 (385) T protein:vir:95 290 NPLLRKIEAELNSKFFYQDEYL-NDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELDKFIIT 368 (385) T ss_pred HHHHHHHHHHHHhhcCChhhcc-cceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeec Confidence 9999999999999999988764 558999999999999999999999999999999999999999999999999999999 Q ss_pred ccccchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 388 KDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 388 ~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) +|++|++. .+||++++| T Consensus 369 ~n~~~~~~--------------------~kgge~~~e 385 (385) T protein:vir:95 369 KNLQSADA--------------------FKGGESNEE 385 (385) T ss_pred ccceeccc--------------------ccCCCCCCC Confidence 99998764 245555444 No 57 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=4.6e-77 Score=439.07 Aligned_cols=379 Identities=17% Similarity=0.205 Sum_probs=307.7 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ||||++++..-. ................+ ......+.+++.+.++++|+|++||++||++||+||+++++.. T Consensus 1 M~~f~~~~~~~~--~~~~~~~~~~~~~~~~~-~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~~----- 72 (386) T protein:vir:48 1 MPIFNITNLATE--SPPISQGGFFDITDPDF-LSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRKQ----- 72 (386) T ss_pred Cccccccccccc--ccccccccccccccchh-cccccCCceechhhhhcchHHHHHHHHHHHhhccCceeeccch----- Confidence 999987653211 11111111011111111 1223445567888899999999999999999999999998642 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCC--- Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDI--- 157 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~--- 157 (435) .+.|+.+||++||+.+||+.++.+++++||+|++++++. .|++++|+|++|++|++..+.++..++|.+..++ T Consensus 73 ---~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~-~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~~~~ 148 (386) T protein:vir:48 73 ---LQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE-NGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDDPRI 148 (386) T ss_pred ---hHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC-CCcEEEEEEecCceeEEEEcCCCceEEEEEEecCccc Confidence 456788999999999999999999999999999999874 5899999999999999999988877776665433 Q ss_pred -eeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHH Q lcl|NC_019456. 158 -YNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLR 234 (435) Q Consensus 158 -~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~ 234 (435) ..++|+++||||++++++.+.++|+||+..+...+..+.++++++.++|.|+ +.++++.++.+++|+.+++++.|.. T Consensus 149 ~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~~~~~~ 228 (386) T protein:vir:48 149 PPKQHVPQGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLSRSRQA 228 (386) T ss_pred cceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHH Confidence 4568999999999998877779999999999999999999999999999997 6789999999999999999999999 Q ss_pred HhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccH-HHHHHHHHHHHHhHHHHH Q lcl|NC_019456. 235 MVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTN-VEHVTHSWTMTLMPIIRQ 313 (435) Q Consensus 235 ~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~-~e~~~~~~~~~i~P~~~~ 313 (435) ..+++|+++||++|++|+++++++.++||.|++++++++||++|||||.+||... ++++ +++...|++.||.|++.. T Consensus 229 ~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~--~~~~~e~~~~~~~~~~l~P~~~~ 306 (386) T protein:vir:48 229 MKQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQG--DQQSSLEMSLDLYNKAVSRYLRP 306 (386) T ss_pred hhcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC--CcccHHHHHHHHHHHHHHHHHHH Confidence 8999999999999999999999999999999999999999999999999998643 3344 456677889999999999 Q ss_pred HHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccch Q lcl|NC_019456. 314 YESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPL 393 (435) Q Consensus 314 i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l 393 (435) |+++|+++|++. ++++.......|...++..+++++++|++|+||+|+++|++|+. .+|..... T Consensus 307 ie~~l~~~l~~~--------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~--~~~~~~~~------ 370 (386) T protein:vir:48 307 FLSELSQKLSCD--------VDADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEIL--PKELPEGE------ 370 (386) T ss_pred HHHHHHHhhcch--------hhcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCC--Cccchhhc------ Confidence 999999999863 45667777788888899999999999999999999999999984 35532110 Q ss_pred hccccccccccccccccccccccCCCCCCCCC Q lcl|NC_019456. 394 DKYYDAILDNKIQTDASVAAPKQEGGENTNEN 425 (435) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (435) ....++.+||++++++ T Consensus 371 ----------------~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:48 371 ----------------NPNKTTLKGGEINGED 386 (386) T ss_pred ----------------CCCCCccCCCCCCCCC Confidence 0112233455544433 No 58 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=100.00 E-value=9.1e-77 Score=437.47 Aligned_cols=383 Identities=15% Similarity=0.162 Sum_probs=299.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ||||+||++. .. .... .. .......++...|+++++|++||++||++||++||++++++. ... T Consensus 1 Mg~f~~lf~~---~~---~~~~-----~~-----~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~-~~~ 63 (395) T protein:vir:95 1 MSILEKIFKT---RK---DITY-----ML-----DLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR-IQK 63 (395) T ss_pred Cchhhhhhcc---Cc---cccc-----cc-----cchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc-ccc Confidence 9999998642 11 1110 00 111223455677899999999999999999999999998764 557 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCCeeE Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDIYNF 160 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~ 160 (435) |++.++|+.+||++||+++||+.++.++++.|++|+++.++ . .++++++..+++....+.....+.+...+..+ T Consensus 64 ~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~-~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 137 (395) T protein:vir:95 64 NDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS-K-----ELLIADSFYREEYALYDDIFKDVTVKDYTYQR 137 (395) T ss_pred chHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecC-C-----CeEecCCccceeEeecCcceeEEEEcCceeee Confidence 99999999999999999999999999999999988765432 2 36777777777666555555556666666678 Q ss_pred EEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC-CceEEEeCCc-CCHHHHHHHHHHHHHHhc- Q lcl|NC_019456. 161 TIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK-DKFVLQYDRS-ISPEKRQAMVNDFLRMVK- 237 (435) Q Consensus 161 ~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~-~~~~~~~~~~-~~~e~~~~~~~~~~~~~~- 237 (435) +++++||||++++++.+..+|.||+..+...+....+ .++.++ +++++..++. +++++.+++++.|+...+ T Consensus 138 ~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~~------~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~ 211 (395) T protein:vir:95 138 TFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMIG------AQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNT 211 (395) T ss_pred eeccccEEEEccCCCCcccccchHHHHHHHHHHHHHH------HHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhcc Confidence 8999999999998888888999999999888876543 234444 6678877654 688999999998876543 Q ss_pred -CCCc--cccccCCceeeeccCChhhH-----HHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccH-HHHHHHHHHHHHh Q lcl|NC_019456. 238 -ENGG--AVVQEAGWKVDRYESKFEPA-----DLSSVEQISRIRIATAFNVPISFLNDDQAKSTTN-VEHVTHSWTMTLM 308 (435) Q Consensus 238 -~~~~--~~vl~~g~~~~~~~~~~~~~-----~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~-~e~~~~~~~~~i~ 308 (435) ++++ ++++++|++|+++++++.++ ||.|.+++++++||++|||||++||. ++++ +++...|+++||. T Consensus 212 ~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~----~~sn~e~~~~~~~~~~l~ 287 (395) T protein:vir:95 212 FNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG----ETADLEKNTLVFEKFCLT 287 (395) T ss_pred ccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC----cccCHHHHHHHHHHHHHH Confidence 2333 55689999999999887765 89999999999999999999999973 3444 4566778899999 Q ss_pred HHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecc Q lcl|NC_019456. 309 PIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISK 388 (435) Q Consensus 309 P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~ 388 (435) |++.+|+++|+++|+++.++..+ ++|+++.+++.|.+++++++.+++++|++|+||+|+++|+||+++++||++++++ T Consensus 288 P~~~~ie~~l~~kL~~~~~~~~~--~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~ 365 (395) T protein:vir:95 288 PLLKKIQNELNAKLITQSMYLKD--TRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITK 365 (395) T ss_pred HHHHHHHHHHHHhhcChhhhccc--ceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecc Confidence 99999999999999998776554 5789999999999999999999999999999999999999999998999999999 Q ss_pred cccchhccccccccccccccccccccccCCCCCCCCCCC Q lcl|NC_019456. 389 DLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGL 427 (435) Q Consensus 389 n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (435) |+++++.......... ....+||++ +++|+ T Consensus 366 n~~~~~~~~~~~~~~~--------~~~~kgg~~-~~~g~ 395 (395) T protein:vir:95 366 NYEKANSGENDEKEKD--------ENTLKGGDE-DESGD 395 (395) T ss_pred ccccccccccccCccc--------ccccCCCCC-CCCCC Confidence 9999876533221111 112223333 23333 No 59 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=100.00 E-value=9.1e-77 Score=437.47 Aligned_cols=383 Identities=15% Similarity=0.162 Sum_probs=299.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ||||+||++. .. .... .. .......++...|+++++|++||++||++||++||++++++. ... T Consensus 1 Mg~f~~lf~~---~~---~~~~-----~~-----~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~-~~~ 63 (395) T protein:vir:10 1 MSILEKIFKT---RK---DITY-----ML-----DLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR-IQK 63 (395) T ss_pred Cchhhhhhcc---Cc---cccc-----cc-----cchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc-ccc Confidence 9999998642 11 1110 00 111223455677899999999999999999999999998764 557 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCCeeE Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDIYNF 160 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~ 160 (435) |++.++|+.+||++||+++||+.++.++++.|++|+++.++ . .++++++..+++....+.....+.+...+..+ T Consensus 64 ~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~-~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 137 (395) T protein:vir:10 64 NDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS-K-----ELLIADSFYREEYALYDDIFKDVTVKDYTYQR 137 (395) T ss_pred chHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecC-C-----CeEecCCccceeEeecCcceeEEEEcCceeee Confidence 99999999999999999999999999999999988765432 2 36777777777666555555556666666678 Q ss_pred EEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC-CceEEEeCCc-CCHHHHHHHHHHHHHHhc- Q lcl|NC_019456. 161 TIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK-DKFVLQYDRS-ISPEKRQAMVNDFLRMVK- 237 (435) Q Consensus 161 ~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~-~~~~~~~~~~-~~~e~~~~~~~~~~~~~~- 237 (435) +++++||||++++++.+..+|.||+..+...+....+ .++.++ +++++..++. +++++.+++++.|+...+ T Consensus 138 ~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~~------~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~ 211 (395) T protein:vir:10 138 TFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMIG------AQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNT 211 (395) T ss_pred eeccccEEEEccCCCCcccccchHHHHHHHHHHHHHH------HHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhcc Confidence 8999999999998888888999999999888876543 234444 6678877654 688999999998876543 Q ss_pred -CCCc--cccccCCceeeeccCChhhH-----HHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccH-HHHHHHHHHHHHh Q lcl|NC_019456. 238 -ENGG--AVVQEAGWKVDRYESKFEPA-----DLSSVEQISRIRIATAFNVPISFLNDDQAKSTTN-VEHVTHSWTMTLM 308 (435) Q Consensus 238 -~~~~--~~vl~~g~~~~~~~~~~~~~-----~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~-~e~~~~~~~~~i~ 308 (435) ++++ ++++++|++|+++++++.++ ||.|.+++++++||++|||||++||. ++++ +++...|+++||. T Consensus 212 ~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~----~~sn~e~~~~~~~~~~l~ 287 (395) T protein:vir:10 212 FNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG----ETADLEKNTLVFEKFCLT 287 (395) T ss_pred ccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC----cccCHHHHHHHHHHHHHH Confidence 2333 55689999999999887765 89999999999999999999999973 3444 4566778899999 Q ss_pred HHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecc Q lcl|NC_019456. 309 PIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISK 388 (435) Q Consensus 309 P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~ 388 (435) |++.+|+++|+++|+++.++..+ ++|+++.+++.|.+++++++.+++++|++|+||+|+++|+||+++++||++++++ T Consensus 288 P~~~~ie~~l~~kL~~~~~~~~~--~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~ 365 (395) T protein:vir:10 288 PLLKKIQNELNAKLITQSMYLKD--TRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITK 365 (395) T ss_pred HHHHHHHHHHHHhhcChhhhccc--ceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecc Confidence 99999999999999998776554 5789999999999999999999999999999999999999999998999999999 Q ss_pred cccchhccccccccccccccccccccccCCCCCCCCCCC Q lcl|NC_019456. 389 DLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGL 427 (435) Q Consensus 389 n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (435) |+++++.......... ....+||++ +++|+ T Consensus 366 n~~~~~~~~~~~~~~~--------~~~~kgg~~-~~~g~ 395 (395) T protein:vir:10 366 NYEKANSGENDEKEKD--------ENTLKGGDE-DESGD 395 (395) T ss_pred ccccccccccccCccc--------ccccCCCCC-CCCCC Confidence 9999876533221111 112223333 23333 No 60 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=100.00 E-value=9.1e-77 Score=437.47 Aligned_cols=383 Identities=15% Similarity=0.162 Sum_probs=299.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ||||+||++. .. .... .. .......++...|+++++|++||++||++||++||++++++. ... T Consensus 1 Mg~f~~lf~~---~~---~~~~-----~~-----~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~-~~~ 63 (395) T protein:vir:10 1 MSILEKIFKT---RK---DITY-----ML-----DLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR-IQK 63 (395) T ss_pred Cchhhhhhcc---Cc---cccc-----cc-----cchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc-ccc Confidence 9999998642 11 1110 00 111223455677899999999999999999999999998764 557 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCCeeE Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDIYNF 160 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~ 160 (435) |++.++|+.+||++||+++||+.++.++++.|++|+++.++ . .++++++..+++....+.....+.+...+..+ T Consensus 64 ~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~-~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 137 (395) T protein:vir:10 64 NDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS-K-----ELLIADSFYREEYALYDDIFKDVTVKDYTYQR 137 (395) T ss_pred chHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecC-C-----CeEecCCccceeEeecCcceeEEEEcCceeee Confidence 99999999999999999999999999999999988765432 2 36777777777666555555556666666678 Q ss_pred EEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC-CceEEEeCCc-CCHHHHHHHHHHHHHHhc- Q lcl|NC_019456. 161 TIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK-DKFVLQYDRS-ISPEKRQAMVNDFLRMVK- 237 (435) Q Consensus 161 ~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~-~~~~~~~~~~-~~~e~~~~~~~~~~~~~~- 237 (435) +++++||||++++++.+..+|.||+..+...+....+ .++.++ +++++..++. +++++.+++++.|+...+ T Consensus 138 ~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~~------~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~ 211 (395) T protein:vir:10 138 TFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMIG------AQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNT 211 (395) T ss_pred eeccccEEEEccCCCCcccccchHHHHHHHHHHHHHH------HHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhcc Confidence 8999999999998888888999999999888876543 234444 6678877654 688999999998876543 Q ss_pred -CCCc--cccccCCceeeeccCChhhH-----HHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccH-HHHHHHHHHHHHh Q lcl|NC_019456. 238 -ENGG--AVVQEAGWKVDRYESKFEPA-----DLSSVEQISRIRIATAFNVPISFLNDDQAKSTTN-VEHVTHSWTMTLM 308 (435) Q Consensus 238 -~~~~--~~vl~~g~~~~~~~~~~~~~-----~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~-~e~~~~~~~~~i~ 308 (435) ++++ ++++++|++|+++++++.++ ||.|.+++++++||++|||||++||. ++++ +++...|+++||. T Consensus 212 ~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~----~~sn~e~~~~~~~~~~l~ 287 (395) T protein:vir:10 212 FNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG----ETADLEKNTLVFEKFCLT 287 (395) T ss_pred ccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC----cccCHHHHHHHHHHHHHH Confidence 2333 55689999999999887765 89999999999999999999999973 3444 4566778899999 Q ss_pred HHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecc Q lcl|NC_019456. 309 PIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISK 388 (435) Q Consensus 309 P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~ 388 (435) |++.+|+++|+++|+++.++..+ ++|+++.+++.|.+++++++.+++++|++|+||+|+++|+||+++++||++++++ T Consensus 288 P~~~~ie~~l~~kL~~~~~~~~~--~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~ 365 (395) T protein:vir:10 288 PLLKKIQNELNAKLITQSMYLKD--TRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITK 365 (395) T ss_pred HHHHHHHHHHHHhhcChhhhccc--ceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecc Confidence 99999999999999998776554 5789999999999999999999999999999999999999999998999999999 Q ss_pred cccchhccccccccccccccccccccccCCCCCCCCCCC Q lcl|NC_019456. 389 DLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGL 427 (435) Q Consensus 389 n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (435) |+++++.......... ....+||++ +++|+ T Consensus 366 n~~~~~~~~~~~~~~~--------~~~~kgg~~-~~~g~ 395 (395) T protein:vir:10 366 NYEKANSGENDEKEKD--------ENTLKGGDE-DESGD 395 (395) T ss_pred ccccccccccccCccc--------ccccCCCCC-CCCCC Confidence 9999876533221111 112223333 23333 No 61 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=1.3e-76 Score=436.70 Aligned_cols=373 Identities=15% Similarity=0.230 Sum_probs=304.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ||||+++. |........ ............ .......+++.+.|+++++|++||++||++||++||++++ T Consensus 1 Mg~~~~~~--~~k~~~~~~-~~~~~~~~~~~~-~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~------- 69 (383) T protein:vir:10 1 MGLLTPKN--FSKRNAKNM-VYPSNPAFFTTT-VGGMQLSYVSALSALQNTNVYSVINRIASDVSSAHFKTEN------- 69 (383) T ss_pred CCcccccc--ccccccccc-ccccchhhhhhh-ccCccccccchhHhhcchHHHHHHHHHHHhhccCceeecc------- Confidence 99998742 111111111 111111112111 1222345677888999999999999999999999999864 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCCeeE Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDIYNF 160 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~ 160 (435) |+...+| .+||++||+++||+.++.+++++||||++++++ +.+++|+++.+|++..+.++..+++....++..+ T Consensus 70 ~~~~~ll-~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~-----~~~~~p~~~~~v~~~~~~~~~~~~~~~~~~~~~~ 143 (383) T protein:vir:10 70 TATLNRL-ESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ-----NLEHIPNSDVQINYLPGNMGIVYTVLESNDRPKM 143 (383) T ss_pred cchhhhh-hCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC-----ceeEeecCcceEEEEEcCCceEEEEEEcCCceEE Confidence 3444554 599999999999999999999999999998864 4679999999999999888877777777788889 Q ss_pred EEchhheEEeccCCC--ccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCC-HHHHHHHHHHHHHH Q lcl|NC_019456. 161 TIPINDVIHVKHVVP--SNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSIS-PEKRQAMVNDFLRM 235 (435) Q Consensus 161 ~~~~~~iih~~~~~~--~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~-~e~~~~~~~~~~~~ 235 (435) +|+++||||||+.++ .++++|+||+.++...+....++++++.++|.|+ +.+++++++.++ +++.+++++.|+.. T Consensus 144 ~~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~ 223 (383) T protein:vir:10 144 VLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKA 223 (383) T ss_pred EEcccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 999999999997654 4568999999999999999999999999999997 568999998774 78889999999765 Q ss_pred h--cCCCccccccCCceeeeccCChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCcc--cCcccHHHHHHHHHHHHHhHH Q lcl|NC_019456. 236 V--KENGGAVVQEAGWKVDRYESKFEPADL-SSVEQISRIRIATAFNVPISFLNDDQ--AKSTTNVEHVTHSWTMTLMPI 310 (435) Q Consensus 236 ~--~~~~~~~vl~~g~~~~~~~~~~~~~~~-~e~~~~~~~~Ia~~fgvP~~~lg~~~--~~~~~~~e~~~~~~~~~i~P~ 310 (435) . .|+|+++|+++|++|++++.++.++|+ .+++++++++||++|||||.+||+.+ ..++++.||+..+|..||.|+ T Consensus 224 ~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~ 303 (383) T protein:vir:10 224 NTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLANLNSY 303 (383) T ss_pred hCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHHHHHHHHHH Confidence 4 467899999999999999999999997 58999999999999999999999765 456778899998998999999 Q ss_pred HHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccc Q lcl|NC_019456. 311 IRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDL 390 (435) Q Consensus 311 ~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~ 390 (435) ++.|+++|+++|+. .+++||++.+++.|.+++++++.+++++|+||+||+|+++|++|++ +||.+....+. T Consensus 304 ~~~ie~~l~~~l~~-------~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~--~~d~~~~~~~~ 374 (383) T protein:vir:10 304 VNPIVDELRLKMNA-------PDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFL--PDNLPEFKPLT 374 (383) T ss_pred HHHHHHHHHHhhCC-------ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccc--CCcccccCCCc Confidence 99999999999974 3689999999999999999999999999999999999999999994 66654332221 Q ss_pred cchhccccccccccccccccccccccCCCCCC Q lcl|NC_019456. 391 YPLDKYYDAILDNKIQTDASVAAPKQEGGENT 422 (435) Q Consensus 391 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (435) ++.+|||+. T Consensus 375 -----------------------~~~~gGd~e 383 (383) T protein:vir:10 375 -----------------------NETKGGDDK 383 (383) T ss_pred -----------------------ccCCCCCCC Confidence 112233332 No 62 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=5.1e-76 Score=433.35 Aligned_cols=379 Identities=17% Similarity=0.234 Sum_probs=312.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhh-hccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLD-MAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMD 79 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 79 (435) ||||+++++. +.+............. ........+..++.+.++++++|++||++||++||++|+++++.. T Consensus 1 M~~f~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~~~---- 72 (386) T protein:vir:49 1 MPIFNITNLA----TESPPINQESFFDIADSDFLASLNSSEWVSAENALKNSDLFSIISQLSNDLATAKITTSRKQ---- 72 (386) T ss_pred CchhhhhccC----CCCcccchhhhhhhhhccccccccCCceechhhhhccHHHHHHHHHHHHHhhhCceeeccch---- Confidence 9999987542 1111111111111111 111223345567888899999999999999999999999998754 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEec---- Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTS---- 155 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~---- 155 (435) ...|+.+||++||+++||+.++.+++++||||++|+++. .|++++|||++|++|++..+.++..++|.+.. T Consensus 73 ----~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~-~g~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~ 147 (386) T protein:vir:49 73 ----LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRND-NGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDPH 147 (386) T ss_pred ----hhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC-CCcEEEEEEecCceeEEEEcCCCceEEEEEEEcCcc Confidence 235788999999999999999999999999999999875 58999999999999999998887766665542 Q ss_pred CCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHH Q lcl|NC_019456. 156 DIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFL 233 (435) Q Consensus 156 ~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~ 233 (435) .+..++|+++||||++++++.+.++|+|++.++...+....++++++.++|+|+ +++++++++.+++++.+++++.|. T Consensus 148 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~ 227 (386) T protein:vir:49 148 IAPKQHVPQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQ 227 (386) T ss_pred ccceeEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHH Confidence 345678999999999998777779999999999999999999999999999997 678999999999999999999999 Q ss_pred HHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhHHHHH Q lcl|NC_019456. 234 RMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMPIIRQ 313 (435) Q Consensus 234 ~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P~~~~ 313 (435) ...+++|+++|+++|++|++++.++.++|+.|++++++++||++|||||.+||+.. +++++.++...+|.++|.|++.. T Consensus 228 ~~~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~~~~~~~~~~~~~i~~~l~~ 306 (386) T protein:vir:49 228 AMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDG-DQQSSLEMIYNIYFKSVSRYLRP 306 (386) T ss_pred HhccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CccchHHHHHHHHHHHHHHHHHH Confidence 88899999999999999999999999999999999999999999999999999744 45567778888999999999999 Q ss_pred HHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccch Q lcl|NC_019456. 314 YESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPL 393 (435) Q Consensus 314 i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l 393 (435) ++++|+++|+. +++|+.+.+.+.|...++..+.+++++|++|+||+|++++..++. ..+.. . T Consensus 307 i~~~~~~~l~~--------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~--~~~~~---~----- 368 (386) T protein:vir:49 307 FVSEMSKKLSC--------EVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEIL--PKELP---D----- 368 (386) T ss_pred HHHHHHHHhcc--------hhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCC--CCcCc---c----- Confidence 99999999863 478999999999999999999999999999999999999866542 11111 0 Q ss_pred hccccccccccccccccccccccCCCCCCCCC Q lcl|NC_019456. 394 DKYYDAILDNKIQTDASVAAPKQEGGENTNEN 425 (435) Q Consensus 394 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (435) ......++.+|||+++++ T Consensus 369 --------------~~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:49 369 --------------GKNPNRTSLKGGEINEQD 386 (386) T ss_pred --------------hhccCCCCCCCCCCCCCC Confidence 011122445677766655 No 63 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=1.3e-74 Score=425.61 Aligned_cols=427 Identities=11% Similarity=0.074 Sum_probs=299.4 Q ss_pred CchHHHHHhhcccccc------------ccccccccchhhhhhc--cccc---------------cCcccccHHHHhhhH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQ------------ANQIVQNPIPQPLDMA--GVKL---------------EQATFSREHILESNE 51 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~--~~~~---------------~~~~~~~~~~~~~~~ 51 (435) |++-+--++....... ..+..+......-... +..+ .....++.-...... T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~ 106 (574) T protein:vir:80 27 MHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILNAIINTRSN 106 (574) T ss_pred cccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccHHHHHHhhccChhHHHHHHHHHH Confidence 5443211111110000 0000000000000000 0000 111223333445567 Q ss_pred HHHHHHHHHHHHHhhCceeeeecc--------cccccchHHHhhhc---cccccC-CHHHHHHHHHHHHHhcCCcceEEe Q lcl|NC_019456. 52 YIFSIVTRLSNVLASLPLHEYQNY--------KQMDNEPLADLLKT---SPNPNM-TAFEFIARLETDRNVSGNGYAWIQ 119 (435) Q Consensus 52 ~v~~~i~~ia~~ia~~~~~~~~~~--------~~~~~~~l~~~l~~---~Pn~~~-~~~~f~~~~~~~~~~~G~~~~~i~ 119 (435) .|++|+.+|+.++|++||+|+.++ .....|++..+|.. .|||++ ++.+|++.++.+++++||+|++++ T Consensus 107 ~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~ 186 (574) T protein:vir:80 107 QVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKV 186 (574) T ss_pred HHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEE Confidence 788899999999999999998642 23445777777653 456665 788999999999999999999999 Q ss_pred eeCCCCcEEEEEEeCCceeEEEEcCCCc-----eEEEEEecCCeeEEEchhheEEeccCCCc---cccccCcHHHHHHHH Q lcl|NC_019456. 120 KSLSTGEPIALWPLDPNTVSILRNTDNN-----SYWYRVTSDIYNFTIPINDVIHVKHVVPS---NSWYGVSPIDVLSSS 191 (435) Q Consensus 120 ~~~~~g~~~~l~~l~~~~v~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~iih~~~~~~~---~~~~G~s~l~~~~~~ 191 (435) ++. .|+|++||||+|.+|++..+.++. ..||++..++....|+++||||++++... ++.+|+||+..+... T Consensus 187 r~~-~G~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a~~~ 265 (574) T protein:vir:80 187 FDK-DGNFIKFDTVDPTTIFLATNGEGKLIKNGERFVQVIDNRIVAKFNERELAFAVRNPRADIEVGQYGYPELEIALKQ 265 (574) T ss_pred ECC-CCcEEEEEEEcCceeEEEEcCccccccCceEEEEEeCCceEEEEccccEEEEeccCCCCcccccccccHHHHHHHH Confidence 874 489999999999999999887653 45677777778889999999999976433 467899999999999 Q ss_pred HHHHHHHHHHHHHHhhcC--CceEEEeC--CcCCHHHHHHHHHHHHHHh---cCCCcc-ccccCCceeeeccCChhhHHH Q lcl|NC_019456. 192 LKFQRSVENFSQNEMEKK--DKFVLQYD--RSISPEKRQAMVNDFLRMV---KENGGA-VVQEAGWKVDRYESKFEPADL 263 (435) Q Consensus 192 i~~~~~~~~~~~~~~~n~--~~~~~~~~--~~~~~e~~~~~~~~~~~~~---~~~~~~-~vl~~g~~~~~~~~~~~~~~~ 263 (435) |..+.++++++.++|.|| ++++++++ ..+++++.+++++.|...+ .|+|++ +++++|++|+++++++.|+|| T Consensus 266 i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~~D~qf 345 (574) T protein:vir:80 266 FIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSANDMQF 345 (574) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCChhHHHH Confidence 999999999999999997 67888875 4489999999999997654 567776 455789999999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCHHHhCCcccC----------cccHHHH-HHHHHHHHHhHHHHHHHHHHHHhhcccccccCcc Q lcl|NC_019456. 264 SSVEQISRIRIATAFNVPISFLNDDQAK----------STTNVEH-VTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGF 332 (435) Q Consensus 264 ~e~~~~~~~~Ia~~fgvP~~~lg~~~~~----------~~~~~e~-~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~ 332 (435) +|++++++++||++|||||.+||..+.+ +++|.|+ ...|+++||.|++..|+++|+++|++..+ .++ T Consensus 346 le~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~--~~~ 423 (574) T protein:vir:80 346 EKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEFG--EKY 423 (574) T ss_pred HHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcC--Cce Confidence 9999999999999999999999986654 3466665 45677999999999999999999997644 457 Q ss_pred eeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccccccccccccc Q lcl|NC_019456. 333 YFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVA 412 (435) Q Consensus 333 ~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~ 412 (435) +++|+..++...+. ++. +..++.+|+||+||+|+++|+||+ +|||++++++|+++++.................. T Consensus 424 ~~~f~~~d~~~~~~--~~~-~~~~~~~G~lT~NE~R~~lgl~Pi--~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~ 498 (574) T protein:vir:80 424 QFQFRGGDLSAQLD--KLK-IIEQEGKVFRTVNEIRHDKGLEPI--KGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLN 498 (574) T ss_pred EEEecccchhhHHH--HHH-HHHHHhCCccCHHHHHHHhCCCCC--CCCCEeeeccceeecccccccccCCccchhcccc Confidence 88888777654432 222 345788999999999999999999 6899999999999998654333222211111111 Q ss_pred ccccCCC--CCCCCCCCCC-CCCCCC Q lcl|NC_019456. 413 APKQEGG--ENTNENGLQS-TEPEGS 435 (435) Q Consensus 413 ~~~~~~~--~~~~~~~~~~-~~~~~~ 435 (435) .+....+ ++.++...++ .+.|.+ T Consensus 499 ~~~~~~~~~~~~~~~~~p~~~~~d~~ 524 (574) T protein:vir:80 499 RLLELSGGDVEQPEPEEPKDSQNDTD 524 (574) T ss_pred ccccccCCCCCCCCCCCCCCcccccc Confidence 1111111 1111111100 011111 No 64 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=100.00 E-value=6.4e-76 Score=432.82 Aligned_cols=357 Identities=14% Similarity=0.156 Sum_probs=282.4 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc----- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY----- 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~----- 75 (435) ||||+++.++.+.... .... .. ..+.....++..++|++||++||++||++||++++.. T Consensus 1 Mg~f~~~~~~~~~~~~-~~~~-----~~----------~~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~ 64 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLN-NDTQ-----RV----------TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVG 64 (378) T ss_pred CCccccchhccccccc-CCcc-----ee----------eeeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcc Confidence 9999998764322211 1100 00 1112233466788999999999999999999987632 Q ss_pred ----cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEE Q lcl|NC_019456. 76 ----KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWY 151 (435) Q Consensus 76 ----~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~ 151 (435) ....+|+++++|+.+||++||+++||+.++.+++++||+|+++++++..|++..++|.. T Consensus 65 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l~p~~----------------- 127 (378) T protein:vir:94 65 SDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDLLFAD----------------- 127 (378) T ss_pred cccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEEEecC----------------- Confidence 23457999999999999999999999999999999999999998888778877776532 Q ss_pred EEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHHHHHHHHHH Q lcl|NC_019456. 152 RVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPEKRQAMVND 231 (435) Q Consensus 152 ~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~ 231 (435) ..++|+++||||++++ .++..|+|+++.+...+.... ..+.+++++++++.+++++.++++++ T Consensus 128 ------~~~~~~~~diiH~~~~--~~~~~g~s~l~~~~~~i~~~~---------~~~~~~gil~~~~~l~~~~~~~~~~~ 190 (378) T protein:vir:94 128 ------DKKEYKPEELVRLTSP--FYINEDTSILDNALASIQTKL---------EQGKLRGLLKINAFLDIDNTQEYREK 190 (378) T ss_pred ------CeeEeeeeeeEEecCc--CCccchhHHHHHHHHHHHHHH---------hcccccceeeeCCcCCHHHHHHHHHH Confidence 2346889999999964 567789999988887764322 12347789999999999988888877 Q ss_pred HHHHh------cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHH Q lcl|NC_019456. 232 FLRMV------KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTM 305 (435) Q Consensus 232 ~~~~~------~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~ 305 (435) |...+ .+++++++|++|++|++++.++.++|+ +.+++++++||++|||||.+|+. + ++.++...||++ T Consensus 191 ~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~l~~----~-~se~~~~~f~~~ 264 (378) T protein:vir:94 191 ALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG----T-ASQEQQIYFYNS 264 (378) T ss_pred HHHHHHHhhcccccccceecCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC----C-hHHHHHHHHHHH Confidence 75543 356789999999999999999999997 55689999999999999999953 2 234677788999 Q ss_pred HHhHHHHHHHHHHHHhhcccccccCcc------eeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCc Q lcl|NC_019456. 306 TLMPIIRQYESQFNMKLFTPGKRVKGF------YFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDE 379 (435) Q Consensus 306 ~i~P~~~~i~~~l~~~l~~~~~~~~g~------~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~ 379 (435) ||.|++.+|+++|+++||++.++..|. .++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+ | T Consensus 265 tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~--~ 342 (378) T protein:vir:94 265 TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPI--E 342 (378) T ss_pred HHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--C Confidence 999999999999999999998776664 37899999999999999999999999999999999999999999 5 Q ss_pred CCceeeecccccchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 380 AADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 380 ~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) |||++++++|++|++...+....+. ...++++++++ T Consensus 343 gGD~~~~~~n~~~~~~~~~~~~~~~---------~~~~~~e~~n~ 378 (378) T protein:vir:94 343 GGDVYIANLNAVAVKNLSDLQGSRK---------DVTSTDETNNQ 378 (378) T ss_pred CCCeeeecccccccccchhhcCCcC---------CCCCCCCCCCC Confidence 7999999999999887644422211 11222333333 No 65 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=1.5e-74 Score=425.39 Aligned_cols=427 Identities=10% Similarity=0.020 Sum_probs=297.5 Q ss_pred CchHHHHH-------hhcccccc-----------------------ccccccccchhhhhhc---cccccCccc----cc Q lcl|NC_019456. 1 MSFMSKVR-------QFFGVHDQ-----------------------ANQIVQNPIPQPLDMA---GVKLEQATF----SR 43 (435) Q Consensus 1 Mg~~~~~~-------~~~~~~~~-----------------------~~~~~~~~~~~~~~~~---~~~~~~~~~----~~ 43 (435) ||||++++ .++..... .....+...+. ..+. ...+...++ .. T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~-~~~~~~~~~r~~~~~~~~l~~~ 83 (551) T protein:vir:80 5 LGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGS-MSANPGFKTKPSIRNNQDLHGV 83 (551) T ss_pred hhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccc-eecCcccccCccccChhHHHHH Confidence 99999987 22210000 00000000000 0111 111111111 11 Q ss_pred HHHHhhhHHHHHHHHHHHHHHhhCc-----------eeeeecc---cccc----cchHHHhhhcccccc-----CCHHHH Q lcl|NC_019456. 44 EHILESNEYIFSIVTRLSNVLASLP-----------LHEYQNY---KQMD----NEPLADLLKTSPNPN-----MTAFEF 100 (435) Q Consensus 44 ~~~~~~~~~v~~~i~~ia~~ia~~~-----------~~~~~~~---~~~~----~~~l~~~l~~~Pn~~-----~~~~~f 100 (435) .+.+..+|+|++||+.||+.||+++ |.+.-+. +... .......++.+||+. +|+.+| T Consensus 84 ~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f 163 (551) T protein:vir:80 84 LKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSF 163 (551) T ss_pred HHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHHH Confidence 2346678999999999999999854 4332111 1111 111233345689887 488899 Q ss_pred HHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc-----eEEEEEecCCeeEEEchhheEEeccCCC Q lcl|NC_019456. 101 IARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN-----SYWYRVTSDIYNFTIPINDVIHVKHVVP 175 (435) Q Consensus 101 ~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~iih~~~~~~ 175 (435) ++.++.+++++||+|++++++. .|+|++||||+|.+|++..+.+|. ..|+++..++....|+++||||+++++. T Consensus 164 ~~~lv~dlll~Gnay~~i~rd~-~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~eiiH~~~n~~ 242 (551) T protein:vir:80 164 VKKIVRDTYMYDQVNFEKVFNR-NQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVRNPR 242 (551) T ss_pred HHHHHHHHHhcCCEEEEEEECC-CCcEEEEEEeCCceeEEEECCccccccCceEEEEEeCCcEEEEEcccceEEecccCC Confidence 9999999999999999999875 589999999999999999888774 3455555666677899999999997543 Q ss_pred ---ccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCC--cCCHHHHHHHHHHHHHHh---cCCCccccc Q lcl|NC_019456. 176 ---SNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDR--SISPEKRQAMVNDFLRMV---KENGGAVVQ 245 (435) Q Consensus 176 ---~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~--~~~~e~~~~~~~~~~~~~---~~~~~~~vl 245 (435) .++.+|+||+.++...|..+.++++++.++|.|| ++++|++++ .+++++.+++++.|.... +|+|+++++ T Consensus 243 ~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl 322 (551) T protein:vir:80 243 SDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVV 322 (551) T ss_pred CCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCccccc Confidence 3467899999999999999999999999999997 578888754 489999999999997754 577887665 Q ss_pred -cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC----------cccHHHHHH-HHHHHHHhHHHHH Q lcl|NC_019456. 246 -EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK----------STTNVEHVT-HSWTMTLMPIIRQ 313 (435) Q Consensus 246 -~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~----------~~~~~e~~~-~~~~~~i~P~~~~ 313 (435) ++|++|+++++++.++||+|++++++++||++|||||.+||....+ +++|.+++. .|+++||+|++.. T Consensus 323 ~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ 402 (551) T protein:vir:80 323 SAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGF 402 (551) T ss_pred cCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHH Confidence 6899999999999999999999999999999999999999975543 567776654 6779999999999 Q ss_pred HHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccch Q lcl|NC_019456. 314 YESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPL 393 (435) Q Consensus 314 i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l 393 (435) |+++|+++|++... ..+.|+++.+...+..+++++++ ++..|+||+||+|+++|+||. .+|||+++.+.++.++ T Consensus 403 ie~~ln~~L~~~~~----~~~~f~f~~~~~~~~~~~~~~~~-~~~~g~lT~NE~R~~~gl~P~-~egGD~~~~~~~~~~~ 476 (551) T protein:vir:80 403 IEDFINKHIVAEFG----DKYTFQFVGGDIKSELESVKILA-EKAKVAMTVNEVRKELNLPGD-VIGGDIPLNGVIVQRI 476 (551) T ss_pred HHHHHHhhhccccC----CceEEEeeccChhhHHHHHHHHH-HHhcCCcCHHHHHHHhCCCCC-CCCCceeecccccccc Confidence 99999999987542 34566667777888888887665 667899999999999999983 2799999999998877 Q ss_pred hcccccccccccc-----------cc-ccccccccCCCCC---CCCCCCCCCCCC-CC Q lcl|NC_019456. 394 DKYYDAILDNKIQ-----------TD-ASVAAPKQEGGEN---TNENGLQSTEPE-GS 435 (435) Q Consensus 394 ~~~~~~~~~~~~~-----------~~-~~~~~~~~~~~~~---~~~~~~~~~~~~-~~ 435 (435) +........+... .. .....++.++.+. ++.+++....++ ++ T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~ 534 (551) T protein:vir:80 477 GQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNA 534 (551) T ss_pred cccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccCCCccccccccCcccc Confidence 6543222111000 00 0001111111111 111111111111 11 No 66 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=100.00 E-value=9.4e-76 Score=431.92 Aligned_cols=356 Identities=15% Similarity=0.174 Sum_probs=281.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc---- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK---- 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~---- 76 (435) ||||+++.++... ....... .. ..+.....++..++|++||++||++||++||++++... T Consensus 1 Mg~f~~~~~f~~~-~~~~~~~-----~~----------~~~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~ 64 (378) T protein:vir:93 1 MNLFGKVVSFSRG-KLNNDTQ-----RV----------TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVG 64 (378) T ss_pred Cccchhhhhhhcc-ccCCCcc-----ee----------eecccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccccc Confidence 9999999764222 1111110 00 11122334668889999999999999999999887432 Q ss_pred -----ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEE Q lcl|NC_019456. 77 -----QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWY 151 (435) Q Consensus 77 -----~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~ 151 (435) ...+|+++++|+.+||++||+++||+.++.+++++||+|+++++++..|++..++|.. T Consensus 65 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l~~~~----------------- 127 (378) T protein:vir:93 65 SDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFAD----------------- 127 (378) T ss_pred cccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecC----------------- Confidence 2356999999999999999999999999999999999999999887777777666532 Q ss_pred EEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC-CceEEEeCCcCCHHHHHHHHH Q lcl|NC_019456. 152 RVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK-DKFVLQYDRSISPEKRQAMVN 230 (435) Q Consensus 152 ~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~-~~~~~~~~~~~~~e~~~~~~~ 230 (435) ...+|+++||||++.+ .++..|.|++..+...+. .++.++ +++++++++.+++++.+++++ T Consensus 128 ------~~~~~~~~diih~r~~--~~~~~~~s~l~~~~~~i~----------~~~~~~~~~g~l~~~~~l~~~~~~~~~~ 189 (378) T protein:vir:93 128 ------DKKEYKTEELVRLTSP--FYINEDTSILDNALASIQ----------TKLEQGKLRGLLKINAFLDIDNTQEYRE 189 (378) T ss_pred ------CeeEeccceeEEecCc--cccchhhHHHHHHHHHHH----------HHHhcCcccceeeeCCcCCHHHHHHHHH Confidence 2346889999999954 567778999887766553 234444 789999999999998888888 Q ss_pred HHHHHhc------CCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHH Q lcl|NC_019456. 231 DFLRMVK------ENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWT 304 (435) Q Consensus 231 ~~~~~~~------~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~ 304 (435) +|....+ +++++++|++|++|++++.++.++|+ +.+++++++||++|||||.+|++ ..+.++...|++ T Consensus 190 ~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~g-----~~~e~~~~~f~~ 263 (378) T protein:vir:93 190 KALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG-----TATQEQQIYFYN 263 (378) T ss_pred HHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC-----CcHHHHHHHHHH Confidence 8765432 56789999999999999999999997 55689999999999999999953 223466778889 Q ss_pred HHHhHHHHHHHHHHHHhhcccccccCcc------eeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCC Q lcl|NC_019456. 305 MTLMPIIRQYESQFNMKLFTPGKRVKGF------YFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPD 378 (435) Q Consensus 305 ~~i~P~~~~i~~~l~~~l~~~~~~~~g~------~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~ 378 (435) +||.|++.+|+++|+++||++.++..++ .++||++.+++.|.+++++++.+++++|++|+||+|+++|+||+ T Consensus 264 ~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~-- 341 (378) T protein:vir:93 264 STIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPI-- 341 (378) T ss_pred HHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-- Confidence 9999999999999999999998776664 48899999999999999999999999999999999999999999 Q ss_pred cCCceeeecccccchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 379 EAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 379 ~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) ||||++++++|++|++...+....+... .++++++++ T Consensus 342 ~ggD~~~~~~n~~~~~~~~~~~~~~~~~---------~~~~e~~n~ 378 (378) T protein:vir:93 342 EGGDVYIANLNAVAVKNLSDLQGSRKDV---------TSTDETNNQ 378 (378) T ss_pred CCCCeeeeccccccccchhhhcCccCCC---------CCCCCCCCC Confidence 5799999999999998765443222111 122222222 No 67 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=100.00 E-value=1.9e-75 Score=430.17 Aligned_cols=357 Identities=15% Similarity=0.160 Sum_probs=281.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc---- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK---- 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~---- 76 (435) ||||+++.++...... .... .. ..+.....++..++|++||++||++||+|||++++..+ T Consensus 1 Mg~f~~~~~~~~~~~~-~~~~-----~~----------~~~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~ 64 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLN-NDTQ-----RV----------TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVG 64 (378) T ss_pred Cccchhhhhhhccccc-CCcc-----ee----------eecccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccc Confidence 9999998764322211 1100 00 11222334668889999999999999999999986432 Q ss_pred -----ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEE Q lcl|NC_019456. 77 -----QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWY 151 (435) Q Consensus 77 -----~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~ 151 (435) ...+|+++++|+.+||++||+++||+.++.+++++||+|+++++++..|++..++|.. T Consensus 65 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~~----------------- 127 (378) T protein:vir:16 65 SDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFAD----------------- 127 (378) T ss_pred cccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecC----------------- Confidence 2457999999999999999999999999999999999999999987777776666532 Q ss_pred EEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHHHHHHHHHH Q lcl|NC_019456. 152 RVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPEKRQAMVND 231 (435) Q Consensus 152 ~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~ 231 (435) ..+.|+++||||+|.+ .++..|.|++..+...+... +..+.++++++.++.+++++.++++++ T Consensus 128 ------~~~~~~~~diih~r~~--~~~~~~~s~l~~~~~~i~~~---------~~~~~~~g~l~~~~~l~~~~~~~~~~~ 190 (378) T protein:vir:16 128 ------DKKEYKPEELVRLTSP--FYINEDTSILDNALASIQTK---------LEQGKLRGLLKINAFLDIDNTQEYREK 190 (378) T ss_pred ------CeeEecccceEEecCc--cCccchhHHHHHHHHHHHHH---------HhcCccceeeEeCCcCCHHHHHHHHHH Confidence 2346889999999964 46677889888777655321 223447899999999999888887777 Q ss_pred HHHHh------cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHH Q lcl|NC_019456. 232 FLRMV------KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTM 305 (435) Q Consensus 232 ~~~~~------~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~ 305 (435) |.... .+++++++|++|++|+++++++.++|+.+ .++++++||++|||||.+|++ .++.++...||++ T Consensus 191 ~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~-~~~~~~~Ia~~fgVPp~~l~g-----~~~e~~~~~f~~~ 264 (378) T protein:vir:16 191 ALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDE-IDLIKSELLTGYFMNENILLG-----TASQEQQIYFYNS 264 (378) T ss_pred HHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHHH-HHHHHHHHHHHhCCCHHHhcC-----CchHHHHHHHHHH Confidence 76543 35678999999999999999999999755 578999999999999999953 2234667788899 Q ss_pred HHhHHHHHHHHHHHHhhcccccccCcc------eeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCc Q lcl|NC_019456. 306 TLMPIIRQYESQFNMKLFTPGKRVKGF------YFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDE 379 (435) Q Consensus 306 ~i~P~~~~i~~~l~~~l~~~~~~~~g~------~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~ 379 (435) ||.|++.+|+++|+++||++.++..+. .++|+++.+++.|.+++++++.+++++|++|+||+|+++|+||+ | T Consensus 265 tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~--~ 342 (378) T protein:vir:16 265 TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPI--E 342 (378) T ss_pred HHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--C Confidence 999999999999999999988776654 47899999999999999999999999999999999999999999 5 Q ss_pred CCceeeecccccchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 380 AADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 380 ~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) |||++++|+|++|++...+....+.. ..++++++++ T Consensus 343 ggD~~~~~~n~~~~~~~~~~~~~~~~---------~~~~~e~~ne 378 (378) T protein:vir:16 343 GGDVYIANLNAVAVKNLSDLQGSRKD---------VTSTDETNNQ 378 (378) T ss_pred CCCeEeeccccccccchhhhcCccCC---------CCCCCCCCCC Confidence 89999999999999876544322111 1223333333 No 68 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=7e-75 Score=427.12 Aligned_cols=373 Identities=17% Similarity=0.237 Sum_probs=300.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ||||+|+... ...+........... ..........++.+.++++++|++||++||++||++||++++... T Consensus 1 Mg~f~~~~~~---~~~~~~~~~~~~~~~---~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~---- 70 (382) T protein:vir:48 1 MPIFNLATES---PPDNQGGFFDVVDSD---FLASLKGNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKKL---- 70 (382) T ss_pred CccccccccC---Ccccccccccchhhh---ccccccCCcccchHhhhccHHHHHHHHHHHHhhccCceeeecchh---- Confidence 9999987532 111111111111111 112233445677888999999999999999999999999987642 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCC--- Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDI--- 157 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~--- 157 (435) ..|+.+||++||+++||+.++.+|+++||||++++++ ..|++++|+||+|++|++..+.++..++|.+..++ T Consensus 71 ----~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd-~~G~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~~ 145 (382) T protein:vir:48 71 ----QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRN-ENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDDPRI 145 (382) T ss_pred ----hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEEC-CCCcEEEEEEEcCceeEEEEcCCCCeEEEEEEecCccc Confidence 3578999999999999999999999999999999987 45899999999999999999888877777665443 Q ss_pred -eeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHH Q lcl|NC_019456. 158 -YNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLR 234 (435) Q Consensus 158 -~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~ 234 (435) ..+.|+++||||++++++.+.++|.||+.++..++....+++++..++|.|+ +++++++++.+++|+.++++++|.. T Consensus 146 ~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~ 225 (382) T protein:vir:48 146 PPKQHVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLSRSRQA 225 (382) T ss_pred cceeEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHh Confidence 4568999999999998877779999999999999999999999999999997 6789999999999999999999998 Q ss_pred HhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhHHHHHH Q lcl|NC_019456. 235 MVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMPIIRQY 314 (435) Q Consensus 235 ~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P~~~~i 314 (435) ..+++|+++|+++|++|++++.++.++|+.|.+++.+++||++|||||.+||+...++ +++++...|++.+|.|+++.| T Consensus 226 ~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~-~~~~~~~~~~~~~l~p~~~~i 304 (382) T protein:vir:48 226 MKQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQ-SSLEMSSDLYSKAVSRYLRPF 304 (382) T ss_pred hccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc-cHHHHHHHHHHHHHHHHHHHH Confidence 8899999999999999999999999999999999999999999999999999866543 566778889999999999999 Q ss_pred HHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCC---CCcCCceeeeccccc Q lcl|NC_019456. 315 ESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPI---PDEAADHLYISKDLY 391 (435) Q Consensus 315 ~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~---~~~~gd~~~~~~n~~ 391 (435) +++|+++|+++.+. +.......+.......+++++++|++|+||+|+.++...+ +.++++. T Consensus 305 ~~~l~~~l~~~~~~--------~~~~~~~~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~-------- 368 (382) T protein:vir:48 305 LSELSQKLSCDVDA--------DIFPAVDPTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGEN-------- 368 (382) T ss_pred HHHHHHHhcChhhh--------hhhhhhccchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhc-------- Confidence 99999999876433 2222223344455667788999999999999999853322 1111111 Q ss_pred chhccccccccccccccccccccccCCCCCCCCC Q lcl|NC_019456. 392 PLDKYYDAILDNKIQTDASVAAPKQEGGENTNEN 425 (435) Q Consensus 392 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (435) ..++.+|||+++++ T Consensus 369 --------------------~~~~~~GGd~~~~~ 382 (382) T protein:vir:48 369 --------------------PNSTLKGGEEDGQD 382 (382) T ss_pred --------------------CCCCCCCCCCCCCC Confidence 12345677766655 No 69 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=100.00 E-value=1.7e-75 Score=430.47 Aligned_cols=378 Identities=16% Similarity=0.193 Sum_probs=283.1 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccc-cc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQ-MD 79 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~-~~ 79 (435) ||||++++... ......... . .....+....|+++++|++||++||++||+|||+++++++. .. T Consensus 1 Mgl~d~~~~~~-----~~~~~~~~~---~-------~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~~ 65 (395) T protein:vir:96 1 MGILDFFSFKK-----SGTLSDDDS---G-------STTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTEN 65 (395) T ss_pred CcchhhhcCCC-----Ccccccccc---c-------cchhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCccccc Confidence 99999885421 111111111 1 11122445678999999999999999999999999987644 46 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCCee Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDIYN 159 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~ 159 (435) +|++.++|+.+||++||+++||+.++.+++++||+|+++.++. . +++.+...+.... .......+.+...... T Consensus 66 ~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~-~-----~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~ 138 (395) T protein:vir:96 66 QKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGK-G-----IYVADAFTQDKKL-SGNKFKVSRVQGQTYE 138 (395) T ss_pred cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCC-c-----eecCCcccccccc-ccceeeeeeeccceee Confidence 7899999999999999999999999999999999999998753 2 2332222111111 1111111222222235 Q ss_pred EEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHH------HHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHH Q lcl|NC_019456. 160 FTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRS------VENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVND 231 (435) Q Consensus 160 ~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~------~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~ 231 (435) ++++++||+|||+.++....++.+++..+...+....+ +.++..+++.++ +.+++..++...++..++..++ T Consensus 139 ~~~~~~dvih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (395) T protein:vir:96 139 KIFTFDQVIYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGGRQPKSDKDFFKR 218 (395) T ss_pred eEeccCceEEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCchhhHHHHHHHHHH Confidence 68999999999987665555555555555544444332 224555666665 3567788888888888888888 Q ss_pred HHHHhc-CCCccccccCCceeeeccCChhhHHHHHHHHHH------HHHHHHHhCCCHHHhCCcccCcccHHH-HHHHHH Q lcl|NC_019456. 232 FLRMVK-ENGGAVVQEAGWKVDRYESKFEPADLSSVEQIS------RIRIATAFNVPISFLNDDQAKSTTNVE-HVTHSW 303 (435) Q Consensus 232 ~~~~~~-~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~------~~~Ia~~fgvP~~~lg~~~~~~~~~~e-~~~~~~ 303 (435) +..... ++++++++++|++|++++.++.++|+.+.+++. .++||++|||||.+|| ++++|.| +...|| T Consensus 219 ~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~----~~~sn~e~~~~~f~ 294 (395) T protein:vir:96 219 TIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH----GDIADNQKNYELLL 294 (395) T ss_pred HHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCCccHHHHHHHHH Confidence 765553 455688899999999999999999999988876 4789999999999997 3445554 566788 Q ss_pred HHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCce Q lcl|NC_019456. 304 TMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADH 383 (435) Q Consensus 304 ~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~ 383 (435) ++||.||+.+||++|+++|+++.++..+.+ |+++.+++.|.+++++++++++++|++|+||+|+++|+||+++++||+ T Consensus 295 ~~~L~P~~~~ie~~l~~~Ll~~~e~~~~~~--f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~~~gD~ 372 (395) T protein:vir:96 295 EGPIESLITNIVDGLEYAIFDKSETLEGSF--IKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPDGLGKV 372 (395) T ss_pred HHHHHHHHHHHHHHHHhhcCChhhhcCcee--EeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCce Confidence 999999999999999999999887766654 677899999999999999999999999999999999999999999999 Q ss_pred eeecccccchhccccccccccccccccccccccCCCCCCCCCCC Q lcl|NC_019456. 384 LYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGL 427 (435) Q Consensus 384 ~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (435) +++++|++|++. +||+++++++. T Consensus 373 ~~~~~N~~~~~~---------------------~gge~~~~~~~ 395 (395) T protein:vir:96 373 LYMTKNYESVLE---------------------RGGEVDEEVET 395 (395) T ss_pred eeecccceechh---------------------ccCCCCCCCCC Confidence 999999998763 35565555555 No 70 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=6.7e-75 Score=427.22 Aligned_cols=380 Identities=18% Similarity=0.205 Sum_probs=293.9 Q ss_pred CchHHHHHhhcccccccccccccc--chhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNP--IPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQM 78 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 78 (435) ||||+++++.-.... +..+.... ......+.......+..++...++++++|++||++||++||++|+++++... T Consensus 3 m~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~-- 79 (392) T protein:vir:74 3 LPILNFINQTNDPPE-AGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN-- 79 (392) T ss_pred chhhhhhhcccCccc-ccccccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchh-- Confidence 999998765321111 11111100 0111111222223455678889999999999999999999999999987542 Q ss_pred ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCC- Q lcl|NC_019456. 79 DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDI- 157 (435) Q Consensus 79 ~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~- 157 (435) ..|+.+||++||+++||+.++.+++++||||++++++. .|++++|+||+|++|++..+.+++.++|.+...+ T Consensus 80 ------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~-~G~~~~L~~i~~~~v~v~~~~~~~~~~y~~~~~~~ 152 (392) T protein:vir:74 80 ------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNA-NGADMKWEYLRPSQVNTYYFEYENGMYYNITFDDP 152 (392) T ss_pred ------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC-CCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCC Confidence 34678999999999999999999999999999999875 5899999999999999999888877666665433 Q ss_pred ---eeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcC--CHHHHHHHHH Q lcl|NC_019456. 158 ---YNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSI--SPEKRQAMVN 230 (435) Q Consensus 158 ---~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~--~~e~~~~~~~ 230 (435) ....++++||||++++++.+.++|+||+.++...|..+.++++++.++|+|+ +++++++++.. ++++.+++++ T Consensus 153 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~ 232 (392) T protein:vir:74 153 KIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSR 232 (392) T ss_pred ccceeEEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHH Confidence 3567999999999988766668999999999999999999999999999998 57899987654 3444455555 Q ss_pred HHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhHH Q lcl|NC_019456. 231 DFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMPI 310 (435) Q Consensus 231 ~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P~ 310 (435) .+. +..++|+++||++|++|+++++++.++||+|.+++++++||++|||||.+||+...+ +++.++...||+++|.|+ T Consensus 233 ~~~-~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~~~~e~~~~~~~~~l~p~ 310 (392) T protein:vir:74 233 SFM-KRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQ-QSSIQQISGMYASALNRY 310 (392) T ss_pred HHh-ccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc-ccHHHHHHHHHHHHHHHH Confidence 443 445788999999999999999999999999999999999999999999999986544 467788888999999999 Q ss_pred HHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccc Q lcl|NC_019456. 311 IRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDL 390 (435) Q Consensus 311 ~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~ 390 (435) ++.|+++|+++|++. ++||...+.+.|.+++++.+++++++|++|+||+|+++....+. + ++.....|+ T Consensus 311 ~~~ie~~l~~~l~~~--------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~-p--ne~r~~enl 379 (392) T protein:vir:74 311 LRPAISELEYKLSDH--------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYI-P--KDLPAPENT 379 (392) T ss_pred HHHHHHHHHHhccch--------hcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCC-c--cccchhcCC Confidence 999999999999753 67899999999999999999999999999999999987322211 1 122222232 Q ss_pred cchhcccccccccccccccccccc Q lcl|NC_019456. 391 YPLDKYYDAILDNKIQTDASVAAP 414 (435) Q Consensus 391 ~~l~~~~~~~~~~~~~~~~~~~~~ 414 (435) -|+... . .....| T Consensus 380 ~~~~~G---------d--~~~p~p 392 (392) T protein:vir:74 380 NKKTTG---------Q--SNEPVP 392 (392) T ss_pred CCCCCC---------C--CCCCCC Confidence 222110 0 000011 No 71 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=4.3e-75 Score=428.30 Aligned_cols=372 Identities=17% Similarity=0.236 Sum_probs=303.8 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ||||+++... ..................+.+ ....+.+++.+.++++++|++||++||++||+|||+++++.. T Consensus 1 Mglf~~~~~~--~~~~~~~~~~~~~~~~~~~~~-~~~~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~~~---- 73 (384) T protein:vir:49 1 MPIFNITNLA--TESPPSNQDSFFDITDPEFLD-ALNGSEWVSAETALKNSDLFSIISQLSNDLATAKITTSRKQL---- 73 (384) T ss_pred CccccccccC--cccccccchhhccccchhhcc-cccCCceechhhhhccHHHHHHHHHHHHHHhhCceeeecchh---- Confidence 9999875321 111111111111000001111 234456678889999999999999999999999999987542 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecC---- Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSD---- 156 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~---- 156 (435) ..|+.+||++||+++|++.++.+++++||+|++++++. .|+|++|+||+|++|++..+.++..++|.+..+ T Consensus 74 ----~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~~~~ 148 (384) T protein:vir:49 74 ----QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE-NGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDPRI 148 (384) T ss_pred ----hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECC-CCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCccc Confidence 34778999999999999999999999999999999874 489999999999999999888777776666543 Q ss_pred CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHH Q lcl|NC_019456. 157 IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLR 234 (435) Q Consensus 157 ~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~ 234 (435) +..++|+++||||++++++.+.++|+||+.++...+..+.++++++.++|.|+ +++++++++.+++++.++...++.. T Consensus 149 ~~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~ 228 (384) T protein:vir:49 149 PPKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQA 228 (384) T ss_pred cceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHh Confidence 34578999999999998777779999999999999999999999999999997 6689999999998887777777777 Q ss_pred HhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCc--ccHHHH-HHHHHHHHHhHHH Q lcl|NC_019456. 235 MVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKS--TTNVEH-VTHSWTMTLMPII 311 (435) Q Consensus 235 ~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~--~~~~e~-~~~~~~~~i~P~~ 311 (435) +..++++++++++|++|+++++++.++|+.|.+++++++||++|||||.+||+...++ +.+.++ ...+++.++.|++ T Consensus 229 ~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~pi~ 308 (384) T protein:vir:49 229 MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRPFV 308 (384) T ss_pred cccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHHHH Confidence 7788999999999999999999999999999999999999999999999999855432 233333 3456678899999 Q ss_pred HHHHHHHHHhh---cccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceee Q lcl|NC_019456. 312 RQYESQFNMKL---FTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLY 385 (435) Q Consensus 312 ~~i~~~l~~~l---~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~ 385 (435) ..++++|++++ +....+..+.+++|+++.+.+.+..++.++..++.+.|+++ ||+|+.+|++|++....++.| T Consensus 309 ~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~-ne~r~~~~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 309 SELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP-KDLPEGETDSTLKGGETNEQY 384 (384) T ss_pred HHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC-hhHHHHcCCCCCCCCCCCCCC Confidence 99999999887 33444556778999999999999999999999999999986 999999999999754445555 No 72 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=2.4e-74 Score=424.17 Aligned_cols=378 Identities=18% Similarity=0.217 Sum_probs=293.2 Q ss_pred CchHHHHHhhccccccccccccccchhhhhh-ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDM-AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMD 79 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 79 (435) ||||+++++.-.....+.............+ .......+..++.+.++++++|++||++||++||++|+++++... T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~--- 79 (392) T protein:vir:10 3 LPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN--- 79 (392) T ss_pred chhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh--- Confidence 9999988653222111111111111111111 112223345677888999999999999999999999999987542 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCC-- Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDI-- 157 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~-- 157 (435) ..|+.+||++||+++||+.++.+++++||||++++++. .|++++|+|++|++|++..+.++..++|.+...+ T Consensus 80 -----~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~-~g~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~~~ 153 (392) T protein:vir:10 80 -----QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNA-NGADMKWEYLRPSQVNTYYFEYENGMYYNITFDDPK 153 (392) T ss_pred -----hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC-CCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCcc Confidence 34678999999999999999999999999999999874 5899999999999999999888877776665443 Q ss_pred --eeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcC--CHHHHHHHHHH Q lcl|NC_019456. 158 --YNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSI--SPEKRQAMVND 231 (435) Q Consensus 158 --~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~--~~e~~~~~~~~ 231 (435) ....|+++||||++++++.+.++|+||+.++...+..+.++++++.++|.|+ +++++++++.. ++++.++++++ T Consensus 154 ~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~ 233 (392) T protein:vir:10 154 IEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRS 233 (392) T ss_pred cceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHH Confidence 3467999999999998776668999999999999999999999999999997 56899887654 34444445554 Q ss_pred HHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhHHH Q lcl|NC_019456. 232 FLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMPII 311 (435) Q Consensus 232 ~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P~~ 311 (435) +. +..++|+++||++|++|++++.++.++||.+.+++++++||++|||||.+||+.... +++.++...||++||.|++ T Consensus 234 ~~-~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~-~~~~~~~~~f~~~~l~P~~ 311 (392) T protein:vir:10 234 FM-KRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQ-QSSIQQISGMYASALNRYL 311 (392) T ss_pred Hh-ccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc-ccHHHHHHHHHHHHHHHHH Confidence 44 345778999999999999999999999999999999999999999999999976544 4667888889999999999 Q ss_pred HHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHh---CCCCCCCcCCceeeecc Q lcl|NC_019456. 312 RQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELE---GQAPIPDEAADHLYISK 388 (435) Q Consensus 312 ~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~---g~~p~~~~~gd~~~~~~ 388 (435) +.|+++++++|++. ++||...+.+.|..++++.+++++++|++|+||+|+++ |+.|. +..... T Consensus 312 ~~ie~~l~~~L~~~--------~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~------e~r~~e 377 (392) T protein:vir:10 312 RPAISELEYKLSDH--------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK------DLPAPE 377 (392) T ss_pred HHHHHHHHHhcccc--------ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc------ccchhc Confidence 99999999999753 67888888899999999999999999999999999987 55431 111112 Q ss_pred cccchhcccccccccccccccccccc Q lcl|NC_019456. 389 DLYPLDKYYDAILDNKIQTDASVAAP 414 (435) Q Consensus 389 n~~~l~~~~~~~~~~~~~~~~~~~~~ 414 (435) |+-|+... .+ ....| T Consensus 378 ~l~~~~~G---------d~--~~p~p 392 (392) T protein:vir:10 378 NTNKKTTG---------QS--NEPVP 392 (392) T ss_pred CCCCCCCC---------CC--CCCCC Confidence 22221110 00 00001 No 73 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=2.4e-74 Score=424.17 Aligned_cols=378 Identities=18% Similarity=0.217 Sum_probs=293.2 Q ss_pred CchHHHHHhhccccccccccccccchhhhhh-ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDM-AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMD 79 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 79 (435) ||||+++++.-.....+.............+ .......+..++.+.++++++|++||++||++||++|+++++... T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~--- 79 (392) T protein:vir:39 3 LPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN--- 79 (392) T ss_pred chhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh--- Confidence 9999988653222111111111111111111 112223345677888999999999999999999999999987542 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCC-- Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDI-- 157 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~-- 157 (435) ..|+.+||++||+++||+.++.+++++||||++++++. .|++++|+|++|++|++..+.++..++|.+...+ T Consensus 80 -----~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~-~g~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~~~ 153 (392) T protein:vir:39 80 -----QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNA-NGADMKWEYLRPSQVNTYYFEYENGMYYNITFDDPK 153 (392) T ss_pred -----hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC-CCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCcc Confidence 34678999999999999999999999999999999874 5899999999999999999888877776665443 Q ss_pred --eeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcC--CHHHHHHHHHH Q lcl|NC_019456. 158 --YNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSI--SPEKRQAMVND 231 (435) Q Consensus 158 --~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~--~~e~~~~~~~~ 231 (435) ....|+++||||++++++.+.++|+||+.++...+..+.++++++.++|.|+ +++++++++.. ++++.++++++ T Consensus 154 ~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~ 233 (392) T protein:vir:39 154 IEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRS 233 (392) T ss_pred cceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHH Confidence 3467999999999998776668999999999999999999999999999997 56899887654 34444445554 Q ss_pred HHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhHHH Q lcl|NC_019456. 232 FLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMPII 311 (435) Q Consensus 232 ~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P~~ 311 (435) +. +..++|+++||++|++|++++.++.++||.+.+++++++||++|||||.+||+.... +++.++...||++||.|++ T Consensus 234 ~~-~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~-~~~~~~~~~f~~~~l~P~~ 311 (392) T protein:vir:39 234 FM-KRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQ-QSSIQQISGMYASALNRYL 311 (392) T ss_pred Hh-ccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc-ccHHHHHHHHHHHHHHHHH Confidence 44 345778999999999999999999999999999999999999999999999976544 4667888889999999999 Q ss_pred HHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHh---CCCCCCCcCCceeeecc Q lcl|NC_019456. 312 RQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELE---GQAPIPDEAADHLYISK 388 (435) Q Consensus 312 ~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~---g~~p~~~~~gd~~~~~~ 388 (435) +.|+++++++|++. ++||...+.+.|..++++.+++++++|++|+||+|+++ |+.|. +..... T Consensus 312 ~~ie~~l~~~L~~~--------~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~------e~r~~e 377 (392) T protein:vir:39 312 RPAISELEYKLSDH--------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK------DLPAPE 377 (392) T ss_pred HHHHHHHHHhcccc--------ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc------ccchhc Confidence 99999999999753 67888888899999999999999999999999999987 55431 111112 Q ss_pred cccchhcccccccccccccccccccc Q lcl|NC_019456. 389 DLYPLDKYYDAILDNKIQTDASVAAP 414 (435) Q Consensus 389 n~~~l~~~~~~~~~~~~~~~~~~~~~ 414 (435) |+-|+... .+ ....| T Consensus 378 ~l~~~~~G---------d~--~~p~p 392 (392) T protein:vir:39 378 NTNKKTTG---------QS--NEPVP 392 (392) T ss_pred CCCCCCCC---------CC--CCCCC Confidence 22221110 00 00001 No 74 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=1.2e-73 Score=420.27 Aligned_cols=427 Identities=12% Similarity=0.077 Sum_probs=299.7 Q ss_pred CchHHHHHhhccccccccccccccchh---------------------hhhhcccc-----cc--Cc------------- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQ---------------------PLDMAGVK-----LE--QA------------- 39 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~---------------------~~~~~~~~-----~~--~~------------- 39 (435) |.+++-++++|....++.........+ ..+...+. .. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~~~~ 80 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQYSVASISDVLSTKKLLKAYA 80 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccccccCCcccccccCccccccCHHHHHHHhc Confidence 999988888775332222111000000 00000000 00 00 Q ss_pred --cccc--HHHHhhhHHHHHHHHHHHHHHhhCceeeeecc------cccccchHHHhhhccccccCCHHH----HHHHHH Q lcl|NC_019456. 40 --TFSR--EHILESNEYIFSIVTRLSNVLASLPLHEYQNY------KQMDNEPLADLLKTSPNPNMTAFE----FIARLE 105 (435) Q Consensus 40 --~~~~--~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~------~~~~~~~l~~~l~~~Pn~~~~~~~----f~~~~~ 105 (435) +.+. -.++....++|+||.++++.++++|+++++.. .....|+++++|+.+||++|++++ |++.++ T Consensus 81 ~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv 160 (535) T protein:vir:10 81 DNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKII 160 (535) T ss_pred cChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHH Confidence 0000 12344566777888888888889999998643 234568889999999999999876 555666 Q ss_pred HHHHhc-CCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC---ceEEEEEecCCeeEEEchhheEEeccCCCc---cc Q lcl|NC_019456. 106 TDRNVS-GNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN---NSYWYRVTSDIYNFTIPINDVIHVKHVVPS---NS 178 (435) Q Consensus 106 ~~~~~~-G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~---~~~~~~~~~~~~~~~~~~~~iih~~~~~~~---~~ 178 (435) .+++++ |++|++|+++ ..|+|++||||+|.+|++..+.++ ..++|++..++....|+++|||||++++.. ++ T Consensus 161 ~d~l~~~g~ay~~i~r~-~~G~~~~L~~l~p~~V~v~~d~~~~~~~~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~~ 239 (535) T protein:vir:10 161 NDMYVQDQINIERIFKN-DSNELDHFNAVDASKVVISYSPRSKDQPRKFEQFVSETKSVKFSERNLTFINYWNLSDTDRR 239 (535) T ss_pred HHHHhhCCceEEEEEEC-CCCcEEEEEEeCCceeEEEEcCccccCceEEEEEecCceeEEECcccEEEEeccCCCCcccc Confidence 665555 5789999876 458999999999999999887655 467777778888889999999999986543 46 Q ss_pred cccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCC----cCCHHHHHHHHHHHHHHh---cCCCcccccc-CC Q lcl|NC_019456. 179 WYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDR----SISPEKRQAMVNDFLRMV---KENGGAVVQE-AG 248 (435) Q Consensus 179 ~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~----~~~~e~~~~~~~~~~~~~---~~~~~~~vl~-~g 248 (435) .+|+||+.++...|..+.++++++.++|+|| +++++++++ .+++++.+++++.|.... .|+++++|+. .| T Consensus 240 ~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g 319 (535) T protein:vir:10 240 GYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKD 319 (535) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCC Confidence 7899999999999999999999999999997 568998865 478899999999998754 4667876665 79 Q ss_pred ceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHH------------HHHH-HHHHHHHhHHHHHHH Q lcl|NC_019456. 249 WKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNV------------EHVT-HSWTMTLMPIIRQYE 315 (435) Q Consensus 249 ~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~------------e~~~-~~~~~~i~P~~~~i~ 315 (435) ++|+++++++.|+||+|++++++++||++|||||.+||..+.++++|+ |++. .|++.||.||+..|+ T Consensus 320 ~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie 399 (535) T protein:vir:10 320 AKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIE 399 (535) T ss_pred ceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999888776543 4333 455889999999999 Q ss_pred HHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecc---cccc Q lcl|NC_019456. 316 SQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISK---DLYP 392 (435) Q Consensus 316 ~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~---n~~~ 392 (435) ++|+++|++.. +.+++|+++.+++.|.++++++++.++ .|+||+||+|+++|+||+ +|||++++.. +++. T Consensus 400 ~~ln~~Ll~~~----~~~~~f~f~~l~~~d~~~r~~~~~~~~-~g~lT~NE~R~~~gl~pi--egGD~~~~~~~~~~~~~ 472 (535) T protein:vir:10 400 QVINDKIMRYV----DTDYRFSFTLGDAQDKLQEEQVWKLKL-ANGYFINEYRKDHGLKTV--DGLDVPGFIGSAENFIN 472 (535) T ss_pred HHHhhhccccc----CCeEEEEeccccccCHHHHHHHHHHHH-cCCCCHHHHHHHhCCCCC--CCccccccccchhhccc Confidence 99999999754 345778888999999999999887665 678999999999999999 6899876533 2221 Q ss_pred hhcccccc-cccccccc----c---cccccc---cCCCCCCCC--CCCCCCCCCCC Q lcl|NC_019456. 393 LDKYYDAI-LDNKIQTD----A---SVAAPK---QEGGENTNE--NGLQSTEPEGS 435 (435) Q Consensus 393 l~~~~~~~-~~~~~~~~----~---~~~~~~---~~~~~~~~~--~~~~~~~~~~~ 435 (435) .....+.. +......+ . +..... .++|.++.+ ...++++.+.| T Consensus 473 ~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 528 (535) T protein:vir:10 473 ATGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLPKPSESDDVS 528 (535) T ss_pred ccccccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCCcCCCCCccc Confidence 11111110 00000000 0 000000 001111111 01122222223 No 75 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=2.4e-73 Score=418.73 Aligned_cols=427 Identities=11% Similarity=0.042 Sum_probs=296.2 Q ss_pred CchHHHHHhhcccccc------cc------------------------ccccccchhhhhhcccccc--Cccc----ccH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQ------AN------------------------QIVQNPIPQPLDMAGVKLE--QATF----SRE 44 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~------~~------------------------~~~~~~~~~~~~~~~~~~~--~~~~----~~~ 44 (435) ||||++++..+..... +. ...+...+......|.... ...+ ... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~ 80 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 80 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHH Confidence 9999999876541110 00 0000000000000111111 1111 112 Q ss_pred HHHhhhHHHHHHHHHHHHHHhhC-------------ceeeeeccccc------ccchHHHhhhcccccc-----CCHHHH Q lcl|NC_019456. 45 HILESNEYIFSIVTRLSNVLASL-------------PLHEYQNYKQM------DNEPLADLLKTSPNPN-----MTAFEF 100 (435) Q Consensus 45 ~~~~~~~~v~~~i~~ia~~ia~~-------------~~~~~~~~~~~------~~~~l~~~l~~~Pn~~-----~~~~~f 100 (435) +.|..+|+|++||+.||+.||.+ ++++....... ..+.+.. ++.+||++ +|+.+| T Consensus 81 ~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~-~l~~pn~~~~p~~~s~~~f 159 (547) T protein:vir:63 81 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIES-FIEKTGVDNDINRDSFSSF 159 (547) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHH-HHHhhCCCCCCccchHHHH Confidence 34567899999999999999974 22332211111 1123333 34588877 488999 Q ss_pred HHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc-----eEEEEEecCCeeEEEchhheEEeccCCC Q lcl|NC_019456. 101 IARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN-----SYWYRVTSDIYNFTIPINDVIHVKHVVP 175 (435) Q Consensus 101 ~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~iih~~~~~~ 175 (435) ++.++.+++++||+|++++++. .|+|++||||+|.+|++..+.++. ..|+++..++....|+++||||+++++. T Consensus 160 ~~~lv~d~ll~Gn~~~~i~rd~-~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~eiih~r~n~~ 238 (547) T protein:vir:63 160 VKKIVRDTYMYDQVNFEKVFNR-NQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVRNPR 238 (547) T ss_pred HHHHHHHHHhhCCEEEEEEECC-CCcEEEEEEecCceeEEEECCccccccCceEEEEEcCCcEEEEeccccEEEecccCC Confidence 9999999999999999999874 589999999999999999887763 3455666666677899999999997643 Q ss_pred c---cccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCC--cCCHHHHHHHHHHHHHHh---cCCCccccc Q lcl|NC_019456. 176 S---NSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDR--SISPEKRQAMVNDFLRMV---KENGGAVVQ 245 (435) Q Consensus 176 ~---~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~--~~~~e~~~~~~~~~~~~~---~~~~~~~vl 245 (435) . .+.+|+||+..+...|..+.++++++.++|.|| ++++|++++ .+++++.+++++.|.... +|+|+++|+ T Consensus 239 ~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl 318 (547) T protein:vir:63 239 SDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVV 318 (547) T ss_pred CCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCcccccccccc Confidence 3 366899999999999999999999999999998 678888754 489999999999997754 567887655 Q ss_pred -cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC----------cccHHHHH-HHHHHHHHhHHHHH Q lcl|NC_019456. 246 -EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK----------STTNVEHV-THSWTMTLMPIIRQ 313 (435) Q Consensus 246 -~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~----------~~~~~e~~-~~~~~~~i~P~~~~ 313 (435) ++|++|+++++++.++||+|++++++++||++|||||++||....+ +++|.+++ ..||++||.|++.. T Consensus 319 ~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ 398 (547) T protein:vir:63 319 SAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGF 398 (547) T ss_pred cCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHH Confidence 6889999999999999999999999999999999999999975543 56676654 46779999999999 Q ss_pred HHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccch Q lcl|NC_019456. 314 YESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPL 393 (435) Q Consensus 314 i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l 393 (435) |+++|+++|++... ..++|+++.+...+..+++++.+ ++..|+||+||+|+++|++|. .+|||+++.+.++.++ T Consensus 399 ie~~ln~~L~~~~~----~~~~~~f~~~~~~~~~~~~~~~~-~~~~g~lT~NE~R~~~gl~P~-~egGD~~~~~~~~~~~ 472 (547) T protein:vir:63 399 IEDFINKHIVAEFG----DKYTFQFVGGDIKSELESVKILA-EKAKVAMTVNEVRKELNLPGD-VIGGDIPLNGVIVQRI 472 (547) T ss_pred HHHHHHhhcccccC----CceEEEeeccccccHHHHHHHHH-HHhCCCcCHHHHHHHhCCCCC-CCCCceeecccccccc Confidence 99999999986532 23455566777778777777654 777899999999999999983 2799999999988877 Q ss_pred hccccccccc-cccccc-----------cccccccCCC--CCCCCCCCCCC-CCC-CC Q lcl|NC_019456. 394 DKYYDAILDN-KIQTDA-----------SVAAPKQEGG--ENTNENGLQST-EPE-GS 435 (435) Q Consensus 394 ~~~~~~~~~~-~~~~~~-----------~~~~~~~~~~--~~~~~~~~~~~-~~~-~~ 435 (435) +......... ..+... ....++.++. ++..+.+++++ .++ ++ T Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~ 530 (547) T protein:vir:63 473 GQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNA 530 (547) T ss_pred cccccccCCccccchhhccccccccCCCCCCCCCCCCCCcccCCCcCccccccCcccc Confidence 5433221111 000000 0001111111 11111111111 111 11 No 76 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=3.3e-74 Score=423.42 Aligned_cols=350 Identities=17% Similarity=0.226 Sum_probs=284.8 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ||||+++++. + ..... ........+........++.+.++++++|++||++||++||++|+. . T Consensus 1 M~~~~~f~~r------~-~~~~~-~~~~~~~~~~~~~~~~~v~~~~al~~~av~~cv~~ia~~ia~~p~~---------~ 63 (359) T protein:vir:10 1 MSILNPFERR------S-SITPN-NYYPFMVQNGSIVPNSLVDATEALKNSDLYAVTSLISSDIAGTRFI---------G 63 (359) T ss_pred Ccccchhhcc------c-cCCCC-cchhhhhccccccCCcccCHHHhhcchHHHHHHHHHHHhhhcCccc---------c Confidence 9999865431 1 11111 1222333344555667788889999999999999999999999984 4 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCCeeE Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDIYNF 160 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~ 160 (435) ++++++|+.+||++||+++||+.++.+++++||+|++|+++. .|.|.+|+|++|+.|++..++++..+.+....++..+ T Consensus 64 ~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~g~~~~l~~l~~~~v~i~~~~~~~~y~~~~~~~~~~~ 142 (359) T protein:vir:10 64 NQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGD-NSLMKELRLIPSNAITIDLTDDTLTYEVNQFDDYPSA 142 (359) T ss_pred chHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECC-CCeEEEEEEeCCceEEEEEcCCeEEEEEEecCCceEE Confidence 778889999999999999999999999999999999999874 5899999999999999988876655555555667788 Q ss_pred EEchhheEEeccCC----CccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCC-cCCHHHHHHHHHHHH Q lcl|NC_019456. 161 TIPINDVIHVKHVV----PSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDR-SISPEKRQAMVNDFL 233 (435) Q Consensus 161 ~~~~~~iih~~~~~----~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~-~~~~e~~~~~~~~~~ 233 (435) +|+++|||||++++ +.++++|+||+..+...+..+.+++++..++|+|| +++++++++ .+++++.+++++.|+ T Consensus 143 ~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~ 222 (359) T protein:vir:10 143 KYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFE 222 (359) T ss_pred EEcccceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHH Confidence 99999999999764 34788999999999999999999999999999997 678999875 789999999999997 Q ss_pred HHh--cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhHHH Q lcl|NC_019456. 234 RMV--KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMPII 311 (435) Q Consensus 234 ~~~--~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P~~ 311 (435) ... .|+|+++||++|++|+++++++.|+|++|.+++++++||++|||||++||+....+ ++.++...++..++.|.+ T Consensus 223 ~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~~~~~e~~~~~~l~~~l 301 (359) T protein:vir:10 223 KANGGNNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQ-SSLDQIKDLYVNALNRFI 301 (359) T ss_pred HHhCccccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCccc-ccHHHHHHHHHHHHHHHH Confidence 654 56789999999999999999999999999999999999999999999998765432 344555556666666666 Q ss_pred HHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCC Q lcl|NC_019456. 312 RQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIP 377 (435) Q Consensus 312 ~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~ 377 (435) ..++++|+.+|... +.++...+...|...+...+.+++++|++|+||+|+++|++|+= T Consensus 302 ~p~~~~l~~~l~~~--------~~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 302 EPLISELRIKCDSS--------IGVDMSPITDYSNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred HHHHHHHHHHhhhh--------hcccchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 66666666666543 23444444444555556677889999999999999999999993 No 77 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=100.00 E-value=2.7e-74 Score=423.93 Aligned_cols=364 Identities=14% Similarity=0.129 Sum_probs=279.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ||||+++++.. ....... + ......++.+.|+++++|++||++||++||++||++++++. ..+ T Consensus 1 Mg~f~~l~~~~----~~~~~~~-------~-----~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~-~~~ 63 (376) T protein:vir:78 1 MGFFSELFKRN----KEIEWMW-------D-----LDFLEDKTTKVYLKKMALNTCVKHIARTIAKSDFRLKNGET-SVR 63 (376) T ss_pred CchhhhhhccC----Ccccccc-------c-----hhhccccchhhhhhhHHHHHHHHHHHHhhcccceeeccccc-ccc Confidence 99999986431 1111110 0 01122345677899999999999999999999999998654 457 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCCeeE Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDIYNF 160 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~ 160 (435) |+++++|+.+||++||+++||+.++.+++++||+|+++.++. .|.+..++|+.+..+..... ..+.+...+... T Consensus 64 ~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~-~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~ 137 (376) T protein:vir:78 64 DKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTD-DFLIADSYVRKEFAFFPDVF-----EGVTVKDYRYNR 137 (376) T ss_pred chHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCC-Ceeeccceeecccceeeeee-----eeeeeecceeee Confidence 999999999999999999999999999999999999998875 48899999999876543321 112222223356 Q ss_pred EEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCce--EEEeCCcCCHHHHHHHHHHHHHHhc- Q lcl|NC_019456. 161 TIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKF--VLQYDRSISPEKRQAMVNDFLRMVK- 237 (435) Q Consensus 161 ~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~--~~~~~~~~~~e~~~~~~~~~~~~~~- 237 (435) .|+++||+|+++........+.++...+...+... .....+.++.++ ++..++.+++++.+++++.|....+ T Consensus 138 ~~~~~evih~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~g 212 (376) T protein:vir:78 138 NFSMDDVIFLEYGNERLSAFTDGMFEDYGELFGKM-----IRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYAS 212 (376) T ss_pred eeccccEEEeccCCCCchhhhhHHHHHHHHHHHHH-----HHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhcc Confidence 79999999999765422222222222222222111 122234555554 4555778999999999999976643 Q ss_pred ---CCCccccccCCceeeeccCChhh-----HHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHH-HHHHHHHHHHh Q lcl|NC_019456. 238 ---ENGGAVVQEAGWKVDRYESKFEP-----ADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVE-HVTHSWTMTLM 308 (435) Q Consensus 238 ---~~~~~~vl~~g~~~~~~~~~~~~-----~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e-~~~~~~~~~i~ 308 (435) +.++++++++|++|+++++++.+ +||.|.+++++++||++|||||.+||+ +++|.| +...|+++||. T Consensus 213 ~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~----~~s~~e~~~~~f~~~~l~ 288 (376) T protein:vir:78 213 FNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHG----DMADLSNNMKAYMEYCID 288 (376) T ss_pred ccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCC----CCCCHHHHHHHHHHHHHH Confidence 23457889999999999988865 499999999999999999999999974 455555 45677899999 Q ss_pred HHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecc Q lcl|NC_019456. 309 PIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISK 388 (435) Q Consensus 309 P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~ 388 (435) |++.+|+++|+++|+++.+ +++.|+++.+++.|.+++++++.+++++|++|+||+|+++|+||+++++||++++|+ T Consensus 289 P~~~~ie~~l~~kll~~~~----~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~d~~~~~~ 364 (376) T protein:vir:78 289 PLTKKLEDELNAKLFTFSE----FLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPELDKYLITK 364 (376) T ss_pred HHHHHHHHHHHhhhCCccc----ceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecc Confidence 9999999999999998753 567889899999999999999999999999999999999999999988899999999 Q ss_pred cccchhccccccccccccccccccccccCCCCCC Q lcl|NC_019456. 389 DLYPLDKYYDAILDNKIQTDASVAAPKQEGGENT 422 (435) Q Consensus 389 n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (435) |++|++..++. + T Consensus 365 n~~~~~~~~e~----------------------g 376 (376) T protein:vir:78 365 NYQSADEGGED----------------------G 376 (376) T ss_pred CceehhccccC----------------------C Confidence 99998753221 1 No 78 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=100.00 E-value=5.4e-74 Score=422.25 Aligned_cols=378 Identities=15% Similarity=0.155 Sum_probs=273.4 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc-cccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY-KQMD 79 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~-~~~~ 79 (435) ||||+|+... +........+. .....+..+.++++++|++||++||++||++||++++++ .... T Consensus 1 MGlf~~~~~~-----~~~~~~~~~~~----------~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~~ 65 (395) T protein:vir:98 1 MGILDFFSFK-----KSGTLSDDDSG----------STTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTEN 65 (395) T ss_pred CcchhhhcCC-----Ccccccccccc----------hhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcccc Confidence 9999998421 11111111111 111224456788999999999999999999999999865 4456 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCCee Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDIYN 159 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~ 159 (435) +|+++++|+.+||++||+++||+.++.+++++||||++++++.. +++.+ ..+............+.+...... T Consensus 66 ~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~------~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 138 (395) T protein:vir:98 66 QKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKG------IYVAD-SFTQDKKISGSQFKVSRVQGQTYE 138 (395) T ss_pred cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCc------eecCC-cccccccccCcccceeeecCceee Confidence 78999999999999999999999999999999999999987532 22222 222222111111222222222335 Q ss_pred EEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHH--HHHHHHHhhcCCc--eEEEeCCc-CCHHHHHHHHHHHHH Q lcl|NC_019456. 160 FTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSV--ENFSQNEMEKKDK--FVLQYDRS-ISPEKRQAMVNDFLR 234 (435) Q Consensus 160 ~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~--~~~~~~~~~n~~~--~~~~~~~~-~~~e~~~~~~~~~~~ 234 (435) ++|++++|||+|+.+.....++.+++......+...... .....+++.++.. +++..... .++++.++.++.+.. T Consensus 139 ~~~~~~evih~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (395) T protein:vir:98 139 KTFTFDQVIYLKNDNSDLMSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFKR 218 (395) T ss_pred eEecCccEEEecCCCCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccCCcHHHHHHHHHHHHH Confidence 789999999999876544455556666666655544433 3334455555432 33333333 334444444444433 Q ss_pred ---H-hcCCCccccccCCceeeeccC------ChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHH-HHHHHH Q lcl|NC_019456. 235 ---M-VKENGGAVVQEAGWKVDRYES------KFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVE-HVTHSW 303 (435) Q Consensus 235 ---~-~~~~~~~~vl~~g~~~~~~~~------~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e-~~~~~~ 303 (435) . ..++++++++++|++|++++. ++.++|+.+++++++++||++|||||.+|| +++++.| +...|+ T Consensus 219 ~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~----~~~sn~e~~~~~f~ 294 (395) T protein:vir:98 219 TVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH----GDIADNQKNYELLL 294 (395) T ss_pred HHhhhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCcccHHHHHHHHH Confidence 2 234556888999999999985 467789999999999999999999999997 3455544 566788 Q ss_pred HHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCce Q lcl|NC_019456. 304 TMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADH 383 (435) Q Consensus 304 ~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~ 383 (435) ++||.|++.+|+++|+++||++.++..|.+ |+++.+++.|.+++++++.+++++|++|+||+|+++|+||+++++||+ T Consensus 295 ~~tl~P~~~~ie~~l~~kll~~~~~~~g~~--f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~gD~ 372 (395) T protein:vir:98 295 EGPIESLITNIVDGLEYAIFDKSETLQGSF--IKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLGKV 372 (395) T ss_pred HHHHHHHHHHHHHHHHHhcCChhhhcCcce--eeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCce Confidence 999999999999999999999888777655 666799999999999999999999999999999999999999999999 Q ss_pred eeecccccchhccccccccccccccccccccccCCCCCCCCCCC Q lcl|NC_019456. 384 LYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGL 427 (435) Q Consensus 384 ~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (435) +++++|++|++. +||++++++++ T Consensus 373 ~~~~~n~~~~~~---------------------~gge~~~~~~~ 395 (395) T protein:vir:98 373 LYMTKNYESVLE---------------------RGGEVDEEVET 395 (395) T ss_pred eeecccceeccc---------------------ccCCCCCCCCC Confidence 999999999863 35555555444 No 79 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=100.00 E-value=6.9e-74 Score=421.67 Aligned_cols=356 Identities=15% Similarity=0.165 Sum_probs=274.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc---- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK---- 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~---- 76 (435) ||||++++.++........ ....+++....++.+++|++||++||++||++||+++++.. T Consensus 1 M~~f~k~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~ 64 (378) T protein:vir:85 1 MNLFGKVVSFSRGKLNNDT----------------QRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVG 64 (378) T ss_pred CchhhhhhhhhhcccccCC----------------cceeeeeccchhhhhHHHHHHHHHHHHhHhhCceeEEEEeccccc Confidence 9999999876543322111 11112334456788899999999999999999999987432 Q ss_pred -----ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEE Q lcl|NC_019456. 77 -----QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWY 151 (435) Q Consensus 77 -----~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~ 151 (435) ...+|+++++|+.+||++||+++||+.++.+++++||||++++.+...|.+..+++. T Consensus 65 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~------------------ 126 (378) T protein:vir:85 65 SDTLISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLLFA------------------ 126 (378) T ss_pred cccccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEEec------------------ Confidence 346799999999999999999999999999999999999997766666655444322 Q ss_pred EEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhc-CCceEEEeCCcCCHHHHHHHHH Q lcl|NC_019456. 152 RVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEK-KDKFVLQYDRSISPEKRQAMVN 230 (435) Q Consensus 152 ~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n-~~~~~~~~~~~~~~e~~~~~~~ 230 (435) . ..++|.++||||++.+...+ .+.+.+..+...+. ..+.+ .++++++.++.+++++.+++++ T Consensus 127 ---~--~~~~~~~~dvih~~~~~~~~--~~~~~~~~a~~~~~----------~~~~~~~~~g~l~~~~~l~~~~~~~~~~ 189 (378) T protein:vir:85 127 ---N--DKKEYKPEELVRLVSPFYIN--EDTSILDNALASIQ----------TKLEQGKLRGLLKINAFLDIDNTQEYRE 189 (378) T ss_pred ---C--CCEEEcccceEEEecCcCcc--chhhHHHHHHHHHH----------HHHhcCCcceEEEeCCcCCHHHHHHHHH Confidence 1 12467889999999654323 23344443333222 22334 4789999999999999888888 Q ss_pred HHHHHh------cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHH Q lcl|NC_019456. 231 DFLRMV------KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWT 304 (435) Q Consensus 231 ~~~~~~------~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~ 304 (435) +|...+ .++++++||++|++|+++++++.++|+ +.+++..++||++|||||.+|+. .++.++...|++ T Consensus 190 ~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~~-----s~~e~~~~~f~~ 263 (378) T protein:vir:85 190 KALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIELIKSELLTGYFMNENILLG-----TATQEQQIYFYN 263 (378) T ss_pred HHHHHHHHhhcccccccceecCCCceEEeccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC-----CchHHHHHHHHH Confidence 885543 356789999999999999999999996 56789999999999999999953 223456677899 Q ss_pred HHHhHHHHHHHHHHHHhhcccccccCcc------eeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCC Q lcl|NC_019456. 305 MTLMPIIRQYESQFNMKLFTPGKRVKGF------YFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPD 378 (435) Q Consensus 305 ~~i~P~~~~i~~~l~~~l~~~~~~~~g~------~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~ 378 (435) +||.||+.+|+++|+++||++.++..++ .+.||.+.+++.|.+++++.+.+++++|+||+||+|+++|+||+ T Consensus 264 ~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~-- 341 (378) T protein:vir:85 264 STIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPI-- 341 (378) T ss_pred HHHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-- Confidence 9999999999999999999998777665 37899999999999999999999999999999999999999999 Q ss_pred cCCceeeecccccchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 379 EAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 379 ~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) +|||++++|+|++|++...+.+..+... .++++++++ T Consensus 342 ~gGD~~~~~~N~~~~~~~~~~~~~~~~~---------~~~~e~~n~ 378 (378) T protein:vir:85 342 EGGDIYIANLNAVAVKNLSDLQGSRKDV---------ASTDETNNQ 378 (378) T ss_pred CCCCeEeecccccccccchhhcCccCCC---------CCCCCCCCC Confidence 5899999999999998765443222111 112222222 No 80 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=100.00 E-value=1.6e-73 Score=419.64 Aligned_cols=357 Identities=15% Similarity=0.164 Sum_probs=277.7 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc---- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK---- 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~---- 76 (435) ||||+++++++......... ....++.+..++..++|++||++||++||++||++++... T Consensus 1 M~if~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~ 64 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQ----------------RVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVG 64 (378) T ss_pred CchhHHhHhhhhcccccCcc----------------eeeeeecchhhhhhHHHHHHHHHHHHhHhhCceeeeeecccccc Confidence 99999999876543322110 1112333445677889999999999999999999887432 Q ss_pred -----ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEE Q lcl|NC_019456. 77 -----QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWY 151 (435) Q Consensus 77 -----~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~ 151 (435) ...+|+++++|+.+||++||+++||+.++.+++++|+||++++.+...|.+..+++.. T Consensus 65 ~~~~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~~----------------- 127 (378) T protein:vir:94 65 SDTLISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDLLFAN----------------- 127 (378) T ss_pred cccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEEEEec----------------- Confidence 3467999999999999999999999999999999999999977766767765554321 Q ss_pred EEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHHHHHHHHHH Q lcl|NC_019456. 152 RVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPEKRQAMVND 231 (435) Q Consensus 152 ~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~ 231 (435) + .++|+++||+|++.+...+ .+.+++..+...+... +..+.++++++.++.+++++.++.+++ T Consensus 128 ----~--~~~~~~~dvih~~~~~~~~--~~~~~~~~~~~~~~~~---------~~~~~~~g~l~~~~~l~~~~~~~~~e~ 190 (378) T protein:vir:94 128 ----D--KKEYKPEELVRLTSPFYIN--EDTSILDNALASIQTK---------LEQGKLRGLLKINAFLDIDNTQEYREK 190 (378) T ss_pred ----C--cEEechhceeeecCcCCcc--cchhHHHHHHHHHHHH---------HhhCCcccceeeCCcCCHHHHHHHHHH Confidence 1 2468999999999664333 3445655554433221 123347889999999998887777777 Q ss_pred HHHHh------cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHH Q lcl|NC_019456. 232 FLRMV------KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTM 305 (435) Q Consensus 232 ~~~~~------~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~ 305 (435) |...+ .++++++||++|++|+++++++.++|+ +.+++..++||++|||||.+|++ ..+.++...|+++ T Consensus 191 ~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l~g-----~~~e~~~~~f~~~ 264 (378) T protein:vir:94 191 ALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG-----TATQEQQIYFYNS 264 (378) T ss_pred HHHHHHHhhcccccccceeccCCceEEEccCChHHhhH-HHHHHHHHHHHHHhCCCHHHhcC-----CchHHHHHHHHHH Confidence 75543 356789999999999999999999997 66688999999999999999953 2234566778899 Q ss_pred HHhHHHHHHHHHHHHhhcccccccCcc------eeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCc Q lcl|NC_019456. 306 TLMPIIRQYESQFNMKLFTPGKRVKGF------YFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDE 379 (435) Q Consensus 306 ~i~P~~~~i~~~l~~~l~~~~~~~~g~------~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~ 379 (435) ||.||+..|+++|+++||++.++..|+ .++|+++.+++.|.+++++++.+++++|++|+||+|+++|+||+ | T Consensus 265 tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~--~ 342 (378) T protein:vir:94 265 TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPI--E 342 (378) T ss_pred HHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--C Confidence 999999999999999999988776654 47799999999999999999999999999999999999999999 5 Q ss_pred CCceeeecccccchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 380 AADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 380 ~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) |||++++|+|++|++.+......+.. ..++++++++ T Consensus 343 ggd~~~~~~n~~~~~~~~~~~~~~~~---------~~~~~e~~n~ 378 (378) T protein:vir:94 343 GGDVYIANLNAVAVKNLSDLQGNRKD---------VTSTDETNNQ 378 (378) T ss_pred CCCeeeecccccchhcchhcccccCC---------CCCCCCCCCC Confidence 89999999999999876554332221 1223333333 No 81 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=6.8e-71 Score=405.25 Aligned_cols=424 Identities=12% Similarity=0.067 Sum_probs=286.6 Q ss_pred CchHHHHHh---hccccccccccccccchhh-hhhccccccCccc----------ccHHHHhhhHHHHHHHHHHHHHHhh Q lcl|NC_019456. 1 MSFMSKVRQ---FFGVHDQANQIVQNPIPQP-LDMAGVKLEQATF----------SREHILESNEYIFSIVTRLSNVLAS 66 (435) Q Consensus 1 Mg~~~~~~~---~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~----------~~~~~~~~~~~v~~~i~~ia~~ia~ 66 (435) =++|..+.. -......+.. ......+ +.+.|.......+ ..-.++..+|+|++||++||+.||+ T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~--~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~ 109 (576) T protein:vir:96 32 QANIRNIEEKSKELNKSLYGKQ--QAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAM 109 (576) T ss_pred hHHHHHhhhhhhhhccccCCcc--chhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHh Confidence 222222211 0111110000 0000000 0111111100001 0113355689999999999999997 Q ss_pred C-------------ceeeeecccccc------cchH---HHhhhcccccc-CCHHHHHHHHHHHHHhcCCcceEEeeeCC Q lcl|NC_019456. 67 L-------------PLHEYQNYKQMD------NEPL---ADLLKTSPNPN-MTAFEFIARLETDRNVSGNGYAWIQKSLS 123 (435) Q Consensus 67 ~-------------~~~~~~~~~~~~------~~~l---~~~l~~~Pn~~-~~~~~f~~~~~~~~~~~G~~~~~i~~~~~ 123 (435) + +++++..+.... .+.+ +..++..|||+ +|+.+||+.++.+++++||+|++++++.. T Consensus 110 ~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd 189 (576) T protein:vir:96 110 YCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKK 189 (576) T ss_pred hhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecC Confidence 3 334443322111 1222 22223455555 58999999999999999999999986543 Q ss_pred -CCcEEEEEEeCCceeEEEEcCCCceE-----EEEEecCCeeEEEchhheEEeccCCCcc---ccccCcHHHHHHHHHHH Q lcl|NC_019456. 124 -TGEPIALWPLDPNTVSILRNTDNNSY-----WYRVTSDIYNFTIPINDVIHVKHVVPSN---SWYGVSPIDVLSSSLKF 194 (435) Q Consensus 124 -~g~~~~l~~l~~~~v~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~iih~~~~~~~~---~~~G~s~l~~~~~~i~~ 194 (435) .|++++||||+|.+|++..+.++..+ ++++..++....|++++|||+++....+ +.+|+||+.++...|.. T Consensus 190 ~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~ 269 (576) T protein:vir:96 190 NATTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFIA 269 (576) T ss_pred CCCceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccceEEEeecCCCCcccCcccccHHHHHHHHHHH Confidence 57899999999999999999988654 3444455667789999999877654443 67899999999999999 Q ss_pred HHHHHHHHHHHhhcC--CceEEEeCC--cCCHHHHHHHHHHHHHHh---cCCCc-cccccCCceeeeccCChhhHHHHHH Q lcl|NC_019456. 195 QRSVENFSQNEMEKK--DKFVLQYDR--SISPEKRQAMVNDFLRMV---KENGG-AVVQEAGWKVDRYESKFEPADLSSV 266 (435) Q Consensus 195 ~~~~~~~~~~~~~n~--~~~~~~~~~--~~~~e~~~~~~~~~~~~~---~~~~~-~~vl~~g~~~~~~~~~~~~~~~~e~ 266 (435) +.++++++.++|.|| +++++++++ .+++++++++++.|.... .|+|+ ++||++|++|+++++++.++||+|+ T Consensus 270 ~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~ 349 (576) T protein:vir:96 270 YNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFEKW 349 (576) T ss_pred HHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHHHHHH Confidence 999999999999997 678888765 579999999999998765 45677 5899999999999999999999999 Q ss_pred HHHHHHHHHHHhCCCHHHhCCcccC-----------cccHHHH-HHHHHHHHHhHHHHHHHHHHHHhhcccccccCccee Q lcl|NC_019456. 267 EQISRIRIATAFNVPISFLNDDQAK-----------STTNVEH-VTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYF 334 (435) Q Consensus 267 ~~~~~~~Ia~~fgvP~~~lg~~~~~-----------~~~~~e~-~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i 334 (435) +++++++||++|||||.+||..+.+ +++|.|+ ...||++||.|++..|+++|+++|++... .++ T Consensus 350 ~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~----~~~ 425 (576) T protein:vir:96 350 LTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISEYS----DKY 425 (576) T ss_pred HHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcc----Cce Confidence 9999999999999999999986644 5567665 45677999999999999999999997542 233 Q ss_pred eechhhhhccCHHHHHHHHHHH--HhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccc--- Q lcl|NC_019456. 335 SFNVNGLLRGDTAARTQYYQTL--TRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDA--- 409 (435) Q Consensus 335 ~fd~~~l~~~d~~~~~~~~~~~--~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~--- 409 (435) .|+ +++.|.+++++.++.+ +.+|+||+||+|+++|+||+ +|||+++.+.++.+++............... T Consensus 426 ~~~---f~r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~pi--egGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 500 (576) T protein:vir:96 426 VFQ---FVGGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPI--EGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFD 500 (576) T ss_pred EEE---eccCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCC--CCcceeccccccccccccccCCCCCCcccccccc Confidence 443 3456777777766543 56799999999999999999 6899999999988876543222111111100 Q ss_pred ---------cccccccC--CCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 410 ---------SVAAPKQE--GGENTNENGLQSTEPEGS 435 (435) Q Consensus 410 ---------~~~~~~~~--~~~~~~~~~~~~~~~~~~ 435 (435) .+..+..+ ...++.+.+++++..++. T Consensus 501 ~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~ 537 (576) T protein:vir:96 501 MIQQFLNSPDDEEPQQESTEDKVDGRESNDPTKIDSP 537 (576) T ss_pred ccccccCCCCCCCCCCCCCCCcccccccccCCCCCCc Confidence 00000010 111112222222222221 No 82 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=6.9e-71 Score=405.24 Aligned_cols=431 Identities=10% Similarity=0.073 Sum_probs=303.9 Q ss_pred CchHHHHHhhcccccccc----------------c----------cccccchhhhh--------hccccccC-ccccc-- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQAN----------------Q----------IVQNPIPQPLD--------MAGVKLEQ-ATFSR-- 43 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~----------------~----------~~~~~~~~~~~--------~~~~~~~~-~~~~~-- 43 (435) -|||+++..+|..+..-. . ...++-...+. ..+..... .++.+ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~~epp~d~~ 87 (648) T protein:vir:79 8 RGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRDFEEPEFDFN 87 (648) T ss_pred chhhhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcCCccccccCCcCHH Confidence 789999988876221111 0 00011111110 01111111 12222 Q ss_pred --HHHHhhhHHHHHHHHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeee Q lcl|NC_019456. 44 --EHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKS 121 (435) Q Consensus 44 --~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~ 121 (435) .+.+.++|.|++||++||++||++||++..++....++.-..+++.+||++||.++||+.++.+++++||+|++++++ T Consensus 88 ~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd 167 (648) T protein:vir:79 88 EITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRA 167 (648) T ss_pred HHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCccchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEec Confidence 245668999999999999999999999877554443333344566799999999999999999999999999999998 Q ss_pred CCCC--------------cEEEEEEeCCceeEEEEcCCCceEEEEEec--CCeeEEEchhheEEeccCCCccccccCcHH Q lcl|NC_019456. 122 LSTG--------------EPIALWPLDPNTVSILRNTDNNSYWYRVTS--DIYNFTIPINDVIHVKHVVPSNSWYGVSPI 185 (435) Q Consensus 122 ~~~g--------------~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~--~~~~~~~~~~~iih~~~~~~~~~~~G~s~l 185 (435) ..++ .+.+|||++|.+|++..+.+|...+|.+.. ++..+.|++++||||++.++.++++|+||+ T Consensus 168 ~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~~Y~y~~~g~~~~~~~~~~dIIHik~~~~~d~~~GlSpi 247 (648) T protein:vir:79 168 KDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIKGWQQEQEGQDKPQKFKPEDIVHIYYKREKGRAFGTPWL 247 (648) T ss_pred CCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCceeeeEEEecCCceeEEecCccEEEEccCCCCCCceeccHH Confidence 6643 357899999999999999888776655544 344567899999999988888999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeC-CcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeecc----CCh Q lcl|NC_019456. 186 DVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYD-RSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYE----SKF 258 (435) Q Consensus 186 ~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~-~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~----~~~ 258 (435) .++...|....+++++++++|.|+ +.++++++ .....+..++++++|....++ ..+.+.+.++..+. .++ T Consensus 248 ~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~---~~i~gg~v~~~~~~i~~~~s~ 324 (648) T protein:vir:79 248 LPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQEGFGAEEGEVDLVRGEVEN---MDVEGGMVTTERVNISSIASN 324 (648) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCccchHHHHHHHHHHHHhccc---ccccccccccceeeccccCCH Confidence 999999999999999999999998 46777764 344556677777777766544 23333443443333 256 Q ss_pred hhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhHHHHHHHHHHHHhhc----cccc----ccC Q lcl|NC_019456. 259 EPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMPIIRQYESQFNMKLF----TPGK----RVK 330 (435) Q Consensus 259 ~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P~~~~i~~~l~~~l~----~~~~----~~~ 330 (435) +++||++++++++++||++|||||.+||....+++++.+++..+|..++.|++..++..++.++. .+.. ... T Consensus 325 ~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~ 404 (648) T protein:vir:79 325 QIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILMEGGFDPVLNP 404 (648) T ss_pred HHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 89999999999999999999999999999888888999888888988888887766655554332 2211 112 Q ss_pred cceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccccccccccc Q lcl|NC_019456. 331 GFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDAS 410 (435) Q Consensus 331 g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~ 410 (435) .++++|+++++++.|.+++++.+.+++++|+||+||+|+++|+||+|+.+ +..++..++.+.............++... T Consensus 405 d~~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 483 (648) T protein:vir:79 405 DDKVEFRFNEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGE-GRAKMHLQMVTIAQATALAALAPTPAGGS 483 (648) T ss_pred cceEEEeecccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC-CccccccccccchhccccccCCCCCCCCC Confidence 45789999999999999999999999999999999999999999998543 44556677766554332222222222111 Q ss_pred ccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 411 VAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 411 ~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) ..+...++.....++.+.+++..|. T Consensus 484 ~~~a~~eg~~~e~~~~~~~~~~~g~ 508 (648) T protein:vir:79 484 SASASGDKKKKATDNKTKPTNQHGT 508 (648) T ss_pred CCCccccccccccCCCCCCCCCCCc Confidence 1122222222222333334444443 No 83 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=3.4e-70 Score=401.42 Aligned_cols=408 Identities=13% Similarity=0.079 Sum_probs=283.9 Q ss_pred CchHH-----HHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc Q lcl|NC_019456. 1 MSFMS-----KVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY 75 (435) Q Consensus 1 Mg~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~ 75 (435) |++.+ .+++. ..+...... .+..+...+ .........+..+++|++||++||+.||++|++++.+. T Consensus 6 ~~~~~~~~~~~~~~~----~~~~~~~~~---~~~~~~~pp--~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~~~~ 76 (540) T protein:vir:41 6 LSIKSLEKYRAIKGD----TDSQALKED---RFEEYVEPK--VHPLVLLSLLQVNPYHASACSIKANDILRTGYLIDGDD 76 (540) T ss_pred cChhhccchhhhhcc----ccccccccC---CCCccccCC--CCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceEecCc Confidence 77754 22221 111111111 111111111 11112245677899999999999999999999997653 Q ss_pred cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCce------- Q lcl|NC_019456. 76 KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNS------- 148 (435) Q Consensus 76 ~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~------- 148 (435) . .+.. ..||++||+.+||+.++.+++++||||++++++. .|+|++|+||+|.+|++..+..+.. T Consensus 77 ~-----~~~~---~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~-~G~~~~L~~i~~~~V~v~~~~~~~~~~~d~~~ 147 (540) T protein:vir:41 77 G-----GVEE---LLRACRPSFEFILLQALEDLQVFNYCTLEVVRDD-QGEPVRLDYIPAHTVRVHRDGSRYMQTWDGIH 147 (540) T ss_pred c-----chhh---hccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECC-CCcEEEEEEeCCcceEEeEcCceeEeeecCce Confidence 2 3322 3499999999999999999999999999999875 5899999999999999876654311 Q ss_pred EEEEE----------ecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEe Q lcl|NC_019456. 149 YWYRV----------TSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQY 216 (435) Q Consensus 149 ~~~~~----------~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~ 216 (435) .+|+. ........|++++|||+|.+++.++++|+||+.++...+..+.++++++.++|+|| +++++++ T Consensus 148 ~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~ 227 (540) T protein:vir:41 148 VTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITV 227 (540) T ss_pred eeeeecccccceeeccccccceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEe Confidence 11111 11223457899999999998889999999999999999999999999999999998 6789998 Q ss_pred CCcCCHHH------H----HHHHHHHHHH----hcCCCcccccc------CCceeeeccCChhhHHHHHHHHHHHHHHHH Q lcl|NC_019456. 217 DRSISPEK------R----QAMVNDFLRM----VKENGGAVVQE------AGWKVDRYESKFEPADLSSVEQISRIRIAT 276 (435) Q Consensus 217 ~~~~~~e~------~----~~~~~~~~~~----~~~~~~~~vl~------~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~ 276 (435) ++.+++++ . +++++.|... .+|+|+++||+ +|++|+++++++.++||.+++++++++||+ T Consensus 228 ~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~ 307 (540) T protein:vir:41 228 TGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAA 307 (540) T ss_pred CcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHHH Confidence 87766543 2 3344444433 24678899884 799999999999999999999999999999 Q ss_pred HhCCCHHHhCCcccC--cccHHHHH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHH Q lcl|NC_019456. 277 AFNVPISFLNDDQAK--STTNVEHV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYY 353 (435) Q Consensus 277 ~fgvP~~~lg~~~~~--~~~~~e~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~ 353 (435) +|||||.+||..+.+ +++|.|++ ..|+++||.|+++.|+++|+++|++.. ..+++|+|+.+.+++.|.++ .+ T Consensus 308 afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~--~~~~~i~f~~~~ll~~D~~~---~~ 382 (540) T protein:vir:41 308 AHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIVSSVLTDFIQLKL--DPGARFVFNEEILMESEFVH---NY 382 (540) T ss_pred HhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc--CCceEEEecchhhcchHHHH---HH Confidence 999999999987654 45677665 556789999999999999999988754 35789999999999887554 46 Q ss_pred HHHHhcCCcCHHHHHHHh-CCCCCCCcCCceeeecccccchhcccccccccccc-ccccccccccCCCCCCCCCCCCCCC Q lcl|NC_019456. 354 QTLTRNGIFKPNEIRELE-GQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQ-TDASVAAPKQEGGENTNENGLQSTE 431 (435) Q Consensus 354 ~~~~~~g~~t~NE~R~~~-g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (435) .+++++|++|+||+|+.+ |++| ++|.++.|.|+...+........+..+ .+........+..-+...+.+.+.+ T Consensus 383 ~~lv~~G~lT~NE~Re~L~g~e~----gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~ 458 (540) T protein:vir:41 383 ALLVQCGVLTPSEVREKLFGLDG----GPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLE 458 (540) T ss_pred HHHHhCCCCCHHHHHHHhCcCcC----CCcccccccccccccccccccccCCCCccccccccchhcccccCccccccccc Confidence 678999999999999854 5543 557777787775533222111111100 0000000000111111111222222 Q ss_pred CCCC Q lcl|NC_019456. 432 PEGS 435 (435) Q Consensus 432 ~~~~ 435 (435) +.++ T Consensus 459 ~~~~ 462 (540) T protein:vir:41 459 DKKK 462 (540) T ss_pred cccc Confidence 2222 No 84 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=1.1e-69 Score=398.57 Aligned_cols=388 Identities=15% Similarity=0.102 Sum_probs=283.4 Q ss_pred cHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc--------ccccchHHHhhhccccccC--------CHHHHHHHHHH Q lcl|NC_019456. 43 REHILESNEYIFSIVTRLSNVLASLPLHEYQNYK--------QMDNEPLADLLKTSPNPNM--------TAFEFIARLET 106 (435) Q Consensus 43 ~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~--------~~~~~~l~~~l~~~Pn~~~--------~~~~f~~~~~~ 106 (435) -...+..+++|++||++||+.||++|++++.... ....+....++..+||+.| ++.+||+.++. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 2345667899999999999999999999974321 1112223345566788755 67789999999 Q ss_pred HHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc-------eEEEEE----------------------ecCC Q lcl|NC_019456. 107 DRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN-------SYWYRV----------------------TSDI 157 (435) Q Consensus 107 ~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~-------~~~~~~----------------------~~~~ 157 (435) +++++||+|++++++. .|.|++|+||+|.+|++..+..+. ..++.+ ...+ T Consensus 81 ~l~l~Gn~~i~~~r~~-~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (467) T protein:vir:31 81 DYEAIGWLTIEILTQT-DGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDADDGSTG 159 (467) T ss_pred HHHhcCCeEEEEEECC-CCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeeeeeeecccccc Confidence 9999999999999874 489999999999999987765421 111111 1224 Q ss_pred eeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeC-CcCCHHHHHHHHHHHHH Q lcl|NC_019456. 158 YNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYD-RSISPEKRQAMVNDFLR 234 (435) Q Consensus 158 ~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~-~~~~~e~~~~~~~~~~~ 234 (435) ....+++++|||++.+++.++++|+||+.++...+..+.+++++++++|+|| +++++.++ +.+++++.+++++.|.. T Consensus 160 ~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~ 239 (467) T protein:vir:31 160 TSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIED 239 (467) T ss_pred ceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHh Confidence 4567999999999998888999999999999999999999999999999998 56888764 57999999999999976 Q ss_pred Hhc--------------CCCccccccCCceeeecc--------CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCc Q lcl|NC_019456. 235 MVK--------------ENGGAVVQEAGWKVDRYE--------SKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKS 292 (435) Q Consensus 235 ~~~--------------~~~~~~vl~~g~~~~~~~--------~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~ 292 (435) ..+ +++++++++.|.++.+++ .++.|+||.+++++++++||++|||||.+||..+.++ T Consensus 240 ~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~ 319 (467) T protein:vir:31 240 NNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGA 319 (467) T ss_pred hhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCC Confidence 443 456778887776555543 3578999999999999999999999999999887766 Q ss_pred c-cHHHH-HHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHH Q lcl|NC_019456. 293 T-TNVEH-VTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIREL 370 (435) Q Consensus 293 ~-~~~e~-~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~ 370 (435) + ++.++ ...|++++|.|++..|+++|+++|++......+++++|+++.+++.|.++++++++.++++|++|+||+|++ T Consensus 320 ~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 399 (467) T protein:vir:31 320 FSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDE 399 (467) T ss_pred cccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 5 45554 566779999999999999999999998888889999999999999999999999999999999999999999 Q ss_pred hCCCCCCCcCCceeeecccccchhccccccc--cccccccccccccccC---CCCCCCCCCCCCCCC--CCC Q lcl|NC_019456. 371 EGQAPIPDEAADHLYISKDLYPLDKYYDAIL--DNKIQTDASVAAPKQE---GGENTNENGLQSTEP--EGS 435 (435) Q Consensus 371 ~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~--~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~--~~~ 435 (435) +|+||+++ ..+.+.+.......+.... ..+....+....+..+ .-.++-+..+..+.. ..| T Consensus 400 ~Gl~pi~d----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 400 FGFEPFPE----EHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDRADEIIDSYQADLETEQLIEIGANADS 467 (467) T ss_pred hCCCCCCc----ccccCCcccccccccccCCCCcccCcCCCCCCCcccchHhhhhhccccchhhhhccccCC Confidence 99999853 2333332221111110000 0000000000000000 000000111111111 111 No 85 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=9.2e-69 Score=393.58 Aligned_cols=420 Identities=13% Similarity=0.115 Sum_probs=287.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhc-cccccC--c-c----cccHHHHhhhHHHHHHHHHHHHHHhh------ Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMA-GVKLEQ--A-T----FSREHILESNEYIFSIVTRLSNVLAS------ 66 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~--~-~----~~~~~~~~~~~~v~~~i~~ia~~ia~------ 66 (435) +.++++-.+ ...++ ..+ +.....+.. |....+ . . ...-.++..+++|.+||+.+++.||. T Consensus 43 ~~~~~~~~~---~~~~a--~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~ 116 (563) T protein:vir:99 43 YQDLTKSLY---GQQQA--YAE-PFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPAR 116 (563) T ss_pred HHHHHhhhc---cCCCc--chh-hhHhhhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhh Confidence 333333221 11111 111 111111111 111111 1 1 11123455678999999999999995 Q ss_pred -------Cceeeeeccc------ccccchHHHhh---hcccccc-CCHHHHHHHHHHHHHhcCCcceEEee-eCCCCcEE Q lcl|NC_019456. 67 -------LPLHEYQNYK------QMDNEPLADLL---KTSPNPN-MTAFEFIARLETDRNVSGNGYAWIQK-SLSTGEPI 128 (435) Q Consensus 67 -------~~~~~~~~~~------~~~~~~l~~~l---~~~Pn~~-~~~~~f~~~~~~~~~~~G~~~~~i~~-~~~~g~~~ 128 (435) +++++++... ....|++..+| +..|+++ +|+.+|++.++.+++++||+|+++++ +...|+|+ T Consensus 117 ~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~ 196 (563) T protein:vir:99 117 YSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLE 196 (563) T ss_pred hhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceE Confidence 4666665322 12224443333 1233333 58899999999999999999998764 23458999 Q ss_pred EEEEeCCceeEEEEcCCCceE-----EEEEecCCeeEEEchhheEEeccCCCcc---ccccCcHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 129 ALWPLDPNTVSILRNTDNNSY-----WYRVTSDIYNFTIPINDVIHVKHVVPSN---SWYGVSPIDVLSSSLKFQRSVEN 200 (435) Q Consensus 129 ~l~~l~~~~v~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~iih~~~~~~~~---~~~G~s~l~~~~~~i~~~~~~~~ 200 (435) +||||+|.+|++..+.++..+ |+++..++....|++++|||++.....+ +.+|+||+.++...|..+.++++ T Consensus 197 ~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~ 276 (563) T protein:vir:99 197 KFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTES 276 (563) T ss_pred EEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHH Confidence 999999999999999887654 3455555666789999988766544433 77899999999999999999999 Q ss_pred HHHHHhhcC--CceEEEeCC--cCCHHHHHHHHHHHHHHh---cCCCcc-ccccCCceeeeccCChhhHHHHHHHHHHHH Q lcl|NC_019456. 201 FSQNEMEKK--DKFVLQYDR--SISPEKRQAMVNDFLRMV---KENGGA-VVQEAGWKVDRYESKFEPADLSSVEQISRI 272 (435) Q Consensus 201 ~~~~~~~n~--~~~~~~~~~--~~~~e~~~~~~~~~~~~~---~~~~~~-~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~ 272 (435) ++.++|.|| +++++++++ .+++++.+++++.|.... .|+|++ +|+++|++|+++++++.++||+++++++++ T Consensus 277 ~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~ 356 (563) T protein:vir:99 277 FNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLIN 356 (563) T ss_pred HHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHH Confidence 999999997 678888764 479999999999998765 456775 789999999999999999999999999999 Q ss_pred HHHHHhCCCHHHhCCcccC-----------cccHHHHH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhh Q lcl|NC_019456. 273 RIATAFNVPISFLNDDQAK-----------STTNVEHV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNG 340 (435) Q Consensus 273 ~Ia~~fgvP~~~lg~~~~~-----------~~~~~e~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~ 340 (435) +||++|||||.+||..+.+ +++|.+++ ..|+++||.||+..|+++|+++|++... .++.|+ T Consensus 357 ~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~----~~~~~~--- 429 (563) T protein:vir:99 357 IISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYG----DKYTFQ--- 429 (563) T ss_pred HHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcc----cccEEE--- Confidence 9999999999999986654 33455554 4577999999999999999999997543 234443 Q ss_pred hhccCHHHHHHHHH--HHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccc--------- Q lcl|NC_019456. 341 LLRGDTAARTQYYQ--TLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDA--------- 409 (435) Q Consensus 341 l~~~d~~~~~~~~~--~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~--------- 409 (435) +++.|.+++++.++ +++++|+||+||+|+++|+||+ +|||+++++.++.+++............... T Consensus 430 f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi--~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (563) T protein:vir:99 430 FVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPI--EGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLL 507 (563) T ss_pred eccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCC--CCcceeecccccccccccccccCCCccccchhhhhccccc Confidence 35667777777654 4688999999999999999999 5899999999988877654332221111000 Q ss_pred --cccccccCCCCCCCCCCCCCC-----CCC--------CC Q lcl|NC_019456. 410 --SVAAPKQEGGENTNENGLQST-----EPE--------GS 435 (435) Q Consensus 410 --~~~~~~~~~~~~~~~~~~~~~-----~~~--------~~ 435 (435) ....++.++..+++++..+.+ +.+ |+ T Consensus 508 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (563) T protein:vir:99 508 EGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSN 548 (563) T ss_pred CCCCCCCCCCCCCCCCCCccccccccccccccccccccCcc Confidence 000111111111111000000 000 00 No 86 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=9.2e-69 Score=393.58 Aligned_cols=420 Identities=13% Similarity=0.115 Sum_probs=287.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhc-cccccC--c-c----cccHHHHhhhHHHHHHHHHHHHHHhh------ Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMA-GVKLEQ--A-T----FSREHILESNEYIFSIVTRLSNVLAS------ 66 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~--~-~----~~~~~~~~~~~~v~~~i~~ia~~ia~------ 66 (435) +.++++-.+ ...++ ..+ +.....+.. |....+ . . ...-.++..+++|.+||+.+++.||. T Consensus 43 ~~~~~~~~~---~~~~a--~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~ 116 (563) T protein:vir:95 43 YQDLTKSLY---GQQQA--YAE-PFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPAR 116 (563) T ss_pred HHHHHhhhc---cCCCc--chh-hhHhhhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhh Confidence 333333221 11111 111 111111111 111111 1 1 11123455678999999999999995 Q ss_pred -------Cceeeeeccc------ccccchHHHhh---hcccccc-CCHHHHHHHHHHHHHhcCCcceEEee-eCCCCcEE Q lcl|NC_019456. 67 -------LPLHEYQNYK------QMDNEPLADLL---KTSPNPN-MTAFEFIARLETDRNVSGNGYAWIQK-SLSTGEPI 128 (435) Q Consensus 67 -------~~~~~~~~~~------~~~~~~l~~~l---~~~Pn~~-~~~~~f~~~~~~~~~~~G~~~~~i~~-~~~~g~~~ 128 (435) +++++++... ....|++..+| +..|+++ +|+.+|++.++.+++++||+|+++++ +...|+|+ T Consensus 117 ~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~ 196 (563) T protein:vir:95 117 YSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLE 196 (563) T ss_pred hhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceE Confidence 4666665322 12224443333 1233333 58899999999999999999998764 23458999 Q ss_pred EEEEeCCceeEEEEcCCCceE-----EEEEecCCeeEEEchhheEEeccCCCcc---ccccCcHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 129 ALWPLDPNTVSILRNTDNNSY-----WYRVTSDIYNFTIPINDVIHVKHVVPSN---SWYGVSPIDVLSSSLKFQRSVEN 200 (435) Q Consensus 129 ~l~~l~~~~v~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~iih~~~~~~~~---~~~G~s~l~~~~~~i~~~~~~~~ 200 (435) +||||+|.+|++..+.++..+ |+++..++....|++++|||++.....+ +.+|+||+.++...|..+.++++ T Consensus 197 ~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~ 276 (563) T protein:vir:95 197 KFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTES 276 (563) T ss_pred EEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHH Confidence 999999999999999887654 3455555666789999988766544433 77899999999999999999999 Q ss_pred HHHHHhhcC--CceEEEeCC--cCCHHHHHHHHHHHHHHh---cCCCcc-ccccCCceeeeccCChhhHHHHHHHHHHHH Q lcl|NC_019456. 201 FSQNEMEKK--DKFVLQYDR--SISPEKRQAMVNDFLRMV---KENGGA-VVQEAGWKVDRYESKFEPADLSSVEQISRI 272 (435) Q Consensus 201 ~~~~~~~n~--~~~~~~~~~--~~~~e~~~~~~~~~~~~~---~~~~~~-~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~ 272 (435) ++.++|.|| +++++++++ .+++++.+++++.|.... .|+|++ +|+++|++|+++++++.++||+++++++++ T Consensus 277 ~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~ 356 (563) T protein:vir:95 277 FNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLIN 356 (563) T ss_pred HHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHH Confidence 999999997 678888764 479999999999998765 456775 789999999999999999999999999999 Q ss_pred HHHHHhCCCHHHhCCcccC-----------cccHHHHH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhh Q lcl|NC_019456. 273 RIATAFNVPISFLNDDQAK-----------STTNVEHV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNG 340 (435) Q Consensus 273 ~Ia~~fgvP~~~lg~~~~~-----------~~~~~e~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~ 340 (435) +||++|||||.+||..+.+ +++|.+++ ..|+++||.||+..|+++|+++|++... .++.|+ T Consensus 357 ~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~----~~~~~~--- 429 (563) T protein:vir:95 357 IISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYG----DKYTFQ--- 429 (563) T ss_pred HHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcc----cccEEE--- Confidence 9999999999999986654 33455554 4577999999999999999999997543 234443 Q ss_pred hhccCHHHHHHHHH--HHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccc--------- Q lcl|NC_019456. 341 LLRGDTAARTQYYQ--TLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDA--------- 409 (435) Q Consensus 341 l~~~d~~~~~~~~~--~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~--------- 409 (435) +++.|.+++++.++ +++++|+||+||+|+++|+||+ +|||+++++.++.+++............... T Consensus 430 f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi--~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (563) T protein:vir:95 430 FVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPI--EGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLL 507 (563) T ss_pred eccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCC--CCcceeecccccccccccccccCCCccccchhhhhccccc Confidence 35667777777654 4688999999999999999999 5899999999988877654332221111000 Q ss_pred --cccccccCCCCCCCCCCCCCC-----CCC--------CC Q lcl|NC_019456. 410 --SVAAPKQEGGENTNENGLQST-----EPE--------GS 435 (435) Q Consensus 410 --~~~~~~~~~~~~~~~~~~~~~-----~~~--------~~ 435 (435) ....++.++..+++++..+.+ +.+ |+ T Consensus 508 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (563) T protein:vir:95 508 EGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSN 548 (563) T ss_pred CCCCCCCCCCCCCCCCCCccccccccccccccccccccCcc Confidence 000111111111111000000 000 00 No 87 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=3e-69 Score=396.26 Aligned_cols=413 Identities=12% Similarity=0.071 Sum_probs=283.4 Q ss_pred hHH---HHHhhcccccccc-cccccc--chhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 3 FMS---KVRQFFGVHDQAN-QIVQNP--IPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 3 ~~~---~~~~~~~~~~~~~-~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) +|+ .|+......+... +..+.. ...+..+...+ .........+..+++|++||++||++||++||+++++.. T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~~~s~~~~~~~~~~~~~pp--~~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~~~~~~ 78 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREEVESQALGETRFEEYVEPK--VNPLVLLSLLQVNPYHASACSIKANDIIRTGYILEGDDE 78 (542) T ss_pred CccccccccccccchhhhhccccccccccccCCccccCC--CCHHHHHHHHhhcHHHHHHHHHHHHHHhhCceeeecccc Confidence 343 1111111111100 100000 00000010000 011111244667899999999999999999999976644 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEE----- Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWY----- 151 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~----- 151 (435) .. +.+..||++||+++||+.++.+++++||||++++++.. |+|++|+||+|..|++..+.++...++ T Consensus 79 ~~-------l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~-G~~~~L~~l~~~~v~v~~d~~~~~~~~~~~~~ 150 (542) T protein:vir:41 79 GV-------VDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDR-GDPIRFEYIPSHTIRVHKDGSRYRQTWDGVNI 150 (542) T ss_pred hh-------hhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC-CcEEEEEEEcCcceEEEEcCCeeEeeecCCcc Confidence 32 23446999999999999999999999999999998754 899999999999999987765432211 Q ss_pred ----EEe--------cCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeC Q lcl|NC_019456. 152 ----RVT--------SDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYD 217 (435) Q Consensus 152 ----~~~--------~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~ 217 (435) .+. .+.....++++||||+|.+++.++++|+||+..+...+....++++++.++|.|| ++++++++ T Consensus 151 ~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~ 230 (542) T protein:vir:41 151 THFKDYRYEGEINPETGEDQDSVGANELVFIHIPSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVT 230 (542) T ss_pred eeEEeecccccccccccccccccCcccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC Confidence 111 1112245889999999998888999999999999999999999999999999998 56888776 Q ss_pred C----------cCCHHHHHHHHHHHHHHh----cCCCcccccc------CCceeeeccCChhhHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 218 R----------SISPEKRQAMVNDFLRMV----KENGGAVVQE------AGWKVDRYESKFEPADLSSVEQISRIRIATA 277 (435) Q Consensus 218 ~----------~~~~e~~~~~~~~~~~~~----~~~~~~~vl~------~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~ 277 (435) + .+++++.+++++.|...+ +|+|+++||+ +|++|+++++++.++||.+.+++++++||++ T Consensus 231 ~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~a 310 (542) T protein:vir:41 231 GEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAA 310 (542) T ss_pred CccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHH Confidence 4 457788899999986543 4667888884 7999999999999999999999999999999 Q ss_pred hCCCHHHhCCcccCcc--cHHHH-HHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHH Q lcl|NC_019456. 278 FNVPISFLNDDQAKST--TNVEH-VTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQ 354 (435) Q Consensus 278 fgvP~~~lg~~~~~~~--~~~e~-~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~ 354 (435) |||||.+||..+.+++ ++.|+ ...|+++||.|++..|+++|+++|+++.+ .+++++|+.+.+++.|.. +.+. T Consensus 311 fgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~--~~~~~~f~~~~ll~~d~~---~~~~ 385 (542) T protein:vir:41 311 HMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTDFFQVKFN--PKTRFKFNDETLLESDSV---RNCA 385 (542) T ss_pred hCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccC--CceEEEecchhhcchHHH---HHHH Confidence 9999999999876643 56654 56677999999999999999999887654 468999999999887744 4567 Q ss_pred HHHhcCCcCHHHHHHHh-CCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 355 TLTRNGIFKPNEIRELE-GQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 355 ~~~~~g~~t~NE~R~~~-g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (435) .++++|++|+||+|+.+ |++| ++|.++.+.|+.. ................+......++++.-++..+++...+ T Consensus 386 ~~v~~GilT~NE~Re~L~g~~p----gdd~~l~p~~~~~-~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~ 460 (542) T protein:vir:41 386 LLVQSGVLTPAEARERLFGLDG----GPDIFMVPSKGAA-KSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAE 460 (542) T ss_pred HHHhCCCCCHHHHHHhhCCCCC----CCccccccccccc-cccccCCcCCCCCchhhhhhcccccCccccccccccccch Confidence 78999999999999854 5554 3455556666432 2221111111111101111111122221222111111111 Q ss_pred CC Q lcl|NC_019456. 434 GS 435 (435) Q Consensus 434 ~~ 435 (435) -+ T Consensus 461 ~~ 462 (542) T protein:vir:41 461 EK 462 (542) T ss_pred hh Confidence 11 No 88 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=3.2e-67 Score=385.15 Aligned_cols=431 Identities=12% Similarity=0.096 Sum_probs=303.7 Q ss_pred CchHH-HHHhh----ccc--c---ccccccccccchhhhhh-ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCce Q lcl|NC_019456. 1 MSFMS-KVRQF----FGV--H---DQANQIVQNPIPQPLDM-AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPL 69 (435) Q Consensus 1 Mg~~~-~~~~~----~~~--~---~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~ 69 (435) |.=-+ .+++. -++ . .++....+.+...++.- ..+.+.+....-...+..++++++||+++++.||.++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~~~iag~g~ 80 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQIPDHRIQSHNVGVNPPYNPDRLAAFLELNETLATGIRKKSRYEVGFGF 80 (651) T ss_pred CCCccceeeeeEEEeecccccccccccccccccchhhhcccCCCCCCCCCHHHHHHHHhcChHHHHHHHHHhhhhhccCc Confidence 33221 11110 000 0 00001111111112111 11222222222223345689999999999999999999 Q ss_pred eeee----ccccc-------------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEE Q lcl|NC_019456. 70 HEYQ----NYKQM-------------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWP 132 (435) Q Consensus 70 ~~~~----~~~~~-------------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~ 132 (435) .+.. +.... ..++.+......+|+.+++.++++.++.+++.+|++|+.+++++. |+|..|+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~-g~pv~L~~ 159 (651) T protein:vir:99 81 DLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIE-GRPVGLAY 159 (651) T ss_pred eeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCc-cchhhhhh Confidence 8753 11110 123444455567899999999999999999999999999998755 78888988 Q ss_pred eCCceeEEEEcCCC--------------------------------ce-------------------------------- Q lcl|NC_019456. 133 LDPNTVSILRNTDN--------------------------------NS-------------------------------- 148 (435) Q Consensus 133 l~~~~v~~~~~~~~--------------------------------~~-------------------------------- 148 (435) +++..+++..+... .. T Consensus 160 lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~ 239 (651) T protein:vir:99 160 VPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYREDEE 239 (651) T ss_pred cChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEEeccCcc Confidence 88876654322100 00 Q ss_pred ------------EEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEE Q lcl|NC_019456. 149 ------------YWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVL 214 (435) Q Consensus 149 ------------~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~ 214 (435) ..+.....+...+++++||||||++++.++++|+||+..+..++..+.++++++.++|.|| +++++ T Consensus 240 ~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil 319 (651) T protein:vir:99 240 SEREPIFVDRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDTIPRMVI 319 (651) T ss_pred eeeeeecccceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE Confidence 0011222334456889999999998888999999999999999999999999999999997 67899 Q ss_pred EeCC-cCCHHHHHHHHHHHHHHhcCCCccccccC-----------CceeeeccCCh-hhHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019456. 215 QYDR-SISPEKRQAMVNDFLRMVKENGGAVVQEA-----------GWKVDRYESKF-EPADLSSVEQISRIRIATAFNVP 281 (435) Q Consensus 215 ~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~vl~~-----------g~~~~~~~~~~-~~~~~~e~~~~~~~~Ia~~fgvP 281 (435) ++++ .+++++.+++++.|+...+|+|+++||+. |++|++++.++ .|+||.|++++++++||++|||| T Consensus 320 ~~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~afgVP 399 (651) T protein:vir:99 320 KVTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVLEVP 399 (651) T ss_pred EecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHhCCC Confidence 8864 59999999999999999999999988865 99999999876 59999999999999999999999 Q ss_pred HHHhCCcccCcccHHHHHH-HHHHHHHhHHHHHHHHHHHHhhcccccccCcc--eeeechhhhhccCHHHHHHHHHHHHh Q lcl|NC_019456. 282 ISFLNDDQAKSTTNVEHVT-HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGF--YFSFNVNGLLRGDTAARTQYYQTLTR 358 (435) Q Consensus 282 ~~~lg~~~~~~~~~~e~~~-~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~--~i~fd~~~l~~~d~~~~~~~~~~~~~ 358 (435) |.+||..+.++++|+|++. .|+++||.|++..|+++||++|+++.+...++ +++|+.+.+++.|.+++++.+..+++ T Consensus 400 p~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~ 479 (651) T protein:vir:99 400 PVKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQEAQLAEQRVRAMRL 479 (651) T ss_pred HHHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhccHHHHHHHHHHHHh Confidence 9999999999999987654 56789999999999999999999998887776 45677778999999999999999999 Q ss_pred cCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCC---C- Q lcl|NC_019456. 359 NGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE---G- 434 (435) Q Consensus 359 ~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~- 434 (435) +|+||+||+|+++|+||+++++||.++.+.++...+.. ...+..++..++...+..+..+.+.....-...| - T Consensus 480 ~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~---~~gge~~~~~~~~~~~~~~~~e~~~~~~~~~~~e~~~~~ 556 (651) T protein:vir:99 480 AGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDV---AGGGETEAVHEPPEENKIGEREWDTVKSELTTKDPIEQM 556 (651) T ss_pred CCCcCHHHHHHHhCCCCCCCcccccccccccccccccc---ccCCCCcccccCccccccccchhhhhhhhhcccchhhhh Confidence 99999999999999999999999998877665543321 1111111111111111111111110000000000 0 Q ss_pred ---C Q lcl|NC_019456. 435 ---S 435 (435) Q Consensus 435 ---~ 435 (435) | T Consensus 557 ~v~s 560 (651) T protein:vir:99 557 QFSS 560 (651) T ss_pred hHHH Confidence 0 No 89 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=100.00 E-value=8.2e-64 Score=366.43 Aligned_cols=276 Identities=35% Similarity=0.613 Sum_probs=256.9 Q ss_pred HhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEc Q lcl|NC_019456. 64 LASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRN 143 (435) Q Consensus 64 ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~ 143 (435) ||+|||+++++++. .+|+++++|+.+||++||+.+||+.++.+++++||||++++++ ..|++++|||++|++|++..+ T Consensus 1 ia~l~~~~~~~~~~-~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~-~~G~~~~l~~l~~~~v~v~~~ 78 (278) T protein:vir:78 1 MASLPLKMYEDYKV-VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD-IYHQPSKLFLLNPDVVEMLIE 78 (278) T ss_pred CccceeEEEecCcc-cccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEEC-CCCcEEEEEEECCceeEEEEc Confidence 99999999987654 4689999999999999999999999999999999999999987 558999999999999999998 Q ss_pred CCCceEEEEEec-CCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCH Q lcl|NC_019456. 144 TDNNSYWYRVTS-DIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISP 222 (435) Q Consensus 144 ~~~~~~~~~~~~-~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~ 222 (435) .++...+|.+.. +|...+|+++||||++++++.++++|+|++.++..++..+.++++++...+.+++++++..++.+++ T Consensus 79 ~~~~~~~y~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~ 158 (278) T protein:vir:78 79 NQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVGK 158 (278) T ss_pred CCCceEEEEEEcCCceEEEEccccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeCCCCCH Confidence 888777665554 4567899999999999988889999999999999999999999999988888899999999999999 Q ss_pred HHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH-HHH Q lcl|NC_019456. 223 EKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH-VTH 301 (435) Q Consensus 223 e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~-~~~ 301 (435) |+.++++++|+...+++|+++++++|++|+++++++.++|+.+.+++++++||++|||||.++|..+.++++|.++ ... T Consensus 159 e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~ 238 (278) T protein:vir:78 159 EKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRF 238 (278) T ss_pred HHHHHHHHHHHHHhccCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH Confidence 9999999999999999999999999999999999999999999999999999999999999999999899998766 456 Q ss_pred HHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhh Q lcl|NC_019456. 302 SWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGL 341 (435) Q Consensus 302 ~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l 341 (435) |+++||+|+++.|+++|+++||++.++..|++|+||++.| T Consensus 239 ~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 239 YLQHTLLPIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred HHHHHHHHHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 7799999999999999999999999999999999999999 No 90 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=2.2e-51 Score=298.34 Aligned_cols=336 Identities=14% Similarity=0.068 Sum_probs=238.0 Q ss_pred CchHHHHH--hhccccccccccccc--------cchhhhhhccccccCcc---------cccHHHHhhhHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVR--QFFGVHDQANQIVQN--------PIPQPLDMAGVKLEQAT---------FSREHILESNEYIFSIVTRLS 61 (435) Q Consensus 1 Mg~~~~~~--~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~v~~~i~~ia 61 (435) |+=..+-. +......+++..... .....|. +|.+..... ....+.+.+.|.-+.|+..+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~~ 79 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFS-FGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLARSF 79 (368) T ss_pred CCccccccchhccCcccccccccCcchhhccccCceEEEE-cCCceeecchhhHHHHHHHHhccchhccCcCHHHHHHHH Confidence 66543211 111111111100000 0001111 111110000 000011223333333333322 Q ss_pred HHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEE Q lcl|NC_019456. 62 NVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSIL 141 (435) Q Consensus 62 ~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~ 141 (435) ...+ +........|+++.+ +.+||++||+++|++ ++.+++++||||++++++. .|+|++|+|++|..|++. T Consensus 80 ---~~~~---~h~~~~~~~~n~l~l-~~~Pn~~~t~~~f~~-l~~d~ll~Gnay~~~~r~~-~G~~~~L~~l~~~~v~~~ 150 (368) T protein:vir:79 80 ---RAAA---HHSSAVYVKRNILVS-TFIPHPLLSRATFER-LVLDWQVFGNAYLERRENV-LGGTIRLDTPLAKYVRRG 150 (368) T ss_pred ---hhcc---ccchhhhhhcchhhh-hcCCCcCCCHHHHHH-HHHHHhhcCCeEEEEEEcC-CCCEEEEEEeCcccceee Confidence 2222 112233345777654 569999999999975 7899999999999999875 589999999999999876 Q ss_pred EcCCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeC-C Q lcl|NC_019456. 142 RNTDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYD-R 218 (435) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~-~ 218 (435) .+.+ .++++..++...+|++++|||++.+++.++++|+|++..+...+....+++.+.+++|.|| +.+++.++ . T Consensus 151 ~~~~---~~~~~~~~~~~~~~~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~~ 227 (368) T protein:vir:79 151 LDLN---TYFFVQNWQQPYTFAAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTDA 227 (368) T ss_pred ccCC---EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC Confidence 6543 3445566778889999999999998889999999999999999999999999999999998 56788765 5 Q ss_pred cCCHHHHHHHHHHHHHH--hcCCCccccc-----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC Q lcl|NC_019456. 219 SISPEKRQAMVNDFLRM--VKENGGAVVQ-----EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK 291 (435) Q Consensus 219 ~~~~e~~~~~~~~~~~~--~~~~~~~~vl-----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~ 291 (435) .+++|+.++++++|+.. ..|+++++|+ ++|++|++++.++.++||.+++++++++||++|||||.++|+...+ T Consensus 228 ~l~~e~~~~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~ 307 (368) T protein:vir:79 228 AQKQEDVDTLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNN 307 (368) T ss_pred CCCHHHHHHHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCC Confidence 79999999999999763 3567888888 6899999999999999999999999999999999999999987654 Q ss_pred c--ccHHHHH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhc Q lcl|NC_019456. 292 S--TTNVEHV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRN 359 (435) Q Consensus 292 ~--~~~~e~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~ 359 (435) + ++|.|++ ..|++++|.|++..|+ +++.+|.. .+++|+...|++.|.+.++.....- . T Consensus 308 t~~~sn~e~~~~~f~~~~l~Pl~~~ie-~ln~~l~~-------e~~rF~~~~l~~~D~~a~a~~~~rs--a 368 (368) T protein:vir:79 308 TGGFGDVEKAAMVFARNEVKPLQDRLL-AINDWIGD-------EVVRFAPYALGGHDQPAAAPGGQRS--A 368 (368) T ss_pred CCccccHHHHHHHHHHHHHHHHHHHHH-HHHhccCc-------ceeeechhHhhcccccccCCccccc--C Confidence 3 6677654 5677899999999998 67776532 3689999999999988777622211 1 No 91 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=1.8e-49 Score=287.85 Aligned_cols=313 Identities=14% Similarity=0.070 Sum_probs=225.8 Q ss_pred CchHHHHHhhccccccc-cccccccc-hhhhhhccccc-------------------cCcccccH----HHHhhhHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQA-NQIVQNPI-PQPLDMAGVKL-------------------EQATFSRE----HILESNEYIFS 55 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~-------------------~~~~~~~~----~~~~~~~~v~~ 55 (435) |+-.++..+.-...... +.....+. ...|. +|.+. -+.+++.. +.+..++...+ T Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~-fg~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~La~~~~~~~~h~s 104 (376) T protein:vir:10 26 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFT-FDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSS 104 (376) T ss_pred chhccCCCcccchhhhhHhhhccCcceeEEEE-cCCceeccCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHHhhh Confidence 65543221110000000 00000000 00000 01000 00011111 11122222233 Q ss_pred HHHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCC Q lcl|NC_019456. 56 IVTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDP 135 (435) Q Consensus 56 ~i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~ 135 (435) ||...++.++. ..+||++||+.+|++ ++.+++++||+|++++++. .|.|++|+|++| T Consensus 105 ~l~~k~n~l~~---------------------~~~Pnp~lT~~~f~~-~v~d~ll~Gnay~~~~rn~-~G~~~~L~pl~~ 161 (376) T protein:vir:10 105 ALFFKANVLAS---------------------TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNM-VGGTLRLEPALA 161 (376) T ss_pred hHHHHhHHHHh---------------------ccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECC-CCCEEEEEEeCC Confidence 33322222211 247999999999974 5679999999999999875 589999999999 Q ss_pred ceeEEEEcCCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceE Q lcl|NC_019456. 136 NTVSILRNTDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFV 213 (435) Q Consensus 136 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~ 213 (435) .+|++..+.++ ++++..++....|+++||||++.+++.++++|+|++.++...+....+++.+++++|+|| +.++ T Consensus 162 ~~vr~~~d~~~---~~~~~~~~~~~~~~~~eViHir~~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggI 238 (376) T protein:vir:10 162 KYVRRKADFNG---FVYVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFI 238 (376) T ss_pred cceEEEeeCCe---EEEEEcCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Confidence 99999887653 344556677788999999999999888999999999999999999999999999999998 5678 Q ss_pred EEeC-CcCCHHHHHHHHHHHHHH--hcCCCccccc-----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_019456. 214 LQYD-RSISPEKRQAMVNDFLRM--VKENGGAVVQ-----EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFL 285 (435) Q Consensus 214 ~~~~-~~~~~e~~~~~~~~~~~~--~~~~~~~~vl-----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~l 285 (435) +.++ ..+++|+.++++++|+.. ..|.++++|+ ++|++|++++.++.++||.+++++++++||++|||||.++ T Consensus 239 l~~~d~~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~ll 318 (376) T protein:vir:10 239 LYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLL 318 (376) T ss_pred EEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHh Confidence 8765 579999999999999753 3566778887 5789999999999999999999999999999999999999 Q ss_pred CCcccC--cccHHHHH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHH Q lcl|NC_019456. 286 NDDQAK--STTNVEHV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAA 348 (435) Q Consensus 286 g~~~~~--~~~~~e~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~ 348 (435) |....+ +++|.|++ ..|++++|.|++..|+ +++.+|.. .+++|+...|++.|.++ T Consensus 319 Gi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~ie-eln~~L~~-------~~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 319 GIVPSNSGGFGTPDTAARVFGRNEIRPLQARFA-ELNDWLGE-------EVVRFDDYEIPPAPVAA 376 (376) T ss_pred cccCCCCCCcccHHHHHHHHHHHHHHHHHHHHH-HHHhhccc-------cccccChhHhhcccccC Confidence 987754 46777655 5667899999999998 57877632 35899999999999887 No 92 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=4.7e-49 Score=285.55 Aligned_cols=322 Identities=13% Similarity=0.108 Sum_probs=225.4 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhC---ceeeee---- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASL---PLHEYQ---- 73 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~---~~~~~~---- 73 (435) |.= .+.+.-.... . .....|. +|.. .+..+...+++.|+.++.+..... |+.... T Consensus 1 ~~~--~~~~~~~~~~-----~--~~~~~~~-~~~~--------p~~~~~~~~~~~~~~~~~~~~~~~~epp~~~~~La~l 62 (348) T protein:vir:26 1 MTE--QLIHSHTTDG-----T--ESKSVYS-FDPN--------PEPVDTNSWMTRYCELFYNDFDDYWEPPISLKGLAEI 62 (348) T ss_pred CCc--cccchhhccc-----c--CCceEEE-ecCC--------CeeecCcchHHHHHHHHhcCCCccccCCCCHHHHHHH Confidence 331 1110000000 0 0001111 1111 111233334555555554433322 332110 Q ss_pred -cccccccchHHHhh-----hccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc Q lcl|NC_019456. 74 -NYKQMDNEPLADLL-----KTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN 147 (435) Q Consensus 74 -~~~~~~~~~l~~~l-----~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~ 147 (435) +........+.... ..+||++||+.+|++ ++.+++++||||++++++. .|+|++|+|+++.+|++..+.+ T Consensus 63 ~~~n~~h~~~i~~k~N~l~~~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~~~rn~-~G~~~~L~~l~~~~v~~~~d~~-- 138 (348) T protein:vir:26 63 ANANGYHGSLLKARANYVAGRFMNGGGLPMYKMNS-ACWDYFGLGMSAFVKIRSY-LKNVIALEPLPMVHMRKRKNGD-- 138 (348) T ss_pred HhhhhhhhhhHhhhhhHHhhcccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcC-CCcEEEEEEecCceeEeeecCc-- Confidence 00000000111111 237999999999966 5679999999999999874 5899999999999998876532 Q ss_pred eEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEe-CCcCCHHH Q lcl|NC_019456. 148 SYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQY-DRSISPEK 224 (435) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~-~~~~~~e~ 224 (435) +|++..++....|+++||+|++.+++.++++|+|++..+.+++....+++.+++++|+|| +.+++.. +..+++|+ T Consensus 139 --~~~~~~~g~~~~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~ 216 (348) T protein:vir:26 139 --FVQLLRNNEQKVFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATDPNLSEAD 216 (348) T ss_pred --EEEEEecCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHH Confidence 345556778889999999999998888999999999999999999999999999999998 5677765 45799999 Q ss_pred HHHHHHHHHHHh--cCCCccccc-----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccc--CcccH Q lcl|NC_019456. 225 RQAMVNDFLRMV--KENGGAVVQ-----EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQA--KSTTN 295 (435) Q Consensus 225 ~~~~~~~~~~~~--~~~~~~~vl-----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~--~~~~~ 295 (435) .++++++|+... +|.++++|+ ++|+++++++.++.++||++++++++++||++|||||.++|.... +++++ T Consensus 217 ~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn 296 (348) T protein:vir:26 217 EKALKEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPD 296 (348) T ss_pred HHHHHHHHHHhcCcccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCcccc Confidence 999999997643 456778888 789999999999999999999999999999999999999997654 45677 Q ss_pred HHHH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHH Q lcl|NC_019456. 296 VEHV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQY 352 (435) Q Consensus 296 ~e~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~ 352 (435) .|++ ..|++++|.|++..|+++||++|..+ .+.+++||++.....+. .+.+ T Consensus 297 ~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~----~~~~~~fdl~~~~e~~~--~~a~ 348 (348) T protein:vir:26 297 PLKVSQVYDFYEVIPVCKRFMDAVNNDPEIP----DNLKLKFNLNPGVESAN--GSAV 348 (348) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhCCC----CccEEEEecCcccccch--hhcC Confidence 6655 55678999999999999999987643 34578888774332221 1111 No 93 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=2.3e-48 Score=281.71 Aligned_cols=313 Identities=14% Similarity=0.070 Sum_probs=223.8 Q ss_pred CchHHHHHhhcccccccc-cccccc-chhhhhhccccc-------------------cCcccccH----HHHhhhHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQAN-QIVQNP-IPQPLDMAGVKL-------------------EQATFSRE----HILESNEYIFS 55 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~-------------------~~~~~~~~----~~~~~~~~v~~ 55 (435) |+=.+...+.-....... .....+ ....|.+ |.+. -+.++++. +.+..++...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~ 79 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTF-DDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSS 79 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEc-CCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHhhhh Confidence 554332221110000000 000000 0000100 0000 00011111 11111222222 Q ss_pred HHHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCC Q lcl|NC_019456. 56 IVTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDP 135 (435) Q Consensus 56 ~i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~ 135 (435) ||...++.++ -..+||+.||..+|+ .++.+++++||||++++++. .|++++|+|++| T Consensus 80 ~l~~k~n~l~---------------------~~~~Pnp~~t~~~f~-~~v~d~ll~Gnay~~~~r~~-~G~~~~L~~l~~ 136 (351) T protein:vir:79 80 ALFFKANVLA---------------------STFRPHRWLSRHAFE-RWALDFLTFGNGYLERRRNM-VGGTLRLEPALA 136 (351) T ss_pred hhhhhhhHHh---------------------hcccCCCCCCHHHHH-HHHHHHHhcCCeEEEEEECC-CCCEEEEEEeCC Confidence 2221111111 124799999999996 57789999999999999874 589999999999 Q ss_pred ceeEEEEcCCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceE Q lcl|NC_019456. 136 NTVSILRNTDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFV 213 (435) Q Consensus 136 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~ 213 (435) .+|++..+.++ |+++..++....|+++||||++.+++.++++|+|++..+..++....+++.+.+++|+|| +.++ T Consensus 137 ~~v~~~~~~~~---~~~~~~~g~~~~~~~~eIihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~i 213 (351) T protein:vir:79 137 KYVRRKADFSG---FVYVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFI 213 (351) T ss_pred cceeeeecCCe---EEEEecCceEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Confidence 99998776654 455667777889999999999999888999999999999999999999999999999998 5678 Q ss_pred EEeC-CcCCHHHHHHHHHHHHHH--hcCCCccccc-----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh Q lcl|NC_019456. 214 LQYD-RSISPEKRQAMVNDFLRM--VKENGGAVVQ-----EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFL 285 (435) Q Consensus 214 ~~~~-~~~~~e~~~~~~~~~~~~--~~~~~~~~vl-----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~l 285 (435) +..+ ..+++|+.++++++|+.. .+|.++++|+ ++|+++++++.++.++||.+++++++++||++|||||.++ T Consensus 214 l~~~~~~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~ll 293 (351) T protein:vir:79 214 LYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLL 293 (351) T ss_pred EEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHh Confidence 8764 579999999999999764 3566778877 5789999999999999999999999999999999999999 Q ss_pred CCcccC--cccHHHH-HHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHH Q lcl|NC_019456. 286 NDDQAK--STTNVEH-VTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAA 348 (435) Q Consensus 286 g~~~~~--~~~~~e~-~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~ 348 (435) |....+ ++++.|+ ...|++++|.|++..|++ ++.+|. ..+++|+...+++.|.++ T Consensus 294 Gi~~~~t~~~~n~e~~~~~f~~~~l~Pl~~~ie~-ln~~lg-------~~~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 294 GIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLG-------DEVVTFDDYEIPPAPVAA 351 (351) T ss_pred cccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcC-------cceeeeChhhhccccccC Confidence 987653 4667665 456678999999999975 776552 235899999999999877 No 94 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=2.7e-48 Score=281.36 Aligned_cols=323 Identities=14% Similarity=0.096 Sum_probs=229.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcc------c--c--cHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQAT------F--S--REHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~--~--~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) |+=.. +.-.....+....+. .+.|. +|-+..-.. + + ..+.+.+-|.-+ .-||+.+...+.+ T Consensus 1 m~~~~---~~~~~~~~~~~~~~~--~~~~~-~~~p~~~~~~~~~~~~~~~~~~~~~~~~pp~~~---~~la~l~~~~~~h 71 (346) T protein:vir:10 1 MKKQL---RKNLTQNDRLQPQAQ--TEIFS-FGDPIPVLDRADILNYLECSAMYEKWYNPPMSF---DGLAKSLRSSTHH 71 (346) T ss_pred CCccc---CCCCCcccccccccC--eEEEe-cCCcceecCchhHHHHHHHhhcCCceEecCCCH---HHHHHHHHhhhhc Confidence 54432 211111111111100 01111 111000000 0 0 000011111111 1233333333322 Q ss_pred eeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEE Q lcl|NC_019456. 71 EYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYW 150 (435) Q Consensus 71 ~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~ 150 (435) - ..-....|.+..+ +.+||++||+.+|++ ++.+++++||||++++++. .|++++|+|++|..|++..+.++. .+ T Consensus 72 ~--~~i~~k~n~l~~l-~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~i~r~~-~G~~~~L~pl~~~~v~~~~~~~~~-~~ 145 (346) T protein:vir:10 72 E--SAIITKANILLST-CEVDSRYLSRRDLSS-FVKDYLVFGNAYFEVVRNR-LGQVQRIESPLAKYVRKGLEAGQF-YY 145 (346) T ss_pred c--hhhhhhhhhHHHH-HhCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcC-CCcEEEEEEecCCceEEEEcCCeE-EE Confidence 1 1112234555554 458999999999986 6789999999999999875 589999999999999987776554 44 Q ss_pred EEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeC-CcCCHHHHHH Q lcl|NC_019456. 151 YRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYD-RSISPEKRQA 227 (435) Q Consensus 151 ~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~-~~~~~e~~~~ 227 (435) +....+|...+|+++||||+|.+++.++++|+|++..+...+....+++.+++++|.|| +.+++.++ ..+++|+.++ T Consensus 146 ~~~~~~g~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~ 225 (346) T protein:vir:10 146 VPQRFDHQEHEFAKGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVEN 225 (346) T ss_pred EEEccCCeEEEEecccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHH Confidence 55566778889999999999998888999999999999999999999999999999998 56788764 5799999999 Q ss_pred HHHHHHHHh--cCCCcccccc-----CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC--cccHHHH Q lcl|NC_019456. 228 MVNDFLRMV--KENGGAVVQE-----AGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK--STTNVEH 298 (435) Q Consensus 228 ~~~~~~~~~--~~~~~~~vl~-----~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~--~~~~~e~ 298 (435) ++++|+... .|+++++|+. .|+++++++.++.++||.+++++++++||++|||||.++|....+ ++++.|+ T Consensus 226 i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~ 305 (346) T protein:vir:10 226 IRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVAD 305 (346) T ss_pred HHHHHHHhcCccccCceeEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHH Confidence 999997654 4567788874 478999999999999999999999999999999999999987654 4667665 Q ss_pred H-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCH Q lcl|NC_019456. 299 V-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDT 346 (435) Q Consensus 299 ~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~ 346 (435) + ..|++++|.|+++.|++ ++.+|.. .+|+|+...|++.|. T Consensus 306 ~~~~f~~~~l~P~~~~iee-~n~~L~~-------e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 306 AAEVFFITEIEPLQERLKE-FNQWLGQ-------EVIKFKPSKLLQRTQ 346 (346) T ss_pred HHHHHHHHHHHHHHHHHHH-HHhhccc-------ceeeechhhhcccCC Confidence 5 56779999999999985 6766632 368999999999887 No 95 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=3.9e-48 Score=280.51 Aligned_cols=314 Identities=15% Similarity=0.089 Sum_probs=223.2 Q ss_pred CchHHHHHhhccccccc--cc--------cccccchhh-------hhhccccc---cCcccccH----HHHhhhHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQA--NQ--------IVQNPIPQP-------LDMAGVKL---EQATFSRE----HILESNEYIFSI 56 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~--~~--------~~~~~~~~~-------~~~~~~~~---~~~~~~~~----~~~~~~~~v~~~ 56 (435) |+=.+...+.-...... .. ..+-.++++ ++...... -+.++++. +.+..++...+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~~ 80 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSA 80 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHHHHHHHhhhHhhhhh Confidence 55433222111000000 00 000000000 00000000 00011111 111112222222 Q ss_pred HHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCc Q lcl|NC_019456. 57 VTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPN 136 (435) Q Consensus 57 i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~ 136 (435) |...++.++. ..+||+.||+++|+ .++.+++++||+|++++++. .|+|++|+|+++. T Consensus 81 l~~k~n~l~~---------------------~~~Pn~~~t~~~f~-~~~~d~ll~Gnay~~~~rn~-~G~~~~L~pl~~~ 137 (351) T protein:vir:78 81 LFFKANVLAS---------------------TFRPHRWLSRHAFE-RWALDFLTFGNGYLERRRNM-VGGTLRLEPALAK 137 (351) T ss_pred hhhhhhHHhh---------------------cccCCCCCCHHHHH-HHHHHHHhcCCeEEEEEECC-CCCEEEEEEecCc Confidence 2221111111 24799999999996 56679999999999999875 5899999999999 Q ss_pred eeEEEEcCCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEE Q lcl|NC_019456. 137 TVSILRNTDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVL 214 (435) Q Consensus 137 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~ 214 (435) +|++..+.++ |+++..++....|+++||||++.+++.++++|+|++..+...+....++..+++++|+|| +.+++ T Consensus 138 ~v~~~~~~~~---~~~~~~~~~~~~~~~~eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl 214 (351) T protein:vir:78 138 YVRRKADFSG---FVYVNGWQERHEFAPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFIL 214 (351) T ss_pred ceEEeeeCCe---EEEEecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE Confidence 9998877654 344456677889999999999999888999999999999999999999999999999998 56788 Q ss_pred EeC-CcCCHHHHHHHHHHHHHH--hcCCCccccc-----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhC Q lcl|NC_019456. 215 QYD-RSISPEKRQAMVNDFLRM--VKENGGAVVQ-----EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLN 286 (435) Q Consensus 215 ~~~-~~~~~e~~~~~~~~~~~~--~~~~~~~~vl-----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg 286 (435) ..+ ..+++|+.++++++|+.. ..|+++++|+ ++|+++++++.++.++||.+++++++++||++|||||.++| T Consensus 215 ~~~~~~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llG 294 (351) T protein:vir:78 215 YMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLG 294 (351) T ss_pred EecCCCCCHHHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhc Confidence 764 579999999999999754 4566778887 57899999999999999999999999999999999999999 Q ss_pred CcccC--cccHHHHH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHH Q lcl|NC_019456. 287 DDQAK--STTNVEHV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAA 348 (435) Q Consensus 287 ~~~~~--~~~~~e~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~ 348 (435) ....+ ++++.|++ ..|++++|.|+++.|++ ++.+|. ..+|+|+...|++.|.++ T Consensus 295 i~~~~t~~~sn~e~~~~~f~~~~l~P~~~~iee-~n~~l~-------~~~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 295 IVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLG-------DEVVRFDDYEIPPAPVAA 351 (351) T ss_pred ccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcC-------ccceecChhhhccccccC Confidence 87654 46776654 56678999999999985 666653 236899999999999887 No 96 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=2.4e-48 Score=281.67 Aligned_cols=323 Identities=15% Similarity=0.132 Sum_probs=223.5 Q ss_pred CchHHHHHhhcccccccccc--ccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhh--Cceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQI--VQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLAS--LPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~--~~~~~~~~~~ 76 (435) |+-.+.....-.....++.. ..-..+.+..-......+......+.+++.|.-+.++-.+.++-+. -+++..+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~s~i~~k~--- 77 (340) T protein:vir:98 1 MSKRKPRKAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHSSPIYVKR--- 77 (340) T ss_pred CCCCCCCccccccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHHhccccchhhhhhh--- Confidence 66322111100000000000 0000000000000000011111112233333333333222211110 0111111 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecC Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSD 156 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~ 156 (435) +-+.. .-+||++||..+|+ +++.+++++||+|++++++. .|++++|+|+++..|++..+.+ .+|++..+ T Consensus 78 ----n~l~~--~~~Pn~~lt~~~f~-~~~~d~ll~Gnay~~~~rn~-~G~~~~L~pl~~~~vr~~~~~~---~~~~~~~~ 146 (340) T protein:vir:98 78 ----NVLAS--TYIPHPLLSRQDFS-RFALDYLVFGNAFLEQRHSV-TGQLIKLLTSPAKYTRRGVDDS---VFWFVENF 146 (340) T ss_pred ----hHHhh--ccCCCCCCCHHHHH-HHHHHHHhcCCeEEEEEECC-CCcEEEEEEeCCceEEEcccCc---EEEEEecC Confidence 11111 23899999999996 56789999999999999874 5899999999999988765443 45666778 Q ss_pred CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeC-CcCCHHHHHHHHHHHH Q lcl|NC_019456. 157 IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYD-RSISPEKRQAMVNDFL 233 (435) Q Consensus 157 ~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~-~~~~~e~~~~~~~~~~ 233 (435) +....|+++||+|++.+++.++++|+|++..+..++....+++.+++++|.|| +.+++.++ ..+++|+.++++++|+ T Consensus 147 ~~~~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~~ 226 (340) T protein:vir:98 147 TQPHEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTDPAQSATDVESLRDAMR 226 (340) T ss_pred CeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHH Confidence 88889999999999998888999999999999999999999999999999998 56788765 5799999999999997 Q ss_pred HH--hcCCCccccc-----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC--cccHHHHH-HHHH Q lcl|NC_019456. 234 RM--VKENGGAVVQ-----EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK--STTNVEHV-THSW 303 (435) Q Consensus 234 ~~--~~~~~~~~vl-----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~--~~~~~e~~-~~~~ 303 (435) .. ..|.++++|+ ++|+++++++.++.++||.+++++++++||++|||||.++|..+.+ ++++.|++ ..|+ T Consensus 227 ~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~ 306 (340) T protein:vir:98 227 NSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVAKVFV 306 (340) T ss_pred HhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHHH Confidence 63 3456778887 5799999999999999999999999999999999999999987653 46676654 5566 Q ss_pred HHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccC Q lcl|NC_019456. 304 TMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGD 345 (435) Q Consensus 304 ~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d 345 (435) +++|.|++..|++ ++.+|..+ .++|+...+++.| T Consensus 307 ~~~l~Pl~~~iee-~n~~L~~e-------~~rF~~~~l~~~d 340 (340) T protein:vir:98 307 RNELSPLQDRFRE-VNDWLGME-------VIRFKEYTLDNPE 340 (340) T ss_pred HHHHHHHHHHHHH-HHhccccc-------ccccCccccccCC Confidence 8999999999985 88876432 3789988999988 No 97 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=4.8e-48 Score=279.99 Aligned_cols=314 Identities=11% Similarity=0.101 Sum_probs=224.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHh---hCceeeee--cc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLA---SLPLHEYQ--NY 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia---~~~~~~~~--~~ 75 (435) |+-.+. . ++.... ......|.+ |. .+..+...++..|+...-+.++ ..|+.... +- T Consensus 1 m~~~~~---~-----~~~~~~-~~~~~~~~~-~~---------p~~~~~~~~~~~~~~~~~~~~~~~~~pP~~~~~La~l 61 (337) T protein:vir:78 1 MTKRQQ---Q-----PAQAAA-SSPRPSVVF-SM---------PEAIDPTAWMTDYTGVFYNPYGEYYQPPIDRKGLAKV 61 (337) T ss_pred CCCccc---C-----cccccc-cCceeEEEe-cC---------cccccCcchhHhhhhhhhccCcceecCCCCHHHHHHH Confidence 443221 1 110000 000111111 11 1112233345555555443333 22433211 00 Q ss_pred cccccchHHHhhhccccccCCH----HHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEE Q lcl|NC_019456. 76 KQMDNEPLADLLKTSPNPNMTA----FEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWY 151 (435) Q Consensus 76 ~~~~~~~l~~~l~~~Pn~~~~~----~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~ 151 (435) -....+ ....|..+||..+++ .++++.++.+++++||||++++++. .|+|++|+|++|.+|++..+ +. ++ T Consensus 62 ~~~~~~-h~~~L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~-~G~~~~L~pl~~~~v~~~~d--~~-~~- 135 (337) T protein:vir:78 62 ARANAH-HGAILMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNS-FGQVVGLHPLSSVYLRRRED--GC-FV- 135 (337) T ss_pred hhcchh-hhhHHHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECC-CCcEEEEEEeCCceeEeeeC--Ce-EE- Confidence 011111 134567799976665 4789999999999999999999875 58999999999999987754 33 22 Q ss_pred EEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeC-CcCCHHHHHHH Q lcl|NC_019456. 152 RVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYD-RSISPEKRQAM 228 (435) Q Consensus 152 ~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~-~~~~~e~~~~~ 228 (435) ++..++....|+++||||+|.+++.++++|+|++..+..++....+++.+++++|+|| +.+++..+ ..+++++.+++ T Consensus 136 ~~~~~~~~~~~~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~~l 215 (337) T protein:vir:78 136 YLQQGKPNLIYRPDDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEEEM 215 (337) T ss_pred EEEcCCceEEECCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Confidence 3344567788999999999998888999999999999999999999999999999998 56788765 47999999999 Q ss_pred HHHHHHHh--cCCCccccc-----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccc---CcccHHHH Q lcl|NC_019456. 229 VNDFLRMV--KENGGAVVQ-----EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQA---KSTTNVEH 298 (435) Q Consensus 229 ~~~~~~~~--~~~~~~~vl-----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~---~~~~~~e~ 298 (435) ++.|+... .|.++++++ ++|+++++++.++.++||++++++++++||++|||||.++|+... +++++.|+ T Consensus 216 k~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~ 295 (337) T protein:vir:78 216 KEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEK 295 (337) T ss_pred HHHHHHhcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHH Confidence 99997643 455677776 678999999999999999999999999999999999999997554 45666766 Q ss_pred HH-HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhh Q lcl|NC_019456. 299 VT-HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLL 342 (435) Q Consensus 299 ~~-~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~ 342 (435) +. .|++++|.|+++.|+++++++|++.... .+++++...++ T Consensus 296 ~~~~f~~~~L~P~~~~ie~~~n~~ll~~~~~---~~f~~~~~~~~ 337 (337) T protein:vir:78 296 YDATYARNEVLPLCELVQDAINSAGLPRALW---VTFRETIGAAV 337 (337) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcCChhhc---eeccccccccC Confidence 55 5679999999999999999988865432 23455555555 No 98 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=1.7e-47 Score=276.96 Aligned_cols=328 Identities=12% Similarity=0.050 Sum_probs=224.8 Q ss_pred CchHHHHHhhcccccc--ccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQ--ANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQM 78 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 78 (435) |.-....-+.-..... +....+-+++.+.+ ...+....+.....+++-|.-+.+ +|+.+-.-+.+ ..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~--~~~y~~~~~~~~~~~~epp~~~~~---la~l~~~~~~h---~~~i~ 72 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISASP--ALDYVGIGFDENYNCYLPPVNRHA---LAKLPHQNAQH---GGILH 72 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCccccc--chhhhhhhhcCCccccCCCCCHHH---HHHHhhccccc---cccee Confidence 4432211110000000 00001111111110 000000001111123333332222 22222111211 00000 Q ss_pred ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEE--EEEecC Q lcl|NC_019456. 79 DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYW--YRVTSD 156 (435) Q Consensus 79 ~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~--~~~~~~ 156 (435) ..++.+.. ..+||+.||+++|++ ++.+++++||||++++++. .|+|++|+|+++..|++..+.+....+ +....+ T Consensus 73 ~k~n~l~~-~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~-~G~~~~L~pl~~~~vr~~~d~~~~~~~~~~~~~~~ 149 (345) T protein:vir:37 73 SRANMVSS-LYEGGKALSRMDMRA-LCLNLIQFGDVGLLKVRNG-FGQVVRLVPLSSLYLRVRKDGGYSYLMKKSLYDTA 149 (345) T ss_pred eechHHHh-hccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcC-CCcEEEEEEEcCceeEEEEeCCeeEEEEEeEecCC Confidence 01222222 348999999999975 6679999999999999875 489999999999999988776554332 233455 Q ss_pred CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeC-CcCCHHHHHHHHHHHH Q lcl|NC_019456. 157 IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYD-RSISPEKRQAMVNDFL 233 (435) Q Consensus 157 ~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~-~~~~~e~~~~~~~~~~ 233 (435) +...+|+++||||++.+++.++++|+|++..+..++....+++.+++++|.|| +.+++.++ ..+++|+.++++++|+ T Consensus 150 g~~~~~~~~dVihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~~l~~e~~~~lk~~~~ 229 (345) T protein:vir:37 150 QEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARKIS 229 (345) T ss_pred ceEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecCCCCCHHHHHHHHHHHH Confidence 67788999999999998888999999999999999999999999999999998 56788764 6799999999999997 Q ss_pred HH--hcCCCccccc-----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccc--CcccHHHHH-HHHH Q lcl|NC_019456. 234 RM--VKENGGAVVQ-----EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQA--KSTTNVEHV-THSW 303 (435) Q Consensus 234 ~~--~~~~~~~~vl-----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~--~~~~~~e~~-~~~~ 303 (435) .. ..|.++++++ ++|+++++++.++.++||.+++++++++||++|||||.++|.... +++++.|++ ..|+ T Consensus 230 ~~~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~f~ 309 (345) T protein:vir:37 230 ESKGVGNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYH 309 (345) T ss_pred HhcCcccccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHHHH Confidence 64 3455667766 589999999999999999999999999999999999999998665 446676654 5667 Q ss_pred HHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhc Q lcl|NC_019456. 304 TMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLR 343 (435) Q Consensus 304 ~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~ 343 (435) +++|.|+++.|++++++.+ +...+.+++|+..+|.. T Consensus 310 ~~~l~P~~~~ie~~ln~~~----~~~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 310 YDEVMPLQEIIAETINQDP----EIKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHHHhhhhc----cCCCcceEEecchhhcC Confidence 9999999999999999743 23456789998888777 No 99 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=4.5e-48 Score=280.16 Aligned_cols=240 Identities=16% Similarity=0.249 Sum_probs=193.1 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhc-cccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMA-GVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMD 79 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 79 (435) ||||++.. +++............... +........++.+.|+++++|++||++||++||++|++++++++... T Consensus 1 MglF~~~~------~r~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~~ 74 (251) T protein:vir:46 1 MGIFYKNE------KRDLQYNEDDLQMMVQTLPSFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINY 74 (251) T ss_pred CCcccccc------ccccCCCccchhhhhhhhccccCcCcceechhhhhccHHHHHHHHHHHHhHhhCceEEeeCccccc Confidence 99986542 222222222222222222 22222334567788999999999999999999999999999999999 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEec---- Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTS---- 155 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~---- 155 (435) +|+++++|+.+||++||+++||+.++.+++++||||++++++.. |+|++|+||+|++|++..+.++...++.... T Consensus 75 ~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~-G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~ 153 (251) T protein:vir:46 75 SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT-GEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNG 153 (251) T ss_pred cchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-CcEEEEEEECCceEEEEECCCCcEEEEEEEeccCC Confidence 99999999999999999999999999999999999999998755 8999999999999999999888766554432 Q ss_pred CCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCC-HHHHHHHHHHH Q lcl|NC_019456. 156 DIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSIS-PEKRQAMVNDF 232 (435) Q Consensus 156 ~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~-~e~~~~~~~~~ 232 (435) ++....|+++||||||++ +.++++|+||+..+..+|..+.++++++.++|+|| +.+++++++.++ +++.++++++| T Consensus 154 ~g~~~~~~~~diiH~r~~-~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e~~~~~~~~~ 232 (251) T protein:vir:46 154 NNIERNVKFEDMLDIKFY-SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEF 232 (251) T ss_pred cceeEEECCccEEEecCc-CCCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHHHHHHHHHHH Confidence 355678999999999987 57899999999999999999999999999999997 679999998875 55678899999 Q ss_pred HHHhc---CCCccccccCCcee Q lcl|NC_019456. 233 LRMVK---ENGGAVVQEAGWKV 251 (435) Q Consensus 233 ~~~~~---~~~~~~vl~~g~~~ 251 (435) ..... |+|++++ |++= T Consensus 233 ~~~~~g~~n~g~~~~---gm~~ 251 (251) T protein:vir:46 233 PKVLVELNKLGKLSY---SMNQ 251 (251) T ss_pred HHHhcCccccccccc---ccCC Confidence 76654 4566554 3221 No 100 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=6.3e-48 Score=279.35 Aligned_cols=320 Identities=15% Similarity=0.140 Sum_probs=221.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccc---------cCcccccHHHHhhhHHHHHHHHHHHHHH--hhCce Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKL---------EQATFSREHILESNEYIFSIVTRLSNVL--ASLPL 69 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~v~~~i~~ia~~i--a~~~~ 69 (435) |+=. .++........+.. .......|.+ |.+. .+......+.+++-|.-+.++-.+.+.- ..-++ T Consensus 1 m~~~--~~~~~~~~~~~~~~-~~~~~~~~~f-~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i 76 (344) T protein:vir:60 1 MSKK--KGKTLQPAAKKMTA-SAPKMEAFTF-GEPVPVLDRRDILDYVECISNGRWYEPPISFTGLAKSLRAAVHHSSPI 76 (344) T ss_pred CCcc--cCCCCCchHHhhcC-CcCcEEEEEc-CCceeecCCcchhHHHHhhhcCccccCCCCHHHHHHHHHhhhhhccch Confidence 4332 11111111000000 0000011111 1100 0000000011111111122211111111 11122 Q ss_pred eeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceE Q lcl|NC_019456. 70 HEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSY 149 (435) Q Consensus 70 ~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~ 149 (435) +.++ +-+.. ..+||++||+.+| +.++.+++++||||++++++. .|+|++|+|+++.+|++..+.+ . T Consensus 77 ~~k~-------n~l~~--~~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i~rn~-~G~~~~L~~l~~~~vr~~~~~~---~ 142 (344) T protein:vir:60 77 YVKR-------NILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYST-TGKVIRLETSPAKYTRRGVEED---V 142 (344) T ss_pred hhhh-------hHHHh--hccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECC-CCcEEEEEEcCcceEEEeecCC---e Confidence 2211 11111 3489999999999 678899999999999999874 4899999999999999877654 3 Q ss_pred EEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeC-CcCCHHHHH Q lcl|NC_019456. 150 WYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYD-RSISPEKRQ 226 (435) Q Consensus 150 ~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~-~~~~~e~~~ 226 (435) ||++..++....|+++||||++.+++.++++|+|++..+..++.+..+++.+++++|.|| +.+++.++ ..+++|+.+ T Consensus 143 ~~~v~~~~~~~~~~~~eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ls~e~~~ 222 (344) T protein:vir:60 143 YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIE 222 (344) T ss_pred EEEEccCCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHH Confidence 566777888889999999999998888999999999999999999999999999999998 56888765 579999999 Q ss_pred HHHHHHHHHh-cCCCccccc------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC--cccHHH Q lcl|NC_019456. 227 AMVNDFLRMV-KENGGAVVQ------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK--STTNVE 297 (435) Q Consensus 227 ~~~~~~~~~~-~~~~~~~vl------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~--~~~~~e 297 (435) +++++|+... .++++.+++ .+|+++++++.++.++||.+++++++++||++|||||.++|....+ +++|.| T Consensus 223 ~ik~~~~~~~g~~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e 302 (344) T protein:vir:60 223 MLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIE 302 (344) T ss_pred HHHHHHHHhcCCCCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHH Confidence 9999997643 456677776 4799999999999999999999999999999999999999987654 467766 Q ss_pred HH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCH Q lcl|NC_019456. 298 HV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDT 346 (435) Q Consensus 298 ~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~ 346 (435) ++ ..|++++|.|++..++ +++.+|.. ..++|+.-.+...|- T Consensus 303 ~~~~~f~~~~L~Pl~~~~e-~ln~~lg~-------~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 303 KVAKVFVRNELIPLQDRIR-EINGWLGQ-------EVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHhcCC-------cccccCccccCCCCC Confidence 55 4566899999999998 58887632 235677777777665 No 101 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=2.3e-47 Score=276.25 Aligned_cols=323 Identities=13% Similarity=0.085 Sum_probs=220.2 Q ss_pred CchHHHHHhhcccccc--ccccccccchhh---hhhccccccCcccccHHHHhhhHHHHHHHHHH--HHHHhhCceeeee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQ--ANQIVQNPIPQP---LDMAGVKLEQATFSREHILESNEYIFSIVTRL--SNVLASLPLHEYQ 73 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~--~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~i--a~~ia~~~~~~~~ 73 (435) |+-...--..-+.... .....+.+.+.+ ++..+.. ......+++-|.-+.+.-.+ |+..-.-+++..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~-----~~~~~~~~epp~~~~~la~~~~~~~~h~~~i~~k~ 75 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLSEITASPALDYVGIG-----FDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRA 75 (345) T ss_pred CCccccccchhhhcCCCceEEEeecCCcccchhhccccee-----eecCCccccCCCCHHHHHHHhhcchhhcchhhhhh Confidence 4332211110000000 001111111111 1111111 01111122222211111111 1111111111100 Q ss_pred cccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEE--E Q lcl|NC_019456. 74 NYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYW--Y 151 (435) Q Consensus 74 ~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~--~ 151 (435) +-+ ....+||++||+.+|+ .++.+++++||||++++++. .|++++|+|++|..|++..+.+....+ + T Consensus 76 -------n~l--~~~~~Pn~~~t~~~f~-~~v~d~ll~Gnay~~i~rn~-~G~~~~L~pl~~~~vr~~~d~~~~~~~~~~ 144 (345) T protein:vir:37 76 -------NMV--SATYEGGKALSKMEMR-ALCLNLIQFGDVGLLKVRNG-FGQVVRLVPLSSLYLRVHKDGGYSYLMKKS 144 (345) T ss_pred -------hHH--hhccCCCCCCCHHHHH-HHHHHHHhcCCeEEEEEECC-CCCEEEEEEecCceeEEeecCCeeEEEeee Confidence 111 1234899999999996 56689999999999999875 589999999999999987766543322 2 Q ss_pred EEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEe-CCcCCHHHHHHH Q lcl|NC_019456. 152 RVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQY-DRSISPEKRQAM 228 (435) Q Consensus 152 ~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~-~~~~~~e~~~~~ 228 (435) .+...+...+|+++||||++.+++.++++|+|++..+...+....+++.+++++|.|| +.+++.. +..+++|+.+++ T Consensus 145 ~~~~~g~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~~~l~~e~~~~l 224 (345) T protein:vir:37 145 LYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEI 224 (345) T ss_pred eeccCceEEEEccccEEEEcCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHH Confidence 3344566788999999999998888999999999999999999999999999999998 5678865 457999999999 Q ss_pred HHHHHHHh--cCCCccccc-----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccc--CcccHHHHH Q lcl|NC_019456. 229 VNDFLRMV--KENGGAVVQ-----EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQA--KSTTNVEHV 299 (435) Q Consensus 229 ~~~~~~~~--~~~~~~~vl-----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~--~~~~~~e~~ 299 (435) +++|+... .|.+.++++ ++|+++++++.++.++||.+++++++++||++|||||.++|..+. +++++.|++ T Consensus 225 k~~~~~~~g~~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~ 304 (345) T protein:vir:37 225 ARKISESKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKY 304 (345) T ss_pred HHHHHHhcCccccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHH Confidence 99998654 233444444 568999999999999999999999999999999999999998765 346777654 Q ss_pred -HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhc Q lcl|NC_019456. 300 -THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLR 343 (435) Q Consensus 300 -~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~ 343 (435) ..|++++|.|++..|++++++.+ +...+.+++||...|.+ T Consensus 305 ~~~f~~~~l~P~~~~ie~~ln~~~----e~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 305 REVYHYDEVMPLQEIIAETINQDP----EIKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhh----ccCCcceEEECchhhcC Confidence 55678999999999999999732 23457889999999988 No 102 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=3.6e-47 Score=275.20 Aligned_cols=320 Identities=15% Similarity=0.140 Sum_probs=218.9 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccc---------cCcccccHHHHhhhHHHHHHHHHHHHHHh--hCce Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKL---------EQATFSREHILESNEYIFSIVTRLSNVLA--SLPL 69 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~v~~~i~~ia~~ia--~~~~ 69 (435) |+=.++-... ...............|.+ |.+. .+......+.+++=|.-+.++-.+.++-+ .-++ T Consensus 1 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~~-~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~s~i 76 (344) T protein:vir:56 1 MSKKKGKTPQ---PAAKTMTASAPKMEAFTF-GEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPI 76 (344) T ss_pred CCCCCCCCCc---hhhHHhhcCCCceEEEEc-CCceeecCcchhhhHHHhhhcCccccCCCCHHHHHHHHhhhhhhCccc Confidence 4443221000 000000000000011110 1000 00000000111111211222211111111 1122 Q ss_pred eeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceE Q lcl|NC_019456. 70 HEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSY 149 (435) Q Consensus 70 ~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~ 149 (435) +.++ +-+.. .-+||++||+.+| +.++.+++++||||++++++. .|+|++|+|+++.+|++..+.+ . T Consensus 77 ~~k~-------n~l~~--~~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~~rn~-~G~~~~L~pl~~~~v~~~~~~~---~ 142 (344) T protein:vir:56 77 YVKR-------NILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYST-TGKVIRLETSPAKYTRRGVEED---V 142 (344) T ss_pred eehh-------hhHHh--hcCCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECC-CCcEEEEEEeCCceeEEeecCC---E Confidence 2211 11111 3489999999999 678899999999999999874 4899999999999999876654 3 Q ss_pred EEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeC-CcCCHHHHH Q lcl|NC_019456. 150 WYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYD-RSISPEKRQ 226 (435) Q Consensus 150 ~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~-~~~~~e~~~ 226 (435) +|++..+|....|+++||||++.+++.++++|+|++..+...+....+++.+.+++|.|| +.+++.++ ..+++|+.+ T Consensus 143 ~~~~~~~g~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~~ls~e~~~ 222 (344) T protein:vir:56 143 YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIE 222 (344) T ss_pred EEEEecCCeEEEEcCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHH Confidence 456677788889999999999998888999999999999999999999999999999998 56788765 579999999 Q ss_pred HHHHHHHHHh-cCCCccccc------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC--cccHHH Q lcl|NC_019456. 227 AMVNDFLRMV-KENGGAVVQ------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK--STTNVE 297 (435) Q Consensus 227 ~~~~~~~~~~-~~~~~~~vl------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~--~~~~~e 297 (435) +++++|.... .+++++++| ++|+++++++.++.++||.|++++++++||++|||||.++|....+ ++++.| T Consensus 223 ~lk~~~~~~~g~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~e 302 (344) T protein:vir:56 223 MLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIE 302 (344) T ss_pred HHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHH Confidence 9999997643 467788887 4799999999999999999999999999999999999999987654 466766 Q ss_pred HH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCH Q lcl|NC_019456. 298 HV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDT 346 (435) Q Consensus 298 ~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~ 346 (435) ++ ..|++++|.|++..+++ ++.+|..+ .++|+.-.+...|- T Consensus 303 q~~~~f~~~tL~Pl~~~ie~-~n~~l~~~-------~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 303 KVAKVFVRNELIPLQDRIRE-INGWIGQE-------VIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHhhhccc-------cccCCCccccccCC Confidence 55 45668999999999985 77777532 34555445544443 No 103 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=6.3e-47 Score=273.88 Aligned_cols=320 Identities=14% Similarity=0.077 Sum_probs=217.4 Q ss_pred CchHHHHHhhcccccccccccc-----ccchhhhhhccccccCcccccH------------HHHhhhHHHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQ-----NPIPQPLDMAGVKLEQATFSRE------------HILESNEYIFSIVTRLSNV 63 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~v~~~i~~ia~~ 63 (435) |+=.++.........+...... ......|. +| .+. +..+. +.+.+-|.-+.+ ||+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~p~--~v~~~~~~~~y~~~~~~~~~~~pp~~~~~---la~~ 73 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFT-FG-DPM--PVLDGRGILDYLECWPNGRWYEPPLSMEG---LAKS 73 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEE-eC-Cce--eecCcchhhHHHHHhhcCccccCCCCHHH---HHHH Confidence 5443322111100000000000 00001111 11 100 00000 001111111111 1222 Q ss_pred HhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEc Q lcl|NC_019456. 64 LASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRN 143 (435) Q Consensus 64 ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~ 143 (435) +..-+.+ ...-...++.+- ...+||++||+++|++ ++.+++++||||++++++. .|+|++|+|++|.+|++..+ T Consensus 74 ~~~~~~h---~~~l~~k~n~l~-~~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~~~rn~-~G~~~~L~~l~~~~vr~~~~ 147 (350) T protein:vir:11 74 VGSSVYL---QSGLKFKRNMLA-KTFIPHRLLSRATFEQ-FSLDWLTFGSAYLEQPRSR-LGTRMPLQAPLAKYMRRGTD 147 (350) T ss_pred Hhhhhhh---ccchhhhhhhhh-hcccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcC-CCCEEEEEEeCCceeEeeec Confidence 1111111 000000011111 1248999999999975 6789999999999999875 48999999999999988765 Q ss_pred CCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeC-CcC Q lcl|NC_019456. 144 TDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYD-RSI 220 (435) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~-~~~ 220 (435) .+ .+|++..++...+|+++||||++.+++.++++|+|++..+...+....+++.+.+++|.|| +.+++.++ ..+ T Consensus 148 ~~---~~~~~~~~~~~~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~~~l 224 (350) T protein:vir:11 148 LE---TFYQVRSWKDEHEFEKGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTDAAQ 224 (350) T ss_pred CC---eEEEEeeCCeEEEECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCC Confidence 54 3456677788889999999999998888999999999999999999999999999999998 56888775 579 Q ss_pred CHHHHHHHHHHHHHH--hcCCCccccc-----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccc--C Q lcl|NC_019456. 221 SPEKRQAMVNDFLRM--VKENGGAVVQ-----EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQA--K 291 (435) Q Consensus 221 ~~e~~~~~~~~~~~~--~~~~~~~~vl-----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~--~ 291 (435) ++|+.+++++.|+.. ..|+++++|+ ++|+++++++.++.++||.+++++++++||++|||||.++|+... + T Consensus 225 s~e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~ 304 (350) T protein:vir:11 225 NEEDIDALRTALKTAKGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAG 304 (350) T ss_pred CHHHHHHHHHHHHHhcCccccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCC Confidence 999999999999764 3566778877 468999999999999999999999999999999999999998766 4 Q ss_pred cccHHHHH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhh Q lcl|NC_019456. 292 STTNVEHV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGL 341 (435) Q Consensus 292 ~~~~~e~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l 341 (435) ++++.|++ ..|++++|.|++..|++ ++.+|..+. ..+.+|+..+| T Consensus 305 ~~sn~e~~~~~f~~~~L~P~~~~ie~-ln~~l~~~~----~~F~~~~~~~l 350 (350) T protein:vir:11 305 GFGSISDAAAVWASLELAPMQTRLQQ-VNEMIGEEV----VRFAQFDAPGL 350 (350) T ss_pred CcCCHHHHHHHHHHHHHHHHHHHHHH-HHhhcCccc----cccCcccccCC Confidence 46777655 56678999999999985 787775321 22346777777 No 104 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=4.8e-47 Score=274.53 Aligned_cols=320 Identities=16% Similarity=0.153 Sum_probs=220.2 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccc---------cCcccccHHHHhhhHHHHHHHHHH--HHHHhhCce Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKL---------EQATFSREHILESNEYIFSIVTRL--SNVLASLPL 69 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~v~~~i~~i--a~~ia~~~~ 69 (435) |+=.+..... ....+..........|.+ |.+. .+......+.+++-|.-+.++-.+ |+....-++ T Consensus 1 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~f-~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i 76 (344) T protein:vir:20 1 MSKKKGKTPQ---PAAKTMTASGPKMEAFTF-GEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPI 76 (344) T ss_pred CCcccCCCCc---chhhhhhccCCceEEEEc-CCceEecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhhhhCccc Confidence 5443211100 000000000000011111 1100 000000001111111111221111 111111122 Q ss_pred eeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceE Q lcl|NC_019456. 70 HEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSY 149 (435) Q Consensus 70 ~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~ 149 (435) +.+++ -+.. ..+||+.||+.+| +.++.+++++||||++++++. .|+|++|+|+++.+|++..+.+ . T Consensus 77 ~~k~n-------~l~~--~~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i~rn~-~G~~~~L~pl~~~~vr~~~~~~---~ 142 (344) T protein:vir:20 77 YVKRN-------ILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYST-TGKVIRLETSPAKYTRRGVEED---V 142 (344) T ss_pred eehhh-------hHHH--hccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECC-CCcEEEEEEcCCceeEeeecCC---E Confidence 22111 1111 2489999999999 678899999999999999874 4899999999999999876654 3 Q ss_pred EEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEe-CCcCCHHHHH Q lcl|NC_019456. 150 WYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQY-DRSISPEKRQ 226 (435) Q Consensus 150 ~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~-~~~~~~e~~~ 226 (435) |+++..++....|+++||||++.+++.++++|+|++..+..++....+++.+++++|.|| +.+++.+ +..+++|+.+ T Consensus 143 ~~~~~~~~~~~~~~~~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d~~l~~e~~~ 222 (344) T protein:vir:20 143 YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIE 222 (344) T ss_pred EEEEccCCeEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHH Confidence 456677788899999999999998888999999999999999999999999999999998 5678876 4679999999 Q ss_pred HHHHHHHHHh-cCCCccccc------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC--cccHHH Q lcl|NC_019456. 227 AMVNDFLRMV-KENGGAVVQ------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK--STTNVE 297 (435) Q Consensus 227 ~~~~~~~~~~-~~~~~~~vl------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~--~~~~~e 297 (435) +++++|+... .++++.++| .+|++|++++.++.++||.|++++++++||++|||||.++|....+ ++++.| T Consensus 223 ~ik~~~~~~~g~~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e 302 (344) T protein:vir:20 223 MLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIE 302 (344) T ss_pred HHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHH Confidence 9999997643 456777776 4699999999999999999999999999999999999999987654 466766 Q ss_pred HH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCH Q lcl|NC_019456. 298 HV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDT 346 (435) Q Consensus 298 ~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~ 346 (435) ++ ..|++++|.|++..++ +++.+|.. ..++|+...+...|. T Consensus 303 ~~~~~f~~~~l~P~~~~~e-~in~~lg~-------~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 303 KVAKVFVRNELIPLQDRIR-EINGWLGQ-------EVIRFKNYSLDTDND 344 (344) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHhcCC-------cccccCccccccCCC Confidence 54 5567899999999998 57776632 236777777777665 No 105 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=100.00 E-value=2.2e-38 Score=226.98 Aligned_cols=201 Identities=14% Similarity=0.138 Sum_probs=159.3 Q ss_pred eEEEEcCCCceEEEEE-----ecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--C Q lcl|NC_019456. 138 VSILRNTDNNSYWYRV-----TSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--D 210 (435) Q Consensus 138 v~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~ 210 (435) |++ ..+|. ++|.+ ...|...+|+++||+|+|.+++.++++|+||+..+..++....++++++.++|.|| + T Consensus 1 ~r~--~~dg~-~~y~~~~~~~~~~g~~~~~~~~eilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng~~p 77 (219) T protein:vir:98 1 MRV--CKDGN-YKYLMKKSLYDTKSEIYEYNKNDVIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNGAHM 77 (219) T ss_pred Cce--eecCe-EEEEEecceecCCceeEEeccccEEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 222 22222 22222 22356788999999999998888999999999999999999999999999999998 6 Q ss_pred ceEEEeCC-cCCHHHHHHHHHHHHHHh--cCCCccccc-----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_019456. 211 KFVLQYDR-SISPEKRQAMVNDFLRMV--KENGGAVVQ-----EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPI 282 (435) Q Consensus 211 ~~~~~~~~-~~~~e~~~~~~~~~~~~~--~~~~~~~vl-----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~ 282 (435) ++++.+++ .+++++.+++++.|+... .|+++++++ ++|++|+++++++.|+||+|++++++++||++||||| T Consensus 78 ~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp 157 (219) T protein:vir:98 78 GFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPP 157 (219) T ss_pred ceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCH Confidence 68887654 799999999999997643 344555555 5689999999999999999999999999999999999 Q ss_pred HHhCCccc--CcccHHHHH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccC Q lcl|NC_019456. 283 SFLNDDQA--KSTTNVEHV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGD 345 (435) Q Consensus 283 ~~lg~~~~--~~~~~~e~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d 345 (435) .+||..+. ++++|.|++ ..|+++||.||+..||++||++++.+ .+.+++|+.+...-.+ T Consensus 158 ~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~----~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 158 GLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDYEIK----SALKVNFKQPEKRDKN 219 (219) T ss_pred HHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCC----CccEEeecCcccccCC Confidence 99997653 457777665 56678999999999999999986543 3456777755544333 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.96 E-value=7.2e-30 Score=180.36 Aligned_cols=393 Identities=14% Similarity=0.121 Sum_probs=237.8 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) |.+.+.+.++...-..... ..... .+.........-...|.+++.++++|+.+|+++.+.++.+..++.+... T Consensus 1 ~~~~D~~~~~~~~~g~~~~---~~~~~----~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~~~~ 73 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQE---QTYYS----PSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSNDLNSKQ 73 (437) T ss_pred CchhhhhHhHHhcCCCccc---cceee----cCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecCCCCHHH Confidence 9999998886421111111 00111 1111111111123457889999999999999999999998654322211 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC--------CCcEEEEEEeCCceeEEEEc--------C Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS--------TGEPIALWPLDPNTVSILRN--------T 144 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~--------~g~~~~l~~l~~~~v~~~~~--------~ 144 (435) -..+.-...+ ....+-+...+.+.-++|.|++++..++. .|.+..+.+++++.+++... . T Consensus 74 ~~~~~~~~~~----l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~ 149 (437) T protein:vir:52 74 LDLFTKFERS----LKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPN 149 (437) T ss_pred HHHHHHHHHh----hcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccccc Confidence 1111111111 22456666667777799999999988764 47889999999988874221 1 Q ss_pred CCceEEEEEecCCeeEEEchhheEEeccC---CCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCC--- Q lcl|NC_019456. 145 DNNSYWYRVTSDIYNFTIPINDVIHVKHV---VPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDR--- 218 (435) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~iih~~~~---~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~--- 218 (435) .|.+.+|.+..++....|.++.|+||... .+.+.+.|.|.++.+...|.....+.......+......++++++ T Consensus 150 fg~p~~y~v~~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~ 229 (437) T protein:vir:52 150 FGRYSEYSILGGSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSD 229 (437) T ss_pred cCcceEEEEecCCcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHH Confidence 36667788877766778999999999742 345678899999999999998887777766665443333344432 Q ss_pred cCC---HHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccH Q lcl|NC_019456. 219 SIS---PEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTN 295 (435) Q Consensus 219 ~~~---~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~ 295 (435) .++ ++...+..+.+ ...++.+++++++.+.+|+.++.+..+ +.++.....++||++++||..+|.+...++.++ T Consensus 230 ~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~d~~~~~e~~~~~~sg--l~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glas 306 (437) T protein:vir:52 230 KIAAGMENEVASVISAV-QEIKSATNSLLLDAENEYDRKELTFTG--LKDLLTEFRNAVAGAADMPVTILFGQSVSGLAS 306 (437) T ss_pred HhcCCcHHHHHHHHHHH-HHhcCCCceEEEcCCcceEEEecCcCC--HHHHHHHHHHHHHHHhcCchhhhcCcCcccccc Confidence 232 23333333333 344667889999999999999888754 567888889999999999999997777776665 Q ss_pred HH-HHHHHHH-------HHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHH-------HHHHHHHHHHhcC Q lcl|NC_019456. 296 VE-HVTHSWT-------MTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTA-------ARTQYYQTLTRNG 360 (435) Q Consensus 296 ~e-~~~~~~~-------~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~-------~~~~~~~~~~~~g 360 (435) .+ +...||. ..+.|+++.+.+.+-+..+.. ...++.++| ++|...|.+ ++++++.+++++| T Consensus 307 ge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~--~~~~~~~~f--~pL~~~s~kekae~~~~~a~a~~~~~~~g 382 (437) T protein:vir:52 307 GDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGG--LPADWWFEF--VPLTTVKQEQQINMLNTFATAANTLIQNG 382 (437) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CCCcceEEe--CCcCCcCHHHHHHHHHHHHHHHHHHHhcC Confidence 44 4444442 246677777666666655542 223455555 466666644 4556688999999 Q ss_pred CcCHHHHHHHhC----CCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCC Q lcl|NC_019456. 361 IFKPNEIRELEG----QAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEP 432 (435) Q Consensus 361 ~~t~NE~R~~~g----~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (435) +++++|+|++|. ++.++++ |. +...+. .... . ...++.+.. ++...++.+. T Consensus 383 ~i~~~e~r~~L~~~g~~~~i~~~--~~----------~~~~~~----~~~~-~-~~~~~~~~~---~~~~~~~~~~ 437 (437) T protein:vir:52 383 VLNEYQIANELRESGLFANISAE--HI----------EELKNA----DEFA-G-NFEEPEKME---GAQVQNSEDQ 437 (437) T ss_pred CCCHHHHHHHHHhcCCCCCCCcc--cc----------ccccCC----CCCC-C-ccCCCCCCC---CCCCCCCCCC Confidence 999999999873 2223211 00 000000 0000 0 000000000 0001111111 No 107 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.94 E-value=2.1e-26 Score=161.32 Aligned_cols=421 Identities=11% Similarity=0.083 Sum_probs=228.8 Q ss_pred CchHHHHH------------hhcccccccccc-ccccchhhhhh---c------------cccccCcccccHHHHhhhHH Q lcl|NC_019456. 1 MSFMSKVR------------QFFGVHDQANQI-VQNPIPQPLDM---A------------GVKLEQATFSREHILESNEY 52 (435) Q Consensus 1 Mg~~~~~~------------~~~~~~~~~~~~-~~~~~~~~~~~---~------------~~~~~~~~~~~~~~~~~~~~ 52 (435) .+--.|.+ +..........+ ..+...+.+.+ . +.......+-....|..++. T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l 96 (532) T protein:vir:94 17 LQQAQRVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLALLAQLPE 96 (532) T ss_pred hhhHhhhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCcccccccccccccccccchHHHHHHHHcCch Confidence 11000000 000000000000 00000000000 0 00111111111235678899 Q ss_pred HHHHHHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeC---------- Q lcl|NC_019456. 53 IFSIVTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSL---------- 122 (435) Q Consensus 53 v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~---------- 122 (435) ++.+|+.+|+++.+-.+++..++...........+....... ...+-+..++.+..++|.+++++..++ T Consensus 97 ~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l-~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p 175 (532) T protein:vir:94 97 YRTMHETPADECVRAWGKITCSSKDELAADKATRITQKLEQY-NVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAP 175 (532) T ss_pred hhhhhccchHHHhhCCceEeeCCccccchHHHHHHHHHHHhh-hHHHHHHHHHHhhhcccceEEEEEeccCCcccccccc Confidence 999999999999999999876544333333333333333322 234555666666679999988875432 Q ss_pred --------CCCcEEEEEEeCCceeEEEEc--------CCCceEEEEEecCCeeEEEchhheEEeccCCC------ccccc Q lcl|NC_019456. 123 --------STGEPIALWPLDPNTVSILRN--------TDNNSYWYRVTSDIYNFTIPINDVIHVKHVVP------SNSWY 180 (435) Q Consensus 123 --------~~g~~~~l~~l~~~~v~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~------~~~~~ 180 (435) ..|.+.+|.+++|..|++... ..|.+.+|++..+ +.|.++.|+||..... ...+. T Consensus 176 ~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~~g---~~iH~SRli~f~g~~~p~~~~~~~~~~ 252 (532) T protein:vir:94 176 LLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIATSG---KKIHSSRIHTVVGRPVGDMLKAAYSFR 252 (532) T ss_pred ccccccccccceeeEEEeechheecccccccccccccccCCceeEEEccC---eeeccceEEEecCCCchhhhccccccc Confidence 123457899999998876532 1344445555432 4688999999975432 23457 Q ss_pred cCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeC--CcCCHHHHHHHHHHHH--HHhcCCCccccccC-Cceeeecc Q lcl|NC_019456. 181 GVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYD--RSISPEKRQAMVNDFL--RMVKENGGAVVQEA-GWKVDRYE 255 (435) Q Consensus 181 G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~--~~~~~e~~~~~~~~~~--~~~~~~~~~~vl~~-g~~~~~~~ 255 (435) |.|.++.+...|............+.......+++++ ..++.+..+++.+++. ...++..++++++. +.+|++++ T Consensus 253 G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~a~~ls~~~~~~~~~r~~~~~~~~~n~g~~~id~~~e~~e~~~ 332 (532) T protein:vir:94 253 GVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLATDMAQLLAPGGAQSLDARLQLFNLYRDNRNIGALDKGTEEIQQTN 332 (532) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcchhHHHHHHHHHHHHhhcCCccceEEcCCCceeEEEe Confidence 9999999999998887766666655433333333333 3455556677777765 33455667777775 57899988 Q ss_pred CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCccc-HHH-HHHHHHH-------HHHhHHHHHHHHHHHHhhcccc Q lcl|NC_019456. 256 SKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTT-NVE-HVTHSWT-------MTLMPIIRQYESQFNMKLFTPG 326 (435) Q Consensus 256 ~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~-~~e-~~~~~~~-------~~i~P~~~~i~~~l~~~l~~~~ 326 (435) .+... +.++.....++||++.|||...|-+.+.++.+ +.+ ....||. ..+.|.++.+.+.|.+..+.. T Consensus 333 ~~lsg--l~~~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~- 409 (532) T protein:vir:94 333 TPLSG--LDSLQAQSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQ- 409 (532) T ss_pred cccCC--HHHHHHHHHHHHHhHhCCCeeeeecCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC- Confidence 77764 56788888999999999999977655555443 333 3344442 236777777777777655532 Q ss_pred cccCcceeeechhhhhccCHHHH-------HHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccc Q lcl|NC_019456. 327 KRVKGFYFSFNVNGLLRGDTAAR-------TQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDA 399 (435) Q Consensus 327 ~~~~g~~i~fd~~~l~~~d~~~~-------~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~ 399 (435) ...+++|+| ++|...|.+++ ++++.+++++|++++||+|++++..|. .+.+......+ .++..... T Consensus 410 -~~~d~~~~f--~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~--~~~~~~~~~~~--~~~~~~~~ 482 (532) T protein:vir:94 410 -IDPGLAWEW--SPLMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPT--SGYAGALGERD--ELDDVEEI 482 (532) T ss_pred -CCCCceEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCc--ccccccccccc--ccccccch Confidence 233555555 46766666554 455788999999999999999999876 34443322211 11111111 Q ss_pred ----cccccccccccccccccCCCCCCCCCCCCCCCCC--CC Q lcl|NC_019456. 400 ----ILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE--GS 435 (435) Q Consensus 400 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 435 (435) ......+....+..++..++-.+++....+..+. |+ T Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 524 (532) T protein:vir:94 483 AKQLMAAALNPPATAPQTPNPQPDSEDDQTDNQPDAQADPAQ 524 (532) T ss_pred hhhhcccccCCCCCCCCCCCCCCCCCCCCCCCccCCCccccc Confidence 1111111111111111111111111111111111 11 No 108 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.93 E-value=4.4e-26 Score=159.60 Aligned_cols=417 Identities=9% Similarity=0.033 Sum_probs=229.2 Q ss_pred CchHHH---H---Hhhccccc------cc-c--ccccccchhhhhhc-------------cccc-----------cCccc Q lcl|NC_019456. 1 MSFMSK---V---RQFFGVHD------QA-N--QIVQNPIPQPLDMA-------------GVKL-----------EQATF 41 (435) Q Consensus 1 Mg~~~~---~---~~~~~~~~------~~-~--~~~~~~~~~~~~~~-------------~~~~-----------~~~~~ 41 (435) |++|.. . .....+.. .+ . ....+.....+... +... ....+ T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 104 (537) T protein:vir:10 25 VGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIGH 104 (537) T ss_pred cCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCccH Confidence 888732 1 11110000 00 0 00000000001000 0000 00011 Q ss_pred ccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeee Q lcl|NC_019456. 42 SREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKS 121 (435) Q Consensus 42 ~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~ 121 (435) -....|.+++.++.+|+.+|+++.+-++++...+...........+.........+ +-+...+.+..++|.+++++.-. T Consensus 105 ~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~-~~l~~a~~~~rlyG~~~i~i~v~ 183 (537) T protein:vir:10 105 QMCALIATHWLVNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIK-KHAIQFVRKGRIFGIRIALFKVD 183 (537) T ss_pred HHHHHHHhCchhhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHH-HHHHHHHHhcccccceEEEEeec Confidence 12244678899999999999999999998876544333333344444444444334 44455555556789998887643 Q ss_pred CC---------------CCcEEEEEEeCCceeEEEE------c----CCCceEEEEEecCCeeEEEchhheEEeccCC-- Q lcl|NC_019456. 122 LS---------------TGEPIALWPLDPNTVSILR------N----TDNNSYWYRVTSDIYNFTIPINDVIHVKHVV-- 174 (435) Q Consensus 122 ~~---------------~g~~~~l~~l~~~~v~~~~------~----~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~-- 174 (435) .. .|.+..|.+++|..+++.. + ..|.+.+|.+. .+.|.++.|+||.... T Consensus 184 ~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~----g~~iH~SRli~f~g~~~p 259 (537) T protein:vir:10 184 SPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLIN----GKKYHRSHLAIYINDEVV 259 (537) T ss_pred CcCCcccccccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeeec----CeEecceeEEEecCCCCc Confidence 22 2345678899988777532 1 12444455543 2478899999997542 Q ss_pred ----CccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeC--CcC-CHHHHHHHHHHHHHHhcCCCccccccC Q lcl|NC_019456. 175 ----PSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYD--RSI-SPEKRQAMVNDFLRMVKENGGAVVQEA 247 (435) Q Consensus 175 ----~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~--~~~-~~e~~~~~~~~~~~~~~~~~~~~vl~~ 247 (435) +...+.|.|.++.++..|.............+......+++++ ..+ ++++..+..+.+... ++..++++++. T Consensus 260 ~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~r~~~~~~~-r~n~g~~~id~ 338 (537) T protein:vir:10 260 DFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDETMSWWTAT-RDNYQVRVVDK 338 (537) T ss_pred hhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHHHHHHHHhh-cCCcceeEecC Confidence 2345679999999999998877766666655544333333333 222 344444444455433 44556677765 Q ss_pred -CceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc-cHHH-HHHHHH------HHHHhHHHHHHHHHH Q lcl|NC_019456. 248 -GWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST-TNVE-HVTHSW------TMTLMPIIRQYESQF 318 (435) Q Consensus 248 -g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~-~~~e-~~~~~~------~~~i~P~~~~i~~~l 318 (435) +.+|++++.+... +.++.....+.||.+.|||...|-+...++. ++.+ ....|| +..|.|.+..+.+.+ T Consensus 339 e~e~~e~~~~~lsg--l~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I~~~Qe~l~p~l~~l~~ll 416 (537) T protein:vir:10 339 DNEDVVQIDTTLND--LDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEECESTQDDMRPLIDRHHQLV 416 (537) T ss_pred CCceeEEEeccCCC--HHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5899998877765 4578888889999999999997655443333 3233 334444 234788888888888 Q ss_pred HHhhcccccccCcceeeechhhhhccCHHHHHH-------HHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeeccccc Q lcl|NC_019456. 319 NMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQ-------YYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLY 391 (435) Q Consensus 319 ~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~-------~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~ 391 (435) .+..+.+. .+ +.|.++.|...|.+++++ ++.+++++|++++||+|+.|+.+|. .+-+.+ .+ .. T Consensus 417 ~~~~~~~~---~~--~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~--~g~~~l-~~--~~ 486 (537) T protein:vir:10 417 CRSHLRKR---IR--VKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPT--LGFTSI-TP--AM 486 (537) T ss_pred HHhcCCCC---cc--eEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCc--cccccc-cC--CC Confidence 77666532 23 455566888877777665 5899999999999999999998764 222222 11 11 Q ss_pred chhccccccccccccccccccccccCCCC-CCCCCCCCCCCCCCC Q lcl|NC_019456. 392 PLDKYYDAILDNKIQTDASVAAPKQEGGE-NTNENGLQSTEPEGS 435 (435) Q Consensus 392 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 435 (435) +.+...+...+...........++.+++. ..+..+++..++..+ T Consensus 487 ~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 531 (537) T protein:vir:10 487 RPTDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSSGESANDPRDS 531 (537) T ss_pred ChhhhhcccCCccCCcCCCCCCCCCccccCCCCccccccCCCccC Confidence 22222221111111111111111111111 111112222222222 No 109 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.91 E-value=4e-25 Score=154.32 Aligned_cols=386 Identities=10% Similarity=0.035 Sum_probs=213.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhh-hcccccc---Cc----ccccHHHHhhhHHHHHHHHHHHHHHhhCceeee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLD-MAGVKLE---QA----TFSREHILESNEYIFSIVTRLSNVLASLPLHEY 72 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~---~~----~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~ 72 (435) ||+|-+-+. ++ ....+...+.+. ..|+... .. ...-...|.+++.++.+|+.+|+++-+..+.+. T Consensus 1 ~~~~m~~~~------~~-~~~~D~~~~~~~~~~g~~~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~i~ 73 (435) T protein:vir:79 1 MGVFMSDKV------KA-ITKEDGYNEIFGSKDGTFRPNAFYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGFKVD 73 (435) T ss_pred CCccccccc------cc-chhhcchhhhhcccccccccCcccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCceec Confidence 999754321 11 111122222111 1111110 11 111124567889999999999999999999885 Q ss_pred ecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEee-eC--------CCCcEEEEEEeCCceeEEEEc Q lcl|NC_019456. 73 QNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQK-SL--------STGEPIALWPLDPNTVSILRN 143 (435) Q Consensus 73 ~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~-~~--------~~g~~~~l~~l~~~~v~~~~~ 143 (435) ...... .+...+. . ....+-+...+.+..++|.+++++.. ++ ..|.+..+.++++..+++... T Consensus 74 g~~~~~---~~~~~~~----~-l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~ 145 (435) T protein:vir:79 74 GVKNEK---SFKSRWD----E-LRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHER 145 (435) T ss_pred CCChHH---HHHHHHH----H-hhHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhh Confidence 432211 1111111 1 12345566667777889998887764 21 245677899999988865432 Q ss_pred -------CCCceEEEEEecCC--eeEEEchhheEEeccC------CCccccccCcHH-HHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019456. 144 -------TDNNSYWYRVTSDI--YNFTIPINDVIHVKHV------VPSNSWYGVSPI-DVLSSSLKFQRSVENFSQNEME 207 (435) Q Consensus 144 -------~~~~~~~~~~~~~~--~~~~~~~~~iih~~~~------~~~~~~~G~s~l-~~~~~~i~~~~~~~~~~~~~~~ 207 (435) ..|.+.+|.+.+.+ ..+.|.++.|+||... .+...++|.|++ +.+++.|.............+. T Consensus 146 ~~dp~sp~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~ 225 (435) T protein:vir:79 146 ETNARSVRYGEPKLYKISPGGDIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLR 225 (435) T ss_pred ccCCcccccCcceEEEEecCCCCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23556667776543 3568999999999642 234667899987 5788888887776666666543 Q ss_pred cCCceEEEeCC---cCC-HHHHHHHHHHHH--HHhcC-CCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019456. 208 KKDKFVLQYDR---SIS-PEKRQAMVNDFL--RMVKE-NGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNV 280 (435) Q Consensus 208 n~~~~~~~~~~---~~~-~e~~~~~~~~~~--~~~~~-~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgv 280 (435) ....-++++++ .++ +.....+++++. ...++ .+.+++...+.+|+.++.+... +.++.....++||++.|| T Consensus 226 ~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~~lsg--l~~~~~~~~~~iaaa~~I 303 (435) T protein:vir:79 226 RKQQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLNSDVSG--VPEFLQEKIDRIVALTGI 303 (435) T ss_pred HhcCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEecccCC--HHHHHHHHHHHHHhhhCC Confidence 32222233321 121 122233333332 22333 4456565666789998887754 578888889999999999 Q ss_pred CHHHhCCcccCccc-HHHHHHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHH-------HHHH Q lcl|NC_019456. 281 PISFLNDDQAKSTT-NVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAA-------RTQY 352 (435) Q Consensus 281 P~~~lg~~~~~~~~-~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~-------~~~~ 352 (435) |...|.+...++.+ +.+.-...|.+.|.-..+.....+..+|+.-..+..+++++| ++|...|.++ ++++ T Consensus 304 P~t~L~G~s~~glnstgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li~~s~d~~~~f--~pL~~~sekEkAei~~~~a~a 381 (435) T protein:vir:79 304 HEIIIKNKNTGGVSASQNTALETFYKLIDRKRVEDYKPILEFLLPFMISETEWSIEF--EPLSVPSDKDKAEIMAKNVES 381 (435) T ss_pred CeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCeEEe--CCCCCCCHHHHHHHHHHHHHH Confidence 99988666666543 344444445554444433322222222222222234566665 5677666644 4566 Q ss_pred HHHHHhcCCcCHHHHHHHh-CCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCC Q lcl|NC_019456. 353 YQTLTRNGIFKPNEIRELE-GQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENT 422 (435) Q Consensus 353 ~~~~~~~g~~t~NE~R~~~-g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (435) +.+++++|+++++|+|+.+ ...+.-.-.++.. ..++ ..+.....+..++|+|. T Consensus 382 ~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~------~~~~-----------~~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 382 VVKLKAEQAINLKETRDTLRSICPDLKIMDNDN------IELP-----------EPEDLDPEPGQEGGLNK 435 (435) T ss_pred HHHHHhcCCCCHHHHHHHHHHhccccCCCCccc------ccCC-----------ccccCCCCCCCCCCCCC Confidence 7888999999999999987 2111100001000 0010 01111122233333333 No 110 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.91 E-value=3.5e-24 Score=149.16 Aligned_cols=415 Identities=12% Similarity=0.081 Sum_probs=217.6 Q ss_pred CchH----HHHHhh---cccccc-ccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeee Q lcl|NC_019456. 1 MSFM----SKVRQF---FGVHDQ-ANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEY 72 (435) Q Consensus 1 Mg~~----~~~~~~---~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~ 72 (435) |.+. +.+.+. +|.... +....... ..++.... ....+-....|.+++.++.+|+.+|+++.+-.+.+. T Consensus 93 ~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~---~~~~~~~~-~f~gyql~alY~~~~larkiVd~pAeDatR~g~~I~ 168 (862) T protein:vir:99 93 KAITGFAMDDGGGAPVPIGAEGKQSSYAVPEA---LQDWYLSQ-GFIGHQACALIAQHWLVDKACSLAGEDAIRNGWHLK 168 (862) T ss_pred hhhhhhhhhcchhhhhhccccccccccccchh---cccccccc-CcccHHHHHHHHhCchhhhhhhhhhHHHhhCCceEe Confidence 1111 111111 111111 00000000 00110000 001111124577899999999999999999999887 Q ss_pred eccc--ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC---------------CCcEEEEEEeCC Q lcl|NC_019456. 73 QNYK--QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS---------------TGEPIALWPLDP 135 (435) Q Consensus 73 ~~~~--~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~---------------~g~~~~l~~l~~ 135 (435) ..+. ... ......+...-... ...+-+...+.+.-++|.+++++..+.. .|.+.+|.+|+| T Consensus 169 ~~~d~~e~~-~e~~~~ie~~~~rL-~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp 246 (862) T protein:vir:99 169 SLGEGEEID-EESLEKFKAIDVEF-KVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDP 246 (862) T ss_pred ecCcccccC-HHHHHHHHHHHHHh-hHHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEech Confidence 5321 111 11122222122111 2344455566666788888777653222 245678889998 Q ss_pred ceeEEEE------cC----CCceEEEEEecCCeeEEEchhheEEeccCC------CccccccCcHHHHHHHHHHHHHHHH Q lcl|NC_019456. 136 NTVSILR------NT----DNNSYWYRVTSDIYNFTIPINDVIHVKHVV------PSNSWYGVSPIDVLSSSLKFQRSVE 199 (435) Q Consensus 136 ~~v~~~~------~~----~~~~~~~~~~~~~~~~~~~~~~iih~~~~~------~~~~~~G~s~l~~~~~~i~~~~~~~ 199 (435) ..+.+.. +. .+.+..|.+.. ..|.++.|+||.... +...+.|.|.++.++..|.....+. T Consensus 247 ~w~~p~~v~~~~~Dp~sp~yGkP~~y~I~g----~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~ 322 (862) T protein:vir:99 247 YWMMPMLTAESTADPSSQFFYEPEFWIISG----QKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTA 322 (862) T ss_pred hhhcccccccccccccccccCCceeeeecC----eeeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHH Confidence 8776522 21 13444454432 468889999987543 2334579999999999999888777 Q ss_pred HHHHHHhhcCCceEEEeCCc--CCHHHHHHHHHHH--HHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHH Q lcl|NC_019456. 200 NFSQNEMEKKDKFVLQYDRS--ISPEKRQAMVNDF--LRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIA 275 (435) Q Consensus 200 ~~~~~~~~n~~~~~~~~~~~--~~~e~~~~~~~~~--~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia 275 (435) .....++......+++++.. +..+ +++.+++ ...+++..++++++.+.+|+.++.+... +.++.....++|| T Consensus 323 ~saa~Ll~ka~l~v~ktd~l~~l~~e--d~l~~r~~~~~~~rdN~Gi~liD~eEe~e~ls~slSG--L~dll~~~~q~IA 398 (862) T protein:vir:99 323 NEAPLLAMNKRTTAIHTDTAKAIANE--DKFIQRLMFWVRYRDNHAVKVLGTDETMEQFDTSLAD--FDAVIMGQYQLVA 398 (862) T ss_pred HHHHHHHHHhccceeechhHhhhccH--HHHHHHHHHHHhccCcceeEEecCCCceeEEecccCC--hHHHHHHHHHHHH Confidence 77777765554444444332 2222 2333333 2344556678999999999999888764 4677778888999 Q ss_pred HHhCCCHHHhCCcccCc-ccHHH-HHHHHHH-------HHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCH Q lcl|NC_019456. 276 TAFNVPISFLNDDQAKS-TTNVE-HVTHSWT-------MTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDT 346 (435) Q Consensus 276 ~~fgvP~~~lg~~~~~~-~~~~e-~~~~~~~-------~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~ 346 (435) ++.+||...|.+.+.++ .++.+ ....||. ..|.|.+..+...+..++.. ..++.++| +.|...|. T Consensus 399 aas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~s~QE~~L~P~LerL~~li~~~lg~----~~d~~ieF--npL~~~se 472 (862) T protein:vir:99 399 SIAKTPATKLLGTAPKGFNSTGEFETISYHEELESIQEHVYMPFLQRHYLISRLSLGI----QHEIDVVM--EPVASMTA 472 (862) T ss_pred hhhCCCceeecccCcccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----CCcceEEe--CCCCCCCH Confidence 99999999766555333 34434 4444442 24667777777666554432 23455555 58887777 Q ss_pred HHHHH-------HHHHHHhcCCcCHHHHHHHh------CCCCCCCcCCce--eeecccccchhccccccccc---ccccc Q lcl|NC_019456. 347 AARTQ-------YYQTLTRNGIFKPNEIRELE------GQAPIPDEAADH--LYISKDLYPLDKYYDAILDN---KIQTD 408 (435) Q Consensus 347 ~~~~~-------~~~~~~~~g~~t~NE~R~~~------g~~p~~~~~gd~--~~~~~n~~~l~~~~~~~~~~---~~~~~ 408 (435) +++++ ++.+++++|+++++|+|++| |++.++++..++ ...+.++..+....+..... ...+. T Consensus 473 kEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~~ag 552 (862) T protein:vir:99 473 QQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETASAKETQAG 552 (862) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCcccccccccccccc Confidence 76664 47899999999999999976 555554332111 11122222221111110000 00000 Q ss_pred cc--------ccccccC---CCC-CCCCCCCCC---CCCCCC Q lcl|NC_019456. 409 AS--------VAAPKQE---GGE-NTNENGLQS---TEPEGS 435 (435) Q Consensus 409 ~~--------~~~~~~~---~~~-~~~~~~~~~---~~~~~~ 435 (435) .+ +..+..+ +|. ..++.+-+. .....+ T Consensus 553 a~~~~~e~d~~~~p~~~~~~~g~~~~~t~~~~a~~p~~~~~~ 594 (862) T protein:vir:99 553 AAVTTAEGDQPNVQMVPSMKPGQMVGPEVGITAPMPEDDAPV 594 (862) T ss_pred cCCccccCCcccccccCCCCCCCccccccccccCCCcccccc Confidence 00 0000001 111 111100000 000000 No 111 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.90 E-value=1.6e-23 Score=145.54 Aligned_cols=420 Identities=14% Similarity=0.131 Sum_probs=215.7 Q ss_pred CchHHHHHhhcccccccc---cc----------cc------ccchhhhhhccccccCc---------------ccccHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQAN---QI----------VQ------NPIPQPLDMAGVKLEQA---------------TFSREHI 46 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~---~~----------~~------~~~~~~~~~~~~~~~~~---------------~~~~~~~ 46 (435) |.=+.+++.+.....++. .. .. ...+.+..+.+...... .+-.... T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~al 116 (765) T protein:vir:96 37 MIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAI 116 (765) T ss_pred chhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCccHHHHHH Confidence 333333333221111110 00 00 00001111111110000 0111234 Q ss_pred HhhhHHHHHHHHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC--- Q lcl|NC_019456. 47 LESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS--- 123 (435) Q Consensus 47 ~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~--- 123 (435) |.+++.++.+|+.+|++.-+-.+.+...+.+... .....|...-.. ....+-+...+.+..++|.+|+++..+.. T Consensus 117 Y~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~~~-~~~~~l~~~~~r-l~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~ 194 (765) T protein:vir:96 117 ISQHWLVDKACSMSGEDAARNGWELKSDGRKLSD-EQSALIARRDME-FRVKDNLVELNRFKNVFGVRIALFVVESDDPD 194 (765) T ss_pred HHhCchhhhhhhcchHHhhcCCceeecCccccCH-HHHHHHHHHHHH-hhHHHHHHHHHHHhhhceeeEEEEEecccCcc Confidence 7788999999999999999988888665443322 222223222211 22456667777777899999987764322 Q ss_pred ------------CCcEEEEEEeCCceeEEEE------cC----CCceEEEEEecCCeeEEEchhheEEeccCC------C Q lcl|NC_019456. 124 ------------TGEPIALWPLDPNTVSILR------NT----DNNSYWYRVTSDIYNFTIPINDVIHVKHVV------P 175 (435) Q Consensus 124 ------------~g~~~~l~~l~~~~v~~~~------~~----~~~~~~~~~~~~~~~~~~~~~~iih~~~~~------~ 175 (435) .|.+..|.+++|..+.+.. +. .+....|.+.. +.|.++.||||.... + T Consensus 195 ~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i~g----~~IH~SRli~~~g~~lpd~lk~ 270 (765) T protein:vir:96 195 YYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWIISG----KKYHRSHLVVVRGPQPPDILKP 270 (765) T ss_pred hhhccccccccccceeeEEEEechhhcccccchhccccccccccCcceeeeecC----ceeccceEEEecCCCchhhhcc Confidence 2355678888887666532 11 23333444432 367889999996543 2 Q ss_pred ccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCC--cCC-HHHHHHHHHHHHHHhcCCCccccccCCceee Q lcl|NC_019456. 176 SNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDR--SIS-PEKRQAMVNDFLRMVKENGGAVVQEAGWKVD 252 (435) Q Consensus 176 ~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~--~~~-~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~ 252 (435) ...++|.|.++.++..|............++.....-++.++. .+. +++..+-.+.+. ..++..++++++.+.+|+ T Consensus 271 ~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~r~~~~~-~~r~n~g~~~id~ee~~e 349 (765) T protein:vir:96 271 TYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNARLAFWI-ANRDNHGVKVIGIDETME 349 (765) T ss_pred ccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHHHHHHHH-HhcCCceeEEecCCccee Confidence 3456799999999999988877766666665443333333332 222 233332223333 345556788999999999 Q ss_pred eccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc-cHHH-HHHHHHH-------HHHhHHHHHHHHHHHHhhc Q lcl|NC_019456. 253 RYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST-TNVE-HVTHSWT-------MTLMPIIRQYESQFNMKLF 323 (435) Q Consensus 253 ~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~-~~~e-~~~~~~~-------~~i~P~~~~i~~~l~~~l~ 323 (435) .++.+... +.++.....++||++.+||...|-+...++. ++.| ....||. ..+.|.++.+-+.|-. T Consensus 350 ~~s~~lsg--l~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~--- 424 (765) T protein:vir:96 350 QFDTNLSD--FDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELESIQEHIFDPLLERHYLLLAK--- 424 (765) T ss_pred EEecccCC--HHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--- Confidence 99988764 5688888899999999999987766554433 3434 3333432 2244444444444332 Q ss_pred ccccccCcceeeechhhhhccCHHHHH-------HHHHHHHhcCCcCHHHHHHHhCCCC------CCCcCCc--eeeecc Q lcl|NC_019456. 324 TPGKRVKGFYFSFNVNGLLRGDTAART-------QYYQTLTRNGIFKPNEIRELEGQAP------IPDEAAD--HLYISK 388 (435) Q Consensus 324 ~~~~~~~g~~i~fd~~~l~~~d~~~~~-------~~~~~~~~~g~~t~NE~R~~~g~~p------~~~~~gd--~~~~~~ 388 (435) ......++.++| +.|...|.++++ +++.+++++|+++++|+|+.++.++ ++++..+ ...-+. T Consensus 425 -s~~i~~d~~i~F--npL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe 501 (765) T protein:vir:96 425 -SESIDVQLEIVW--NPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPE 501 (765) T ss_pred -hcCCCCcceEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCcc Confidence 223334545554 577777766655 5588999999999999999986543 3221111 111111 Q ss_pred cccchhcccccccccccccccccccc-ccCCCC-----------CCCC-----CCCCCCCCCCC Q lcl|NC_019456. 389 DLYPLDKYYDAILDNKIQTDASVAAP-KQEGGE-----------NTNE-----NGLQSTEPEGS 435 (435) Q Consensus 389 n~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~-----------~~~~-----~~~~~~~~~~~ 435 (435) +...++..........+........+ ..++.. +..+ .+...++...+ T Consensus 502 ~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~~p~~~ 565 (765) T protein:vir:96 502 NLAELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAATPPSRP 565 (765) T ss_pred ccccccCCCcccccccCccccccCCCCccCCCCcccccCCcccCCccccccccCccccCccccc Confidence 11111111111111111111110000 011100 0000 01111110011 No 112 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.90 E-value=7.3e-24 Score=147.41 Aligned_cols=381 Identities=11% Similarity=0.049 Sum_probs=217.9 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccc-cc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQ-MD 79 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~-~~ 79 (435) |--.+.+.+++.....+... .+.........-...|.+++.++.+|+.+|+++.+..|.+..+... .. T Consensus 1 ~~~~D~~~n~~~gg~~~~~~-----------~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~~~~~~~ 69 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEI-----------YGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDEPAF 69 (422) T ss_pred CccchhhHHHHcCCCCCccc-----------cCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccccCCCHHHHH Confidence 77777777765321110000 0111111111223457789999999999999999999998654322 11 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeC---------CCCcEEEEEEeCCceeEEEEc------- Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSL---------STGEPIALWPLDPNTVSILRN------- 143 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~---------~~g~~~~l~~l~~~~v~~~~~------- 143 (435) ... ...| ...+-+...+.+..++|.+++++..++ ..|.+..+.++++..+++... T Consensus 70 ~~~-~~~l--------~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~ 140 (422) T protein:vir:10 70 WSR-WDDL--------EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNA 140 (422) T ss_pred HHH-HHHh--------hHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCcccc Confidence 111 1122 245566677777789999998877422 246778899999998876431 Q ss_pred CCCceEEEEEecCC--eeEEEchhheEEeccCC------CccccccCcHHHH-HHHHHHHHHHHHHHHHHHhhcCCceEE Q lcl|NC_019456. 144 TDNNSYWYRVTSDI--YNFTIPINDVIHVKHVV------PSNSWYGVSPIDV-LSSSLKFQRSVENFSQNEMEKKDKFVL 214 (435) Q Consensus 144 ~~~~~~~~~~~~~~--~~~~~~~~~iih~~~~~------~~~~~~G~s~l~~-~~~~i~~~~~~~~~~~~~~~n~~~~~~ 214 (435) ..|.+.+|.+...+ ..+.|.++.|+||.... +...++|.|++.. +.+.|.....+.......+......++ T Consensus 141 ~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v~ 220 (422) T protein:vir:10 141 RFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQAVW 220 (422) T ss_pred ccCcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 23566666666543 23678899999996432 3456789999986 678888777766666665543333333 Q ss_pred EeCC---cCC-HHHHHHHHHHHHH--Hhc-CCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCC Q lcl|NC_019456. 215 QYDR---SIS-PEKRQAMVNDFLR--MVK-ENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLND 287 (435) Q Consensus 215 ~~~~---~~~-~e~~~~~~~~~~~--~~~-~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~ 287 (435) ++++ .++ ......+++++.. ..+ +.+.+++.+.+.+|++++.+... +.++.....++||++.|||...|.+ T Consensus 221 ~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsg--l~~~~~~~~~~iaaa~~IP~t~L~G 298 (422) T protein:vir:10 221 KAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSGIHEIILKN 298 (422) T ss_pred cchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEecccCC--hHHHHHHHHHHHHhhhCCCeeeecc Confidence 3332 111 2223333444432 223 34445566667899999888764 5788888899999999999998876 Q ss_pred cccCccc-HHHHHHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHH-------HHHHHHHHHhc Q lcl|NC_019456. 288 DQAKSTT-NVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAA-------RTQYYQTLTRN 359 (435) Q Consensus 288 ~~~~~~~-~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~-------~~~~~~~~~~~ 359 (435) .+.++.+ +.+.-...|.+.|.-..+.....+..+|+....+..+++++|+ +|...|.++ +++++++++++ T Consensus 299 ~s~~Glnatgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~~s~~~~~~f~--pL~~~sekekaei~~~~a~a~~~~~~~ 376 (422) T protein:vir:10 299 KNVGGVSSSQNTALETFHKLVDRKRNAELLPILEFLIPFIVNAEEWSVEFN--PLAQESSKDKAEILEKNVNSIAALIAA 376 (422) T ss_pred CCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcEEEeC--CCCCCCHHHHHHHHHHHHHHHHHHHhc Confidence 6666553 3444344454444443333222222223222223346667765 666666554 45678899999 Q ss_pred CCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCC Q lcl|NC_019456. 360 GIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTE 431 (435) Q Consensus 360 g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (435) |+++++|+|+.|--... ...+..+..+.+. . +.+ ....|.. +++.+ T Consensus 377 g~i~~~e~r~~L~~~~~------~~~~~~~~~~~~~----~-----~~~-~~~~~~~----------~~~~d 422 (422) T protein:vir:10 377 GAMDIDEARDTLRTIAP------EVKINDGSVETEV----T-----ISE-TSNDPLE----------VPTDD 422 (422) T ss_pred CCCCHHHHHHHhhhhcc------cccCCCCCCcccc----c-----hhh-cCCCCCC----------CCCCC Confidence 99999999998842211 1111111111110 0 000 0000000 00001 No 113 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.90 E-value=6.2e-24 Score=147.80 Aligned_cols=381 Identities=9% Similarity=0.039 Sum_probs=218.0 Q ss_pred CchHH--HHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccc Q lcl|NC_019456. 1 MSFMS--KVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQM 78 (435) Q Consensus 1 Mg~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 78 (435) |..|. .+.++++.....+... .......+--...|.+++.++.+|+.+|+++.+..+.+....... T Consensus 1 ~~~~~~d~~~~~~~~~~~~~~~~------------~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~~ 68 (427) T protein:vir:10 1 MKIVKHDGYNDIFNGGADGSPKP------------FFMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMSGVKDEK 68 (427) T ss_pred CCccccchHHHHhhcCCCCcccC------------ccccCchHHHHHHHHcCchhhhhhccchHHhhcCCccccCccHHH Confidence 88774 4556655432222110 001111112234678899999999999999999999986533221 Q ss_pred ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeC---------CCCcEEEEEEeCCceeEEEEc------ Q lcl|NC_019456. 79 DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSL---------STGEPIALWPLDPNTVSILRN------ 143 (435) Q Consensus 79 ~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~---------~~g~~~~l~~l~~~~v~~~~~------ 143 (435) .-......| ...+-+...+.+..++|.+++++..++ ..|.+..|.+++++.+++... T Consensus 69 ~~~~~~~~l--------~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s 140 (427) T protein:vir:10 69 EFKSLWDSY--------KLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNARS 140 (427) T ss_pred HHHHHHHHh--------hHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccCccc Confidence 111111122 244556777777789999998875322 357788999999988876532 Q ss_pred -CCCceEEEEEecCCe--eEEEchhheEEeccCC------CccccccCcHHH-HHHHHHHHHHHHHHHHHHHhhcCCceE Q lcl|NC_019456. 144 -TDNNSYWYRVTSDIY--NFTIPINDVIHVKHVV------PSNSWYGVSPID-VLSSSLKFQRSVENFSQNEMEKKDKFV 213 (435) Q Consensus 144 -~~~~~~~~~~~~~~~--~~~~~~~~iih~~~~~------~~~~~~G~s~l~-~~~~~i~~~~~~~~~~~~~~~n~~~~~ 213 (435) ..|.+.+|.+.+.+. .+.+.++.|+||.... +...++|.|++. .+...|.............+......+ T Consensus 141 ~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v 220 (427) T protein:vir:10 141 PRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAV 220 (427) T ss_pred cccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 235666677765442 3689999999996442 345678999985 567777777666666555554333333 Q ss_pred EEeCCc---CC-HHHHHHHHHHHH---HHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhC Q lcl|NC_019456. 214 LQYDRS---IS-PEKRQAMVNDFL---RMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLN 286 (435) Q Consensus 214 ~~~~~~---~~-~e~~~~~~~~~~---~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg 286 (435) +++++- ++ .+....+++++. ....+.+.+++...+.+|++++.+... +.++.....++||++.+||...|. T Consensus 221 ~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsg--l~~~~~~~~~~iaaa~~IP~t~L~ 298 (427) T protein:vir:10 221 WKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSDISG--VPEFLSSKMDRIVSLSGIHEIIIK 298 (427) T ss_pred ccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEecccCC--hHHHHHHHHHHHHhhhCCCeeeec Confidence 333321 11 111222333332 223344556666677899999888764 567888889999999999999887 Q ss_pred CcccCccc-HHHHHHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHH-------HHHHHHHHHh Q lcl|NC_019456. 287 DDQAKSTT-NVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAA-------RTQYYQTLTR 358 (435) Q Consensus 287 ~~~~~~~~-~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~-------~~~~~~~~~~ 358 (435) +.+.++.+ +.+.-...|.+.|.-..+.....+-.+|+....+..+++++|+ +|...+.++ +++++.++++ T Consensus 299 G~sp~Glnstgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~~s~~~~~~f~--pL~~~s~kEkaei~~~~a~a~~~~~~ 376 (427) T protein:vir:10 299 NKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVDEEEWSIEFE--PLSVPSKKEESEITKNNVESVTKAIT 376 (427) T ss_pred cCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcEEEeC--CCCCCCHHHHHHHHHHHHHHHHHHHh Confidence 66666553 3343344444444433322222222222222222346667765 666555444 4567889999 Q ss_pred cCCcCHHHHHHHhC----CCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCC Q lcl|NC_019456. 359 NGIFKPNEIRELEG----QAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNEN 425 (435) Q Consensus 359 ~g~~t~NE~R~~~g----~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (435) +|+++++|+|+.|- ...+. +.+....+.. ++....++..+.++.+++ T Consensus 377 ~gvi~~~e~r~~L~~~~~~~~~~---------~~~~~~~e~~-----------~~~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 377 EQIIDLEEARDTLRSIAPEFKLK---------DGNNINIREP-----------EETTEPEPGLGEKLEDEN 427 (427) T ss_pred cCCCCHHHHHHHHHhhhccccCC---------CCcccccccc-----------chhcCCCCCCCCCCCCCC Confidence 99999999999872 22221 1111111110 011112222233333333 No 114 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.87 E-value=9.1e-23 Score=141.43 Aligned_cols=392 Identities=13% Similarity=0.121 Sum_probs=213.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhcccc-------------ccCcccc-cHHHHhhhHHHHHHHHHHHHHHhh Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVK-------------LEQATFS-REHILESNEYIFSIVTRLSNVLAS 66 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~-~~~~~~~~~~v~~~i~~ia~~ia~ 66 (435) |+=.+--.... ....+ ....++..-.|.. +....+. -...|..++.++.+|+.+|+.+-+ T Consensus 1 ~~~~~~a~~~~-~~~~a-----~~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r 74 (461) T protein:vir:80 1 MYSIDKAKQAK-IDSKI-----VNRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDMVR 74 (461) T ss_pred Cccchhhhhhh-hhhhh-----hhhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHhhc Confidence 44333111000 00000 0000111111100 0001111 124567888899999999999999 Q ss_pred CceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCC------------CcEEEEEEeC Q lcl|NC_019456. 67 LPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLST------------GEPIALWPLD 134 (435) Q Consensus 67 ~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~------------g~~~~l~~l~ 134 (435) .++++..... .....+...-+.. ...+-+...+.+..++|.+++++...... +.+.++..|+ T Consensus 75 ~g~~i~~~~~-----~~~~~~~~~~~~l-~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~~~l~ 148 (461) T protein:vir:80 75 AGWSLKTDNK-----EMKKNIESKWRKL-KTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSIPYIN 148 (461) T ss_pred CCeeeecCCH-----HHHHHHHHHHHHh-hHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccceeEEE Confidence 8887754322 1111222221111 23455666677778999999988642211 1222333333 Q ss_pred ---CceeEE---EEc----CCCceEEEEEecC-------------CeeEEEchhheEEeccCCCccccccCcHHHHHHHH Q lcl|NC_019456. 135 ---PNTVSI---LRN----TDNNSYWYRVTSD-------------IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSS 191 (435) Q Consensus 135 ---~~~v~~---~~~----~~~~~~~~~~~~~-------------~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~ 191 (435) +..+.. ..+ ..|.+.+|.+... ...+.+.++.|+||.+....+..+|.|.++.+... T Consensus 149 ~~~~~~i~~~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~ 228 (461) T protein:vir:80 149 TFNTQKVTQLYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFEGETKGRSIFESLYDI 228 (461) T ss_pred eccccccchhhhcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCCCCCccccCcchHHHHHHH Confidence 332221 111 2356666666432 23467999999999987777888999999999999 Q ss_pred HHHHHHHHHHHHHHhhcCCceEEEeCC--cCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHH Q lcl|NC_019456. 192 LKFQRSVENFSQNEMEKKDKFVLQYDR--SISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQI 269 (435) Q Consensus 192 i~~~~~~~~~~~~~~~n~~~~~~~~~~--~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~ 269 (435) |.....+......+.......++++++ .+..+...++.+++... .+..++++++.+.+++.++.+..+ +.++... T Consensus 229 l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~~~-~~~~g~~~~d~~e~~e~~~~~lsg--l~~~l~~ 305 (461) T protein:vir:80 229 ITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLDFM-FRTEALAIIKGDEQLTKESTNVSG--MKDLLDY 305 (461) T ss_pred HHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHHHh-cCCceEEEEcCCcceEEEecCcCC--HHHHHHH Confidence 988877666666555443333445442 23334445555566543 345568889999999999887764 5688889 Q ss_pred HHHHHHHHhCCCHHHhCCcccCcccHHH-HHHHHHH-------HHHhHHHHHHHHHHHHhhcccccc--cCcceeeechh Q lcl|NC_019456. 270 SRIRIATAFNVPISFLNDDQAKSTTNVE-HVTHSWT-------MTLMPIIRQYESQFNMKLFTPGKR--VKGFYFSFNVN 339 (435) Q Consensus 270 ~~~~Ia~~fgvP~~~lg~~~~~~~~~~e-~~~~~~~-------~~i~P~~~~i~~~l~~~l~~~~~~--~~g~~i~fd~~ 339 (435) ....||++.+||...|.+...+..++.+ +...||. ..+.|+++.+.+.+-+..+..... .....+.|.++ T Consensus 306 ~~~~iaa~s~iP~t~L~G~s~g~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~ 385 (461) T protein:vir:80 306 GWDYLAGAVRMPKTVLKGQEAGTLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFN 385 (461) T ss_pred HHHHHhhhhcCCeeeeecccCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeC Confidence 9999999999999987666665555544 4444431 235566666666555543321111 11123555566 Q ss_pred hhhccCHHHHH-------HHHHHHHhcCCcCHHHHHHHh-CCCCCCCcCCceeeecccccchhccccccccccccccccc Q lcl|NC_019456. 340 GLLRGDTAART-------QYYQTLTRNGIFKPNEIRELE-GQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASV 411 (435) Q Consensus 340 ~l~~~d~~~~~-------~~~~~~~~~g~~t~NE~R~~~-g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~ 411 (435) +|...|.++++ +++.+++++|+++++|+|+.+ +.-.++++ .++.-++. ...+. . T Consensus 386 ~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~--------~~~~~~~~-----~~~~~-~---- 447 (461) T protein:vir:80 386 PLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENS--------SKFSGDSA-----EIDKL-A---- 447 (461) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCC--------ccCCCCCc-----hhhhh-h---- Confidence 88777766665 458899999999999999866 22111111 01100000 00000 0 Q ss_pred cccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 412 AAPKQEGGENTNENGLQSTEPEG 434 (435) Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~ 434 (435) ....+...+ .+.+| T Consensus 448 -~~~~~~~~~--------e~~~g 461 (461) T protein:vir:80 448 -KLVYDAYAK--------KNADG 461 (461) T ss_pred -hhccccccc--------cCCCC Confidence 000000000 11111 No 115 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.85 E-value=2.3e-20 Score=128.26 Aligned_cols=408 Identities=12% Similarity=0.032 Sum_probs=232.8 Q ss_pred CchHHHHHhhccccccccccccccchhhh---hhc----cccccC--cccccHHHH-hhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPL---DMA----GVKLEQ--ATFSREHIL-ESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~----~~~~~~--~~~~~~~~~-~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) |-=+ . .+..++............ .+. .+..-+ ......+.. .+-+.|.+|++.+...|.+++|. T Consensus 1 ~~~~--~----~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w~ 74 (469) T protein:vir:10 1 MTER--V----KTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTPWR 74 (469) T ss_pred CCCc--c----cCCCCccchhhhhhcccccchhhccccccccccccccchHHHHHHHhhChHHHHHHHHHHHHHhcCCce Confidence 2111 0 000000000000000000 000 000000 000012223 35789999999999999999999 Q ss_pred eeeccccccc-chHHHhhhccc-------------cccCCHHHHHHHHHHHHHhcCCcceEEeeeCC----CCc--EEEE Q lcl|NC_019456. 71 EYQNYKQMDN-EPLADLLKTSP-------------NPNMTAFEFIARLETDRNVSGNGYAWIQKSLS----TGE--PIAL 130 (435) Q Consensus 71 ~~~~~~~~~~-~~l~~~l~~~P-------------n~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~----~g~--~~~l 130 (435) |...+..... ..+...| ..+ +...++.+++..++.+.+.+|.++.++++... .|. +..| T Consensus 75 v~p~~~~~e~~~~~~~~L-~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~~l 153 (469) T protein:vir:10 75 IRANGASDEVTEFVSRNL-MVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLRKL 153 (469) T ss_pred EecCCCCHHHHHHHHHHH-HhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeeeee Confidence 9764432111 1122222 121 12346788888888888899999999998643 232 5567 Q ss_pred EEeCCcee-EEEEcCCCceEEEEEe------------cCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHH Q lcl|NC_019456. 131 WPLDPNTV-SILRNTDNNSYWYRVT------------SDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRS 197 (435) Q Consensus 131 ~~l~~~~v-~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~ 197 (435) .+.++..+ ....+.++....+... .++....+|+...|++++....+.++|.|.+..++-....... T Consensus 154 ~~rp~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~ 233 (469) T protein:vir:10 154 APRPQWTISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWLLKDK 233 (469) T ss_pred eecCcccceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecCCCCCcccchhHHHHHHHHHHHHH Confidence 77776654 2334444444433221 1223456888887877777667788999999999998888887 Q ss_pred HHHHHHHHhhc-C-CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHH Q lcl|NC_019456. 198 VENFSQNEMEK-K-DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIA 275 (435) Q Consensus 198 ~~~~~~~~~~n-~-~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia 275 (435) ..++-..|... | +-.+.+++...++++.+++.+.......+....++++.|++++-++.+.....+.++.++..++|+ T Consensus 234 ~~~~w~~f~EryG~P~~vgky~~~a~~~ek~~l~~a~~~~~~g~~a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Is 313 (469) T protein:vir:10 234 LLRIEAATAERNGMGIPVGTASSATDEDEVRKMAALARSVRGGINAGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIA 313 (469) T ss_pred HHHHHHHHHHHcCCcceEEecCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEeecCCCchHHHHHHHHHHHHHH Confidence 77777777755 4 335677888888888888888777665444557778999999888877666778999999889998 Q ss_pred HHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhHHHHHHHHHHHHhhcccccccC----c--ceeeechhhhhccCHHHH Q lcl|NC_019456. 276 TAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRVK----G--FYFSFNVNGLLRGDTAAR 349 (435) Q Consensus 276 ~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~----g--~~i~fd~~~l~~~d~~~~ 349 (435) .+.-- ..+-.....++++..+.+.....+.+.-.++.++..||+.|+.+.-... . .+++|+ ... .+.+.. T Consensus 314 k~iLG-~tlTs~~~gGS~a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~~--~~e-~~~~~~ 389 (469) T protein:vir:10 314 LSGLA-HFLNLDGKGGSYALASVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAPVLTFD--PIG-SRQDLT 389 (469) T ss_pred HHHhc-ccccccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEec--CCC-CcHHHH Confidence 87521 1111112223444456666777788888899999999987766432221 2 245553 333 456778 Q ss_pred HHHHHHHHhcCCc-----CHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 350 TQYYQTLTRNGIF-----KPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 350 ~~~~~~~~~~g~~-----t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) ++.++++++.|++ +.+.+|+.+|+|+.. + ++....+.. |.. . + .+...+....+.+.+ + T Consensus 390 a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~-~-~~~~~~~~~--~~~---~-------~-~~~~~~~~~~~~~~~-~ 453 (469) T protein:vir:10 390 AAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSEL-N-DTPSAEPEE--PAA---V-------P-NQSAAPARTRSSGNA-D 453 (469) T ss_pred HHHHHHHHhcCCccCccccHHHHHHHhCCCCCC-C-Ccccccchh--ccc---C-------C-CCCccccccCCCCCc-c Confidence 9999999999984 567899999999653 2 233221110 000 0 0 000000000001111 1 Q ss_pred CCCCCCCCCCC Q lcl|NC_019456. 425 NGLQSTEPEGS 435 (435) Q Consensus 425 ~~~~~~~~~~~ 435 (435) ...++...+.. T Consensus 454 ~~~~~~~~~~~ 464 (469) T protein:vir:10 454 ARARAPKADQG 464 (469) T ss_pred cccccCCChHH Confidence 11111111111 No 116 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.84 E-value=2.1e-21 Score=133.94 Aligned_cols=422 Identities=11% Similarity=0.067 Sum_probs=223.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccc------cCcc---cc----------cHHHHhhhHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKL------EQAT---FS----------REHILESNEYIFSIVTRLS 61 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~---~~----------~~~~~~~~~~v~~~i~~ia 61 (435) |+|++++..++.+.....-.........++..+... ...+ .+ ..+.+..+|.+..||+.+. T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 999999998876432211111000001111100000 0000 00 0234567899999998777 Q ss_pred HHHhhC-ceeeee--cccc-----cccc---hHHHhhhcc--ccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC----- Q lcl|NC_019456. 62 NVLASL-PLHEYQ--NYKQ-----MDNE---PLADLLKTS--PNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS----- 123 (435) Q Consensus 62 ~~ia~~-~~~~~~--~~~~-----~~~~---~l~~~l~~~--Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~----- 123 (435) +.+-.. -+.+.- .... .... .+...+..+ ....++.+++...++..++..|++|+.++++.. T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~ 160 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTP 160 (502) T ss_pred HhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCC Confidence 666643 333321 1100 0011 122222222 334678999999999999999999999876432 Q ss_pred -CCcEEEEEEeCCceeE------------EEEcCCCceEEEEEec-------CCeeEEEchhheEEeccCCCccccccCc Q lcl|NC_019456. 124 -TGEPIALWPLDPNTVS------------ILRNTDNNSYWYRVTS-------DIYNFTIPINDVIHVKHVVPSNSWYGVS 183 (435) Q Consensus 124 -~g~~~~l~~l~~~~v~------------~~~~~~~~~~~~~~~~-------~~~~~~~~~~~iih~~~~~~~~~~~G~s 183 (435) .+.+..|..|+|+++. |..|..|.++.|.+.. ......+|+++|+|+..+...+...|+| T Consensus 161 g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r~gQ~RGis 240 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRRLHQMRGTS 240 (502) T ss_pred CcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeecccCCccccCCc Confidence 2346789999998775 3455666666665542 1234679999999999877778999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh--hcCCceEEEeCCcCC--HHHHHHHHHHHHHHhcCCCccc-cccCCceeeeccCCh Q lcl|NC_019456. 184 PIDVLSSSLKFQRSVENFSQNEM--EKKDKFVLQYDRSIS--PEKRQAMVNDFLRMVKENGGAV-VQEAGWKVDRYESKF 258 (435) Q Consensus 184 ~l~~~~~~i~~~~~~~~~~~~~~--~n~~~~~~~~~~~~~--~e~~~~~~~~~~~~~~~~~~~~-vl~~g~~~~~~~~~~ 258 (435) .+..+...+.-.....+...... .....++++.+..-. .+....-...-...+ ..|.++ .|..|.+++..+.+. T Consensus 241 ~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~l-~pG~i~~~L~pGe~i~~~~p~~ 319 (502) T protein:vir:79 241 LLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKENERELTI-QPGIIYDDLKPGEEIGMVKSDR 319 (502) T ss_pred hHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCCccccccc-cCCccccccCCCceeeeeCCCC Confidence 99999888876655444333222 222334444322110 000000000000011 235444 589999999988775 Q ss_pred hhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH-HHHHH------HH-HHhHHHHH-----HHHHHHHhhccc Q lcl|NC_019456. 259 EPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH-VTHSW------TM-TLMPIIRQ-----YESQFNMKLFTP 325 (435) Q Consensus 259 ~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~-~~~~~------~~-~i~P~~~~-----i~~~l~~~l~~~ 325 (435) ....|.+..+...+.||+.+|||.+.|.+.-+.||++.-+ ...++ +. -+..+|.. +++++....++. T Consensus 320 p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~ 399 (502) T protein:vir:79 320 PNPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRL 399 (502) T ss_pred CCCCHHHHHHHHHHHHHhhcCCCHHHHhccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC Confidence 5567899999999999999999999887665557665422 11111 11 12223332 233322222221 Q ss_pred -c--cccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccc- Q lcl|NC_019456. 326 -G--KRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAIL- 401 (435) Q Consensus 326 -~--~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~- 401 (435) . ....-..+.|-.-.....|+.+-++....++++|+.|.-|+-++.|.+|- +--+++-.-. .-+...+-... T Consensus 400 p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~--~v~~q~a~e~--~~~~~~Gl~~~~ 475 (502) T protein:vir:79 400 PRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPD--DVKRRRKAEI--DENRKLDLVFDT 475 (502) T ss_pred CCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHH--HHHHHHHHHH--HHHHHcCCCCCC Confidence 1 01111234555556667899999999999999999999999999998863 2222221100 00000000000 Q ss_pred -cccccccccccccccCCCCCCCCCCC Q lcl|NC_019456. 402 -DNKIQTDASVAAPKQEGGENTNENGL 427 (435) Q Consensus 402 -~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (435) ....+...+...+..++.+.++++++ T Consensus 476 ~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 476 DPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 00000000000111111111111111 No 117 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.83 E-value=2.6e-19 Score=122.52 Aligned_cols=407 Identities=12% Similarity=0.040 Sum_probs=223.9 Q ss_pred CchHHHHHhhccccccccccccccchh------h---hhhccccccC----------ccc-----ccHHHHhhhHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQ------P---LDMAGVKLEQ----------ATF-----SREHILESNEYIFSI 56 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~------~---~~~~~~~~~~----------~~~-----~~~~~~~~~~~v~~~ 56 (435) |+- |...+|.+-+.......-... . ....|..+.. +.. .-++...+.+.|.+| T Consensus 1 ~~~---~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~ 77 (528) T protein:vir:10 1 MAA---IVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAE 77 (528) T ss_pred CCe---eECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHH Confidence 432 332333222211111100000 0 0001111100 000 001111257889999 Q ss_pred HHHHHHHHhhCceeeeeccc-ccccchHHHhhhccccccCC-HHHHHHHHHHHHHhcCCcceEEeeeCCCCc--EEEEEE Q lcl|NC_019456. 57 VTRLSNVLASLPLHEYQNYK-QMDNEPLADLLKTSPNPNMT-AFEFIARLETDRNVSGNGYAWIQKSLSTGE--PIALWP 132 (435) Q Consensus 57 i~~ia~~ia~~~~~~~~~~~-~~~~~~l~~~l~~~Pn~~~~-~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~--~~~l~~ 132 (435) ++.+...|.+++|.|..... ...+......+...- .... +.+++.. +.+.+++|.++.++++...+|. |..+.+ T Consensus 78 l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l-~~~~~f~~~i~~-~lda~~~G~s~~Ei~w~~~~g~~~~~~~~~ 155 (528) T protein:vir:10 78 MSKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELL-LDLEGIEDLMLD-CMDGVGHGYSAIELDWSLQGREWLPQAFDH 155 (528) T ss_pred HHHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHH-hCCccHHHHHHH-HHhhhhhcceeEEEEEeecCCceeEEEeee Confidence 99999999999999975322 222222222222111 1222 3344433 4556789999999998765553 557888 Q ss_pred eCCceeEEEEcCCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhc-C-C Q lcl|NC_019456. 133 LDPNTVSILRNTDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEK-K-D 210 (435) Q Consensus 133 l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n-~-~ 210 (435) .++.++.+..+. ...+...........+++...++.++....+.++|.+.+..++-.........+.-..|... | | T Consensus 156 r~~~~f~~~~~~--~~~l~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P 233 (528) T protein:vir:10 156 RPQSWFQLNPDD--QDELRLRDNSIAGEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLP 233 (528) T ss_pred ecccceeeccCC--CcEEeccCCCCCceeecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCC Confidence 888876654433 22222222223345677766666566656677899999999988888877777777777765 4 3 Q ss_pred ceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCC-hhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcc Q lcl|NC_019456. 211 KFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESK-FEPADLSSVEQISRIRIATAFNVPISFLNDDQ 289 (435) Q Consensus 211 ~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~-~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~ 289 (435) -.+.+++...++++.+++.+.+.....+++ +|++.|++++-+..+ .....|.++.++..++|+.+. ||..- T Consensus 234 ~~igky~~~a~~~ek~~L~~al~~i~~~~~--~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i------LGqtl 305 (528) T protein:vir:10 234 IRLGKYPPGTPDEEKVTLLRAVTGLGHAAA--GIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAI------LGGTL 305 (528) T ss_pred eEEEecCCCCCHHHHHHHHHHHHHHhhCcE--EEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHH------hhhhh Confidence 356778877889999999988887766654 455556555555432 222347888888888988875 44322 Q ss_pred c--------CcccHHHHHHHHHHHHHhHHHHHHHHHHHHhhcccccccC---------cceeeechhhhhccCHHHHHHH Q lcl|NC_019456. 290 A--------KSTTNVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRVK---------GFYFSFNVNGLLRGDTAARTQY 352 (435) Q Consensus 290 ~--------~~~~~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~---------g~~i~fd~~~l~~~d~~~~~~~ 352 (435) . ++++-.+-......+.+.-.++.++..||+.|+.+.-... ..+++|+. ....|.+++++. T Consensus 306 Ts~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~--~e~eDl~~~a~~ 383 (528) T protein:vir:10 306 TSQTSESGGGAYALGQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDL--KDRADLAAMATS 383 (528) T ss_pred hccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecC--CCcccHHHHHHH Confidence 1 2233334455666777788888999999888754432221 12445544 458888999999 Q ss_pred HHHHHhcCC-cCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCC Q lcl|NC_019456. 353 YQTLTRNGI-FKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTE 431 (435) Q Consensus 353 ~~~~~~~g~-~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (435) +.+++..|+ ++..++|+.+|+|.. . .++.++.+....+..... ..+..... ......+. ...+++..+ T Consensus 384 ~~~L~~~G~~i~~~~i~e~~gip~p-~-~~e~~~~~~~~~~~~~~~------~~~~~~~~-~~~~~~~~--~~~~~~~~d 452 (528) T protein:vir:10 384 LPPLVKLGVQVPVNWVQEQLGIPLP-A-NGEAVLGDQAGAGIAQLS------RRPGPRIA-ALAQVIGP--RYRDQEALD 452 (528) T ss_pred HHHHHhCCCCCCHHHHHHHhCCCCC-C-CCcccccCCCcccccccC------cccccccc-cccccccc--cccccchHH Confidence 999999998 899999999999753 2 345544332211111000 00000000 00000000 000000000 Q ss_pred ----CCCC Q lcl|NC_019456. 432 ----PEGS 435 (435) Q Consensus 432 ----~~~~ 435 (435) .... T Consensus 453 ~~~~~~~~ 460 (528) T protein:vir:10 453 QVLASLPA 460 (528) T ss_pred HHHHHHHH Confidence 0000 No 118 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=99.79 E-value=3.2e-18 Score=116.48 Aligned_cols=396 Identities=11% Similarity=0.040 Sum_probs=223.9 Q ss_pred CchHHHHHhhccccccccccccccch------hhh---hhccccccC----------cc-c----ccHHHHhhhHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIP------QPL---DMAGVKLEQ----------AT-F----SREHILESNEYIFSI 56 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~------~~~---~~~~~~~~~----------~~-~----~~~~~~~~~~~v~~~ 56 (435) |+ +|....|.+-........-.. ..+ ...|..+.. +. . .-++...+.+.|.+| T Consensus 1 ~~---~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~ 77 (526) T protein:vir:99 1 MA---QIVDVYGNPIRTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAE 77 (526) T ss_pred CC---eeECCCCCccccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHH Confidence 33 233222222111111000000 000 001111100 00 0 111111257899999 Q ss_pred HHHHHHHHhhCceeeeecccc-cccc----hHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc--EEE Q lcl|NC_019456. 57 VTRLSNVLASLPLHEYQNYKQ-MDNE----PLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE--PIA 129 (435) Q Consensus 57 i~~ia~~ia~~~~~~~~~~~~-~~~~----~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~--~~~ 129 (435) ++.+...|.+++|.|...... ..+. .+...+...| .+.+++..+. +.+.+|.++.++++...+|. |.. T Consensus 78 l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~----~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~ 152 (526) T protein:vir:99 78 MSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLE----GLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLA 152 (526) T ss_pred HHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhccc----CHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEE Confidence 999999999999999753221 2122 2333333223 3556666555 57889999999998766554 557 Q ss_pred EEEeCCceeEEEEcCCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhc- Q lcl|NC_019456. 130 LWPLDPNTVSILRNTDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEK- 208 (435) Q Consensus 130 l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n- 208 (435) +.+.++.++.+..+.... ............+++...+..++....+.++|.+.+..++-.........+.-..|... T Consensus 153 l~~r~~~~f~~~~~~~~~--l~~~~~~~~g~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~y 230 (526) T protein:vir:99 153 FHHRPQSWFQLNPEDQNE--LRLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIY 230 (526) T ss_pred eeeecccceeeccCCCcE--EEecCCCCCceeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHc Confidence 888988877654443322 22223333445677665555555556688899999999988888777777777777755 Q ss_pred C-CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCC-hhhHHHHHHHHHHHHHHHHHhCCCHHHhC Q lcl|NC_019456. 209 K-DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESK-FEPADLSSVEQISRIRIATAFNVPISFLN 286 (435) Q Consensus 209 ~-~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~-~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg 286 (435) | |-.+.+++...++++.+++.+.+.....++ .+|++.|++++-+..+ .....|.++.++..++|+.+. || T Consensus 231 G~P~~igky~~~a~~~ek~~L~~av~~i~~d~--~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i------LG 302 (526) T protein:vir:99 231 GLPIRLGKYPPGTADEEKATLLRAVTGLGHAA--AGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV------LG 302 (526) T ss_pred CCceEEEecCCCCCHHHHHHHHHHHHHHhhCc--EEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH------hh Confidence 4 335667777778999999998887776654 5556666665555432 222347888888889998874 44 Q ss_pred Cccc--------CcccHHHHHHHHHHHHHhHHHHHHHHHHHHhhcccccccC---------cceeeechhhhhccCHHHH Q lcl|NC_019456. 287 DDQA--------KSTTNVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRVK---------GFYFSFNVNGLLRGDTAAR 349 (435) Q Consensus 287 ~~~~--------~~~~~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~---------g~~i~fd~~~l~~~d~~~~ 349 (435) ..-. ++++..+.......+-+.-.++.++..||+.|+.+.-... ..+++|+ .....|.+.+ T Consensus 303 qtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~--~~e~eDl~~~ 380 (526) T protein:vir:99 303 GTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFD--LREQADITSM 380 (526) T ss_pred hhhccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeC--CCCcccHHHH Confidence 3221 2233344455556677778888999999887765433222 1244554 4458889999 Q ss_pred HHHHHHHHhcCC-cCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCC Q lcl|NC_019456. 350 TQYYQTLTRNGI-FKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQ 428 (435) Q Consensus 350 ~~~~~~~~~~g~-~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (435) ++.+.++++.|+ ++..++|+.+|+|... .++..+.+..- ............ ...... ..+ T Consensus 381 a~~~~~L~~~G~~i~~~~i~e~~Gip~~~--~~e~~l~~~~~--------~~~~~~~~~~~~--~~~~~~-------~~~ 441 (526) T protein:vir:99 381 AQSIPALVNVGLEIPSAWVYDKLGIPQPA--KNEPVLRSAAQ--------PAILSRQHGQRV--AALATI-------VGP 441 (526) T ss_pred HHHHHHHHhCCCccCHHHHHHHhCCCCCC--CcccccCCCCC--------Cccccccccccc--cccccc-------ccc Confidence 999999999997 8999999999997632 23443322110 000000000000 000000 000 Q ss_pred CCCCCCC Q lcl|NC_019456. 429 STEPEGS 435 (435) Q Consensus 429 ~~~~~~~ 435 (435) ....... T Consensus 442 ~~~~~~~ 448 (526) T protein:vir:99 442 RYGDQQA 448 (526) T ss_pred cCcchhh Confidence 0000001 No 119 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.79 E-value=2e-18 Score=117.63 Aligned_cols=399 Identities=11% Similarity=0.008 Sum_probs=221.6 Q ss_pred CchHHHHHhhcccccccccccc--ccchh---hhhhccccccCc--------ccccHHHHhhhHHHHHHHHHHHHHHhhC Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQ--NPIPQ---PLDMAGVKLEQA--------TFSREHILESNEYIFSIVTRLSNVLASL 67 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~--~~~~~---~~~~~~~~~~~~--------~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 67 (435) =+|+..-.+.+........... ..... .....+..+... .....+..++.+.|.+|++.+...|.++ T Consensus 3 ~~i~~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m~~D~~i~s~l~~Rk~av~~~ 82 (491) T protein:vir:79 3 KGLWVSPTEFVKFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCVRRRKAAVKAL 82 (491) T ss_pred CeeeCCCCCcccccccchhHHHHHhhhccccccccccccCcchhHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhCC Confidence 1121111111111000000000 00000 000111111110 1112233567899999999999999999 Q ss_pred ceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc--EEEEEEeCCceeEEEEcCC Q lcl|NC_019456. 68 PLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE--PIALWPLDPNTVSILRNTD 145 (435) Q Consensus 68 ~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~--~~~l~~l~~~~v~~~~~~~ 145 (435) +|.+...+......+...-++.++ .+.+++..+ .+.+++|.++.++++...+|. |..+.+.++.++.+. .+ T Consensus 83 ~w~i~~~~~~~~~a~~i~e~l~~~----~~~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d--~~ 155 (491) T protein:vir:79 83 EWGLDRGKAKSRVAKSIADVFADL----DLSRIATEM-LDAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYD--PE 155 (491) T ss_pred CcEEecCCCCHHHHHHHHHHHhcC----CHHHHHHHH-HHhhhhcceeEEEEEeecCCeeeEEeeeeecccceeec--cC Confidence 999976443322223333333343 466666666 457889999999998766554 457889998876643 34 Q ss_pred CceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhc-C-CceEEEeCCcCCHH Q lcl|NC_019456. 146 NNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEK-K-DKFVLQYDRSISPE 223 (435) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n-~-~~~~~~~~~~~~~e 223 (435) +...+...........+++...+++++....+.++|.|.+..++-.........+.-..|.+. | |-.+.+++...+++ T Consensus 156 ~~l~l~~~~~~~~g~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a~~~ 235 (491) T protein:vir:79 156 NQLRFRSKEHWVQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDA 235 (491) T ss_pred CceEEeecCCCCCceeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHH Confidence 444433333333456788888888887766778999999999988888877777777777755 4 33577888888899 Q ss_pred HHHHHHHHHHHHhcCCCccccccCCceeeeccC--C-hhhHHHHHHHHHHHHHHHHHhCCCHHHhCCc----ccCcccHH Q lcl|NC_019456. 224 KRQAMVNDFLRMVKENGGAVVQEAGWKVDRYES--K-FEPADLSSVEQISRIRIATAFNVPISFLNDD----QAKSTTNV 296 (435) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~--~-~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~----~~~~~~~~ 296 (435) +.+++.+.+.+...+++ +|++.|++++-+.. . .....+.++.++..++|+.+. ||.. ..++++.. T Consensus 236 ek~~l~~al~~~~~~a~--~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i------LGqtlTt~~~gs~a~~ 307 (491) T protein:vir:79 236 ETNLLLDRLEDMVQDAV--AVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIAL------LGQNQTTEATSTRASA 307 (491) T ss_pred HHHHHHHHHHHHhcCeE--EEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHH------hhhhhccCcccchhhH Confidence 99999988887766554 55555655555432 2 222237788888888888764 4433 22344445 Q ss_pred HHHHHHHHHHHhHHHHHHHHHHHHhhcccccccC---cceeeechhhhhccCHHHHHHHHHHHHhcCC-cCHHHHHHHhC Q lcl|NC_019456. 297 EHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRVK---GFYFSFNVNGLLRGDTAARTQYYQTLTRNGI-FKPNEIRELEG 372 (435) Q Consensus 297 e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~---g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~-~t~NE~R~~~g 372 (435) +.+.....+-+.-.++.+++.||+ |+.+.-... ...++|.+..... +.+.+++.++++++.|+ ++..++|+.+| T Consensus 308 ~vh~~v~~~i~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~e~ee-~~~~~a~~~~~L~~~G~~i~~~~~~e~~G 385 (491) T protein:vir:79 308 QAGLEVTDDIRDGDKAIVVEAMNM-LIRWICDLNFDGAARPVFDMWEQEQ-VDEIQAGRDEKLTRAGARFTPAYFKRAYN 385 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEeecCcCc-hhHHHHHHHHHHHhCCCccCHHHHHHHhC Confidence 555555566667777777777774 544332221 1234555544322 23567899999999987 78999999999 Q ss_pred CCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 373 QAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 373 ~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) +|+.. .++....... ..... . .+..+...++.+..+...+.-.. T Consensus 386 ip~~~--~~e~~~~~~~--------~~~~~---~------~~~~~~~~~~~~~~d~~~~~~~~ 429 (491) T protein:vir:79 386 LQDGD--LDERPLPVSA--------VDAVG---A------ASFAEFEAPDQDALDAALNALSA 429 (491) T ss_pred CCCCC--CCccccCcCc--------ccccc---c------ccccccCCCCCcchHHHHHHHHH Confidence 98642 2333211100 00000 0 00000000000000000000000 No 120 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=99.79 E-value=5.3e-18 Score=115.30 Aligned_cols=405 Identities=11% Similarity=0.067 Sum_probs=224.3 Q ss_pred CchHHHHHhhccccccccccccccch------hhh---hhccccccC-----------cc----cccHHHHhhhHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIP------QPL---DMAGVKLEQ-----------AT----FSREHILESNEYIFSI 56 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~------~~~---~~~~~~~~~-----------~~----~~~~~~~~~~~~v~~~ 56 (435) |+ +|....|.+-+.......-.. ..+ ...|..+.. .. .+-++...+.+.|.+| T Consensus 1 ~~---~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~ 77 (526) T protein:vir:79 1 MA---QIVDVYGNPIRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAE 77 (526) T ss_pred CC---eeeCCCCCccCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHH Confidence 33 222222221111111000000 000 001111100 00 0111111256889999 Q ss_pred HHHHHHHHhhCceeeeecccc-cccch----HHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc--EEE Q lcl|NC_019456. 57 VTRLSNVLASLPLHEYQNYKQ-MDNEP----LADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE--PIA 129 (435) Q Consensus 57 i~~ia~~ia~~~~~~~~~~~~-~~~~~----l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~--~~~ 129 (435) +..+...|.+++|.|...... ..+.. +...|...| .+.+++..+.. .+.+|.++.++++...+|. |.. T Consensus 78 l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~----~~~~~i~~~ld-A~~~G~s~~Ei~w~~~~g~~~~~~ 152 (526) T protein:vir:79 78 MSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLE----GLEDLLLDALD-GIGHGYSCIELEWALQGREWMPLA 152 (526) T ss_pred HHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhccc----CHHHHHHHHHh-hhhhcceeEEEEEeecCCceeEEE Confidence 999999999999999753221 11112 333333223 35555655544 6789999999998776553 557 Q ss_pred EEEeCCceeEEEEcCCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhc- Q lcl|NC_019456. 130 LWPLDPNTVSILRNTDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEK- 208 (435) Q Consensus 130 l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n- 208 (435) +.+.++.++.+..+.... + ...........+++...+..++....+.++|.+.+..++-.........+.-..|... T Consensus 153 l~~r~~~~F~~~~~~~~~-l-~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~y 230 (526) T protein:vir:79 153 FHHRPQSWFQLNPEDQNE-L-RLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIY 230 (526) T ss_pred eeeecccceEeccCCCcE-E-EecCCCCCceeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHc Confidence 888888876654433322 2 2222333445677776655566656678899999999988888777666666667655 Q ss_pred C-CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCC-hhhHHHHHHHHHHHHHHHHHhCCCHHHhC Q lcl|NC_019456. 209 K-DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESK-FEPADLSSVEQISRIRIATAFNVPISFLN 286 (435) Q Consensus 209 ~-~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~-~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg 286 (435) | |-.+.+++...++++.+++.+.+.....++ .++++.|++++-+..+ .....|.++.++..++|+.+. || T Consensus 231 G~P~~igky~~~a~~~ek~~L~~av~~i~~da--~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i------LG 302 (526) T protein:vir:79 231 GLPIRLGKYPPGTADEEKATLLRAVTGLGHAA--AGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV------LG 302 (526) T ss_pred CCceEEEecCCCCCHHHHHHHHHHHHHHhcCc--EEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH------hh Confidence 4 335667777788898899998887776554 5566666666655532 222347888888889998874 44 Q ss_pred Cccc--------CcccHHHHHHHHHHHHHhHHHHHHHHHHHHhhcccccccCc---------ceeeechhhhhccCHHHH Q lcl|NC_019456. 287 DDQA--------KSTTNVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKG---------FYFSFNVNGLLRGDTAAR 349 (435) Q Consensus 287 ~~~~--------~~~~~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g---------~~i~fd~~~l~~~d~~~~ 349 (435) ..-. ++++..+.......+-+.-.++.++..||+.|+.+.-.... .+++|++ ....|.+++ T Consensus 303 qtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~--~e~eDl~~~ 380 (526) T protein:vir:79 303 GTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDL--REQADITSM 380 (526) T ss_pred hhhccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCC--CCcccHHHH Confidence 3221 22333444556667777888999999999887654433321 2445544 458889999 Q ss_pred HHHHHHHHhcCC-cCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCC- Q lcl|NC_019456. 350 TQYYQTLTRNGI-FKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGL- 427 (435) Q Consensus 350 ~~~~~~~~~~g~-~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 427 (435) ++.+.++++.|+ ++..++|+.+|+|. |.+ ++.++.+.. .+.......................+.+.- T Consensus 381 a~~~~~L~~~G~~i~~~~i~e~~gip~-~~~-~e~~l~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 450 (526) T protein:vir:79 381 AQSIPALVNVGLEIPSAWVYDKLGIPQ-PAK-NEPVLRPAA--------QPAILSRQHGQRVAALATIVGPRYGDQQALD 450 (526) T ss_pred HHHHHHHHhCCCcCCHHHHHHHhCCCC-CCC-chhhccccC--------CccccccccccccccccccccccCchhhHHH Confidence 999999999997 78899999999965 333 333332211 000000000000000000000000000000 Q ss_pred CCCCCCCC Q lcl|NC_019456. 428 QSTEPEGS 435 (435) Q Consensus 428 ~~~~~~~~ 435 (435) .-.+.... T Consensus 451 ~~l~~~~~ 458 (526) T protein:vir:79 451 KALADLPA 458 (526) T ss_pred HHHHHHHH Confidence 00000000 No 121 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.77 E-value=1.8e-17 Score=112.34 Aligned_cols=388 Identities=12% Similarity=0.036 Sum_probs=218.9 Q ss_pred Cc--hHHHHHhhccccc-------------c-----ccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHH Q lcl|NC_019456. 1 MS--FMSKVRQFFGVHD-------------Q-----ANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRL 60 (435) Q Consensus 1 Mg--~~~~~~~~~~~~~-------------~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~i 60 (435) |. |+.--.+.+.... + +....-......+...+ +.....+..++.+.|.+|++.+ T Consensus 1 m~~~i~~~~g~p~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~~~~~~iLr~~~-----~~~~~y~~m~~D~~i~s~l~~R 75 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALG-----KDIRVYRELRADAHVGGCVRRR 75 (491) T ss_pred CCCceeCCCCCccCcccCChHHHHHHHhhhcccccccccCCccchHHHHHhcC-----CCHHHHHHHhhChHHHHHHHHH Confidence 21 1110000000000 0 00000000000000000 0111223355788999999999 Q ss_pred HHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc--EEEEEEeCCcee Q lcl|NC_019456. 61 SNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE--PIALWPLDPNTV 138 (435) Q Consensus 61 a~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~--~~~l~~l~~~~v 138 (435) ...|.+++|.+...+......+...-++.++ .+.+++..+. +.+++|.++.++++...+|. |..+.+.++.++ T Consensus 76 k~av~~~~w~i~~~~~~~~~~e~v~e~l~~~----~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f 150 (491) T protein:vir:10 76 KAAVKALEWGLDRGKAKSRVAKSIADVFADL----DLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWF 150 (491) T ss_pred HHHHhCCCcEEecCCCCHHHHHHHHHHHhcC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeecccce Confidence 9999999999976433322223333344443 4677777775 67899999999999766664 558999998876 Q ss_pred EEEEcCCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhc-C-CceEEEe Q lcl|NC_019456. 139 SILRNTDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEK-K-DKFVLQY 216 (435) Q Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n-~-~~~~~~~ 216 (435) .+ +.++...+...........+++...|++++......++|.|.+..++-.+.......+.-..|... | |-.+.++ T Consensus 151 ~~--d~~~~l~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky 228 (491) T protein:vir:10 151 VY--DPENQLRFRSKDHWMQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKH 228 (491) T ss_pred ee--ccCCceEEecCCCCCCcceecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEec Confidence 64 334444333222233456788888888777666678899999999998888877777777777654 4 3456788 Q ss_pred CCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccC--Chhh-HHHHHHHHHHHHHHHHHhCCCHHHhCCc----c Q lcl|NC_019456. 217 DRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYES--KFEP-ADLSSVEQISRIRIATAFNVPISFLNDD----Q 289 (435) Q Consensus 217 ~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~--~~~~-~~~~e~~~~~~~~Ia~~fgvP~~~lg~~----~ 289 (435) +...++++.+++.+.+.+...++ .+|++.|++++-+.. +... ..|.++.++..++|+.+. ||.. . T Consensus 229 ~~~a~~~ek~~l~~al~~~~~~a--~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i------LGqtlTt~~ 300 (491) T protein:vir:10 229 PRSASDGEKNLLLDCLEDMVQDA--VAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIAL------LGQNQTTEA 300 (491) T ss_pred CCCCCHHHHHHHHHHHHHHhcCc--EEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHH------hhhhcccCc Confidence 88889999999998888776654 455666666555533 2222 237788888888887773 4433 2 Q ss_pred cCcccHHHHHHHHHHHHHhHHHHHHHHHHHHhhccccccc---CcceeeechhhhhccCHHHHHHHHHHHHhcCC-cCHH Q lcl|NC_019456. 290 AKSTTNVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRV---KGFYFSFNVNGLLRGDTAARTQYYQTLTRNGI-FKPN 365 (435) Q Consensus 290 ~~~~~~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~---~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~-~t~N 365 (435) .++++..+.+.....+-+.-.++.++..+|. |+.+.-.. ...+.+|.+.... .+.+.+++.+.++++.|+ ++.. T Consensus 301 ~gs~a~~~vh~~v~~di~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~~~~-e~~~~~a~~~~~L~~~G~~i~~~ 378 (491) T protein:vir:10 301 TSTRASAQAGLEVTDDIRDGDKAVVSEAMNM-LIRWICDLNFDGADRPVFDMWEQE-QVDEIQAGRDQKLTQAGARFTPA 378 (491) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEEecCcC-chhHHHHHHHHHHHhCCCcCCHH Confidence 2334444545555566666677777777764 44322111 1112334444332 334778999999999987 7889 Q ss_pred HHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 366 EIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 366 E~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) ++|+.+|+|+-. .++...-. ...... ..+.... .....++ +.+.. T Consensus 379 ~i~e~~Gip~~~--~~~~~~~~--------~~~~~~-----~~~~~~~--------~~~~~~~--~~d~~ 423 (491) T protein:vir:10 379 YFKRAYNLQDGD--LDERPLPV--------SAVDTV-----GAASFAE--------FEAPDQD--ALDAA 423 (491) T ss_pred HHHHHhCCCCCC--cCcccccc--------CCCCCc-----ccccccc--------cCCCCCC--chHHH Confidence 999999998642 22221100 000000 0000000 0000000 00111 No 122 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=99.77 E-value=8.6e-18 Score=114.15 Aligned_cols=390 Identities=11% Similarity=0.012 Sum_probs=216.9 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) .+..+.+..++..-. .+........+ .+.....+..++.+.|.+|++.+...|.+++|.+...+....+ T Consensus 14 ~~~~d~~~~~~~~l~-------~~~~~il~~a~----~g~~~~y~~l~~D~~i~s~l~~rk~av~~~~w~i~p~~~~~~~ 82 (488) T protein:vir:99 14 GDGRDITRPFISGLQ-------VPNDSILQRRG----GNDLRVYEEILSDAQVKTVWGQRQLAVVSREWKVEAGGDRPID 82 (488) T ss_pred HhhhhhhccccCCCC-------CCChHHHHhhc----cCCHHHHHHHhhChHHHHHHHHHHHHHhcCCceEEcCCCChHH Confidence 111111111110000 00001100000 0001112334677899999999999999999999754433222 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc--EEEEEEeCCceeEEEEcCCCceEEEEEecCCe Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE--PIALWPLDPNTVSILRNTDNNSYWYRVTSDIY 158 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~--~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 158 (435) ......+...- ....+.+++..+. +.+++|.++.++++...+|. |..+.+.++.++.+ +.++...+........ T Consensus 83 ~~~ae~v~~~l-~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~--d~~~~l~~~~~~~~~~ 158 (488) T protein:vir:99 83 QAAAEHLEQQL-QRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRFRY--DQDGGLRLLTPNNMFE 158 (488) T ss_pred HHHHHHHHHHH-hCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccceee--cCCCceEEeccCCCCC Confidence 22222222111 1235777777776 57889999999998765554 45788888887654 3334433332222223 Q ss_pred eEEEchhh--eEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhc-C-CceEEEeCC-cCCHHHHHHHHHHHH Q lcl|NC_019456. 159 NFTIPIND--VIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEK-K-DKFVLQYDR-SISPEKRQAMVNDFL 233 (435) Q Consensus 159 ~~~~~~~~--iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n-~-~~~~~~~~~-~~~~e~~~~~~~~~~ 233 (435) ...++... |+|. +....+.++|.|.+..++-.........+.-..|... | |-.+.+++. ..++++.+++.+.+. T Consensus 159 g~~lp~~~~~i~~~-~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~av~ 237 (488) T protein:vir:99 159 GEPCPAPYFWHFST-GADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAALH 237 (488) T ss_pred ccccccCceEEEEe-ecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHHHH Confidence 44565432 3333 3334567899999999988887777777777767654 4 334556664 567788888888887 Q ss_pred HHhcCCCccccccCCceeeeccCC-hhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhHHHH Q lcl|NC_019456. 234 RMVKENGGAVVQEAGWKVDRYESK-FEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMPIIR 312 (435) Q Consensus 234 ~~~~~~~~~~vl~~g~~~~~~~~~-~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P~~~ 312 (435) ....++ .+|++.|++++-++.+ .....|.++.++..++|+.+.- -..+......++++..+.+.....+.+...++ T Consensus 238 ~~~~~~--~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iL-Gqtlts~~~~Gs~a~~~vh~~v~~d~~~aDa~ 314 (488) T protein:vir:99 238 AIQTDS--AIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVGL-GQVASTQGTPGRLGNDDLQADVRLDLVKADAD 314 (488) T ss_pred HHhcCc--EEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHHh-hhhhcccccccchhhHHHHHHHHHHHHHHHHH Confidence 766554 4555666666555432 2223478888888899987741 11222222223444555566667788888899 Q ss_pred HHHHHHHHhhcccccccC-----cceeeechhhhhccCHHHHHHHHHHHHhc-CC-cCHHHHHHHhCCCCCCCcCCceee Q lcl|NC_019456. 313 QYESQFNMKLFTPGKRVK-----GFYFSFNVNGLLRGDTAARTQYYQTLTRN-GI-FKPNEIRELEGQAPIPDEAADHLY 385 (435) Q Consensus 313 ~i~~~l~~~l~~~~~~~~-----g~~i~fd~~~l~~~d~~~~~~~~~~~~~~-g~-~t~NE~R~~~g~~p~~~~~gd~~~ 385 (435) .+++.||+.|+.+..... ..++.|++. ...|.+++++.+.++++. |+ ++..++|+.+|+|+-. + ++... T Consensus 315 ~i~~tln~~li~~l~~~N~~~~~~p~~~~~~~--e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~~~-~-~~~~~ 390 (488) T protein:vir:99 315 LICESFNLGPARWLTEWNFPGAQPPRVYRVIE--EPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVEVES-T-QAEAT 390 (488) T ss_pred HHHHHHHHHHHHHHHHhCcCCcCCceeEecCC--CcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCCcc-c-ccccc Confidence 999999888765443222 124555544 478889999999999986 64 6888899999999742 2 22221 Q ss_pred ecccccchhccccccccccccccccccccccCCCCCCCCCCC--CCCCCCCC Q lcl|NC_019456. 386 ISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGL--QSTEPEGS 435 (435) Q Consensus 386 ~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 435 (435) .+.. ... ..++..+.+.... +..+.+.. T Consensus 391 ~~~~-----------~~~-----------~~~~~~~~~~~~~~~~~~~~~~~ 420 (488) T protein:vir:99 391 APTP-----------STE-----------FAEGDQPSDPAAAMAPQLAEAMQ 420 (488) T ss_pred cCCC-----------ccc-----------CCCCCCCCCchHHHHHHHHHHHH Confidence 1100 000 0000000000000 00000000 No 123 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.75 E-value=5.5e-19 Score=120.67 Aligned_cols=413 Identities=10% Similarity=0.028 Sum_probs=223.9 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhcccc---------ccCc---cc----------ccHHHHhhhHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVK---------LEQA---TF----------SREHILESNEYIFSIVT 58 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~---~~----------~~~~~~~~~~~v~~~i~ 58 (435) |+++++...+........... ....++..+.. +... .. -..+.+..++.+..+|+ T Consensus 8 ~~~~dr~i~~~~~~~~~~~~~---~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~ 84 (505) T protein:vir:96 8 PSLAQRMVNWAWYRYVEPQKN---AARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQ 84 (505) T ss_pred cchhhcccchhhhhhHHHHHH---hhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 888887765432211111000 00001100000 0000 00 01133567899999999 Q ss_pred HHHHHHhh-Cceeeeec--cc-----ccc---cchHHHhhhcccc----ccCCHHHHHHHHHHHHHhcCCcceEEeeeCC Q lcl|NC_019456. 59 RLSNVLAS-LPLHEYQN--YK-----QMD---NEPLADLLKTSPN----PNMTAFEFIARLETDRNVSGNGYAWIQKSLS 123 (435) Q Consensus 59 ~ia~~ia~-~~~~~~~~--~~-----~~~---~~~l~~~l~~~Pn----~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~ 123 (435) .+.+.+-. ..++..-. .. +.. -..+...+..++| ..++.+++...++..++..|++|+...++.. T Consensus 85 ~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~ 164 (505) T protein:vir:96 85 LLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYP 164 (505) T ss_pred HHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCC Confidence 77766664 45544321 10 001 1123444444554 4467889999999999999999998876544 Q ss_pred CCcEEEEEEeCCceeE----------------EEEcCCCceEEEEEecC-------------CeeEEEchhheEEeccCC Q lcl|NC_019456. 124 TGEPIALWPLDPNTVS----------------ILRNTDNNSYWYRVTSD-------------IYNFTIPINDVIHVKHVV 174 (435) Q Consensus 124 ~g~~~~l~~l~~~~v~----------------~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~iih~~~~~ 174 (435) ...+..|..|+|+++. |..+..|..+.|.+... .....+|+++|+|+..+. T Consensus 165 ~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~ 244 (505) T protein:vir:96 165 NKWGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYERVPADEIIHTFVPW 244 (505) T ss_pred CCcceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccccCHhHhhhhhccc Confidence 4456789999988774 23344565555555321 123458999999998877 Q ss_pred CccccccCcHHHHHHHHHHHHHHHHHHHHHHh--hcCCceEEEeCCc-CCHHHHHHHHHHHHHHhcCCCccccccCCcee Q lcl|NC_019456. 175 PSNSWYGVSPIDVLSSSLKFQRSVENFSQNEM--EKKDKFVLQYDRS-ISPEKRQAMVNDFLRMVKENGGAVVQEAGWKV 251 (435) Q Consensus 175 ~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~--~n~~~~~~~~~~~-~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~ 251 (435) ..+...|+|.+..+...+.-.....+...... .....++++.+.. ..+...+.- ..... .-..|.+..|..|.++ T Consensus 245 r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~-~~~~~-~l~pG~i~~L~pGe~i 322 (505) T protein:vir:96 245 RPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDDQ-GEIVE-EVEAGTYQLLPYGIRF 322 (505) T ss_pred CCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCcccccc-Ccccc-ccCCceeeecCCCCee Confidence 78899999999998888776555444333222 2223345554322 111100000 00000 1135678889999999 Q ss_pred eeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhC-CcccCcccHHHH-HHHHH-----------HHHHhHHHHH-HHHH Q lcl|NC_019456. 252 DRYESKFEPADLSSVEQISRIRIATAFNVPISFLN-DDQAKSTTNVEH-VTHSW-----------TMTLMPIIRQ-YESQ 317 (435) Q Consensus 252 ~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg-~~~~~~~~~~e~-~~~~~-----------~~~i~P~~~~-i~~~ 317 (435) +.++.+....+|.+..+...+.||+.+|||.+.|- ..+.+||++.-+ ...++ ...+.|+.+. ++.+ T Consensus 323 ~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a 402 (505) T protein:vir:96 323 KEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMS 402 (505) T ss_pred eeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99888765678999999999999999999999774 445567765422 11111 1223343322 3333 Q ss_pred HHHhhcccccccCc--ceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhc Q lcl|NC_019456. 318 FNMKLFTPGKRVKG--FYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDK 395 (435) Q Consensus 318 l~~~l~~~~~~~~g--~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~ 395 (435) +....++....... ..+.|-.-.....|+.+-++....++++|+.|+-|+-++.|.++- +--+++..-.. -+.. T Consensus 403 ~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~--~v~~q~a~e~~--~~~~ 478 (505) T protein:vir:96 403 LLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDPE--DVFDEIAWEEQ--LMRD 478 (505) T ss_pred HHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHH--HHHHHHHHHHH--HHHH Confidence 33333321111111 135565556667899999999999999999999999988998863 22222211000 0000 Q ss_pred cccccccccccccccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 396 YYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 396 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (435) . +. ....+... .....++.++++..+| T Consensus 479 ~-------Gl-~~~~~~~~---~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 479 K-------GV-NPTPPEQE---SKDATTDEEDDSASDD 505 (505) T ss_pred c-------CC-CCCCCCCC---CCCCCCCCCCCCCCCC Confidence 0 00 00000000 0000000011111111 No 124 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.74 E-value=2.2e-18 Score=117.36 Aligned_cols=431 Identities=13% Similarity=0.080 Sum_probs=219.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccc------cCccc-------------ccHHHHhhhHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKL------EQATF-------------SREHILESNEYIFSIVTRLS 61 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~-------------~~~~~~~~~~~v~~~i~~ia 61 (435) |+|++++..+|.+.....-.........++..+... ...+. -..+.+..++.+..||+.+. T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 999999998875432111000000000011000000 00000 01133557789999999876 Q ss_pred HHHhh---Cceeeee---ccccc--cc---chHHHhhhccc--cccCCHHHHHHHHHHHHHhcCCcceEEeeeCCC---- Q lcl|NC_019456. 62 NVLAS---LPLHEYQ---NYKQM--DN---EPLADLLKTSP--NPNMTAFEFIARLETDRNVSGNGYAWIQKSLST---- 124 (435) Q Consensus 62 ~~ia~---~~~~~~~---~~~~~--~~---~~l~~~l~~~P--n~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~---- 124 (435) +.+-. +-+.-.- +.... .. ..+...+..++ ...++.+++...++..++..|++++.+.++... T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~ 160 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTF 160 (548) T ss_pred HhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccC Confidence 66554 2222111 11100 00 11222333332 345789999999999999999999988765322 Q ss_pred --CcEEEEEEeCCceeE-------------EEEcCCCceEEEEEecC-----------CeeEEEchhheEEeccCCCccc Q lcl|NC_019456. 125 --GEPIALWPLDPNTVS-------------ILRNTDNNSYWYRVTSD-----------IYNFTIPINDVIHVKHVVPSNS 178 (435) Q Consensus 125 --g~~~~l~~l~~~~v~-------------~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~iih~~~~~~~~~ 178 (435) ..+..|..|+|+++. |..+..|.++.|.+... ...+.+|+++|+|+..+...+. T Consensus 161 g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r~gQ 240 (548) T protein:vir:95 161 ATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAERIIHIAYRKRIGQ 240 (548) T ss_pred CcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeechhHheecccccCCcc Confidence 245688999998774 23344555555544321 1245699999999988777788 Q ss_pred cccCcHHHHHHHHHHHHHHHHHHHHHHh--hcCCceEEEeCCcCCHHHHHHHHHHHHHHh-cCCCcc-ccccCCceeeec Q lcl|NC_019456. 179 WYGVSPIDVLSSSLKFQRSVENFSQNEM--EKKDKFVLQYDRSISPEKRQAMVNDFLRMV-KENGGA-VVQEAGWKVDRY 254 (435) Q Consensus 179 ~~G~s~l~~~~~~i~~~~~~~~~~~~~~--~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~-~~~~~~-~vl~~g~~~~~~ 254 (435) ..|+|.+..+...+.-.....+...... .....++++.+..-.... ......-.... -..|.+ ..|..|.+++.+ T Consensus 241 ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~-~~~~~~~~~~~~~~pG~iv~~L~pGe~i~~~ 319 (548) T protein:vir:95 241 NRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTV-EPGKDRKNRTIPIAPGMVFDDLEPGEDVGMI 319 (548) T ss_pred ccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccC-CCCcccccccccccCCccccccCCCceeeec Confidence 9999999998888876655444433222 222344444332211000 00000000000 123444 358899999988 Q ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH-HHHHH------HH-HHhHHHHH-----HHHHHHHh Q lcl|NC_019456. 255 ESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH-VTHSW------TM-TLMPIIRQ-----YESQFNMK 321 (435) Q Consensus 255 ~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~-~~~~~------~~-~i~P~~~~-----i~~~l~~~ 321 (435) +.+....+|.+..+...+.||+.+|||.+.|....+.||++.-+ +..++ +. .+..+|.. +++++..- T Consensus 320 ~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G 399 (548) T protein:vir:95 320 ESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLAR 399 (548) T ss_pred CCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Confidence 87755567999999999999999999999887665567766432 11111 11 12233332 22333322 Q ss_pred hcc-ccc--ccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceee------ecccccc Q lcl|NC_019456. 322 LFT-PGK--RVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLY------ISKDLYP 392 (435) Q Consensus 322 l~~-~~~--~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~------~~~n~~~ 392 (435) .++ +.. ......++|-.-.....|+.+-++....++++|+.|.-|+-++.|.++-. --+++. .-.++ + T Consensus 400 ~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~e--v~~q~a~E~~~~~~~GL-~ 476 (548) T protein:vir:95 400 KERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDPRE--LKKSRETEIKANRAAGL-V 476 (548) T ss_pred CcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHH--HHHHHHHHHHHHHHcCC-C Confidence 221 110 01112344444555668999999999999999999999988888887531 111110 00010 0 Q ss_pred hhc--ccccccccccccccc-----c------------------cccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 393 LDK--YYDAILDNKIQTDAS-----V------------------AAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 393 l~~--~~~~~~~~~~~~~~~-----~------------------~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) ++. ..........+.+.. . +.-+-.|-|..+++.+-..+..-| T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 544 (548) T protein:vir:95 477 FSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNESNNGGADGQPS 544 (548) T ss_pred CCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCCCCCcccccCCCCCCCC Confidence 000 000000000000000 0 000111111112221111111111 No 125 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=99.74 E-value=6.2e-17 Score=109.42 Aligned_cols=404 Identities=12% Similarity=0.080 Sum_probs=224.0 Q ss_pred HHHHHhhccccccccccccccchh------h---hhhccccccC----------ccc-----ccHHHHhhhHHHHHHHHH Q lcl|NC_019456. 4 MSKVRQFFGVHDQANQIVQNPIPQ------P---LDMAGVKLEQ----------ATF-----SREHILESNEYIFSIVTR 59 (435) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~------~---~~~~~~~~~~----------~~~-----~~~~~~~~~~~v~~~i~~ 59 (435) |++|....|.+-........-... . ....|..+.. +.. .-++..++.+.|.+|++. T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~dm~~~D~hi~s~l~~ 80 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAFDMEEKDTHLFSELSK 80 (512) T ss_pred CcceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 333433333222111111110000 0 0011111110 000 011222357889999999 Q ss_pred HHHHHhhCceeeeeccc-ccccch----HHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCC--cEEEEEE Q lcl|NC_019456. 60 LSNVLASLPLHEYQNYK-QMDNEP----LADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTG--EPIALWP 132 (435) Q Consensus 60 ia~~ia~~~~~~~~~~~-~~~~~~----l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g--~~~~l~~ 132 (435) +...|.+++|.|..... ...+.. +...|...| .+.+++..+. +.+++|.+++++++...+| .|..+.+ T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~----~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~~ 155 (512) T protein:vir:19 81 RRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAA----WFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALHH 155 (512) T ss_pred HHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCC----CHHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeeee Confidence 99999999999975322 222222 223333333 3556666654 5788999999999865444 4667888 Q ss_pred eCCceeEEEEcCCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhc-C-C Q lcl|NC_019456. 133 LDPNTVSILRNTDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEK-K-D 210 (435) Q Consensus 133 l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n-~-~ 210 (435) .++.++....+..+... ..........+++...+..++....+.++|.+.+..++-.........+.-..|... | | T Consensus 156 r~~~~f~~~~~~~~~lr--~~~~~~~G~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P 233 (512) T protein:vir:19 156 RDPALFCANPDNLNELR--LRDASYHGLELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLP 233 (512) T ss_pred eccccceeccCCCcEEE--ecCCCCCceeecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCC Confidence 98887665444333222 222223345677776666666656778899999999988888777777777777655 4 3 Q ss_pred ceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCC-hhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcc Q lcl|NC_019456. 211 KFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESK-FEPADLSSVEQISRIRIATAFNVPISFLNDDQ 289 (435) Q Consensus 211 ~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~-~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~ 289 (435) -.+.+++...++++.+++.+.+.....++ .+|++.|++++-+..+ .....|.++.++..++|+.+. ||..- T Consensus 234 ~~igky~~~a~~~ek~~L~~al~~~~~~a--~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~i------LGqtl 305 (512) T protein:vir:19 234 MRVGKYPTGSTNREKATLMQAVMDIGRRA--GGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAI------LGGTL 305 (512) T ss_pred eeEEecCCCCCHHHHHHHHHHHHHHhhCc--EEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHH------hhhhh Confidence 34667888888899899998888776554 5556666666555433 222347888888889998773 44332 Q ss_pred ------cCcccHHHHHHHHHHHHHhHHHHHHHHHHHHhhcccccccCc---------ceeeechhhhhccCHHHHHHHHH Q lcl|NC_019456. 290 ------AKSTTNVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKG---------FYFSFNVNGLLRGDTAARTQYYQ 354 (435) Q Consensus 290 ------~~~~~~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g---------~~i~fd~~~l~~~d~~~~~~~~~ 354 (435) .++++..+-+.....+-+...++.++..||+.|+.+.-.... .+++|++. ...|.+..++.+. T Consensus 306 Ts~~g~~Gs~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~--e~eDl~~~a~~~~ 383 (512) T protein:vir:19 306 TTEAGDKGARSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTS--EAGDITALSDAIP 383 (512) T ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCC--ChhhHHHHHHHHH Confidence 122334455666667778888999999999888765422221 24555544 4788888999888 Q ss_pred HHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 355 TLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEG 434 (435) Q Consensus 355 ~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (435) ++...--++..++|+.+|+|. |.++ +....+.-..+ ..................+.-+.-....++-.. T Consensus 384 ~l~~G~~i~~~~i~e~~Gip~-~~~~-e~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~ 452 (512) T protein:vir:19 384 KLAAGMRIPVSWIQEKLHIPQ-PVGD-EAVFTIQPVVP---------DNGSQKEAALSAEDIPQEDDIDRMGVSPEDWQR 452 (512) T ss_pred HHhcCCCCCHHHHHHHhCCCC-CCCc-cccccCCCccc---------cccccccccccccCCCchhhHhHHhhhHHHHHH Confidence 887544568899999999974 3333 33221110000 000000000000000000000000000000000 Q ss_pred ----------------C Q lcl|NC_019456. 435 ----------------S 435 (435) Q Consensus 435 ----------------~ 435 (435) | T Consensus 453 ~~~~~~~~i~~~~~~~s 469 (512) T protein:vir:19 453 SVDPLLKPVIFSVLKDG 469 (512) T ss_pred HHHHHHHHHHHHHHhCC Confidence 0 No 126 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.72 E-value=3e-17 Score=111.15 Aligned_cols=418 Identities=12% Similarity=0.038 Sum_probs=218.4 Q ss_pred CchHHHHHhhcccccccccccc--------ccchhhhhhccccccC----------cccccHHHHhhhHHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQ--------NPIPQPLDMAGVKLEQ----------ATFSREHILESNEYIFSIVTRLSN 62 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~v~~~i~~ia~ 62 (435) |..-. ..++..+...... ........+.....+. ...-..+.+..+|.+..||+.+.+ T Consensus 1 ~~~~~----~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~ 76 (530) T protein:vir:38 1 MKIPS----LVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQD 76 (530) T ss_pred Cccce----eecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Confidence 43321 1111110000000 0000000000000000 000112345678999999998888 Q ss_pred HHhhCceeeeec---------cccc--cc---chHHHhhhcccc------ccCCHHHHHHHHHHHHHhcCCcceEEeeeC Q lcl|NC_019456. 63 VLASLPLHEYQN---------YKQM--DN---EPLADLLKTSPN------PNMTAFEFIARLETDRNVSGNGYAWIQKSL 122 (435) Q Consensus 63 ~ia~~~~~~~~~---------~~~~--~~---~~l~~~l~~~Pn------~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~ 122 (435) .+-...|.+.-+ ++.. .. ..+...+...|+ ..++.+++...++..++..|++|+.+.++. T Consensus 77 nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~ 156 (530) T protein:vir:38 77 HIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDS 156 (530) T ss_pred HhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeecc Confidence 887777776432 1111 01 123333444443 357899999999999999999999988765 Q ss_pred CCCc--EEEEEEeCCceeE--------------EEEcCCCceEEEEEecC---Ce----------eEEEchhheEEeccC Q lcl|NC_019456. 123 STGE--PIALWPLDPNTVS--------------ILRNTDNNSYWYRVTSD---IY----------NFTIPINDVIHVKHV 173 (435) Q Consensus 123 ~~g~--~~~l~~l~~~~v~--------------~~~~~~~~~~~~~~~~~---~~----------~~~~~~~~iih~~~~ 173 (435) ..|. +..|..|+|+++. |..+..|....|.+... +. ...++.++|+|+..+ T Consensus 157 ~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~ 236 (530) T protein:vir:38 157 DSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEP 236 (530) T ss_pred CCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHeEeeccc Confidence 5554 4578999987764 33345565555544321 11 133667799999988 Q ss_pred CCccccccCcHHHHHHHHHHHHHHHHHHHHHHh--hcCCceEEEeCCcC-----------CHHHHHHHHH------HHHH Q lcl|NC_019456. 174 VPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEM--EKKDKFVLQYDRSI-----------SPEKRQAMVN------DFLR 234 (435) Q Consensus 174 ~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~--~n~~~~~~~~~~~~-----------~~e~~~~~~~------~~~~ 234 (435) ...+...|+|.+..+...+.-.....+...... .....++++.+..- .+++...+.. .... T Consensus 237 ~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 316 (530) T protein:vir:38 237 MEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYS 316 (530) T ss_pred cCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhccc Confidence 777899999999998887776554443332221 22223333322110 0111111111 1100 Q ss_pred ---HhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCcccCcccHHHHH-HHHH------ Q lcl|NC_019456. 235 ---MVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFL-NDDQAKSTTNVEHV-THSW------ 303 (435) Q Consensus 235 ---~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~l-g~~~~~~~~~~e~~-~~~~------ 303 (435) ..-..|.+..|..|.+++.++.+-...+|.+..+.....||+.+|||.+.| |..+..||++.-+. ..++ T Consensus 317 ~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~ 396 (530) T protein:vir:38 317 AAPVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGR 396 (530) T ss_pred ccceeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHH Confidence 011356788899999999888775556789999999999999999999977 44556677754321 1111 Q ss_pred H-HHHhHHHHH-----HHHHHHHhhccccc-c--------cCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHH Q lcl|NC_019456. 304 T-MTLMPIIRQ-----YESQFNMKLFTPGK-R--------VKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIR 368 (435) Q Consensus 304 ~-~~i~P~~~~-----i~~~l~~~l~~~~~-~--------~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R 368 (435) + ..+.|+|+. +++++....++... . ..-..++|-.-.....|+.+-++....++++|+.|.-|+- T Consensus 397 q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~ 476 (530) T protein:vir:38 397 RKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKEC 476 (530) T ss_pred HHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHH Confidence 1 112233322 33333333332110 0 0001245555566678999999999999999999999999 Q ss_pred HHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 369 ELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 369 ~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) ++.|.++- +--+++.--. .-+. ..+...+.....++..+.. +..++++....+| T Consensus 477 a~~G~D~~--~v~~q~a~e~--~~~~-------~~Gl~~~~~~~~~~~~~~~--~~~~~~~d~~~~a 530 (530) T protein:vir:38 477 AKRGDDYQ--EIFAQQVRES--MERR-------AAGLNPPAWAAAAFEAGVK--KSNEEEQDGARAA 530 (530) T ss_pred HHcCCCHH--HHHHHHHHHH--HHHH-------HcCCCCCCCcccccCCCCC--CCCCCCCCCCCCC Confidence 99998863 2222221000 0000 0111111111111111111 1111112222222 No 127 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=99.70 E-value=1.8e-17 Score=112.40 Aligned_cols=367 Identities=10% Similarity=0.063 Sum_probs=209.0 Q ss_pred CchHHHHHhhcccccc----cccccc---ccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQ----ANQIVQ---NPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQ 73 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~ 73 (435) |-+.+.-.-.+..+.. ...... ++++-.- ..|......-.+-++...+.+.|++|+..+...|.+++|.|.. T Consensus 3 ~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~D~~lr-~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w~V~p 81 (446) T protein:vir:98 3 MEVRNAPTPAIRRRTIYAMEHLGLATSYLSEDGGYK-RAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKVGPYQH 81 (446) T ss_pred ccccCCCchhhhhhhhhccccchhhcccCCcchHhh-hcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCCceecC Confidence 3222100000000000 000000 0000000 0010000000111222335789999999999999999999976 Q ss_pred cccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc---------EEEEEEeCCceeEEEEcC Q lcl|NC_019456. 74 NYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE---------PIALWPLDPNTVSILRNT 144 (435) Q Consensus 74 ~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~---------~~~l~~l~~~~v~~~~~~ 144 (435) ..+++.+ -+...|.... .++....+.+.+.+|.++.++++....|. ++.+.|+++ ....+. T Consensus 82 ~~~~~a~-~v~~~l~~~~------~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~~---r~~~~~ 151 (446) T protein:vir:98 82 GDKRIKK-FIDDQLRNRA------KTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPLQV---MLIAND 151 (446) T ss_pred ccHHHHH-HHHHHHhhcC------chhHHHHHHHHHhhCceeeeEEEeecccccccchhhccccccccccc---eeeecc Confidence 5433321 2333333221 24455557899999999999998654332 112333332 222222 Q ss_pred CCceEE--------------------------EEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHH Q lcl|NC_019456. 145 DNNSYW--------------------------YRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSV 198 (435) Q Consensus 145 ~~~~~~--------------------------~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~ 198 (435) ++.... ......+....+|....+++++....+.++|.|.+..++-........ T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~~~~~~p~G~gLlr~~~w~~~fK~~~ 231 (446) T protein:vir:98 152 NGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYNTKGNNPWGTSCLTSVLDYSIFKRAF 231 (446) T ss_pred CCccccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEecCCCCCccccchHHHHHHHHHHHHhh Confidence 211100 001112334568899999999887778899999999999988888877 Q ss_pred HHHHHHHhhc-C-CceEEEeCCcCCHH---------HHHHHHHHHHHHhc----CCCccc---cccCCceeeeccCChh- Q lcl|NC_019456. 199 ENFSQNEMEK-K-DKFVLQYDRSISPE---------KRQAMVNDFLRMVK----ENGGAV---VQEAGWKVDRYESKFE- 259 (435) Q Consensus 199 ~~~~~~~~~n-~-~~~~~~~~~~~~~e---------~~~~~~~~~~~~~~----~~~~~~---vl~~g~~~~~~~~~~~- 259 (435) .+.-..|... | |-.+.+++...+++ ..+...+.+.+.+. +++.++ +++.|++++-++.... T Consensus 232 ~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~ 311 (446) T protein:vir:98 232 RDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNF 311 (446) T ss_pred HHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceEEeeccccCC Confidence 7777777755 4 33455665443322 22233344444432 333322 3488999988765433 Q ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhCCccc--CcccHHHHHHHHHHHHHhHHHHHHHHHHHHhhcccccccCc------ Q lcl|NC_019456. 260 PADLSSVEQISRIRIATAFNVPISFLNDDQA--KSTTNVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKG------ 331 (435) Q Consensus 260 ~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~--~~~~~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g------ 331 (435) ..++.+..++..++|+.+.....-.++.... ++++-.+.+...+.+.+.-.++++++.+|+.|+.+.-...+ T Consensus 312 ~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i~~tln~~Li~~l~~lNf~~~~~~ 391 (446) T protein:vir:98 312 SDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIFDTVIHAFTEQVIGNLIRLNFDPALYP 391 (446) T ss_pred hhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccc Confidence 2358888899899999987655444443322 33333455666677788889999999999988654422221 Q ss_pred -----ceeeechhhhhccCHHHHHHHHHHHHhcCCcCH---HHHHHHhCCCCCCCcC Q lcl|NC_019456. 332 -----FYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKP---NEIRELEGQAPIPDEA 380 (435) Q Consensus 332 -----~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~---NE~R~~~g~~p~~~~~ 380 (435) .+++|++. ...|.+..++.+.++++.|..++ +.+|+.+|+|+-++.- T Consensus 392 ~~~~~~~~~~~~~--e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~~ 446 (446) T protein:vir:98 392 LASNTGYITRLPG--RATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISST 446 (446) T ss_pred cccccccceeccC--ChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCCC Confidence 13344443 36788999999999999998765 4599999998754321 No 128 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.70 E-value=8e-17 Score=108.83 Aligned_cols=419 Identities=9% Similarity=-0.020 Sum_probs=215.4 Q ss_pred CchHHHHHhhccccccccccccccchhhhh--------hccccccCcc---c----------ccHHHHhhhHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLD--------MAGVKLEQAT---F----------SREHILESNEYIFSIVTR 59 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~---~----------~~~~~~~~~~~v~~~i~~ 59 (435) |.-..+....+........... ....++ +.+..+...+ . -..+.+..++.+..+|+. T Consensus 2 ~~~~~r~~~~~a~~~~~~~~~~--~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~ 79 (553) T protein:vir:63 2 TKVTVRKLSEVTSGRPEQSASL--GGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGY 79 (553) T ss_pred cchhhhhhcccccccchhhhhh--hcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 2222222221111110000000 000000 0000000000 0 011335677999999998 Q ss_pred HHHHHhhCceeeeecc--------cccc-------cchHHHhhhccc------cccCCHHHHHHHHHHHHHhcCCcceEE Q lcl|NC_019456. 60 LSNVLASLPLHEYQNY--------KQMD-------NEPLADLLKTSP------NPNMTAFEFIARLETDRNVSGNGYAWI 118 (435) Q Consensus 60 ia~~ia~~~~~~~~~~--------~~~~-------~~~l~~~l~~~P------n~~~~~~~f~~~~~~~~~~~G~~~~~i 118 (435) +.+.+-.-.|.+.-.- .... -..+...+...| ...++.+++...++..++..|++|+.+ T Consensus 80 ~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~ 159 (553) T protein:vir:63 80 QRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATA 159 (553) T ss_pred HHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEe Confidence 8877776677764220 0000 112334444444 355688999999999999999999988 Q ss_pred eeeCCCCc--EEEEEEeCCceeE--------------EEEcCCCceEEEEEecC--C----------------eeEEEch Q lcl|NC_019456. 119 QKSLSTGE--PIALWPLDPNTVS--------------ILRNTDNNSYWYRVTSD--I----------------YNFTIPI 164 (435) Q Consensus 119 ~~~~~~g~--~~~l~~l~~~~v~--------------~~~~~~~~~~~~~~~~~--~----------------~~~~~~~ 164 (435) ++....|. +..|..|+|+++. |..|..|.++.|.+... | ....++. T Consensus 160 ~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~~~v~a 239 (553) T protein:vir:63 160 EWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQSKPWGR 239 (553) T ss_pred eeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeeeccccccCh Confidence 76554443 4578889988774 23345555555554321 1 1124789 Q ss_pred hheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHh--hcCCceEEEeCCcCCHHHHHHHHH------------ Q lcl|NC_019456. 165 NDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEM--EKKDKFVLQYDRSISPEKRQAMVN------------ 230 (435) Q Consensus 165 ~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~--~n~~~~~~~~~~~~~~e~~~~~~~------------ 230 (435) ++|||+..+...+...|+|.+..+...+.-.....+...... .....++++.+ ...++...... T Consensus 240 ~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~--~~~~~~~~~~~~~~~~~~~~~~~ 317 (553) T protein:vir:63 240 RQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESE--LPPEFIHSQMSGGSPNADMVGIF 317 (553) T ss_pred hHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecC--CChhhhhhhcccccccccccccc Confidence 999999887777899999999999888876655544433322 22233444433 22222111100 Q ss_pred -----HH---HH----HhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCcccCcccHHH Q lcl|NC_019456. 231 -----DF---LR----MVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFL-NDDQAKSTTNVE 297 (435) Q Consensus 231 -----~~---~~----~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~l-g~~~~~~~~~~e 297 (435) .. .. ..-+.|.+..|..|.+++.++.+-...+|.+..+...+.||+.+|||.+.| |..+..||++.- T Consensus 318 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R 397 (553) T protein:vir:63 318 GKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQ 397 (553) T ss_pred cccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHH Confidence 00 00 001256788899999999988875556799999999999999999999977 444566777642 Q ss_pred HH-HHH----------H-HHHHhHHHHH-HHHHHHHhhcc-ccccc-----------CcceeeechhhhhccCHHHHHHH Q lcl|NC_019456. 298 HV-THS----------W-TMTLMPIIRQ-YESQFNMKLFT-PGKRV-----------KGFYFSFNVNGLLRGDTAARTQY 352 (435) Q Consensus 298 ~~-~~~----------~-~~~i~P~~~~-i~~~l~~~l~~-~~~~~-----------~g~~i~fd~~~l~~~d~~~~~~~ 352 (435) +. ..+ + ...++|+.+. +++++-...++ +.... .-..++|-.-.....|+.+-++. T Consensus 398 ~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A 477 (553) T protein:vir:63 398 AGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQA 477 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHH Confidence 21 111 1 1122332222 22222222211 11000 00123455555566799999999 Q ss_pred HHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeeccc-----ccchhccccccccccccccccccccccCCCCCCCCCCC Q lcl|NC_019456. 353 YQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKD-----LYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGL 427 (435) Q Consensus 353 ~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n-----~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (435) ...++++|+.|.-|+-++.|.++- +--+++..-.. -.+++..... ....+..... +..++....+ T Consensus 478 ~~~~i~~G~~t~~~~~a~~G~D~~--~v~~q~a~e~~~~~~~Gl~~~~~~~~-~~~~~~~~~~-------~~~~~~~~~~ 547 (553) T protein:vir:63 478 AVMRIDAGLSTYEREIARLGGDFR--KSFAQRAREDALLKKYGLTFNLSAKR-SLGDGRDAAT-------GIAEDPAAAQ 547 (553) T ss_pred HHHHHHcCCCCHHHHHHHhCCCHH--HHHHHHHHHHHHHHHcCCCCCCCCcc-ccCCCcccCC-------CCCCCCCCCC Confidence 999999999999999999998753 22222110000 0011100000 0000000000 0001111111 Q ss_pred CCCCCC Q lcl|NC_019456. 428 QSTEPE 433 (435) Q Consensus 428 ~~~~~~ 433 (435) +..+.| T Consensus 548 ~~~~~e 553 (553) T protein:vir:63 548 TSQQGE 553 (553) T ss_pred cccccC Confidence 111111 No 129 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.68 E-value=4.7e-17 Score=110.11 Aligned_cols=422 Identities=12% Similarity=0.037 Sum_probs=219.1 Q ss_pred CchHHHHHhhccccccc--cccc---cccchhhhhhccccccCc----------ccccHHHHhhhHHHHHHHHHHHHHHh Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQA--NQIV---QNPIPQPLDMAGVKLEQA----------TFSREHILESNEYIFSIVTRLSNVLA 65 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~--~~~~---~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~v~~~i~~ia~~ia 65 (435) |--+..+...-+..... .... .........|.....+.. ..-..+.+..+|.+..||+.+.+.+- T Consensus 3 ~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvV 82 (533) T protein:vir:34 3 TPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDHIV 82 (533) T ss_pred CchhhhhhcccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhh Confidence 21111111100000000 0000 000000000000000000 00112335678999999998888876 Q ss_pred hCceeeeec---------cccc--cc---chHHHhhhcccc------ccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCC Q lcl|NC_019456. 66 SLPLHEYQN---------YKQM--DN---EPLADLLKTSPN------PNMTAFEFIARLETDRNVSGNGYAWIQKSLSTG 125 (435) Q Consensus 66 ~~~~~~~~~---------~~~~--~~---~~l~~~l~~~Pn------~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g 125 (435) .-.|++.-. ++.. .. ..+...+...|+ ..++.+++...++..++..|++|+.+.++...| T Consensus 83 G~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~g 162 (533) T protein:vir:34 83 GSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSS 162 (533) T ss_pred CCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeeccCCC Confidence 667766432 1110 11 123334444443 456899999999999999999999988765544 Q ss_pred c--EEEEEEeCCceeE--------------EEEcCCCceEEEEEecC---Ce----------eEEEchhheEEeccCCCc Q lcl|NC_019456. 126 E--PIALWPLDPNTVS--------------ILRNTDNNSYWYRVTSD---IY----------NFTIPINDVIHVKHVVPS 176 (435) Q Consensus 126 ~--~~~l~~l~~~~v~--------------~~~~~~~~~~~~~~~~~---~~----------~~~~~~~~iih~~~~~~~ 176 (435) . +..|..|+|+++. |..+..|..+-|.+... +. ...++.++|+|+..+... T Consensus 163 ~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~~~r~ 242 (533) T protein:vir:34 163 RLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPVED 242 (533) T ss_pred CccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeeccccCC Confidence 3 4578889987664 33344555555554321 11 123678899999988778 Q ss_pred cccccCcHHHHHHHHHHHHHHHHHHHHHH--hhcCCceEEEeCCc-----------CCHHHHHHHH------HHHHHH-- Q lcl|NC_019456. 177 NSWYGVSPIDVLSSSLKFQRSVENFSQNE--MEKKDKFVLQYDRS-----------ISPEKRQAMV------NDFLRM-- 235 (435) Q Consensus 177 ~~~~G~s~l~~~~~~i~~~~~~~~~~~~~--~~n~~~~~~~~~~~-----------~~~e~~~~~~------~~~~~~-- 235 (435) +...|+|.+..+...+.-.....+..... ......++++.+.. ..++....+. ..+... T Consensus 243 gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 322 (533) T protein:vir:34 243 GQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAP 322 (533) T ss_pred CcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcce Confidence 89999999999888777655444333222 22223334433211 0111111111 111110 Q ss_pred -hcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCC-cccCcccHHHHH-HHH---------- Q lcl|NC_019456. 236 -VKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLND-DQAKSTTNVEHV-THS---------- 302 (435) Q Consensus 236 -~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~-~~~~~~~~~e~~-~~~---------- 302 (435) .-+.|.+..|..|.+++.++.+-...+|.+..+.....||+.+|||.+.|-+ .+..||++.-+. ..+ T Consensus 323 ~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~ 402 (533) T protein:vir:34 323 VRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKF 402 (533) T ss_pred eeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHH Confidence 0135678889999999998887666788999999999999999999997754 456777764321 111 Q ss_pred H-HHHHhHHHHH-HHHHHHHhhcc-ccc------ccC--cceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHh Q lcl|NC_019456. 303 W-TMTLMPIIRQ-YESQFNMKLFT-PGK------RVK--GFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELE 371 (435) Q Consensus 303 ~-~~~i~P~~~~-i~~~l~~~l~~-~~~------~~~--g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~ 371 (435) + ...+.|+... +++++....++ +.. ... -..+.|-.-.....|+.+-++....++++|+.|.-|+-++. T Consensus 403 ~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~ 482 (533) T protein:vir:34 403 VASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKR 482 (533) T ss_pred HHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc Confidence 1 1122333222 33333332222 100 000 01345555666678999999999999999999999999999 Q ss_pred CCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 372 GQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 372 g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) |.++- +.-+++..-. +.....+...+..+......+ ...++.+++++..+| T Consensus 483 G~D~~--ev~~q~a~e~---------~~~~~~gl~~~~~~~~~~~s~--~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 483 GDDYQ--EIFAQQVRET---------MERRAAGLKPPAWAAAAFESG--LRQSTEEEKSDSRAA 533 (533) T ss_pred CCCHH--HHHHHHHHHH---------HHHHhcCCCCCCCCCcCccCC--CCCCCCCCcccCCCC Confidence 98863 2223221000 000111111111111111111 111222333334444 No 130 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.67 E-value=5.2e-17 Score=109.84 Aligned_cols=412 Identities=11% Similarity=0.018 Sum_probs=204.4 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccc--------cCccc----------ccHHHHhhhHHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKL--------EQATF----------SREHILESNEYIFSIVTRLSN 62 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~----------~~~~~~~~~~~v~~~i~~ia~ 62 (435) |+++.+-. ........ .......++..+... +.... -..+.+..++.+..||+.+.+ T Consensus 1 m~~~~~~~--~a~~~~~~---~~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~ 75 (495) T protein:vir:10 1 MNMTPSGY--QSLASGLL---VPVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVA 75 (495) T ss_pred CCcccccc--cccchhhh---hHHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Confidence 88876521 11111000 000000111000000 00000 011335677999999998888 Q ss_pred HHhhCceeeeecc-ccccc---chHHHhhhccc--cccCCHHHHHHHHHHHHHhcCCcceEEeeeCC-C--CcEEEEEEe Q lcl|NC_019456. 63 VLASLPLHEYQNY-KQMDN---EPLADLLKTSP--NPNMTAFEFIARLETDRNVSGNGYAWIQKSLS-T--GEPIALWPL 133 (435) Q Consensus 63 ~ia~~~~~~~~~~-~~~~~---~~l~~~l~~~P--n~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~-~--g~~~~l~~l 133 (435) .+-.-.|+..-.. .+... ..+...+..++ ...++.+++...++..++..|++|+.+..... . ..+..|..| T Consensus 76 ~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqli 155 (495) T protein:vir:10 76 AAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQII 155 (495) T ss_pred hhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEe Confidence 8755566543211 11111 12233333332 34578999999999999999999988765422 2 246789999 Q ss_pred CCceeEE-----------------EEcCCCceEEEEEec--CC---------eeEEEchhheEEeccCCCccccccCcHH Q lcl|NC_019456. 134 DPNTVSI-----------------LRNTDNNSYWYRVTS--DI---------YNFTIPINDVIHVKHVVPSNSWYGVSPI 185 (435) Q Consensus 134 ~~~~v~~-----------------~~~~~~~~~~~~~~~--~~---------~~~~~~~~~iih~~~~~~~~~~~G~s~l 185 (435) +|+++.. ..+..|....|.+.. .| ....+|+++|+|+.. ...+...|+|.+ T Consensus 156 epd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~-~r~gQ~RGis~l 234 (495) T protein:vir:10 156 EPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTV-LTVRSDAGAPWF 234 (495) T ss_pred chhhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEeccc-cCCCcccCcchh Confidence 9988742 122334444444322 11 235699999999964 456788999977 Q ss_pred HHHHHHHHHHHHHHHHHHH--HhhcCCceEEEeCCcCCHHHHHHH----HHH-H--HHHhcCCCccccccCCceeeeccC Q lcl|NC_019456. 186 DVLSSSLKFQRSVENFSQN--EMEKKDKFVLQYDRSISPEKRQAM----VND-F--LRMVKENGGAVVQEAGWKVDRYES 256 (435) Q Consensus 186 ~~~~~~i~~~~~~~~~~~~--~~~n~~~~~~~~~~~~~~e~~~~~----~~~-~--~~~~~~~~~~~vl~~g~~~~~~~~ 256 (435) ..+.+ +.-.....+.... ....-..++++.+.. ++..... .+. - ....-+.|.+..|..|.+++.++. T Consensus 235 a~i~~-l~~l~~y~dael~~a~i~A~~~~fi~~~~~--~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p 311 (495) T protein:vir:10 235 QLLLR-LNELDQYEDAELVRKKTAALFAAFIQEATA--DSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNP 311 (495) T ss_pred HHHHH-HHHhhHHHHHHHHHHHHhhhheeeeecCCC--ccccccccCccccccCcccceecCCceeeecCCCCeeeeeCC Confidence 65443 3322222222111 111222334432211 1110000 000 0 000013567888999999999887 Q ss_pred ChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCcccCcccHHHH-----HHHHH--HH-HHhH-HHHH-----HHHHHHHh Q lcl|NC_019456. 257 KFEPADLSSVEQISRIRIATAFNVPISFL-NDDQAKSTTNVEH-----VTHSW--TM-TLMP-IIRQ-----YESQFNMK 321 (435) Q Consensus 257 ~~~~~~~~e~~~~~~~~Ia~~fgvP~~~l-g~~~~~~~~~~e~-----~~~~~--~~-~i~P-~~~~-----i~~~l~~~ 321 (435) +.....|.+..+.....||+.+|||.+.| |..+..||++.-+ ...+- +. -+.| +|.. ++.++... T Consensus 312 ~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G 391 (495) T protein:vir:10 312 ADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASG 391 (495) T ss_pred CCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Confidence 75556789999999999999999999977 5555667776432 11121 11 1222 2322 22232222 Q ss_pred hcc-cccc-cCc--ceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccc Q lcl|NC_019456. 322 LFT-PGKR-VKG--FYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYY 397 (435) Q Consensus 322 l~~-~~~~-~~g--~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~ 397 (435) .+. +... ... ..++|-.-.....|+.+-++....++++|+.|+-|+-++.|.++- +--+++--- ..-+...+ T Consensus 392 ~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~--~v~~q~a~e--~~~~~~~G 467 (495) T protein:vir:10 392 AVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDME--ELFDMISDA--NQLIDEYD 467 (495) T ss_pred CCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHH--HHHHHHHHH--HHHHHHcC Confidence 221 1100 000 124455556667899999999999999999999999989998863 222221100 00001110 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 398 DAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 398 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (435) ..-+.++.... ..+...+...++..++| T Consensus 468 -------l~~~~~p~~~~-~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 468 -------LRLDSDPRYVN-GSGAEQKSVMEAALNNE 495 (495) T ss_pred -------CCCCCCCCcCC-CccCCCCCCCCCCCCCC Confidence 00000000000 00000000011111111 No 131 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=99.67 E-value=1.9e-15 Score=101.33 Aligned_cols=408 Identities=12% Similarity=0.076 Sum_probs=206.7 Q ss_pred CchHH---------HHHhhcccccc--ccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCce Q lcl|NC_019456. 1 MSFMS---------KVRQFFGVHDQ--ANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPL 69 (435) Q Consensus 1 Mg~~~---------~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~ 69 (435) |.=-+ |+...+....+ ...... +....+. ........+..++.+.|.+|++.+...|.+++| T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~----~~~~~Lr---~~~~~~ly~~m~~D~hi~s~l~~Rk~av~~~~w 73 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGLKVKNGRIYE----EPRQALR---FPESIKTFQLMMRDPAVAASVNIIKMFVRKVNW 73 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhhccccchhhc----cchhhhc---ccchHHHHHHHhhChHHHHHHHHHHHHHhcCCc Confidence 22111 11111100000 000000 0000000 001111233445678999999999999999999 Q ss_pred eeeeccccccc---chHHHhhhccc-cccCCHHHHHHHHHHHHHhcCCcceEEeeeCC------------CCc--EEEEE Q lcl|NC_019456. 70 HEYQNYKQMDN---EPLADLLKTSP-NPNMTAFEFIARLETDRNVSGNGYAWIQKSLS------------TGE--PIALW 131 (435) Q Consensus 70 ~~~~~~~~~~~---~~l~~~l~~~P-n~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~------------~g~--~~~l~ 131 (435) .|...+....+ ......+...- +-..++.+++..+. +.+.+|.+++++++... +|. +..+. T Consensus 74 ~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~ 152 (488) T protein:vir:95 74 RFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAKLP 152 (488) T ss_pred eEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCeeeeeeee Confidence 99753322111 11122222111 11234667776665 67899999999998643 222 34455 Q ss_pred EeCCcee-EEEEcCCCceEE-EEEe--------------cCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHH Q lcl|NC_019456. 132 PLDPNTV-SILRNTDNNSYW-YRVT--------------SDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQ 195 (435) Q Consensus 132 ~l~~~~v-~~~~~~~~~~~~-~~~~--------------~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~ 195 (435) +.++.+. .+..+.++.... .... .......+|+...++.++....+.++|.+.+..++-..... T Consensus 153 ~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~fK 232 (488) T protein:vir:95 153 IRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWKYK 232 (488) T ss_pred ecCcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecCCCCccchhhHHHHHHHHHHHH Confidence 5555321 122233332211 1100 01123457777777777666677889999999998888777 Q ss_pred HHHHHHHHHHhhc-C-CceEEEeC----CcCCHHHHHHHHHHHHHHhcC----CCccccccCCceee---------eccC Q lcl|NC_019456. 196 RSVENFSQNEMEK-K-DKFVLQYD----RSISPEKRQAMVNDFLRMVKE----NGGAVVQEAGWKVD---------RYES 256 (435) Q Consensus 196 ~~~~~~~~~~~~n-~-~~~~~~~~----~~~~~e~~~~~~~~~~~~~~~----~~~~~vl~~g~~~~---------~~~~ 256 (435) ....++-..|... + +..+...+ ...++++.+.+.+.......+ ....++++.|+++. .++. T Consensus 233 ~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~ 312 (488) T protein:vir:95 233 VQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSLVSR 312 (488) T ss_pred HHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhcccc Confidence 7777777767654 3 22333332 234445555566665544433 22345666655432 2333 Q ss_pred C-hhhHHHHHHHHHHHHHHHHHhCCCHHHhCCc------ccCcccHHHHHHHHHHHHHhHHHHHHHHHHHHhhccccccc Q lcl|NC_019456. 257 K-FEPADLSSVEQISRIRIATAFNVPISFLNDD------QAKSTTNVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRV 329 (435) Q Consensus 257 ~-~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~------~~~~~~~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~ 329 (435) . .....+.++.++..++|+.+. ||.. ..++++..+-+.....+.+.-.++.+++.||++|+.+.-.. T Consensus 313 ~~~~~~~~~~li~~~d~~Isk~i------LGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~~ 386 (488) T protein:vir:95 313 QGAKAYDTGSIIDRYSKQIMMAF------MSDVLAMGQSKYGSFSLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYAL 386 (488) T ss_pred ccCCchhHHHHHHHHHHHHHHHH------hccccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2 222346777777778888775 4442 12333344556666777888889999999998877654211 Q ss_pred ----CcceeeechhhhhccCHHHHHHHHHHHHhcCCcCH-----HHHHHHhCCCCCCCcCCceeeecccccchhcccccc Q lcl|NC_019456. 330 ----KGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKP-----NEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAI 400 (435) Q Consensus 330 ----~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~-----NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~ 400 (435) ...+.+|-++.....|.++.++++.++++.|+.-+ +.+|+.+|+|+-+ .++....+. +........ T Consensus 387 Nfg~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~--~~e~~~~~~---~~~~~~~~~ 461 (488) T protein:vir:95 387 NMWDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPAD--ESQPVSEKL---SPNSQSRSG 461 (488) T ss_pred cCCCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCC--CCccccccC---CCCCCCCCC Confidence 12223344445567888999999999999998765 5699999999642 233332221 111000000 Q ss_pred ccccccccccccccccCCCCCCCCCCC Q lcl|NC_019456. 401 LDNKIQTDASVAAPKQEGGENTNENGL 427 (435) Q Consensus 401 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (435) ......+......+..+.+...+...+ T Consensus 462 ~~~~~~~~~~~~~~~~~~~~~a~~~~~ 488 (488) T protein:vir:95 462 DGYKTAGEGTAKTPSAKDPSTANKANK 488 (488) T ss_pred cccCCCcccCCcccccccchhhhhccC Confidence 000001111111111111111111111 No 132 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.62 E-value=6.2e-16 Score=103.96 Aligned_cols=367 Identities=11% Similarity=0.046 Sum_probs=184.9 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) |+|.+-+.+. |.++. ..+.-.|.+..-....-...|..++.+..+|+.+++.+-.--..+..+.+.... T Consensus 23 d~l~~~~~gl-g~~r~----------~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d~~~~~~~~i~~g~~~~~~ 91 (449) T protein:vir:10 23 MGLMVPTMGL-DNKRH----------SAWCEYGFPELVTYENLYSLYRRGGIAHGAVEKLVGKCWQTNPEIIEGDDADDS 91 (449) T ss_pred HHHHHHHhcC-Ccccc----------hhhhhcCCcccCCHHHHHHHHhcCchhHHHHHhhhhhhhhcCcccccCccccch Confidence 4443322111 11111 111111222111111113346677888899999998663221122222111110 Q ss_pred ------chHHHhhhccccccCCHHHHHHH---HHHHHHhcCCcceEEee-eC--------CCCcEEEEEEeCCceeEEEE Q lcl|NC_019456. 81 ------EPLADLLKTSPNPNMTAFEFIAR---LETDRNVSGNGYAWIQK-SL--------STGEPIALWPLDPNTVSILR 142 (435) Q Consensus 81 ------~~l~~~l~~~Pn~~~~~~~f~~~---~~~~~~~~G~~~~~i~~-~~--------~~g~~~~l~~l~~~~v~~~~ 142 (435) ......++ ...+|.. ....-.++|-+++++.- ++ ..+.+..|.|+....+++.. T Consensus 92 ~~~~~~e~~~~~l~--------~~~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l~~Pl~~~~~i~~i~v~~~~~i~~~~ 163 (449) T protein:vir:10 92 EDETSWEKKSKQVF--------TNRLWRSFAEADRRRLVGRYAGILLHIRDEKDWNLPATKGRGLQKVSVSWAGSLKVAE 163 (449) T ss_pred hhhHHHHHHHHHHH--------HHHHHHHHHHHHHhhhccCcEEEEEEecCCCCCCcccccCcceeeEEeeccccCChhh Confidence 01111111 1123332 23334567877776643 22 12356777777765555332 Q ss_pred -------cCCCceEEEEEecC-----CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHH-HHHHHHhhcC Q lcl|NC_019456. 143 -------NTDNNSYWYRVTSD-----IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVE-NFSQNEMEKK 209 (435) Q Consensus 143 -------~~~~~~~~~~~~~~-----~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~-~~~~~~~~n~ 209 (435) ...|.+.+|.+... ...+.|.++.|+||-.. ...|.|.++.+++.+.....+. .+...++++. T Consensus 164 ~~~dp~sp~yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~----~~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~ 239 (449) T protein:vir:10 164 WDTGINSKTYGQPKLWKYTERLPNGSSRRVDIHPDRVFILGDY----SEDAIGFLEPAYNAFVSLEKVEGGSGESFLKNA 239 (449) T ss_pred hhcCCCCCCCCCceEEEEeeeccCCCccceeeccceeEeecCC----CCCChhHHHHHHHHhhhHHHhhhhHHHHHHHHH Confidence 22356666665431 23356889999998532 2347888888887654333221 2222222221 Q ss_pred C-------------ceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHH Q lcl|NC_019456. 210 D-------------KFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIAT 276 (435) Q Consensus 210 ~-------------~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~ 276 (435) . ..+....+.-.++..+++.+......++.+ .+.++.+.+|+.++.++.+ +.+......++||+ T Consensus 240 ~rq~~~~~~~~~~~~~l~~~~~~~~e~~~~~~~~~~~~~~~~~~-~~~i~~~~d~~~~~~~~sg--l~d~l~~~~q~iaa 316 (449) T protein:vir:10 240 ARQLNVNFEKEIDFTNLASLYGVSIDELQDKFNEVAGEINRGND-VLMTTQGATVTPLVTSVAD--PTATYNVNLQTAAA 316 (449) T ss_pred HHHHhhhhhhhhhhhhhhHHhhCCchHHHHHHHHHHHHHhccch-heeecCCcceEEEecccCC--hhHHHHHHHHHHHH Confidence 1 111111111122223344444333333333 4456677789998888764 45677777888999 Q ss_pred HhCCCHHHhCCcccCcccHHHHHHHHH------HHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHH Q lcl|NC_019456. 277 AFNVPISFLNDDQAKSTTNVEHVTHSW------TMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAART 350 (435) Q Consensus 277 ~fgvP~~~lg~~~~~~~~~~e~~~~~~------~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~ 350 (435) +.|||...|-+.+.++.+..+....|| +..+.|.++.+.+.|-+.-+... .. .+.|.+++|...+.++++ T Consensus 317 a~~IP~t~L~Gqsp~glnst~D~~nyyd~i~~~Q~~l~p~le~l~~~l~~s~~g~~--~~--d~~i~f~pL~~~t~kEkA 392 (449) T protein:vir:10 317 GVDIPTRILIGNQQAERSSTEDQKYFNARCQSRRVDLSFEIEDFCDKLIELKIIDA--VA--KKAVIWDDLNEQTGTEKL 392 (449) T ss_pred HhCCCeeeeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC--CC--ceeEEeCCCCCCCHHHHH Confidence 999999988777776655444444444 23366777777766655443221 12 355556688888877775 Q ss_pred H-------HHHHHHhcC---CcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCC Q lcl|NC_019456. 351 Q-------YYQTLTRNG---IFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGE 420 (435) Q Consensus 351 ~-------~~~~~~~~g---~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (435) + ++++++++| +++++|+|+.+|++|.. ++.+ . .+..+ T Consensus 393 ei~k~~A~a~~~~~~ag~~~~~~~~EiR~~~~~~~~~---~~~~---------~---------------------~e~~d 439 (449) T protein:vir:10 393 TNAKTMGEINQTMLGSGDNPAFSREEIRTAAGYDNDD---EEPL---------G---------------------EEDGD 439 (449) T ss_pred HHHHHHHHHHHHHHHccccCCcCHHHHHHHhcccCCC---CCCC---------C---------------------CCCCc Confidence 4 555677676 89999999999999853 2210 0 00000 Q ss_pred CCCCCCCCCCC Q lcl|NC_019456. 421 NTNENGLQSTE 431 (435) Q Consensus 421 ~~~~~~~~~~~ 431 (435) +. +.+.++.. T Consensus 440 e~-~~~~d~~a 449 (449) T protein:vir:10 440 EE-DKATDSAA 449 (449) T ss_pred cc-cccCCcCC Confidence 00 00001111 No 133 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.61 E-value=1.6e-15 Score=101.64 Aligned_cols=401 Identities=9% Similarity=0.089 Sum_probs=215.2 Q ss_pred CchHHHHHhhcccccccccccc-ccchhhhhhccccccCcccc---------cHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQ-NPIPQPLDMAGVKLEQATFS---------REHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) .+| ...|........+.. ......+++.++......++ .--...++|.+++|+..||+.+.+-=+. T Consensus 67 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~ 142 (695) T protein:vir:36 67 LRL----ARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGE 142 (695) T ss_pred ccc----ceeceecccccCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccce Confidence 222 122221111111111 11112223223322222111 1112346788999999999988665222 Q ss_pred eeecccc-------------cc--cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC------------ Q lcl|NC_019456. 71 EYQNYKQ-------------MD--NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS------------ 123 (435) Q Consensus 71 ~~~~~~~-------------~~--~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~------------ 123 (435) +.....+ .. +-.....|..+-. ...-.+-+...+.+--++|-+.+++..++. T Consensus 143 ~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~e-rL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~ 221 (695) T protein:vir:36 143 AIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIE-RLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRP 221 (695) T ss_pred ecccchhhhhhccccccccccccCchHHHHHHHHHHH-HHHHHHHHHHHHHhhccccceEEEEEeccCcccccccccccc Confidence 2211000 00 1012223332222 222334455556666778888776654331 Q ss_pred ----CCcEEEEEEeCCceeEEEEc--------CCCceEEEEEecCCeeEEEchhheEEeccCCC------ccccccCcHH Q lcl|NC_019456. 124 ----TGEPIALWPLDPNTVSILRN--------TDNNSYWYRVTSDIYNFTIPINDVIHVKHVVP------SNSWYGVSPI 185 (435) Q Consensus 124 ----~g~~~~l~~l~~~~v~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~------~~~~~G~s~l 185 (435) .|...+|.+|+|..|++... ..+.+.+|.+.. .+|..+.++.|..... ...+.|+|.. T Consensus 222 ~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G----~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~ 297 (695) T protein:vir:36 222 YTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIG----TEVHATRLHTIVSRPVGDMLKPTYSFAGISMT 297 (695) T ss_pred ccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEec----eEEeeeeEEEecCCCchhhhhcccccCcccHH Confidence 35667799999998887542 234555666642 3577777777664332 2346799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcC-CceEEEeC--CcCCHHHHHHHH--HHHHHHhcCCCcccccc-CCceeeeccCChh Q lcl|NC_019456. 186 DVLSSSLKFQRSVENFSQNEMEKK-DKFVLQYD--RSISPEKRQAMV--NDFLRMVKENGGAVVQE-AGWKVDRYESKFE 259 (435) Q Consensus 186 ~~~~~~i~~~~~~~~~~~~~~~n~-~~~~~~~~--~~~~~e~~~~~~--~~~~~~~~~~~~~~vl~-~g~~~~~~~~~~~ 259 (435) +.+...+............+.... ..++ ..+ ..+.......+. -++.+.+++..++.+++ +..+|++.+.+.. T Consensus 298 q~~~e~V~~~~rT~~~v~~Li~~~~v~~l-k~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~stslS 376 (695) T protein:vir:36 298 QLAMPYIDNWLRTRQSVSDIVKQFSVSGI-LMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPLS 376 (695) T ss_pred HHHHHHHHHHHHHHhHHHHHHHhhhHHHH-HHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecCCcceEEEecccC Confidence 999988887766555555444321 1111 111 112222222222 23345567777888888 5789999988775 Q ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCccc-HHH-HHHHHH-------HHHHhHHHHHHHHHHHHhhcccccccC Q lcl|NC_019456. 260 PADLSSVEQISRIRIATAFNVPISFLNDDQAKSTT-NVE-HVTHSW-------TMTLMPIIRQYESQFNMKLFTPGKRVK 330 (435) Q Consensus 260 ~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~-~~e-~~~~~~-------~~~i~P~~~~i~~~l~~~l~~~~~~~~ 330 (435) . +.++.....++||.+.+||...|-+.++.+.+ +.| ....|| ...++|.++.+-+.|-+..|... T Consensus 377 G--LddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i---- 450 (695) T protein:vir:36 377 G--LDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV---- 450 (695) T ss_pred C--HHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---- Confidence 4 45777778899999999999988887777653 434 233444 23578888888888877776542 Q ss_pred cceeeechhhhhccCHHHHHHH-------HHHHHhcCCcCHHHHHHHhCCCCCCCcC------Cceeeecccccchhccc Q lcl|NC_019456. 331 GFYFSFNVNGLLRGDTAARTQY-------YQTLTRNGIFKPNEIRELEGQAPIPDEA------ADHLYISKDLYPLDKYY 397 (435) Q Consensus 331 g~~i~fd~~~l~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~~g~~p~~~~~------gd~~~~~~n~~~l~~~~ 397 (435) ...|.|.++.|...+.++++++ ...+++.|+++++|+|.++.-+|-. .. .|++.++... T Consensus 451 dpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s-~Y~~~~D~~d~p~~~~~~------- 522 (695) T protein:vir:36 451 DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDG-PYAGKLDANDDPGVPADD------- 522 (695) T ss_pred CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCc-ccccccccccCCCcCccc------- Confidence 2346677778888777666554 6678899999999999998765421 11 1111111110 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 398 DAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 398 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) ..+ ...+..++..++++.+.+....+|+ T Consensus 523 --------~~~--~~~~~~~~~~~~~~~~~~~~~~~g~ 550 (695) T protein:vir:36 523 --------DID--GVLTYVQRLAEGGDTGAPGGARAGA 550 (695) T ss_pred --------hhh--hhHhhhcCcccccccCCCCcccccc Confidence 000 0011111222222222233333333 No 134 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=99.60 E-value=1.8e-14 Score=95.92 Aligned_cols=396 Identities=13% Similarity=0.089 Sum_probs=207.9 Q ss_pred CchHHHHHhhccccccccccccccc--------hhhh--hhcccccc--------CcccccHHHHhhhHHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPI--------PQPL--DMAGVKLE--------QATFSREHILESNEYIFSIVTRLSN 62 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~--------~~~~--~~~~~~~~--------~~~~~~~~~~~~~~~v~~~i~~ia~ 62 (435) |.--++--... . +.+... .... .... .+.|..+. .......+..+..+.|.+|++.+.. T Consensus 1 m~kk~~k~~~~-~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~~Rk~ 77 (448) T protein:vir:77 1 MAKRGRKPKEL-V-PGPGSI-DPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNALNYIFG 77 (448) T ss_pred CCCCCCCCccc-C-Cccccc-chhhhhhhccchhhhcccccccccccchhHhhccccchHHHHHHhhChHHHHHHHHHHH Confidence 43321110000 0 000000 0000 0000 00111100 1111112335567889999999999 Q ss_pred HHhhCceeeeecccccccchH---HHhhhcccc---ccCCHHHHHHHHHHHHHhcCCcceEEeeeC-CCCc--EEEEEEe Q lcl|NC_019456. 63 VLASLPLHEYQNYKQMDNEPL---ADLLKTSPN---PNMTAFEFIARLETDRNVSGNGYAWIQKSL-STGE--PIALWPL 133 (435) Q Consensus 63 ~ia~~~~~~~~~~~~~~~~~l---~~~l~~~Pn---~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~-~~g~--~~~l~~l 133 (435) .|.+++|.|...+....+... ...++..+. ...++.+++..+ .+.+.+|.+++++++.. ..|. +..|.+. T Consensus 78 av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~~~~~~l~~r 156 (448) T protein:vir:77 78 RIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILDKIVPI 156 (448) T ss_pred HHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCceeecccccc Confidence 999999999754433322222 222222222 223566777766 57899999999999863 3454 3356666 Q ss_pred CCceeE-EEEcCCCceEEEEEecC-------CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 134 DPNTVS-ILRNTDNNSYWYRVTSD-------IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNE 205 (435) Q Consensus 134 ~~~~v~-~~~~~~~~~~~~~~~~~-------~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~ 205 (435) ++..+. +..+.++.......... .....+|...++|..+. ..+.++|.|.+..++-.........+.-..| T Consensus 157 ~~~~~~~f~~~~~~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~~-~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f 235 (448) T protein:vir:77 157 HPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVNGLEIPIWKTVVFLHN-DDGSFTGQSALRAAVPHWLAKRALILLINHG 235 (448) T ss_pred CCCccceeeeecCCceEEEecCCcccccccCCCccccccceEEEEecC-CcCCcccchHHHHHHHHHHHHHhhHHHHHHH Confidence 665332 22334443333222211 12345688899998764 4567899999999988887777777776767 Q ss_pred hhc-C-CceEEEeCCcC--CHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019456. 206 MEK-K-DKFVLQYDRSI--SPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVP 281 (435) Q Consensus 206 ~~n-~-~~~~~~~~~~~--~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP 281 (435) .+. | |-.+.+++... ++++.+++.+.......+....++++.|++++-++.+....++.+..++..++|+.+..- T Consensus 236 ~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLG- 314 (448) T protein:vir:77 236 LERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGI- 314 (448) T ss_pred HHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCccCHHHHHHHHHHHHHHHHhc- Confidence 665 4 33455665443 345666666666554333444677888888887776655566778888888888887531 Q ss_pred HHHhCCcccCcccHHH-HHHHHHHHHHhHHHHHHHHHHHHhhccccccc------CcceeeechhhhhccCHHHHHHHHH Q lcl|NC_019456. 282 ISFLNDDQAKSTTNVE-HVTHSWTMTLMPIIRQYESQFNMKLFTPGKRV------KGFYFSFNVNGLLRGDTAARTQYYQ 354 (435) Q Consensus 282 ~~~lg~~~~~~~~~~e-~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~------~g~~i~fd~~~l~~~d~~~~~~~~~ 354 (435) ..+--....++++... +........+.-.++.+++.||+.|+.+.-.. .-.++.|+.. ...|.++.++.+. T Consensus 315 qtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~--e~eDl~~~a~~~~ 392 (448) T protein:vir:77 315 DFNTVQLNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEME--ERNDFSAAANLMG 392 (448) T ss_pred cccccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCC--ChhhHHHHHHHhH Confidence 1111111112222222 22234456667788889998888876543111 1125666544 4678888899888 Q ss_pred HHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 355 TLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEG 434 (435) Q Consensus 355 ~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (435) +++ +-+|+.+|+|.- .+.+ .-+ . + . . ..+.....++..+....++++.- T Consensus 393 ~l~-------~~~~~~~~ip~~-~~~~---~~~------~----~--~--~------~~~~~~~~~~~~~~~~~~~~~~~ 441 (448) T protein:vir:77 393 MLI-------NAVKDSEDIPTE-LKAL---IDA------L----P--S--K------MRRALGVVDEVREAVRQPADSRY 441 (448) T ss_pred HHH-------HHHHHHhcCCcc-CCcC---CCC------C----c--h--h------cccccCCCCCCCchhhcchhhHH Confidence 886 458999998742 1111 000 0 0 0 0 00000011111111111111111 Q ss_pred C Q lcl|NC_019456. 435 S 435 (435) Q Consensus 435 ~ 435 (435) . T Consensus 442 ~ 442 (448) T protein:vir:77 442 L 442 (448) T ss_pred H Confidence 1 No 135 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.59 E-value=2.6e-15 Score=100.51 Aligned_cols=407 Identities=9% Similarity=0.080 Sum_probs=216.6 Q ss_pred CchHHHHHhhcccccccccccc-ccchhhhhhccccccCcccc---------cHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQ-NPIPQPLDMAGVKLEQATFS---------REHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) .+| ...|........+.. ......+++.++......++ .--...++|.+++|+..||+.+.+-=+. T Consensus 67 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~ 142 (695) T protein:vir:78 67 LRL----ARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGE 142 (695) T ss_pred ccc----ceeceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccce Confidence 222 222221111111111 11112223223322222111 1112346788999999999988665222 Q ss_pred eeecccc-------------cc--cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC------------ Q lcl|NC_019456. 71 EYQNYKQ-------------MD--NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS------------ 123 (435) Q Consensus 71 ~~~~~~~-------------~~--~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~------------ 123 (435) +.....+ .. +-.....|..+-.... -.+-+...+.+--++|-+.+++..++. T Consensus 143 ~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~-V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~ 221 (695) T protein:vir:78 143 AIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLR-IRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRP 221 (695) T ss_pred eccccchhhhhhcccccccccccccHHHHHHHHHHHHHHH-HHHHHHHHHHhhccccceEEEEEeccCcccccccccccc Confidence 2211000 00 1122233333322222 334445555566678888776654331 Q ss_pred ----CCcEEEEEEeCCceeEEEEc--------CCCceEEEEEecCCeeEEEchhheEEeccCCC------ccccccCcHH Q lcl|NC_019456. 124 ----TGEPIALWPLDPNTVSILRN--------TDNNSYWYRVTSDIYNFTIPINDVIHVKHVVP------SNSWYGVSPI 185 (435) Q Consensus 124 ----~g~~~~l~~l~~~~v~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~------~~~~~G~s~l 185 (435) .|...+|.+|+|..|++... ..+.+.+|.+.. .+|..+.++.|..... ...+.|+|.. T Consensus 222 ~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G----~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~ 297 (695) T protein:vir:78 222 YTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIG----TEVHATRLHTIVSRPVGDMLKPTYSFAGISMT 297 (695) T ss_pred ccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEec----eEEeeeeEEEecCCCchhhhhcccccCcccHH Confidence 35667799999998887542 234555666642 3577777777664332 2346799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhc-CCceEEEeC--CcCCHHHHHHHH--HHHHHHhcCCCcccccc-CCceeeeccCChh Q lcl|NC_019456. 186 DVLSSSLKFQRSVENFSQNEMEK-KDKFVLQYD--RSISPEKRQAMV--NDFLRMVKENGGAVVQE-AGWKVDRYESKFE 259 (435) Q Consensus 186 ~~~~~~i~~~~~~~~~~~~~~~n-~~~~~~~~~--~~~~~e~~~~~~--~~~~~~~~~~~~~~vl~-~g~~~~~~~~~~~ 259 (435) +.+...+............+... ...++ ..+ ..+.......+. -++.+.+++..++.+++ +..+|++.+.+.. T Consensus 298 q~~~e~V~~~~rT~~~v~~Li~~~~v~~l-k~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~stslS 376 (695) T protein:vir:78 298 QLAMPYIDNWLRTRQSVSDIVKQFSVSGI-LMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPLS 376 (695) T ss_pred HHHHHHHHHHHHHHhHHHHHHHhhhhHHH-HHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecCCcceEEEecccC Confidence 99998888877655555544432 22221 111 122222222232 23345567777888888 5789999988775 Q ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCccc-HHH-HHHHHH-------HHHHhHHHHHHHHHHHHhhcccccccC Q lcl|NC_019456. 260 PADLSSVEQISRIRIATAFNVPISFLNDDQAKSTT-NVE-HVTHSW-------TMTLMPIIRQYESQFNMKLFTPGKRVK 330 (435) Q Consensus 260 ~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~-~~e-~~~~~~-------~~~i~P~~~~i~~~l~~~l~~~~~~~~ 330 (435) . +.++.....++||.+.+||...|-+.++.+.+ +.| ....|| ...++|.++.+-+.|-+..|... T Consensus 377 G--LddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i---- 450 (695) T protein:vir:78 377 G--LDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV---- 450 (695) T ss_pred C--HHHHHHHHHHHHHhhhcCchhhhhccCCccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---- Confidence 4 45777778899999999999988887777653 434 233444 23578888888888877776542 Q ss_pred cceeeechhhhhccCHHHHHHH-------HHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccc Q lcl|NC_019456. 331 GFYFSFNVNGLLRGDTAARTQY-------YQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDN 403 (435) Q Consensus 331 g~~i~fd~~~l~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~ 403 (435) ...|.|.++.|...+.++++++ ...+++.|+++++|+|.++.-+|-. ..+.. +..|-.|.- .. T Consensus 451 dpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s-~Y~~~--~D~~d~p~~-------~~ 520 (695) T protein:vir:78 451 DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDG-PYAGK--LDANDDPGV-------PA 520 (695) T ss_pred CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCc-ccccc--cccccCCCc-------Cc Confidence 2346777778888777666554 6678899999999999998766421 11100 001111100 00 Q ss_pred cccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 404 KIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 404 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) +...+ ...+..++..++++.+.+.....|+ T Consensus 521 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~g~ 550 (695) T protein:vir:78 521 DDDID--GVLTYVQRLAEGGDTGAPGGARAGA 550 (695) T ss_pred cchhh--hhHhhhcCcccccccCCCCCCCCCC Confidence 00000 0011111222222223333333343 No 136 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.59 E-value=2.7e-15 Score=100.43 Aligned_cols=405 Identities=10% Similarity=0.078 Sum_probs=215.7 Q ss_pred CchHHHHHhhcccccccccccc-ccchhhhhhccccccCcccc---------cHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQ-NPIPQPLDMAGVKLEQATFS---------REHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) =+=--+|...|........+.. ......+++.|+......++ .--...++|.+++|+..||+.+.+-=+. T Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~ 141 (694) T protein:vir:10 62 PSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGE 141 (694) T ss_pred CCcchhhhhhccccccCCCccccchhhhhhccCcccccchhhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccce Confidence 0000012222221111111111 11112222223322222111 1112346788999999999988665222 Q ss_pred eeecccc-------------cc--cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC------------ Q lcl|NC_019456. 71 EYQNYKQ-------------MD--NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS------------ 123 (435) Q Consensus 71 ~~~~~~~-------------~~--~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~------------ 123 (435) +.....+ .. +-.....|..+-.... -.+-+...+.+--++|-+.+++..++. T Consensus 142 ~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~-V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~ 220 (694) T protein:vir:10 142 AIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLR-IRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRP 220 (694) T ss_pred eccccchhhhhhcccccccccccccHHHHHHHHHHHHHHH-HHHHHHHHHHhhccccceEEEEEeecCcccccccccccc Confidence 2211000 00 1122233333322222 334445555666678888776654331 Q ss_pred ----CCcEEEEEEeCCceeEEEEc--------CCCceEEEEEecCCeeEEEchhheEEeccCCC------ccccccCcHH Q lcl|NC_019456. 124 ----TGEPIALWPLDPNTVSILRN--------TDNNSYWYRVTSDIYNFTIPINDVIHVKHVVP------SNSWYGVSPI 185 (435) Q Consensus 124 ----~g~~~~l~~l~~~~v~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~------~~~~~G~s~l 185 (435) .|...+|.+|+|..|++... ..+.+.+|.+.. .+|..+.++.|..... ...+.|+|.. T Consensus 221 ~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G----~~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~ 296 (694) T protein:vir:10 221 YTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIG----TEVHATRLHTIVSRPVGDMLKPTYSFAGISMT 296 (694) T ss_pred ccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEec----eEEeeeeEEEecCCCchhhhhcccccCcccHH Confidence 35667799999998887542 234555666642 3577777777664332 2346799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhc-CCceEEEeC--CcCCHHHHHHHH--HHHHHHhcCCCcccccc-CCceeeeccCChh Q lcl|NC_019456. 186 DVLSSSLKFQRSVENFSQNEMEK-KDKFVLQYD--RSISPEKRQAMV--NDFLRMVKENGGAVVQE-AGWKVDRYESKFE 259 (435) Q Consensus 186 ~~~~~~i~~~~~~~~~~~~~~~n-~~~~~~~~~--~~~~~e~~~~~~--~~~~~~~~~~~~~~vl~-~g~~~~~~~~~~~ 259 (435) +.+...+............+... ...++ ..+ ..+.......+. -++.+.+++..++.+++ +..+|++.+.+.. T Consensus 297 q~~~e~V~~~~rT~~~v~~Li~~~~v~~l-k~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~stslS 375 (694) T protein:vir:10 297 QLAMPYIDNWLRTRQSVSDIVKQFSVSGI-LMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPLS 375 (694) T ss_pred HHHHHHHHHHHHHHhHHHHHHHhhhhHHH-HHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecCCcceEEEecccC Confidence 99998888876655555544432 22221 111 122222222222 23345567777888888 5789999988775 Q ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCccc-HHH-HHHHHH-------HHHHhHHHHHHHHHHHHhhcccccccC Q lcl|NC_019456. 260 PADLSSVEQISRIRIATAFNVPISFLNDDQAKSTT-NVE-HVTHSW-------TMTLMPIIRQYESQFNMKLFTPGKRVK 330 (435) Q Consensus 260 ~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~-~~e-~~~~~~-------~~~i~P~~~~i~~~l~~~l~~~~~~~~ 330 (435) . +.++.....++||.+.+||...|-+.++.+.+ +.| ....|| ...++|.++.+-+.|-+..|... T Consensus 376 G--LddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i---- 449 (694) T protein:vir:10 376 G--LDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV---- 449 (694) T ss_pred C--HHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---- Confidence 4 45777778899999999999988887777653 434 233444 23578888888888877776542 Q ss_pred cceeeechhhhhccCHHHHHHH-------HHHHHhcCCcCHHHHHHHhCCCCCCCcC------Cceeeecccccchhccc Q lcl|NC_019456. 331 GFYFSFNVNGLLRGDTAARTQY-------YQTLTRNGIFKPNEIRELEGQAPIPDEA------ADHLYISKDLYPLDKYY 397 (435) Q Consensus 331 g~~i~fd~~~l~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~~g~~p~~~~~------gd~~~~~~n~~~l~~~~ 397 (435) ...|.|.++.|...+.++++++ ...+++.|+++++|+|.++.-+|-. .. .|.+.++... T Consensus 450 dp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s-~Y~~~~D~~d~p~~~~~~------- 521 (694) T protein:vir:10 450 DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDG-PYAGKLDANDDPGVPADD------- 521 (694) T ss_pred CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCc-ccccccccccCCCcCccc------- Confidence 2346677778888777666554 6678899999999999998765421 11 1111111110 Q ss_pred cccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 398 DAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 398 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) ..+ ...+..++..++++.+.+....+|+ T Consensus 522 --------~~~--~~~~~~~~~~~~~~~~~~~~~~~g~ 549 (694) T protein:vir:10 522 --------DID--GVLTYVQRLAEGGDTGAPGGARAGA 549 (694) T ss_pred --------hhh--hhHhhhcCcccccccCCCCcccccc Confidence 000 0011111222222222233333333 No 137 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=99.58 E-value=3.8e-14 Score=94.18 Aligned_cols=400 Identities=12% Similarity=0.073 Sum_probs=208.4 Q ss_pred CchHHHHHhhcccccccccccccc-ch------hhhhhccccccC--------cccccHHHHhhhHHHHHHHHHHHHHHh Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNP-IP------QPLDMAGVKLEQ--------ATFSREHILESNEYIFSIVTRLSNVLA 65 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~-~~------~~~~~~~~~~~~--------~~~~~~~~~~~~~~v~~~i~~ia~~ia 65 (435) |.-..+.-....+..........+ .. .-..+.|..+.. ......+..++.+.|.+|++.+...|. T Consensus 1 m~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~l~~Rk~av~ 80 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNALNYIFGRIR 80 (448) T ss_pred CCCCCCCCccccCcccccccccchhhhhhhhhhcccccccccccchhHhhccccchHHHHHHhhChHHHHHHHHHHHHHh Confidence 433211100000000000000000 00 000011111110 011123335567889999999999999 Q ss_pred hCceeeeecccccccchH---HHhhhccccc---cCCHHHHHHHHHHHHHhcCCcceEEeeeC-CCCc--EEEEEEeCCc Q lcl|NC_019456. 66 SLPLHEYQNYKQMDNEPL---ADLLKTSPNP---NMTAFEFIARLETDRNVSGNGYAWIQKSL-STGE--PIALWPLDPN 136 (435) Q Consensus 66 ~~~~~~~~~~~~~~~~~l---~~~l~~~Pn~---~~~~~~f~~~~~~~~~~~G~~~~~i~~~~-~~g~--~~~l~~l~~~ 136 (435) +++|.|...+....+... ....+..+.. ..++.+++.. +.+.+++|.+++++++.. ..|. +..|.+.++. T Consensus 81 ~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~-~lda~~~G~s~~Eivw~~~~~g~~~~~~l~~r~~~ 159 (448) T protein:vir:79 81 SAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAI-YENAYIYGMAAGEIVLTLGADGKLILDKIVPIHPF 159 (448) T ss_pred cCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHH-HHHhhhhcceeEEEEeeecCCCceecccccccCCc Confidence 999999754333222222 2222333332 2244454444 455789999999999853 3454 3356666665 Q ss_pred eeE-EEEcCCCceEEEEEecC-------CeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019456. 137 TVS-ILRNTDNNSYWYRVTSD-------IYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEK 208 (435) Q Consensus 137 ~v~-~~~~~~~~~~~~~~~~~-------~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n 208 (435) .+. +..+.++.......... .....+|...++|..+. ..+.++|.+.+..++-.........+.-..|.+. T Consensus 160 ~~~~f~~~~d~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~~-~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~ 238 (448) T protein:vir:79 160 NIDEVLYDEEGGPKALKLSGEVKGGSQFVSGLEIPIWKTVVFLHN-DDGSFTGQSALRAAVPHWLAKRALILLINHGLER 238 (448) T ss_pred cccceeeecCCceEEeecCCcccccccCCCccccccceEEEEecC-ccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 332 22333343333322211 12345688889998764 4567899999999998888777777777777765 Q ss_pred -C-CceEEEeCCcCC--HHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_019456. 209 -K-DKFVLQYDRSIS--PEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISF 284 (435) Q Consensus 209 -~-~~~~~~~~~~~~--~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~ 284 (435) | |-.+.+++...+ +++.+++.+.......+....+|++.|++++-++......++.++.++..++|+.+. T Consensus 239 yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk~i------ 312 (448) T protein:vir:79 239 FMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARAL------ 312 (448) T ss_pred cCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCcccHHHHHHHHHHHHHHHH------ Confidence 4 334566654433 455666666655543334446778899888887766555667788888888888775 Q ss_pred hCCccc-----CcccHHH-HHHHHHHHHHhHHHHHHHHHHHHhhccccccc------CcceeeechhhhhccCHHHHHHH Q lcl|NC_019456. 285 LNDDQA-----KSTTNVE-HVTHSWTMTLMPIIRQYESQFNMKLFTPGKRV------KGFYFSFNVNGLLRGDTAARTQY 352 (435) Q Consensus 285 lg~~~~-----~~~~~~e-~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~------~g~~i~fd~~~l~~~d~~~~~~~ 352 (435) ||..-. ++++... .......+.+.-.++.++..||+.|+.+.-.. .-.+++|++. ...|.++.++. T Consensus 313 LGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~--e~~Dl~~~a~~ 390 (448) T protein:vir:79 313 GIDFNTVQLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFEME--ERNDFSAAANL 390 (448) T ss_pred hhhhhccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEecCC--ChHHHHHHHHH Confidence 343221 1222221 12233456667788888888888777543211 1125566544 57788889999 Q ss_pred HHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCC Q lcl|NC_019456. 353 YQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEG 418 (435) Q Consensus 353 ~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 418 (435) +.+++..+-...+-+|+.+|+|. +.++ +....+. ..........++++..-.-....- T Consensus 391 ~~~l~~~~~~~~~~~~~~~~~p~-~~~~-~~~~a~~------~~~~~~~~~~~~~~~~~~~~~~~~ 448 (448) T protein:vir:79 391 MGMLINAVKDSEDIPTELKALID-ALPS-KMRRALG------VVDEVREAVRQPADSRYLYTRRRR 448 (448) T ss_pred hhhhhccchhhHHHHHHhhcCCC-CCCC-ccccccC------CCCcccccccCCccccchhhcccC Confidence 99998876555555777788773 2222 1110000 000000000000000000000000 No 138 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.55 E-value=8.9e-15 Score=97.61 Aligned_cols=419 Identities=10% Similarity=0.081 Sum_probs=213.1 Q ss_pred CchHHHHHhhcccccccccccc-ccchhhhhhccccccCcccc---------cHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQ-NPIPQPLDMAGVKLEQATFS---------REHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) .+| ...|........+.. ......+++.++......++ .--...++|.+++|+..||+.+.+-=+. T Consensus 67 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~ 142 (698) T protein:vir:10 67 LRL----ARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGE 142 (698) T ss_pred ccc----cccceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccce Confidence 222 222221111111111 11112223333332222111 1112346788999999999988665222 Q ss_pred eeecccc-------------cc--cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC------------ Q lcl|NC_019456. 71 EYQNYKQ-------------MD--NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS------------ 123 (435) Q Consensus 71 ~~~~~~~-------------~~--~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~------------ 123 (435) +.....+ .. +-.....|..+-.... -.+-+...+.+--++|-+.+++..++. T Consensus 143 ~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~-V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~ 221 (698) T protein:vir:10 143 AIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLR-IRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRP 221 (698) T ss_pred eccccchhhhhhcccccccccccccHHHHHHHHHHHHHHH-HHHHHHHHHHhcccccceEEEEEeecCcccccccccccc Confidence 2211000 00 1122233333322222 334444555556677887766654332 Q ss_pred ----CCcEEEEEEeCCceeEEEEc--------CCCceEEEEEecCCeeEEEchhheEEeccCCC------ccccccCcHH Q lcl|NC_019456. 124 ----TGEPIALWPLDPNTVSILRN--------TDNNSYWYRVTSDIYNFTIPINDVIHVKHVVP------SNSWYGVSPI 185 (435) Q Consensus 124 ----~g~~~~l~~l~~~~v~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~------~~~~~G~s~l 185 (435) .|...+|.+|+|..|++... ..+.+.+|++.. .+|..+.++.|..... ...+.|.|.+ T Consensus 222 ~~I~kGslKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G----~~IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~ 297 (698) T protein:vir:10 222 YTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIG----SEVHATRLHTIVSRPVGDMLKPTYSFAGISMT 297 (698) T ss_pred ccccCccceeeeeecccccccchhhhccchhhccCCCceEEEec----ceecceeEEEecCCCchhhhcchhccCCccHH Confidence 34566799999998887542 234555666643 2577777776664322 2346799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhc-CCceEE-EeCCcCCHHHHHHHHH--HHHHHhcCCCcccccc-CCceeeeccCChhh Q lcl|NC_019456. 186 DVLSSSLKFQRSVENFSQNEMEK-KDKFVL-QYDRSISPEKRQAMVN--DFLRMVKENGGAVVQE-AGWKVDRYESKFEP 260 (435) Q Consensus 186 ~~~~~~i~~~~~~~~~~~~~~~n-~~~~~~-~~~~~~~~e~~~~~~~--~~~~~~~~~~~~~vl~-~g~~~~~~~~~~~~ 260 (435) +.+...+............+... ....+. .....++......+.. ++.+.+++..++.+++ ++.+|++.+.+... T Consensus 298 q~~~e~V~~~~rT~~~v~~Li~~~~~~~l~~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~st~lSG 377 (698) T protein:vir:10 298 QLAMPYIDNWLRTRQSVSDIVKQFSVSGILMDLAQALTPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPLSG 377 (698) T ss_pred HHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHHhcCChhhHHHHHHHHHHHHhcCccceEEEecCCcceEEEecCcCC Confidence 99999888877655554444322 111110 0111122222222333 3345667777888888 57899999887754 Q ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhCCcccCccc-HHH-HHHHHH-------HHHHhHHHHHHHHHHHHhhcccccccCc Q lcl|NC_019456. 261 ADLSSVEQISRIRIATAFNVPISFLNDDQAKSTT-NVE-HVTHSW-------TMTLMPIIRQYESQFNMKLFTPGKRVKG 331 (435) Q Consensus 261 ~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~-~~e-~~~~~~-------~~~i~P~~~~i~~~l~~~l~~~~~~~~g 331 (435) +.++.....++||.+.+||...|-+.++.+.+ +.| ....|| ...|+|.++.+-+.|-+..|... . T Consensus 378 --LddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~i----d 451 (698) T protein:vir:10 378 --LDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAV----D 451 (698) T ss_pred --HHHHHHHHHHHHHhhhcCchhhhhccCCcccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC----C Confidence 45777778899999999999988888877653 434 233444 23578888888888887776542 2 Q ss_pred ceeeechhhhhccCHHHHHHH-------HHHHHhcCCcCHHHHHHHhCCCCCCCcC------Cceeeecc-cccchhccc Q lcl|NC_019456. 332 FYFSFNVNGLLRGDTAARTQY-------YQTLTRNGIFKPNEIRELEGQAPIPDEA------ADHLYISK-DLYPLDKYY 397 (435) Q Consensus 332 ~~i~fd~~~l~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~~g~~p~~~~~------gd~~~~~~-n~~~l~~~~ 397 (435) ..|.|.++.|...|.++++++ ...++..|+++++|+|.+|.-+|-. .. -|++..|. |.+...... T Consensus 452 p~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s-~Y~~~~d~~d~p~~~~~~~~~~~~~~ 530 (698) T protein:vir:10 452 PSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDG-PYAGKLDANDDPGAPADDDIDGVLTY 530 (698) T ss_pred CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCC-ccccccCCcccCCCCCCCcchHHHhh Confidence 346777778888877666654 5677789999999999998654311 11 11111111 111100000 Q ss_pred -cccccccccccccccccccCCC------CCCCCCCCCCCCCCCC Q lcl|NC_019456. 398 -DAILDNKIQTDASVAAPKQEGG------ENTNENGLQSTEPEGS 435 (435) Q Consensus 398 -~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~ 435 (435) .....++..++.+...-..+|. -+-.-+..++..+... T Consensus 531 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 575 (698) T protein:vir:10 531 VQRMAEGGDTGAPTAPGGARAGATAPPAAANVNANANPREAGAQD 575 (698) T ss_pred hcCCcCCCCcccccccccccCCCCCCcccccccCCCCccccCccc Confidence 0000000000000000000000 0001111111111111 No 139 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=99.50 E-value=1.4e-13 Score=90.97 Aligned_cols=309 Identities=12% Similarity=0.040 Sum_probs=170.3 Q ss_pred ceEEeeeCCCCc--EEEEEEeCCceeE-EEEcCCCceEE-EEEecCC-eeEEEchhheEEeccCCCccccccCcHHHHHH Q lcl|NC_019456. 115 YAWIQKSLSTGE--PIALWPLDPNTVS-ILRNTDNNSYW-YRVTSDI-YNFTIPINDVIHVKHVVPSNSWYGVSPIDVLS 189 (435) Q Consensus 115 ~~~i~~~~~~g~--~~~l~~l~~~~v~-~~~~~~~~~~~-~~~~~~~-~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~ 189 (435) +.++++....|. |..|.+.++.++. ...+.++.... .+....| ....+|+...|++++....+.++|.+.+..++ T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~g~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~ 80 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLVAIEQWGVFGKATVRIPVDRLVVFVNEREGANWLGQSLLRQAY 80 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCceeEEEecCCCCCCcceeccCCEEEEEeCCCCCCccchhhHHHHH Confidence 788998777664 5578888887554 33444444333 3333333 44678888888877766677789999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcC--CceEEEeCCc--CCHHH-------HHHHHHHHHHHhc----CCCccccccCCceeeec Q lcl|NC_019456. 190 SSLKFQRSVENFSQNEMEKK--DKFVLQYDRS--ISPEK-------RQAMVNDFLRMVK----ENGGAVVQEAGWKVDRY 254 (435) Q Consensus 190 ~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~--~~~e~-------~~~~~~~~~~~~~----~~~~~~vl~~g~~~~~~ 254 (435) -.........++-..|.+.- +..+...+.. ..+++ .+..++.+..... +....+|++.|++++-+ T Consensus 81 w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie~~ 160 (355) T protein:vir:78 81 KNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFTLT 160 (355) T ss_pred HHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCCceEEEe Confidence 98888777777777777643 3334443322 11111 1222233322222 22357789999998888 Q ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhC-Cccc-CcccHHHHHHHHHHHHHhHHHHHHHHHHHHhhccccccc--- Q lcl|NC_019456. 255 ESKFEPADLSSVEQISRIRIATAFNVPISFLN-DDQA-KSTTNVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRV--- 329 (435) Q Consensus 255 ~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg-~~~~-~~~~~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~--- 329 (435) +.+....++.++.++..++|+.+..-. .+.. .... ++++-.+.......+.+.-.++.+++.||+.|+.+.-.. T Consensus 161 ea~g~~~~~~~~i~~~d~~Isk~iLGq-tlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~ 239 (355) T protein:vir:78 161 GVQGKLPEMDGPIRYHDEQIARAVLAH-FLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADVTQQHVVEDLVDQNWG 239 (355) T ss_pred ecCCCcccHHHHHHHHHHHHHHHHhhh-hhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 776666678888888889998886322 1211 1111 333445556666777888888899998988776543111 Q ss_pred ---CcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHH-----HHHHHhCCCCCCCcCCceeeeccc-ccchhcccccc Q lcl|NC_019456. 330 ---KGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPN-----EIRELEGQAPIPDEAADHLYISKD-LYPLDKYYDAI 400 (435) Q Consensus 330 ---~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~N-----E~R~~~g~~p~~~~~gd~~~~~~n-~~~l~~~~~~~ 400 (435) .-.+++|+ ... .+.+++++.+++++..|+.-++ .+|+.+|+|. |.+ +++...+.. ..+.... T Consensus 240 ~~~~~P~~~~~--~~~-~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~-p~~-~~~~~~~~~~~~~~~~~---- 310 (355) T protein:vir:78 240 PEEPAPRLVPA--QLG-KEQPVTAEAIRALVECGAFTADPELEKDLRARYGLPA-PAE-RDDGADAAAAKAAGRRR---- 310 (355) T ss_pred CCCCCCEEEec--CcC-hhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCC-CCC-CCcccCCcccccccccc---- Confidence 11245554 333 3556789999999999998764 4799999975 333 333222111 0110000 Q ss_pred ccccccccccccccccCCCCCCCCCCC------------CCCCCCC Q lcl|NC_019456. 401 LDNKIQTDASVAAPKQEGGENTNENGL------------QSTEPEG 434 (435) Q Consensus 401 ~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~ 434 (435) ...+++..+....+.......+.+.. .--..+| T Consensus 311 -~~~~~~~~~~~~~~a~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (355) T protein:vir:78 311 -AKRLPGQRQGAALPSRSPRADPPRRRGPLRRRPRHPAHRRCAPDG 355 (355) T ss_pred -ccccCCccccccccccCCCCCChhhhHHHHHHhhccccCCCCCCC Confidence 00000000000000000000000000 0001111 No 140 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=99.36 E-value=5.1e-13 Score=87.99 Aligned_cols=425 Identities=12% Similarity=0.126 Sum_probs=213.6 Q ss_pred CchHHHHH---hhccccccc-ccccc------ccchhhhhhccccccCcccccH--HHHhhhHHHHHHHHHHHHHHhhCc Q lcl|NC_019456. 1 MSFMSKVR---QFFGVHDQA-NQIVQ------NPIPQPLDMAGVKLEQATFSRE--HILESNEYIFSIVTRLSNVLASLP 68 (435) Q Consensus 1 Mg~~~~~~---~~~~~~~~~-~~~~~------~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~~~i~~ia~~ia~~~ 68 (435) |.--..|+ +-+|....+ .+... ++...+-...|. .....|..+ +.|-..|.++..+.-|++++++|. T Consensus 1 ~~a~~~lr~~rrpkg~~~a~~r~L~aAs~~~~dpg~~~~~~~g~-~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~r 79 (631) T protein:vir:10 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKSTGI-SRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRCR 79 (631) T ss_pred CCcccceeeeecCCCCCccchhhhhhhhccccchhhhhhhhcCC-cccchhhHHHHHHHHhhhhHHHHhhhhhhhhceee Confidence 44332221 112211100 01100 111111111111 111111111 234455888999999999999999 Q ss_pred eeeeecc-------ccccc-----chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc---------- Q lcl|NC_019456. 69 LHEYQNY-------KQMDN-----EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE---------- 126 (435) Q Consensus 69 ~~~~~~~-------~~~~~-----~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~---------- 126 (435) +..-+-+ ..+.+ ....+....=+...+...++++.++.++-+-|++|+.++.....|- T Consensus 80 L~as~idpDtg~ptg~iee~~~~~~~v~~~~~~i~gG~lgQ~~llkrl~~~ltV~GE~wiv~l~~p~~~~~~~pd~~~r~ 159 (631) T protein:vir:10 80 LVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVRT 159 (631) T ss_pred eEeeeeccCCCCCccccccCCchhHHHHHHHHhcCCCcchHHHHHHHHHhheecccceEEEEEeccCcCCCCCccccccc Confidence 8875411 11222 2344444546778889999999999999999999988764322211 Q ss_pred EEEEEEeCCceeEEEEcCCCceEEEEEecCCeeEEEc-hhheEEec--cCCCccccccCcHHHHHHHHHHHHHHHHHH-- Q lcl|NC_019456. 127 PIALWPLDPNTVSILRNTDNNSYWYRVTSDIYNFTIP-INDVIHVK--HVVPSNSWYGVSPIDVLSSSLKFQRSVENF-- 201 (435) Q Consensus 127 ~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~-~~~iih~~--~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~-- 201 (435) ..+++++....++.....+|..+ ... .|..-+|- ..| +.|| .+++.....--||+.++...+.-....... T Consensus 160 ~~~W~~vt~~ei~~~~~g~g~~v--~lp-~g~~h~~~~~~D-~l~RiW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~ 235 (631) T protein:vir:10 160 RQEWYAVSKEEIKKSNKGSGTNI--VLP-TGEEHEFVKGTD-IIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTTKTIA 235 (631) T ss_pred ccceeeccHHHHhcccCccccee--ecC-CCCccceecCCc-eEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHH Confidence 33677777776654433433322 222 23333332 233 3333 455666677788888877766654433322 Q ss_pred ---HHHHhhcCCceE---EEeCCc---------------CCHHHHHHHHHHHH----HHhcCCC-----cccccc----- Q lcl|NC_019456. 202 ---SQNEMEKKDKFV---LQYDRS---------------ISPEKRQAMVNDFL----RMVKENG-----GAVVQE----- 246 (435) Q Consensus 202 ---~~~~~~n~~~~~---~~~~~~---------------~~~e~~~~~~~~~~----~~~~~~~-----~~~vl~----- 246 (435) ..+++.||..++ ++++.. ..+.....+.+.+. ..+.+.+ -++|+. T Consensus 236 aaakSRl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~vPii~~~p~E~ 315 (631) T protein:vir:10 236 NASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQ 315 (631) T ss_pred HHHHHHHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHH Confidence 223445553221 222211 12234555555554 3333322 133332 Q ss_pred -CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCcccCcccHHHH-HHHHHHHHHhHHHHHHHHHHHHhhc Q lcl|NC_019456. 247 -AGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFL-NDDQAKSTTNVEH-VTHSWTMTLMPIIRQYESQFNMKLF 323 (435) Q Consensus 247 -~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~l-g~~~~~~~~~~e~-~~~~~~~~i~P~~~~i~~~l~~~l~ 323 (435) .+++.-.+.....+. .+.+++..+..||....|||+.| |.+.++|-.+.=| -...++-.|.|.+..||++|++.+| T Consensus 316 i~~i~hlkf~~ei~e~-aiktR~daI~RlA~glDi~pE~LLGlGsd~NHWsAWqI~dedVrlHI~P~l~lic~AlT~q~L 394 (631) T protein:vir:10 316 IKDVKHIRFDNEITEV-AIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQIL 394 (631) T ss_pred hcCeeEEeecCchhHH-HHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHH Confidence 234444444444443 56778888899999999999955 4445665443211 1234677899999999999999977 Q ss_pred cccc-----ccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCc-------ee----eec Q lcl|NC_019456. 324 TPGK-----RVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAAD-------HL----YIS 387 (435) Q Consensus 324 ~~~~-----~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd-------~~----~~~ 387 (435) .+.. +...|.+.||.+.|... -.+.+-+..+++.|.+|-...|+.+|+.- +.+-| .. .+. T Consensus 395 rp~Le~eGvDp~kYvvW~DaS~Lt~d--Pdr~deA~qa~drGAIt~eAlrk~lGf~e--Dd~yd~~t~e~~~~~a~~av~ 470 (631) T protein:vir:10 395 RVTLAREGIDPSKYVVWYDPSQLTID--PDKSDEAKFAYENGAINGEALRKYLGLGD--DAGYDFTTREGWVMWAQDAVS 470 (631) T ss_pred HHHHHHhCCCHHHhEeeecCcccccC--CCCcHHHHHHHHcCCcCHHHHHHHhcCch--hcccCcCchHHHHHHHHHHhh Confidence 6543 33345678888887543 22445556688899999999999999873 12212 00 011 Q ss_pred c------cccch-----hcccccccc---ccccccccccccccCCCCCCC-CCCCCCCCCCCC Q lcl|NC_019456. 388 K------DLYPL-----DKYYDAILD---NKIQTDASVAAPKQEGGENTN-ENGLQSTEPEGS 435 (435) Q Consensus 388 ~------n~~~l-----~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 435 (435) + ++.|+ ....-+++. ..++.+++...+..++++..+ ++..+..+..+. T Consensus 471 ~dpaLip~lApl~~~~~~~v~~P~~~a~~~~g~ed~~~~~~~~~g~~epdt~d~~p~~~~a~~ 533 (631) T protein:vir:10 471 KDPTLIPMLAPLIAGVLKQIEFPQQQAIDSGGNEDTSDADDLDDGEQEPDTEDDDDGTQKAGL 533 (631) T ss_pred cccCcchhhHHHHHHHhhhccCCCCCCCCCCCCCccccccccccCCCCCCCCCCCCccccccc Confidence 1 12221 111111110 011111111111111111111 111111111111 No 141 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=99.29 E-value=2.8e-12 Score=83.89 Aligned_cols=422 Identities=10% Similarity=0.052 Sum_probs=200.9 Q ss_pred CchHHHHHhhccccccccc----ccc-----ccchhhhhhccccc--cCcccccHHHHhhhHHHHHHHHHHHHHHhhCce Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQ----IVQ-----NPIPQPLDMAGVKL--EQATFSREHILESNEYIFSIVTRLSNVLASLPL 69 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~----~~~-----~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~ 69 (435) |.++.--.+--..+..+.. ... ......+.-.+... .+....+ +.|-..|.++..+.-|++++++|.+ T Consensus 1 ~~~~rPk~~p~~p~~~~~arrr~LtaAsa~l~~~~~~~~kt~~~~~~~WQ~eAW-~~~d~vpELry~vgW~~~a~SR~rL 79 (646) T protein:vir:10 1 MALLKPKSAPPEPFGAEVARRIALAGATAQVDLGASSSWKTWKFGNKDWQTEGW-RLYDIIPEHHFLAGRIGDSVAQARL 79 (646) T ss_pred CcccCCCCCCCCcccccccchhhhhhccccccCCCcceeecCCCcchhhhHHHH-HHHhhhhhHhhHhhhhhhhhceeee Confidence 7765211000000000000 000 00000000011111 1111111 2244558888999999999999998 Q ss_pred eeee---cc---cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEee--eCCCCcEEEEEEeCCceeEEE Q lcl|NC_019456. 70 HEYQ---NY---KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQK--SLSTGEPIALWPLDPNTVSIL 141 (435) Q Consensus 70 ~~~~---~~---~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~--~~~~g~~~~l~~l~~~~v~~~ 141 (435) ..-+ .+ ..+.++....+....-....-..++++.+..++-+-|++|+.... ....+-=..++++....|.. T Consensus 80 ~aseiddtG~~tg~v~~~~v~~iv~~~~Gg~~gQ~qlLkr~~~~ltV~GE~wiv~~~~~~~~~~~~~~W~vvt~~Ev~~- 158 (646) T protein:vir:10 80 YVTEVDDTGEETGEVQDERIKRLAAVPLGTGSQRDDNLRLAGLDLAVGGECWIVGEGAATSPEAAEGSWFVVTGSAISR- 158 (646) T ss_pred eeeeecCCCCCcCccchHHHHHHhhhhccchhhHHHHHHHHHhheecccceEEeeccccCCCCCCccceeeecHHHhcc- Confidence 8754 11 222334444444433344445678899999999999998875311 11111112466776666532 Q ss_pred EcCCCceEEEEEec---CCeeEEEchhheEEec--cCCCccccccCcHHHHHHHHHHHHHHHHHHH-----HHHhhcCCc Q lcl|NC_019456. 142 RNTDNNSYWYRVTS---DIYNFTIPINDVIHVK--HVVPSNSWYGVSPIDVLSSSLKFQRSVENFS-----QNEMEKKDK 211 (435) Q Consensus 142 ~~~~~~~~~~~~~~---~~~~~~~~~~~iih~~--~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~-----~~~~~n~~~ 211 (435) . |..+...-.. ++.....+..++ .|| .+++.....--||+.++...+.-........ .+++.||. T Consensus 159 --t-g~~~~i~~p~~~~g~~~v~~~~~d~-lvRiW~P~Prr~~epDSpvra~l~~l~Ei~~lt~~I~aaakSRL~GnGv- 233 (646) T protein:vir:10 159 --T-GDEIAVRRPQQRGGSKLVLVDGQDI-LIRCWRPHPNDTDQADSFTRSAIVPLREIELLTKREFAELDSRLTGAGI- 233 (646) T ss_pred --C-CCeeeeecCccCCCCCcceecCCce-EEEEecCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHHHHhcCce- Confidence 1 2222222111 223334455666 335 3455566777899888877766544433322 23344443 Q ss_pred eEEEeCCcCC-------HHHHHHHHHHH----HHHhcCCC-----ccccccC-Cc------eeeeccC--ChhhHHHHHH Q lcl|NC_019456. 212 FVLQYDRSIS-------PEKRQAMVNDF----LRMVKENG-----GAVVQEA-GW------KVDRYES--KFEPADLSSV 266 (435) Q Consensus 212 ~~~~~~~~~~-------~e~~~~~~~~~----~~~~~~~~-----~~~vl~~-g~------~~~~~~~--~~~~~~~~e~ 266 (435) +.++..++ +.....+.+.| ...+.+.+ -++|+.. |. +++.+.. ...+ -.+.+ T Consensus 234 --LfvP~e~s~p~~~~~~a~~~~l~~~l~qaa~tAi~De~S~aA~vPiia~~P~E~i~~~~~ik~l~f~~eite-~aikt 310 (646) T protein:vir:10 234 --MFLPEGVDFPRGEEDPAGLAGFMAYLQRAAAASMADQSRASAMVPIMATIPNEMMEHLDKIKPLTFWSELSA-EITPM 310 (646) T ss_pred --eeeccccccCCCCCCCcchhHHHHHHHHHHHhhhcCCCCccceeeeEEeeChHHHhhhhcceeeccCchhhH-HHhhh Confidence 33332221 11223333333 33443322 1333322 11 3333333 2222 25677 Q ss_pred HHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH-HHHHHHHHhHHHHHHHHHHHHhhcccccccCc------ceeeechh Q lcl|NC_019456. 267 EQISRIRIATAFNVPISFLNDDQAKSTTNVEHV-THSWTMTLMPIIRQYESQFNMKLFTPGKRVKG------FYFSFNVN 339 (435) Q Consensus 267 ~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~-~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g------~~i~fd~~ 339 (435) ++..+..||...-|||+.|-+..++|-.+.=|. ...++ .|.|.+..|+++|++.+|.+.....| |.+.||.+ T Consensus 311 R~daI~RlA~glDIppE~LLGlgd~NHWtAWqI~de~vr-HI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~kyvvW~DaS 389 (646) T protein:vir:10 311 KDKAIARLASSAEIPGEVLTGIGDANHWTAWLISDEGIR-WIRGYLGLIADALTRGFLRRALESMGVTNPERYAFAFDTS 389 (646) T ss_pred HHHHHHHHHhccCCchhheeeccccceeeeeeeccccch-hhhhHHHHHHHHHHhhHHHHHHHHcCCCChhHeEEeecCc Confidence 888889999999999996655556654443221 23356 69999999999999998876543333 56789988 Q ss_pred hhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceee-------ecc--ccc--chhcccc--ccccc--- Q lcl|NC_019456. 340 GLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLY-------ISK--DLY--PLDKYYD--AILDN--- 403 (435) Q Consensus 340 ~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~-------~~~--n~~--~l~~~~~--~~~~~--- 403 (435) .|... -.+.+-+..+++.|.+|-...|+.+|+.--+.+.-++.. +.+ +++ |+-.... +.... T Consensus 390 ~Lt~~--pd~~deA~qa~drGAIt~eAlrk~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~Lil~P~~qa~~~~P~~~~~~l 467 (646) T protein:vir:10 390 TLASK--PNRLDEAIQLHERNLIKDEEVVKAGAFSVDQMPTVQERAVQILLGLVKTQPDLILDPAIQAALGLPAVQSVGL 467 (646) T ss_pred ccccC--CCCcHHHHHHHHcCCccHHHHHHHhcccccccCChHHHHHHHHHHHhcCCccccccchhhccccCCCcCcccc Confidence 87543 224455566888999999999999998632211111110 001 111 1111100 00000 Q ss_pred ----cccccccccccccCCCCCCCCCCCCC-CCCCCC Q lcl|NC_019456. 404 ----KIQTDASVAAPKQEGGENTNENGLQS-TEPEGS 435 (435) Q Consensus 404 ----~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 435 (435) ...++.....+..+|..+++|..+.+ .+.+++ T Consensus 468 pp~~~~~~dg~~~~~e~~g~~~~~E~~~~pda~~~~a 504 (646) T protein:vir:10 468 PPTAAQRTDGDLDDDESEGAPNGGEAPDQPDADEARA 504 (646) T ss_pred CCcccccccCCCCChhhcCCCCCCccCCCCCCCcccc Confidence 00011111111122222222211111 111111 No 142 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=99.21 E-value=4.7e-12 Score=82.66 Aligned_cols=420 Identities=13% Similarity=0.087 Sum_probs=205.3 Q ss_pred CchHH-HHHhhcccccccc---------ccccccchhhhhhcccccc--CcccccHHHHhhhHHHHHHHHHHHHHHhhCc Q lcl|NC_019456. 1 MSFMS-KVRQFFGVHDQAN---------QIVQNPIPQPLDMAGVKLE--QATFSREHILESNEYIFSIVTRLSNVLASLP 68 (435) Q Consensus 1 Mg~~~-~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~ 68 (435) |.--+ |+.+.-.....++ .+...+...+-...++... +....+ +.|-..|.++..+.-|++++++|. T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW-~~~d~v~Elry~vgW~~~s~Sr~r 79 (629) T protein:vir:99 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQDDAW-KAYDAVGELRYYVGWRSSSASRVR 79 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhcccccchhhhhhcCCCchhhhhHHHH-HHHHhhhhHHHHhhhhhhhhceee Confidence 54322 2211111100000 0000111111111111111 111111 223347888899999999999999 Q ss_pred eeeeecc-------cccc-cc----hHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc------E-EE Q lcl|NC_019456. 69 LHEYQNY-------KQMD-NE----PLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE------P-IA 129 (435) Q Consensus 69 ~~~~~~~-------~~~~-~~----~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~------~-~~ 129 (435) +..-+-+ ..+. ++ ...+....=-...+...++++.+..++-+-|++|+.+..... |. + .+ T Consensus 80 L~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~-~~~d~~~~~~~e 158 (629) T protein:vir:99 80 LIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDK-SRLDSNGNPVPE 158 (629) T ss_pred eEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCC-CccCCCCcchhh Confidence 8875411 1111 11 222223322335566789999999999999999998885422 22 1 24 Q ss_pred EEEeCCceeEEEEcCCCceEEEEEecCCeeEEE-chhheEEec--cCCCccccccCcHHHHHHHHHHHHHHHHH-----H Q lcl|NC_019456. 130 LWPLDPNTVSILRNTDNNSYWYRVTSDIYNFTI-PINDVIHVK--HVVPSNSWYGVSPIDVLSSSLKFQRSVEN-----F 201 (435) Q Consensus 130 l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~-~~~~iih~~--~~~~~~~~~G~s~l~~~~~~i~~~~~~~~-----~ 201 (435) ++.+.+..++- ..++.. ...+.|...+| +..+++ || .+++.....--||+.++...+.-...... . T Consensus 159 W~~vt~~ei~~---~~~~~~--i~lP~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaa 232 (629) T protein:vir:99 159 WLALTPEEVRA---SEKKTI--IELPTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANAS 232 (629) T ss_pred heeechHHhhh---ccCcee--EEcCCCCccceeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHH Confidence 55565554442 122222 22233333333 344444 55 34555667778888887776665443322 2 Q ss_pred HHHHhhcCCceE---EEeCC----------------cCCHHHHHHHHHHHH----HHhcCCC-----cccccc------C Q lcl|NC_019456. 202 SQNEMEKKDKFV---LQYDR----------------SISPEKRQAMVNDFL----RMVKENG-----GAVVQE------A 247 (435) Q Consensus 202 ~~~~~~n~~~~~---~~~~~----------------~~~~e~~~~~~~~~~----~~~~~~~-----~~~vl~------~ 247 (435) ..+++.||..++ ++++. -+.....+.+.+.|. ..+.+.+ -++|+. . T Consensus 233 kSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~ 312 (629) T protein:vir:99 233 KSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIK 312 (629) T ss_pred HHHHhhCceeEeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhc Confidence 234455654322 12211 011123455666664 3343322 133332 2 Q ss_pred CceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCcccCcccHHHH-HHHHHHHHHhHHHHHHHHHHHHhhccc Q lcl|NC_019456. 248 GWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFL-NDDQAKSTTNVEH-VTHSWTMTLMPIIRQYESQFNMKLFTP 325 (435) Q Consensus 248 g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~l-g~~~~~~~~~~e~-~~~~~~~~i~P~~~~i~~~l~~~l~~~ 325 (435) +++.-.+.....+. .+.+++..+..||....|||+.| |.+.++|-.+.=| -...++-.|.|.+..||++|++.+|.+ T Consensus 313 ~i~hlkf~~ei~e~-aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp 391 (629) T protein:vir:99 313 NVTHLKFDNQVTEV-AIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVLRT 391 (629) T ss_pred CeeEEeecCchhHH-HHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchhHHHHHHHHHhhHHHH Confidence 34444444444443 56778888899999999999955 4445665443311 123467789999999999999997765 Q ss_pred cc-----ccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCc-------------eeeec Q lcl|NC_019456. 326 GK-----RVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAAD-------------HLYIS 387 (435) Q Consensus 326 ~~-----~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd-------------~~~~~ 387 (435) .. +...|.+.||.+.|... -.+.+-+..+++.|.+|-...|+.+|+.- +.+-| ..... T Consensus 392 ~Le~eGiDp~kYvvW~DaS~Lt~d--Pd~~deA~~a~drGAIt~eAlrk~lGf~e--D~~yd~tt~E~~~~~a~d~V~~~ 467 (629) T protein:vir:99 392 VLMREGIDPNAYVVWHDASQLTVD--PDKTDEARDAFDRGAITAEAMVKMLGLAD--DTVYDFTTPEGWAQWARDRVGQD 467 (629) T ss_pred HHHHhCCCHHHhEeeecCcccccC--CCCcHHHHHHHHcCCccHHHHHHHhcCcc--ccccCCCchHHHHHHHHHhhhhC Confidence 43 33345678888887543 22445556688899999999999999873 11221 11111 Q ss_pred cccc----chh-ccccccccccccccccccccccCCCCCCC---CCCCCCCCCCC----C Q lcl|NC_019456. 388 KDLY----PLD-KYYDAILDNKIQTDASVAAPKQEGGENTN---ENGLQSTEPEG----S 435 (435) Q Consensus 388 ~n~~----~l~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~----~ 435 (435) .++. ++- .++... ...+..+.+..+..+++++.+ ..++++++.+- + T Consensus 468 P~Li~~~a~l~~~~a~~~--~P~~~~~~pp~~e~~~~dE~sga~~~~ep~te~d~~~~~a 525 (629) T protein:vir:99 468 PNLLPTLAVLIPELADVE--FPTPTVALPPAEEQDGDEEASGASRREEPDTEDDAGTDDS 525 (629) T ss_pred cchhhhhhhhhhhhcccc--cCccCCCCCccccCCCcccccCCCcCCCCCCCCCCccccc Confidence 1111 110 011100 011111111111111111111 00112222221 1 No 143 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=99.19 E-value=7.7e-12 Score=81.53 Aligned_cols=420 Identities=12% Similarity=0.090 Sum_probs=205.7 Q ss_pred CchHH-HHHhhcccccccc---------ccccccchhhhhhcccccc--CcccccHHHHhhhHHHHHHHHHHHHHHhhCc Q lcl|NC_019456. 1 MSFMS-KVRQFFGVHDQAN---------QIVQNPIPQPLDMAGVKLE--QATFSREHILESNEYIFSIVTRLSNVLASLP 68 (435) Q Consensus 1 Mg~~~-~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~ 68 (435) |.--+ |+.+.-.....++ .+...+...+-...++... +....+ +.|-..|.++..+.-|++++++|. T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~eAW-~~~d~v~Elry~vgW~~~s~Sr~r 79 (629) T protein:vir:86 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQEDAW-KAYDAVGELRYYVGWRSSSASRVR 79 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhccccccchhhhhcCCCchhhhhHHHH-HHHHhhhhHHHHhhhhhhhhceee Confidence 54322 2211111100000 0000111111111111111 111111 223347888899999999999999 Q ss_pred eeeeecc-------cccc-cc----hHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc------E-EE Q lcl|NC_019456. 69 LHEYQNY-------KQMD-NE----PLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE------P-IA 129 (435) Q Consensus 69 ~~~~~~~-------~~~~-~~----~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~------~-~~ 129 (435) +..-+-+ ..+. ++ ...+....=-...+...++++.+..++-+-|++|+.+..... |. + .+ T Consensus 80 L~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~-~~~d~~~~~~~e 158 (629) T protein:vir:86 80 LIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDK-SRLDSNGNPVPE 158 (629) T ss_pred eEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCC-CccCCCCcchhh Confidence 8875411 1111 11 222223322335566789999999999999999998885422 22 1 24 Q ss_pred EEEeCCceeEEEEcCCCceEEEEEecCCeeEEE-chhheEEec--cCCCccccccCcHHHHHHHHHHHHHHHHH-----H Q lcl|NC_019456. 130 LWPLDPNTVSILRNTDNNSYWYRVTSDIYNFTI-PINDVIHVK--HVVPSNSWYGVSPIDVLSSSLKFQRSVEN-----F 201 (435) Q Consensus 130 l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~-~~~~iih~~--~~~~~~~~~G~s~l~~~~~~i~~~~~~~~-----~ 201 (435) ++.+.+..++- ..++.. ...+.|...+| +..+++ || .+++.....--||+.++...+.-.....+ . T Consensus 159 W~~vt~~ei~~---~~~~~~--i~lP~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaa 232 (629) T protein:vir:86 159 WLALTPEEVRA---SEKKTI--IELPTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANAS 232 (629) T ss_pred heeechHHhhh---ccCcee--eEcCCCCcceeeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHH Confidence 55565554442 222222 22333443344 334444 55 34555667778888887776665443322 2 Q ss_pred HHHHhhcCCceE---EEeCC----------------cCCHHHHHHHHHHHH----HHhcCCCc-----ccccc------C Q lcl|NC_019456. 202 SQNEMEKKDKFV---LQYDR----------------SISPEKRQAMVNDFL----RMVKENGG-----AVVQE------A 247 (435) Q Consensus 202 ~~~~~~n~~~~~---~~~~~----------------~~~~e~~~~~~~~~~----~~~~~~~~-----~~vl~------~ 247 (435) ..+++.||..++ ++++. -+.....+.+.+.|. ..+.+.+. ++|+. . T Consensus 233 kSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~ 312 (629) T protein:vir:86 233 KSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIK 312 (629) T ss_pred HHHHhhCceeeeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhc Confidence 234455654322 12211 011123455666664 33433221 33332 2 Q ss_pred CceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCcccCcccHHHH-HHHHHHHHHhHHHHHHHHHHHHhhccc Q lcl|NC_019456. 248 GWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFL-NDDQAKSTTNVEH-VTHSWTMTLMPIIRQYESQFNMKLFTP 325 (435) Q Consensus 248 g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~l-g~~~~~~~~~~e~-~~~~~~~~i~P~~~~i~~~l~~~l~~~ 325 (435) +++.-.+.....+. .+.+++..+..||....|||+.| |.+.++|-.+.=| -...++-.|.|.+..||++|++.+|.+ T Consensus 313 ~i~hlkf~~ei~e~-aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp 391 (629) T protein:vir:86 313 NVTHLKFDNQVTEV-AIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVLRT 391 (629) T ss_pred CeeEEeecCchhHH-HHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHHHH Confidence 34444444444443 56778888899999999999955 4445665443312 123467789999999999999997765 Q ss_pred cc-----ccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCc-------------eeeec Q lcl|NC_019456. 326 GK-----RVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAAD-------------HLYIS 387 (435) Q Consensus 326 ~~-----~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd-------------~~~~~ 387 (435) .. +...|.+.||.+.|... -.+.+-+..+++.|.+|-...|+.+|+.- +.+-| ..... T Consensus 392 ~Le~eGiDp~kYvvW~DaS~Lt~d--Pd~~deA~~a~drGAIt~eAlrk~lGf~e--D~~yd~tt~E~~~~~a~d~V~~~ 467 (629) T protein:vir:86 392 VLMREGIDPNAYVVWHDASQLTVD--PDKTDEARDAFDRGAITAEAMVKMLGLAD--DTVYDFTTPEGWAQWARDRVGQD 467 (629) T ss_pred HHHHhCCCHHHhEeeecCcccccC--CCCcHHHHHHHHcCCcCHHHHHHHhcCcc--ccccCCCchHHHHHHHHHhhhhC Confidence 43 33345678888887543 22445556688899999999999999873 11221 11111 Q ss_pred cccc----chh-ccccccccccccccccccccccCCCCCCC---CCCCCCCCCCCC Q lcl|NC_019456. 388 KDLY----PLD-KYYDAILDNKIQTDASVAAPKQEGGENTN---ENGLQSTEPEGS 435 (435) Q Consensus 388 ~n~~----~l~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 435 (435) .++. ++- .++... ...+..+.+..+..+++++.+ ..++++++.+-+ T Consensus 468 P~Li~~~a~l~~~~a~~~--~P~~~~~~pp~~e~~~~dE~sga~~~~ep~te~d~~ 521 (629) T protein:vir:86 468 PNLLPTLAVLIPELADVE--FPTPTVALPPAEEQDGDEEASGASRREEPDTEDDAG 521 (629) T ss_pred cchhhhhhhhhhhhcccc--cCccCCCCCccccCCCcccccCCCcCCCCCCCCCCc Confidence 1111 110 011100 001111111111111111110 001122222221 No 144 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.18 E-value=1.3e-10 Score=74.84 Aligned_cols=418 Identities=10% Similarity=-0.033 Sum_probs=166.8 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcc-cccH---HHHhhhHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQAT-FSRE---HILESNEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) +-+++.|.+.+......... ...++..-. ...... .... .....+.+..-+|+.+|..+.--.|.+- +. T Consensus 23 ~~~i~~L~~~~~~~~~r~~~----l~~YY~G~~-~i~~~~~~~p~~~~~~~~v~n~~~~iVd~~a~rl~~~Gf~~~--d~ 95 (504) T protein:vir:99 23 VDKVNGLYQQLVDRTPRNLL----RASFYDGKY-AIRQIGNLIPPEYLRTATVLGWSAKAVDTLARRCNLESFVWP--DG 95 (504) T ss_pred HHHHHHHHHHHHHHhHHHHH----HHHHHhccc-cchhccccccHHHHHHhhccCcHHHHHHHHHhhhccceeeCC--CC Confidence 44444444433222111100 001111000 000000 0011 1111223344456666654432233321 11 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEE-EEEEeCCceeEEEEcCCCceE-----E Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPI-ALWPLDPNTVSILRNTDNNSY-----W 150 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~-~l~~l~~~~v~~~~~~~~~~~-----~ 150 (435) ...+..+ +.+. +-| +.......+..+.+.+|.||+.|..+. .|.+. -+.+++|..+.+.+++..... + T Consensus 96 ~~~~~~l-~~i~-~~N---~ld~~~~~~~~~a~iyG~af~~v~~~~-d~~~~~~I~~~sP~~~~~iyD~~~~~~~~a~~~ 169 (504) T protein:vir:99 96 DYGSIGG-PDVW-DEN---FFATKANNAMVSSLIHGPAFLINTEGG-AGEPDSLIHVKSAMQATGEWNSRRNAMDSLLSI 169 (504) T ss_pred ChhhHHH-HHHH-Hhc---ChhhHHHHHHHHHHhhCceeEEEecCC-CCCceeEEEEeccceeEEEEeCCCCceeEEEEE Confidence 1111222 2222 223 234566788999999999999887654 36553 577889998887766532211 1 Q ss_pred EEEecCCee---EEEchhh------------------------eEEeccCCCccccccCcHHH----HHHHHHHHHHHHH Q lcl|NC_019456. 151 YRVTSDIYN---FTIPIND------------------------VIHVKHVVPSNSWYGVSPID----VLSSSLKFQRSVE 199 (435) Q Consensus 151 ~~~~~~~~~---~~~~~~~------------------------iih~~~~~~~~~~~G~s~l~----~~~~~i~~~~~~~ 199 (435) +....+|.. ..|.++. |++|.+....+..+|.|.+. .+.+.+.....-. T Consensus 170 ~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~gvPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~ 249 (504) T protein:vir:99 170 TSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKLGVPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGCIRM 249 (504) T ss_pred EEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCCCcceEEecccccCccccCcccchhhHHHHHHHHHHHHHHH Confidence 111222221 1233333 34444332234456777542 2333332222222 Q ss_pred HHHHHHhhcCCceEEEeC-CcCCHHH---HHHHH---HHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHH Q lcl|NC_019456. 200 NFSQNEMEKKDKFVLQYD-RSISPEK---RQAMV---NDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRI 272 (435) Q Consensus 200 ~~~~~~~~n~~~~~~~~~-~~~~~e~---~~~~~---~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~ 272 (435) .....+|....+.++-.+ ....++. ...++ .++...-.+......-....++.+++...-+ .+.+..+.... T Consensus 250 ~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~-~~~~~l~~~i~ 328 (504) T protein:vir:99 250 DGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPASSPQ-PHIEMLEQIAM 328 (504) T ss_pred HHHHHHhcchhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccccCccceeeecCCCChH-HHHHHHHHHHH Confidence 223334444333333221 1111111 11111 1111111111111111234566655544322 47888888999 Q ss_pred HHHHHhCCCHHHhCCcccCcccHHHHHHH---HHHHHHhHHHHHHHHHHHHh------hccc--ccccCcceeeechhhh Q lcl|NC_019456. 273 RIATAFNVPISFLNDDQAKSTTNVEHVTH---SWTMTLMPIIRQYESQFNMK------LFTP--GKRVKGFYFSFNVNGL 341 (435) Q Consensus 273 ~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~---~~~~~i~P~~~~i~~~l~~~------l~~~--~~~~~g~~i~fd~~~l 341 (435) +|+..-++|+..+|.....+.++.+++.. -+...+.-..+.+.+.|.+- +... ........+++.+.+. T Consensus 329 ~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~ 408 (504) T protein:vir:99 329 MFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFRSP 408 (504) T ss_pred HHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEecCC Confidence 99999999999999766544334333221 11222222222233322221 1000 0011123345455667 Q ss_pred hccCHHHHHHHHHHHHhcCCcC--H-HHHHHHhCCCCCCCc-CCceeeecccccchhcccccccccccccccc-cccccc Q lcl|NC_019456. 342 LRGDTAARTQYYQTLTRNGIFK--P-NEIRELEGQAPIPDE-AADHLYISKDLYPLDKYYDAILDNKIQTDAS-VAAPKQ 416 (435) Q Consensus 342 ~~~d~~~~~~~~~~~~~~g~~t--~-NE~R~~~g~~p~~~~-~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~-~~~~~~ 416 (435) ...+..+.++++.++++.|... . .-+++++|+.+-.-+ +-++-........++.+.......++..... ...... T Consensus 409 ~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~ 488 (504) T protein:vir:99 409 LYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEP 488 (504) T ss_pred CccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCC Confidence 7888899999999999988632 3 335677888653110 0000000001111121111111000000000 000000 Q ss_pred CCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 417 EGGENTNENGLQSTEPEG 434 (435) Q Consensus 417 ~~~~~~~~~~~~~~~~~~ 434 (435) .+...+...+.+ ..+| T Consensus 489 a~~~~~~~~~~p--~~~~ 504 (504) T protein:vir:99 489 PANEPPAALGRP--TLVG 504 (504) T ss_pred CCCCCCccCCCc--ccCC Confidence 011111111111 1222 No 145 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=99.13 E-value=8.8e-11 Score=75.70 Aligned_cols=423 Identities=14% Similarity=0.146 Sum_probs=199.4 Q ss_pred CchHHHHHh----------hccc--cccccc--cccccchhhhh-------hccccccC-cccc--cHHHHhhhHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQ----------FFGV--HDQANQ--IVQNPIPQPLD-------MAGVKLEQ-ATFS--REHILESNEYIFSI 56 (435) Q Consensus 1 Mg~~~~~~~----------~~~~--~~~~~~--~~~~~~~~~~~-------~~~~~~~~-~~~~--~~~~~~~~~~v~~~ 56 (435) |.+|.+... .++. ++...+ .........+. +.|+.... ...+ ..+.++.+|.|..| T Consensus 4 ~~~w~~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~~pEVd~A 83 (533) T protein:vir:58 4 LEKYKKLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYTDPLISTV 83 (533) T ss_pred cchhhhhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhccCcchhhH Confidence 555544422 1221 111111 11100000100 11111000 0001 12334568999999 Q ss_pred HHHHHHHHhhCc-----eeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEE Q lcl|NC_019456. 57 VTRLSNVLASLP-----LHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALW 131 (435) Q Consensus 57 i~~ia~~ia~~~-----~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~ 131 (435) |+.|++.+.-+. +.+.-+..+.. ......++ ..++...--..++..|++.|..|..+..+...+-+.+|. T Consensus 84 ideIvneaiv~d~~~~pV~v~l~~~e~s-~~iK~kI~----~lldf~~~~~~~fR~WYVDGriy~Hkiik~~k~GI~elr 158 (533) T protein:vir:58 84 LDIIADECTIPNENGNIVDVVTKDIELA-KAILSYLD----YVINIEKNAYPIIRNMIKYGDMFLHILEKGSDGTIEKFQ 158 (533) T ss_pred HHhhhceeeEecCCCceeEeeccccccc-HHHHHHHH----HHhcchhhhhHHHHhhhhcceeEEEeccCCcccchhhhe Confidence 999998876553 33332222221 11112222 123333334566777888999999887655556688999 Q ss_pred EeCCceeEEEEcCCCceEEEEEe-------cCCeeEEEchhheEEeccC-CCccccccCcHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 132 PLDPNTVSILRNTDNNSYWYRVT-------SDIYNFTIPINDVIHVKHV-VPSNSWYGVSPIDVLSSSLKFQRSVENFSQ 203 (435) Q Consensus 132 ~l~~~~v~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~iih~~~~-~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~ 203 (435) .|+|..++.+........||.+. ..+....++.+.|+|+... ....+.+++|-|..+.+.+.....++...- T Consensus 159 ~lDPr~i~~vr~~~t~~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~NQLkmiEDAlV 238 (533) T protein:vir:58 159 VVSPYIFSKRYNPETDTWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWNQLRLMEDALM 238 (533) T ss_pred ecCCeeeEEEEeeccceEEEeecccccccccCccccccchhheeeeeeccccCCCCceehhhhHHHHHHHHHHHHHHHHH Confidence 99999998887765555555555 2334567999999999865 344677888999999888887777666544 Q ss_pred HH-hhcCC-c--eEEEeCC---cCCHHHHHHHHHHHHHHh--c-CCCcc----------ccc----------cCCceeee Q lcl|NC_019456. 204 NE-MEKKD-K--FVLQYDR---SISPEKRQAMVNDFLRMV--K-ENGGA----------VVQ----------EAGWKVDR 253 (435) Q Consensus 204 ~~-~~n~~-~--~~~~~~~---~~~~e~~~~~~~~~~~~~--~-~~~~~----------~vl----------~~g~~~~~ 253 (435) .+ +...| + +.+-++. .-.++-.+.+..++++.+ . +.|.+ ..| ..|.+++. T Consensus 239 IYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~T 318 (533) T protein:vir:58 239 LYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDI 318 (533) T ss_pred HHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeee Confidence 33 23332 1 2333332 122222344444443321 1 11222 122 24678888 Q ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-HHHHHHHhHHHHHHHHHHHHhhcccccccC-c Q lcl|NC_019456. 254 YESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-HSWTMTLMPIIRQYESQFNMKLFTPGKRVK-G 331 (435) Q Consensus 254 ~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~-g 331 (435) |.-. . +.-.+-..+..+.+.++++||.+-|+.....+.+++=... .-|...|..+-..+.+.|.++|........ . T Consensus 319 LpGg-~-lgemeDV~YF~kkLy~ALnVP~sRl~~e~~fgr~~eItRDEiKF~KFI~rLR~rF~~ll~~qLilk~iit~ee 396 (533) T protein:vir:58 319 LQGS-K-VDLAEDVEYMLNRLISALKVPKAFIGYEGDVNAKNTLATQDIKFNNTIKRIQGFFVEELERMVRMNKEFADQD 396 (533) T ss_pred cCCC-C-CCcHHHHHHHHHHHHHHhCCCeeecCCCCCCccchhhhHHHHHHHHHHHHHHHHHHHHHhcccccccCcchhh Confidence 7753 3 4444555677888999999999999876655444321111 225666777777788888888766443322 2 Q ss_pred ceeeechhhhh----ccC-HHHHHHHHHHH---Hhc-----CC--cCHHHHHHH------h-C--CCCCCCcCCceeeec Q lcl|NC_019456. 332 FYFSFNVNGLL----RGD-TAARTQYYQTL---TRN-----GI--FKPNEIREL------E-G--QAPIPDEAADHLYIS 387 (435) Q Consensus 332 ~~i~fd~~~l~----~~d-~~~~~~~~~~~---~~~-----g~--~t~NE~R~~------~-g--~~p~~~~~gd~~~~~ 387 (435) |++.|..|..- ... +..+++++..+ |.. -+ || +|+... . + +-+-++++++. .| T Consensus 397 w~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~t-dei~~q~e~ie~E~~~~~~~~~~~~~e~--~~ 473 (533) T protein:vir:58 397 FRLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIP-YDLKPQEEVAEAAGGGGLFDTGGFGEET--TP 473 (533) T ss_pred eeeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCC-hhhhHHHHHHHHhhcCCCCCCCCccccc--CC Confidence 44555444321 111 12222222211 110 01 22 232221 1 1 11212222221 12 Q ss_pred ccccchhcccccccc-ccccccccccccccC-CCC-CCCCCCCCCCCCCCC Q lcl|NC_019456. 388 KDLYPLDKYYDAILD-NKIQTDASVAAPKQE-GGE-NTNENGLQSTEPEGS 435 (435) Q Consensus 388 ~n~~~l~~~~~~~~~-~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~ 435 (435) .-.-+. ..++... ......+....+..+ |+. +-....++..+.+|+ T Consensus 474 ~~~~~~--~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~ 522 (533) T protein:vir:58 474 ADFLGE--RGSPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGG 522 (533) T ss_pred cccCcc--ccCcccCCCChhhHhcccCCcccccccccccccchhhhhhcCC Confidence 111110 0111111 011111111111111 111 111222222233333 No 146 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=99.10 E-value=1.5e-10 Score=74.46 Aligned_cols=424 Identities=11% Similarity=0.082 Sum_probs=202.7 Q ss_pred CchHH-HHHhhcccccccccccc---ccchhh-h--hhcccc--ccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceee Q lcl|NC_019456. 1 MSFMS-KVRQFFGVHDQANQIVQ---NPIPQP-L--DMAGVK--LEQATFSREHILESNEYIFSIVTRLSNVLASLPLHE 71 (435) Q Consensus 1 Mg~~~-~~~~~~~~~~~~~~~~~---~~~~~~-~--~~~~~~--~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~ 71 (435) |.--+ |+.+.-.....+.++.. ...+.- + ...|.. ..+....+ +.|-..|.++..+.-+++++++|.+.. T Consensus 1 ma~~~lrv~rrpk~~p~~r~l~aasqp~~P~~~~~~~~~g~~~~~~WQ~eAW-~~~d~VgElryyvgW~~ss~Sr~rL~a 79 (629) T protein:vir:10 1 MAASTLRVSRRPKGSPARRSLTAASQPMEPGRTPSRQVAGTVVRTSWQNEAW-ECMDLVGELRYYVGWRASSCSRVELIA 79 (629) T ss_pred CCccceeEEecCCCccceeeeccccCCCCcchhhchhhhhhhhhhhhhHHHH-HHHHhhhhHHHHhhhhhhhheeeeEEE Confidence 44322 11111111111111110 011110 0 111111 11111111 223445888888999999999999877 Q ss_pred ee---c-c---ccc-ccch----HHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC---CCcEE-EEEEeCC Q lcl|NC_019456. 72 YQ---N-Y---KQM-DNEP----LADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS---TGEPI-ALWPLDP 135 (435) Q Consensus 72 ~~---~-~---~~~-~~~~----l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~---~g~~~-~l~~l~~ 135 (435) -+ + + ..+ .+++ ..+.+..=-..-+...++++.+..++-+-|+.|++++.... .+.+. .++.+.. T Consensus 80 s~idpDtg~ptg~i~ed~p~~~~v~~~v~~iagG~lGqaqLlkr~~~~ltV~GE~~i~il~~~~~~pd~~~r~~W~vVt~ 159 (629) T protein:vir:10 80 SELDPDTGKPTGGIRDDDPDGLRFLEIVKTMAGGPLGQAQLQKRAAECLTVPGEHRICLLDQGDKNPDGSVRHNWYVVTN 159 (629) T ss_pred eeecCCCCCCccccccCchhHHHHHHHHHHhcCccchHHHHHHHHHhheeccCceEEEEeecCCCCCCcccccceeeecH Confidence 54 1 1 111 1222 22223333445577889999999999999999999885332 12233 4555555 Q ss_pred ceeEEEEcCCCceEEEEEecCCeeEEEchhheEEec--cCCCccccccCcHHHHHHHHHHHHHHHHH-----HHHHHhhc Q lcl|NC_019456. 136 NTVSILRNTDNNSYWYRVTSDIYNFTIPINDVIHVK--HVVPSNSWYGVSPIDVLSSSLKFQRSVEN-----FSQNEMEK 208 (435) Q Consensus 136 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~--~~~~~~~~~G~s~l~~~~~~i~~~~~~~~-----~~~~~~~n 208 (435) ..++ .+.++... ...++|..-+|..+.=+.|| .+++.....--||+.++...+.-.....+ ...+++.| T Consensus 160 ~Ei~---~kg~g~~~-i~lpdg~~he~~~~~D~l~RvW~P~Prr~~e~DSpvra~l~~lrEi~r~tk~i~~aakSRL~gn 235 (629) T protein:vir:10 160 DEVK---NKGAGKTD-IELPDGTIHEYSKGRDVMFRVWNPRPRRAKEPDSPVRACLDSLREIIRTTKKIRNASKSRLIGN 235 (629) T ss_pred HHhc---cccCceeE-EEcCCCceeeeeCCCeeEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHhHHhhC Confidence 5433 12212222 23344444555444333444 34555666778888887776665443322 22334556 Q ss_pred CCceE---EEeCC------cC----------CHHHHHHHHHHHH----HHhcCCCc-----ccccc------CCceeeec Q lcl|NC_019456. 209 KDKFV---LQYDR------SI----------SPEKRQAMVNDFL----RMVKENGG-----AVVQE------AGWKVDRY 254 (435) Q Consensus 209 ~~~~~---~~~~~------~~----------~~e~~~~~~~~~~----~~~~~~~~-----~~vl~------~g~~~~~~ 254 (435) |..++ ++++. .- .....+.+.+.|. ..+.+.+. ++++. .+++.-.+ T Consensus 236 GvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~De~S~aA~vPiia~vP~E~l~~ikhLkf 315 (629) T protein:vir:10 236 GVVFLPQELSLPRATAPVADNQPGAPVPIVDGVAAADELSNLLFQTAAAAVDDEDSQAALIPLLATVPGEHLQKIFHLKI 315 (629) T ss_pred ceeEeccCcccccccCCCCCCCCcccccccCCCcchHHHHHHHHHHHHhhhcCCCCccceeeeEEeechHHhcCeeeeee Confidence 54322 12221 00 1112334444443 33433221 23321 22333334 Q ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCcccCcccHHHH-HHHHHHHHHhHHHHHHHHHHHHhhccccc----- Q lcl|NC_019456. 255 ESKFEPADLSSVEQISRIRIATAFNVPISFL-NDDQAKSTTNVEH-VTHSWTMTLMPIIRQYESQFNMKLFTPGK----- 327 (435) Q Consensus 255 ~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~l-g~~~~~~~~~~e~-~~~~~~~~i~P~~~~i~~~l~~~l~~~~~----- 327 (435) .....+. -+.+++..+..+|....|||+.| |.+.++|-.+.=| -...++-.|.|.+..++++|++.+|.+.. T Consensus 316 ~~eite~-~iktR~daI~RlAmglDispErLLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~Ait~~~Lrp~L~~eGi 394 (629) T protein:vir:10 316 GNEITEV-EIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVQLHIKPVMEVLCAAIYREVLVATLRAEGI 394 (629) T ss_pred cCchhHH-HHhhHHHHHHHHHhccCCChhheeeccCCccceeeEEecccceeeecchHHHHHHHHHHhHHHHHHHHHhCC Confidence 4444433 46678888889999999999955 4445665444322 12346778999999999999998776543 Q ss_pred ccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCcee-----------eecccccchhc- Q lcl|NC_019456. 328 RVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHL-----------YISKDLYPLDK- 395 (435) Q Consensus 328 ~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~-----------~~~~n~~~l~~- 395 (435) +...|.+.||.+.|... + ++.+-+..+.+.|.+|-...|+.+|+.--+..--+.+ ..+-.+.+.-. T Consensus 395 Dp~~Yvvw~DaS~Lt~d-P-d~~deA~~a~drGaIt~eAlRr~lG~~~dd~y~~~t~~~~q~~A~~~v~~~P~Li~~~ap 472 (629) T protein:vir:10 395 DPDRYVLWYDASGLTVD-P-DKTDEATAAKEQGAITHEAYRRYLGLADEDGYDLETLEGAQAWARDAIVADPSLIKVLAP 472 (629) T ss_pred CHHHhEeeecCcccccC-C-CCcHHHHHHHHcCCccHHHHHHHhccccccCCCcCCcHHHHHHHHHHhcCCCchhhhhhh Confidence 33345678888877532 2 2345556678899999999999999863211000111 00111111000 Q ss_pred c-ccccccccccccccccccccCCCCCCCCC------CCCCCCCCCC Q lcl|NC_019456. 396 Y-YDAILDNKIQTDASVAAPKQEGGENTNEN------GLQSTEPEGS 435 (435) Q Consensus 396 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~ 435 (435) . ......-+-+. +.....++++..+++ ++++++.|.. T Consensus 473 ll~~~l~~i~~P~---p~~a~~~~~~~~~~~E~~~~~~e~~~e~dA~ 516 (629) T protein:vir:10 473 LLTDELAEIDWPE---PPAALPPGEDDQADEEQDTTGSEPSTEDDAE 516 (629) T ss_pred hcCCccccccccC---CCCcCCCCCcccCccccCCCCCCcCCCcchh Confidence 0 00000000000 000111111111111 1122222211 No 147 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=99.07 E-value=1.2e-10 Score=74.96 Aligned_cols=422 Identities=13% Similarity=0.086 Sum_probs=203.5 Q ss_pred CchHH-HHHhhccccccccc---------cccccchhhhhhcccc--ccCcccccHHHHhhhHHHHHHHHHHHHHHhhCc Q lcl|NC_019456. 1 MSFMS-KVRQFFGVHDQANQ---------IVQNPIPQPLDMAGVK--LEQATFSREHILESNEYIFSIVTRLSNVLASLP 68 (435) Q Consensus 1 Mg~~~-~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~ 68 (435) |.--+ |+.+.-....+.+. +.-.+...+-...+.+ ..+....+ +.|-..|.++..+.-|++++++|. T Consensus 1 ma~~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW-~~~d~v~Elry~vgW~~~s~sr~r 79 (639) T protein:vir:97 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAW-DFSESIGELSYYVSWRANSCSRTT 79 (639) T ss_pred CCccceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhh-hhhhhhhhHHHHhhhhhhhhceee Confidence 54332 11111111110000 0000000111111111 11111122 335556888999999999999999 Q ss_pred eeeee---c-ccc-----ccc----chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc------EE- Q lcl|NC_019456. 69 LHEYQ---N-YKQ-----MDN----EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE------PI- 128 (435) Q Consensus 69 ~~~~~---~-~~~-----~~~----~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~------~~- 128 (435) +..-+ + +.. +.+ +.+.+....=-..-+...++++.+..++-+-|++|+.++.....+- +. T Consensus 80 L~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~~ 159 (639) T protein:vir:97 80 LIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRA 159 (639) T ss_pred eEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCccccccc Confidence 88754 1 111 112 2233333323345567889999999999999999987665333221 22 Q ss_pred EEEEeCCceeEEEEcCCCceEEEEEecCCeeEEEchhheEEec--cCCCccccccCcHHHHHHHHHHHHHHHHHH----- Q lcl|NC_019456. 129 ALWPLDPNTVSILRNTDNNSYWYRVTSDIYNFTIPINDVIHVK--HVVPSNSWYGVSPIDVLSSSLKFQRSVENF----- 201 (435) Q Consensus 129 ~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~--~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~----- 201 (435) .++.+....+. ...++...... ++|..-+|..+.=+.|| .+++.....--||+.++...+.-.....+. T Consensus 160 ~W~vvs~~Ei~---~~~~~~~~i~l-PdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaa 235 (639) T protein:vir:97 160 RWYAVTREEIK---SKAGETAEISL-PDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (639) T ss_pred ceeeeeHHHhc---ccCCCeeEeec-CCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHH Confidence 35555555443 22222222222 24444445433333344 455556677788888877766654433322 Q ss_pred HHHHhhcCCceE---EEeCCcC-------------------CHHHHHHHHHHH----HHHhcCCCc-----ccccc---- Q lcl|NC_019456. 202 SQNEMEKKDKFV---LQYDRSI-------------------SPEKRQAMVNDF----LRMVKENGG-----AVVQE---- 246 (435) Q Consensus 202 ~~~~~~n~~~~~---~~~~~~~-------------------~~e~~~~~~~~~----~~~~~~~~~-----~~vl~---- 246 (435) ..+++.||..++ ++++..- +....+.+.+.| ...+.+.+. ++++. T Consensus 236 kSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~p~E 315 (639) T protein:vir:97 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (639) T ss_pred HHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEeechH Confidence 233445553221 1111110 111233444444 333433221 33332 Q ss_pred --CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH-HHHHHHHHHhHHHHHHHHHHHHhhc Q lcl|NC_019456. 247 --AGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH-VTHSWTMTLMPIIRQYESQFNMKLF 323 (435) Q Consensus 247 --~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~-~~~~~~~~i~P~~~~i~~~l~~~l~ 323 (435) .+++.-.+.....+. .+.+++..+..||..+.|||+.|-+..++|-.+.=| -...++..|.|.+..|+++|++.+| T Consensus 316 ~l~~ikhl~f~~ei~e~-aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~~~L 394 (639) T protein:vir:97 316 HLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (639) T ss_pred HhcCeeeeeecCchhHH-HHhhHHHHHHHHHhccCCchhheeecccccceEEEEecccceeeecchhHHHHHHHHHhhHH Confidence 233333344444333 567788888999999999999776655555433212 1234677899999999999999977 Q ss_pred cccc-----ccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCce-------------ee Q lcl|NC_019456. 324 TPGK-----RVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADH-------------LY 385 (435) Q Consensus 324 ~~~~-----~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~-------------~~ 385 (435) .+.. +...|.+.||.+.|... -.+.+-+..+++.|.+|-.-.|+.+|+.-- ++=|. .. T Consensus 395 rp~Le~eGvDp~kYvvW~DaS~Lt~d--Pd~~deA~qa~drGAIt~eAlR~~lG~~ed--d~yd~~t~e~~~~~A~~~V~ 470 (639) T protein:vir:97 395 TPLLAREGIDPTKYILWYDASGLTSD--PDLSDEAVEAHDRGAITSAALRRLLNVGED--SGYDLTTLDGCREFAADVVT 470 (639) T ss_pred HHHHHHhCCCHHHhEeeecCcccccC--CCCcHHHHHHHHcCCccHHHHHHHhccccc--cCCCCCCcHHHHHHHHHHhc Confidence 6543 33345678988887543 224455566888999999999999998632 11120 00 Q ss_pred ecccc----cchhccccc------cccccccccccccccccCCCCCCCCCCCCCCCCC----CC Q lcl|NC_019456. 386 ISKDL----YPLDKYYDA------ILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE----GS 435 (435) Q Consensus 386 ~~~n~----~~l~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 435 (435) .+-.+ .|+-...-. ......+++.+...+.++|..+..+ +.++.+ .+ T Consensus 471 ~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga~~~~e---Pdte~~~~~~~a 531 (639) T protein:vir:97 471 KNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGARQQRE---PQTEDERSTEEA 531 (639) T ss_pred CCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCCCCCcC---CCcccccCCccc Confidence 00111 111100000 0000111111111111111111111 111111 11 No 148 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=99.07 E-value=1.2e-10 Score=74.96 Aligned_cols=422 Identities=13% Similarity=0.086 Sum_probs=203.5 Q ss_pred CchHH-HHHhhccccccccc---------cccccchhhhhhcccc--ccCcccccHHHHhhhHHHHHHHHHHHHHHhhCc Q lcl|NC_019456. 1 MSFMS-KVRQFFGVHDQANQ---------IVQNPIPQPLDMAGVK--LEQATFSREHILESNEYIFSIVTRLSNVLASLP 68 (435) Q Consensus 1 Mg~~~-~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~ 68 (435) |.--+ |+.+.-....+.+. +.-.+...+-...+.+ ..+....+ +.|-..|.++..+.-|++++++|. T Consensus 1 ma~~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~~ar~~WQ~eAW-~~~d~v~Elry~vgW~~~s~sr~r 79 (639) T protein:vir:10 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMGTARNEWQSEAW-DFSESIGELSYYVSWRANSCSRTT 79 (639) T ss_pred CCccceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccccchhhhhhhhh-hhhhhhhhHHHHhhhhhhhhceee Confidence 54332 11111111110000 0000000111111111 11111122 335556888999999999999999 Q ss_pred eeeee---c-ccc-----ccc----chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc------EE- Q lcl|NC_019456. 69 LHEYQ---N-YKQ-----MDN----EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE------PI- 128 (435) Q Consensus 69 ~~~~~---~-~~~-----~~~----~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~------~~- 128 (435) +..-+ + +.. +.+ +.+.+....=-..-+...++++.+..++-+-|++|+.++.....+- +. T Consensus 80 L~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~~ 159 (639) T protein:vir:10 80 LIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRA 159 (639) T ss_pred eEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCccccccc Confidence 88754 1 111 112 2233333323345567889999999999999999987665333221 22 Q ss_pred EEEEeCCceeEEEEcCCCceEEEEEecCCeeEEEchhheEEec--cCCCccccccCcHHHHHHHHHHHHHHHHHH----- Q lcl|NC_019456. 129 ALWPLDPNTVSILRNTDNNSYWYRVTSDIYNFTIPINDVIHVK--HVVPSNSWYGVSPIDVLSSSLKFQRSVENF----- 201 (435) Q Consensus 129 ~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~--~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~----- 201 (435) .++.+....+. ...++...... ++|..-+|..+.=+.|| .+++.....--||+.++...+.-.....+. T Consensus 160 ~W~vvs~~Ei~---~~~~~~~~i~l-PdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaa 235 (639) T protein:vir:10 160 RWYAVTREEIK---SKAGETAEISL-PDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKIKNAA 235 (639) T ss_pred ceeeeeHHHhc---ccCCCeeEeec-CCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHH Confidence 35555555443 22222222222 24444445433333344 455556677788888877766654433322 Q ss_pred HHHHhhcCCceE---EEeCCcC-------------------CHHHHHHHHHHH----HHHhcCCCc-----ccccc---- Q lcl|NC_019456. 202 SQNEMEKKDKFV---LQYDRSI-------------------SPEKRQAMVNDF----LRMVKENGG-----AVVQE---- 246 (435) Q Consensus 202 ~~~~~~n~~~~~---~~~~~~~-------------------~~e~~~~~~~~~----~~~~~~~~~-----~~vl~---- 246 (435) ..+++.||..++ ++++..- +....+.+.+.| ...+.+.+. ++++. T Consensus 236 kSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~~p~E 315 (639) T protein:vir:10 236 KSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVASVAAE 315 (639) T ss_pred HHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEeechH Confidence 233445553221 1111110 111233444444 333433221 33332 Q ss_pred --CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH-HHHHHHHHHhHHHHHHHHHHHHhhc Q lcl|NC_019456. 247 --AGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH-VTHSWTMTLMPIIRQYESQFNMKLF 323 (435) Q Consensus 247 --~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~-~~~~~~~~i~P~~~~i~~~l~~~l~ 323 (435) .+++.-.+.....+. .+.+++..+..||..+.|||+.|-+..++|-.+.=| -...++..|.|.+..|+++|++.+| T Consensus 316 ~l~~ikhl~f~~ei~e~-aiktR~daI~RlA~glDi~pE~LLGl~d~NHWsAWqI~dedvrlHI~P~l~~icdAlT~~~L 394 (639) T protein:vir:10 316 HLEKVQHIKFGNEVTEV-EIKTRIDAITRLAMGLDVSPERLLGMSKGNHWSAWAIGDEDVQLHIKPVMDLICQAIYNDIL 394 (639) T ss_pred HhcCeeeeeecCchhHH-HHhhHHHHHHHHHhccCCchhheeecccccceEEEEecccceeeecchhHHHHHHHHHhhHH Confidence 233333344444333 567788888999999999999776655555433212 1234677899999999999999977 Q ss_pred cccc-----ccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCce-------------ee Q lcl|NC_019456. 324 TPGK-----RVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADH-------------LY 385 (435) Q Consensus 324 ~~~~-----~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~-------------~~ 385 (435) .+.. +...|.+.||.+.|... -.+.+-+..+++.|.+|-.-.|+.+|+.-- ++=|. .. T Consensus 395 rp~Le~eGvDp~kYvvW~DaS~Lt~d--Pd~~deA~qa~drGAIt~eAlR~~lG~~ed--d~yd~~t~e~~~~~A~~~V~ 470 (639) T protein:vir:10 395 TPLLAREGIDPTKYILWYDASGLTSD--PDLSDEAVEAHDRGAITSAALRRLLNVGED--SGYDLTTLDGCREFAADVVT 470 (639) T ss_pred HHHHHHhCCCHHHhEeeecCcccccC--CCCcHHHHHHHHcCCccHHHHHHHhccccc--cCCCCCCcHHHHHHHHHHhc Confidence 6543 33345678988887543 224455566888999999999999998632 11120 00 Q ss_pred ecccc----cchhccccc------cccccccccccccccccCCCCCCCCCCCCCCCCC----CC Q lcl|NC_019456. 386 ISKDL----YPLDKYYDA------ILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE----GS 435 (435) Q Consensus 386 ~~~n~----~~l~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 435 (435) .+-.+ .|+-...-. ......+++.+...+.++|..+..+ +.++.+ .+ T Consensus 471 ~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga~~~~e---Pdte~~~~~~~a 531 (639) T protein:vir:10 471 KNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGARQQRE---PQTEDERSTEEA 531 (639) T ss_pred CCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCCCCCcC---CCcccccCCccc Confidence 00111 111100000 0000111111111111111111111 111111 11 No 149 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=98.96 E-value=5e-09 Score=66.11 Aligned_cols=402 Identities=10% Similarity=0.005 Sum_probs=179.6 Q ss_pred CchHHHHHhhccccccc---ccccc-----------ccchhhhh---hccccccCc------ccccHHHHhhhHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQA---NQIVQ-----------NPIPQPLD---MAGVKLEQA------TFSREHILESNEYIFSIV 57 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~---~~~~~-----------~~~~~~~~---~~~~~~~~~------~~~~~~~~~~~~~v~~~i 57 (435) ||+|+++++++...... .+... ........ +.-...... .....+.......-..++ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 99999999987533211 11100 00000000 100010000 000111122333444455 Q ss_pred HHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCce Q lcl|NC_019456. 58 TRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNT 137 (435) Q Consensus 58 ~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~ 137 (435) +.+|+-+..=|..+.-++. ..+ ..+..++ ........+...+......|.+++.+..+. |.+ .+-.+++.. T Consensus 81 ~~~A~lv~~e~~~i~v~d~-~~~-~~l~~~l----~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~--~~~-~i~~v~ad~ 151 (522) T protein:vir:47 81 KKIASLVYNEQATITTKNE-ILQ-KFLDDML----TNDRFNKNFERYLESCLALGGLAMRPYIDG--DKV-RVAFIQAPV 151 (522) T ss_pred HHHhhhhcCCcceeecCCh-HHH-HHHHHHH----hhcchHHHHHHHHHHhhccCCEEEEEEEcC--Cce-EEEEEcCCc Confidence 5666555543333322221 111 2222222 123355556777888888898888877753 332 333344433 Q ss_pred eEEE-EcCC----------------CceEEEE------------------------------Eec-C----CeeE---EE Q lcl|NC_019456. 138 VSIL-RNTD----------------NNSYWYR------------------------------VTS-D----IYNF---TI 162 (435) Q Consensus 138 v~~~-~~~~----------------~~~~~~~------------------------------~~~-~----~~~~---~~ 162 (435) +-+. .+.. ....||. +.. + |... .+ T Consensus 152 ~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~ 231 (522) T protein:vir:47 152 FFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSEL 231 (522) T ss_pred eEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCcccccccc Confidence 3322 1111 1111111 000 0 0000 00 Q ss_pred ------ch---------hheEEeccCCCc----cccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCC----- Q lcl|NC_019456. 163 ------PI---------NDVIHVKHVVPS----NSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDR----- 218 (435) Q Consensus 163 ------~~---------~~iih~~~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~----- 218 (435) .+ --..||+.+-++ +.++|+|.+..+...+...+..-..-..-|+.+-..++.-.. T Consensus 232 ~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~ 311 (522) T protein:vir:47 232 DKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQ 311 (522) T ss_pred ccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhccC Confidence 00 112245543221 456799999999988877665444444445554332222111 Q ss_pred -cCCHHHHHHHHHHHH---HHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCccc Q lcl|NC_019456. 219 -SISPEKRQAMVNDFL---RMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTT 294 (435) Q Consensus 219 -~~~~e~~~~~~~~~~---~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~ 294 (435) ........ ....|. ..+..-+ .-.+++-+++.++....+-++.+..+...+.|+..+|+++..++....+.. T Consensus 312 ~~~~~g~~~-~~~~fd~~~~~f~~~~--~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~k- 387 (522) T protein:vir:47 312 YQRPDGTID-FRPRFDVEQNVYMQIG--GSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMK- 387 (522) T ss_pred CCCCCcccc-cccccCcccceEeecC--CCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccc- Confidence 10000000 000000 0000000 001233457777777777788888888899999999999999987655322 Q ss_pred HHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcC Q lcl|NC_019456. 295 NVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNG 360 (435) Q Consensus 295 ~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g 360 (435) ++.+. ...++.+|..++..+.+....--+..........+.+++++....|.++.++...+++.+| T Consensus 388 TAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG 467 (522) T protein:vir:47 388 TATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAG 467 (522) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcC Confidence 22111 1122334444444444333211111112223456888889999999999999999999999 Q ss_pred CcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 361 IFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEG 434 (435) Q Consensus 361 ~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (435) +|++-+++.++ ..++++.+++-+ ..+.. .+.. . .+...+-..+++.....++-|| T Consensus 468 ~~s~e~~i~~~--~g~~eeea~~el--------~ri~~---E~~~---~---~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 468 FSTKKRAIGKT--LNISGVEAEKEL--------NAINS---ELLP---M---NDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred CCCHHHHHHhc--CCCChHHHHHHH--------HHHHH---hhcc---C---CCCCCCCCCCCCcccccCCCCC Confidence 99999987664 223333332211 11110 0000 0 0000001111112222223333 No 150 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.94 E-value=4.1e-09 Score=66.54 Aligned_cols=392 Identities=9% Similarity=0.011 Sum_probs=178.8 Q ss_pred CchHHHHHhhccccccc--------------cccccccchh---hhhhccccccCcc------cccHHHHhhhHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQA--------------NQIVQNPIPQ---PLDMAGVKLEQAT------FSREHILESNEYIFSIV 57 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~--------------~~~~~~~~~~---~~~~~~~~~~~~~------~~~~~~~~~~~~v~~~i 57 (435) ||||++|++++...... .......... ...+......... ....+.....+.-..++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 99999999998542110 0111000000 0001111111000 00011122223334455 Q ss_pred HHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCce Q lcl|NC_019456. 58 TRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNT 137 (435) Q Consensus 58 ~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~ 137 (435) +.+|+-+..=|..+.-++.. . ...+..++ ........+...+...+..|.+++.+..++ +.+ .+..+++.. T Consensus 81 ~~~A~lv~~e~~~i~~~d~~-~-~~~l~~il----~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~--~~~-~I~~v~ad~ 151 (500) T protein:vir:98 81 KKIASLVFNEQAEIKVDDDA-A-NEFISETL----KNDRFNKNFERYLESCLALGGLAMRPYVDG--DKV-RVAFVQAPV 151 (500) T ss_pred HHHhhhhcCCcceEecCChH-H-HHHHHHHH----hhccHHHHHHHHHHHHhhcCCEEEEEEEeC--Cce-EEEEEcCCe Confidence 55665554433333222211 1 11222222 223355666777888888999988887764 333 345556655 Q ss_pred eEEEEcC-CC-----------------ceEEEEE----ecCC--eeE---EEc--------------------hh----- Q lcl|NC_019456. 138 VSILRNT-DN-----------------NSYWYRV----TSDI--YNF---TIP--------------------IN----- 165 (435) Q Consensus 138 v~~~~~~-~~-----------------~~~~~~~----~~~~--~~~---~~~--------------------~~----- 165 (435) +.+...+ .+ ..+|... ..++ ... .|. .. T Consensus 152 ~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~ 231 (500) T protein:vir:98 152 FLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVT 231 (500) T ss_pred eEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEec Confidence 5442211 11 1111000 0001 000 010 00 Q ss_pred -----heEEeccCCC----ccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHH----HHHHHH-HH Q lcl|NC_019456. 166 -----DVIHVKHVVP----SNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPE----KRQAMV-ND 231 (435) Q Consensus 166 -----~iih~~~~~~----~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e----~~~~~~-~~ 231 (435) -..||+.+.+ .+.+.|+|.+..+...+...+.......+-++.+...++.-...+... ..+.+. -. T Consensus 232 ~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~ 311 (500) T protein:vir:98 232 DVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRPR 311 (500) T ss_pred cCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCcc Confidence 1224443222 145679999999999988877655444455565544332211111000 000000 00 Q ss_pred HHHHhcCC-C-ccc-cccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH--------- Q lcl|NC_019456. 232 FLRMVKEN-G-GAV-VQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV--------- 299 (435) Q Consensus 232 ~~~~~~~~-~-~~~-vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------- 299 (435) |. .... . .+- -.+++..++.++....+-++.+..+...++|+...|+++..+|....+.. ++.+. T Consensus 312 ~d--~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~-TAtei~s~~~~~~~ 388 (500) T protein:vir:98 312 FE--SDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMK-TATEIVSENSDTYQ 388 (500) T ss_pred cC--CCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccc-cHHHHHHHHHHHHH Confidence 00 0000 0 000 01233457777777666678888888889999999999999987665432 22221 Q ss_pred -----HHHHHHHHhHHHHHHHHHHHH-hhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHh-C Q lcl|NC_019456. 300 -----THSWTMTLMPIIRQYESQFNM-KLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELE-G 372 (435) Q Consensus 300 -----~~~~~~~i~P~~~~i~~~l~~-~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~-g 372 (435) ...++.+|..++..+...... .++ .........+.+++++-...|.++.++...+++.+|+|+.-+++.++ | T Consensus 389 t~~~~~~~~~~al~~lv~~il~~~~~~~~~-~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g 467 (500) T protein:vir:98 389 MRNSIVALVEQSLKELVISIFEIAKAYDLY-QSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLN 467 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-CCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCC Confidence 112233333333333322221 111 11222345677888888889999999999999999999999988654 5 Q ss_pred CCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCC Q lcl|NC_019456. 373 QAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGE 420 (435) Q Consensus 373 ~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (435) ++ ++.+++.+ ..+... . .+..........-.|+ T Consensus 468 ~~---eeea~~~l--------~~i~~E---~-~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 468 VT---EEKAQEIA--------AEINTG---I-VDEINQQRTDTHLYGE 500 (500) T ss_pred CC---HHHHHHHH--------HHHHHh---c-cccCCCCCccccccCC Confidence 43 33332221 111000 0 0000000001111111 No 151 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.94 E-value=4.1e-09 Score=66.54 Aligned_cols=392 Identities=9% Similarity=0.011 Sum_probs=178.8 Q ss_pred CchHHHHHhhccccccc--------------cccccccchh---hhhhccccccCcc------cccHHHHhhhHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQA--------------NQIVQNPIPQ---PLDMAGVKLEQAT------FSREHILESNEYIFSIV 57 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~--------------~~~~~~~~~~---~~~~~~~~~~~~~------~~~~~~~~~~~~v~~~i 57 (435) ||||++|++++...... .......... ...+......... ....+.....+.-..++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 99999999998542110 0111000000 0001111111000 00011122223334455 Q ss_pred HHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCce Q lcl|NC_019456. 58 TRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNT 137 (435) Q Consensus 58 ~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~ 137 (435) +.+|+-+..=|..+.-++.. . ...+..++ ........+...+...+..|.+++.+..++ +.+ .+..+++.. T Consensus 81 ~~~A~lv~~e~~~i~~~d~~-~-~~~l~~il----~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~--~~~-~I~~v~ad~ 151 (500) T protein:vir:30 81 KKIASLVFNEQAEIKVDDDA-A-NEFISETL----KNDRFNKNFERYLESCLALGGLAMRPYVDG--DKV-RVAFVQAPV 151 (500) T ss_pred HHHhhhhcCCcceEecCChH-H-HHHHHHHH----hhccHHHHHHHHHHHHhhcCCEEEEEEEeC--Cce-EEEEEcCCe Confidence 55665554433333222211 1 11222222 223355666777888888999988887764 333 345556655 Q ss_pred eEEEEcC-CC-----------------ceEEEEE----ecCC--eeE---EEc--------------------hh----- Q lcl|NC_019456. 138 VSILRNT-DN-----------------NSYWYRV----TSDI--YNF---TIP--------------------IN----- 165 (435) Q Consensus 138 v~~~~~~-~~-----------------~~~~~~~----~~~~--~~~---~~~--------------------~~----- 165 (435) +.+...+ .+ ..+|... ..++ ... .|. .. T Consensus 152 ~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~ 231 (500) T protein:vir:30 152 FLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVT 231 (500) T ss_pred eEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEec Confidence 5442211 11 1111000 0001 000 010 00 Q ss_pred -----heEEeccCCC----ccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHH----HHHHHH-HH Q lcl|NC_019456. 166 -----DVIHVKHVVP----SNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPE----KRQAMV-ND 231 (435) Q Consensus 166 -----~iih~~~~~~----~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e----~~~~~~-~~ 231 (435) -..||+.+.+ .+.+.|+|.+..+...+...+.......+-++.+...++.-...+... ..+.+. -. T Consensus 232 ~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~ 311 (500) T protein:vir:30 232 DVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRPR 311 (500) T ss_pred cCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCcc Confidence 1224443222 145679999999999988877655444455565544332211111000 000000 00 Q ss_pred HHHHhcCC-C-ccc-cccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH--------- Q lcl|NC_019456. 232 FLRMVKEN-G-GAV-VQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV--------- 299 (435) Q Consensus 232 ~~~~~~~~-~-~~~-vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------- 299 (435) |. .... . .+- -.+++..++.++....+-++.+..+...++|+...|+++..+|....+.. ++.+. T Consensus 312 ~d--~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~-TAtei~s~~~~~~~ 388 (500) T protein:vir:30 312 FE--SDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMK-TATEIVSENSDTYQ 388 (500) T ss_pred cC--CCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccc-cHHHHHHHHHHHHH Confidence 00 0000 0 000 01233457777777666678888888889999999999999987665432 22221 Q ss_pred -----HHHHHHHHhHHHHHHHHHHHH-hhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHh-C Q lcl|NC_019456. 300 -----THSWTMTLMPIIRQYESQFNM-KLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELE-G 372 (435) Q Consensus 300 -----~~~~~~~i~P~~~~i~~~l~~-~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~-g 372 (435) ...++.+|..++..+...... .++ .........+.+++++-...|.++.++...+++.+|+|+.-+++.++ | T Consensus 389 t~~~~~~~~~~al~~lv~~il~~~~~~~~~-~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g 467 (500) T protein:vir:30 389 MRNSIVALVEQSLKELVISIFEIAKAYDLY-QSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLN 467 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-CCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCC Confidence 112233333333333322221 111 11222345677888888889999999999999999999999988654 5 Q ss_pred CCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCC Q lcl|NC_019456. 373 QAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGE 420 (435) Q Consensus 373 ~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (435) ++ ++.+++.+ ..+... . .+..........-.|+ T Consensus 468 ~~---eeea~~~l--------~~i~~E---~-~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 468 VT---EEKAQEIA--------AEINTG---I-VDEINQQRTDTHLYGE 500 (500) T ss_pred CC---HHHHHHHH--------HHHHHh---c-cccCCCCCccccccCC Confidence 43 33332221 111000 0 0000000001111111 No 152 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=98.93 E-value=8.2e-09 Score=64.91 Aligned_cols=393 Identities=10% Similarity=-0.033 Sum_probs=170.4 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcc-cccH---HHHhhhHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQAT-FSRE---HILESNEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) +.++..|.+.+.......... ..++..-. ...... .... .......+..-||+.+|+.+.--.|.+- + . T Consensus 17 ~~~~~~L~~~~~~~~~~~~~~----~~Yy~G~~-~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~-d-~ 89 (474) T protein:vir:81 17 NALINGLLAQIENLRWKNLLR----TSYYENKR-TIQYVGTLIPPQYFNLGLVLGWTGKAVDALARRCNLEGFVWP-D-G 89 (474) T ss_pred HHHHHHHHHHHHHHhhHHHHH----HHHhccCC-ChhhccccccHHHHHHHhhcChHHHHHHHHHhhhcccceECC-C-C Confidence 444444444332222111100 01111000 000000 0001 0011234555567777665544444431 1 1 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcE-EEEEEeCCceeEEEEcCCCceE-----E Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEP-IALWPLDPNTVSILRNTDNNSY-----W 150 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~-~~l~~l~~~~v~~~~~~~~~~~-----~ 150 (435) ...+..+...+ +-| ........+..+.+.+|.||+.|..+.. |.+ ..+.+++|.++....|+..... . T Consensus 90 ~~~~~~l~~iw--~~N---~ld~~~~~~~~~al~~G~sf~~V~~~~d-~~~~~~i~~~sp~~~~~~~D~~~~~~~~al~~ 163 (474) T protein:vir:81 90 DLDSLGGTEVV--DDN---HLLSEIDSAIVAAMQHGPAFLINTVGED-DEPEALIHVKDASEATGEWNRRRRGLNNLLSI 163 (474) T ss_pred CccchHHHHHH--Hhc---ChhHHHHHHHHHHHhhCceeEEEecCCC-CCceeEEEEeccceEEEEEeCCCCcceeeeEE Confidence 11122232322 222 2335567778899999999988876533 543 4578889988887766533221 0 Q ss_pred EEEecCCee---EEEchhh-------------------------eEEeccCCCccccccCcHH----HHHHHHHHHHHHH Q lcl|NC_019456. 151 YRVTSDIYN---FTIPIND-------------------------VIHVKHVVPSNSWYGVSPI----DVLSSSLKFQRSV 198 (435) Q Consensus 151 ~~~~~~~~~---~~~~~~~-------------------------iih~~~~~~~~~~~G~s~l----~~~~~~i~~~~~~ 198 (435) +....+|.. ..|.++. |++|.+.......+|.|.+ ..+.+.+.....- T Consensus 164 ~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~ 243 (474) T protein:vir:81 164 IDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRELAR 243 (474) T ss_pred EEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcceEEecccccccCcCCccccchhHHHHHHHHHHHHHH Confidence 111111111 1122222 4444443333455777754 2333333333332 Q ss_pred HHHHHHHhhcCCceEEEeCC-cCCHHH------HHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHH Q lcl|NC_019456. 199 ENFSQNEMEKKDKFVLQYDR-SISPEK------RQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISR 271 (435) Q Consensus 199 ~~~~~~~~~n~~~~~~~~~~-~~~~e~------~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~ 271 (435) ......|+....+.++-.+. ...+++ .+....++...-++..+......+.++.+++...- ..|.+..+... T Consensus 244 ~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l-~~~~~~l~~~~ 322 (474) T protein:vir:81 244 REGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASP-DAHWSDINGLA 322 (474) T ss_pred HHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCCh-hHHHHHHHHHH Confidence 33344455444444443321 111111 11112222222222222222334566666655432 24788888899 Q ss_pred HHHHHHhCCCHHHhCCcccCcccHHHHHHH---HHHHHHhHHHHHH----HHHHHHhhccccc------ccCcceeeech Q lcl|NC_019456. 272 IRIATAFNVPISFLNDDQAKSTTNVEHVTH---SWTMTLMPIIRQY----ESQFNMKLFTPGK------RVKGFYFSFNV 338 (435) Q Consensus 272 ~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~---~~~~~i~P~~~~i----~~~l~~~l~~~~~------~~~g~~i~fd~ 338 (435) ..||..-++|+..+|.....|-++.++... -+...+.-..+.+ ++.+-.-+.-... ......+++.+ T Consensus 323 ~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W 402 (474) T protein:vir:81 323 KLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKW 402 (474) T ss_pred HHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEe Confidence 999999999999999765443333333211 1111122222222 2222111111110 01123455555 Q ss_pred hhhhccCHHHHHHHHHHHHhcCC-cC-HHHHHHHhCCCCCCCc-CCceeeecccccchhccccccccccccccc Q lcl|NC_019456. 339 NGLLRGDTAARTQYYQTLTRNGI-FK-PNEIRELEGQAPIPDE-AADHLYISKDLYPLDKYYDAILDNKIQTDA 409 (435) Q Consensus 339 ~~l~~~d~~~~~~~~~~~~~~g~-~t-~NE~R~~~g~~p~~~~-~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~ 409 (435) .+....+...+++++.|+++.|. +. ..=+++++|+.+-.-+ +-++..-.....+++.+.. ....+.+.+ T Consensus 403 ~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~--~~~~~~~aq 474 (474) T protein:vir:81 403 RDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQALID--RSNNGATAQ 474 (474) T ss_pred cCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHh--cCCCCCCCC Confidence 66778889999999999999874 33 3446788898753210 0000000001112222110 001111111 No 153 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.91 E-value=1.1e-08 Score=64.17 Aligned_cols=397 Identities=12% Similarity=0.045 Sum_probs=172.5 Q ss_pred CchHHHHHhhccccccc--------------cccccccchh---hhhhccccccCccccc------HHHHhhhHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQA--------------NQIVQNPIPQ---PLDMAGVKLEQATFSR------EHILESNEYIFSIV 57 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~--------------~~~~~~~~~~---~~~~~~~~~~~~~~~~------~~~~~~~~~v~~~i 57 (435) |++|++++++|..-... .......... ...+.........+.+ .....+.+.-..+. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~ 80 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSA 80 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHH Confidence 99999999988432110 0000000000 0011111111110000 00111222223333 Q ss_pred HHHHHHHhh--Cceeeeecccc-------cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEE Q lcl|NC_019456. 58 TRLSNVLAS--LPLHEYQNYKQ-------MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPI 128 (435) Q Consensus 58 ~~ia~~ia~--~~~~~~~~~~~-------~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~ 128 (435) +.+|+-+.. +.+.+-..... ......+..+. ........+...+.+.+..|.+++.+..+. |.+ T Consensus 81 ~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~----~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~--~~~- 153 (517) T protein:vir:98 81 DVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVF----QHNKFIKNLSDYLEPTFALGGLTVRPYVDN--GEI- 153 (517) T ss_pred HHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHH----HhccHHHHHHHHHHHHhhhCCEEEEEEEeC--Cee- Confidence 444444332 22333211100 00111222222 223345556667788888899988887764 332 Q ss_pred EEEEeCCceeEEEEc------------------CCCceEEEEEe-----------------------cCCe--eEEEc-- Q lcl|NC_019456. 129 ALWPLDPNTVSILRN------------------TDNNSYWYRVT-----------------------SDIY--NFTIP-- 163 (435) Q Consensus 129 ~l~~l~~~~v~~~~~------------------~~~~~~~~~~~-----------------------~~~~--~~~~~-- 163 (435) .+..+++..+-+... .++..+|.... .+.. ...++ T Consensus 154 ~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~ 233 (517) T protein:vir:98 154 EFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLE 233 (517) T ss_pred EEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCcccccccccc Confidence 244455544432111 11111111000 0000 00010 Q ss_pred ------hhh----------eEEeccCCCc----cccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCH- Q lcl|NC_019456. 164 ------IND----------VIHVKHVVPS----NSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISP- 222 (435) Q Consensus 164 ------~~~----------iih~~~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~- 222 (435) +++ ..||+.+-++ ..+.|+|.+..+...+...+..-..-..-++.|-..+..-...+.. T Consensus 234 ~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~~l~~~ 313 (517) T protein:vir:98 234 ELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVSDVMLRTV 313 (517) T ss_pred ccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecChhhhccc Confidence 011 2244443222 3567999999998888877655444444555553332221111100 Q ss_pred -HHHHH-HHHHHHHHhcCCCc----cccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHH Q lcl|NC_019456. 223 -EKRQA-MVNDFLRMVKENGG----AVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNV 296 (435) Q Consensus 223 -e~~~~-~~~~~~~~~~~~~~----~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~ 296 (435) +.-.. ...-| ..... +..-.++-.++.++....+-++.+..+...+.|+..+|+++..++....+.. ++ T Consensus 314 ~~~~g~~~~~~~----d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~k-TA 388 (517) T protein:vir:98 314 PDESGMPPPQVF----DPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMK-TA 388 (517) T ss_pred cCCCCcccCCCC----CcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccc-cH Confidence 00000 00000 00000 0001123346666777666788888889999999999999999997665432 22 Q ss_pred HHHH---HHHHHHHhHHHHHHHHHHHH------------hhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCC Q lcl|NC_019456. 297 EHVT---HSWTMTLMPIIRQYESQFNM------------KLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGI 361 (435) Q Consensus 297 e~~~---~~~~~~i~P~~~~i~~~l~~------------~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~ 361 (435) .+.. .-...++.-+...++..|.. .++. .....+..+.+++++....|.++.++...+++..|+ T Consensus 389 TEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~-~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ 467 (517) T protein:vir:98 389 TEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFG-GEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFGF 467 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC-CCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcCC Confidence 2111 11111222222222211111 1222 222334568888999999999999999999999999 Q ss_pred cCHHHHHHHh-CCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 362 FKPNEIRELE-GQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 362 ~t~NE~R~~~-g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (435) |++-+++.++ |+. ++.+++.+.. +. .......... ......++ ..++.| T Consensus 468 ms~~~~i~~~~g~~---eeeA~~e~~~-----i~---~E~~~~~~~~----~~~~~~~~--------~~gd~e 517 (517) T protein:vir:98 468 IPTVEAIQRIFKVP---KKTAEQWLEE-----IR---KDQIELDPVT----ISQRAQKR--------MFGDEE 517 (517) T ss_pred CCHHHHHHHhCCCC---hHHHHHHHHH-----HH---HhccccCCCC----ccccccCC--------CCCCCC Confidence 9999987654 754 3334332111 10 0000000000 00011111 111111 No 154 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.91 E-value=1.6e-08 Score=63.29 Aligned_cols=345 Identities=11% Similarity=0.030 Sum_probs=160.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHH---HH-hhhHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREH---IL-ESNEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~-~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) ..++++|.+.+.......... ..++..-..-..-+..+.++ .+ .-..+..-+|+.+|..+.=-.|.. T Consensus 3 ~~~i~~L~~~~~~~~~r~~~~----~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~----- 73 (409) T protein:vir:94 3 EKGIGYLRFKLSVHKRRAEMR----YDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFEN----- 73 (409) T ss_pred HHHHHHHHHHHHHHhHHHHHH----HHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCcccC----- Confidence 334555555443322211111 01111000000000001111 01 112344445565554333222221 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceE--EEEEe Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSY--WYRVT 154 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~--~~~~~ 154 (435) .+..+-..+. ..+.......+..+.+.+|.||+.|..+. .|.| .+.+++|..+....|...... .+.+. T Consensus 74 --~d~~l~~i~~-----~N~ld~~~~~~~~~aliyG~sf~~v~~~~-dg~~-~i~~~sp~~~~~i~D~~~~~~~~a~~~~ 144 (409) T protein:vir:94 74 --DDFTVNEIFE-----ENNPDIFFDSAVLSSLIASCSFTYISKGE-NDAV-RLQVIEAVNATGIIDPITGLLTEGYAVL 144 (409) T ss_pred --CchHHHHHHH-----hcChhHHHHHHHHHHHHhcceeEEEecCC-CCce-EEEEeccceEEEEEecCCCceeeeEEEE Confidence 1122322221 22334556788899999999999888654 4665 678899988887776643211 11111 Q ss_pred --c-CCee---EEEchhh----------------------eEEeccCCCccccccCcHH----HHHHHHHHHHHHHHHHH Q lcl|NC_019456. 155 --S-DIYN---FTIPIND----------------------VIHVKHVVPSNSWYGVSPI----DVLSSSLKFQRSVENFS 202 (435) Q Consensus 155 --~-~~~~---~~~~~~~----------------------iih~~~~~~~~~~~G~s~l----~~~~~~i~~~~~~~~~~ 202 (435) + .+.. ..+.+++ |++|.+....+..+|.|.+ ..+.+.+.....-.... T Consensus 145 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~ 224 (409) T protein:vir:94 145 ERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVT 224 (409) T ss_pred EecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHHHHH Confidence 1 1110 1122222 3344433233556787754 33334443333333344 Q ss_pred HHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCcccccc-----CCceeeeccCChhhHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 203 QNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQE-----AGWKVDRYESKFEPADLSSVEQISRIRIATA 277 (435) Q Consensus 203 ~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~-----~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~ 277 (435) ..++....+.++-.+.+-. ..+ .|+.. .++++.++ .+.++.+++...- ..|++..+....++|.. T Consensus 225 ~e~~a~pqr~i~G~d~d~~--~~~----~~~~~---~~~i~~~~~d~dg~~~~v~q~~~~~l-~~~~~~l~~~~~~~a~~ 294 (409) T protein:vir:94 225 AEFYSFPQKYVTGLSDDAE--PME----TWKAT---VSSMLQFTKDEDGDKPTLGQFTQPSM-SPFTEQLRTAAAGFAGE 294 (409) T ss_pred HHHhcChhheeEecCCCCc--ccc----hhhhh---HHHhhcCCCCCCCCCceEEecCCCCh-hHHHHHHHHHHHHHhhh Confidence 4555555555544432211 111 22211 12333332 3356666554432 25889999999999999 Q ss_pred hCCCHHHhCCcccCcccHHHHHH--------------HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhc Q lcl|NC_019456. 278 FNVPISFLNDDQAKSTTNVEHVT--------------HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLR 343 (435) Q Consensus 278 fgvP~~~lg~~~~~~~~~~e~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~ 343 (435) -++|+..+|....+.. +.++.. ..|...++..++........ .. ........+++.+..+.. T Consensus 295 t~lP~~~lg~~~~Nps-Sa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~--~~-~~~~~~~~~~v~W~p~~~ 370 (409) T protein:vir:94 295 TGLTLDDLGFVSDNPS-SVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDD--AP-YLREQFRKTKPKWEPLFE 370 (409) T ss_pred cCCCHHHhccccCchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CC-ccccccccceEEeccCCC Confidence 9999999998665322 122211 11222222222211111111 00 000111223333344433 Q ss_pred cC---HHHHHHHHHHHHhcC--CcCHHHHHHHhCCCCCC Q lcl|NC_019456. 344 GD---TAARTQYYQTLTRNG--IFKPNEIRELEGQAPIP 377 (435) Q Consensus 344 ~d---~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~ 377 (435) .+ ....++.+.|+++.| +...+-+++++|+..-+ T Consensus 371 ~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 371 ADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred cchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 33 456678899999999 55668899999999642 No 155 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.89 E-value=1.9e-08 Score=62.97 Aligned_cols=396 Identities=12% Similarity=0.030 Sum_probs=163.1 Q ss_pred Cc-------hHHHHHhhccccccccccccccchhhhhh------ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhC Q lcl|NC_019456. 1 MS-------FMSKVRQFFGVHDQANQIVQNPIPQPLDM------AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASL 67 (435) Q Consensus 1 Mg-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 67 (435) |. +++.|.+.+......- .....++.. .+... ............+...+|+.++..+--- T Consensus 8 ~~~~~~~~~~~~~l~~~~~~~~~r~----~~~~~Yy~G~~~i~~~~~~~---~~~~~~~~~~~n~~~~ivd~~~~~l~~~ 80 (485) T protein:vir:10 8 QEEIEDPAIARDEMVSAFEDSTQNL----KTNTSYYEAERRPEAIGVTV---PIQMQSLLAHVGYPRLYVDSIAERQAVE 80 (485) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHH----HHHHHHHhcCCcchhcCCCC---ChhhhhhhhhcCcHHHHHHHHHhhhccc Confidence 11 1222222221111000 000011100 00000 0000001111234455666666554322 Q ss_pred ceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC-------CCcEEEEEEeCCceeEE Q lcl|NC_019456. 68 PLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS-------TGEPIALWPLDPNTVSI 140 (435) Q Consensus 68 ~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~-------~g~~~~l~~l~~~~v~~ 140 (435) .|.+ .+....+..+..++. ......+...+..+++.+|.||.++.++.. .|. ..+.+++|..+.+ T Consensus 81 g~~~--~~~~~~~~~~~~i~~-----~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~-~~i~~~~p~~~~~ 152 (485) T protein:vir:10 81 GFRF--GDADEADEELWQWWQ-----ANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNT-PIIRVEPPTRMYA 152 (485) T ss_pred ceec--CCCchhHHHHHHHHH-----hcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCe-eEEEEEccceeEE Confidence 3332 111111222333332 234556778899999999999998876532 122 2477788888877 Q ss_pred EEcCCCc-eE---EEEEecC-Ce---eEEEchhhe-------------------------EEeccCCCccccccCcHHHH Q lcl|NC_019456. 141 LRNTDNN-SY---WYRVTSD-IY---NFTIPINDV-------------------------IHVKHVVPSNSWYGVSPIDV 187 (435) Q Consensus 141 ~~~~~~~-~~---~~~~~~~-~~---~~~~~~~~i-------------------------ih~~~~~~~~~~~G~s~l~~ 187 (435) ..+.... .. .+..... +. ...|..+.+ ++|.+.....+.+|.|-+.. T Consensus 153 ~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~ 232 (485) T protein:vir:10 153 EIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITP 232 (485) T ss_pred EEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCccchhH Confidence 7664321 11 1111111 11 111223333 34443322344567776542 Q ss_pred -HHHHHHH---HHHHHHHHHHHhhcCCceEEEeCCcCCHHHHH--HHHHHHHHHhcCCCcccccc-CCceeeeccCChhh Q lcl|NC_019456. 188 -LSSSLKF---QRSVENFSQNEMEKKDKFVLQYDRSISPEKRQ--AMVNDFLRMVKENGGAVVQE-AGWKVDRYESKFEP 260 (435) Q Consensus 188 -~~~~i~~---~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~--~~~~~~~~~~~~~~~~~vl~-~g~~~~~~~~~~~~ 260 (435) +...+.. ..+-......++......+.-. ...+...+ .-...|. ...+.+..++ ++.++.++....-+ T Consensus 233 ~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~--~~~~~~~~~~~~~~~~~---~~~~~i~~~~~~d~k~~q~~~~~~~ 307 (485) T protein:vir:10 233 ELRSMTDAAARILMLMQATAELMGVPQRLIFGI--KPEEIGVDPETGQTLFD---AYLARILAFEDAEGKIQQFSAAELA 307 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcC--Ccccccccccccchhhh---hcccceeccCCCCceEEeecccchH Confidence 2333332 2221111222333222211111 11110000 0011111 1124455554 56777776654432 Q ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccc Q lcl|NC_019456. 261 ADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPG 326 (435) Q Consensus 261 ~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~ 326 (435) .+++..+....+|+..-++|+..+|....+.. +.+++ ...|...+...++.+.. +.. ... T Consensus 308 -~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~-Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~-~~~---~~~ 381 (485) T protein:vir:10 308 -NFTNALDQIAKQVAAYTGLPPQYLSTAADNPA-SAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYR-MMK---GGD 381 (485) T ss_pred -HHHHHHHHHHHHHhcccCCCHHHhccccCchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhC---CCC Confidence 37777778888999999999999987553321 22221 12222233332222211 111 011 Q ss_pred cccCcceeeechhhhhccCHHHHHHHHHHHHhcC--CcCHHHHHHHhCCCCCCCcCCceeeecccc---cchhccccccc Q lcl|NC_019456. 327 KRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNG--IFKPNEIRELEGQAPIPDEAADHLYISKDL---YPLDKYYDAIL 401 (435) Q Consensus 327 ~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~---~~l~~~~~~~~ 401 (435) .......+++.+......+..+.++++.++++.| +++..-+++.+|+.+-+-+...+..--... ..++.+.... T Consensus 382 ~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~- 460 (485) T protein:vir:10 382 VPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPN- 460 (485) T ss_pred CcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccC- Confidence 1111234556666677888999999999999866 888888999999875321111111000000 0011111000 Q ss_pred cccccccccccccccCCCCCC-CCCCCCCCCCCCC Q lcl|NC_019456. 402 DNKIQTDASVAAPKQEGGENT-NENGLQSTEPEGS 435 (435) Q Consensus 402 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 435 (435) . ..++..+. .+.+.+..++-|+ T Consensus 461 --~----------~~~~~~~~~~~~~~~~~~~~~~ 483 (485) T protein:vir:10 461 --P----------TVPGSPSPAPAPKPAALESGGD 483 (485) T ss_pred --C----------CCCCCCCccccccCcCCCCCCC Confidence 0 00011111 1111111122222 No 156 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.89 E-value=1.6e-08 Score=63.33 Aligned_cols=370 Identities=9% Similarity=-0.016 Sum_probs=159.7 Q ss_pred ccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHH Q lcl|NC_019456. 19 QIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAF 98 (435) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~ 98 (435) .......+.+-. ..+.+ ...+..-+|+.++..+---.|.+ .....+..+..++. + | ... T Consensus 1 ~l~~~~~~~~~~------------~~~~~-v~n~~~~ivd~~~~~l~~~gf~~---~d~~~~~~~~~i~~-~-N---~~d 59 (434) T protein:vir:98 1 MLPKNAEQAFLD------------FQRKA-RTNFCGLIANASVHRLLALGVTG---PDGEPDTRASRWWQ-A-N---RLD 59 (434) T ss_pred CCCCCccHHHHH------------hhhhh-hccchHHHHHHHHhhhccCceec---CCCchHHHHHHHHH-h-c---Chh Confidence 000000000000 00001 12344556666665443223332 11112222333322 2 2 345 Q ss_pred HHHHHHHHHHHhcCCcceEEeeeCC----CCcEE-EEEEeCCceeEEEEcCCCceE-----EEEEecCCee--EEEc--- Q lcl|NC_019456. 99 EFIARLETDRNVSGNGYAWIQKSLS----TGEPI-ALWPLDPNTVSILRNTDNNSY-----WYRVTSDIYN--FTIP--- 163 (435) Q Consensus 99 ~f~~~~~~~~~~~G~~~~~i~~~~~----~g~~~-~l~~l~~~~v~~~~~~~~~~~-----~~~~~~~~~~--~~~~--- 163 (435) .....+..+++.+|.+|.++..+.. .|.+. .+.+++|..+.+..+...... +|....++.. ..+. T Consensus 60 ~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~~~~ 139 (434) T protein:vir:98 60 SRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYARVFFDDT 139 (434) T ss_pred HHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEEEEEEeCc Confidence 6677889999999999998876432 12222 377789988887776532111 1111111110 0000 Q ss_pred ------------------------------------hhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHH---H Q lcl|NC_019456. 164 ------------------------------------INDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQ---N 204 (435) Q Consensus 164 ------------------------------------~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~---~ 204 (435) .=-|+||++....+. .|.|-+..+...+.....+..... . T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~-~g~sd~e~vi~liDa~~~~~s~~~~~~~ 218 (434) T protein:vir:98 140 SFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGE-DPEPEFAGVLDIQDRVNLGILNRMAASR 218 (434) T ss_pred EEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCCcCc-CCcchhhhHHHHHHHHHHHHHHHHHHHH Confidence 011444544322223 588888877777776554332222 2 Q ss_pred HhhcCCceEEEeCC-cCCHHHHHHHHHHHHHHhcCCCcccccc-CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_019456. 205 EMEKKDKFVLQYDR-SISPEKRQAMVNDFLRMVKENGGAVVQE-AGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPI 282 (435) Q Consensus 205 ~~~n~~~~~~~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~vl~-~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~ 282 (435) ++......+.-.+. ...++. ......+.......+++.+++ .+.++.++.....+ .+.+..+..+..|+..-++|+ T Consensus 219 ~~a~p~~~i~G~~~~~~~~~~-~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~-~~~~~l~~~i~~~~~~~~~p~ 296 (434) T protein:vir:98 219 FSGFRQKWIKGHKFAKRTDPA-TGMTVVDQPFVPSPSAVWASEGENTQFGQLDATDLS-GFLKEHASDVRDMLTISQTPT 296 (434) T ss_pred HhcchhhhhcCCCcccccccc-cccchhhhhhhccccccccCCCCCceEEEecCcchH-HHHHHHHHHHHHHhcccCCCH Confidence 22222222211111 111111 111111222222334555655 45777776654332 377777788899999999999 Q ss_pred HHhCCcccCcccHHHHHHHH---HHHHHhHHHHHHHHHHHHh---hcc-cccccCcceeeechhhhhccCHHHHHHHHHH Q lcl|NC_019456. 283 SFLNDDQAKSTTNVEHVTHS---WTMTLMPIIRQYESQFNMK---LFT-PGKRVKGFYFSFNVNGLLRGDTAARTQYYQT 355 (435) Q Consensus 283 ~~lg~~~~~~~~~~e~~~~~---~~~~i~P~~~~i~~~l~~~---l~~-~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~ 355 (435) ..+|.... +. +.+++..- +...+.-..+.+.+.|.+. ++. .........+++.+......+..+.++++.+ T Consensus 297 ~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada~~k 374 (434) T protein:vir:98 297 YLYATDLV-NI-SADTIGALDILHVAKVREHIASFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVKADAATK 374 (434) T ss_pred HHhccccC-Ch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCCCCHHHHHHHHHH Confidence 99986322 21 22221111 1111111111122121110 000 0001112235555566778889999999999 Q ss_pred HHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccc-ccccccccccccccccCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 356 LTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAI-LDNKIQTDASVAAPKQEGGENTNENGLQSTEPEG 434 (435) Q Consensus 356 ~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (435) ++..|+ +..-+++++|+++- + -+++. .-...+ ........+ ...+..|.+++ +.+..+| T Consensus 375 l~~~g~-~~e~~~~~lg~~~~--e-~~r~~---------~e~~~~~~~~~~~~~~--~~~~~~g~~~~-----~~~~~dg 434 (434) T protein:vir:98 375 LKSIGY-PLDVIAEELDESPA--R-VRRIV---------AGAASQALLAASLLPA--PGAPSAGNVPD-----SGGAVDG 434 (434) T ss_pred HHhcCC-cHHHHHHhCCCCHH--H-HHHHH---------HHHHHHHHHHHhhhcc--CCCCCCCCCCc-----ccCCCCC Confidence 998885 77778888887642 1 11110 000000 000000000 01111122221 2222233 No 157 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.88 E-value=2.1e-08 Score=62.71 Aligned_cols=393 Identities=13% Similarity=0.070 Sum_probs=181.9 Q ss_pred CchHHHHHhhcccccccc----ccc-------cccchhh-------hhhccccccCcccc------cHHHHhhhHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQAN----QIV-------QNPIPQP-------LDMAGVKLEQATFS------REHILESNEYIFSI 56 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~----~~~-------~~~~~~~-------~~~~~~~~~~~~~~------~~~~~~~~~~v~~~ 56 (435) ||+|+++++++..-.... ... -...+.. ..+........... ..........-..+ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHH Confidence 999999998875421111 000 0000100 00111111100000 00011222333445 Q ss_pred HHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCc Q lcl|NC_019456. 57 VTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPN 136 (435) Q Consensus 57 i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~ 136 (435) ++.+|+-+..=|..+.-++... ...+..++ .......-....+.+....|.+++.+..+. |.+ .+..++|. T Consensus 81 ~~~~A~ll~~e~~~i~~~d~~~-~e~l~~i~-----~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~--~~~-~i~~v~ad 151 (505) T protein:vir:79 81 SAKLASLIFNEQCQVTVSDETA-NDFLDDVF-----QQNDFYTTFEEKLEEWIALGSGCVRPYVDS--GKI-KLAWATAD 151 (505) T ss_pred HHHHHhhhcCCCceeecCChHH-HHHHHHHH-----HhccHHHHHHHHHHHHhhcCCeEEEEEEeC--Cce-EEEEEcCC Confidence 5666665554443433232211 11222222 222345566778888888999988887763 433 34455555 Q ss_pred eeEEEE-cCCC-----------------ceEEE------------E-----Eec-C----Cee---------------EE Q lcl|NC_019456. 137 TVSILR-NTDN-----------------NSYWY------------R-----VTS-D----IYN---------------FT 161 (435) Q Consensus 137 ~v~~~~-~~~~-----------------~~~~~------------~-----~~~-~----~~~---------------~~ 161 (435) .+-+.. +.++ ..+|. . +.. + |.. .+ T Consensus 152 ~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~ 231 (505) T protein:vir:79 152 QVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVK 231 (505) T ss_pred eeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCccee Confidence 544432 1111 11110 0 000 0 000 00 Q ss_pred E---chhheEEeccCCCc----cccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCC------cCCHHHHHHH Q lcl|NC_019456. 162 I---PINDVIHVKHVVPS----NSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDR------SISPEKRQAM 228 (435) Q Consensus 162 ~---~~~~iih~~~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~------~~~~e~~~~~ 228 (435) + +.--..||+.+.++ ..+.|+|.+..+...+...+..-....+-|+.+-..++.-.. .-..+..... T Consensus 232 ~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~~ 311 (505) T protein:vir:79 232 ITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQASETH 311 (505) T ss_pred ecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCccccccc Confidence 0 11123355543222 346799999999988887766554444455555333222111 1111100000 Q ss_pred HHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH--------- Q lcl|NC_019456. 229 VNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV--------- 299 (435) Q Consensus 229 ~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------- 299 (435) ...|....+--..+..-+++..++.++.....-++.+..+...++|+...|+++..++....+.. ++.+. T Consensus 312 ~~~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~-TAtei~s~~~~l~~ 390 (505) T protein:vir:79 312 PPMFDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQ-TATEVVTNNSQTYQ 390 (505) T ss_pred ccCCCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccc-hHHHHHHHHhHHHH Confidence 00000000000011111234467777777766778888888889999999999999987655432 22221 Q ss_pred -----HHHHHHHHhHHHHHHHHHHHHhhcccc------cccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHH Q lcl|NC_019456. 300 -----THSWTMTLMPIIRQYESQFNMKLFTPG------KRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIR 368 (435) Q Consensus 300 -----~~~~~~~i~P~~~~i~~~l~~~l~~~~------~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R 368 (435) ...++.+|..++..+......-.+... .......+.+++++-...|.++.++...+++.+|+|+.-+++ T Consensus 391 t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e~~l 470 (505) T protein:vir:79 391 TRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMPKKQFL 470 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHH Confidence 111233333333333322221111111 112234678888898899999999999999999999999888 Q ss_pred HHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCC Q lcl|NC_019456. 369 ELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGE 420 (435) Q Consensus 369 ~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (435) ... +-++++.+++-+ ..+ ..++. ...+....-||| T Consensus 471 ~~~--~~~~eeea~~el--------~ri---~~E~~----~~~p~~~~~gg~ 505 (505) T protein:vir:79 471 MRN--YGLDEEEADEWL--------AQI---DAENS----TAEPEFNQFGGD 505 (505) T ss_pred Hhc--CCCChHHHHHHH--------HHH---HHhcc----ccCCCchhccCC Confidence 764 223333332211 111 11111 111233455566 No 158 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.86 E-value=2.6e-08 Score=62.18 Aligned_cols=404 Identities=11% Similarity=0.007 Sum_probs=164.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccH---HHHhhhHHHHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSRE---HILESNEYIFSIVTRLSNVLASLPLHEYQNYKQ 77 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 77 (435) .-+.+++.+.+....+.-. ....++.--..-......... .....+.+..-+|+.++..+--..|.+-. .. T Consensus 14 ~~~~~~l~~~~~~~~~rl~----~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~--~~ 87 (484) T protein:vir:77 14 EKAREEMLNLFTERTQDLG----DNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQELEGFRLGG--AD 87 (484) T ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHHhhhccCceecCC--cc Confidence 2233333332221111000 001111100000000000001 11112344555666666555434444321 11 Q ss_pred cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcE-------EEEEEeCCceeEEEEcCCCceE- Q lcl|NC_019456. 78 MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEP-------IALWPLDPNTVSILRNTDNNSY- 149 (435) Q Consensus 78 ~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~-------~~l~~l~~~~v~~~~~~~~~~~- 149 (435) ..+..+..++ ...........+..+++.+|.||..|.++.. |.+ ..+.+++|..+.+..+...... T Consensus 88 ~~~~~l~~i~-----~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~-~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~ 161 (484) T protein:vir:77 88 KADEQLWDWW-----QANDLDIESTLGHTDSLVHGRSYITISKPDP-NIDPGVDPEVPIIRVEPPTNLYAQIDPRTRQVM 161 (484) T ss_pred hhHHHHHHHH-----HhcCHhHHHHHHHHHHhhcCceEEEEecCCC-CcccccccccceEEEeccceeEEEecCCCCceE Confidence 1122233322 2234556778899999999999998876543 332 2467788888877665431110 Q ss_pred ---EEEEec-CCee---EEEchh-------------------------heEEeccCCCccccccCcHHHH-HHHHHHHHH Q lcl|NC_019456. 150 ---WYRVTS-DIYN---FTIPIN-------------------------DVIHVKHVVPSNSWYGVSPIDV-LSSSLKFQR 196 (435) Q Consensus 150 ---~~~~~~-~~~~---~~~~~~-------------------------~iih~~~~~~~~~~~G~s~l~~-~~~~i~~~~ 196 (435) .+.... ++.. ..|.++ -|++|.+........|.|.+.. +...+.... T Consensus 162 ~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~ 241 (484) T protein:vir:77 162 RAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAA 241 (484) T ss_pred EEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHH Confidence 000000 1100 011111 1355554333445577776542 222223222 Q ss_pred ---HHHHHHHHHhhcCCceEEEeC-CcCCHHHHHHHHHHHHHHhcCCCcccccc-CCceeeeccCChhhHHHHHHHHHHH Q lcl|NC_019456. 197 ---SVENFSQNEMEKKDKFVLQYD-RSISPEKRQAMVNDFLRMVKENGGAVVQE-AGWKVDRYESKFEPADLSSVEQISR 271 (435) Q Consensus 197 ---~~~~~~~~~~~n~~~~~~~~~-~~~~~e~~~~~~~~~~~~~~~~~~~~vl~-~g~~~~~~~~~~~~~~~~e~~~~~~ 271 (435) +-......++......++-.. .....+. .+-...|. ...+.++.++ ++.++.++....-+ .+++..+... T Consensus 242 ~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~-~~~~~~~~---~~~~~~~~~~~~~~~~~q~~~~~~e-~~~~~l~~~i 316 (484) T protein:vir:77 242 RTLMLMQATAELMGVPQRLLFGVKGEELGVDP-ETGQTLFD---AYLARILAFEDHESKAQQFSAAELR-NFVDALDALD 316 (484) T ss_pred HHHHHHHHHHHhhhhhHHHHhCCCcchhcccc-cccchhhh---hhhhhhcccCCCCceeEeecCCChH-HHHHHHHHHH Confidence 211122223322222221111 0111010 00111121 1123455555 46788777655433 3777777888 Q ss_pred HHHHHHhCCCHHHhCCcccCcccHHHHHHH--------------HHHHHHhHHHHHHHHHHHHhhcccccccCcceeeec Q lcl|NC_019456. 272 IRIATAFNVPISFLNDDQAKSTTNVEHVTH--------------SWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFN 337 (435) Q Consensus 272 ~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~--------------~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd 337 (435) ..|+..-++|+..+|+...+. ++.+++.. .|...+.-.++.+....+. .........+++. T Consensus 317 ~~~s~~~~~p~~~fg~~~~n~-~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~----~~~~~~~~~i~v~ 391 (484) T protein:vir:77 317 RKAAAYTGLPPYYLSFSSENP-ASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNG----GDIPPEYYRMESI 391 (484) T ss_pred HHHhcccCCCHHHhccccCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccccccceEE Confidence 889999999999998755331 22222211 1222222222222111110 0001111235555 Q ss_pred hhhhhccCHHHHHHHHHHHHhcC--CcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccc Q lcl|NC_019456. 338 VNGLLRGDTAARTQYYQTLTRNG--IFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPK 415 (435) Q Consensus 338 ~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~ 415 (435) +......+..+.++.+.+++++| +++..-+++++|+.+-+.+......-........ .........++..+ T Consensus 392 w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~-~~~~~~~~~~~~~~------ 464 (484) T protein:vir:77 392 WRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLG-LMGTMFGTDPSGGG------ 464 (484) T ss_pred ecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHH-HHhhhccccccCCC------ Confidence 56666788899999999999876 8888889999988653322111110000000000 00000000000000 Q ss_pred cCCCCCCCCC-CCCCCCCCCC Q lcl|NC_019456. 416 QEGGENTNEN-GLQSTEPEGS 435 (435) Q Consensus 416 ~~~~~~~~~~-~~~~~~~~~~ 435 (435) .++..+++ +.+..+++-+ T Consensus 465 --~~~~~~~~~~~~~~~~~~~ 483 (484) T protein:vir:77 465 --NPDNPETPEPQPNPAEEAA 483 (484) T ss_pred --CCCCCCcccccCCCccccC Confidence 00001111 1111111111 No 159 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=98.83 E-value=3.2e-08 Score=61.66 Aligned_cols=403 Identities=10% Similarity=0.004 Sum_probs=176.3 Q ss_pred CchHHHHHh----hcccccccccccccc--------chhhhh-hc--ccc--ccCcccccHHHHhhhHHHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQ----FFGVHDQANQIVQNP--------IPQPLD-MA--GVK--LEQATFSREHILESNEYIFSIVTRLSNV 63 (435) Q Consensus 1 Mg~~~~~~~----~~~~~~~~~~~~~~~--------~~~~~~-~~--~~~--~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 63 (435) ||+|+.+++ |+............. ....+. .. +.. ..+...+. ...+..+.-..+++.+|+- T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~-~~~~~~~l~~~i~~~~A~l 79 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVH-DKLMNSGTGNEIVVVAAEY 79 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccc-cccccCChHHHHHHHHHHh Confidence 999987765 444433211111100 000000 00 000 00011111 1122333445567777777 Q ss_pred HhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEc Q lcl|NC_019456. 64 LASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRN 143 (435) Q Consensus 64 ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~ 143 (435) +..=+..+.=.+....+++.++..+.+--.......-+...+......|.+++.+..+. |++ .+..+++..+-+... T Consensus 80 l~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~--~~~-~i~~v~ad~~~P~~~ 156 (518) T protein:vir:78 80 ISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILN--GRP-SISVHSSSQFWIDFK 156 (518) T ss_pred hcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEEC--Cee-EEEEEcCCeeEEEee Confidence 76544433211111111111111111111223345555667788888899887776653 443 455556655544321 Q ss_pred C---------------CCceEEEEE-------------------------ecC-CeeEE-------------Echh---- Q lcl|NC_019456. 144 T---------------DNNSYWYRV-------------------------TSD-IYNFT-------------IPIN---- 165 (435) Q Consensus 144 ~---------------~~~~~~~~~-------------------------~~~-~~~~~-------------~~~~---- 165 (435) . +...+|+.. ..+ +.... ...+ T Consensus 157 ~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e 236 (518) T protein:vir:78 157 NNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQL 236 (518) T ss_pred cCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccccccccccccccccccccCcc Confidence 1 111111100 000 00000 0000 Q ss_pred -----------heEEeccCCCc----cccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHH-HHHHHH Q lcl|NC_019456. 166 -----------DVIHVKHVVPS----NSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPE-KRQAMV 229 (435) Q Consensus 166 -----------~iih~~~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e-~~~~~~ 229 (435) -+.|++...++ +.+.|+|.+..+...+...+..-.....-|+.+-..+......+... ...... T Consensus 237 ~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~ 316 (518) T protein:vir:78 237 NHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTKTKIAASERMFRKKVNKSTDK 316 (518) T ss_pred ceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCCceeeechhHhccCCCCCCCc Confidence 01223322111 34669999999999998877655555555666543333222111000 000000 Q ss_pred HHHHHHhcCCCccc-----cccCCc----eeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHH---H Q lcl|NC_019456. 230 NDFLRMVKENGGAV-----VQEAGW----KVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNV---E 297 (435) Q Consensus 230 ~~~~~~~~~~~~~~-----vl~~g~----~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~---e 297 (435) ..+. +....+.. -.+.|. .++.++....+.++.+..+...+.|...+|++|..+|......++++ + T Consensus 317 ~~~~--fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TATei~s~ 394 (518) T protein:vir:78 317 EEWS--MNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKATEIWSL 394 (518) T ss_pred cccc--cCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccccHHHHHHH Confidence 0000 00000100 112222 36677777777788888888899999999999999986433322321 1 Q ss_pred HHH---------HHHHHHHhHHHHHHHHHHHHhhccc---ccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHH Q lcl|NC_019456. 298 HVT---------HSWTMTLMPIIRQYESQFNMKLFTP---GKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPN 365 (435) Q Consensus 298 ~~~---------~~~~~~i~P~~~~i~~~l~~~l~~~---~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~N 365 (435) .+. ..+..+|.-++..+...+.. ++.. ........+.+++++....|.++.++...+++.+|+|++- T Consensus 395 ~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~-~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aGimS~e 473 (518) T protein:vir:78 395 QDATVRKIEKKKRLIQNVYEQMLWDFLYLLTG-GTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSALAMSVE 473 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHH Confidence 111 11122222222222222221 1111 0111234688888999999999999999999999999999 Q ss_pred HHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCC Q lcl|NC_019456. 366 EIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNEN 425 (435) Q Consensus 366 E~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (435) ++-+++... ..++..++-+ ..+. . .+.. .....|..-+|.+..+- T Consensus 474 ~~i~~~~~~-~~deea~~e~-----~ri~---~---E~~~---~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 474 EKVKLIHPK-WEDEEIQAEV-----KRIY---L---ENAI---GEVPDPEAIGGMETKGG 518 (518) T ss_pred HHHHHhCCC-CCHHHHHHHH-----HHHH---H---Hhcc---cCCCCCccccCCCCCCC Confidence 865555322 2333333211 1111 0 0000 00011111111111111 No 160 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.83 E-value=3.3e-08 Score=61.60 Aligned_cols=348 Identities=9% Similarity=0.026 Sum_probs=159.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccC-cccccHHH---H-hhhHHHHHHHHHHHHHHhhCceeeeecc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQ-ATFSREHI---L-ESNEYIFSIVTRLSNVLASLPLHEYQNY 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~---~-~~~~~v~~~i~~ia~~ia~~~~~~~~~~ 75 (435) ..++++|...+.......... ..++..-. .... ......+. + ....+..-+|+.+|..+.=-.|.. T Consensus 3 ~~~i~~L~~~~~~~~~r~~~~----~~yY~g~~-~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~---- 73 (409) T protein:vir:16 3 EKGIGYLRFKLSVHKRRAEMR----YEQYAMKH-VDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFEN---- 73 (409) T ss_pred HHHHHHHHHHHHHHhHHHHHH----HHHHhccC-chhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhcccccccC---- Confidence 233555554433222111111 01111000 0000 00011100 0 112344445555554333222221 Q ss_pred cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceE-----E Q lcl|NC_019456. 76 KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSY-----W 150 (435) Q Consensus 76 ~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~-----~ 150 (435) .+..+-..+ ...+.......+..+.+.+|.||+.|..+. .|.| .+.+++|.++....|+..... + T Consensus 74 ---~d~~l~~i~-----~~N~ld~~~~~~~~~al~yG~sf~~v~~~~-dg~~-~i~~~sP~~~~~i~D~~~~~~~~a~~~ 143 (409) T protein:vir:16 74 ---DDFTVNEIF-----EENNPDIFFDSTVLSALIASCSFTYISKGE-NDAV-RLQVIEATNATGIIDPITGLLTEGYAV 143 (409) T ss_pred ---cchHHHHHH-----HhcChhHHHHHHHHHHHHhCceeEEEecCC-CCce-EEEEEcccceEEEeecccccceeeeEE Confidence 112232222 223344566788899999999999888654 4664 677888888877765532211 0 Q ss_pred EEEecCCee---EEEchhh----------------------eEEeccCCCccccccCcHH----HHHHHHHHHHHHHHHH Q lcl|NC_019456. 151 YRVTSDIYN---FTIPIND----------------------VIHVKHVVPSNSWYGVSPI----DVLSSSLKFQRSVENF 201 (435) Q Consensus 151 ~~~~~~~~~---~~~~~~~----------------------iih~~~~~~~~~~~G~s~l----~~~~~~i~~~~~~~~~ 201 (435) +.....+.. ..+.+++ |++|.+.......+|.|-+ ..+.+.+.....-... T Consensus 144 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~ 223 (409) T protein:vir:16 144 LERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADV 223 (409) T ss_pred EEecCCCceEEEEEEecCcEEEEEecCccccceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHH Confidence 110011111 0111222 4444443333556787744 3444444444443444 Q ss_pred HHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCcccccc-----CCceeeeccCChhhHHHHHHHHHHHHHHHH Q lcl|NC_019456. 202 SQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQE-----AGWKVDRYESKFEPADLSSVEQISRIRIAT 276 (435) Q Consensus 202 ~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~-----~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~ 276 (435) ...|+.+..+.++-.+.+-.+ .+ .|+.. .++++.++ .+.++.+++...-. .|.+..+....++|. T Consensus 224 ~~e~~a~pqr~i~G~d~d~~~--~~----~~~~~---~~~i~~~~~d~~g~~~~v~q~~~~~l~-~~~~~l~~~~~~~a~ 293 (409) T protein:vir:16 224 TAEFYSFPQKYVTGLSDDAEP--ME----TWKAT---VSSMLQFTKDEDGDKPTLGQFTQPSMS-PFTEQLRTAAAGFAG 293 (409) T ss_pred HHHHhcChhheeEecCCCCCc--cc----hhhhh---hhHhhccCCCCCCCCceEEecCCCChh-HHHHHHHHHHHHHhh Confidence 555666655555544322111 11 22211 13344443 33566666554332 588999999999999 Q ss_pred HhCCCHHHhCCcccC-ccc-HHHHHHHHHHHHHhHHHHHHHHHHH----Hhhcccc---c-ccCcceeeechhhhh---c Q lcl|NC_019456. 277 AFNVPISFLNDDQAK-STT-NVEHVTHSWTMTLMPIIRQYESQFN----MKLFTPG---K-RVKGFYFSFNVNGLL---R 343 (435) Q Consensus 277 ~fgvP~~~lg~~~~~-~~~-~~e~~~~~~~~~i~P~~~~i~~~l~----~~l~~~~---~-~~~g~~i~fd~~~l~---~ 343 (435) .-++|+..+|....+ .++ ...+...-+...+.-..+.+.+.+. .-+.... . ......+++.+..+. . T Consensus 294 ~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~ 373 (409) T protein:vir:16 294 ETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADA 373 (409) T ss_pred hcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhccceEEecCCCCcch Confidence 999999999987643 111 1111111111111111111111111 1111000 0 011122333334333 3 Q ss_pred cCHHHHHHHHHHHHhcC-Cc-CHHHHHHHhCCCCCC Q lcl|NC_019456. 344 GDTAARTQYYQTLTRNG-IF-KPNEIRELEGQAPIP 377 (435) Q Consensus 344 ~d~~~~~~~~~~~~~~g-~~-t~NE~R~~~g~~p~~ 377 (435) .+....++.+.|+++.| .+ ..+-+++++|+..-+ T Consensus 374 ~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 374 SMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred hhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 44677889999999997 33 346679999998642 No 161 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=98.81 E-value=5.4e-10 Score=71.38 Aligned_cols=187 Identities=9% Similarity=0.006 Sum_probs=96.1 Q ss_pred EEEeCC---cCCHHHHHHHHHHHHH--HhcC-CCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhC Q lcl|NC_019456. 213 VLQYDR---SISPEKRQAMVNDFLR--MVKE-NGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLN 286 (435) Q Consensus 213 ~~~~~~---~~~~e~~~~~~~~~~~--~~~~-~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg 286 (435) |++.++ .++.. ..++++++.. ..++ .+.+.+...+-+|+.++.+... +.++.....+.||++.|||...|- T Consensus 1 V~k~~~l~~~~~~~-~~~~~~r~~~~~~~~~~~~~~~ld~~~e~~e~~~~~lsG--l~d~l~~~~~~iaa~s~iP~t~Lf 77 (201) T protein:vir:10 1 MWKAKGLADLCDDS-DGAARLRLAQVDNNSGVGQAIGIDADSEEYNVLNSDIGG--IDTFLSQKFDRIVALSGIHEIILK 77 (201) T ss_pred CccchHHHHHhcCC-hHHHHHHHHHHHHhhhhhhhheeecCCcceeeeecCcCC--hHHHHHHHHHHHHhHhcCchhhhc Confidence 333322 11111 1233344432 2222 3445555566889998888764 557777888999999999999998 Q ss_pred CcccCccc-HHHHHHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHH-------HHHHHHHHh Q lcl|NC_019456. 287 DDQAKSTT-NVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAAR-------TQYYQTLTR 358 (435) Q Consensus 287 ~~~~~~~~-~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~-------~~~~~~~~~ 358 (435) +.+.++.+ +.+.-...|.+.|.-.++.....+..+|+.-..+..++.|+| .+|...+.+++ ++.+.++++ T Consensus 78 G~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~~~~~~~~~~f--~pL~~~s~kekAei~~~~a~a~~~~~~ 155 (201) T protein:vir:10 78 GKNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLPFIVTEQEWSVEF--NPLSQVSDKDKSEILEKNVNSVAALIA 155 (201) T ss_pred CCCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCceEee--CCCCCCCHHHHHHHHHHHHHHHHHHHH Confidence 88877664 334333334333333333222222222222222334555554 57777665554 566888999 Q ss_pred cCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCC Q lcl|NC_019456. 359 NGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQ 428 (435) Q Consensus 359 ~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (435) +|+++++|+|+.|--.+. .+ +++.+.+..+. ...+..++.+.+++. T Consensus 156 ~g~i~~~e~r~~L~~~~~--~~----~~~~~~~~~~~------------------~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 156 AGIIDADEARDTLRAIST--EV----KIGEGSIQTEV------------------VINESEDPLDVSANN 201 (201) T ss_pred cCCCCHHHHHHHHHhcCC--cC----CCCCCCCCccc------------------cccccCCCCCCCCCC Confidence 999999999998854432 11 11111111000 000011111111111 No 162 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.81 E-value=6.9e-09 Score=65.34 Aligned_cols=401 Identities=11% Similarity=0.028 Sum_probs=160.9 Q ss_pred CchHHHHHhhccccccccccccccchhhhhh------ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDM------AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) +.++++|.+.+......-... ..++.. .+... ..-.........+...+|+.+++.+---.|.+-.. T Consensus 10 ~~~i~~L~~~~~~~~~r~~~~----~~Yy~g~~~i~~~~~~~---~~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~~~~ 82 (488) T protein:vir:23 10 EKLRDQLLDAFENKQNELKSS----KAYYDAERRPDAIGLAV---PLDMRKYLAHVGYPRTYVDAIAERQELEGFRIPSA 82 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHH----HHHHhcccchhhcCccc---chhhhhhhhhcchHHHHHHHHHHhhhccceeccCC Confidence 444444444332221111000 011110 00000 00000111223445556666665443334443211 Q ss_pred --------ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeC-------CCCcEEEEEEeCCceeE Q lcl|NC_019456. 75 --------YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSL-------STGEPIALWPLDPNTVS 139 (435) Q Consensus 75 --------~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~-------~~g~~~~l~~l~~~~v~ 139 (435) +.......+...+ ...........+..+++.+|.||.++..+. ..|. ..+.+++|..+. T Consensus 83 ~~~~~~~~~d~~~~~~l~~i~-----~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~-~~i~~~~p~~~~ 156 (488) T protein:vir:23 83 NGEEPESGGENDPASELWDWW-----QANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEV-PLIRVEPPTALY 156 (488) T ss_pred cccccccccchhHHHHHHHHH-----HhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCc-ceEEEeccceeE Confidence 1111111222222 222456677888999999999998876532 1121 236677888777 Q ss_pred EEEcCC-CceEE---EEEecC-Cee---EEEchhh-------------------------eEEeccCCCccccccCcHHH Q lcl|NC_019456. 140 ILRNTD-NNSYW---YRVTSD-IYN---FTIPIND-------------------------VIHVKHVVPSNSWYGVSPID 186 (435) Q Consensus 140 ~~~~~~-~~~~~---~~~~~~-~~~---~~~~~~~-------------------------iih~~~~~~~~~~~G~s~l~ 186 (435) +..+.. +...+ +....+ +.. ..|.++. |++|++.......+|.|-+. T Consensus 157 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~ 236 (488) T protein:vir:23 157 AEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEIS 236 (488) T ss_pred EEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEeccccccCCcCCccchh Confidence 666542 11111 111111 111 1222222 34444332334456777654 Q ss_pred H-HHHHHHHHHHH-HH--HHHHHhhcCCceEEEeCCcCCHHHH--HHHHHHHHHHhcCCCccccccCC--ceeeeccCCh Q lcl|NC_019456. 187 V-LSSSLKFQRSV-EN--FSQNEMEKKDKFVLQYDRSISPEKR--QAMVNDFLRMVKENGGAVVQEAG--WKVDRYESKF 258 (435) Q Consensus 187 ~-~~~~i~~~~~~-~~--~~~~~~~n~~~~~~~~~~~~~~e~~--~~~~~~~~~~~~~~~~~~vl~~g--~~~~~~~~~~ 258 (435) . +...+...... .+ ....++......+. +....+... ..-...|.. ..+++..++.| .++.++.... T Consensus 237 ~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G~~~~~~~~~~~~~~~~~~~---~~~~v~~~~~g~~~~~~q~~~~~ 311 (488) T protein:vir:23 237 PELRSVTDAAAQILMNMQGTANLMAIPQRLIF--GAKPEELGINAETGQRMFDA---YMARILAFEGGEGAHAEQFSAAE 311 (488) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHh--CCCcccccccccccchhhhh---hhhhhccCCCCCCceeEecCCCC Confidence 2 22223222211 11 11122222111111 111111100 000111111 12456666655 5666665544 Q ss_pred hhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH--------------HHHHHHHhHHHHHHHHHHHHhhcc Q lcl|NC_019456. 259 EPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT--------------HSWTMTLMPIIRQYESQFNMKLFT 324 (435) Q Consensus 259 ~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~ 324 (435) .+ .+++..+....+|+..-++|+..+|....+. ++.+++. ..|...+.-.++.+...+...-. T Consensus 312 ~~-~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~- 388 (488) T protein:vir:23 312 LR-NFVDALDALDRKAASYSGLPPQYLSSSSDNP-ASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGDI- 388 (488) T ss_pred hH-HHHHHHHHHHHHHhcccCCCHHHhccccCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc- Confidence 33 3777778888999999999999998755432 2222221 12222233333322221111100 Q ss_pred cccccCcceeeechhhhhccCHHHHHHHHHHHHhcC--CcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccc Q lcl|NC_019456. 325 PGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNG--IFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILD 402 (435) Q Consensus 325 ~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~ 402 (435) ......+++.+......+..+.++.+.+++++| +++..-+++++|+-+-+.+..+...-.......+.. +.... T Consensus 389 ---~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~-~~~~~ 464 (488) T protein:vir:23 389 ---PTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGLI-GSLYG 464 (488) T ss_pred ---chhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHHH-HHHhc Confidence 011123444445556778888999999999876 788888999998754322111111000000000000 00000 Q ss_pred ccccccccccccccCCCCCCCCCCCCCCC Q lcl|NC_019456. 403 NKIQTDASVAAPKQEGGENTNENGLQSTE 431 (435) Q Consensus 403 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (435) ..... ....+...|+..+++ +... T Consensus 465 ~~~~~--~~~~~~~~~~~~~~e---~~~a 488 (488) T protein:vir:23 465 ASTPE--GKPGEAPVGEPPAPE---PDAA 488 (488) T ss_pred cCCCc--ccCCCCCCCCCCCCC---CCCC Confidence 00000 000011111111111 1111 No 163 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.76 E-value=6e-08 Score=60.19 Aligned_cols=397 Identities=13% Similarity=0.032 Sum_probs=160.3 Q ss_pred Cch---------HHHHHhhccccccccccccccchhhhh------hccccccCcccccHHHHhhhHHHHHHHHHHHHHHh Q lcl|NC_019456. 1 MSF---------MSKVRQFFGVHDQANQIVQNPIPQPLD------MAGVKLEQATFSREHILESNEYIFSIVTRLSNVLA 65 (435) Q Consensus 1 Mg~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia 65 (435) +|+ ++.|.+.+......-. ....++. ..+..... ........+.+...+|+..+..+. T Consensus 6 ~~~~~~~~~~~~~~~L~~~~~~~~~r~~----~~~~YY~G~~~i~~~~~~~~~---~~~~~~~~~n~~~~ivd~~~~~l~ 78 (485) T protein:vir:24 6 PGQEEIADPAIARDEMVSAFEDQNQNLR----SNTSYYEAERRPEAIGVTVPV---QMQSLLAHVGYPRLYVDSIAERQA 78 (485) T ss_pred CCCCcccchHHHHHHHHHHHHHHHHHHH----HHHHHHhccCchhhcCcccch---hhhhhhhccchHHHHHHHHhhhhc Confidence 111 0111111100000000 0000110 00000000 000111122344556666665554 Q ss_pred hCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcE-------EEEEEeCCcee Q lcl|NC_019456. 66 SLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEP-------IALWPLDPNTV 138 (435) Q Consensus 66 ~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~-------~~l~~l~~~~v 138 (435) -.+|.+-. ....+..+..++. + | ........+..+++.+|.||.++..+.. +.. ..+.+++|..+ T Consensus 79 ~~g~~~~~--~~~~~~~l~~i~~-~-N---~~d~~~~~~~~~a~i~G~ay~~v~~~~~-~~~~~~~~~~~~i~~~~p~~~ 150 (485) T protein:vir:24 79 VEGFRLGD--ADEADEELWQWWQ-A-N---NLDIEAPLGYTDAYVHGRSYITISRPDP-QIDLGWDPNVPLIRVEPPTRM 150 (485) T ss_pred cCceecCC--CchhHHHHHHHHH-h-c---ChhHHHHHHHHHHhhcCceEEEEecCCc-ccccccCCCcceEEEecccee Confidence 44554321 1112222333332 1 2 3456678999999999999998876533 211 24777888887 Q ss_pred EEEEcCCCc-eE---EEEEecC-Cee---EEEchhh-------------------------eEEeccCCCccccccCcHH Q lcl|NC_019456. 139 SILRNTDNN-SY---WYRVTSD-IYN---FTIPIND-------------------------VIHVKHVVPSNSWYGVSPI 185 (435) Q Consensus 139 ~~~~~~~~~-~~---~~~~~~~-~~~---~~~~~~~-------------------------iih~~~~~~~~~~~G~s~l 185 (435) .+..+.... .. .+..... +.. ..|..+. |++|++.....+.+|.|-+ T Consensus 151 ~~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i 230 (485) T protein:vir:24 151 YAEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEI 230 (485) T ss_pred EEEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCcccc Confidence 777654321 11 0001111 100 1122222 3444433333456787766 Q ss_pred HH-HHHHHHHHHHH-HH--HHHHHhhcCCceEEEeCCcCCHHHHH--HHHHHHHHHhcCCCcccccc-CCceeeeccCCh Q lcl|NC_019456. 186 DV-LSSSLKFQRSV-EN--FSQNEMEKKDKFVLQYDRSISPEKRQ--AMVNDFLRMVKENGGAVVQE-AGWKVDRYESKF 258 (435) Q Consensus 186 ~~-~~~~i~~~~~~-~~--~~~~~~~n~~~~~~~~~~~~~~e~~~--~~~~~~~~~~~~~~~~~vl~-~g~~~~~~~~~~ 258 (435) .. +...+.....+ .+ ....++.. +..++. +....+...+ .-...|. ...+.+..++ ++.++.++.... T Consensus 231 ~~~v~~liDa~~~~~s~~~~~~~~~a~-p~~~i~-G~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~~~~~~~q~~~~~ 305 (485) T protein:vir:24 231 TPELRSMTDAAARILMLMQATAELMGV-PQRLIF-GIKPEEIGVDPETGQTLFD---AYLARILAFEDAEGKIQQFSAAE 305 (485) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhcc-hhhhhc-cCCccccccccccccchhh---hcccceeccCCCCceEEeecccc Confidence 53 33333332221 11 12222222 222221 1111110000 0011111 1234455554 567777766544 Q ss_pred hhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcc Q lcl|NC_019456. 259 EPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFT 324 (435) Q Consensus 259 ~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~ 324 (435) .+ .+.+..+....+++..-++|+..+|....+.. +.+++ ...|...+...++.+....+. T Consensus 306 ~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~-Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~---- 379 (485) T protein:vir:24 306 LA-NFTNALDQIAKQVAAYTGLPPQYLSTAADNPA-SAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKG---- 379 (485) T ss_pred hH-HHHHHHHHHHHHHhcccCCCHHHhccccCcch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---- Confidence 33 36777777788888889999999986653321 22221 222233333333332221111 Q ss_pred cccccCcceeeechhhhhccCHHHHHHHHHHHHhcC--CcCHHHHHHHhCCCCCCCcCCceeeeccc---ccchhccccc Q lcl|NC_019456. 325 PGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNG--IFKPNEIRELEGQAPIPDEAADHLYISKD---LYPLDKYYDA 399 (435) Q Consensus 325 ~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~~~~gd~~~~~~n---~~~l~~~~~~ 399 (435) .........+++.+......+..+.++.+.+++.+| +++..-+++++|+.+-+.+...+..--.. ...++..... T Consensus 380 ~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~ 459 (485) T protein:vir:24 380 GDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDA 459 (485) T ss_pred CCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhccc Confidence 000111123455555566778888999999998866 67777788888886432111111100000 0000110000 Q ss_pred cccccccccccccccccCCCCCCCCCC-CCCCC-CCCC Q lcl|NC_019456. 400 ILDNKIQTDASVAAPKQEGGENTNENG-LQSTE-PEGS 435 (435) Q Consensus 400 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~ 435 (435) ....+ ..++.++.++ ++.++ .++. T Consensus 460 ----------~~~~~--~~~~~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 460 ----------DPTVP--GSPNPTPAPKPQPAIEGGDSA 485 (485) T ss_pred ----------CCCCC--CCCCCCCCCCCccCCCCCCCC Confidence 00000 0001111111 11111 1222 No 164 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.76 E-value=9e-09 Score=64.70 Aligned_cols=381 Identities=9% Similarity=-0.003 Sum_probs=168.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhh------ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDM------AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) +-+.+++.+........-. ....++.. .+..... ..-.......+.+...+|+..+..+-.-||.+... T Consensus 7 ~~~~~~l~~~~~~~~~r~~----~l~~Yy~g~~~i~~~~~~~~~-~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~ 81 (456) T protein:vir:79 7 AEWLPVLTKRIDDGMSRVR----LLARYSNGDAPLPELTRNTSA-AWRSFQREARTNWGLMVRDSVADRIIPNGITVGGS 81 (456) T ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHHhccCChhhcCcccCh-hhchhhhhhhcchHHHHHHHHHhhhccCCeecCCC Confidence 3333333332211111000 00011110 0000000 00000001123456678888888777778875432 Q ss_pred ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc-eE---- Q lcl|NC_019456. 75 YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN-SY---- 149 (435) Q Consensus 75 ~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~-~~---- 149 (435) ........+..++. + | ....+...+..+++.+|.||.++-.+ ..|.+ .+..++|..+.+..++... .+ T Consensus 82 ~d~~~~~~~~~~~~-~-n---~~d~~~~~~~~~a~~~G~a~~~~~~~-edg~~-~i~~~~p~~~~~i~d~~~~~~~~~~~ 154 (456) T protein:vir:79 82 ADSDLALRARRIWR-D-N---RMDSVCKQWVKYGLDFGESYLTCWRR-DDGTA-TITADSPETMVVSVDPLQPWRIRSAM 154 (456) T ss_pred CCccHHHHHHHHHH-h-c---ChhHHHHHHHHHHhhcCeeEEEEeeC-CCCce-EEEEeccceeEEEEcCCCCCceEEEE Confidence 22222223333332 2 2 34466788999999999999877665 45766 5788899888877764321 10 Q ss_pred EEEEecCCee---EEEchh-------------------------------heEEeccCC---CccccccCcHHHHHHHHH Q lcl|NC_019456. 150 WYRVTSDIYN---FTIPIN-------------------------------DVIHVKHVV---PSNSWYGVSPIDVLSSSL 192 (435) Q Consensus 150 ~~~~~~~~~~---~~~~~~-------------------------------~iih~~~~~---~~~~~~G~s~l~~~~~~i 192 (435) .++...++.. ..+..+ ++-|.-... +.....|+|-+..+...+ T Consensus 155 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~~~gd~e~v~~li 234 (456) T protein:vir:79 155 RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDII 234 (456) T ss_pred EEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEecCCCCCchhhhhHHHH Confidence 0111111100 000001 111110000 012235777777766655 Q ss_pred HHHHHHHHH---HHHHhhcCCceEEEe--CCcCCHHHHHHH--HHHHHHHhcCCCccccccCCceeeeccCChhhHHHHH Q lcl|NC_019456. 193 KFQRSVENF---SQNEMEKKDKFVLQY--DRSISPEKRQAM--VNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSS 265 (435) Q Consensus 193 ~~~~~~~~~---~~~~~~n~~~~~~~~--~~~~~~e~~~~~--~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e 265 (435) +....+..- ...++......+.-. +....++.-+.+ .+.|. ...+.+..++.+.++.++....- -.+.+ T Consensus 235 D~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~---~~~~~~~~~~~~~~~~q~~~~~~-~~~~~ 310 (456) T protein:vir:79 235 NRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFE---AAPGALWELPPGVDIWESQTNDF-TPMLS 310 (456) T ss_pred HHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhh---hhccccccCCCCcceeeecccCh-HHHHH Confidence 544322111 111111111111000 000111111111 11221 23355667788888877665433 23778 Q ss_pred HHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH--------------HHHHHHHhHHHHHHHHHHHHhhcccccccCc Q lcl|NC_019456. 266 VEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT--------------HSWTMTLMPIIRQYESQFNMKLFTPGKRVKG 331 (435) Q Consensus 266 ~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g 331 (435) ..+....+|+..-++|+..+|....+. +.+++. ..|...+...++.+. .+.. .... T Consensus 311 ~l~~~i~~i~~~t~~p~~~~~~~~~N~--Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~-----~~~g---~~~~ 380 (456) T protein:vir:79 311 AIKEHIRQLSSATKTPLPMLMPDSANQ--SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL-----QIEG---ESVE 380 (456) T ss_pred HHHHHHHHHHhhcCCChhHhcccccCc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhcC---CCcc Confidence 888889999999999999998654332 112211 122222222222111 1111 1122 Q ss_pred ceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccc Q lcl|NC_019456. 332 FYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASV 411 (435) Q Consensus 332 ~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~ 411 (435) ..++..+......+..+.++++.+++..|+++..-+++.+|+.+-.-+-. . .+...+... +.. T Consensus 381 ~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~~~-------e---~~r~~~e~~---~~~---- 443 (456) T protein:vir:79 381 DTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQD-------D---LDRAREQIT---LFA---- 443 (456) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHHHH-------H---HHHHHHHHH---HHh---- Confidence 23444445566778899999999999999999888888888865310000 0 000000000 000 Q ss_pred cccccCCCCCCCC Q lcl|NC_019456. 412 AAPKQEGGENTNE 424 (435) Q Consensus 412 ~~~~~~~~~~~~~ 424 (435) ..+.+.++.++.. T Consensus 444 ~~~~~~~~~~~~~ 456 (456) T protein:vir:79 444 GNPVQRPQEDGSR 456 (456) T ss_pred hhHhhcCCCCCCC Confidence 0000111111111 No 165 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.75 E-value=6.6e-08 Score=59.93 Aligned_cols=392 Identities=11% Similarity=0.079 Sum_probs=178.2 Q ss_pred CchHHHHHhhccccccc---------------cccccccchhhh---h-hcccccc--Cccc---ccHHHHhhhHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQA---------------NQIVQNPIPQPL---D-MAGVKLE--QATF---SREHILESNEYIFSI 56 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~---~-~~~~~~~--~~~~---~~~~~~~~~~~v~~~ 56 (435) ||+|++++++|...-.. ...........- . +.|-... .... ...........-..+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTA 80 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHH Confidence 99999999987431111 000000000000 0 1111000 0000 000111122334455 Q ss_pred HHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCc Q lcl|NC_019456. 57 VTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPN 136 (435) Q Consensus 57 i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~ 136 (435) ++..|+-+..=|..+.-++....+..+..++. ......-....+.+.+..|.+++.+..+. |. ..+..++|. T Consensus 81 ~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~-----~n~f~~~~~~~~e~a~a~G~~~~k~~~d~--~~-~~i~~v~ad 152 (508) T protein:vir:15 81 ARRIASVVFNEKAEIHVKDNNEADKFLNDVLE-----DNDFKNKFEEALEKGVALGGFAMRPYIDG--NH-IKIAWVRAD 152 (508) T ss_pred HHHHHhhhhCCCceEEeCCchHHHHHHHHHHH-----hccHHHHHHHHHHHHhhcCceEEEEEEeC--Ce-eEEEEEcCC Confidence 56666555443433332222222222222221 22244556677888889999988887763 33 234555555 Q ss_pred eeEEEE-cCC-----------------CceEEEEE-----ecCC-ee---------------EEEch----------hh- Q lcl|NC_019456. 137 TVSILR-NTD-----------------NNSYWYRV-----TSDI-YN---------------FTIPI----------ND- 166 (435) Q Consensus 137 ~v~~~~-~~~-----------------~~~~~~~~-----~~~~-~~---------------~~~~~----------~~- 166 (435) .+-+.. +.. +..+|... ..++ .. ..++- ++ T Consensus 153 ~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~ 232 (508) T protein:vir:15 153 QFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQV 232 (508) T ss_pred eeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcce Confidence 443321 111 11111100 0000 00 00110 01 Q ss_pred ---------eEEeccCCCc----cccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCH-HHHHHHHHHH Q lcl|NC_019456. 167 ---------VIHVKHVVPS----NSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISP-EKRQAMVNDF 232 (435) Q Consensus 167 ---------iih~~~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~-e~~~~~~~~~ 232 (435) ..||+.+-++ ..+.|+|.+..+...+...+........-|+.+...++.....+.. +.-.. .| T Consensus 233 ~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~d~~~~~---~~ 309 (508) T protein:vir:15 233 TISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPGMLRFDDEHKP---TF 309 (508) T ss_pred EecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcCCCCCcc---cc Confidence 1244432221 3567999999999988887765555555556654333332211110 00000 01 Q ss_pred HHHhcCCCcc---cc--ccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH------- Q lcl|NC_019456. 233 LRMVKENGGA---VV--QEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT------- 300 (435) Q Consensus 233 ~~~~~~~~~~---~v--l~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~------- 300 (435) ....+. +- .++|..++.++....+-++.+..+...+.|....|++|..+|....+.. ++.+.. T Consensus 310 ----~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~-TAtei~s~~~~~~ 384 (508) T protein:vir:15 310 ----DTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVK-TATEVVSNNSMTY 384 (508) T ss_pred ----CCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccc-cHHHHHHHHHHHH Confidence 111111 11 1344567777777666678888888899999999999999987655432 222211 Q ss_pred -------HHHHHHHhHHHHHHHHHHHH-hhccc-------ccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHH Q lcl|NC_019456. 301 -------HSWTMTLMPIIRQYESQFNM-KLFTP-------GKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPN 365 (435) Q Consensus 301 -------~~~~~~i~P~~~~i~~~l~~-~l~~~-------~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~N 365 (435) ..++.+|..++..+...... .++.. ........+.+++++-...|.++.++...+++..|+|+.- T Consensus 385 ~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~e 464 (508) T protein:vir:15 385 QTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSKQ 464 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHH Confidence 11222333333333222211 11111 0112234577888888899999999999999999999999 Q ss_pred HHHHHh-CCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 366 EIRELE-GQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 366 E~R~~~-g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) +++... |++ ++.+++.+- .+. .++...........+..|++. | T Consensus 465 ~~i~~~~g~~---deea~~el~-----ri~------~E~~~~~~~~~~~~~~~g~~g--e 508 (508) T protein:vir:15 465 TFLQRNYGMT---DEQAAEELA-----KIQ------SEAPTDTFEGGRSAILNGGDG--E 508 (508) T ss_pred HHHHhcCCCC---hHHHHHHHH-----HHH------HhccccCccccccccCCCCCC--C Confidence 988654 433 333332211 010 000000000001111111111 1 No 166 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=98.71 E-value=9e-08 Score=59.21 Aligned_cols=411 Identities=13% Similarity=0.044 Sum_probs=174.2 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhcc--ccccC--cccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAG--VKLEQ--ATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~--~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) --.++++..+..........+-.....++.-.. ..... ........-..++.....|+..+.-+-.-|+.+.-... T Consensus 39 ~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~ 118 (502) T protein:vir:48 39 VNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDN 118 (502) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhhcccCeeEecCCc Confidence 111112222111110000000000000000000 00000 00000000112345556677777777777877654332 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--CceE----E Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NNSY----W 150 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~~----~ 150 (435) .. ...+...|. +.............+..+++.+|.||.++..+. .|.+ .+..++|..+.+..+.. +... + T Consensus 119 ~~-~~~~~~~l~-~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~de-dg~~-~i~~~~p~~~~~vydd~~~~~~~~~ir~ 194 (502) T protein:vir:48 119 ED-NSQNDDAIK-RIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSE-YDET-RIKRLSPLETFVIYDNSLEDNSIAAVRY 194 (502) T ss_pred cc-hhHHHHHHH-HHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC-CCce-EEEEEcccceEEEEcCCCCCceEEEEEE Confidence 21 122222222 223334566778899999999999998887764 4654 57788998888776643 2221 1 Q ss_pred EEEe-cCCe---eEEEchhheEEeccCC----------C---------ccccccCcHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019456. 151 YRVT-SDIY---NFTIPINDVIHVKHVV----------P---------SNSWYGVSPIDVLSSSLKFQRSVENFSQNEME 207 (435) Q Consensus 151 ~~~~-~~~~---~~~~~~~~iih~~~~~----------~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ 207 (435) |... ..+. ...+..+.++++.... + .+...|.|.+..+...+.....+.....+.+. T Consensus 195 ~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~ 274 (502) T protein:vir:48 195 YNRGTLQNAKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGTVPITEFLNNADGIGDYETELYLIDLYDSAESDTANHMS 274 (502) T ss_pred EEEeecCCcEEEEEEEeCCeEEEEEeCCceeeccceecCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHH Confidence 2111 1111 1234455554443211 1 12346788888777777765544433333333 Q ss_pred cCCc--eEEEeCCcCC-HHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_019456. 208 KKDK--FVLQYDRSIS-PEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISF 284 (435) Q Consensus 208 n~~~--~~~~~~~~~~-~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~ 284 (435) .... .++....... ++....+++...-.....+.....+.+.+++.++.......+....+...+.|+..-++|... T Consensus 275 ~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~ 354 (502) T protein:vir:48 275 DMADAILAIYGDLALPQGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMS 354 (502) T ss_pred HhcCceeeeecCcccccccchhhhhhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcC Confidence 3322 2222221211 122222211110000111111112344555555544444456666778888999999999765 Q ss_pred hCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHH Q lcl|NC_019456. 285 LNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAART 350 (435) Q Consensus 285 lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~ 350 (435) .+.... +. +.++. ...|...+.-.++.+...+...--...... ..+.+.+......|..+.+ T Consensus 355 ~~~~~~-n~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~--~~i~i~f~~~~p~d~~e~a 430 (502) T protein:vir:48 355 DNHFSG-NA-SGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDE--SRLKITFTPNLPKSLYEQV 430 (502) T ss_pred cccccc-Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc--ccceEEeCCCCCcCHHHHH Confidence 543321 11 11221 223344444444444444443211111111 2344555667778899999 Q ss_pred HHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCC-CCCCCCC Q lcl|NC_019456. 351 QYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENT-NENGLQS 429 (435) Q Consensus 351 ~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 429 (435) +++.++ .|+++..-+.+++++- +++-. + +..+.+..... ............+++.. +..++.+ T Consensus 431 ~~~~kl--~g~iS~et~l~~l~~v--~D~~~-E---------~~ri~~E~~~~--~~~~~~~~~~~~~~~~~d~~~e~~~ 494 (502) T protein:vir:48 431 SILNDL--GGQVSQETALSLSGLV--ENPTE-E---------LDKINEESSKI--DFKGYPSYFYDNVGKYTDEVKETHT 494 (502) T ss_pred HHHHHH--hccCcHHHHHHhCCCC--CCHHH-H---------HHHHHHHHHhh--hhhcccccccccccccCCCccCCCC Confidence 999988 4889988888887643 21110 1 11111100000 00000000111111111 1111111 Q ss_pred CCCCCC Q lcl|NC_019456. 430 TEPEGS 435 (435) Q Consensus 430 ~~~~~~ 435 (435) .+.|.- T Consensus 495 ~~~~~~ 500 (502) T protein:vir:48 495 DDFERV 500 (502) T ss_pred cCcCCC Confidence 111111 No 167 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.68 E-value=3.7e-08 Score=61.34 Aligned_cols=390 Identities=9% Similarity=-0.008 Sum_probs=170.8 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhcc----ccccC-cccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAG----VKLEQ-ATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~ 75 (435) ..+.++|.+.+......-. ....++.... ..... ...-....-..+.+...+|+..+..+-.-||.+.... T Consensus 7 ~~~~~~l~~~~~~~~~r~~----~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~~~ 82 (456) T protein:vir:10 7 AEWLPVLTKRIDDGMSRVR----LLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA 82 (456) T ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCCCC Confidence 3333434332221111000 0011111000 00000 0000000112234666788888887777788764322 Q ss_pred cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCce-----EE Q lcl|NC_019456. 76 KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNS-----YW 150 (435) Q Consensus 76 ~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-----~~ 150 (435) .......+...+. + | ....+...+..+++.+|.||.++..+ ..|.+ .+..++|..+.+..++.... .. T Consensus 83 d~~~~~~~~~i~~-~-N---~~d~~~~~~~~~a~i~G~ay~~v~~d-~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~i~ 155 (456) T protein:vir:10 83 DSDLALRARRIWR-D-N---RMDSVCKQWVKYGLDFGESYLTCWRR-DDGTA-TITADSPETMVVSVDPLQPWRIRAAMR 155 (456) T ss_pred CcchHHHHHHHHH-h-c---ChhhHHHHHHHHHhhcCeeEEEEeeC-CCCce-EEEEEccceeEEEEcCCCCcceEEEEE Confidence 1111222333332 2 2 34556678899999999999887765 44665 46788898888777654311 01 Q ss_pred EEEecCCeeE---EE-------------------------chhh------eEEeccCC---CccccccCcHHHHHHHHHH Q lcl|NC_019456. 151 YRVTSDIYNF---TI-------------------------PIND------VIHVKHVV---PSNSWYGVSPIDVLSSSLK 193 (435) Q Consensus 151 ~~~~~~~~~~---~~-------------------------~~~~------iih~~~~~---~~~~~~G~s~l~~~~~~i~ 193 (435) +....++... .+ .... .-|+-... +.....|+|.+......+. T Consensus 156 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~gd~e~vi~liD 235 (456) T protein:vir:10 156 WWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIIN 235 (456) T ss_pred EEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCchhhhhHHHHH Confidence 1111111000 00 0000 00100000 0122357888877766666 Q ss_pred HHHHHHHH--HHHHhhcCCceEEEe-C--CcCCHHHHHHH--HHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHH Q lcl|NC_019456. 194 FQRSVENF--SQNEMEKKDKFVLQY-D--RSISPEKRQAM--VNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSV 266 (435) Q Consensus 194 ~~~~~~~~--~~~~~~n~~~~~~~~-~--~~~~~e~~~~~--~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~ 266 (435) ....+..- ....+..-+..++.. . ....++....+ ...|. ...+.+..++.+.++.+++...- -.+.+. T Consensus 236 a~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~---~~~~~~~~~~~~~~~~q~~~~~~-~~~~~~ 311 (456) T protein:vir:10 236 RINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFE---AAPGALWELPPGVDIWESQANDF-TPMLSA 311 (456) T ss_pred HHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhh---hhccccccCCCCcceEEecccCh-hHHHHH Confidence 54432221 111111111111111 0 00111111111 11121 22345667788899887765432 247888 Q ss_pred HHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHH---HHHHHhHHHHHHHHHHHH---hhcccccccCcceeeechhh Q lcl|NC_019456. 267 EQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHS---WTMTLMPIIRQYESQFNM---KLFTPGKRVKGFYFSFNVNG 340 (435) Q Consensus 267 ~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~---~~~~i~P~~~~i~~~l~~---~l~~~~~~~~g~~i~fd~~~ 340 (435) .+....+|+..-++|+..+|....+. +.+++..- +...+.-..+.+...|.+ .++.-........+++.+.. T Consensus 312 l~~~i~~~~~~s~~p~~~~~~~~~N~--Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~~~~~~v~w~~ 389 (456) T protein:vir:10 312 IKEHIRQLSSATKTPLPMLMPDSANQ--SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFES 389 (456) T ss_pred HHHHHHHHHhccCCChHHhcccccCh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecC Confidence 88899999999999999998754322 12221111 111111111111111111 01100111112234444456 Q ss_pred hhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCC Q lcl|NC_019456. 341 LLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGE 420 (435) Q Consensus 341 l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (435) ....+..+.++++.++++.|+.+..-+++++|+.+-.-+ + ..++...+... +.+ ..+.+.+.+ T Consensus 390 ~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~---~-------~e~er~~~e~~---~~~----~~~~~~~~~ 452 (456) T protein:vir:10 390 PDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIK---Q-------DDLDRAREQIT---LFA----GNPVQRPQE 452 (456) T ss_pred CCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHH---H-------HHHHHHHHHHH---HHh----hhhhhcCCC Confidence 667888999999999999999998888888888652100 0 00111111100 000 001111111 Q ss_pred CCCC Q lcl|NC_019456. 421 NTNE 424 (435) Q Consensus 421 ~~~~ 424 (435) ++.. T Consensus 453 ~~~~ 456 (456) T protein:vir:10 453 DGSR 456 (456) T ss_pred CCCC Confidence 1111 No 168 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.68 E-value=3.7e-08 Score=61.34 Aligned_cols=390 Identities=9% Similarity=-0.008 Sum_probs=170.8 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhcc----ccccC-cccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAG----VKLEQ-ATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNY 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~ 75 (435) ..+.++|.+.+......-. ....++.... ..... ...-....-..+.+...+|+..+..+-.-||.+.... T Consensus 7 ~~~~~~l~~~~~~~~~r~~----~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~~~ 82 (456) T protein:vir:10 7 AEWLPVLTKRIDDGMSRVR----LLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA 82 (456) T ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCCCC Confidence 3333434332221111000 0011111000 00000 0000000112234666788888887777788764322 Q ss_pred cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCce-----EE Q lcl|NC_019456. 76 KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNS-----YW 150 (435) Q Consensus 76 ~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-----~~ 150 (435) .......+...+. + | ....+...+..+++.+|.||.++..+ ..|.+ .+..++|..+.+..++.... .. T Consensus 83 d~~~~~~~~~i~~-~-N---~~d~~~~~~~~~a~i~G~ay~~v~~d-~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~i~ 155 (456) T protein:vir:10 83 DSDLALRARRIWR-D-N---RMDSVCKQWVKYGLDFGESYLTCWRR-DDGTA-TITADSPETMVVSVDPLQPWRIRAAMR 155 (456) T ss_pred CcchHHHHHHHHH-h-c---ChhhHHHHHHHHHhhcCeeEEEEeeC-CCCce-EEEEEccceeEEEEcCCCCcceEEEEE Confidence 1111222333332 2 2 34556678899999999999887765 44665 46788898888777654311 01 Q ss_pred EEEecCCeeE---EE-------------------------chhh------eEEeccCC---CccccccCcHHHHHHHHHH Q lcl|NC_019456. 151 YRVTSDIYNF---TI-------------------------PIND------VIHVKHVV---PSNSWYGVSPIDVLSSSLK 193 (435) Q Consensus 151 ~~~~~~~~~~---~~-------------------------~~~~------iih~~~~~---~~~~~~G~s~l~~~~~~i~ 193 (435) +....++... .+ .... .-|+-... +.....|+|.+......+. T Consensus 156 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~gd~e~vi~liD 235 (456) T protein:vir:10 156 WWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIIN 235 (456) T ss_pred EEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCchhhhhHHHHH Confidence 1111111000 00 0000 00100000 0122357888877766666 Q ss_pred HHHHHHHH--HHHHhhcCCceEEEe-C--CcCCHHHHHHH--HHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHH Q lcl|NC_019456. 194 FQRSVENF--SQNEMEKKDKFVLQY-D--RSISPEKRQAM--VNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSV 266 (435) Q Consensus 194 ~~~~~~~~--~~~~~~n~~~~~~~~-~--~~~~~e~~~~~--~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~ 266 (435) ....+..- ....+..-+..++.. . ....++....+ ...|. ...+.+..++.+.++.+++...- -.+.+. T Consensus 236 a~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~---~~~~~~~~~~~~~~~~q~~~~~~-~~~~~~ 311 (456) T protein:vir:10 236 RINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFE---AAPGALWELPPGVDIWESQANDF-TPMLSA 311 (456) T ss_pred HHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhh---hhccccccCCCCcceEEecccCh-hHHHHH Confidence 54432221 111111111111111 0 00111111111 11121 22345667788899887765432 247888 Q ss_pred HHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHH---HHHHHhHHHHHHHHHHHH---hhcccccccCcceeeechhh Q lcl|NC_019456. 267 EQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHS---WTMTLMPIIRQYESQFNM---KLFTPGKRVKGFYFSFNVNG 340 (435) Q Consensus 267 ~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~---~~~~i~P~~~~i~~~l~~---~l~~~~~~~~g~~i~fd~~~ 340 (435) .+....+|+..-++|+..+|....+. +.+++..- +...+.-..+.+...|.+ .++.-........+++.+.. T Consensus 312 l~~~i~~~~~~s~~p~~~~~~~~~N~--Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~~~~~~v~w~~ 389 (456) T protein:vir:10 312 IKEHIRQLSSATKTPLPMLMPDSANQ--SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFES 389 (456) T ss_pred HHHHHHHHHhccCCChHHhcccccCh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecC Confidence 88899999999999999998754322 12221111 111111111111111111 01100111112234444456 Q ss_pred hhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCC Q lcl|NC_019456. 341 LLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGE 420 (435) Q Consensus 341 l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (435) ....+..+.++++.++++.|+.+..-+++++|+.+-.-+ + ..++...+... +.+ ..+.+.+.+ T Consensus 390 ~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~---~-------~e~er~~~e~~---~~~----~~~~~~~~~ 452 (456) T protein:vir:10 390 PDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIK---Q-------DDLDRAREQIT---LFA----GNPVQRPQE 452 (456) T ss_pred CCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHH---H-------HHHHHHHHHHH---HHh----hhhhhcCCC Confidence 667888999999999999999998888888888652100 0 00111111100 000 001111111 Q ss_pred CCCC Q lcl|NC_019456. 421 NTNE 424 (435) Q Consensus 421 ~~~~ 424 (435) ++.. T Consensus 453 ~~~~ 456 (456) T protein:vir:10 453 DGSR 456 (456) T ss_pred CCCC Confidence 1111 No 169 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=98.66 E-value=1.3e-07 Score=58.30 Aligned_cols=412 Identities=13% Similarity=0.051 Sum_probs=174.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccc--cC--cccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKL--EQ--ATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~--~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) +..+.++.++...........-.....++.-..... .. ........-..++....+|+..+.-+-.-|+++.-.+. T Consensus 38 ~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~ 117 (501) T protein:vir:96 38 VNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDN 117 (501) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhcccCeeEeeCCc Confidence 333333333221111100000000000110000000 00 00000001123455566777777666666776644322 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--CceE----E Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NNSY----W 150 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~~----~ 150 (435) .. ...+...| .+.............+..+++.+|.||.++.++. .|.+ .+..++|..+.+..+.. +... + T Consensus 118 ~~-~~~~~~~l-~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~de-dg~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~ 193 (501) T protein:vir:96 118 DD-NSQNDDAI-KRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSE-YDET-RIKRLSPLETFVIYDNSLEDNSIAAVRY 193 (501) T ss_pred cc-hhHHHHHH-HHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcC-CCce-EEEEEccceeEEEEcCCCCCceEEEEEE Confidence 11 12222222 2333334566778899999999999999888764 4654 57788999888877653 2221 1 Q ss_pred EEEec-CCe---eEEEchhheEEeccC----------CC---------ccccccCcHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019456. 151 YRVTS-DIY---NFTIPINDVIHVKHV----------VP---------SNSWYGVSPIDVLSSSLKFQRSVENFSQNEME 207 (435) Q Consensus 151 ~~~~~-~~~---~~~~~~~~iih~~~~----------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ 207 (435) |.... .+. ...+.++.|.++... ++ .+...|.|.+..+...+.....+.....+.+. T Consensus 194 ~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~ 273 (501) T protein:vir:96 194 YNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISVTTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMS 273 (501) T ss_pred EEeecCCCcEEEEEEEcCCcEEEEeeCCCceeccccccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHH Confidence 11111 111 112344444443311 00 12245788887777777665543333333332 Q ss_pred cC--CceEEEeCCcC-CHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_019456. 208 KK--DKFVLQYDRSI-SPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISF 284 (435) Q Consensus 208 n~--~~~~~~~~~~~-~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~ 284 (435) .. +..++...... .++....++....-.....+.......+.++.-+........+....+...+.|...-++|... T Consensus 274 ~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 353 (501) T protein:vir:96 274 DMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMS 353 (501) T ss_pred HhcCceeeeecccccCcccchhhhhhcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccC Confidence 22 22222221111 1122222211110011111222223344455555544444556777778888999999998765 Q ss_pred hCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHH Q lcl|NC_019456. 285 LNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAART 350 (435) Q Consensus 285 lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~ 350 (435) .+.... +.+ .+++ ...|...+...++.+...+...--...... ..+++.+......|..+.+ T Consensus 354 ~~~~~~-n~S-g~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~--~~i~i~f~~~~p~n~~e~a 429 (501) T protein:vir:96 354 DTNFSG-NTS-GEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDE--SLLKITFTPNLPKSLNEQV 429 (501) T ss_pred cccccc-cch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc--ccceEEeCCCCCcCHHHHH Confidence 543322 111 1111 123334444444444444333211111111 1244445667788899999 Q ss_pred HHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCC Q lcl|NC_019456. 351 QYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQST 430 (435) Q Consensus 351 ~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (435) +.+.++. |+++..-+.+++++- +++-. + ++.+.+.......... .....+..+...++.++.++. T Consensus 430 d~~~kl~--g~iS~et~~~~l~~v--~D~~~-E---------~~ri~~E~~~~~~~~~-~~~~~~~~~~~~~~~~e~~~d 494 (501) T protein:vir:96 430 SILTGLG--GQVSQETALSLSGLV--ESPNE-E---------LDKINKEMSEIDFKGY-SNDFNEHVGKYTDEVKETHTD 494 (501) T ss_pred HHHHHHh--ccCchHHHHHhCCCC--CCHHH-H---------HHHHHHHHHHhhcccc-ccchhhcccccCCcCCCCCCC Confidence 9999984 789987788877542 21111 1 1111111000000000 000001111111111111111 Q ss_pred CCCCC Q lcl|NC_019456. 431 EPEGS 435 (435) Q Consensus 431 ~~~~~ 435 (435) +.|.- T Consensus 495 ~~e~~ 499 (501) T protein:vir:96 495 DFERE 499 (501) T ss_pred ccccc Confidence 11111 No 170 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.63 E-value=1.7e-07 Score=57.67 Aligned_cols=401 Identities=9% Similarity=0.042 Sum_probs=168.2 Q ss_pred CchHHHH---------------Hhhccccccccccccccchhhhh----h----------ccccccCcccccHHHHhhhH Q lcl|NC_019456. 1 MSFMSKV---------------RQFFGVHDQANQIVQNPIPQPLD----M----------AGVKLEQATFSREHILESNE 51 (435) Q Consensus 1 Mg~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~----~----------~~~~~~~~~~~~~~~~~~~~ 51 (435) |.+...+ ..+......+. . .....++. + .+.........+. -..++ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~-~--~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~--ri~~n 87 (503) T protein:vir:59 13 EELNEIIVESAKEIAEPDTTMIQKLIDEHNPEP-L--LKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNN--RTSHA 87 (503) T ss_pred HhHHHhhhhhhhhccchhHHHHHHHHHhhcHHH-H--HHHHHHhccccchhhccchhcccccccccccccccc--eeecc Confidence 2211111 11100000000 0 00000000 0 0000000000000 11245 Q ss_pred HHHHHHHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEE Q lcl|NC_019456. 52 YIFSIVTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALW 131 (435) Q Consensus 52 ~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~ 131 (435) ....+|+..+.-+-.-|+.+..++... ......+.. | ........+..+++.+|.+|.++..+.. |++ .+. T Consensus 88 ~~~~ivd~~~~yl~g~~~~~~~~d~~~--~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~d-g~~-~i~ 158 (503) T protein:vir:59 88 WHKLFVDQKTQYLVGEPVTFTSDNKTL--LEYVNELAD--D---DFDDILNETVKNMSNKGIEYWHPFVDEE-GEF-DYV 158 (503) T ss_pred hHHHHHHHHHhhhhcCCeeeccCcHHH--HHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEEeecCC-Cce-EEE Confidence 566677888877777777754333221 122333331 2 4556677789999999999998887644 665 588 Q ss_pred EeCCceeEEEEcCC--CceE----EEEEe-cCCe----eEEEchhheEEeccCC-------------------------C Q lcl|NC_019456. 132 PLDPNTVSILRNTD--NNSY----WYRVT-SDIY----NFTIPINDVIHVKHVV-------------------------P 175 (435) Q Consensus 132 ~l~~~~v~~~~~~~--~~~~----~~~~~-~~~~----~~~~~~~~iih~~~~~-------------------------~ 175 (435) .++|..+.+..++. +... +|... ..+. ...+.++.|.++.... + T Consensus 159 ~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (503) T protein:vir:59 159 IFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIG 238 (503) T ss_pred EEccceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceecc Confidence 89998888776643 2211 12111 1111 1234445444433110 0 Q ss_pred ---------ccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_019456. 176 ---------SNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVV 244 (435) Q Consensus 176 ---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~v 244 (435) .+...|.|-+..+...+.....+.....+.+... +..+++.-.....+ .....+ ...+++. T Consensus 239 ~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~---~~~~~~-----~~~~~~~ 310 (503) T protein:vir:59 239 WGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPK---EFTANL-----RYHSVIK 310 (503) T ss_pred CCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccc---hhhhhh-----hccccee Confidence 1224577777777777766544332222223222 22333221111111 111111 1234555 Q ss_pred ccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHH---HhCCcccCcccH----------HHHHHHHHHHHHhHHH Q lcl|NC_019456. 245 QEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPIS---FLNDDQAKSTTN----------VEHVTHSWTMTLMPII 311 (435) Q Consensus 245 l~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~---~lg~~~~~~~~~----------~e~~~~~~~~~i~P~~ 311 (435) ++++.+++.+........+....+...+.|....++|.. ..+... ++.+- .+.....|...|..++ T Consensus 311 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 389 (503) T protein:vir:59 311 VSGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGA-TGPALENLYALLDLKANMAERKIRAGLRLFF 389 (503) T ss_pred ccCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 555555554444433344556666666677666666643 222211 11110 0111223344444444 Q ss_pred HHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeeccccc Q lcl|NC_019456. 312 RQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLY 391 (435) Q Consensus 312 ~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~ 391 (435) +.+...+...-- ........+.+.+......|..+.++.+.+++..|+++...+.+++++-+-++...+.+ -.... T Consensus 390 ~~i~~~~~~~~~--~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v~d~~~E~~ri--~~E~~ 465 (503) T protein:vir:59 390 WFFAEYLRNTGK--GDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPFVQDPEEELARI--EEEMN 465 (503) T ss_pred HHHHHHHHhccC--cccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCCCCHHHHHHHH--HHHHH Confidence 444444433211 11111123555566777889999999999999999999988988876532111111110 00000 Q ss_pred -chhccccccccccccccccccccccCCCCCCCCCCCCC Q lcl|NC_019456. 392 -PLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQS 429 (435) Q Consensus 392 -~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (435) ..+...... ......+......+..+++..+.+|+.. T Consensus 466 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 466 QYAEMQGNLL-DDEGGDDDLEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred HHHhhhcccc-CccCCCCCCCcCCCCCCcccCCCCCCcC Confidence 000000000 0000000000001111111111112111 No 171 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=98.59 E-value=2.3e-07 Score=56.98 Aligned_cols=409 Identities=12% Similarity=0.050 Sum_probs=176.2 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccc-cC---cccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKL-EQ---ATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~---~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) +..+++++.+...........-.....++.--.... .+ ........-..++....+|+..+.-+-.-|+.+..... T Consensus 38 ~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~ 117 (501) T protein:vir:27 38 VNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDN 117 (501) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhhcccCeeEecCCc Confidence 333333433322111110000000001110000000 00 00000001123455666777777777666776654332 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--CceE----E Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NNSY----W 150 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~~----~ 150 (435) ... ..+...| .+-............+..+++.+|.+|.++.++.. |.+ .+..++|..+.+..+.. +... + T Consensus 118 ~~~-~~~~~~l-~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded-~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~ 193 (501) T protein:vir:27 118 DNN-SQNDDTI-KRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEY-DET-RIKRLNPLETFVIYDNSLEDNSIAAVRY 193 (501) T ss_pred cch-HHHHHHH-HHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCC-Cce-EEEEEccceeEEEecCCCCCceEEEEEE Confidence 211 1122222 12222335667788899999999999999887644 654 57778898888776653 2111 1 Q ss_pred EEEec-CCe---eEEEchhheEEeccC----------CC---------ccccccCcHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019456. 151 YRVTS-DIY---NFTIPINDVIHVKHV----------VP---------SNSWYGVSPIDVLSSSLKFQRSVENFSQNEME 207 (435) Q Consensus 151 ~~~~~-~~~---~~~~~~~~iih~~~~----------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ 207 (435) |.... .+. ...+..+.|.++... ++ .+...|.|.+..+...+.....+.....+.+. T Consensus 194 ~~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~ 273 (501) T protein:vir:27 194 YNRGTLQNAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANHMS 273 (501) T ss_pred EEeeecCCcEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHH Confidence 11111 111 112334444333211 10 12345788888777777766544333333333 Q ss_pred cCCc--eEEEeC-CcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_019456. 208 KKDK--FVLQYD-RSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISF 284 (435) Q Consensus 208 n~~~--~~~~~~-~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~ 284 (435) .... .++... ..-.++....++....-.....+.....+.+.++..+........+....+...+.|+..-++|... T Consensus 274 ~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 353 (501) T protein:vir:27 274 DMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMS 353 (501) T ss_pred HhcCceeeeecCccCCcccchhhhhhcCceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccC Confidence 2222 222221 1112223232222111111222223334455566665555555556777788888999999998755 Q ss_pred hCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHH Q lcl|NC_019456. 285 LNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAART 350 (435) Q Consensus 285 lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~ 350 (435) .+.... +. +..++ ...|...+...++.+...+...--..... ...+.+.+......+..+.+ T Consensus 354 ~~~~~~-n~-Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d--~~~i~v~f~~~~p~n~~e~a 429 (501) T protein:vir:27 354 DTNFSG-NT-SGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFD--ESLLKITFTPNLPKSLNEQV 429 (501) T ss_pred cccccc-Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--cccceEEeCCCCCcCHHHHH Confidence 443221 11 11111 12333444444444444433221111111 12345555677788889999 Q ss_pred HHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccc-cccccCCCC----CCCCC Q lcl|NC_019456. 351 QYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASV-AAPKQEGGE----NTNEN 425 (435) Q Consensus 351 ~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~----~~~~~ 425 (435) +.+.++ .|+++..-+.+++++-.=|+... +.+.+..........+.. ..+...+++ ..+++ T Consensus 430 d~~~kl--~g~iS~et~l~~l~~v~D~~~E~------------eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~~~~~d~ 495 (501) T protein:vir:27 430 SILTGL--GGQVSQETALSLSGLVESPNEEL------------DKINKEVSEIDFKGYSNDFNEHVGKYTDEVKETHTDD 495 (501) T ss_pred HHHHHH--hccCcHHHHHHhCCCCCCHHHHH------------HHHHHHHHhhhHhhhcCccccccccccCCCCCCcccc Confidence 999988 58899888888775422111111 111111000000000000 000011111 11111 Q ss_pred CCCCCC Q lcl|NC_019456. 426 GLQSTE 431 (435) Q Consensus 426 ~~~~~~ 431 (435) .++.+| T Consensus 496 ~e~~~~ 501 (501) T protein:vir:27 496 FERAYE 501 (501) T ss_pred ccccCC Confidence 111122 No 172 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.56 E-value=2.8e-07 Score=56.51 Aligned_cols=397 Identities=9% Similarity=0.010 Sum_probs=155.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccH--H--HH-hhhHHHHHHHHHHHHHHhhCceeeeecc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSRE--H--IL-ESNEYIFSIVTRLSNVLASLPLHEYQNY 75 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~--~~-~~~~~v~~~i~~ia~~ia~~~~~~~~~~ 75 (435) .-+...+.+.+....+.-. -...++..-............ . .. ..+.+..-+|+.++..+---.|.+ . T Consensus 29 ~~l~~~l~~~~~~~~~rl~----~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l~~~gf~~---~ 101 (501) T protein:vir:25 29 GALVADMWRLHISERQWLD----RIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNLSVVGYRN---A 101 (501) T ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhhcccceec---C Confidence 1112222211111100000 000111100000000000000 0 00 112344455665555442223332 1 Q ss_pred cccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEE-cCCCc--eE--- Q lcl|NC_019456. 76 KQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILR-NTDNN--SY--- 149 (435) Q Consensus 76 ~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~-~~~~~--~~--- 149 (435) .......+...+ .-| ........+..+++.+|.||.++..+.. |. .+..++|..|.+.. ++... .. T Consensus 102 d~~~~~~l~~i~--~~N---~~d~~~~~~~~~a~i~G~ay~~v~~de~-~~--~i~~~sp~~~~~iy~D~~~~~~~~~ai 173 (501) T protein:vir:25 102 LAKENDPAWEMW--QRN---RMDARQAEVHRPALTYGASYVTVTPTDE-GP--VFRTRSPRQILAVYADPSVDAWPQYAL 173 (501) T ss_pred CccchHHHHHHH--Hhc---ChhHHHHHHHHHHhhcCceEEEEecCCC-CC--eEEEeccccEEEEEecCCCCcceeEEE Confidence 111223333222 112 2445567889999999999988876643 53 46677888887654 32211 11 Q ss_pred -EEEEecC-Ce---eEEEchhh----------------------------------------------eEEeccCCCccc Q lcl|NC_019456. 150 -WYRVTSD-IY---NFTIPIND----------------------------------------------VIHVKHVVPSNS 178 (435) Q Consensus 150 -~~~~~~~-~~---~~~~~~~~----------------------------------------------iih~~~~~~~~~ 178 (435) ++..... +. ...+.+.. |+||++.. ... T Consensus 174 ~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~-~~~ 252 (501) T protein:vir:25 174 ETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGR-DAD 252 (501) T ss_pred EEEeeccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccceeeEeccCcc-ccC Confidence 1111111 00 01111111 23333211 112 Q ss_pred cccCcHHHHHHHHHHHHHHHH---HHHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCcccccc-CCceeeec Q lcl|NC_019456. 179 WYGVSPIDVLSSSLKFQRSVE---NFSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQE-AGWKVDRY 254 (435) Q Consensus 179 ~~G~s~l~~~~~~i~~~~~~~---~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~-~g~~~~~~ 254 (435) ..|.|-+..+...+....... .....++......++-.+. ++.+. |+ ...+++++++ ++.++.++ T Consensus 253 ~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~----~~~~~----~~---~~~~~i~~~~~~~~~~~q~ 321 (501) T protein:vir:25 253 DMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWTG----SKAEV----LK---ASALRVWTFEDPEVKAQAF 321 (501) T ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCC----Cccch----hh---hcccceeccCCCCceEEEe Confidence 347776665554444433321 1222233332222222211 11111 11 1234566665 46677665 Q ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHH---HHHHHhHHHHHHHHHHHH------hhccc Q lcl|NC_019456. 255 ESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHS---WTMTLMPIIRQYESQFNM------KLFTP 325 (435) Q Consensus 255 ~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~---~~~~i~P~~~~i~~~l~~------~l~~~ 325 (435) ....- -.+.+..+....+|+..-++|+..+|....+. +.+++..- +...+.-..+.+...|.+ .+... T Consensus 322 ~~~~~-~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~--Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~ 398 (501) T protein:vir:25 322 PPASV-EPYNLILEEMLQHVAMVAQISPAQVTGKMINV--SAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDD 398 (501) T ss_pred cccCh-HHHHHHHHHHHHHHHhhcCCChhhhccccCCh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 44322 23778888889999999999999998654432 22221111 111111111112222211 11111 Q ss_pred ccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHH-HhCCCCCCCcCCceeeecccccchhcccccccccc Q lcl|NC_019456. 326 GKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRE-LEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNK 404 (435) Q Consensus 326 ~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~-~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~ 404 (435) ........+++.+......+..+.++++.++++.|+ +.-.+.. +.|+.+-. -++..--..-.......+.... . T Consensus 399 ~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi-s~et~~~~~~g~~~~~---ie~~~~~~~e~~~~~~~~~~~~-~ 473 (501) T protein:vir:25 399 PDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI-PIEHLLSMVPGMTQQT---IQAIKDSLRGGEVKSLVDKLLS-N 473 (501) T ss_pred CccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC-CHHHHHHHcCCCCHHH---HHHHHHHHHHHhHHHHHHHhhc-c Confidence 111122346666677778899999999999998886 5555444 45776421 1111000000000000000000 0 Q ss_pred ccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 405 IQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 405 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) . ..+....+..+..+.++ .-..+.+.|+ T Consensus 474 ~-~~~~~~~~~~~~~~~~~--~~~~~~~~g~ 501 (501) T protein:vir:25 474 E-PAPVPPPPPQAAAQALN--EGGVNGNGGA 501 (501) T ss_pred C-cCCCCCCCCCCCccccc--cccCCCCCCC Confidence 0 00001111111111111 1122334444 No 173 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.56 E-value=2.9e-07 Score=56.42 Aligned_cols=394 Identities=11% Similarity=0.012 Sum_probs=172.1 Q ss_pred CchHHHHHhhcccc-----------ccccccccccchhh---hhh-ccccc-------cCcccccHHHHhhhHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVH-----------DQANQIVQNPIPQP---LDM-AGVKL-------EQATFSREHILESNEYIFSIVT 58 (435) Q Consensus 1 Mg~~~~~~~~~~~~-----------~~~~~~~~~~~~~~---~~~-~~~~~-------~~~~~~~~~~~~~~~~v~~~i~ 58 (435) =+|+.|+++++..- .............. -.+ .|-.. ...................+++ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i~~ 82 (496) T protein:vir:38 3 NQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAK 82 (496) T ss_pred hHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHHHH Confidence 12222222222110 00000000000000 000 11000 0000000011122344455677 Q ss_pred HHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCcee Q lcl|NC_019456. 59 RLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTV 138 (435) Q Consensus 59 ~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v 138 (435) ..|+-+..=|..+.-++.... ..+..+. ......+-...++.+...+|.+|+.+..+.. |.+ .+..++|..+ T Consensus 83 ~~a~~l~~~p~~i~~~d~~~~--e~l~~~~----~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~-~~~-~i~~v~~~~~ 154 (496) T protein:vir:38 83 YMSKLLFNEKVKINIDDKAAE--EFVLNVL----KTNGFTKNMERYIEYGEAMGGFVIKVYHDGN-KNV-KVSFATADCM 154 (496) T ss_pred HHhhhhhCCcceEeeCChHHH--HHHHHHH----hccCHHHHHHHHHHHHhhhCcEEEEEEEcCC-CcE-EEEEEcccce Confidence 777777666666543332221 2222222 2334666678889999999999999888754 554 4566677666 Q ss_pred EEEEcCCCce--------------EEEE-----------------Eec-CCe--eEEEch------------------hh Q lcl|NC_019456. 139 SILRNTDNNS--------------YWYR-----------------VTS-DIY--NFTIPI------------------ND 166 (435) Q Consensus 139 ~~~~~~~~~~--------------~~~~-----------------~~~-~~~--~~~~~~------------------~~ 166 (435) -+.....+.. +|+. +.. ++. ...++. -- T Consensus 155 ~P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~ 234 (496) T protein:vir:38 155 YPLSNDSENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPT 234 (496) T ss_pred EEEEecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccceeecCCCcce Confidence 5543333321 1100 000 000 000000 01 Q ss_pred eEEeccCC----CccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeC------CcCCHHHHHHHHHHHHHHh Q lcl|NC_019456. 167 VIHVKHVV----PSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYD------RSISPEKRQAMVNDFLRMV 236 (435) Q Consensus 167 iih~~~~~----~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~------~~~~~e~~~~~~~~~~~~~ 236 (435) +.|++.+- ......|+|.+..+...+.....+.....+-|..+...++.-. .....+.. ..+..-. T Consensus 235 f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~----~~~~~~~ 310 (496) T protein:vir:38 235 FIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTT----QYFDSTD 310 (496) T ss_pred EEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccCCCCCccc----cCCCCcc Confidence 22333221 1245679999998888888766544444444554433222211 00000000 0000000 Q ss_pred cC--CCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH---H----------HH Q lcl|NC_019456. 237 KE--NGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH---V----------TH 301 (435) Q Consensus 237 ~~--~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~---~----------~~ 301 (435) +- .....-.+++..++.++.....-++.+..+....+|+...|+||..+|....+..+..+- . .. T Consensus 311 ~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~ 390 (496) T protein:vir:38 311 EAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQ 390 (496) T ss_pred ceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHHHHH Confidence 00 000111223345666666555556777778888899999999999998765443211111 1 11 Q ss_pred HHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHh-CCCCCCCcC Q lcl|NC_019456. 302 SWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELE-GQAPIPDEA 380 (435) Q Consensus 302 ~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~-g~~p~~~~~ 380 (435) .+..+|..+++.+................+..+.+.++.-...|..+.++.+.+++..|+++.-.++..+ |.+ ++. T Consensus 391 ~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~~~~~---d~e 467 (496) T protein:vir:38 391 LIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRAWNIT---EAE 467 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCC---hHH Confidence 1223344444433332222111112222345577777777888999999999999999999988887654 332 222 Q ss_pred CceeeecccccchhccccccccccccccccccccccCCCCCCCCC Q lcl|NC_019456. 381 ADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNEN 425 (435) Q Consensus 381 gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (435) +++. ++.+.+ ..... .+.+..++..+++. T Consensus 468 a~~e--------l~ri~~---E~~~~-----~~~~d~~~~~~~~e 496 (496) T protein:vir:38 468 ADEW--------AEMLAK---EKQAE-----MPNNDMNGIFGEEE 496 (496) T ss_pred HHHH--------HHHHHH---hhhcc-----CccccccCCCCCCC Confidence 2221 111111 00000 00011111111111 No 174 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=98.55 E-value=3.1e-07 Score=56.26 Aligned_cols=395 Identities=9% Similarity=0.020 Sum_probs=160.5 Q ss_pred CchHH-HHHhhccccccccccccccchhhhhhc-ccccc--CcccccHHH---HhhhHHHHHHHHHHHHHHhhCceeeee Q lcl|NC_019456. 1 MSFMS-KVRQFFGVHDQANQIVQNPIPQPLDMA-GVKLE--QATFSREHI---LESNEYIFSIVTRLSNVLASLPLHEYQ 73 (435) Q Consensus 1 Mg~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--~~~~~~~~~---~~~~~~v~~~i~~ia~~ia~~~~~~~~ 73 (435) ..++. .+.+.+......- .....++.-- ..... ......... ...+.+...+|+.++..+---.|.+ T Consensus 15 ~~~~~~~l~~~~~~~~~r~----~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~gf~~-- 88 (479) T protein:vir:99 15 AKYLETKVFPKMNTECERL----DDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQLIVDGYRK-- 88 (479) T ss_pred HHHHHHHHHHHHHHHhHHH----HHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhhcccccccC-- Confidence 11110 1111000000000 0000000000 00000 000000000 1122345556666665443222322 Q ss_pred cccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeee----CCCCcEEEEEEeCCceeEEEEcCCCc-- Q lcl|NC_019456. 74 NYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKS----LSTGEPIALWPLDPNTVSILRNTDNN-- 147 (435) Q Consensus 74 ~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~----~~~g~~~~l~~l~~~~v~~~~~~~~~-- 147 (435) ......+.+...+. + | ........+..+++.+|.||.++-.. ...|.+ .+..++|..+.+..+.... T Consensus 89 -~d~~~~~~~~~i~~-~-N---~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~-~i~~~~p~~~~~iydd~~~~~ 161 (479) T protein:vir:99 89 -TGTNENAKGWDTWR-L-N---QMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVA-RIKCIDPRDAFAIWEDPYWDE 161 (479) T ss_pred -CCchhhHHHHHHHH-h-c---ChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCce-EEEEechhheEEEecCCcccc Confidence 11112233444433 2 2 23456677889999999999887641 122333 4677788888766543221 Q ss_pred e--EEEEEecCCeeEEE-------------------------chhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 148 S--YWYRVTSDIYNFTI-------------------------PINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVEN 200 (435) Q Consensus 148 ~--~~~~~~~~~~~~~~-------------------------~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~ 200 (435) . +.+.....+....+ ..==|++|++.... ...|.|-+..+...+........ T Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~-~~~g~sd~e~v~~liDa~~~~~s 240 (479) T protein:vir:99 162 WPKYLLERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDL-RGVCYGDVEPLVTVAKAIDKTGL 240 (479) T ss_pred eeeEEEeecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeecCCCc-CcCCcchhHHHHHHHHHHHHHHH Confidence 1 11111111111001 11114555543222 34688888777766666544322 Q ss_pred HH---HHHhhcCCceEEEeCCcCCHH-HHHHHHHHHHHHhcCCCccc-cccCCceeeeccCChhhHHHHHHHHHHHHHHH Q lcl|NC_019456. 201 FS---QNEMEKKDKFVLQYDRSISPE-KRQAMVNDFLRMVKENGGAV-VQEAGWKVDRYESKFEPADLSSVEQISRIRIA 275 (435) Q Consensus 201 ~~---~~~~~n~~~~~~~~~~~~~~e-~~~~~~~~~~~~~~~~~~~~-vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia 275 (435) -. ..++......++-. .+.++ ..+. ..+. -..++++ ..+++.++.+++.... ..+.+..+....+|+ T Consensus 241 ~~~~~~~~~a~p~~~i~G~--~~~~~~~~~~--~~~~---~~~~~i~~~~~~~~~~~q~~~~~~-~~~~~~l~~~i~~i~ 312 (479) T protein:vir:99 241 DILLVQHHQSFQIRWATGL--MLPEGANADQ--EKMR---FAQESMLISQNEKASFGAIPAAPL-DGLLNAYKESLLEFL 312 (479) T ss_pred HHHHHHHHhhchhhhhcCC--Ccccccccch--hccc---cccccceeecCCCceEEEecccch-HHHHHHHHHHHHHHh Confidence 22 12222222222211 11111 1110 0111 0112333 3456677776654332 246677777788899 Q ss_pred HHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhh Q lcl|NC_019456. 276 TAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGL 341 (435) Q Consensus 276 ~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l 341 (435) ..-++|+..+|...+. +.+++ ...|...+...++.+.... ..........+++.+... T Consensus 313 ~~t~~p~~~~g~~~n~---Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~-----~~~~~~~~~~i~~~w~~~ 384 (479) T protein:vir:99 313 ALAQLPPHIAGQIVNV---AADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIE-----GRTEEATDLDFTITWQDV 384 (479) T ss_pred ccCCCCHHHcccccch---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc-----CCCccccceeeeEEecCC Confidence 9999999999854332 11221 1222233333333222211 111111123355555556 Q ss_pred hccCHHHHHHHHHHHHhcCCcCHHHHHHHh-CCCCCCCcCCceeeeccc-ccchhccccccccccccccccccccccCCC Q lcl|NC_019456. 342 LRGDTAARTQYYQTLTRNGIFKPNEIRELE-GQAPIPDEAADHLYISKD-LYPLDKYYDAILDNKIQTDASVAAPKQEGG 419 (435) Q Consensus 342 ~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~-g~~p~~~~~gd~~~~~~n-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (435) ...+..+.++.+.+++++|+++.-.+.+++ |+.+-. -+....-.. -...+...+....+..+. .+.++ T Consensus 385 ~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~---~e~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~ 454 (479) T protein:vir:99 385 TIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQST---VNGWKEIYDREGDFGKYMRKLQNGPDPA-------EQRGG 454 (479) T ss_pred CCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHH---HHHHHHHHHHHHHHHHHHHHHhcccCcc-------cccCC Confidence 677888999999999999999998888877 765421 111100000 000010001100000111 11111 Q ss_pred CCCCCCCCCCCCCCCC Q lcl|NC_019456. 420 ENTNENGLQSTEPEGS 435 (435) Q Consensus 420 ~~~~~~~~~~~~~~~~ 435 (435) .++..+.++.++++|. T Consensus 455 ~~~~~~~~~~~~~~~~ 470 (479) T protein:vir:99 455 PNGATNMQQANNKTGE 470 (479) T ss_pred CCCCCCCCCCCCCCcc Confidence 1222222223332322 No 175 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.54 E-value=3.2e-07 Score=56.17 Aligned_cols=402 Identities=12% Similarity=0.023 Sum_probs=162.2 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCc-ccccHH---HHhhhHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQA-TFSREH---ILESNEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) --+.+.|...+......... ...++..-. ..... ...... ......+..-+|+.++..+--..|.+- +. T Consensus 15 ~~~~~~l~~~~~~~~~r~~~----l~~YY~G~~-~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~~~g~~~~--~~ 87 (486) T protein:vir:42 15 AVVREEMISAFEDASKDLAS----NTSYYDAER-RPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQAVEGFRLG--DA 87 (486) T ss_pred HHHHHHHHHHHHHHHHHHHH----HHHHhcccC-cchhcccccchhHhhhhhccchHHHHHHHHHhhhcccceecC--CC Confidence 11233333332211110000 001111000 00000 000000 011223455566666655533444431 21 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCC-----C-cEEEEEEeCCceeEEEEcCCCceEE Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLST-----G-EPIALWPLDPNTVSILRNTDNNSYW 150 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~-----g-~~~~l~~l~~~~v~~~~~~~~~~~~ 150 (435) ...+..+.. +.. -| ........+..+++.+|.||+++.++... + ....+.+++|..+.+..+....... T Consensus 88 ~~~~~~~~~-i~~-~N---~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~~~~~~ 162 (486) T protein:vir:42 88 DEADEELWQ-WWQ-AN---NLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDPRINRVS 162 (486) T ss_pred chhHHHHHH-HHH-hc---ChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeCCCCCeE Confidence 112222333 322 23 23455678899999999999988754211 1 1235677888888777664221110 Q ss_pred ----EEEecCCee----EEEchhhe-------------------------EEeccCCCccccccCcHHHH-H---HHHHH Q lcl|NC_019456. 151 ----YRVTSDIYN----FTIPINDV-------------------------IHVKHVVPSNSWYGVSPIDV-L---SSSLK 193 (435) Q Consensus 151 ----~~~~~~~~~----~~~~~~~i-------------------------ih~~~~~~~~~~~G~s~l~~-~---~~~i~ 193 (435) +.....+.. ..|.++.+ ++|++.....+.+|.|-+.. + .+.+. T Consensus 163 ~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~ 242 (486) T protein:vir:42 163 KAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAA 242 (486) T ss_pred EEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHH Confidence 111111110 01222222 33333222344567775542 2 23333 Q ss_pred HHHHHHHHHHHHhhcCCceEEEeC-CcCCHHHHHHHHHHHHHHhcCCCcccccc-CCceeeeccCChhhHHHHHHHHHHH Q lcl|NC_019456. 194 FQRSVENFSQNEMEKKDKFVLQYD-RSISPEKRQAMVNDFLRMVKENGGAVVQE-AGWKVDRYESKFEPADLSSVEQISR 271 (435) Q Consensus 194 ~~~~~~~~~~~~~~n~~~~~~~~~-~~~~~e~~~~~~~~~~~~~~~~~~~~vl~-~g~~~~~~~~~~~~~~~~e~~~~~~ 271 (435) ...+-......++......+.-.+ .....+. .+....|. ...+++.+++ ++.++.++.....+ .+++..+... T Consensus 243 ~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~-~~~~~~~~---~~~~~~~~~~~~~~~~~q~~~~~~e-~~~~~l~~~i 317 (486) T protein:vir:42 243 RILMLMQATAELMGVPQRLIFGIKPEEIGVDS-ETGQTLFD---AYLARILAFEDAEGKIQQFSAAELA-NFTNALDQIA 317 (486) T ss_pred HHHHHHHHHHHhhcchHHHhhcCCcccccccc-ccccchhh---hhhchhcccCCCCceEEeecccCHH-HHHHHHHHHH Confidence 222211112222222211111111 0111000 00011111 1224455554 45677666554333 3777777888 Q ss_pred HHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeec Q lcl|NC_019456. 272 IRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFN 337 (435) Q Consensus 272 ~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd 337 (435) .+++..-++|+..+|....+.. +.+++ ...|...+...++.+....+..-. + .....+++. T Consensus 318 ~~~s~~~~~p~~~fg~~~~n~~-Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~-~---~d~~~i~v~ 392 (486) T protein:vir:42 318 KQVAAYTGLPPQYLSTAADNPA-SAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDV-P---PDMLRMETV 392 (486) T ss_pred HHHhcccCCCHHHhccccCchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-c---ccceeeeEE Confidence 8899999999999987653321 22221 222333333333322221111000 1 111235555 Q ss_pred hhhhhccCHHHHHHHHHHHHhc--CCcCHHHHHHHhCCCCCCCcCCceeeecc---cccchhcccccccccccccccccc Q lcl|NC_019456. 338 VNGLLRGDTAARTQYYQTLTRN--GIFKPNEIRELEGQAPIPDEAADHLYISK---DLYPLDKYYDAILDNKIQTDASVA 412 (435) Q Consensus 338 ~~~l~~~d~~~~~~~~~~~~~~--g~~t~NE~R~~~g~~p~~~~~gd~~~~~~---n~~~l~~~~~~~~~~~~~~~~~~~ 412 (435) +......+..+.++.+.+++++ |+++..-+++.+|+.+-+.+...++.--. ....++.......... T Consensus 393 w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~-------- 464 (486) T protein:vir:42 393 WRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVP-------- 464 (486) T ss_pred ecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC-------- Confidence 5666678888999999999986 67887778888887643321111110000 0000111100000000 Q ss_pred ccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 413 APKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) .....++ +..+++.+++.|. T Consensus 465 ~~~~~~~---~~~~~~~~~~~~~ 484 (486) T protein:vir:42 465 GSPSPTA---PPKPQPAIESSGG 484 (486) T ss_pred CCCCCCC---CCCCCcccCCCCC Confidence 0001111 1122222233322 No 176 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=98.50 E-value=4.3e-07 Score=55.47 Aligned_cols=376 Identities=12% Similarity=0.026 Sum_probs=165.9 Q ss_pred Cch---HHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_019456. 1 MSF---MSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQ 77 (435) Q Consensus 1 Mg~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 77 (435) ... ++++.....+.. .... ..... ...+ .-..++....+|+..+.-+-.-|+.+.-.... T Consensus 39 ~~~~~~~~~l~~Yy~g~~---~i~~-----------~~~~~-~~~~--~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~~d~ 101 (470) T protein:vir:99 39 TVLKPRYRENMKLYLGKH---KILT-----------APEKE-TGAD--NRIVVNSAKYVVDVYNGYFCGIEPKLALLNDS 101 (470) T ss_pred HhhHHHHHHHHHHhcccc---cccc-----------Ccccc-cCCc--ceeecchHHHHHHHHhhhhccCCeeEeeCCch Confidence 111 122222211110 0000 00000 0000 01123445556666666665557665433222 Q ss_pred cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCce--E----EE Q lcl|NC_019456. 78 MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNS--Y----WY 151 (435) Q Consensus 78 ~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~--~----~~ 151 (435) .....+...+. ..........+....+.+|.+|.++..+. .|.+ .+..++|..+.+..+..... . +| T Consensus 102 ~~~~~l~~~~~-----~n~~~~~~~~~~~~~~~~G~~~~~v~~d~-dg~~-~i~~~~p~~~~~i~d~~~~~~~~~~vr~~ 174 (470) T protein:vir:99 102 SKIDEIARWNR-----QENFFDTINEISKQCDIFGRSIASIYQGE-DARP-HLMYSSPNHAFIIYDDTVQRQPLAFVHYQ 174 (470) T ss_pred hHHHHHHHHHH-----hcCHhHHHHHHHHHHHhcCeeEEEEEeCC-CCeE-EEEEEccceeEEEEcCCCCcceEEEEEEE Confidence 22223333332 33566778899999999999999887764 4765 57889999988877765321 1 11 Q ss_pred EEecCCee----EEEchhheEEeccC-------------CC---------ccccccCcHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 152 RVTSDIYN----FTIPINDVIHVKHV-------------VP---------SNSWYGVSPIDVLSSSLKFQRSVENFSQNE 205 (435) Q Consensus 152 ~~~~~~~~----~~~~~~~iih~~~~-------------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~ 205 (435) ....++.. ..+..+.++++... ++ .+...|.|-+..+...+.....+.....+. T Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~ 254 (470) T protein:vir:99 175 IDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFENEERQGIFDSIKTLINALDKVISQKANQ 254 (470) T ss_pred EEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCCccceEeecCCCCCCcchHhHHHHHHHHHHHHHHHHHH Confidence 11112111 12334444443211 11 123457777777777666655433222222 Q ss_pred hhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccc-----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_019456. 206 MEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQ-----EAGWKVDRYESKFEPADLSSVEQISRIRIATAF 278 (435) Q Consensus 206 ~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl-----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~f 278 (435) +... +..++.. ....+++.-+....+ .. .+++.+ +.+.++..+........+....+...+.|+..- T Consensus 255 ~~~~~~~~~~i~g-~~~~~~~~g~~~~~~----~~-~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s 328 (470) T protein:vir:99 255 VEYFDNAYMYMIG-FKLPEDDEGNPKFDF----KN-NRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMA 328 (470) T ss_pred HHHhcCceeeeec-CCcccccccchhhhh----hh-cceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHh Confidence 2222 2222222 122221111111111 11 122222 344555556555444556677788889999999 Q ss_pred CCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhcc Q lcl|NC_019456. 279 NVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRG 344 (435) Q Consensus 279 gvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~ 344 (435) ++|....+... ++. +..+. ...|...|...++.+...+..+--... ....+++.+..-... T Consensus 329 ~~p~~~~~~~~-~n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~---~~~~i~v~f~~~~p~ 403 (470) T protein:vir:99 329 MVPNIQDKNFA-GNS-SGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQE---LWSELDFKFTRNLPE 403 (470) T ss_pred CCccccccccc-cCc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccc---ccccceEEeCCCCCc Confidence 99976443322 211 11221 122333444444444433332211111 112345555666778 Q ss_pred CHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 345 DTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 345 d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) |..+.++.+.++. |+++...++++++.-. ++... +.+.+..........+. ..+ .+.. T Consensus 404 ~~~e~a~~~~kl~--giis~et~l~~l~~vd-~~~E~------------eri~~E~~~~~~~~~~~--~~~-----~d~~ 461 (470) T protein:vir:99 404 DMASAIDNAKNAE--GIVSKKTQLGMIPDIE-PDAEM------------KQIAKEKADAIKQTQQL--SMP-----IDIL 461 (470) T ss_pred CHHHHHHHHHHHh--ccCCHHHHHHhCCCCC-HHHHH------------HHHHHHHHHHHHHHHhh--cCC-----CCcC Confidence 8899999999885 7899888888875431 11111 11111100000000000 000 0001 Q ss_pred CCCCCCCCC Q lcl|NC_019456. 425 NGLQSTEPE 433 (435) Q Consensus 425 ~~~~~~~~~ 433 (435) .+++.++.| T Consensus 462 ~~d~~~ee~ 470 (470) T protein:vir:99 462 KRDNNAEEE 470 (470) T ss_pred CCCCCccCC Confidence 111111111 No 177 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=98.47 E-value=5.4e-07 Score=54.94 Aligned_cols=393 Identities=11% Similarity=0.052 Sum_probs=175.7 Q ss_pred CchHHHHHhh-------------ccccccccccccccchhhhh---------------------hccccccCcccccHHH Q lcl|NC_019456. 1 MSFMSKVRQF-------------FGVHDQANQIVQNPIPQPLD---------------------MAGVKLEQATFSREHI 46 (435) Q Consensus 1 Mg~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~~~ 46 (435) |.|.+-+-.. .........-. ......+. ............+.. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~-~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k- 78 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERM-VNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNK- 78 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHH-HHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccc- Confidence 5554333211 00000000000 00000000 000000000000000 Q ss_pred HhhhHHHHHHHHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc Q lcl|NC_019456. 47 LESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE 126 (435) Q Consensus 47 ~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~ 126 (435) ..++....+|+..+.-+-.-|+.+.-......+..+...|. +-............+..++..+|.||.++..+. .|. T Consensus 79 -i~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~-~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~-~~~ 155 (474) T protein:vir:94 79 -LNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFIT-NFAIRNSVDDEDSEIGKMAAICGYGARLAYIDT-NGD 155 (474) T ss_pred -cccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHH-HHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC-CCe Confidence 12345555666666666566776543222222222222222 222333566777888999999999998877654 465 Q ss_pred EEEEEEeCCceeEEEEcCCCceEE----EEEec--CCee----EEEchhheEEeccC------------CC--------- Q lcl|NC_019456. 127 PIALWPLDPNTVSILRNTDNNSYW----YRVTS--DIYN----FTIPINDVIHVKHV------------VP--------- 175 (435) Q Consensus 127 ~~~l~~l~~~~v~~~~~~~~~~~~----~~~~~--~~~~----~~~~~~~iih~~~~------------~~--------- 175 (435) + .+..++|..+.+..+..+...+ |.... ++.. ..+....+.+++.. ++ T Consensus 156 ~-~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 234 (474) T protein:vir:94 156 I-RIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGV 234 (474) T ss_pred e-EEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEe Confidence 4 6778889888777766553321 11111 1110 12233333333311 00 Q ss_pred ccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeee Q lcl|NC_019456. 176 SNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDR 253 (435) Q Consensus 176 ~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~ 253 (435) .+...|.|-+..+...+.....+..-..+.+... +..++ .+..++++....+ ...+.+.+.+.+.+++. T Consensus 235 ~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i-~g~~~~~~~~~~~--------~~~~~i~~~~~~~~~~~ 305 (474) T protein:vir:94 235 PNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVL-RGMGMSEEMIQET--------QKSGAFELFDKDMDVKY 305 (474) T ss_pred cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh-ccCCCCchhhhhh--------hhcceeEecCCCCceeE Confidence 1223577777776666665544333333222222 22222 2223444433322 22344555666666666 Q ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH--------------HHHHHHHHHhHHHHHHHHHHH Q lcl|NC_019456. 254 YESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH--------------VTHSWTMTLMPIIRQYESQFN 319 (435) Q Consensus 254 ~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~--------------~~~~~~~~i~P~~~~i~~~l~ 319 (435) +........+....+...+.|...-++|....+.... +. +..+ ....|...+...++.+...+. T Consensus 306 l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~ 383 (474) T protein:vir:94 306 LTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNG-NV-PIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALK 383 (474) T ss_pred EeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 6655445557777888889999999998754432221 11 1111 122344445555555555444 Q ss_pred HhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccc Q lcl|NC_019456. 320 MKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDA 399 (435) Q Consensus 320 ~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~ 399 (435) .+-..... ..-..+++.+..-...|..+.++.+.++ .|+++..-+.+++++-.=+... ++.+.+. T Consensus 384 ~~~~~~~~-~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~d~~~E------------~eri~~E 448 (474) T protein:vir:94 384 RKGYNLDD-DSYLNLIFKFTRNIPVNKLEESQVLINL--KGQVSERTRLGQSQLVDDVDYE------------LDEMEKE 448 (474) T ss_pred hccCCCCc-cccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHH------------HHHHHHH Confidence 33111100 0111345555666678899999999988 4889998888888643211111 1111111 Q ss_pred cccccccccccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 400 ILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 400 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (435) ... ... ......+++.++.+ ...++| T Consensus 449 ~~e---~~~--~~~~~~~~~~~~~~---~~~~s~ 474 (474) T protein:vir:94 449 SLE---FND--KLPDIDEGDANDKS---QNNQSE 474 (474) T ss_pred HHH---HHh--hcccccCCCcCCCC---ccccCC Confidence 000 000 00111111111111 122222 No 178 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=98.47 E-value=5.4e-07 Score=54.94 Aligned_cols=393 Identities=11% Similarity=0.052 Sum_probs=175.7 Q ss_pred CchHHHHHhh-------------ccccccccccccccchhhhh---------------------hccccccCcccccHHH Q lcl|NC_019456. 1 MSFMSKVRQF-------------FGVHDQANQIVQNPIPQPLD---------------------MAGVKLEQATFSREHI 46 (435) Q Consensus 1 Mg~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~~~ 46 (435) |.|.+-+-.. .........-. ......+. ............+.. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~-~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k- 78 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERM-VNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNK- 78 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHH-HHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccc- Confidence 5554333211 00000000000 00000000 000000000000000 Q ss_pred HhhhHHHHHHHHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc Q lcl|NC_019456. 47 LESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE 126 (435) Q Consensus 47 ~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~ 126 (435) ..++....+|+..+.-+-.-|+.+.-......+..+...|. +-............+..++..+|.||.++..+. .|. T Consensus 79 -i~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~-~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~-~~~ 155 (474) T protein:vir:10 79 -LNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFIT-NFAIRNSVDDEDSEIGKMAAICGYGARLAYIDT-NGD 155 (474) T ss_pred -cccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHH-HHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC-CCe Confidence 12345555666666666566776543222222222222222 222333566777888999999999998877654 465 Q ss_pred EEEEEEeCCceeEEEEcCCCceEE----EEEec--CCee----EEEchhheEEeccC------------CC--------- Q lcl|NC_019456. 127 PIALWPLDPNTVSILRNTDNNSYW----YRVTS--DIYN----FTIPINDVIHVKHV------------VP--------- 175 (435) Q Consensus 127 ~~~l~~l~~~~v~~~~~~~~~~~~----~~~~~--~~~~----~~~~~~~iih~~~~------------~~--------- 175 (435) + .+..++|..+.+..+..+...+ |.... ++.. ..+....+.+++.. ++ T Consensus 156 ~-~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 234 (474) T protein:vir:10 156 I-RIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGV 234 (474) T ss_pred e-EEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEe Confidence 4 6778889888777766553321 11111 1110 12233333333311 00 Q ss_pred ccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeee Q lcl|NC_019456. 176 SNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDR 253 (435) Q Consensus 176 ~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~ 253 (435) .+...|.|-+..+...+.....+..-..+.+... +..++ .+..++++....+ ...+.+.+.+.+.+++. T Consensus 235 ~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i-~g~~~~~~~~~~~--------~~~~~i~~~~~~~~~~~ 305 (474) T protein:vir:10 235 PNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVL-RGMGMSEEMIQET--------QKSGAFELFDKDMDVKY 305 (474) T ss_pred cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh-ccCCCCchhhhhh--------hhcceeEecCCCCceeE Confidence 1223577777776666665544333333222222 22222 2223444433322 22344555666666666 Q ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH--------------HHHHHHHHHhHHHHHHHHHHH Q lcl|NC_019456. 254 YESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH--------------VTHSWTMTLMPIIRQYESQFN 319 (435) Q Consensus 254 ~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~--------------~~~~~~~~i~P~~~~i~~~l~ 319 (435) +........+....+...+.|...-++|....+.... +. +..+ ....|...+...++.+...+. T Consensus 306 l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~ 383 (474) T protein:vir:10 306 LTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNG-NV-PIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALK 383 (474) T ss_pred EeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 6655445557777888889999999998754432221 11 1111 122344445555555555444 Q ss_pred HhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccc Q lcl|NC_019456. 320 MKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDA 399 (435) Q Consensus 320 ~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~ 399 (435) .+-..... ..-..+++.+..-...|..+.++.+.++ .|+++..-+.+++++-.=+... ++.+.+. T Consensus 384 ~~~~~~~~-~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~d~~~E------------~eri~~E 448 (474) T protein:vir:10 384 RKGYNLDD-DSYLNLIFKFTRNIPVNKLEESQVLINL--KGQVSERTRLGQSQLVDDVDYE------------LDEMEKE 448 (474) T ss_pred hccCCCCc-cccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHH------------HHHHHHH Confidence 33111100 0111345555666678899999999988 4889998888888643211111 1111111 Q ss_pred cccccccccccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 400 ILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 400 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (435) ... ... ......+++.++.+ ...++| T Consensus 449 ~~e---~~~--~~~~~~~~~~~~~~---~~~~s~ 474 (474) T protein:vir:10 449 SLE---FND--KLPDIDEGDANDKS---QNNQSE 474 (474) T ss_pred HHH---HHh--hcccccCCCcCCCC---ccccCC Confidence 000 000 00111111111111 122222 No 179 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.45 E-value=6e-07 Score=54.70 Aligned_cols=407 Identities=10% Similarity=0.068 Sum_probs=172.9 Q ss_pred CchHHHHHhhccccccccccccccchhhhhh----ccc-cccC-cccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDM----AGV-KLEQ-ATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~-~~~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) ....+.+..++.......-.+-.....++.- ... .... ....+.. ..++.....++..+.-+-.-|+.+..+ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~ 116 (511) T protein:vir:93 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQYQDD 116 (511) T ss_pred hccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcce--eecchHHHHHHHHhhhhcccCeeeccC Confidence 1111111111110000000000000000000 000 0000 0000000 112444556666666666667666433 Q ss_pred ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--CceE--- Q lcl|NC_019456. 75 YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NNSY--- 149 (435) Q Consensus 75 ~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~~--- 149 (435) .... . ..+..+. ...........+..+++.+|.||.++..+. .|.+ .+..++|..+.+..+.. +... T Consensus 117 d~~~-~-~~l~~~~----~~n~~~~~~~~~~~~~~~~G~ay~~vy~de-~~~~-~i~~~~p~~~~~vydd~~~~~~~~~v 188 (511) T protein:vir:93 117 DKDV-L-EVIEAFN----DLNDVESHNRSLGLDLSIYGKAYELMIRNQ-DDET-RLYKSDAMSTFVIYDNTIERNSIAGV 188 (511) T ss_pred ChHH-H-HHHHHHH----hhcCHhHHHHHHHHHHHhcCeeEEEEEeCC-CCce-EEEEEccceeEEEEcCCCCCceEEEE Confidence 2221 1 1222222 233466777888999999999999988764 4654 57889999888777653 2211 Q ss_pred -EEEEe-cCC---e----eEEEchhheEEeccCCC-------------------------ccccccCcHHHHHHHHHHHH Q lcl|NC_019456. 150 -WYRVT-SDI---Y----NFTIPINDVIHVKHVVP-------------------------SNSWYGVSPIDVLSSSLKFQ 195 (435) Q Consensus 150 -~~~~~-~~~---~----~~~~~~~~iih~~~~~~-------------------------~~~~~G~s~l~~~~~~i~~~ 195 (435) +|... ..+ . ...+.++.|.+++.... .+...|.|-+..+...+... T Consensus 189 r~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~liDa~ 268 (511) T protein:vir:93 189 RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLY 268 (511) T ss_pred EEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCCCCCCchhhHHHHHHHH Confidence 11111 111 0 12355566665532110 01235777777777777765 Q ss_pred HHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcC----CCccccccCCceeeeccCChhhHHHHHHHHH Q lcl|NC_019456. 196 RSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKE----NGGAVVQEAGWKVDRYESKFEPADLSSVEQI 269 (435) Q Consensus 196 ~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~----~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~ 269 (435) ..+..-..+.+... +..++......+.+..++..+........ .+...-.+.+.++..++.......+....+. T Consensus 269 d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 348 (511) T protein:vir:93 269 DNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDR 348 (511) T ss_pred HHHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHH Confidence 54433333333322 22333332333444433322221110100 1111223445566555554444556777778 Q ss_pred HHHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceee Q lcl|NC_019456. 270 SRIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFS 335 (435) Q Consensus 270 ~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~ 335 (435) ..+.|...-++|..-.+.... +- +..++ ...|...+...++.+...+..+.-..... .-..++ T Consensus 349 L~~~I~~~s~~P~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~-d~~~i~ 425 (511) T protein:vir:93 349 LNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANK-DFNTVR 425 (511) T ss_pred HHHHHHHHhCCcccccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc-ccccce Confidence 889999999998765433221 11 11221 22334444444444444333221111000 011245 Q ss_pred echhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccc-ccccccccccc Q lcl|NC_019456. 336 FNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDN-KIQTDASVAAP 414 (435) Q Consensus 336 fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~-~~~~~~~~~~~ 414 (435) +.+..-...|..+.++.+.++ .|+++.--+++++++-+-+++..++ +..-.+..... .....+. .. T Consensus 426 ~~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~~r---------i~~E~~~~~~~~~~~~~~~--~~ 492 (511) T protein:vir:93 426 YVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKK---------IEEDEKESIKKAQKGIYKD--PR 492 (511) T ss_pred EEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHH---------HHHHHHHHHHHHhhhcccC--CC Confidence 545666678889899998888 5889988888887543211111111 11000000000 0000000 00 Q ss_pred ccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 415 KQEGGENTNENGLQSTEPE 433 (435) Q Consensus 415 ~~~~~~~~~~~~~~~~~~~ 433 (435) ..+.++++++++..+++.| T Consensus 493 ~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 493 DINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCCcccccccccC Confidence 0111111122222222222 No 180 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.44 E-value=6.4e-07 Score=54.55 Aligned_cols=404 Identities=12% Similarity=0.052 Sum_probs=163.6 Q ss_pred Cch----HHHHHhhccccccccccccccchhhhhhccccccCcccccH---HHHhhhHHHHHHHHHHHHHHhhCceeeee Q lcl|NC_019456. 1 MSF----MSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSRE---HILESNEYIFSIVTRLSNVLASLPLHEYQ 73 (435) Q Consensus 1 Mg~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~~i~~ia~~ia~~~~~~~~ 73 (435) |+- +++|.+.+......- .....++.--..-......... ..-....+...+|+..+..+---.|.+- T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~----~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~- 75 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNL----LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS- 75 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHH----HHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCceecC- Confidence 543 122222111110000 0000111000000000000000 0001123444566666655433334321 Q ss_pred cccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeee-----CCCCcEEEEEEeCCceeEEEEcCCC-- Q lcl|NC_019456. 74 NYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKS-----LSTGEPIALWPLDPNTVSILRNTDN-- 146 (435) Q Consensus 74 ~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~-----~~~g~~~~l~~l~~~~v~~~~~~~~-- 146 (435) ++ ......+..++. + | ........+..+++.+|.||.++.+. ...|.+ .+.+++|..+.+..+... T Consensus 76 ~d-~~~~~~l~~i~~-~-N---~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~i~D~~~~~ 148 (480) T protein:vir:78 76 ED-SEGLEELWNWWQ-A-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTR 148 (480) T ss_pred CC-chhHHHHHHHHH-h-c---CHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCee-EEEEEcccceEEEEcCCCcc Confidence 11 112233433332 1 2 34566788899999999999887642 123444 577888988888776531 Q ss_pred ceEE----EEEecC-Ce---eEEEchhhe-----------------------------EEeccCCCccccccCcHHHH-H Q lcl|NC_019456. 147 NSYW----YRVTSD-IY---NFTIPINDV-----------------------------IHVKHVVPSNSWYGVSPIDV-L 188 (435) Q Consensus 147 ~~~~----~~~~~~-~~---~~~~~~~~i-----------------------------ih~~~~~~~~~~~G~s~l~~-~ 188 (435) ...+ +....+ +. ...+.++.+ +||.+.......+|.|-+.. + T Consensus 149 ~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i 228 (480) T protein:vir:78 149 RVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPEL 228 (480) T ss_pred ceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhHHH Confidence 1111 111111 11 112223333 33433322344567776543 3 Q ss_pred HHHHHHHHHH-HH--HHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCcccccc-CCceeeeccCChhhHHHH Q lcl|NC_019456. 189 SSSLKFQRSV-EN--FSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQE-AGWKVDRYESKFEPADLS 264 (435) Q Consensus 189 ~~~i~~~~~~-~~--~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~-~g~~~~~~~~~~~~~~~~ 264 (435) ...+...... .+ ....++.+... ++. +..+.+...+.-...|. ...+.++.++ ++.++.++.....+ .+. T Consensus 229 ~~l~Da~~~~~s~~~~~~~~~a~p~~-~i~-G~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 302 (480) T protein:vir:78 229 RKVTDAASRTLMNLQSASQILGTPLR-VIS-GVTTDELTNDGENTTLD---IYYGRILTLASEAAKISEFKAAELR-NFA 302 (480) T ss_pred HHHHHHHHHHHHHHHHHHHhhcchhh-hhh-CCCccccccccccchhh---hhhhhhccCCCCCceEEecCccCHH-HHH Confidence 3333332221 11 12222222211 111 11111111010001111 1123344443 45677766654433 377 Q ss_pred HHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH--------------HHHHHHHhHHHHHHHHHHHHhhcccccccC Q lcl|NC_019456. 265 SVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT--------------HSWTMTLMPIIRQYESQFNMKLFTPGKRVK 330 (435) Q Consensus 265 e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~ 330 (435) +..+..+.+|+..-++|+..+|....+.. +.++.. ..|...+.-.++.+.. +........ T Consensus 303 ~~l~~~i~~~~~~~~~p~~~fg~~~~n~~-Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~-----~~~~~~~~~ 376 (480) T protein:vir:78 303 EEMEVFRKEAASITGLPPQYLSSSSENPA-SAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ-----IMGREVTEE 376 (480) T ss_pred HHHHHHHHHHhcccCCCHHHhccccCchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HcCCCcccc Confidence 77788889999999999999987543211 222211 1122222222222211 111111112 Q ss_pred cceeeechhhhhccCHHHHHHHHHHHHhcC--CcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccccccccc Q lcl|NC_019456. 331 GFYFSFNVNGLLRGDTAARTQYYQTLTRNG--IFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTD 408 (435) Q Consensus 331 g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~ 408 (435) ...+++.+......+..+.++.+.+++.+| +++..-+++++|+.+-+.+.-++..-.....+++..... .. T Consensus 377 ~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~-------~~ 449 (480) T protein:vir:78 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYST-------TK 449 (480) T ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhcc-------cc Confidence 234555556666777888898899898876 567666788898875332111111111111111211111 11 Q ss_pred ccccc-cccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 409 ASVAA-PKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 409 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 435 (435) ++..+ +....++..++....+.....| T Consensus 450 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (480) T protein:vir:78 450 AQADATPKPTVTETKTETQTSPSGFNRT 477 (480) T ss_pred CCCccccCCCCCCCCCccCCCcccCCCc Confidence 11111 1111222222222222222222 No 181 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=98.44 E-value=6.5e-07 Score=54.49 Aligned_cols=389 Identities=9% Similarity=-0.031 Sum_probs=166.7 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhcc-ccccC--------cccccHHHHhhhHHHHHHHHHHHHHHhhCceee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAG-VKLEQ--------ATFSREHILESNEYIFSIVTRLSNVLASLPLHE 71 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--------~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~ 71 (435) +-+.+.|..+......... .-.....++.--. +.... ........-..++....+|+..+.-+-.-|+.+ T Consensus 34 e~~~~~i~~~i~~~~~~~~-r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~ 112 (483) T protein:vir:12 34 ETLEEMIVRYIKQHLEKLP-EISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 112 (483) T ss_pred hhHHHHHHHHHHHHHHHHH-HHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhhhcccCcee Confidence 2222232222211110000 0000000000000 00000 000000001124556667777777766667665 Q ss_pred eecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--CceE Q lcl|NC_019456. 72 YQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NNSY 149 (435) Q Consensus 72 ~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~~ 149 (435) ..++.... ..+..++. | ........+..+++.+|.+|.++..+.. |.+ .+..++|..+.+..+.. +... T Consensus 113 ~~~d~~~~--~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~y~~v~~d~d-~~~-~i~~~~p~~~~~v~d~~~~~~~~ 183 (483) T protein:vir:12 113 KHTDDEVV--KRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEE-GEF-KLFRVPAEQGIPIWTDKEHEELE 183 (483) T ss_pred ccCChHHH--HHHHHHHh--c---cHHHHHHHHHHHHhhCCeEEEEEEEcCC-Cce-EEEEEcccceEEEEcCCCCCceE Confidence 43332221 12222321 2 2345556678899999999998877644 665 58889999888776532 2111 Q ss_pred ----EEEEecCCeeEEEchhheEEeccC---------------------CCc---------cccccCcHHHHHHHHHHHH Q lcl|NC_019456. 150 ----WYRVTSDIYNFTIPINDVIHVKHV---------------------VPS---------NSWYGVSPIDVLSSSLKFQ 195 (435) Q Consensus 150 ----~~~~~~~~~~~~~~~~~iih~~~~---------------------~~~---------~~~~G~s~l~~~~~~i~~~ 195 (435) +|..........+.+..+.|+... ++. +...|.|-+..+...+... T Consensus 184 ~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~ 263 (483) T protein:vir:12 184 AFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAY 263 (483) T ss_pred EEEEEEEeecceEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCCCCCCCchhhHHHHHHHH Confidence 122222222233444444443211 000 1235777777777776655 Q ss_pred HHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHH Q lcl|NC_019456. 196 RSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIR 273 (435) Q Consensus 196 ~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~ 273 (435) ..+.....+.+... +..+++.-. .+........+ + ..+++.++.+.++..+........+....+...+. T Consensus 264 d~~~S~~~~~~~~~~~~~lv~~g~~---~~~~~~~~~~~----~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 335 (483) T protein:vir:12 264 NRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLL----R-YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 335 (483) T ss_pred HHHHHHHHHHHHHhcCceeeeecCC---cccchhHHHhh----h-hccccccCCCCcceEEeecCCHHHHHHHHHHHHHH Confidence 54333222222222 222332211 11112111111 1 22344455555555555444455677777788888 Q ss_pred HHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechh Q lcl|NC_019456. 274 IATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVN 339 (435) Q Consensus 274 Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~ 339 (435) |+..-++|....+.... +. +.++. ...|...+...++.+...+.. ... ...+++.+. T Consensus 336 I~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~-----~~~--~~~i~v~f~ 406 (483) T protein:vir:12 336 IMLFGQAVDFSSDKFGS-AP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-----KGE--HKDVDISFN 406 (483) T ss_pred HHHHhCCCCCCcccccc-Cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-----CCc--cceeeEEeC Confidence 99999998654432211 11 11221 222333444444444333221 111 233455556 Q ss_pred hhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCC Q lcl|NC_019456. 340 GLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGG 419 (435) Q Consensus 340 ~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (435) .-...|..+.++.+.++ .|+++..-++++++.-.-++...++ +.+..... ............+ T Consensus 407 ~~~p~~~~~~a~~~~kl--~GiiS~et~~~~~~~v~d~~~E~~r------------i~~E~~~~---~~~~~~~~~~~~d 469 (483) T protein:vir:12 407 YNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELER------------IEQEQMEY---NKQLPNLDDGGAD 469 (483) T ss_pred CCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHH------------HHHHHHHH---HhhcccccccccC Confidence 66778899999999988 5899998888888653211111111 11100000 0000000000000 Q ss_pred CCCCCCCCCCCCCC Q lcl|NC_019456. 420 ENTNENGLQSTEPE 433 (435) Q Consensus 420 ~~~~~~~~~~~~~~ 433 (435) ....++.....++| T Consensus 470 ~~~~~~~~~~~e~e 483 (483) T protein:vir:12 470 GAQQQERSNNKESE 483 (483) T ss_pred CcccCCCCCcccCC Confidence 00011111112222 No 182 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.43 E-value=6.8e-07 Score=54.38 Aligned_cols=358 Identities=11% Similarity=0.074 Sum_probs=157.8 Q ss_pred Cch--HHHHHhhccccccccccccccchhhhhhccccccCcc-cccHH--HHh--hhHHHHHHHHHHHHHHhhCceeeee Q lcl|NC_019456. 1 MSF--MSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQAT-FSREH--ILE--SNEYIFSIVTRLSNVLASLPLHEYQ 73 (435) Q Consensus 1 Mg~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--~~~--~~~~v~~~i~~ia~~ia~~~~~~~~ 73 (435) |-. +++|...+......... ...++..- -...... .+... ... -..+..-+|+.+|..+.=-.|.+ T Consensus 1 m~~~~i~~L~~~~~~~~~r~~~----~~~yy~g~-~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~~-- 73 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGVDK----RYRYYAMD-DRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIFREFTN-- 73 (422) T ss_pred CChHHHHHHHHHHHHHHHHHHH----HHHHHhcC-CChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccccceeeC-- Confidence 432 23444433222211110 01111100 0000000 01111 010 11233344454444222222221 Q ss_pred cccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceE---- Q lcl|NC_019456. 74 NYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSY---- 149 (435) Q Consensus 74 ~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~---- 149 (435) .+..+...+. + |. .......+..+.+.+|.||++|..+...|.| .+.+++|.++....|+..... T Consensus 74 -----~d~~l~~~w~-~-N~---ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p-~i~~~sp~~~~~i~D~~~~~~~~a~ 142 (422) T protein:vir:97 74 -----DDFNAWEIFK-A-NN---PDIFFDTAIQSALIASCCFVYIMPGAEDGLP-KMQVIEASKATGILDPTTFLLTEGY 142 (422) T ss_pred -----CchhHHHHHH-h-cC---hHHHHHHHHHHHHHhcceeEEEeeCCCCCee-EEEEechhhEEEEEeCCCCcceeeE Confidence 1222333332 2 32 3445567889999999999999876555665 588889999888776543221 Q ss_pred -EEEEecCCee---EEEchh---------------------heEEeccCCCccccccCcHH-HHH---HHHHHHHHHHHH Q lcl|NC_019456. 150 -WYRVTSDIYN---FTIPIN---------------------DVIHVKHVVPSNSWYGVSPI-DVL---SSSLKFQRSVEN 200 (435) Q Consensus 150 -~~~~~~~~~~---~~~~~~---------------------~iih~~~~~~~~~~~G~s~l-~~~---~~~i~~~~~~~~ 200 (435) ++....++.. ..++.. =|++|.+.......+|.|.+ ..+ .+.+.....-.. T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~ 222 (422) T protein:vir:97 143 AILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAE 222 (422) T ss_pred EEEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHH Confidence 1111112211 111111 13444433333456788754 233 333333333223 Q ss_pred HHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCcccccc-----CCceeeeccCChhhHHHHHHHHHHHHHHH Q lcl|NC_019456. 201 FSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQE-----AGWKVDRYESKFEPADLSSVEQISRIRIA 275 (435) Q Consensus 201 ~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~-----~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia 275 (435) ....++....+.++-.+..- ... +.|... .++++.++ .+.++.+++...-+ .|.+..+.....|| T Consensus 223 ~~~e~~a~pqr~i~G~d~d~--~~~----~~~~~~---~~~i~~~~~de~~~~~~v~q~~~~~l~-~~~~~l~~~~~~~a 292 (422) T protein:vir:97 223 VTAEFYSFPQKYVLGMDPDA--KPM----EKWRAT---VSTLLEISKDEDGDKPTVGQFTTASMA-PFMEHLKMYASLFA 292 (422) T ss_pred HHHHHhcchhhhhcccCccc--ccC----chhhhh---hhhhhccCCCCCCCcceeeecCCCChh-HHHHHHHHHHHHHh Confidence 34444444433333322111 111 122211 12444443 23566666554432 48899999999999 Q ss_pred HHhCCCHHHhCCcccCcccHHHH---HH-----------HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhh Q lcl|NC_019456. 276 TAFNVPISFLNDDQAKSTTNVEH---VT-----------HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGL 341 (435) Q Consensus 276 ~~fgvP~~~lg~~~~~~~~~~e~---~~-----------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l 341 (435) ..-++|+..+|....+..+ .++ .. ..|...++-.++.+...... .-.......+..+.| ... T Consensus 293 ~~s~lP~~~lg~~~~NpsS-a~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~-~~~~~~~~~~~~~~w--~p~ 368 (422) T protein:vir:97 293 GGSGLTLDDLGFPSDNPSS-VESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDE-FPYLRNQFMDTVIKW--EPL 368 (422) T ss_pred cccCCCHHHhccccCchhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CcccchhhccceEEE--ccC Confidence 9999999999987653221 121 11 11222222222221111111 000000111223444 333 Q ss_pred hccC---HHHHHHHHHHHHhc--CCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccc Q lcl|NC_019456. 342 LRGD---TAARTQYYQTLTRN--GIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDN 403 (435) Q Consensus 342 ~~~d---~~~~~~~~~~~~~~--g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~ 403 (435) ...+ ....++.+.|++++ |++...-+++++|+...+.+ . . .+.+...++ T Consensus 369 ~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~~~-~--~----------~~~~~~~d~ 422 (422) T protein:vir:97 369 FEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGADKP-I--P----------AITEVTTDG 422 (422) T ss_pred CCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCchhHH-H--H----------HHHhhhccC Confidence 3444 45566777888888 78888889999999653211 0 0 111111111 No 183 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=98.41 E-value=7.7e-07 Score=54.10 Aligned_cols=352 Identities=11% Similarity=0.077 Sum_probs=158.0 Q ss_pred CchHH-HHH---hhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMS-KVR---QFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~-~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) |.+.. ++. ++......-.. .+...... .....-.-..+..-+|+.+|..+.=-.|. . T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~------------~~~~~p~~--~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~---~-- 61 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAP------------TGITIPAH--IRAKYQAVLGWAAKGVDSLADRLIFRAFA---N-- 61 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccc------------cchhccHH--HHhHHHhhcchhHHHHHHhHhhhcccccc---C-- Confidence 44332 111 11111100000 00000000 00000011234444555555433322222 1 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEE----E Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWY----R 152 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~----~ 152 (435) .+..+...+ ...+.......+..+.+.+|.||+.|..+. .|.| .+.+++|.++....|+......+ . T Consensus 62 --~d~~l~~i~-----~~N~ld~~~~~~~~~al~~G~sf~~v~~~~-d~~~-~i~~~sP~~~~~i~Dp~~~~~~~al~~~ 132 (410) T protein:vir:95 62 --DDFNVTEIF-----DRNNPDIFFDSAILSALIGSCSFVYISKGE-DDEV-RLQVIESSNATGVIDPITGLLVEGYAVL 132 (410) T ss_pred --CCchHHHHH-----hhcChHHHHHHHHHHHHHhCceeEEEecCC-CCce-EEEEEcccceEEEEeCCCCceEEEEEEE Confidence 122233332 223345566788899999999999987654 4554 67889999888877664322211 1 Q ss_pred EecC-Ce---eEEEchhhe---------------------EEeccCCCccccccCcHH----HHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 153 VTSD-IY---NFTIPINDV---------------------IHVKHVVPSNSWYGVSPI----DVLSSSLKFQRSVENFSQ 203 (435) Q Consensus 153 ~~~~-~~---~~~~~~~~i---------------------ih~~~~~~~~~~~G~s~l----~~~~~~i~~~~~~~~~~~ 203 (435) .... +. ...|.++.+ ++|.+....+..+|.|-+ ..+.+.+.....-..... T Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~ 212 (410) T protein:vir:95 133 ARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITA 212 (410) T ss_pred EecCCCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHH Confidence 1111 11 112333333 333332223455677743 344444433333333445 Q ss_pred HHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccC-----CceeeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_019456. 204 NEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEA-----GWKVDRYESKFEPADLSSVEQISRIRIATAF 278 (435) Q Consensus 204 ~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~-----g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~f 278 (435) .|+....+.++-.+.+- +..+ .|.. ..++++.++. +.++.+++...-. .|++..+....+||..- T Consensus 213 e~~a~pqr~i~G~d~d~--~~~~----~~~~---~~~~i~~~~~~~~~~~~~v~q~~~~~l~-~~~~~l~~l~~~~a~~s 282 (410) T protein:vir:95 213 EFYSWPQKYILGLDPDA--EPME----KWKA---TVSSLLTISSSDKGVKPSVGQFTTASMS-PFTEQLRTAAAGFAGEM 282 (410) T ss_pred HHhcchhheeeccCCCC--CcCc----hhhh---hhhhheeccCCCCCCcceEEecCCCChH-HHHHHHHHHHHHHhhhc Confidence 55555545444433211 1111 2221 1234555442 3566666544332 58899999999999999 Q ss_pred CCCHHHhCCcccCcccHHHH---HHHHHHHHHhHHHHHHHHHHHH----hhc--ccc--cccCcceeeechh---hhhcc Q lcl|NC_019456. 279 NVPISFLNDDQAKSTTNVEH---VTHSWTMTLMPIIRQYESQFNM----KLF--TPG--KRVKGFYFSFNVN---GLLRG 344 (435) Q Consensus 279 gvP~~~lg~~~~~~~~~~e~---~~~~~~~~i~P~~~~i~~~l~~----~l~--~~~--~~~~g~~i~fd~~---~l~~~ 344 (435) ++|+..+|....+.. +.++ ...-+...+.-..+.+.+.+.+ -+. ... .......+++... +.... T Consensus 283 ~lP~~~lg~~~~Nps-Sa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~ 361 (410) T protein:vir:95 283 GLTLDDLGFVSDNPS-SVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADAN 361 (410) T ss_pred CCCHHHhccccCchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCcchh Confidence 999999997664322 1121 1111111111111112221111 110 100 0111122333333 33345 Q ss_pred CHHHHHHHHHHHHhc--CCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccccc Q lcl|NC_019456. 345 DTAARTQYYQTLTRN--GIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNK 404 (435) Q Consensus 345 d~~~~~~~~~~~~~~--g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~ 404 (435) +....++.+.|+++. |+....-+++++|+.+- ++.. . + ++ .....+. T Consensus 362 s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~-----~~~~-~--~--~~---e~~~~g~ 410 (410) T protein:vir:95 362 TMTMIGDGVVKLNQALPGYINAETIRDLTGIAGD-----MSAK-P--V--VS---EGGSNGE 410 (410) T ss_pred hHHHHHHHHHHHHHhccCCccHHHHHHhcCCChH-----HHHH-H--H--HH---HHHhCCC Confidence 678888899999988 67777779999999752 1110 0 0 00 0000000 No 184 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.38 E-value=9.7e-07 Score=53.56 Aligned_cols=383 Identities=9% Similarity=0.007 Sum_probs=167.8 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) ..-++++.+...+........ ...... ...+. -..++.....|+..+.-+-.-|+.+.-.... . T Consensus 10 ~~r~~~l~~yy~g~~~~~~~~-----------~~~~~~-~~~~~--ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~--~ 73 (440) T protein:vir:95 10 KQRLAILASYAQGDNFSILSG-----------HRRLDD-EKADY--RVRHKWGGYISSFATGYVIGNPVSIGVMEGG--S 73 (440) T ss_pred HHHHHHHHHHhccCCcccccc-----------cccccc-cCCcc--eeecchHHHHHHhhhhheeccCceEeeCCCc--c Confidence 112222222221110000000 000000 00000 1123444555666665554445554322111 1 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc--eE----EEEEe Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN--SY----WYRVT 154 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~----~~~~~ 154 (435) ......| .+-............+..+.+.+|.+|.++..+. .|.+ .+..++|..+.+..+.... .. ++... T Consensus 74 ~~~~~~l-~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~-~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~ 150 (440) T protein:vir:95 74 ADQLSTI-KDIEWQNDINALNSDLAFDASVYGRAYEYHFRDK-DKVD-RVVLISPLEMFVIRDLTVEQNIIAAVHLPIYA 150 (440) T ss_pred HHHHHHH-HHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecC-CCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEec Confidence 1111111 1211223455666788899999999999988764 4665 4777899999888776432 21 11111 Q ss_pred cCCeeEEEchhheEEeccC--------------CC---------ccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC-- Q lcl|NC_019456. 155 SDIYNFTIPINDVIHVKHV--------------VP---------SNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK-- 209 (435) Q Consensus 155 ~~~~~~~~~~~~iih~~~~--------------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~-- 209 (435) .......|..+.++++... ++ .+...|.|-+..+...+.....+.....+..... T Consensus 151 ~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s~~~~~~~~~~~ 230 (440) T protein:vir:95 151 DKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQSDTANYMSDLND 230 (440) T ss_pred CceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhc Confidence 1112223555555554211 11 1123467777776666665544322222222222 Q ss_pred CceEEEe---CCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhC Q lcl|NC_019456. 210 DKFVLQY---DRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLN 286 (435) Q Consensus 210 ~~~~~~~---~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg 286 (435) +..+++. +...+++....+++.-.............+.+.+++.+........+....+...+.|+..-++|..-.+ T Consensus 231 ~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~ 310 (440) T protein:vir:95 231 AMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDD 310 (440) T ss_pred ceeeeecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc Confidence 2223332 1333455555444332211111111222233334444444333445667777888999999999975443 Q ss_pred CcccCcccHHHH--------------HHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHH Q lcl|NC_019456. 287 DDQAKSTTNVEH--------------VTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQY 352 (435) Q Consensus 287 ~~~~~~~~~~e~--------------~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~ 352 (435) ....+ -+ ..+ ....|...+...++.+...+...--.. .....+++.+..-...|..+.++. T Consensus 311 ~~~~n-~S-g~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~---~~~~~v~i~f~~~~p~~~~~~ad~ 385 (440) T protein:vir:95 311 RFNST-SS-GIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPV---IEANKLTFTFHPNIPQDVWTEIKA 385 (440) T ss_pred ccccc-ch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc---cccccceEEeCCCCCCCHHHHHHH Confidence 32211 11 111 122334445555544444443321111 112234555566678889999999 Q ss_pred HHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCC Q lcl|NC_019456. 353 YQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 353 ~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) +.++ .|+++.--+.++++.-..+. + +..+.+.......+. ....+..+++..++| T Consensus 386 ~~kl--~g~iS~et~~~~l~~~d~~~----E---------~~ri~~E~~~~~~~~--~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 386 YIEA--GGEISQETLMENASFTDYKT----E---------HSRILKQGGSSDLEI--GQIVGDADVGQADTE 440 (440) T ss_pred HHHH--hccCcHHHHHHhCCCCCcHH----H---------HHHHHHHHHHhhhhH--HhhccCCCCCCcCCC Confidence 9988 57888877777775421111 1 111111111000000 001111122222222 No 185 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=98.34 E-value=1.2e-06 Score=53.07 Aligned_cols=386 Identities=11% Similarity=0.103 Sum_probs=167.9 Q ss_pred Cch---HHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_019456. 1 MSF---MSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQ 77 (435) Q Consensus 1 Mg~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 77 (435) +.. +.++.+...+........ ..... ......+. -..++....+|+..+.-+-.-|+.+..++.. T Consensus 44 ~~~~~~~~~~~~yY~g~~~~i~~~---~~~~~-------~~~~~~~~--ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~ 111 (481) T protein:vir:10 44 TEQVPRLEMLESYYLNRNTDILAG---ERRLQ-------KYGDKADH--RAVHNYAKYVSRFIVGYLTGNPITITHQDNQ 111 (481) T ss_pred HHHHHHHHHHHHHhcCCCcccccC---ccccc-------cccccccc--eeecchHHHHHHHHHhhhccCCceEecCChh Confidence 111 111111111110000000 00000 00000000 1233455667777777666667655433322 Q ss_pred cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC--ceE----EE Q lcl|NC_019456. 78 MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN--NSY----WY 151 (435) Q Consensus 78 ~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~----~~ 151 (435) . .+.+..++. ......+...+..+.+.+|.+|.++..+. .|.+ .+..++|..+.+..+... ... +| T Consensus 112 ~-~~~l~~~~~-----~n~~~~~~~~~~~~~~~~G~~~~~~~~d~-dg~~-~i~~~~p~~~~~v~d~~~~~~~~~~i~~~ 183 (481) T protein:vir:10 112 T-NDKIIELND-----LNDADEVNSDLALNLSIYGRAYEIVYRDF-EDRD-TFKVLDPKSTFVVYDQTLDKKVVAGVRYF 183 (481) T ss_pred H-HHHHHHHHH-----hcChhHHHHHHHHHHHhcCeEEEEEEeCC-CCeE-EEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 2 233444333 23466788889999999999999887764 4665 577889998887776542 111 12 Q ss_pred EEec-CCee----EEEchhheEEeccCC-----------C---------ccccccCcHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019456. 152 RVTS-DIYN----FTIPINDVIHVKHVV-----------P---------SNSWYGVSPIDVLSSSLKFQRSVENFSQNEM 206 (435) Q Consensus 152 ~~~~-~~~~----~~~~~~~iih~~~~~-----------~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~ 206 (435) .... ++.. ..+.++.|.++.... + .+...|.|-+..+...+...........+.+ T Consensus 184 ~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~lida~~~~~s~~~~~~ 263 (481) T protein:vir:10 184 EKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPIIEYLNDQFKQGDFENVIALIDLYDSAQSDTANYM 263 (481) T ss_pred EEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCceeEEEeecCCCCCCchhhHHHHHHHHHHHHHHHHHHH Confidence 1111 1111 134555555553221 0 0123567776666665554433222212222 Q ss_pred hcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccc--cccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_019456. 207 EKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAV--VQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPI 282 (435) Q Consensus 207 ~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~--vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~ 282 (435) ... +..++......+++..+.++..-. ..-..+.. ..+.+.++.-+........+.+..+...+.|...-++|. T Consensus 264 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 341 (481) T protein:vir:10 264 TDLNDAMLAIIGNVDLDSEDAKAFRDANM--IHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPD 341 (481) T ss_pred HHhcCceeEeecCcCCCccchhhhhhccc--eeccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc Confidence 211 223333222333333333322110 00011111 122334444444443445577778888889999999997 Q ss_pred HHhCCcccCcccHHHHHH--------------HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHH Q lcl|NC_019456. 283 SFLNDDQAKSTTNVEHVT--------------HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAA 348 (435) Q Consensus 283 ~~lg~~~~~~~~~~e~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~ 348 (435) ...+....+ . +.++.. ..|...+.-.++.+...+...-.. ......+++.+..-...|..+ T Consensus 342 ~~~~~~~~n-~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~---~~~~~~i~v~f~~~~~~~~~~ 416 (481) T protein:vir:10 342 LNDEQFSGV-Q-SGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLK---QHNYAELTITFTPNLPKSMME 416 (481) T ss_pred ccccccccc-c-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC---ccccceeeEEeCCCCCcCHHH Confidence 666543322 1 112211 111222222222222222211110 111123555556667888899 Q ss_pred HHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCC Q lcl|NC_019456. 349 RTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQ 428 (435) Q Consensus 349 ~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (435) .++.+.++ .|+++.-.+.+++++-. ++- ++ ++.+.+......... . ..+-....+++.+ T Consensus 417 ~a~~~~kl--~g~is~et~~~~l~~i~--d~~-~E---------~~ri~~E~~~~~~~~------~-~~~~~~~~~~~~~ 475 (481) T protein:vir:10 417 SINAFNAL--SGGVSESTRLSLLDFID--NPK-EE---------LEKMQEEEAQREKQA------D-KRGYGEAFENHLN 475 (481) T ss_pred HHHHHHHH--hccCChHHHHHhCCCCC--CHH-HH---------HHHHHHHHHHHHhhh------h-hccCCccCCCCCC Confidence 99999988 47888877888776421 110 01 111111110000000 0 0011111222333 Q ss_pred CCCCCC Q lcl|NC_019456. 429 STEPEG 434 (435) Q Consensus 429 ~~~~~~ 434 (435) +.+++| T Consensus 476 ~dd~~g 481 (481) T protein:vir:10 476 VDDSNG 481 (481) T ss_pred CCCCCC Confidence 344444 No 186 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.33 E-value=1.3e-06 Score=52.88 Aligned_cols=395 Identities=11% Similarity=0.016 Sum_probs=174.0 Q ss_pred CchHHHHHhhcccccccc-----------ccccccchh---hhhh-ccccccC--------cccccHHHHhhhHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQAN-----------QIVQNPIPQ---PLDM-AGVKLEQ--------ATFSREHILESNEYIFSIV 57 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~-----------~~~~~~~~~---~~~~-~~~~~~~--------~~~~~~~~~~~~~~v~~~i 57 (435) =+|+.|++.++..-.-.. ......... ...+ .|-.... .... ............++ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~-~~~~~s~n~~~~iv 81 (499) T protein:vir:80 3 NQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPV-NRRQLSMNLPKVTA 81 (499) T ss_pred hHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCcc-ccceeecchHHHHH Confidence 133334444432100000 000000000 0000 1100000 0000 01112233444556 Q ss_pred HHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCce Q lcl|NC_019456. 58 TRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNT 137 (435) Q Consensus 58 ~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~ 137 (435) +..|+-+..=|..+.-+++.. ...+..++ .......-+..++......|.+|+.+..+.. |.+ .+..++|.. T Consensus 82 ~~~a~~l~~ep~~i~~~d~~~--~e~l~~~~----~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~-~~~-~i~~v~a~~ 153 (499) T protein:vir:80 82 KYMSKLLFNEKVKINIDDETA--EEFVLNVL----KTNGFTKNMERYIEYGEAMGGFVIKVYHDGN-KNV-KVSFATADC 153 (499) T ss_pred HHHHHhhhCCcceEeeCCHHH--HHHHHHHH----hhccHHHHHHHHHHHHhhcCcEEEEEEECCC-CcE-EEEEEcCCc Confidence 666666655555543333221 11222222 2233556667788888899999999888754 554 456677766 Q ss_pred eEEEEcCCCce--------------EE---------------EEEe------cCCe--eEEEchhh-------------- Q lcl|NC_019456. 138 VSILRNTDNNS--------------YW---------------YRVT------SDIY--NFTIPIND-------------- 166 (435) Q Consensus 138 v~~~~~~~~~~--------------~~---------------~~~~------~~~~--~~~~~~~~-------------- 166 (435) +-+...+.+.. +| |.+. .++. ...++..+ T Consensus 154 ~~Pi~~d~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~ 233 (499) T protein:vir:80 154 MYPLSNDSENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSL 233 (499) T ss_pred eEEEEecCCCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCC Confidence 65533322211 00 0000 0000 00111111 Q ss_pred ----eEEeccCCC----ccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHH-HH-HHHHHHHHHHh Q lcl|NC_019456. 167 ----VIHVKHVVP----SNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPE-KR-QAMVNDFLRMV 236 (435) Q Consensus 167 ----iih~~~~~~----~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e-~~-~~~~~~~~~~~ 236 (435) +.||+.+-+ .+.+.|.|.+..+...+...........+-+..+...++.-...+... .. -.....|.. . T Consensus 234 ~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~-~ 312 (499) T protein:vir:80 234 TRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTTQYFDS-T 312 (499) T ss_pred CccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCcccCCCc-c Confidence 334544321 244679999999988888776554444445565543333211111000 00 000000000 0 Q ss_pred cCCCc-ccc--ccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH------------- Q lcl|NC_019456. 237 KENGG-AVV--QEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT------------- 300 (435) Q Consensus 237 ~~~~~-~~v--l~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~------------- 300 (435) ....+ +.. -+++-.++.++.....-++.+..+...++|....|+++..+|....+.. ++.+.. T Consensus 313 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~-TAtei~s~~~~l~~~~~~~ 391 (499) T protein:vir:80 313 DEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLK-TATEVVSEKSETYQTKNSH 391 (499) T ss_pred cceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccch-hHHHHHHHHHHHHHHHHHH Confidence 00000 111 1223346666666555567777788888999999999999987655432 222211 Q ss_pred -HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHh-CCCCCCC Q lcl|NC_019456. 301 -HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELE-GQAPIPD 378 (435) Q Consensus 301 -~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~-g~~p~~~ 378 (435) ..++.+|..++..+........+..........+.++++.-...|.++.++...+++..|+|+.-.++... |.+ + T Consensus 392 ~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~---d 468 (499) T protein:vir:80 392 SQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAWNIT---E 468 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCCCC---h Confidence 11122233333332222111111122223345678888888888999999999999999999999887654 432 2 Q ss_pred cCCceeeecccccchhccccccccccccccccccccccCCCCCCCCC Q lcl|NC_019456. 379 EAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNEN 425 (435) Q Consensus 379 ~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (435) +.+++.+ ..+. ...... .+.+..+|..+++. T Consensus 469 ~ea~~el--------~~i~---~E~~~~-----~~~~d~~g~~ge~e 499 (499) T protein:vir:80 469 AEADEWA--------EMLA---KEKQAE-----IPNNDMTGIFGEEE 499 (499) T ss_pred HHHHHHH--------HHHH---HHhhcC-----CCCCCccccCCCCC Confidence 2222211 1110 000000 00111112111111 No 187 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=98.31 E-value=1.4e-06 Score=52.64 Aligned_cols=406 Identities=11% Similarity=0.054 Sum_probs=175.3 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhc-cc----cccC-cccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMA-GV----KLEQ-ATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~----~~~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) |...+.|.+++.......-.+-.....++.-. .. .... ....+.. ..++.....++..+.-+-.-|+.+..+ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~ 116 (511) T protein:vir:10 39 LQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQYQDD 116 (511) T ss_pred ccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcce--eecchHHHHHHHHhhhhcccCceeecC Confidence 22222222222111000000000000010000 00 0000 0000000 112344555666666666667766433 Q ss_pred ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC--ceE--- Q lcl|NC_019456. 75 YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN--NSY--- 149 (435) Q Consensus 75 ~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~--- 149 (435) .... ...+.. +. ...........+..+++.+|.||.++..+. .|.+ .+.+++|..+.+..+... ... T Consensus 117 d~~~-~~~l~~-~~----~~n~~~~~~~~~~~~~~i~G~ay~~vy~de-dg~~-~i~~~~p~~~~~vydd~~~~~~~~~v 188 (511) T protein:vir:10 117 DKDV-LEAIEA-FN----DLNDVESHNRSLGLDLSIYGKAYEIMIRNQ-DDET-RLYKSDAMSTFVIYDNTIERNSIAGV 188 (511) T ss_pred chHH-HHHHHH-HH----hhcCHHHHHHHHHHHHHhcCeeEEEEEeCC-CCce-EEEEEccceeEEEEcCCCCCceEEEE Confidence 3222 122222 22 223355666788899999999999888764 3554 677889988888776542 111 Q ss_pred -EEEEe-cCC---e----eEEEchhheEEeccCCC-------------------------ccccccCcHHHHHHHHHHHH Q lcl|NC_019456. 150 -WYRVT-SDI---Y----NFTIPINDVIHVKHVVP-------------------------SNSWYGVSPIDVLSSSLKFQ 195 (435) Q Consensus 150 -~~~~~-~~~---~----~~~~~~~~iih~~~~~~-------------------------~~~~~G~s~l~~~~~~i~~~ 195 (435) +|... .++ . ...+.++.|.++..... .+...|.|-+..+...+... T Consensus 189 r~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v~~liDa~ 268 (511) T protein:vir:10 189 RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLY 268 (511) T ss_pred EEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCCCCCCchhhhHHHHHHH Confidence 11111 111 0 12345555555432110 01235778777777777765 Q ss_pred HHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcC----CCccccccCCceeeeccCChhhHHHHHHHHH Q lcl|NC_019456. 196 RSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKE----NGGAVVQEAGWKVDRYESKFEPADLSSVEQI 269 (435) Q Consensus 196 ~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~----~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~ 269 (435) ..+..-..+.+... +..++......+.++.....+........ .+...-.+.+.+++.+........+....+. T Consensus 269 d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~ 348 (511) T protein:vir:10 269 DNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDR 348 (511) T ss_pred HHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHH Confidence 54333333333332 22333332334444443332221111111 1112223445666666555555566777788 Q ss_pred HHHHHHHHhCCCHHHhCCcccCcccHHHH--------------HHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceee Q lcl|NC_019456. 270 SRIRIATAFNVPISFLNDDQAKSTTNVEH--------------VTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFS 335 (435) Q Consensus 270 ~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~--------------~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~ 335 (435) ..+.|...-++|..-.+.... +. +..+ ....|...+.-.++.+...+....-... ...-..++ T Consensus 349 L~~~I~~~s~~P~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~-~~d~~~i~ 425 (511) T protein:vir:10 349 LNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA-NKDFNTVR 425 (511) T ss_pred HHHHHHHHhCCcccccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccc-ccccceee Confidence 888999999998764433221 11 1122 1223334444444444444332211100 00111355 Q ss_pred echhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccc-cccc-cccccc Q lcl|NC_019456. 336 FNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDN-KIQT-DASVAA 413 (435) Q Consensus 336 fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~-~~~~-~~~~~~ 413 (435) +.+..-...|..+.++.+.+++ |+++.--+.+++++- +++- ++ ++.+.+..... .... ...... T Consensus 426 i~f~~~~p~d~~~~~~~~~kl~--G~iS~et~~~~l~~v--~d~~-~E---------~~ri~~E~~~~~~~~~~~~~~~~ 491 (511) T protein:vir:10 426 YVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFF--QDPE-LE---------VKKIEEDEKESIKKAQKGIYKDP 491 (511) T ss_pred EEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHhCCCC--CCHH-HH---------HHHHHHHHHHHHHHHhhhcccCC Confidence 6666777888999999999984 889987788887542 2211 11 11111100000 0000 000001 Q ss_pred cccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 414 PKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 414 ~~~~~~~~~~~~~~~~~~~~ 433 (435) ...+.++..+++...+++.| T Consensus 492 ~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 492 RDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCCCcccCcccccC Confidence 11111122222222222333 No 188 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=98.30 E-value=1.6e-06 Score=52.42 Aligned_cols=386 Identities=9% Similarity=-0.020 Sum_probs=164.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhhh-----------ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCce Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDM-----------AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPL 69 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~ 69 (435) |-..+.|.++........ ..-.....++.- .+.........+ .-..++....+|+..+.-+-.-|+ T Consensus 43 ~~~~~~i~~~i~~~~~~~-~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~--~ri~~n~~k~Ivd~~~~yl~G~p~ 119 (492) T protein:vir:94 43 ETLEEMIVRYIKQHLEKL-PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPD--DRMITNFHANLVDQKVSYIVGKPI 119 (492) T ss_pred hhHHHHHHHHHHHHHHHH-HHHHHHHHHhccccccccccccccccccccccccc--cccccchHHHHHHHHHhhhcccCc Confidence 222222222211110000 000000000000 000000000000 012245566677777777666676 Q ss_pred eeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--Cc Q lcl|NC_019456. 70 HEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NN 147 (435) Q Consensus 70 ~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~ 147 (435) .+..++... ...+ ..+.. | ........+..+++.+|.+|.++..+. .|.+ .+..++|..+.+..+.. +. T Consensus 120 ~~~~~d~~~-~~~l-~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~-dg~~-~~~~~~p~~~~~v~d~~~~~~ 190 (492) T protein:vir:94 120 AFKHTDDEV-VKRI-DEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDE-EGEF-KLFRVPAEQGIPIWTDKEHEE 190 (492) T ss_pred eeccCchHH-HHHH-HHHHh--c---cHHHHHHHHHHHHhhCCeEEEEEEecC-CCce-EEEEEcccceEEEEcCCCCCc Confidence 654333222 1122 22321 2 344566778899999999999887764 4665 57788998888776532 21 Q ss_pred eE----EEEEecCCeeEEEchhheEEeccC---------------------CCc---------cccccCcHHHHHHHHHH Q lcl|NC_019456. 148 SY----WYRVTSDIYNFTIPINDVIHVKHV---------------------VPS---------NSWYGVSPIDVLSSSLK 193 (435) Q Consensus 148 ~~----~~~~~~~~~~~~~~~~~iih~~~~---------------------~~~---------~~~~G~s~l~~~~~~i~ 193 (435) .. +|..........+....|.++... ++. +...|.|-+..+...+. T Consensus 191 ~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~sd~e~v~~liD 270 (492) T protein:vir:94 191 LEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLID 270 (492) T ss_pred eEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCCCCCCCchHHHHHHHH Confidence 11 122112222223333444443211 000 12357788877777777 Q ss_pred HHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHH Q lcl|NC_019456. 194 FQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISR 271 (435) Q Consensus 194 ~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~ 271 (435) ....+..-..+.+... +..++..- ..+........+ ...+++.++.+.+++.+........+....+... T Consensus 271 a~d~~~S~~~~~~~~~~~p~lv~~g~---~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~ 342 (492) T protein:vir:94 271 AYNRRLSDLSNTFKDSNELTYVLKNY---DDQELPEFKRLL-----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELY 342 (492) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecC---CcccchhhHHHH-----hhccceecCCCCcceeEeccCCHHHHHHHHHHHH Confidence 6554333333333322 22233221 112112221111 1234455555555544444444445666777778 Q ss_pred HHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeec Q lcl|NC_019456. 272 IRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFN 337 (435) Q Consensus 272 ~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd 337 (435) +.|+...++|..-.+... ++. +.++. ...|...+...++.+...+.. ... ...+.+. T Consensus 343 ~~I~~~s~~p~~~~~~~~-~n~-Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~-----~~~--~~~i~v~ 413 (492) T protein:vir:94 343 QKIMLFGQAVDFSSDKFG-SAP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-----KGE--HKDVDIS 413 (492) T ss_pred HHHHHHhCCcCCCccccc-cCc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-----Ccc--cceeeEE Confidence 888888888864332211 111 12222 122233333333333333221 111 1234555 Q ss_pred hhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccc-ccccccccccc Q lcl|NC_019456. 338 VNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKI-QTDASVAAPKQ 416 (435) Q Consensus 338 ~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~-~~~~~~~~~~~ 416 (435) +..-...|..+.++.+.+++ |+++..-++++++.-.-+++..++ +.+....... ........+ T Consensus 414 f~~~~p~~~~e~~~~~~kl~--giiS~et~~~~l~~v~d~~~E~er------------i~~E~~~~~~~~~~~~~~~~-- 477 (492) T protein:vir:94 414 FNYNKVANTELQVQTAQQSM--GIVSHETVLENHPFVEDLQAELER------------IEQEQMEYNKQLPNLDDGGA-- 477 (492) T ss_pred ecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHH------------HHHHHHHHHhhccccccccC-- Confidence 56666788999999999884 889988888888653211111111 1100000000 000000000 Q ss_pred CCCCCCCCCCCCCCCCC Q lcl|NC_019456. 417 EGGENTNENGLQSTEPE 433 (435) Q Consensus 417 ~~~~~~~~~~~~~~~~~ 433 (435) .++...+++ ...++| T Consensus 478 ~~~~~~~~~--~~~e~e 492 (492) T protein:vir:94 478 DSAQQQERS--NNKESE 492 (492) T ss_pred CCCccccCC--ccccCC Confidence 000011111 111112 No 189 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=98.27 E-value=1.8e-06 Score=52.08 Aligned_cols=404 Identities=11% Similarity=0.068 Sum_probs=167.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhcc-c----cccCc-ccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAG-V----KLEQA-TFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~----~~~~~-~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) +-..+.|..++.......-.+-.....++.-.. . ..... ...+.. ..++.....|+..+.-+-.-|+.+..+ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~ 116 (511) T protein:vir:99 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQYQDD 116 (511) T ss_pred hccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcce--eecchHHHHHHHHHhhhcccCceeecC Confidence 111122222111100000000000000000000 0 00000 000000 112344445666666565667665433 Q ss_pred ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC--ceE--- Q lcl|NC_019456. 75 YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN--NSY--- 149 (435) Q Consensus 75 ~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~--- 149 (435) .... ...+.. ++ ...........+...++.+|.+|.++.++. .|.+ .+..++|..+.+..+... ... T Consensus 117 d~~~-~~~l~~-~~----~~n~~~~~~~~~~~~~~i~G~a~~~vy~de-d~~~-~i~~~~p~~~~~vyd~~~~~~~~~~v 188 (511) T protein:vir:99 117 DKDV-LEAIEA-FN----DLNDVESHNRSLGLDLSIYGKAYELMIRNQ-DDET-RLYKSDAMSTFVIYDNTIERNSIAGV 188 (511) T ss_pred chHH-HHHHHH-HH----hhcCHhHHHHHHHHHHHhcCeeEEEEEeCC-CCce-EEEEEccceeEEEEcCCCCCceEEEE Confidence 2222 122222 22 222455667888899999999999888764 3554 678889998888776532 211 Q ss_pred -EEEE-ecCC---e----eEEEchhheEEeccCCC-------------------------ccccccCcHHHHHHHHHHHH Q lcl|NC_019456. 150 -WYRV-TSDI---Y----NFTIPINDVIHVKHVVP-------------------------SNSWYGVSPIDVLSSSLKFQ 195 (435) Q Consensus 150 -~~~~-~~~~---~----~~~~~~~~iih~~~~~~-------------------------~~~~~G~s~l~~~~~~i~~~ 195 (435) +|.. ..++ . ...+.++.+.+++.... .+...|.|.+..+...+... T Consensus 189 r~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~ 268 (511) T protein:vir:99 189 RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLY 268 (511) T ss_pred EEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecCCCCCCCchhhhHHHHHHH Confidence 1111 1111 0 12455666666542110 01235777777777776655 Q ss_pred HHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHH----HHhcCCCccccccCCceeeeccCChhhHHHHHHHHH Q lcl|NC_019456. 196 RSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFL----RMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQI 269 (435) Q Consensus 196 ~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~----~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~ 269 (435) ..+..-..+.+... +..++......+.+......+.-. ......+...-.+.+.+++.++.......+....+. T Consensus 269 d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~ 348 (511) T protein:vir:99 269 DNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDR 348 (511) T ss_pred HHHHHHHHHHHHHhhchhhhhccCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHH Confidence 44333222222222 222222222233333332222110 000011112223455666666555445556777788 Q ss_pred HHHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhccc-ccccCccee Q lcl|NC_019456. 270 SRIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTP-GKRVKGFYF 334 (435) Q Consensus 270 ~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~-~~~~~g~~i 334 (435) ..+.|+..-++|....+.... +. +..+. ...|...+.-.++.+...+...--.. ..... .+ T Consensus 349 L~~~I~~~s~~P~~~~~~~~g-n~-Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~--~i 424 (511) T protein:vir:99 349 LNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFN--TV 424 (511) T ss_pred HHHHHHHHhCCcccccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccc--cc Confidence 888999999998865533321 11 11221 12233333333333333333211000 01111 23 Q ss_pred eechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccc---ccccccccc Q lcl|NC_019456. 335 SFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILD---NKIQTDASV 411 (435) Q Consensus 335 ~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~---~~~~~~~~~ 411 (435) ++.+..-...|..+.++.+.++. |+++.--+++++++- +++- ++ ++.+.+.... ..... ... T Consensus 425 ~i~f~~~~p~n~~e~~~~~~kl~--GiiS~et~l~~l~~v--~D~~-~E---------~~ri~~E~~~~~~~~~~~-~~~ 489 (511) T protein:vir:99 425 RYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFF--QDPE-LE---------VKKIEEDEKESIKKAQKN-MYQ 489 (511) T ss_pred eEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCC--CCHH-HH---------HHHHHHHHHHHHHHHhhc-ccc Confidence 44445556788899999998884 889998888887542 2211 01 1111110000 00000 000 Q ss_pred cccccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 412 AAPKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~ 433 (435) .....+.++.+++++.+..+.| T Consensus 490 ~~~~~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 490 DPRNINDDEQDDSTKDSIDKKE 511 (511) T ss_pred cCCCCCCCCCCCCCcCcccccC Confidence 0001111111111111122222 No 190 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=98.27 E-value=1.9e-06 Score=52.00 Aligned_cols=407 Identities=11% Similarity=0.077 Sum_probs=174.8 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhc-cc----cccCc-ccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMA-GV----KLEQA-TFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~----~~~~~-~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) +...+.|..++.........+-.....++.-. .. ..... ...+.. ..+......++..+.-+-.-|+.+..+ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~ 116 (511) T protein:vir:96 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQYQDD 116 (511) T ss_pred hccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcce--eecchHHHHHHHHHhhhccCCceeecC Confidence 11112222211110000000000000000000 00 00000 000000 112344556666666666667766433 Q ss_pred ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC--ceE--- Q lcl|NC_019456. 75 YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN--NSY--- 149 (435) Q Consensus 75 ~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~--- 149 (435) .... ...+..+ +...........+..++..+|.+|.++.++. .|. ..+.+++|..+.+..+... ... T Consensus 117 ~~~~-~~~l~~~-----~~~n~~~~~~~~~~~~~~i~G~a~~~vy~de-d~~-~~i~~~~p~~~~~vydd~~~~~~~~~v 188 (511) T protein:vir:96 117 DKDV-LEAIEAF-----NDLNDVESHNRSLGLDLSIYGKAYELMIRNQ-DDE-TRLYKSDAMSTFVIYDNTIERNSIAGV 188 (511) T ss_pred chHH-HHHHHHH-----HhhcCHHHHHHHHHHHHHhcCeeEEEEEeCC-CCc-eEEEEEccceeEEEEcCCCCCceEEEE Confidence 3222 1222222 2233466677888999999999999888764 355 4678899998888776532 111 Q ss_pred -EEEEe-cCC---e----eEEEchhheEEeccCCC-------------------------ccccccCcHHHHHHHHHHHH Q lcl|NC_019456. 150 -WYRVT-SDI---Y----NFTIPINDVIHVKHVVP-------------------------SNSWYGVSPIDVLSSSLKFQ 195 (435) Q Consensus 150 -~~~~~-~~~---~----~~~~~~~~iih~~~~~~-------------------------~~~~~G~s~l~~~~~~i~~~ 195 (435) +|... .++ . ...+.++.+.++..... .+...|.|-+..+...+... T Consensus 189 r~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~gd~e~v~~liDa~ 268 (511) T protein:vir:96 189 RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLY 268 (511) T ss_pred EEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEecCCCCCCCchhhhHHHHHHH Confidence 11111 111 0 11345555555432110 01235778787777777765 Q ss_pred HHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhc----CCCccccccCCceeeeccCChhhHHHHHHHHH Q lcl|NC_019456. 196 RSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVK----ENGGAVVQEAGWKVDRYESKFEPADLSSVEQI 269 (435) Q Consensus 196 ~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~----~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~ 269 (435) ..+..-..+.+... +..++......+.++.....+....... ..+...-.+.+.++..+........+....+. T Consensus 269 d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~ 348 (511) T protein:vir:96 269 DNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDR 348 (511) T ss_pred HHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHH Confidence 54433333333332 2233333233344433322221110010 11112223445556655555445566777788 Q ss_pred HHHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccc-cccCccee Q lcl|NC_019456. 270 SRIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPG-KRVKGFYF 334 (435) Q Consensus 270 ~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~-~~~~g~~i 334 (435) ..+.|....++|..-.+.... +. +..+. ...|...+...++.+...+..+.-... .... .+ T Consensus 349 L~~~I~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~--~i 424 (511) T protein:vir:96 349 LNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFN--TV 424 (511) T ss_pred HHHHHHHHhCCcccccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccc--cc Confidence 889999999998865543321 11 11221 223334444444444443332211100 0111 24 Q ss_pred eechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccccccccccccccc Q lcl|NC_019456. 335 SFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAP 414 (435) Q Consensus 335 ~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~ 414 (435) ++.+..-...|..+.++.+.++ .|+++.-.+.+++++-.-+.+..++ +..-.+.......... ..... T Consensus 425 ~~~f~~~~p~n~~e~~~~~~kl--~G~iS~et~l~~l~~v~D~~~E~~r---------i~~E~~~~~~~~~~~~-~~~~~ 492 (511) T protein:vir:96 425 RYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKK---------IEEDEKESIKKAQKGI-YKDPR 492 (511) T ss_pred eEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHH---------HHHHHHHHHHHHhhcc-ccCCC Confidence 4445566677888899998887 5899998888888643211111111 1100000000000000 00011 Q ss_pred ccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 415 KQEGGENTNENGLQSTEPE 433 (435) Q Consensus 415 ~~~~~~~~~~~~~~~~~~~ 433 (435) ..+.+++.++++..+++.| T Consensus 493 ~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 493 DINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCCcccccccccC Confidence 1111222222233333333 No 191 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=98.26 E-value=2e-06 Score=51.85 Aligned_cols=389 Identities=9% Similarity=0.041 Sum_probs=167.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhh----hccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLD----MAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) ..++.++.+.......+. .....++. +...........+.+ ...+.....|+..+.-+-.-|+.+..+++ T Consensus 19 ~~~l~~~i~~~~~~~~r~----~~~~~yy~g~~~i~~~~~~~~~~~~~k--i~~n~~~~ivd~~~~~l~g~~~~~~~~d~ 92 (453) T protein:vir:39 19 NEVVTKFMEKHRLEVARY----EYLKNMYRGIMAIDAEPTKDLWKPDNR--LTVNFTKYIVDTFTGYFNGIPVKKSHSDK 92 (453) T ss_pred HHHHHHHHHHHHHHHHHH----HHHHHHhhccCchhcCCCccccCccce--eecchHHHHHHHHhhhhcccCceeccCCh Confidence 111222211110000000 00000000 000000000000111 12345566677777766666766543332 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc--eE---EE Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN--SY---WY 151 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~--~~---~~ 151 (435) .. ...+..++. ..........+..+.+.+|.||.++..+.. |.+ .+..++|..+.+..+.... .. .+ T Consensus 93 ~~-~~~l~~i~~-----~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~-g~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~ 164 (453) T protein:vir:39 93 ET-LSKLQEFDN-----LNDMEDEESELAKMACIYGRAFELLYQNEE-TQT-NVIYNTPENMFMVYDDTIKQEPLFAVRY 164 (453) T ss_pred HH-HHHHHHHHH-----hcChhHHHHHHHHHHhhcCeEEEEEEecCC-Cce-EEEEEcccceEEEecCCCCCeEEEEEEE Confidence 22 222333333 234556678889999999999998887644 654 5667888888877754321 11 11 Q ss_pred EEecCCe--eEEEchhheEEeccCC-----------C---------ccccccCcHHHHHHHHHHHHHHHHHHHHHHhhc- Q lcl|NC_019456. 152 RVTSDIY--NFTIPINDVIHVKHVV-----------P---------SNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEK- 208 (435) Q Consensus 152 ~~~~~~~--~~~~~~~~iih~~~~~-----------~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n- 208 (435) ....+.. ...+.++.+.++.... + .+...|.|.+..+...+.....+.....+.+.. T Consensus 165 ~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~ 244 (453) T protein:vir:39 165 GYDDDYKLYGEVYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYF 244 (453) T ss_pred EEeCCeEEEEEEEeCCeEEEEEecCCceeeecccccCCCceeEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHh Confidence 1111111 1123344444433211 0 012357777776666665544322222222222 Q ss_pred CCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCc Q lcl|NC_019456. 209 KDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDD 288 (435) Q Consensus 209 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~ 288 (435) .....+..+..+.++..+.++..- .-........+.+.++..++.......+.+..+...+.|+...++|..-.+.. T Consensus 245 ~~p~~~~~g~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~ 321 (453) T protein:vir:39 245 SDQYLTFLGAAVEEEDLKNIRSNR---VINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESF 321 (453) T ss_pred hCceeeeecCCCCchhhhhhhhcc---eeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc Confidence 222222333445555544332211 00000011112333444444433445566777778888888888875432221 Q ss_pred ccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHH Q lcl|NC_019456. 289 QAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQ 354 (435) Q Consensus 289 ~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~ 354 (435) ++. +.++. ...|...+...++.+...+...- ..... ..+.+.+..-...|..+.++.+. T Consensus 322 --gn~-Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~--~~~~~--~~i~v~f~~~~p~~~~~~a~~~~ 394 (453) T protein:vir:39 322 --GSS-SGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVS--NKEAW--KDIEYTFTRNEPKDIKEQAETAN 394 (453) T ss_pred --cCC-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--Ccccc--ccceEEeCCCCCcCHHHHHHHHH Confidence 111 11111 12333444444444433332211 11111 12344445666788899999999 Q ss_pred HHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 355 TLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 355 ~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (435) ++ .|+++.--+.+++++-+ ++- ++ ++.+.+...... ...+....+..+.+++.++.+.| T Consensus 395 kl--~g~is~et~l~~l~~v~--D~~-~E---------~~ri~~E~~~~~------~~~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 395 IL--MGITSQETALSVISVIP--DVQ-AE---------MEKIKKEEASTA------IFDKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred HH--hccCChHHHHHhCCCCC--CHH-HH---------HHHHHHHHHHHH------HHHHhccCCCCCCCCCCCCcCCC Confidence 88 57899888888886432 111 11 111111100000 01112222233333334444444 No 192 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.19 E-value=2.9e-06 Score=50.92 Aligned_cols=405 Identities=12% Similarity=0.047 Sum_probs=158.5 Q ss_pred CchH----HHHHhhccccccccccccccchhhhhhccccccCcccccHH---HHhhhHHHHHHHHHHHHHHhhCceeeee Q lcl|NC_019456. 1 MSFM----SKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREH---ILESNEYIFSIVTRLSNVLASLPLHEYQ 73 (435) Q Consensus 1 Mg~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~v~~~i~~ia~~ia~~~~~~~~ 73 (435) |+-- ++|.+.+......- .....++.--..-.........+ .-..+.+...+|+..+..+--..|.+- T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~----~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~- 75 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNL----LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS- 75 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHH----HHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceecC- Confidence 5532 22222211110000 00001110000000000000000 001223445566666655433334321 Q ss_pred cccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeC-----CCCcEEEEEEeCCceeEEEEcCC--C Q lcl|NC_019456. 74 NYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSL-----STGEPIALWPLDPNTVSILRNTD--N 146 (435) Q Consensus 74 ~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~-----~~g~~~~l~~l~~~~v~~~~~~~--~ 146 (435) ++. ...+.+...+. + | ........+..+++.+|.||.++-+.. ..|.+ .+.+++|..+.+..+.. + T Consensus 76 ~d~-~~~~~l~~i~~-~-N---~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~-~i~~~~p~~~~~~~D~~~~~ 148 (480) T protein:vir:78 76 EDS-EGLEELWNWWQ-A-N---DLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTR 148 (480) T ss_pred CCc-hhHHHHHHHHH-h-c---CHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCee-EEEEEcccceEEEEcCCCcc Confidence 111 11233333332 1 2 345667888999999999998876421 23443 47788888888777642 2 Q ss_pred ceEE----E-EEecCCee---EEEchhh-----------------------------eEEeccCCCccccccCcHHHH-H Q lcl|NC_019456. 147 NSYW----Y-RVTSDIYN---FTIPIND-----------------------------VIHVKHVVPSNSWYGVSPIDV-L 188 (435) Q Consensus 147 ~~~~----~-~~~~~~~~---~~~~~~~-----------------------------iih~~~~~~~~~~~G~s~l~~-~ 188 (435) ...+ | .....+.. ..+.++. |++|++....+..+|.|-+.. + T Consensus 149 ~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v 228 (480) T protein:vir:78 149 RVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPEL 228 (480) T ss_pred ceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCcccchhhH Confidence 1111 1 00011110 1122222 334443323344677776553 3 Q ss_pred HHHHHHHHHH-HHH--HHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccc-cCCceeeeccCChhhHHHH Q lcl|NC_019456. 189 SSSLKFQRSV-ENF--SQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQ-EAGWKVDRYESKFEPADLS 264 (435) Q Consensus 189 ~~~i~~~~~~-~~~--~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl-~~g~~~~~~~~~~~~~~~~ 264 (435) ...+...... .+. ...++.. +..++. +....+...+.....|.. ..+.+..+ +++.++.++.....+ .+. T Consensus 229 ~~l~Da~~~~~s~~~~~~~~~a~-p~~~i~-G~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 302 (480) T protein:vir:78 229 RKVTDAASRTLMNLQSASQILGT-PLRVIS-GVTTDELTNDGENTTLDI---YYGRILTLASEAAKISEFKAAELR-NFA 302 (480) T ss_pred HHHHHHHHHHHHHHHHHHHhhcc-hhhhhh-cCCccccccccccchhhh---hhhhhccCCCCCceEEecCccCHH-HHH Confidence 3444433221 111 1122222 221221 111111111100111111 11233344 345677776654433 366 Q ss_pred HHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHH--------------HHHHHHhHHHHHHHHHHHHhhcccccccC Q lcl|NC_019456. 265 SVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTH--------------SWTMTLMPIIRQYESQFNMKLFTPGKRVK 330 (435) Q Consensus 265 e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~--------------~~~~~i~P~~~~i~~~l~~~l~~~~~~~~ 330 (435) +..+.....|+..-++|+..+|....+. ++.+++.. .|...|.-.++.+.. +........ T Consensus 303 ~~l~~~i~~~~~~~~~p~~~~g~~~~n~-~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~-----~~g~~~~~~ 376 (480) T protein:vir:78 303 EEMEVFRKEAASITGLPPQYLSSSSENP-ASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ-----IMGREVTEE 376 (480) T ss_pred HHHHHHHHHHhcccCCChHHhccccCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HcCCCcccc Confidence 7777788889999999999998755432 22222211 111122222221111 111111111 Q ss_pred cceeeechhhhhccCHHHHHHHHHHHHhcC--CcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccccccccc Q lcl|NC_019456. 331 GFYFSFNVNGLLRGDTAARTQYYQTLTRNG--IFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTD 408 (435) Q Consensus 331 g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~ 408 (435) ...+++.+......+..+.++.+.+++.+| +++..-+++.+|+.+-+.+..++.--...-.+++...... .+.+. T Consensus 377 ~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~---~~~~~ 453 (480) T protein:vir:78 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT---KAQAD 453 (480) T ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccc---cccCC Confidence 123445455555677888888888888876 6777778888888653221111100000000111111000 00000 Q ss_pred ccccccccCCCCCCCCCCCCCCCCC--CC Q lcl|NC_019456. 409 ASVAAPKQEGGENTNENGLQSTEPE--GS 435 (435) Q Consensus 409 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 435 (435) ..+....++..++....+...- ++ T Consensus 454 ---~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (480) T protein:vir:78 454 ---ATPKPTVTETKTETQTSPSGFNRTKT 479 (480) T ss_pred ---CCCCCCCCCCCCccccccCCCCcccC Confidence 0011111111111111111111 11 No 193 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=98.16 E-value=3.3e-06 Score=50.62 Aligned_cols=408 Identities=10% Similarity=0.052 Sum_probs=170.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhc-cc---c-ccC-cccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMA-GV---K-LEQ-ATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~---~-~~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) |-..+.+..+........-..-.....++.-. .. . ... ....+.. ..++.....|+..+.-+-.-|+.+..+ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~ 116 (511) T protein:vir:78 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQYQDD 116 (511) T ss_pred hcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcce--eecchHHHHHHHHhhhhcccCceeecC Confidence 22222222221110000000000000000000 00 0 000 0000000 112444556666666666667665433 Q ss_pred ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC--ceE--- Q lcl|NC_019456. 75 YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN--NSY--- 149 (435) Q Consensus 75 ~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~--- 149 (435) .... ...+..++ .......+...+...++.+|.+|.++.++. .|.+ .+..++|..+.+..+... ... T Consensus 117 d~~~-~~~l~~~~-----~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~-dg~~-~i~~~~p~~~~~v~dd~~~~~~~~~v 188 (511) T protein:vir:78 117 DKDV-LEAIEAFN-----DLNDVESHNRSLGLDLSIYGKAYELMIRNQ-DDET-RLYKSDAMSTFIIYDNTVERNSIAGV 188 (511) T ss_pred chHH-HHHHHHHH-----hhcChhHHHHHHHHHHHhcCeeEEEEEeCC-CCce-EEEEEcccceEEEEcCCCCCceEEEE Confidence 2221 22222222 223355667888899999999999888764 3654 678899998888776532 211 Q ss_pred -EEEE-ecCCe-------eEEEchhheEEeccCCC-------------------------ccccccCcHHHHHHHHHHHH Q lcl|NC_019456. 150 -WYRV-TSDIY-------NFTIPINDVIHVKHVVP-------------------------SNSWYGVSPIDVLSSSLKFQ 195 (435) Q Consensus 150 -~~~~-~~~~~-------~~~~~~~~iih~~~~~~-------------------------~~~~~G~s~l~~~~~~i~~~ 195 (435) +|.. ...+. ...|.++.+.++..... .+...|.|-+..+...+... T Consensus 189 r~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~ 268 (511) T protein:vir:78 189 RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLY 268 (511) T ss_pred EEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHH Confidence 1111 11110 12356666666532210 01234777777777777655 Q ss_pred HHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCC----CccccccCCceeeeccCChhhHHHHHHHHH Q lcl|NC_019456. 196 RSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKEN----GGAVVQEAGWKVDRYESKFEPADLSSVEQI 269 (435) Q Consensus 196 ~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~----~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~ 269 (435) ..+..-..+.+... +..++......+.++.+...+......... ....-.+.+.++..++.......+....+. T Consensus 269 ~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~ 348 (511) T protein:vir:78 269 DNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDR 348 (511) T ss_pred HHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHH Confidence 54333322223222 223333333344444333222111000000 001112233444444444444556777778 Q ss_pred HHHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceee Q lcl|NC_019456. 270 SRIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFS 335 (435) Q Consensus 270 ~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~ 335 (435) ..+.|+..-++|....+.... +. +..++ ...|...+...++.+...+...--... ...-..++ T Consensus 349 L~~~I~~~s~~P~~~~~~~~~-n~-Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~-~~~~~~i~ 425 (511) T protein:vir:78 349 LNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA-NKDFNTVR 425 (511) T ss_pred HHHHHHHHhCCcccccccccc-cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-ccccccce Confidence 888999999998765543322 11 11221 222333444444444333332111000 00011244 Q ss_pred echhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccc Q lcl|NC_019456. 336 FNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPK 415 (435) Q Consensus 336 fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~ 415 (435) +.+..-...|..+.++.+.+++ |+++.--+.+++++- +++- +++ ..+..-.+.......... ...... T Consensus 426 ~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~v--~d~~-~El------~ri~~E~~~~~~~~~~~~-~~~~~~ 493 (511) T protein:vir:78 426 YVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFF--QDPE-LEV------KKIEEDEKESIKKAQKGI-YKDPRD 493 (511) T ss_pred EEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCC--CCHH-HHH------HHHHHHHHHHHHHHhhcc-ccCCCC Confidence 5556666788888999999884 889987788877542 2111 111 111110000000000000 000111 Q ss_pred cCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 416 QEGGENTNENGLQSTEPE 433 (435) Q Consensus 416 ~~~~~~~~~~~~~~~~~~ 433 (435) .+.+++.++++..+++.| T Consensus 494 ~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 494 INDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCCccCcccccC Confidence 111222233333333333 No 194 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=98.16 E-value=3.3e-06 Score=50.62 Aligned_cols=408 Identities=10% Similarity=0.052 Sum_probs=170.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhc-cc---c-ccC-cccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMA-GV---K-LEQ-ATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~---~-~~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) |-..+.+..+........-..-.....++.-. .. . ... ....+.. ..++.....|+..+.-+-.-|+.+..+ T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~ 116 (511) T protein:vir:96 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNR--VAHDYASYISDFINGYFLGNPIQYQDD 116 (511) T ss_pred hcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcce--eecchHHHHHHHHhhhhcccCceeecC Confidence 22222222221110000000000000000000 00 0 000 0000000 112444556666666666667665433 Q ss_pred ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC--ceE--- Q lcl|NC_019456. 75 YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN--NSY--- 149 (435) Q Consensus 75 ~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~--- 149 (435) .... ...+..++ .......+...+...++.+|.+|.++.++. .|.+ .+..++|..+.+..+... ... T Consensus 117 d~~~-~~~l~~~~-----~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~-dg~~-~i~~~~p~~~~~v~dd~~~~~~~~~v 188 (511) T protein:vir:96 117 DKDV-LEAIEAFN-----DLNDVESHNRSLGLDLSIYGKAYELMIRNQ-DDET-RLYKSDAMSTFIIYDNTVERNSIAGV 188 (511) T ss_pred chHH-HHHHHHHH-----hhcChhHHHHHHHHHHHhcCeeEEEEEeCC-CCce-EEEEEcccceEEEEcCCCCCceEEEE Confidence 2221 22222222 223355667888899999999999888764 3654 678899998888776532 211 Q ss_pred -EEEE-ecCCe-------eEEEchhheEEeccCCC-------------------------ccccccCcHHHHHHHHHHHH Q lcl|NC_019456. 150 -WYRV-TSDIY-------NFTIPINDVIHVKHVVP-------------------------SNSWYGVSPIDVLSSSLKFQ 195 (435) Q Consensus 150 -~~~~-~~~~~-------~~~~~~~~iih~~~~~~-------------------------~~~~~G~s~l~~~~~~i~~~ 195 (435) +|.. ...+. ...|.++.+.++..... .+...|.|-+..+...+... T Consensus 189 r~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~ 268 (511) T protein:vir:96 189 RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLY 268 (511) T ss_pred EEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHH Confidence 1111 11110 12356666666532210 01234777777777777655 Q ss_pred HHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCC----CccccccCCceeeeccCChhhHHHHHHHHH Q lcl|NC_019456. 196 RSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKEN----GGAVVQEAGWKVDRYESKFEPADLSSVEQI 269 (435) Q Consensus 196 ~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~----~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~ 269 (435) ..+..-..+.+... +..++......+.++.+...+......... ....-.+.+.++..++.......+....+. T Consensus 269 ~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~ 348 (511) T protein:vir:96 269 DNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDR 348 (511) T ss_pred HHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHH Confidence 54333322223222 223333333344444333222111000000 001112233444444444444556777778 Q ss_pred HHHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceee Q lcl|NC_019456. 270 SRIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFS 335 (435) Q Consensus 270 ~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~ 335 (435) ..+.|+..-++|....+.... +. +..++ ...|...+...++.+...+...--... ...-..++ T Consensus 349 L~~~I~~~s~~P~~~~~~~~~-n~-Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~-~~~~~~i~ 425 (511) T protein:vir:96 349 LNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA-NKDFNTVR 425 (511) T ss_pred HHHHHHHHhCCcccccccccc-cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-ccccccce Confidence 888999999998765543322 11 11221 222333444444444333332111000 00011244 Q ss_pred echhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccc Q lcl|NC_019456. 336 FNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPK 415 (435) Q Consensus 336 fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~ 415 (435) +.+..-...|..+.++.+.+++ |+++.--+.+++++- +++- +++ ..+..-.+.......... ...... T Consensus 426 ~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~v--~d~~-~El------~ri~~E~~~~~~~~~~~~-~~~~~~ 493 (511) T protein:vir:96 426 YVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFF--QDPE-LEV------KKIEEDEKESIKKAQKGI-YKDPRD 493 (511) T ss_pred EEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCC--CCHH-HHH------HHHHHHHHHHHHHHhhcc-ccCCCC Confidence 5556666788888999999884 889987788877542 2111 111 111110000000000000 000111 Q ss_pred cCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 416 QEGGENTNENGLQSTEPE 433 (435) Q Consensus 416 ~~~~~~~~~~~~~~~~~~ 433 (435) .+.+++.++++..+++.| T Consensus 494 ~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 494 INDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCCccCcccccC Confidence 111222233333333333 No 195 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=98.16 E-value=3.3e-06 Score=50.60 Aligned_cols=409 Identities=12% Similarity=-0.001 Sum_probs=164.6 Q ss_pred Cch---HHHH----Hhhccccccccccccccchhhhh-----------hcccc---ccCcccccHHHHhhhHHHHHHHHH Q lcl|NC_019456. 1 MSF---MSKV----RQFFGVHDQANQIVQNPIPQPLD-----------MAGVK---LEQATFSREHILESNEYIFSIVTR 59 (435) Q Consensus 1 Mg~---~~~~----~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~---~~~~~~~~~~~~~~~~~v~~~i~~ 59 (435) |-+ -..+ .+++.....+. .. ....++. ..+.. .......+.+ ..+....-.|+. T Consensus 8 ~~~~~~~~~~~~~i~~~~~~~~~~~-~~--~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnk--i~~nf~k~Ivd~ 82 (537) T protein:vir:78 8 KPIDQLGGLLNTEITTYMASNHIKW-AH--IGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVK--ISHGFFTELVDQ 82 (537) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHH-HH--HHHHHhcccchhhhcccccccccccccccccccccc--cccchHHHHHHH Confidence 222 1111 11111110000 00 0000000 00000 0000000001 112233334555 Q ss_pred HHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeE Q lcl|NC_019456. 60 LSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVS 139 (435) Q Consensus 60 ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~ 139 (435) .+.-+-+-|+++.-.... ...+...|.. -+. .........+..++..+|.+|.++-.+.. |.+ .+..++|..+- T Consensus 83 ~~~yl~G~Pv~~~~~d~~--~~e~~~~l~~-~~~-~~~~~~~~el~~~~s~~G~ay~~~y~de~-~~~-~~~~i~p~~~~ 156 (537) T protein:vir:78 83 LAQYLLSNGVEVKVKDED--NTQLDEILQE-YFD-EDFQATIDTLVTNASKKGFEGIFARTTSE-GKL-KFQTVDGLTLI 156 (537) T ss_pred HhhhhcccCceeecCcch--hHHHHHHHHH-Hhh-ccHHHHHHHHHHHHhhcCeeEEEeeecCC-Cce-EEEEEccceeE Confidence 555555667765432211 1122233321 111 23344567788899999999998877644 654 57788888887 Q ss_pred EEEcCCCceE-----EEEEecC-----C----eeEEEchhheEEeccCC------------------------------- Q lcl|NC_019456. 140 ILRNTDNNSY-----WYRVTSD-----I----YNFTIPINDVIHVKHVV------------------------------- 174 (435) Q Consensus 140 ~~~~~~~~~~-----~~~~~~~-----~----~~~~~~~~~iih~~~~~------------------------------- 174 (435) +..+..+... |+..... + ....++++.|.+++... T Consensus 157 pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 236 (537) T protein:vir:78 157 PVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDAD 236 (537) T ss_pred EEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeecccccccc Confidence 7776554321 1111110 0 01234555555543211 Q ss_pred ------------Cc---------cccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCC-cCCHHHHHHHHHHH Q lcl|NC_019456. 175 ------------PS---------NSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDR-SISPEKRQAMVNDF 232 (435) Q Consensus 175 ------------~~---------~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~-~~~~e~~~~~~~~~ 232 (435) ++ +...|.|-+..+...+.....+....++.+......++.+.+ .+.+ ....+.. T Consensus 237 ~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~--~~~~~~~- 313 (537) T protein:vir:78 237 FEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDS--TDKLRQN- 313 (537) T ss_pred ccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCcc--chhHHHH- Confidence 00 122467777777777776655444444444333223333222 2211 1111111 Q ss_pred HHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH-------------- Q lcl|NC_019456. 233 LRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH-------------- 298 (435) Q Consensus 233 ~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~-------------- 298 (435) ++..+.+.+-+.+.+++-+.............+...+.|.....+|.. .....++.+. .+ T Consensus 314 ---l~~~~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~--~~~~~gn~SG-vAlk~~~~~l~~ka~~ 387 (537) T protein:vir:78 314 ---IKAKKMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNS--TAVGDGNVTN-VVIKSRYTLLAMKARK 387 (537) T ss_pred ---HhhcCceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCC--ccccccCCcH-HHHHHHHhhHHHHHHH Confidence 222232333333434444333333333444455556666655444422 1111122121 11 Q ss_pred HHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCC Q lcl|NC_019456. 299 VTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPD 378 (435) Q Consensus 299 ~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~ 378 (435) ....|...++-.++.|..-+..+-.... ....+.+.+..-...|..+.++.+.++++.|+++..-+.+.+++- ++ T Consensus 388 ke~~f~~~l~~~~~~i~~~~~~~~~~~~---d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~v--dd 462 (537) T protein:vir:78 388 METSLRKVLRWCADMVVSDIALRGLGEY---DSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPRI--GD 462 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCccc---ccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCCC--CC Confidence 1233444555555555555443211111 123456666666788999999999999999999998888877442 22 Q ss_pred cCCceeee--------------cccccc-hhccccccccccccc---cccccccccCCCCCCCC-CCCCCCCCCC Q lcl|NC_019456. 379 EAADHLYI--------------SKDLYP-LDKYYDAILDNKIQT---DASVAAPKQEGGENTNE-NGLQSTEPEG 434 (435) Q Consensus 379 ~~gd~~~~--------------~~n~~~-l~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 434 (435) +.-++..- ..+... .+...+......+.+ ++.+..+.+..|+++.. +++++.-+-+ T Consensus 463 ~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:78 463 DETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVADPNVVPPTDPNAVPQT 537 (537) T ss_pred HHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCCCCCCCCCCCccCCCC Confidence 11000000 000000 000000001011111 11111122222222221 1222222222 No 196 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=98.13 E-value=3.9e-06 Score=50.22 Aligned_cols=405 Identities=10% Similarity=0.051 Sum_probs=171.1 Q ss_pred CchHHHHHhhccccccccccccccchhhhhh----ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDM----AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) +..+.++.........+ +-.....++.- ...............-..++.....|+..+.-+-.-|+.+..++. T Consensus 42 ~~~i~~~i~~~~~~~~~---r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~ 118 (512) T protein:vir:97 42 INEVSKYIEHHMDYQRP---RLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDK 118 (512) T ss_pred HHHHHHHHHHHHHhhHH---HHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeccCCh Confidence 11111111110000000 00000001000 000000000000000011244455666666666666776643332 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC--ceE----E Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN--NSY----W 150 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~----~ 150 (435) .. ...+ ..+. ...........+..+++.+|.+|.++..+. .|.+ .+..++|..+.+..+... ... + T Consensus 119 ~~-~~~l-~~~~----~~n~~~~~~~~~~~~~~i~G~ay~~vy~de-d~~~-~i~~~~p~~~~~iyd~~~~~~~~~~vr~ 190 (512) T protein:vir:97 119 DV-LEAI-EAFN----DLNDVESHNRSLGLDLSIYGKAYELMIRNQ-DDET-RLYKSDAMSTFVIYDNTIERNSIAGVRY 190 (512) T ss_pred HH-HHHH-HHHH----hhcCHHHHHHHHHHHHHhcCeEEEEEEeCC-CCce-EEEEEcccceEEEEcCCCCCceEEEEEE Confidence 22 1222 2222 223455667888999999999999988764 4554 578899998888876542 111 1 Q ss_pred EEE-ecCCe-------eEEEchhheEEeccCC----------------C---------ccccccCcHHHHHHHHHHHHHH Q lcl|NC_019456. 151 YRV-TSDIY-------NFTIPINDVIHVKHVV----------------P---------SNSWYGVSPIDVLSSSLKFQRS 197 (435) Q Consensus 151 ~~~-~~~~~-------~~~~~~~~iih~~~~~----------------~---------~~~~~G~s~l~~~~~~i~~~~~ 197 (435) |.. ...+. ...+..+.|.+++... + .+...|.|-+..+...+..... T Consensus 191 ~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~ 270 (512) T protein:vir:97 191 LRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDN 270 (512) T ss_pred EEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEeecCCCCCCCchhhhHHHHHHHHH Confidence 111 11110 1245566666654211 0 0123577888877777776654 Q ss_pred HHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhc-----CCCccccccCCceeeeccCChhhHHHHHHHHHH Q lcl|NC_019456. 198 VENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVK-----ENGGAVVQEAGWKVDRYESKFEPADLSSVEQIS 270 (435) Q Consensus 198 ~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~-----~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~ 270 (435) +..-..+.+... +..++......+.+..........-... +.....-.+.|.+++.+........+....+.. T Consensus 271 ~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L 350 (512) T protein:vir:97 271 AESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRL 350 (512) T ss_pred HHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHH Confidence 433333333322 2233333233334443333222211111 111112234555666555544445566777778 Q ss_pred HHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccc-cccCcceee Q lcl|NC_019456. 271 RIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPG-KRVKGFYFS 335 (435) Q Consensus 271 ~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~-~~~~g~~i~ 335 (435) .+.|+..-++|..-.+.... +. +.+++ ...|...+.-.++.+...+...--... .... .++ T Consensus 351 ~~~I~~~s~~p~~~~~~~~g-n~-Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~--~i~ 426 (512) T protein:vir:97 351 NSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN--TVR 426 (512) T ss_pred HHHHHHHhCCcccCcccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccc--cce Confidence 88999999998765543321 11 11221 222333333333333333332110000 0111 244 Q ss_pred echhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccc-cccccccccccccc Q lcl|NC_019456. 336 FNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAI-LDNKIQTDASVAAP 414 (435) Q Consensus 336 fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~-~~~~~~~~~~~~~~ 414 (435) +.+..-...|..+.++.+.++ .|+++.--+++++++-.-+....++ +..-.+.. ........+. .. T Consensus 427 ~~f~~~~p~~~~e~~~~~~kl--~giiS~et~~~~l~~v~d~~~E~er---------i~~E~~~~~~~~~~~~~~~--~~ 493 (512) T protein:vir:97 427 YVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKK---------IEEDEKESIKKAQKGIYKD--PR 493 (512) T ss_pred EEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHH---------HHHHHHHHHHHHhhcccCC--CC Confidence 444555677888889988888 4889987788887543211111111 11000000 0000000000 00 Q ss_pred ccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 415 KQEGGENTNENGLQSTEPE 433 (435) Q Consensus 415 ~~~~~~~~~~~~~~~~~~~ 433 (435) ..+.+++.++++..+++.| T Consensus 494 ~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 494 DINDDEQDDDTKDTVDKKE 512 (512) T ss_pred CCCCCCCCCCccccccccC Confidence 0111111122222222222 No 197 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=98.11 E-value=4.4e-06 Score=49.95 Aligned_cols=380 Identities=10% Similarity=0.028 Sum_probs=158.0 Q ss_pred CchH----------HHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSFM----------SKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) ..+. .++.++..+.. ..... .......+..... . ...-..++....+|+..+.-+-.-|+. T Consensus 33 ~~~i~~~~~~~~~~~~~~~Yy~g~~---~i~~r--~~~~~~~~~~~~~-~---~~~ki~~n~~~~Ivd~~~~~l~g~p~~ 103 (474) T protein:vir:95 33 IRLIDDHRKQLDKITVGQRYYDKDN---DIVKQ--MKKVDVYGNIDYD-K---PDWRITTNFHQNLVDQKVSYVASKPVT 103 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccC---chhcc--ccccccccccccc-c---ccceeccchHHHHHHHHHhhhccCCce Confidence 1110 11111111100 00000 0000000000000 0 000112345555677777666666766 Q ss_pred eeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--Cce Q lcl|NC_019456. 71 EYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NNS 148 (435) Q Consensus 71 ~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~ 148 (435) +.-++... . .....++. | ........+......+|.+|.++..+.. |++ .+..++|..+-+..+.. +.. T Consensus 104 ~~~~d~~~-~-~~l~~~~~--n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~-~~~-~i~~~~p~~~~~v~d~~~~~~~ 174 (474) T protein:vir:95 104 YSCEDESV-L-KIIHDVLD--T---RWDNKLIDILTATSNKGIDWLQVYINEN-GEM-KLFRVPAEQAIPIWVDKEREEL 174 (474) T ss_pred eccCchHH-H-HHHHHHHh--c---cHHHHHHHHHHHHhhcCcEEEEEEecCC-Cce-EEEEEcccceEEEEcCCCCCce Confidence 54333222 1 22333332 2 3455567788999999999988877644 665 57788888887776543 211 Q ss_pred ----EEEEEecCCeeEEEchhheEEeccCC---------------------C---------ccccccCcHHHHHHHHHHH Q lcl|NC_019456. 149 ----YWYRVTSDIYNFTIPINDVIHVKHVV---------------------P---------SNSWYGVSPIDVLSSSLKF 194 (435) Q Consensus 149 ----~~~~~~~~~~~~~~~~~~iih~~~~~---------------------~---------~~~~~G~s~l~~~~~~i~~ 194 (435) .+|..........+..+.+.++.... + .+...|.|-+..+...+.. T Consensus 175 ~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa 254 (474) T protein:vir:95 175 KSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSLIDA 254 (474) T ss_pred EEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCCCCCCCcHHHHHHHHHH Confidence 12222222223344455554443110 0 1124577877777777766 Q ss_pred HHHHHHHHHHHhhcCCceEEEeCC-cCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHH Q lcl|NC_019456. 195 QRSVENFSQNEMEKKDKFVLQYDR-SISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIR 273 (435) Q Consensus 195 ~~~~~~~~~~~~~n~~~~~~~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~ 273 (435) ...+.....+.+......++.+.+ ...+ ...... .. ...+++.++++.+++.+........+....+...+. T Consensus 255 ~d~~~S~~~~~~~~~~~p~lv~~g~~~~~--~~~~~~----~~-~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 327 (474) T protein:vir:95 255 IDKRLSDAQNMFDESVELIYILKGYEGQD--LEEFMR----GL-KYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRAY 327 (474) T ss_pred HHHHHHHHHHHHHHhcCceeeeecCCccc--chhhhh----hh-hccceeeccCCCceeEEeecCCHHHHHHHHHHHHHH Confidence 554332222222222222222222 2211 111111 11 124556666666666555554555677777888899 Q ss_pred HHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechh Q lcl|NC_019456. 274 IATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVN 339 (435) Q Consensus 274 Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~ 339 (435) |+...++|....+... ++. +..+. ...|...+...++.+.+.+.. .... ..+.+.++ T Consensus 328 i~~~s~~p~~~~~~~~-~n~-Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~-----~~d~--~~i~v~f~ 398 (474) T protein:vir:95 328 IMEFGQGVDFQTDKFG-SAP-SGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNL-----KMDV--KDIEISFN 398 (474) T ss_pred HHHHhCCccccccccc-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----Cccc--ceeeEEec Confidence 9999999864322111 111 11121 122333344444333332221 1112 23344444 Q ss_pred hhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccc-cccccccccccccCC Q lcl|NC_019456. 340 GLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDN-KIQTDASVAAPKQEG 418 (435) Q Consensus 340 ~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~ 418 (435) .-...|..+.++. +++.|+++...+.+++++-.-+++.. +.+.+..... ........... ++ T Consensus 399 ~~~p~d~~e~a~~---~~~~g~iS~et~i~~l~~v~d~~~E~------------~ri~~E~~~~~~~~~~~~~~~~--d~ 461 (474) T protein:vir:95 399 FNRMMNDAEQSQI---IAQSQYLSRETLVKSSPLVDDYKAEL------------ERIEQEQMEYNKQLPNLDDGGA--DG 461 (474) T ss_pred cCCCcCHHHHHHH---HHhcCCCchHHHHHhCCCCCCHHHHH------------HHHHHHHHHHHhcccccccccC--CC Confidence 4445666555554 55679999888888875432111111 1111100000 00000000000 00 Q ss_pred CCCCCCCCCCCCCCC Q lcl|NC_019456. 419 GENTNENGLQSTEPE 433 (435) Q Consensus 419 ~~~~~~~~~~~~~~~ 433 (435) .+..++++ ..++| T Consensus 462 ~~~~~~~~--~~~~~ 474 (474) T protein:vir:95 462 AQQQERSN--DKESE 474 (474) T ss_pred CcCCCCCc--cCCCC Confidence 00000000 00111 No 198 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=98.08 E-value=5e-06 Score=49.66 Aligned_cols=391 Identities=8% Similarity=-0.028 Sum_probs=164.4 Q ss_pred Cch-HHHHHhhccccccccccccccchhhhh----hcccccc--C--c-ccccHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSF-MSKVRQFFGVHDQANQIVQNPIPQPLD----MAGVKLE--Q--A-TFSREHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~-~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~--~--~-~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) +-+ .+.|.++........ ..-.....++. +...... . . .......-..++....+|+..+.-+-.-|+. T Consensus 42 ~~~~~~~i~~~i~~~~~~~-~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~ 120 (492) T protein:vir:97 42 PETLEEMIVRYIKQHLEKL-PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 120 (492) T ss_pred hhhHHHHHHHHHHHHHHHH-HHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhhhcccCce Confidence 111 111221111100000 00000000000 0000000 0 0 0000000112355666777777766666766 Q ss_pred eeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--Cce Q lcl|NC_019456. 71 EYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NNS 148 (435) Q Consensus 71 ~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~ 148 (435) +..++... . ..+..++. | ...+....+..+++.+|.||.++..+.. |.+ .+..++|..+.+..+.. +.. T Consensus 121 ~~~~d~~~-~-~~l~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~d-g~~-~~~~~~p~~~~~i~d~~~~~~~ 191 (492) T protein:vir:97 121 FKHTDDEV-V-KRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEE-GEF-KLFRVPAEQGIPIWTDKEHEEL 191 (492) T ss_pred eccCchHH-H-HHHHHHHh--c---cHHHHHHHHHHHHhhcCeEEEEEEecCC-Cce-EEEEEcccceEEEEcCCCCCce Confidence 54333222 1 12222321 2 2345556788999999999998887644 654 57788999888877642 211 Q ss_pred E----EEEEecCCeeEEEchhheEEeccC---------------------CCc---------cccccCcHHHHHHHHHHH Q lcl|NC_019456. 149 Y----WYRVTSDIYNFTIPINDVIHVKHV---------------------VPS---------NSWYGVSPIDVLSSSLKF 194 (435) Q Consensus 149 ~----~~~~~~~~~~~~~~~~~iih~~~~---------------------~~~---------~~~~G~s~l~~~~~~i~~ 194 (435) . +|..........+.+..+.++... ++. +...|.|-+..+...+.. T Consensus 192 ~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa 271 (492) T protein:vir:97 192 EAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDA 271 (492) T ss_pred EEEEEEEeeccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchHhHHHHHHH Confidence 1 122112222223344444443211 000 123477888777777766 Q ss_pred HHHHHHHHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHH Q lcl|NC_019456. 195 QRSVENFSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRI 274 (435) Q Consensus 195 ~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~I 274 (435) ...+..-..+.+......++.+.+. ..+........ .+ ..+++.++.+.+.+.+........+....+...+.| T Consensus 272 ~d~~~S~~~~~~~~~~~~~l~~~g~-~~~~~~~~~~~----~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I 345 (492) T protein:vir:97 272 YNRRLSDLSNTFKDSNELTYVLKNY-DDQELPEFKRL----LR-YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKI 345 (492) T ss_pred HHHHHHHHHHHHHHhccceeeeecC-CcccchhHHHH----Hh-hccceecCCCCcceeEeccCCHHHHHHHHHHHHHHH Confidence 5543333333333222222222211 11111111111 11 223455555555555554444556777778888899 Q ss_pred HHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhh Q lcl|NC_019456. 275 ATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNG 340 (435) Q Consensus 275 a~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~ 340 (435) +...++|......... + .+.++. ...|...+...++.+...+.. .. ....+.+.+.. T Consensus 346 ~~~s~~p~~~~~~~~~-n-~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~-----~~--~~~~i~v~f~~ 416 (492) T protein:vir:97 346 MLFGQAVDFSSDKFGS-A-PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-----KG--EHKDVDISFNY 416 (492) T ss_pred HHHhCCCCCCcccccc-C-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-----Cc--ccceeeEEecC Confidence 9999988654322211 1 122222 122233333333333332221 11 12334555566 Q ss_pred hhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCC Q lcl|NC_019456. 341 LLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGE 420 (435) Q Consensus 341 l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (435) -...|..+.++++.++ .|+++..-+.+++++-.-+++. ++.+.+.......... .......+.+. T Consensus 417 ~~p~~~~e~a~~~~kl--~G~iS~et~l~~l~~v~d~~~E------------leri~~E~~~~~~~~~-~~~~~~~~~~~ 481 (492) T protein:vir:97 417 NKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAE------------LERIEQEQTEYNKQLP-NLDDGGADSAQ 481 (492) T ss_pred CCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHH------------HHHHHHHHHHHHHhhh-ccccCCCCCCc Confidence 6678889999999988 5889988888888653211111 1111110000000000 00000000111 Q ss_pred CCCCCCCCCCC Q lcl|NC_019456. 421 NTNENGLQSTE 431 (435) Q Consensus 421 ~~~~~~~~~~~ 431 (435) .++++++...+ T Consensus 482 ~~~~~~~~~~e 492 (492) T protein:vir:97 482 QQERSNNKESE 492 (492) T ss_pred ccccccccccC Confidence 11111111111 No 199 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=98.06 E-value=5.6e-06 Score=49.39 Aligned_cols=389 Identities=9% Similarity=-0.021 Sum_probs=164.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhc----ccccc-Cc----ccccHHHHhhhHHHHHHHHHHHHHHhhCceee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMA----GVKLE-QA----TFSREHILESNEYIFSIVTRLSNVLASLPLHE 71 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~-~~----~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~ 71 (435) +-+.+.|..+......... .-.....++.-- .-... .. .......-..++....+|+..+.-+-.-|+.+ T Consensus 23 ~~~~~~i~~~i~~~~~~~~-~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~ 101 (472) T protein:vir:93 23 ETLEEMIVRYIKQHLEKLP-EISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 101 (472) T ss_pred hhHHHHHHHHHHHHHHHHH-HHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhhhcccCeee Confidence 1222222222111111000 000000000000 00000 00 00000001123566667787777776667665 Q ss_pred eecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--CceE Q lcl|NC_019456. 72 YQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NNSY 149 (435) Q Consensus 72 ~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~~ 149 (435) ..++... . ..+..+.. | ........+..+++.+|.||.++..+.. |.+ .+..++|..+.+..+.. +... T Consensus 102 ~~~d~~~-~-~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~d-~~~-~i~~~~p~~~~~i~d~~~~~~~~ 172 (472) T protein:vir:93 102 KHTDDEV-V-KRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEE-GEF-KLFRVPAEQGIPIWTDKEHEELE 172 (472) T ss_pred ccCChHH-H-HHHHHHHh--c---cHHHHHHHHHHHHhhcCeEEEEEEECCC-Cce-EEEEEcccceEEEEcCCCCCceE Confidence 4333222 1 22222321 2 2445566778999999999998877644 664 57778998888877532 2111 Q ss_pred ----EEEEecCCeeEEEchhheEEeccC---------------------CC---------ccccccCcHHHHHHHHHHHH Q lcl|NC_019456. 150 ----WYRVTSDIYNFTIPINDVIHVKHV---------------------VP---------SNSWYGVSPIDVLSSSLKFQ 195 (435) Q Consensus 150 ----~~~~~~~~~~~~~~~~~iih~~~~---------------------~~---------~~~~~G~s~l~~~~~~i~~~ 195 (435) +|..........+....+.++... ++ .+...|.|-+..+...+... T Consensus 173 ~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~s~~e~v~~liDa~ 252 (472) T protein:vir:93 173 AFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAY 252 (472) T ss_pred EEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchhhhHHHHHHH Confidence 111111122222333333333211 00 01235778888777777655 Q ss_pred HHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHH Q lcl|NC_019456. 196 RSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIR 273 (435) Q Consensus 196 ~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~ 273 (435) ..+.....+.+... +..++..- . .+........+ + ..+++.++.+.+...+........+....+...+. T Consensus 253 ~~~~s~~~~~~~~~~~~~~~~~g~-~--~~~~~~~~~~~----~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 324 (472) T protein:vir:93 253 NRRLSDLSNTFKDSNELTYVLTNY-D--DQELPEFKRLL----R-YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 324 (472) T ss_pred HHHHHHHHHHHHHhcCceeEeecC-C--cccchhhHHHH----h-hccccccCCCCcceeEeecCCHHHHHHHHHHHHHH Confidence 54332222222222 22233221 1 11111111111 1 23455555555555555444556677888888899 Q ss_pred HHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechh Q lcl|NC_019456. 274 IATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVN 339 (435) Q Consensus 274 Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~ 339 (435) |+...++|..-.+.... +. +.++. ...|...+...++.+...+.. ... ...+.+.+. T Consensus 325 i~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~-----~~~--~~~i~v~f~ 395 (472) T protein:vir:93 325 IMLFGQAVDFSSDKFGS-AP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-----KGE--HKDVDISFN 395 (472) T ss_pred HHHHhCCCCCCcccccc-Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----Ccc--cceeeEEeC Confidence 99999998654433221 11 11221 122233333333333332221 111 123455556 Q ss_pred hhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCC Q lcl|NC_019456. 340 GLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGG 419 (435) Q Consensus 340 ~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (435) .-...|..+.++.+.++ .|+++.--+.+++++-.-++...+.+. -+........... .. .. .+++ T Consensus 396 ~~~p~~~~~~~~~~~k~--~giis~et~l~~l~~~~d~~~E~~ri~-------~E~~~~~~~~~~~--~~---~~-~d~~ 460 (472) T protein:vir:93 396 YNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIE-------QEQMEYNKQLPNL--DD---GG-ADGA 460 (472) T ss_pred CCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHH-------HHHHHHHHhccCc--Cc---cc-CCCC Confidence 66678889999999887 588998888888765321111111110 0000000000000 00 00 0011 Q ss_pred CCCCCCCCCCCCCC Q lcl|NC_019456. 420 ENTNENGLQSTEPE 433 (435) Q Consensus 420 ~~~~~~~~~~~~~~ 433 (435) .++++++++.. | T Consensus 461 ~~~~~~~~~~~--e 472 (472) T protein:vir:93 461 QQQERSNNKES--E 472 (472) T ss_pred CCCCCCCcccC--C Confidence 11111111111 1 No 200 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=98.06 E-value=5.7e-06 Score=49.31 Aligned_cols=383 Identities=9% Similarity=0.040 Sum_probs=165.5 Q ss_pred Cc--hHHHHHhhccccccccccccccchhhhh----hccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MS--FMSKVRQFFGVHDQANQIVQNPIPQPLD----MAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg--~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) |. .+.++.+.......+ . .....++. +...........+.+ ...+.....|+..+.-+-.-|+.+.-. T Consensus 17 ~~~~~i~~~i~~~~~~~~r--~--~~~~~Yy~g~~~i~~~~~~~~~~~~~k--i~~n~~~~ivd~~~~~l~g~~~~~~~~ 90 (452) T protein:vir:36 17 ITVEVVTKFMEKHKLEVAR--Y--EYLKNMYLGIMAIDDEPAKDSWKPDNR--LAVNFTKYIVDTFTGYFNGIPVKKSHS 90 (452) T ss_pred CCHHHHHHHHHHHHHHHHH--H--HHHHHHhccccccccCccccccCccce--eecchHHHHHHHHhhhhcccCceeecC Confidence 21 122221111000000 0 00000000 000000000000111 123455556666666665666665433 Q ss_pred ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC--ceE--- Q lcl|NC_019456. 75 YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN--NSY--- 149 (435) Q Consensus 75 ~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~--- 149 (435) +... ...+...+. ..........+....+.+|.+|.++..+. .|.+ .+..++|..+.+..+... ... T Consensus 91 d~~~-~~~l~~~~~-----~n~~~~~~~~~~~~~~~~G~~~~~v~~d~-~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~i 162 (452) T protein:vir:36 91 DKEI-LTKLQEFDN-----LNDMEDEESELAKMACIYGRAFEFLYQDE-DTQT-NVVYNSPENMFMVYDDTVKQEPLFAV 162 (452) T ss_pred ChhH-HHHHHHHHh-----hcChhHHHHHHHHHHHhcCeEEEEEEecC-CCee-EEEEEcccceEEEEcCCCCCceEEEE Confidence 3222 222222322 23455667888899999999998887764 4665 577788988887766532 121 Q ss_pred EEEEecCCe--eEEEchhheEEeccCC-----------C---------ccccccCcHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019456. 150 WYRVTSDIY--NFTIPINDVIHVKHVV-----------P---------SNSWYGVSPIDVLSSSLKFQRSVENFSQNEME 207 (435) Q Consensus 150 ~~~~~~~~~--~~~~~~~~iih~~~~~-----------~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ 207 (435) +++...++. ...+..+.++++.... + .+...|.|-+..+...+.....+.....+.+. T Consensus 163 ~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~ 242 (452) T protein:vir:36 163 RYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVD 242 (452) T ss_pred EEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCcccEEEecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHH Confidence 111111111 1234444444432110 0 11235677776666666554433222222222 Q ss_pred cC-CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCcccccc-----CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019456. 208 KK-DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQE-----AGWKVDRYESKFEPADLSSVEQISRIRIATAFNVP 281 (435) Q Consensus 208 n~-~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~-----~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP 281 (435) .. ....+..+..+.++....++. ++++.++ .+.++..+........+....+...+.|+...++| T Consensus 243 ~~~~p~~~~~g~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 313 (452) T protein:vir:36 243 YFSDQYLTFLGAAVEEEDLKNIRS---------NRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVA 313 (452) T ss_pred HhcCceeEeecCCcCchhhhhhhh---------cceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCcc Confidence 22 222222333444444333221 1222222 12233333433334456677778888899999998 Q ss_pred HHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHH Q lcl|NC_019456. 282 ISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTA 347 (435) Q Consensus 282 ~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~ 347 (435) ..-.+.. ++. +.++. ...|...+...++.+..-+..+ - ..... ..+.+.+......|.. T Consensus 314 ~~~~~~~--gn~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~-~~~~~--~~i~i~f~~~~p~d~~ 386 (452) T protein:vir:36 314 NISDESF--GSS-SGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNV-S-NKDSW--KDIEYTFTRNEPKDIK 386 (452) T ss_pred ccCcccc--cCC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-C-Ccccc--ccceEEeCCCCCcCHH Confidence 6433221 221 11221 1223333444333333322221 1 11111 1244444556678889 Q ss_pred HHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCC Q lcl|NC_019456. 348 ARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGL 427 (435) Q Consensus 348 ~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (435) +.++.+.++ .|+++.--+.+++++-.-++. -++.+.+... . ........++++++.+... T Consensus 387 ~~a~~~~k~--~g~iS~et~~~~~~~~~d~~~------------E~~ri~~E~~---~---~~~~~~~~~~~~~~~~~~~ 446 (452) T protein:vir:36 387 EQAETANIL--MGITSQETALSVISVIPDVQA------------EMEKIKKEEA---S---TAIFDKDKQPSEKGTDTVV 446 (452) T ss_pred HHHHHHHHH--hccCChHHHHHhCCCCCCHHH------------HHHHHHHHHH---H---HHHHHhhccCCCCcccccC Confidence 999999887 578998778888764321111 1111111110 0 0011222334444444444 Q ss_pred CCCCCC Q lcl|NC_019456. 428 QSTEPE 433 (435) Q Consensus 428 ~~~~~~ 433 (435) +.++.| T Consensus 447 ~~~~~e 452 (452) T protein:vir:36 447 SETNEE 452 (452) T ss_pred ccccCC Confidence 444444 No 201 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.01 E-value=7.3e-06 Score=48.75 Aligned_cols=372 Identities=10% Similarity=0.016 Sum_probs=154.4 Q ss_pred CchHHHHHhhccccccccccccccchhhhhh------ccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDM------AGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) .-++++|...+......- .....++.- .+.... ......-..+.+..-+|+..+..+--..|. . T Consensus 6 ~~~i~~l~~~~~~~~~r~----~~l~~Yy~G~~~i~~~~~~~~---~~~~~~k~~~n~~~~ivd~~~~~l~~~g~~---~ 75 (441) T protein:vir:80 6 LALIEGMYDRIQRLSSWH----CCIEGYYEGSNRVRDLGVAIP---PELQRVQTVVSWPGIAVDALEERLDWLGWT---N 75 (441) T ss_pred HHHHHHHHHHHHHHHHHH----HHHHHHHhcCCcchhcCcccc---hhhhhhhhhcchHHHHHHHHHhhhcccccc---C Confidence 112222222111110000 000000000 000000 000001112233444555554443211122 1 Q ss_pred ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc-eE---- Q lcl|NC_019456. 75 YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN-SY---- 149 (435) Q Consensus 75 ~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~-~~---- 149 (435) .. ...+..++. .-+.......+..+++.+|.||.++..+. .|.+ .+.+++|..+.+..+.... .. T Consensus 76 ~d---~~~l~~i~~-----~n~~~~~~~~~~~~~~~~G~a~~~v~~d~-~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~~ 145 (441) T protein:vir:80 76 GD---GYGLDGVYA-----ANRLATASCDVHLDALIFGLSFVAIIPHG-DGTV-SVRPQSPKNCTGKFSADGSRLDAGLV 145 (441) T ss_pred CC---hHHHHHHHH-----hcCHHHHHHHHHHHHhhcCeeEEEEEeCC-CCce-EEEEEccceEEEEEeCCCCceeEEEE Confidence 11 122333322 23467778889999999999999887764 4766 5788999988877664321 11 Q ss_pred EEEEecCCe--eEEEchhh--------------------------eEEeccCCCccccccCcHHHH-HHHHHHHHHHHH- Q lcl|NC_019456. 150 WYRVTSDIY--NFTIPIND--------------------------VIHVKHVVPSNSWYGVSPIDV-LSSSLKFQRSVE- 199 (435) Q Consensus 150 ~~~~~~~~~--~~~~~~~~--------------------------iih~~~~~~~~~~~G~s~l~~-~~~~i~~~~~~~- 199 (435) .|....++. ...|.++. |+||.+.......+|.|-+.. +...+....... T Consensus 146 ~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s 225 (441) T protein:vir:80 146 VQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLL 225 (441) T ss_pred EEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHH Confidence 010001110 11122222 344443333344567775532 333333322211 Q ss_pred H--HHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccC-----CceeeeccCChhhHHHHHHHHHHHH Q lcl|NC_019456. 200 N--FSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEA-----GWKVDRYESKFEPADLSSVEQISRI 272 (435) Q Consensus 200 ~--~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~-----g~~~~~~~~~~~~~~~~e~~~~~~~ 272 (435) + ....++..... ++. +..++++.... +. ...+++..++. +.++..+.....+ .+.+..+.... T Consensus 226 ~~~~~~~~~~~~~~-~i~-G~~~~~~~~~~----~~---~~~~~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~i~ 295 (441) T protein:vir:80 226 GQSVNRDFYAYPQR-WVT-GVSADEFSQPG----WV---LSMASVWAVDKDDDGDTPNVGSFPVNSPT-PYSDQMRLLAQ 295 (441) T ss_pred HHHHHHHhhcCcee-eee-cCCccccccch----hh---hcccccccCCCCCCCCcceeEecCccchH-HHHHHHHHHHH Confidence 1 12223332222 222 22222221111 11 11233443332 2455544433322 36777777889 Q ss_pred HHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeech Q lcl|NC_019456. 273 RIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNV 338 (435) Q Consensus 273 ~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~ 338 (435) .|+..-++|+..+|....+..+ .+++ ...|...|.-.++.+...+....- .......+++.+ T Consensus 296 ~~~~~~~~p~~~~g~~~~~~~S-g~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~---~~~~~~~i~~~f 371 (441) T protein:vir:80 296 LTAGEAAVPERYFGFITSNPPS-GEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVD---EADFFGDVGLRW 371 (441) T ss_pred HHhcccCCCHHHhccCCCcchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---ccccceeeeEEe Confidence 9999999999999876543221 2221 122222333333222222221111 011123455556 Q ss_pred hhhhccCHHHHHHHHHHHHhcCCcC--HHHHHHHhCCCCCCCcCCceeeecccccchhcccccccccccccccccccccc Q lcl|NC_019456. 339 NGLLRGDTAARTQYYQTLTRNGIFK--PNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQ 416 (435) Q Consensus 339 ~~l~~~d~~~~~~~~~~~~~~g~~t--~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~ 416 (435) ......+..+.++.+.+++.+|+.. ..-+++.+|+.+- + ..++. . +.. ...+.-.+........+. T Consensus 372 ~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~--e-~~~~~--~-----e~~--e~~~~~~~~~~~~~~~~~ 439 (441) T protein:vir:80 372 RDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDV--Q-VEAVM--R-----HRA--ESSDPLAVLAGAISRQTN 439 (441) T ss_pred CCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHH--H-HHHHH--H-----HHH--HHHHHHHHHhhhhhcccc Confidence 6677888999999999999999764 3347777777542 1 11110 0 000 000000000000000111 Q ss_pred CC Q lcl|NC_019456. 417 EG 418 (435) Q Consensus 417 ~~ 418 (435) +. T Consensus 440 ~~ 441 (441) T protein:vir:80 440 EV 441 (441) T ss_pred cC Confidence 11 No 202 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=97.95 E-value=9.4e-06 Score=48.15 Aligned_cols=404 Identities=10% Similarity=0.088 Sum_probs=159.8 Q ss_pred CchH--------------HHHHhhccccccccccccccchhhhh----hccccccCcccccHHHHhhhHHHHHHHHHHHH Q lcl|NC_019456. 1 MSFM--------------SKVRQFFGVHDQANQIVQNPIPQPLD----MAGVKLEQATFSREHILESNEYIFSIVTRLSN 62 (435) Q Consensus 1 Mg~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~ 62 (435) |.+. ++|............ .-.....++. +...........+. -..++....+|+..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~-~~~~l~~Yy~g~~~i~~~~~~~~~~~~~--ki~~n~~~~Iv~~~~~ 77 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAINYAIRELQNRKK-RLDKLSDYYNGKQEIEKHEFDNATVEAA--NVMVNHAKYITDMNVG 77 (499) T ss_pred CccchhhhHHhhhhcCCHHHHHHHHHHHHHHHH-HHHHHHHHhccccchhcCCcCcCCCCcc--eeecchHHHHHHHHhh Confidence 2221 111111100000000 0000000000 00000000000011 1123444556666666 Q ss_pred HHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcE--------------- Q lcl|NC_019456. 63 VLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEP--------------- 127 (435) Q Consensus 63 ~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~--------------- 127 (435) -+-.-|+.+.-+.... .+.+...+. ......+...+....+.+|.+|.++..+.. |.+ T Consensus 78 ~l~g~p~~~~~~~~~~-~~~l~~~~~-----~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~-g~~~~~~~~~~~~~~~~~ 150 (499) T protein:vir:10 78 FMTGNPVKYVAEKGKN-IDDILEVFN-----QIDIHKHDIELEKDLSVFGYGYELLYLKKT-DPISVRDELGNEKLTPNT 150 (499) T ss_pred hhcccCceeecCChhH-HHHHHHHHh-----hcCHhHHHHHHHHHHHhcCceEEEEEeccc-cccccccccccccccccc Confidence 6655666654332222 223333332 224556788889999999999998876643 322 Q ss_pred -EEEEEeCCceeEEEEcCCCce------EEEEEe-cC-Ce----eEEEchhheEEeccCC----------------Cc-- Q lcl|NC_019456. 128 -IALWPLDPNTVSILRNTDNNS------YWYRVT-SD-IY----NFTIPINDVIHVKHVV----------------PS-- 176 (435) Q Consensus 128 -~~l~~l~~~~v~~~~~~~~~~------~~~~~~-~~-~~----~~~~~~~~iih~~~~~----------------~~-- 176 (435) ..+..++|..+.+..+..... .+|... .. +. ...+.++.|.+++... +. T Consensus 151 ~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 230 (499) T protein:vir:10 151 ELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGA 230 (499) T ss_pred ceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCc Confidence 346778887776666543211 112111 11 11 1234555555543111 10 Q ss_pred -------cccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCcccc--c Q lcl|NC_019456. 177 -------NSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVV--Q 245 (435) Q Consensus 177 -------~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~v--l 245 (435) +...|.|-+..+...+.....+.....+.+... +..+++ +..+.++... ...+ ..+.+.. . T Consensus 231 vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~-G~~~~~~~~~--~~~~-----~~~~~~~~~~ 302 (499) T protein:vir:10 231 VPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTF-GFGLGDDKDD--IQRL-----KRGAIEAPPR 302 (499) T ss_pred cceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeee-cCccccccch--hhhh-----hhcceeccCC Confidence 123466666666666665443322222222222 222222 2222221100 0011 1122322 3 Q ss_pred cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH--------------HHHHHHHHHhHHH Q lcl|NC_019456. 246 EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH--------------VTHSWTMTLMPII 311 (435) Q Consensus 246 ~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~--------------~~~~~~~~i~P~~ 311 (435) +++.+++.+........+....+...+.|....++|..-..... ++. +..+ ....|...+...+ T Consensus 303 ~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-gn~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~ 380 (499) T protein:vir:10 303 EEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFM-GNV-SGEAMKFKLFGLENLLSIKQRYFFDGLRRRL 380 (499) T ss_pred CCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhc-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 45566666655444455667777778888888888753221111 111 1111 1223334444444 Q ss_pred HHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeeccccc Q lcl|NC_019456. 312 RQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLY 391 (435) Q Consensus 312 ~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~ 391 (435) +.+...++.+- ...... .+++.+..-...|..+.++.+.++ .|+++.--++++++.-.-+++..+++.--.. . T Consensus 381 ~li~~~~~~~~--~~~d~~--~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~-~ 453 (499) T protein:vir:10 381 KLIQTIVNIKG--ANDDAS--GCKISLVANIPSNLSDVVNNVKNA--DGIIPRKYTYSWLPDVDNPQDVIDEMNQQDA-E 453 (499) T ss_pred HHHHHHHhccC--Cccccc--cceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHH-H Confidence 44444333211 111111 234444555577889999999998 5889998888887543211111111100000 0 Q ss_pred chhccccccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 392 PLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 392 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) ......+.. .+..+........+.+..+ +++...+++.+| T Consensus 454 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 493 (499) T protein:vir:10 454 TIKKNQEAL-RGQDPDRLELEDKQDDSSE---NDKEAGSNHNQS 493 (499) T ss_pred HHHHHHhhh-ccCCCCCCCCCCCCcccCC---CCCCCccccccC Confidence 000000000 0000000000000000000 111111122222 No 203 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=97.88 E-value=1.3e-05 Score=47.36 Aligned_cols=386 Identities=9% Similarity=-0.020 Sum_probs=158.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhh-------------hccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhC Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLD-------------MAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASL 67 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 67 (435) .-..++|.++......... .-.....++. ..+..... ..+. -..++.....|+..+.-+-.- T Consensus 26 ~~~~~~i~~~i~~~~~~~~-~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~--~~~~--ki~~n~~k~Iv~~~~~yl~g~ 100 (474) T protein:vir:96 26 ETQEEMIIRLINNHKQKLK-DINVGQKYYDKDNDINYQAYKQDLHGNIDYT--KPDW--RITTNFHQNLVDQKVSYVAGK 100 (474) T ss_pred cchHHHHHHHHHHHHHHHH-HHHHHHHHhcccCccccccchhhhccccccc--cccc--ccccchHHHHHHhhhhhhccc Confidence 0011111111100000000 0000000000 00000000 0000 012344555667777666677 Q ss_pred ceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC-- Q lcl|NC_019456. 68 PLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD-- 145 (435) Q Consensus 68 ~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~-- 145 (435) |+.+..+.... ...+.. ++. .........+..+++.+|.||.++.++. .|.+ .+..++|..+-+..+.. T Consensus 101 p~~~~~~~~~~-~~~l~~-~~~-----n~~~~~~~~l~~~~~~~G~~~~~~~~d~-~~~~-~i~~~~p~~~~~v~d~~~~ 171 (474) T protein:vir:96 101 PVTYAHDDDKV-LDVIHQ-VLD-----TRWDNKLIDILTAASNKGIDWLQVYINE-DGEL-KLFRVPAEQAIPIWTDKER 171 (474) T ss_pred CceeccCChHH-HHHHHH-HHh-----ccHHHHHHHHHHHHhhCCeEEEEeeeCC-CCce-EEEEEcccceEEEEcCCCC Confidence 77764333222 222222 321 2355666778899999999999887764 4664 67788998888776543 Q ss_pred CceE----EEEEecCCeeEEEchhheEEeccCC---------------------C---------ccccccCcHHHHHHHH Q lcl|NC_019456. 146 NNSY----WYRVTSDIYNFTIPINDVIHVKHVV---------------------P---------SNSWYGVSPIDVLSSS 191 (435) Q Consensus 146 ~~~~----~~~~~~~~~~~~~~~~~iih~~~~~---------------------~---------~~~~~G~s~l~~~~~~ 191 (435) +..+ +|..........+..+.|.++.... + .+...|.|-+..+... T Consensus 172 ~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~l 251 (474) T protein:vir:96 172 EQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSF 251 (474) T ss_pred CceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHH Confidence 2211 1111111112234455555543210 0 0223467777777776 Q ss_pred HHHHHHHHHHHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHH Q lcl|NC_019456. 192 LKFQRSVENFSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISR 271 (435) Q Consensus 192 i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~ 271 (435) +.....+..-..+.+......++.+.+. ..+........+ ...+++.++++.++..+........+....+... T Consensus 252 iDa~d~~~S~~~~~~~~~~~p~lv~~g~-~~~~~~~~~~~~-----~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~ 325 (474) T protein:vir:96 252 VDAIDKRLSDVQNMFDESVELIYILRGY-EGEDLSEFMEGL-----KYYKAINVSSDGGVETIQVEVPVASTKEYLDMMR 325 (474) T ss_pred HHHHHHHHHHHHHHHHHhhcchhhhcCC-Ccccccchhhhh-----hccceeeccCCCceeEEeccCCHHHHHHHHHHHH Confidence 6665433322222222221112222221 111111111211 1234555666655555555545556777778888 Q ss_pred HHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeec Q lcl|NC_019456. 272 IRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFN 337 (435) Q Consensus 272 ~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd 337 (435) ..|....++|........ ++. +..+. ...|...+...++.+...+. ..... ..+.+. T Consensus 326 ~~I~~~s~~p~~~~~~~~-~n~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g-----~~~d~--~~i~i~ 396 (474) T protein:vir:96 326 AYIVEFGQGVDFQTDKFG-SAT-SGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNK-----IKLDA--KEIEIT 396 (474) T ss_pred HHHHHHhCCcCccccccc-ccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCccc--ceeeEE Confidence 899999999865432221 111 22221 12223333333333322211 11111 234444 Q ss_pred hhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccC Q lcl|NC_019456. 338 VNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQE 417 (435) Q Consensus 338 ~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 417 (435) +..-...+..+.++.+ .+.|+++.-.+++++++-.-++... +.+.+.... ............ T Consensus 397 f~~~~p~~~~e~a~~~---~~~giiS~et~~~~lp~v~D~~~E~------------eri~~E~~~---~~~~~~~~~~~~ 458 (474) T protein:vir:96 397 FNFNVMVNDLEQSQIG---AQSQYLSKETLVRHHPWVDDPKAEL------------ERLDEEQLE---LNKQLPNLDDGG 458 (474) T ss_pred ecCCCccCHHHHHHHH---HHcCCCChHHHHHhCCCCCCHHHHH------------HHHHHHHHH---HHhhcccccccc Confidence 4555566666666654 4579999988988886432111111 111110000 000000000000 Q ss_pred CCCCCCCCCCCCCCCC Q lcl|NC_019456. 418 GGENTNENGLQSTEPE 433 (435) Q Consensus 418 ~~~~~~~~~~~~~~~~ 433 (435) .+...++.+.+.++++ T Consensus 459 ~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 459 ADGAQQQQQSENNQSK 474 (474) T ss_pred CCCCCCcCCCCccccC Confidence 0000011111111111 No 204 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=97.88 E-value=1.3e-05 Score=47.36 Aligned_cols=386 Identities=9% Similarity=-0.020 Sum_probs=158.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhh-------------hccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhC Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLD-------------MAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASL 67 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 67 (435) .-..++|.++......... .-.....++. ..+..... ..+. -..++.....|+..+.-+-.- T Consensus 26 ~~~~~~i~~~i~~~~~~~~-~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~--~~~~--ki~~n~~k~Iv~~~~~yl~g~ 100 (474) T protein:vir:95 26 ETQEEMIIRLINNHKQKLK-DINVGQKYYDKDNDINYQAYKQDLHGNIDYT--KPDW--RITTNFHQNLVDQKVSYVAGK 100 (474) T ss_pred cchHHHHHHHHHHHHHHHH-HHHHHHHHhcccCccccccchhhhccccccc--cccc--ccccchHHHHHHhhhhhhccc Confidence 0011111111100000000 0000000000 00000000 0000 012344555667777666677 Q ss_pred ceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC-- Q lcl|NC_019456. 68 PLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD-- 145 (435) Q Consensus 68 ~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~-- 145 (435) |+.+..+.... ...+.. ++. .........+..+++.+|.||.++.++. .|.+ .+..++|..+-+..+.. T Consensus 101 p~~~~~~~~~~-~~~l~~-~~~-----n~~~~~~~~l~~~~~~~G~~~~~~~~d~-~~~~-~i~~~~p~~~~~v~d~~~~ 171 (474) T protein:vir:95 101 PVTYAHDDDKV-LDVIHQ-VLD-----TRWDNKLIDILTAASNKGIDWLQVYINE-DGEL-KLFRVPAEQAIPIWTDKER 171 (474) T ss_pred CceeccCChHH-HHHHHH-HHh-----ccHHHHHHHHHHHHhhCCeEEEEeeeCC-CCce-EEEEEcccceEEEEcCCCC Confidence 77764333222 222222 321 2355666778899999999999887764 4664 67788998888776543 Q ss_pred CceE----EEEEecCCeeEEEchhheEEeccCC---------------------C---------ccccccCcHHHHHHHH Q lcl|NC_019456. 146 NNSY----WYRVTSDIYNFTIPINDVIHVKHVV---------------------P---------SNSWYGVSPIDVLSSS 191 (435) Q Consensus 146 ~~~~----~~~~~~~~~~~~~~~~~iih~~~~~---------------------~---------~~~~~G~s~l~~~~~~ 191 (435) +..+ +|..........+..+.|.++.... + .+...|.|-+..+... T Consensus 172 ~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~l 251 (474) T protein:vir:95 172 EQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSF 251 (474) T ss_pred CceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHH Confidence 2211 1111111112234455555543210 0 0223467777777776 Q ss_pred HHHHHHHHHHHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHH Q lcl|NC_019456. 192 LKFQRSVENFSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISR 271 (435) Q Consensus 192 i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~ 271 (435) +.....+..-..+.+......++.+.+. ..+........+ ...+++.++++.++..+........+....+... T Consensus 252 iDa~d~~~S~~~~~~~~~~~p~lv~~g~-~~~~~~~~~~~~-----~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~ 325 (474) T protein:vir:95 252 VDAIDKRLSDVQNMFDESVELIYILRGY-EGEDLSEFMEGL-----KYYKAINVSSDGGVETIQVEVPVASTKEYLDMMR 325 (474) T ss_pred HHHHHHHHHHHHHHHHHhhcchhhhcCC-Ccccccchhhhh-----hccceeeccCCCceeEEeccCCHHHHHHHHHHHH Confidence 6665433322222222221112222221 111111111211 1234555666655555555545556777778888 Q ss_pred HHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeec Q lcl|NC_019456. 272 IRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFN 337 (435) Q Consensus 272 ~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd 337 (435) ..|....++|........ ++. +..+. ...|...+...++.+...+. ..... ..+.+. T Consensus 326 ~~I~~~s~~p~~~~~~~~-~n~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g-----~~~d~--~~i~i~ 396 (474) T protein:vir:95 326 AYIVEFGQGVDFQTDKFG-SAT-SGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNK-----IKLDA--KEIEIT 396 (474) T ss_pred HHHHHHhCCcCccccccc-ccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCccc--ceeeEE Confidence 899999999865432221 111 22221 12223333333333322211 11111 234444 Q ss_pred hhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccC Q lcl|NC_019456. 338 VNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQE 417 (435) Q Consensus 338 ~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 417 (435) +..-...+..+.++.+ .+.|+++.-.+++++++-.-++... +.+.+.... ............ T Consensus 397 f~~~~p~~~~e~a~~~---~~~giiS~et~~~~lp~v~D~~~E~------------eri~~E~~~---~~~~~~~~~~~~ 458 (474) T protein:vir:95 397 FNFNVMVNDLEQSQIG---AQSQYLSKETLVRHHPWVDDPKAEL------------ERLDEEQLE---LNKQLPNLDDGG 458 (474) T ss_pred ecCCCccCHHHHHHHH---HHcCCCChHHHHHhCCCCCCHHHHH------------HHHHHHHHH---HHhhcccccccc Confidence 4555566666666654 4579999988988886432111111 111110000 000000000000 Q ss_pred CCCCCCCCCCCCCCCC Q lcl|NC_019456. 418 GGENTNENGLQSTEPE 433 (435) Q Consensus 418 ~~~~~~~~~~~~~~~~ 433 (435) .+...++.+.+.++++ T Consensus 459 ~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 459 ADGAQQQQQSENNQSK 474 (474) T ss_pred CCCCCCcCCCCccccC Confidence 0000011111111111 No 205 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=97.88 E-value=1.3e-05 Score=47.34 Aligned_cols=379 Identities=12% Similarity=0.074 Sum_probs=157.1 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) +.-+.++.+...+... ........ ...+ .... ...+ .-..++....+|+..+.-+-.-|+.+..+.... . T Consensus 42 ~~~~~~~~~yY~g~~~---i~~~~~~~--~~~~-~~~~-~~~~--~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~-~ 111 (478) T protein:vir:10 42 IDNITMGERYYNHHPD---ILDAPPKR--DVNG-DYDE-TKPD--WRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKA-L 111 (478) T ss_pred HHHHHHHHHHhcCCCc---hhcccccc--cccc-cccc-cccc--ceeccchHHHHHHHHHhhhccCCeeeecCChHH-H Confidence 1111122222221100 00000000 0000 0000 0000 001233445566766666666676654333222 2 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--CceE----EEEEe Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NNSY----WYRVT 154 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~~----~~~~~ 154 (435) ..+...+. | ...+....++.+++.+|.+|.++..+. .|.+ .+..++|..+.+..+.. +... +|... T Consensus 112 ~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~~~~d~-~g~~-~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~ 183 (478) T protein:vir:10 112 KQIQHTLN---H---KWDDKLVDILTAASNKGIEWVQPYVDE-EGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELD 183 (478) T ss_pred HHHHHHHh---c---CHHHHHHHHHHHHHhcCeEEEEEEecC-CCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEec Confidence 22333221 2 456667778899999999998877664 4665 57778898888776542 2111 12211 Q ss_pred cCCeeEEEchhheEEeccC-------------------------CCc---------cccccCcHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 155 SDIYNFTIPINDVIHVKHV-------------------------VPS---------NSWYGVSPIDVLSSSLKFQRSVEN 200 (435) Q Consensus 155 ~~~~~~~~~~~~iih~~~~-------------------------~~~---------~~~~G~s~l~~~~~~i~~~~~~~~ 200 (435) .......+.++.|.++... ++. +...|.|-+..+...+.....+.. T Consensus 184 ~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S 263 (478) T protein:vir:10 184 GAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLS 263 (478) T ss_pred CceEEEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHH Confidence 1112223334444433221 110 234578877776666665554332 Q ss_pred HHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccc--cCCceeeeccCChhhHHHHHHHHHHHHHHHH Q lcl|NC_019456. 201 FSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQ--EAGWKVDRYESKFEPADLSSVEQISRIRIAT 276 (435) Q Consensus 201 ~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl--~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~ 276 (435) ...+.+... +..++. +....+ .+..... .+. .+++.+ +.|.++..+........+....+...+.|.. T Consensus 264 ~~~~~~~~~~~p~~~~~-g~~~~~--~~~~~~~----~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~ 335 (478) T protein:vir:10 264 DTQNTFDESVELIYILK-GYEGED--MKDFMHN----LKY-YKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIE 335 (478) T ss_pred HHHHHHHHhhCceeeee-cCCccc--cchhhhh----hhh-cceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHH Confidence 222222222 222222 211111 1111111 111 222222 2333433343333344566777788888999 Q ss_pred HhCCCHHHhCCcccCcccHHHHHH--------------HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhh Q lcl|NC_019456. 277 AFNVPISFLNDDQAKSTTNVEHVT--------------HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLL 342 (435) Q Consensus 277 ~fgvP~~~lg~~~~~~~~~~e~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~ 342 (435) ..++|..-..... ++. +..+.. ..|...+...++.+... ........ .+++.+..-. T Consensus 336 ~s~~p~~~~~~~~-~n~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~-----~g~~~~~~--~i~i~f~~~~ 406 (478) T protein:vir:10 336 FGQGVDFQQDKFG-NSP-SGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDF-----YRLDVKVQ--DIEITFNFNV 406 (478) T ss_pred HhCccccCccccc-ccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCCcccc--cceEEecCCC Confidence 9998865432221 111 222211 12222233322222221 11111112 2344445556 Q ss_pred ccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCC Q lcl|NC_019456. 343 RGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENT 422 (435) Q Consensus 343 ~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (435) ..|..+.++.+.++ .|+++...+++++++-.-++... +.+.+........ ..+..++...+ T Consensus 407 p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~------------~ri~~E~~~~~~~-----~~~~~~~~~~~ 467 (478) T protein:vir:10 407 MVNELENSQIAMNS--TGLLSKETILSNHAWVEDPVAEM------------ERIEQENIELNQQ-----LPDIEEGLNGE 467 (478) T ss_pred CCCHHHHHHHHHHH--hCCCChHHHHHhCCCCCCHHHHH------------HHHHHHHHHHHhh-----ccccccccCCC Confidence 67889999998887 68999988999886532111111 1111110000000 00000111111 Q ss_pred CCCCCCCCCCC Q lcl|NC_019456. 423 NENGLQSTEPE 433 (435) Q Consensus 423 ~~~~~~~~~~~ 433 (435) .+++++..++| T Consensus 468 ~~~~~~~~~~~ 478 (478) T protein:vir:10 468 QQRQSENNQPE 478 (478) T ss_pred CCCCCCCCCCC Confidence 11111222222 No 206 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=97.78 E-value=2e-05 Score=46.37 Aligned_cols=378 Identities=11% Similarity=0.021 Sum_probs=166.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhh----h--------c-cccccCcccccHHHHhhhHHHHHHHHHHHHHHhhC Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLD----M--------A-GVKLEQATFSREHILESNEYIFSIVTRLSNVLASL 67 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~--------~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 67 (435) .-+.+.+..+......+. . .....++. . . +.........+ .-..++....+|+..+.-+-.- T Consensus 22 ~~~~~~i~~~~~~~~~~~-~--~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~--~ki~~~~~~~Ivd~~~~~l~g~ 96 (479) T protein:vir:79 22 INLVKVIEHYILKHRPEK-Y--KQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVN--NKAINNYHKLLVDQKVGYSVGN 96 (479) T ss_pred hHHHHHHHHHHhhhhHHH-H--HHHHHHhccCCcccccccccccccccccccccCc--ceeecchHHHHHHHHHhhhhcC Confidence 111222222211111000 0 00000000 0 0 00000000000 0112445555677777666666 Q ss_pred ceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC- Q lcl|NC_019456. 68 PLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN- 146 (435) Q Consensus 68 ~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~- 146 (435) |+.+..+.... ..+...+.. | ........++...+.+|.+|.++..+. .|.+ .+..++|..+.+..+... T Consensus 97 p~~~~~~~~~~--~~~~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~-~~~~-~i~~~~p~~~~~v~d~~~~ 167 (479) T protein:vir:79 97 PIVFNADDDNL--TKLLNDLLG--E---EFDDTITELYLNASNKGVEWLHPYINR-KGEF-KYVIIPAEEAIPIWDSKRQ 167 (479) T ss_pred CceeccCCHHH--HHHHHHHHh--c---CHHHHHHHHHHHHHhcCeEEEEEEeCC-CCce-EEEEEccceeEEEEeCCCC Confidence 76664333221 223344432 2 355666788899999999998887664 4665 477889988888765432 Q ss_pred -ceE----EEE-EecCCee----EEEchhheEEeccCC------------------------------C---------cc Q lcl|NC_019456. 147 -NSY----WYR-VTSDIYN----FTIPINDVIHVKHVV------------------------------P---------SN 177 (435) Q Consensus 147 -~~~----~~~-~~~~~~~----~~~~~~~iih~~~~~------------------------------~---------~~ 177 (435) ... +|. ...++.. ..+..+.+.|++... + .+ T Consensus 168 ~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n 247 (479) T protein:vir:79 168 RELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKN 247 (479) T ss_pred CceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecC Confidence 111 122 2122211 134455555543110 0 01 Q ss_pred ccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEe-CCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeec Q lcl|NC_019456. 178 SWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQY-DRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRY 254 (435) Q Consensus 178 ~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~-~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~ 254 (435) ...|.|-+..+...+.....+.....+.+... +-.++.. +....++.... . ..++++.++++.+++.+ T Consensus 248 n~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~--------~-~~~~~i~~~~~~~~~~l 318 (479) T protein:vir:79 248 NEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFIDN--------I-RYYKSIKVDGGGGVDKL 318 (479) T ss_pred CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchhh--------h-hhccceecCCCCcceEE Confidence 23477777776666665544332222223322 2222222 11222221111 1 12345556666555555 Q ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHH Q lcl|NC_019456. 255 ESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNM 320 (435) Q Consensus 255 ~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~ 320 (435) ........+....+...+.|....++|..-.+.. ++. +.++. ...|...+...++.+...+.. T Consensus 319 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--gn~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 395 (479) T protein:vir:79 319 EINIPVEAKKELLDRLEKNIIIFGQGVNPESQNT--GDK-SGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKI 395 (479) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCccccccccc--cch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 5444445566777777888988888887644322 221 11221 222333444444444443332 Q ss_pred hhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccc Q lcl|NC_019456. 321 KLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAI 400 (435) Q Consensus 321 ~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~ 400 (435) +-.. ......+.+.+..-...|.++.++.+.++ .|+++...+.+++++- +++- + -++.+.+.. T Consensus 396 ~~~~---~~~~~~i~i~f~~~~p~~~~~~a~~~~kl--~g~iS~et~l~~l~~v--~d~~-~---------E~~ri~~E~ 458 (479) T protein:vir:79 396 SGNK---SYDYKTVQITFNHSMIINEAEKIDMAAKS--TGIVSDETIVSNHPWV--EDVN-D---------ELERLKKQE 458 (479) T ss_pred cCCC---ccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCC--CCHH-H---------HHHHHHHHH Confidence 2111 11122345555566677889999999888 4889988888887542 2111 0 111111110 Q ss_pred ccccccccccccccccCCCCCCCCC Q lcl|NC_019456. 401 LDNKIQTDASVAAPKQEGGENTNEN 425 (435) Q Consensus 401 ~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (435) . ...+..... ....++..+++ T Consensus 459 ~---~~~~~~~~~-~~~~~~~~~e~ 479 (479) T protein:vir:79 459 D---TQKEYDDLI-PNNQDGVIDET 479 (479) T ss_pred H---HHHHHHhcc-CcccCCCcCcC Confidence 0 000000000 01111111111 No 207 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=97.70 E-value=2.7e-05 Score=45.64 Aligned_cols=379 Identities=11% Similarity=0.064 Sum_probs=159.4 Q ss_pred CchHH--------------HHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhh Q lcl|NC_019456. 1 MSFMS--------------KVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLAS 66 (435) Q Consensus 1 Mg~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~ 66 (435) =-++. ++.+...+.. ..... .......+ .... ...+ .-..++....+|+..+.-+-. T Consensus 29 ~~~i~~~i~~~~~~~~~~~~~~~YY~g~~---~i~~~--~~~~~~~~-~~~~-~~~~--~ki~~n~~k~Ivd~~~~~l~g 99 (474) T protein:vir:94 29 EEMIVRLIDDHRKQLDKITVGQRYYDKDN---DIVKQ--MKKVDVHG-NIDY-DKPD--WRITTNFHQNLVDQKVSYVAS 99 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccc---chhcc--cchhcccc-cccc-ccCc--ceeecchHHHHHHHHHhhhhc Confidence 01111 1111111100 00000 00000000 0000 0000 001234555677777777767 Q ss_pred CceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC- Q lcl|NC_019456. 67 LPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD- 145 (435) Q Consensus 67 ~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~- 145 (435) -|+.+..++... ......++. .........+....+.+|.+|.++..+. .|.+ .+..++|..+.+..+.. T Consensus 100 ~p~~~~~~d~~~--~~~l~~~~~-----n~~~~~~~e~~~~~~~~G~~~~~~~~d~-~~~~-~i~~~~p~~~~~v~d~~~ 170 (474) T protein:vir:94 100 KPVTYSCEDENV--LKVIHDVLD-----TRWDNKLIDILTATSNKGIDWLQVYINE-NGEM-KLFRVPAEQAIPIWVDKE 170 (474) T ss_pred CCceeccCcHHH--HHHHHHHHh-----ccHHHHHHHHHHHHhhcCceEEEEEecC-CCee-EEEEEcccceEEEEcCCC Confidence 777764333221 122333331 2345666777899999999998887764 4654 57778898888877643 Q ss_pred -CceE----EEEEecCCeeEEEchhheEEeccCC---------------------C---------ccccccCcHHHHHHH Q lcl|NC_019456. 146 -NNSY----WYRVTSDIYNFTIPINDVIHVKHVV---------------------P---------SNSWYGVSPIDVLSS 190 (435) Q Consensus 146 -~~~~----~~~~~~~~~~~~~~~~~iih~~~~~---------------------~---------~~~~~G~s~l~~~~~ 190 (435) +... +|..........+..+.+.+++... + .+...|.|-+..+.. T Consensus 171 ~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~ 250 (474) T protein:vir:94 171 REELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKS 250 (474) T ss_pred CCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCCcCCCCcHHHHHH Confidence 2211 1111111112233344443332100 0 012367888877777 Q ss_pred HHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHH Q lcl|NC_019456. 191 SLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQ 268 (435) Q Consensus 191 ~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~ 268 (435) .+.....+.....+.+... +..++. +....+ .+..... . ...+++.++++.+++.+........+....+ T Consensus 251 liDa~n~~~s~~~~~~~~~~~~~lv~~-g~~~~~--~~~~~~~----~-~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~ 322 (474) T protein:vir:94 251 IIDAIDKRLSDAQNMFDESVELIYILK-GYEGED--LEEFMRG----L-KYYKAINVDGDGGVETIQVEVPVSSTKEYID 322 (474) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeeee-cCCccc--chhhhhh----h-hccceeeccCCCceeEEeecCCHHHHHHHHH Confidence 7776554333333233222 222222 111111 1111111 1 1345666666666665555544455667777 Q ss_pred HHHHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCccee Q lcl|NC_019456. 269 ISRIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYF 334 (435) Q Consensus 269 ~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i 334 (435) ...+.|...-++|..-..... ++. +..+. ...|...+...++.+.+.+.. ........+ T Consensus 323 ~l~~~I~~~s~~p~~~~~~~~-~n~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~-----~~d~~~i~v 395 (474) T protein:vir:94 323 LMRVYIMEFGQGVDFQTDKFG-SAP-SGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL-----KTDVKDIEI 395 (474) T ss_pred HHHHHHHHHhCccccCccccc-ccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----CcccceeeE Confidence 778889998888854322111 111 11221 223333444444443332221 111122234 Q ss_pred eechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccccccccccccccc Q lcl|NC_019456. 335 SFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAP 414 (435) Q Consensus 335 ~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~ 414 (435) .| +.-...+..+.++ .+++.|+++.--++++++.- +++- ++ ++.+.+..... . ...+ T Consensus 396 ~f--~~~~p~~~~e~a~---~~~~~g~iS~et~l~~l~~v--~D~~-~E---------~eri~~E~~~~---~---~~~~ 452 (474) T protein:vir:94 396 SF--NFNRMMNDAEQSQ---IIAQSQYLSRETLVKSSPLV--DDYK-AE---------LERIEQEQMEY---N---KQLP 452 (474) T ss_pred Ee--ccCcccCHHHHHH---HHHHcCCCCHHHHHHhCCCC--CCHH-HH---------HHHHHHHHHHH---H---hhcc Confidence 44 4444455554444 45567999998888888542 2111 01 11111100000 0 0011 Q ss_pred ccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 415 KQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 415 ~~~~~~~~~~~~~~~~~~~~~ 435 (435) ...+++.+++..+...+++.+ T Consensus 453 ~~~~~~~~~~~~~~~~~~~~~ 473 (474) T protein:vir:94 453 NLDDGGADGAQQQEGSNNKES 473 (474) T ss_pred ccCCCCCCCcccCCCCccccc Confidence 111111111111222222222 No 208 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=97.70 E-value=2.7e-05 Score=45.64 Aligned_cols=379 Identities=11% Similarity=0.064 Sum_probs=159.4 Q ss_pred CchHH--------------HHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhh Q lcl|NC_019456. 1 MSFMS--------------KVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLAS 66 (435) Q Consensus 1 Mg~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~ 66 (435) =-++. ++.+...+.. ..... .......+ .... ...+ .-..++....+|+..+.-+-. T Consensus 29 ~~~i~~~i~~~~~~~~~~~~~~~YY~g~~---~i~~~--~~~~~~~~-~~~~-~~~~--~ki~~n~~k~Ivd~~~~~l~g 99 (474) T protein:vir:97 29 EEMIVRLIDDHRKQLDKITVGQRYYDKDN---DIVKQ--MKKVDVHG-NIDY-DKPD--WRITTNFHQNLVDQKVSYVAS 99 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccc---chhcc--cchhcccc-cccc-ccCc--ceeecchHHHHHHHHHhhhhc Confidence 01111 1111111100 00000 00000000 0000 0000 001234555677777777767 Q ss_pred CceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC- Q lcl|NC_019456. 67 LPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD- 145 (435) Q Consensus 67 ~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~- 145 (435) -|+.+..++... ......++. .........+....+.+|.+|.++..+. .|.+ .+..++|..+.+..+.. T Consensus 100 ~p~~~~~~d~~~--~~~l~~~~~-----n~~~~~~~e~~~~~~~~G~~~~~~~~d~-~~~~-~i~~~~p~~~~~v~d~~~ 170 (474) T protein:vir:97 100 KPVTYSCEDENV--LKVIHDVLD-----TRWDNKLIDILTATSNKGIDWLQVYINE-NGEM-KLFRVPAEQAIPIWVDKE 170 (474) T ss_pred CCceeccCcHHH--HHHHHHHHh-----ccHHHHHHHHHHHHhhcCceEEEEEecC-CCee-EEEEEcccceEEEEcCCC Confidence 777764333221 122333331 2345666777899999999998887764 4654 57778898888877643 Q ss_pred -CceE----EEEEecCCeeEEEchhheEEeccCC---------------------C---------ccccccCcHHHHHHH Q lcl|NC_019456. 146 -NNSY----WYRVTSDIYNFTIPINDVIHVKHVV---------------------P---------SNSWYGVSPIDVLSS 190 (435) Q Consensus 146 -~~~~----~~~~~~~~~~~~~~~~~iih~~~~~---------------------~---------~~~~~G~s~l~~~~~ 190 (435) +... +|..........+..+.+.+++... + .+...|.|-+..+.. T Consensus 171 ~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~ 250 (474) T protein:vir:97 171 REELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKS 250 (474) T ss_pred CCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCCcCCCCcHHHHHH Confidence 2211 1111111112233344443332100 0 012367888877777 Q ss_pred HHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHH Q lcl|NC_019456. 191 SLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQ 268 (435) Q Consensus 191 ~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~ 268 (435) .+.....+.....+.+... +..++. +....+ .+..... . ...+++.++++.+++.+........+....+ T Consensus 251 liDa~n~~~s~~~~~~~~~~~~~lv~~-g~~~~~--~~~~~~~----~-~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~ 322 (474) T protein:vir:97 251 IIDAIDKRLSDAQNMFDESVELIYILK-GYEGED--LEEFMRG----L-KYYKAINVDGDGGVETIQVEVPVSSTKEYID 322 (474) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeeee-cCCccc--chhhhhh----h-hccceeeccCCCceeEEeecCCHHHHHHHHH Confidence 7776554333333233222 222222 111111 1111111 1 1345666666666665555544455667777 Q ss_pred HHHHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCccee Q lcl|NC_019456. 269 ISRIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYF 334 (435) Q Consensus 269 ~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i 334 (435) ...+.|...-++|..-..... ++. +..+. ...|...+...++.+.+.+.. ........+ T Consensus 323 ~l~~~I~~~s~~p~~~~~~~~-~n~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~-----~~d~~~i~v 395 (474) T protein:vir:97 323 LMRVYIMEFGQGVDFQTDKFG-SAP-SGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL-----KTDVKDIEI 395 (474) T ss_pred HHHHHHHHHhCccccCccccc-ccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----CcccceeeE Confidence 778889998888854322111 111 11221 223333444444443332221 111122234 Q ss_pred eechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccccccccccccccc Q lcl|NC_019456. 335 SFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAP 414 (435) Q Consensus 335 ~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~ 414 (435) .| +.-...+..+.++ .+++.|+++.--++++++.- +++- ++ ++.+.+..... . ...+ T Consensus 396 ~f--~~~~p~~~~e~a~---~~~~~g~iS~et~l~~l~~v--~D~~-~E---------~eri~~E~~~~---~---~~~~ 452 (474) T protein:vir:97 396 SF--NFNRMMNDAEQSQ---IIAQSQYLSRETLVKSSPLV--DDYK-AE---------LERIEQEQMEY---N---KQLP 452 (474) T ss_pred Ee--ccCcccCHHHHHH---HHHHcCCCCHHHHHHhCCCC--CCHH-HH---------HHHHHHHHHHH---H---hhcc Confidence 44 4444455554444 45567999998888888542 2111 01 11111100000 0 0011 Q ss_pred ccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 415 KQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 415 ~~~~~~~~~~~~~~~~~~~~~ 435 (435) ...+++.+++..+...+++.+ T Consensus 453 ~~~~~~~~~~~~~~~~~~~~~ 473 (474) T protein:vir:97 453 NLDDGGADGAQQQEGSNNKES 473 (474) T ss_pred ccCCCCCCCcccCCCCccccc Confidence 111111111111222222222 No 209 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=97.36 E-value=4.3e-06 Score=50.00 Aligned_cols=273 Identities=11% Similarity=0.097 Sum_probs=126.9 Q ss_pred hhhHHHHHHHHHHHHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcE Q lcl|NC_019456. 48 ESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEP 127 (435) Q Consensus 48 ~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~ 127 (435) +. .......|++++-..|.+..-.....-..++-++.---|...+-..-++.++.+. +.|.- ++-+|-+ |-- T Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~--~~~ 72 (279) T protein:vir:40 1 MS----LFNLSRRAEDVSFSTFTVQDPTTDLLLGKLLGLVSYFDNVDYSEASKLEDLFYWA-LQGKE-VYRVWYG--GFK 72 (279) T ss_pred Cc----ccccchhhcccceeeeeecCcchhHHHHHHHHHHHHhhcccchhhhhhhhhhhhh-hccce-eehhhhh--hHH Confidence 00 0112334444444444442221111112222232223343444333344433332 23332 1112211 100 Q ss_pred EEEEEeCCceeEEEEcCCCceEEEEEecCCeeEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019456. 128 IALWPLDPNTVSILRNTDNNSYWYRVTSDIYNFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEME 207 (435) Q Consensus 128 ~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ 207 (435) .--..+.. |.... ....+.....++|-.|+..|-++. +|.-+- .....++.+. +....++ . T Consensus 73 ~~~~~~~~-------d~fn~---~vr~~~~~~vtVP~~Dv~IieNPl-----v~v~~e-e~~kM~~la~--nai~~KL-D 133 (279) T protein:vir:40 73 YYAQRVNA-------DQFNI---VVREPNRREVTIRTNDYEMLLNPF-----YGANPQ-RFGVMFGMAS--NGIGRRL-D 133 (279) T ss_pred HHHhhcCc-------chhhh---heecCCcceeEeecchhhhhhcch-----heeccc-hhhHHHHHHH--hhhhhhh-c Confidence 00001111 11110 111222334567777777776542 444332 3333333221 1122222 3 Q ss_pred cC--CceEEEeCCcCC-HHHHHHHHHHHHHH---hcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019456. 208 KK--DKFVLQYDRSIS-PEKRQAMVNDFLRM---VKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVP 281 (435) Q Consensus 208 n~--~~~~~~~~~~~~-~e~~~~~~~~~~~~---~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP 281 (435) +. .+++++++.+.. ++..++.+.+++++ .++-+++.+++.|.++++++.+.+.+. .+-..+.+.+.+..+||| T Consensus 134 ~~~qIk~fIKTd~d~glee~kekaR~rIk~mlalAk~~nGityid~~ddItQL~kDYStsl-k~die~lkS~l~Sq~Gin 212 (279) T protein:vir:40 134 SQAQIKIYWKTKVSSGLKEVWDRIRERLTQQQQLAREFNGVSVIGSDDDIKQIQPDYSGSL-QNDANLAIEIALSEYGMP 212 (279) T ss_pred ccceeeeEEecCcchhHHHHHHHHHHHHHHHHHHHHhcCCeeeecCCceeEeecccccccc-HHHHHHHHHHHHhhcCCc Confidence 32 567777776544 34455566666543 344478999999999999998766654 344467788999999999 Q ss_pred HHHhCCcccCcccHHHHHHHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCC Q lcl|NC_019456. 282 ISFLNDDQAKSTTNVEHVTHSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGI 361 (435) Q Consensus 282 ~~~lg~~~~~~~~~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~ 361 (435) -.+|- +.++.+++..||..+|.|++++.+..|..+ + +|...-+.. -...|. T Consensus 213 ekIL~-----GsAtE~q~iAyy~rtVePILkQyek~liY~---~---------E~fv~y~tt------------ta~gg~ 263 (279) T protein:vir:40 213 RELLY-----GQSNEVTIIAFAIQKVLPLLKQHDKNIIFN---Q---------ENFVAYIST------------TAKGGA 263 (279) T ss_pred hhhcc-----ccCchhhhhhHHHhhHHHHHHHhcccccch---h---------hhhhhhhee------------cccCcc Confidence 99883 345778999999999999999977655431 1 111110000 001121 Q ss_pred cCHHHHHHHhCCCCCCCcCCc Q lcl|NC_019456. 362 FKPNEIRELEGQAPIPDEAAD 382 (435) Q Consensus 362 ~t~NE~R~~~g~~p~~~~~gd 382 (435) + |-.-...+-+|+.+ | T Consensus 264 ~--~s~~~~~~~~~~~~---~ 279 (279) T protein:vir:40 264 I--ESKSSKRDSEPVGN---D 279 (279) T ss_pred c--ccccccccCCCCCC---C Confidence 1 11111223344422 2 No 210 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=97.22 E-value=0.00013 Score=41.97 Aligned_cols=366 Identities=12% Similarity=0.059 Sum_probs=156.8 Q ss_pred CchH----------HHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSFM----------SKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) ..|. .++.++..... ++...........+ .-..++....+|+..+.-+-.-|+. T Consensus 7 ~~~i~~~~~~~~r~~~l~~yy~g~~--------------~il~~~~~~~~~~~--~ki~~n~~~~ivd~~~~~l~g~~~~ 70 (429) T protein:vir:98 7 SELIQKHRSFNLSYSAYKQLYEGDH--------------AILQQKQKEQYKPD--NRLVVNFAKYIVDTFNGYFIGVPVQ 70 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccc--------------ccccccccccCCCc--ceeecchHHHHHHHHhhhhcccCce Confidence 1111 11111111110 00000000000011 1123456667788877777667766 Q ss_pred eeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC--ce Q lcl|NC_019456. 71 EYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN--NS 148 (435) Q Consensus 71 ~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~ 148 (435) +.-+.... . ..++.+.. ..........+..+++.+|.+|.++..+. .|.+ .+..++|..+.+..+... .. T Consensus 71 ~~~~~~~~-~-~~l~~~~~----~n~~~~~~~~~~~~~~~~G~~~~~v~~d~-~g~~-~~~~~~p~~~~~v~dd~~~~~~ 142 (429) T protein:vir:98 71 TSHENKQV-S-NYLELLDG----YNDQDDNNAELSKICSIYGHGYELVFNDE-NAEA-GITYLTPLEAFIVYDDSIRQKP 142 (429) T ss_pred eecCChHH-H-HHHHHHHh----hcCHhHHHHHHHHHHhhcCeEEEEEEecC-CCcE-EEEEEcccceEEEEeCCCCCce Confidence 54332221 1 22222322 22345667888999999999999888764 4765 577788888877665432 11 Q ss_pred EE---EEEecCCe-eEEEchhheEEe-c-----------cCCC---------ccccccCcHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 149 YW---YRVTSDIY-NFTIPINDVIHV-K-----------HVVP---------SNSWYGVSPIDVLSSSLKFQRSVENFSQ 203 (435) Q Consensus 149 ~~---~~~~~~~~-~~~~~~~~iih~-~-----------~~~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~ 203 (435) .+ +....++. ...+...+.++. . .+++ .+...|.|-+..+...+.....+..... T Consensus 143 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liD~~d~~~s~~~ 222 (429) T protein:vir:98 143 LFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIEYVENEERQSLLASVVTLINAFNKAISEKA 222 (429) T ss_pred EEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccceEEecCCCCCCCcHHHHHHHHHHHHHHHHHHH Confidence 11 11111111 111111111111 0 0000 1234677878777777766554333332 Q ss_pred HHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccC----CceeeeccCChhhHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 204 NEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEA----GWKVDRYESKFEPADLSSVEQISRIRIATA 277 (435) Q Consensus 204 ~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~----g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~ 277 (435) +.+... +..++ .+...+++..+.++ . .+++.++. +.++..+........+....+...+.|+.. T Consensus 223 ~~~~~~~~p~~~i-~g~~~~~~~~~~~~--------~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 292 (429) T protein:vir:98 223 NDVEYFADAYLKI-LGAELDDETLKSLR--------D-TRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRT 292 (429) T ss_pred HHHHHhcCceeee-ecCCCCcchhhhHh--------h-CceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHH Confidence 222222 22232 23334444333221 1 12333321 223444443333344566677888899999 Q ss_pred hCCCHHHhCCcccCcccHHHHHH--------------HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhhc Q lcl|NC_019456. 278 FNVPISFLNDDQAKSTTNVEHVT--------------HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLR 343 (435) Q Consensus 278 fgvP~~~lg~~~~~~~~~~e~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~ 343 (435) .++|..-.+.. ++ .+.++.. ..|...+...++.+..-+...- ..... ..+.+.+..... T Consensus 293 s~~p~~~~~~~--gn-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~--~~~d~--~~i~v~f~~~~p 365 (429) T protein:vir:98 293 AMVANISDESF--GT-ASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKI--GPKDW--IGIKYKFTRNLP 365 (429) T ss_pred hCccccCcccc--cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC--Ccccc--ccceEEeCCCCC Confidence 99985433221 22 1222221 1222233333333333222111 01111 124455566777 Q ss_pred cCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCC Q lcl|NC_019456. 344 GDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTN 423 (435) Q Consensus 344 ~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 423 (435) .|..+.++.+.++ .|+++..-+.++++.-+ ++-. -++.+...... ..+.+.++-+.+ T Consensus 366 ~~~~~~a~~~~kl--~g~is~et~~~~l~~v~--d~~~----------E~~ri~~E~~~---------~~~~~~~~~~~~ 422 (429) T protein:vir:98 366 ANLLEESQIAGNL--AGIVSEETQVGVLSIVE--NPQK----------EIERKNSDKST---------LISRQAGGLNGQ 422 (429) T ss_pred cCHHHHHHHHHHH--hccCchHHHHHhCCCCC--CHHH----------HHHHHHHHHHH---------HHHHHHhhhcCC Confidence 8889999999888 58898877888886532 2111 11111110000 000001111111 Q ss_pred CCCCCCC Q lcl|NC_019456. 424 ENGLQST 430 (435) Q Consensus 424 ~~~~~~~ 430 (435) ++.+... T Consensus 423 ~~~~~~~ 429 (429) T protein:vir:98 423 NTTTILE 429 (429) T ss_pred CCCCCCC Confidence 1000000 No 211 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=97.17 E-value=0.00014 Score=41.66 Aligned_cols=416 Identities=13% Similarity=0.131 Sum_probs=170.4 Q ss_pred CchHHHHHhhc-----cccccccccccccchhhhhhccc----cccCcc---c-------ccHHHHhhhHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFF-----GVHDQANQIVQNPIPQPLDMAGV----KLEQAT---F-------SREHILESNEYIFSIVTRLS 61 (435) Q Consensus 1 Mg~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~---~-------~~~~~~~~~~~v~~~i~~ia 61 (435) |+ .|+.+. +....+..+..+ ........+. ...... . -..+....+|.|..||+.|. T Consensus 1 m~---~lfgf~i~~~~~~~~~S~vpp~~-~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~Av~eIV 76 (564) T protein:vir:10 1 MS---QLFGFLINEKEGQKGQSPVPPND-EASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDSAIDEIV 76 (564) T ss_pred Cc---chhcceeeeeccCCCCCcccCCc-CCChhhhhccccceeeecccccchhhHHHHHHHHHHHhhccchhhHHHHhh Confidence 43 222211 111111111111 1111111111 000000 0 11234567899999999999 Q ss_pred HHHhhCc-----eeeeeccccc-------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC---CCc Q lcl|NC_019456. 62 NVLASLP-----LHEYQNYKQM-------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS---TGE 126 (435) Q Consensus 62 ~~ia~~~-----~~~~~~~~~~-------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~---~g~ 126 (435) +.+.-+. +.+.-++.+. ..+.+.++++ =-+-..+++ .++..|.+.|..|..++.+.. .| T Consensus 77 neaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~-ll~F~~~~~----e~fR~WYVDgRi~fHkiid~~~pk~G- 150 (564) T protein:vir:10 77 NEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILR-MMNFNVNAH----EIIRNWYVDGRSHYHKVIDLDNPKKG- 150 (564) T ss_pred cceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhhhhcceEEEEEEeeCCChhhh- Confidence 8865443 2222121111 1122333332 222334444 455566777888888776533 24 Q ss_pred EEEEEEeCCceeEEEE----cC--CCc---------------eEEEEEecCC-----------------eeEEEchhheE Q lcl|NC_019456. 127 PIALWPLDPNTVSILR----NT--DNN---------------SYWYRVTSDI-----------------YNFTIPINDVI 168 (435) Q Consensus 127 ~~~l~~l~~~~v~~~~----~~--~~~---------------~~~~~~~~~~-----------------~~~~~~~~~ii 168 (435) +.+|..|+|..++.++ +. .+. .-||.+.+.+ ....++.+.|. T Consensus 151 I~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~daI~ 230 (564) T protein:vir:10 151 ILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIASDAIA 230 (564) T ss_pred hhhhhhhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeechhhcc Confidence 8899999998766543 11 111 1133333221 23578888888 Q ss_pred EeccC-CCccccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ce-EEEeCCcCCHHHHHHHHHHHHHHhcC----- Q lcl|NC_019456. 169 HVKHV-VPSNSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KF-VLQYDRSISPEKRQAMVNDFLRMVKE----- 238 (435) Q Consensus 169 h~~~~-~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~-~~~~~~~~~~e~~~~~~~~~~~~~~~----- 238 (435) |...- ...+.-.-+|-|..+.+.+.....++...-.+ +...| +. .+-+ +.+....+++....+...++| T Consensus 231 y~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDV-GnLPk~KAeqYlr~iM~k~KNklVYD 309 (564) T protein:vir:10 231 QSTSGLMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDV-GNLPKVKAEQYLRDVMSRYRNKLVYD 309 (564) T ss_pred eecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEec-CCCCchhHHHHHHHHHHhcCceEEEe Confidence 87642 22344445778888888888777776655433 23333 22 2223 233333333333333222222 Q ss_pred --------CCc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccc-C--cccHH Q lcl|NC_019456. 239 --------NGG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQA-K--STTNV 296 (435) Q Consensus 239 --------~~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~-~--~~~~~ 296 (435) ..+ ...| ..|.+++.|.-...-.++.+. .+....+.++++||.+-|..... - +.+++ T Consensus 310 a~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV-~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~E 388 (564) T protein:vir:10 310 GQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDV-EYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTE 388 (564) T ss_pred ccCceecccchhhhhHhhhcccccCCCcccceeeccccCCcchHHHH-HHHHHHHHHHhCCCcccccCCCceeecccccc Confidence 111 2222 246777777655443444444 45577799999999999975432 1 12221 Q ss_pred HHH-HHHHHHHHhHHHHH----HHHHHHHhhcc-----cccccC-cceeeechhhhhccC-------HHHHHHHHHHHH- Q lcl|NC_019456. 297 EHV-THSWTMTLMPIIRQ----YESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTLT- 357 (435) Q Consensus 297 e~~-~~~~~~~i~P~~~~----i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~~- 357 (435) =.. ..-|...|..+-.. +.+.|...|+. +.+|.. ...|.|++..--... +..|++++..+- T Consensus 389 ItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dp 468 (564) T protein:vir:10 389 ILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDP 468 (564) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhh Confidence 111 11233333333333 33334444332 233321 123444333111111 122333333220 Q ss_pred -hcCCcCHHHHHHH-h--------------------C--CCCCCCcCCceeeecc-cccchhcccccccccccccccccc Q lcl|NC_019456. 358 -RNGIFKPNEIREL-E--------------------G--QAPIPDEAADHLYISK-DLYPLDKYYDAILDNKIQTDASVA 412 (435) Q Consensus 358 -~~g~~t~NE~R~~-~--------------------g--~~p~~~~~gd~~~~~~-n~~~l~~~~~~~~~~~~~~~~~~~ 412 (435) -+-.++.+=+|+. | | .+|...+.||..-+.. .+.|.+.................. T Consensus 469 yvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~ 548 (564) T protein:vir:10 469 FVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIKKLNS 548 (564) T ss_pred hhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhccccccccChhhhcc Confidence 0112233322221 1 1 2343344454332222 223333221111111111111111 Q ss_pred ccccCCCCCCCCCCCC Q lcl|NC_019456. 413 APKQEGGENTNENGLQ 428 (435) Q Consensus 413 ~~~~~~~~~~~~~~~~ 428 (435) ++..+.+.++.+.... T Consensus 549 a~~~~~~~~~~~~~~~ 564 (564) T protein:vir:10 549 APKPPPSQQSKSQSNK 564 (564) T ss_pred CCCCCCCCCCcCcCCC Confidence 1111111111111111 No 212 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=97.13 E-value=0.00016 Score=41.41 Aligned_cols=378 Identities=12% Similarity=0.074 Sum_probs=158.8 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeeccccccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMDN 80 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 80 (435) +.-+.++.+...+... ........ . ...... ....+ .-..++....+|+..+.-+-.-|+.+..++... . T Consensus 42 ~~r~~~~~~Yy~g~~~---i~~~~~~~--~-~~~~~~-~~~~~--~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~~~-~ 111 (478) T protein:vir:10 42 IDNITMGERYYNHHPD---ILDAPFKR--D-VNGDYD-ETKPD--WRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKA-L 111 (478) T ss_pred HHHHHHHHHHhccccc---ccccchhh--h-cccccc-ccccc--ceeccchHHHHHHHHhhhhcccCceeecCChHH-H Confidence 1111222222211100 00000000 0 000000 00000 011234556677777777767777764433222 1 Q ss_pred chHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--CceE----EEEEe Q lcl|NC_019456. 81 EPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NNSY----WYRVT 154 (435) Q Consensus 81 ~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~~----~~~~~ 154 (435) ..+...+. .........+...+..+|.+|.++..+. .|.+ .+..++|..+.+..+.. +... +|... T Consensus 112 ~~l~~~~~------n~~~~~~~~~~~~~~~~G~~~~~v~~d~-~~~~-~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~ 183 (478) T protein:vir:10 112 KQIQHTLN------HKWDDKLVDILTAASNKGIEWVQPYVDE-EGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELD 183 (478) T ss_pred HHHHHHHh------ccHHHHHHHHHHHHhhCCeEEEEEEecC-CCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeee Confidence 22333222 2456667778899999999998877664 4654 57788998887776532 2221 12222 Q ss_pred cCCeeEEEchhheEEeccCC-------------------------Cc---------cccccCcHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 155 SDIYNFTIPINDVIHVKHVV-------------------------PS---------NSWYGVSPIDVLSSSLKFQRSVEN 200 (435) Q Consensus 155 ~~~~~~~~~~~~iih~~~~~-------------------------~~---------~~~~G~s~l~~~~~~i~~~~~~~~ 200 (435) .......+.++.|.+++... +. +...|.|.+..+...+.....+.. T Consensus 184 ~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S 263 (478) T protein:vir:10 184 GAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLS 263 (478) T ss_pred CceEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHH Confidence 22222334555555443210 00 123477877777777766554322 Q ss_pred HHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccc--cCCceeeeccCChhhHHHHHHHHHHHHHHHH Q lcl|NC_019456. 201 FSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQ--EAGWKVDRYESKFEPADLSSVEQISRIRIAT 276 (435) Q Consensus 201 ~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl--~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~ 276 (435) ...+.+... +..++. +....+ .......+ +.. +++.+ +.|.++..+........+.+..+...+.|.. T Consensus 264 ~~~~~~~~~~~~~~~~~-g~~~~~--~~~~~~~~----~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~ 335 (478) T protein:vir:10 264 DTQNTFDESVELIYILK-GYEGED--MKDFMHNL----KYY-KAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIE 335 (478) T ss_pred HHHHHHHHhhCcceeee-cCCccc--ccchhhhh----hhC-ceeEecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHH Confidence 222222221 222222 111111 11111111 122 23222 2333444444444445567777888889999 Q ss_pred HhCCCHHHhCCcccCcccHHHHHH--------------HHHHHHHhHHHHHHHHHHHHhhcccccccCcceeeechhhhh Q lcl|NC_019456. 277 AFNVPISFLNDDQAKSTTNVEHVT--------------HSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLL 342 (435) Q Consensus 277 ~fgvP~~~lg~~~~~~~~~~e~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~ 342 (435) ..++|..-..... ++. +..+.. ..|...+...++.+... +........ +.+.+..-. T Consensus 336 ~s~~p~~~~~~~~-~n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~-----~~~~~d~~~--i~i~f~~~~ 406 (478) T protein:vir:10 336 FGQGVDFQQDKFG-NSP-SGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDF-----YRLDVRVQD--IEITFNFNV 406 (478) T ss_pred HhCCcCcCccccc-cch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCCccccc--ceEEeCCCC Confidence 9999864332211 111 122211 12222233322222222 221222222 344445555 Q ss_pred ccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCC Q lcl|NC_019456. 343 RGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENT 422 (435) Q Consensus 343 ~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (435) ..|..+.++.+.++ .|+++.--+.++++.- +++- .-++.+.+....... ..+...+++ T Consensus 407 p~~~~e~~~~~~~~--~g~iS~et~i~~~~~v--~d~~----------~E~~ri~~E~~~~~~------~~~~~~~~~-- 464 (478) T protein:vir:10 407 MVNELENSQIAMNS--TGLLSKETILGNHSWV--QDPV----------AEMERIEQENIELNQ------QLPDIEEGL-- 464 (478) T ss_pred CCCHHHHHHHHHHH--hCCCChHHHHHhCCCC--CCHH----------HHHHHHHHHHHHHHH------hccccCCCC-- Confidence 67888888888877 6888887787777542 1110 011111111000000 001111111 Q ss_pred CCCCCCCCCCCCC Q lcl|NC_019456. 423 NENGLQSTEPEGS 435 (435) Q Consensus 423 ~~~~~~~~~~~~~ 435 (435) ++..+++++++++ T Consensus 465 ~d~~~~~~~d~~~ 477 (478) T protein:vir:10 465 NDEQQRQSEDNQS 477 (478) T ss_pred cccccccCcCCCC Confidence 1111122222222 No 213 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=97.12 E-value=0.00016 Score=41.34 Aligned_cols=377 Identities=10% Similarity=0.051 Sum_probs=158.2 Q ss_pred CchHHHHH--------------hhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhh Q lcl|NC_019456. 1 MSFMSKVR--------------QFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLAS 66 (435) Q Consensus 1 Mg~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~ 66 (435) =.++.++. +...+.. ...... .... ...... ....+ .-..++....+++..+.-+-. T Consensus 28 ~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~---~i~~~~--~~~~-~~~~~~-~~~~~--~ki~~n~~~~Ivd~~~~~l~g 98 (474) T protein:vir:96 28 EEMIIRLINDHKPKIDDITVGERYYNHDP---DVLRLA--PKLD-NKGEID-PLKPD--WRMFTNYHQNLVDQKVAYAVA 98 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCC---cchhcc--chhc-cccccc-ccccc--hhcccchHHHHHHhhhhhhcc Confidence 01111111 1111100 000000 0000 000000 00000 011234555667777666666 Q ss_pred CceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC- Q lcl|NC_019456. 67 LPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD- 145 (435) Q Consensus 67 ~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~- 145 (435) -|+.+.-++... ...+...+. | +.......+...+..+|.+|.++..+. .|++ .+..++|..+.+..+.. T Consensus 99 ~p~~~~~~d~~~-~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~~y~d~-~~~~-~i~~~~p~~~~~v~d~~~ 169 (474) T protein:vir:96 99 NPVTFSSDDDKS-LKTIQEVLN---H---KWDDKLVDILTAASNKGIEWLQPYIDE-NGEF-KTFRVPAEQAIPIWTNKE 169 (474) T ss_pred cCceeecCchHH-HHHHHHHHh---c---CHHHHHHHHHHHHHhcCeeEEEEEecC-CCce-EEEEEcccceEEEEcCCC Confidence 676654333222 233333332 1 344556667788999999998877664 4665 48889999888887642 Q ss_pred -CceE----EEEEecCCeeEEEchhheEEeccC-------------------------CCc---------cccccCcHHH Q lcl|NC_019456. 146 -NNSY----WYRVTSDIYNFTIPINDVIHVKHV-------------------------VPS---------NSWYGVSPID 186 (435) Q Consensus 146 -~~~~----~~~~~~~~~~~~~~~~~iih~~~~-------------------------~~~---------~~~~G~s~l~ 186 (435) +... +|..........+..+.|.|+... ++. +...|.|-+. T Consensus 170 ~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e 249 (474) T protein:vir:96 170 RDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKNNPQEMSDLF 249 (474) T ss_pred CCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCceeEEEeccCCCCCCcHH Confidence 2211 222222222223444444444221 110 1234777777 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCcccccc-CCceeeeccCChhhHHHHH Q lcl|NC_019456. 187 VLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQE-AGWKVDRYESKFEPADLSS 265 (435) Q Consensus 187 ~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~-~g~~~~~~~~~~~~~~~~e 265 (435) .+...+.....+.....+.+......++.+.+... +..+..... .+ ..+++.++ .|.+++.+........+.. T Consensus 250 ~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~-~~~~~~~~~----~~-~~~~i~~~~~~~~~~~l~~~~~~~~~~~ 323 (474) T protein:vir:96 250 MYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEG-QDLDEFMRN----LK-YYKAINVDGDGSGVDTIQIEVPVQSSKE 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCc-ccccchhhh----hh-cCceEEecCCCCceeEEeecCChHHHHH Confidence 77777766554433333333333222222222111 111111111 11 23455554 4556655555444445677 Q ss_pred HHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH--------------HHHHHHHhHHHHHHHHHHHHhhcccccccCc Q lcl|NC_019456. 266 VEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT--------------HSWTMTLMPIIRQYESQFNMKLFTPGKRVKG 331 (435) Q Consensus 266 ~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g 331 (435) ..+...+.|+...++|..-..... ++ .+..+.. ..|...+...++.+..-+ ....... T Consensus 324 ~~~~l~~~i~~~s~~p~~~~~~~~-~n-~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~-----~~~~~~~- 395 (474) T protein:vir:96 324 YLDMLRDYVIEFGQGVDFQQDKFG-NS-PSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY-----KLNIKVQ- 395 (474) T ss_pred HHHHHHHHHHHHhCCccccccccc-cc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCcccc- Confidence 778888999999999875432211 11 1222221 122223333332222221 1111112 Q ss_pred ceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccc Q lcl|NC_019456. 332 FYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASV 411 (435) Q Consensus 332 ~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~ 411 (435) .+.+.++.-...|..+.++. +.+.|+++...++++++.- +++- .-++.+.+....... T Consensus 396 -~i~i~f~~~~p~~~~e~~~~---~~~ag~iS~et~~~~~~~v--~d~~----------~E~~ri~~E~~e~~~------ 453 (474) T protein:vir:96 396 -DVEITFNFNVMVNELEQSQI---GVQSQYLSKETVVTNHPWV--DDPV----------AELERIEQDNIDFNK------ 453 (474) T ss_pred -eeeEEeccCCCcCHHHHHHH---HHhcCCCchHHHHHhCCCC--CCHH----------HHHHHHHHHHHHHHh------ Confidence 23333444455666555554 5668999999999887532 2211 011111111110000 Q ss_pred cccccCCCCCCC-CCCCCCCC Q lcl|NC_019456. 412 AAPKQEGGENTN-ENGLQSTE 431 (435) Q Consensus 412 ~~~~~~~~~~~~-~~~~~~~~ 431 (435) ..++.+++.++. ++.++.++ T Consensus 454 ~~~~~~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 454 QLPPLEGDANGRAQDNESETN 474 (474) T ss_pred cccccccccccccCCCcccCC Confidence 011111111111 11111111 No 214 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=97.04 E-value=0.0002 Score=40.88 Aligned_cols=387 Identities=9% Similarity=0.052 Sum_probs=157.0 Q ss_pred CchHH---HHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_019456. 1 MSFMS---KVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQ 77 (435) Q Consensus 1 Mg~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 77 (435) +.... ++.+...+.... +..-............-...+.....|+..+.-+-.-|+++.-.... T Consensus 36 ~~~~~r~~~l~~YY~g~~~~-------------i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~~~d~~ 102 (506) T protein:vir:94 36 NYQRPRLEMLDDYYQGYNLK-------------ILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGNPINVKLPDDG 102 (506) T ss_pred HHHHHHHHHHHHHhcCCCcc-------------ccccccccccccCCcceeecchHHHHHHHhhhhhcccCceeecCcch Confidence 11111 111111111000 00000000000000011233455666777766665666665433222 Q ss_pred cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC--ceE----EE Q lcl|NC_019456. 78 MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN--NSY----WY 151 (435) Q Consensus 78 ~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~--~~~----~~ 151 (435) . ...+-. +. ...........+...++.+|.||.++..+. .|.+ .+..++|..+.+..+... ... +| T Consensus 103 ~-~~~l~~-~~----~~N~~~~~~~~~~~~~~~~G~a~~~v~~de-d~~~-~i~~~~p~~~~~v~dd~~~~~~~~~v~~~ 174 (506) T protein:vir:94 103 S-NSGFDT-FN----KANDVDAENYDLFLDMSRYGRAYEYVYRGE-DNEE-HLAKLDPLDTFVIYSTDVDPKPIMAVRYH 174 (506) T ss_pred H-HHHHHH-HH----hccCHhHHHHHHHHHHHhcCeEEEEEEecC-CCee-EEEEEcccceEEEecCCCCCceEEEEEEE Confidence 2 222222 22 223455667788899999999999888764 4654 577789988888776432 111 11 Q ss_pred EEe-cCC-e-------eEEEchhheEEeccC-----------CC---------ccccccCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_019456. 152 RVT-SDI-Y-------NFTIPINDVIHVKHV-----------VP---------SNSWYGVSPIDVLSSSLKFQRSVENFS 202 (435) Q Consensus 152 ~~~-~~~-~-------~~~~~~~~iih~~~~-----------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~ 202 (435) ... .++ . ...+....+.++... ++ .+.-.|.|.+......+.....+.... T Consensus 175 ~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~~~~liDa~d~~~S~~ 254 (506) T protein:vir:94 175 QIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTFPVVEFKNSNFRLGDFENVLPLIDLYDAAQSDT 254 (506) T ss_pred eeeeccCCceeEEEEEEEEEeCceEEEeccccCccceeccccccCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHH Confidence 111 111 0 011233333222110 00 011235565655555554433221111 Q ss_pred HH---HhhcCCceEEEe----------------C-----CcCCHHHHHHHHHHHHH----HhcCCCccccccCCceeeec Q lcl|NC_019456. 203 QN---EMEKKDKFVLQY----------------D-----RSISPEKRQAMVNDFLR----MVKENGGAVVQEAGWKVDRY 254 (435) Q Consensus 203 ~~---~~~n~~~~~~~~----------------~-----~~~~~e~~~~~~~~~~~----~~~~~~~~~vl~~g~~~~~~ 254 (435) .+ ++.+. ..+++. . .....+........+.. .....+.+...+.+.+++-+ T Consensus 255 ~~~~~~~~~~-~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l 333 (506) T protein:vir:94 255 ANYMTDLNEA-MLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYI 333 (506) T ss_pred HHHHHHhhhH-HHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccceee Confidence 11 11111 000000 0 00001111112222211 01112223333344555555 Q ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHH--------------HHHHHHHHHhHHHHHHHHHHHH Q lcl|NC_019456. 255 ESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEH--------------VTHSWTMTLMPIIRQYESQFNM 320 (435) Q Consensus 255 ~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~--------------~~~~~~~~i~P~~~~i~~~l~~ 320 (435) ........+....+.....|...-++|..-..... ++.+ ..+ ....|...+...++.+...+.. T Consensus 334 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~S-g~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~ 411 (506) T protein:vir:94 334 NKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFA-SNSS-GVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENS 411 (506) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCcccccccccc-ccch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 55555566777788888999999999874332211 1111 111 1223344444444444443332 Q ss_pred hhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccc Q lcl|NC_019456. 321 KLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAI 400 (435) Q Consensus 321 ~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~ 400 (435) . ..........+.+.+..-...|..+.++.+.++ .|+++...+++++++-. ++-. -++.+.+.. T Consensus 412 ~--~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~lp~v~--d~~~----------E~~ri~~E~ 475 (506) T protein:vir:94 412 I--HGDWTFDPQELTFTFRDNLPADNISQIKALVQA--GATLPQKYLYQQLPGVT--NPQD----------IVDMMKEQS 475 (506) T ss_pred c--CCccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHH----------HHHHHHHHH Confidence 1 110001112344555666678889999999988 58999999999875432 1110 111111111 Q ss_pred ccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 401 LDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 401 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) ....... ...+..+++...+..+.++. T Consensus 476 ~~~~~~~--------~~~~~~~~~~~~~~~~~~~~ 502 (506) T protein:vir:94 476 ANGDYSF--------DQNGVISNDGQTNTTATQTD 502 (506) T ss_pred HHHhhcc--------hhhcCCCcccCccccccccc Confidence 0000000 00000111101111111111 No 215 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=96.74 E-value=0.00037 Score=39.40 Aligned_cols=388 Identities=12% Similarity=0.073 Sum_probs=163.3 Q ss_pred CchH---------------HHHHhhccccccccccccccchhhhh----hccccccCcccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019456. 1 MSFM---------------SKVRQFFGVHDQANQIVQNPIPQPLD----MAGVKLEQATFSREHILESNEYIFSIVTRLS 61 (435) Q Consensus 1 Mg~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia 61 (435) |.+. +.|..+........ ..-.....++. +...........+.. ..++.....|+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~-~r~~~~~~yy~g~~~i~~~~~~~~~~~~~k--i~~n~~~~ivd~~~ 77 (453) T protein:vir:73 1 MNLKPIKLMTYSRDEEITDKVVNDFMKKHQEEV-ERYEYLGNMYKGIMEISSQKAKDSWKPDNR--LTNNFAKYIVDTFV 77 (453) T ss_pred CccccceeeeccccccCCHHHHHHHHHHHHHHH-HHHHHHHHHhccccchhcCCCCCccCccce--eecchHHHHHHHhh Confidence 1110 00000000000000 00000000000 000000000000001 12344555566666 Q ss_pred HHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEE Q lcl|NC_019456. 62 NVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSIL 141 (435) Q Consensus 62 ~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~ 141 (435) .-+-.-|+.+..++.. ..+.+...+ ...........+..+.+.+|.+|.++..+. .|.+ .+..++|..+.+. T Consensus 78 ~~l~g~~~~~~~~d~~-~~~~l~~~~-----~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~-~~~~-~i~~~~p~~~~~v 149 (453) T protein:vir:73 78 GYFNGIPIKKTHDDKS-VLEAMQLFD-----NLNDMEDEESELAKIACVYGRAYELMYQNE-STES-EVIYCSPLNVFMV 149 (453) T ss_pred hhhcccCceeecCChH-HHHHHHHHH-----HhcChhHHHHHHHHHHHhcCeEEEEEEeCC-CCce-EEEEEcccceEEE Confidence 5554556554332221 112222222 223455667788999999999999888764 4665 4677888888776 Q ss_pred EcCCC-ceE----EEEEecCCe--eEEEchhheEEeccCC-----------C---------ccccccCcHHHHHHHHHHH Q lcl|NC_019456. 142 RNTDN-NSY----WYRVTSDIY--NFTIPINDVIHVKHVV-----------P---------SNSWYGVSPIDVLSSSLKF 194 (435) Q Consensus 142 ~~~~~-~~~----~~~~~~~~~--~~~~~~~~iih~~~~~-----------~---------~~~~~G~s~l~~~~~~i~~ 194 (435) .+... ... ++....++. ...+..+.++++.... + .+...|.|-+..+...+.. T Consensus 150 ~dd~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~~liDa 229 (453) T protein:vir:73 150 YDDSIKQKPLFAVYYGFDEEGNLSGTVYTLLETISITGKAGEVKFGESTYNVYSDLPIVEYNFNEERQSIFEPVHSLINS 229 (453) T ss_pred EeCCCCceeEEEEEEEEecCceEEEEEEeCCeEEEEEecCCceEEccceeccCCceeEEEecCCCCCCcchhhHHHHHHH Confidence 65532 211 122222222 1234555555543211 1 0123577777776666665 Q ss_pred HHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHH--HHhcCCCccccccCCceeeeccCChhhHHHHHHHHHH Q lcl|NC_019456. 195 QRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFL--RMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQIS 270 (435) Q Consensus 195 ~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~--~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~ 270 (435) ...+.....+..... +..++ .+..+.++..+.++..-. ......+.....+.+.++..+........+....+.. T Consensus 230 ~~~~~S~~~~~~~~~~~~~l~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l 308 (453) T protein:vir:73 230 YNKVTSEKANDVEYFSDQYLVF-LGAEVDEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRL 308 (453) T ss_pred HHHHHHHHHHHHHHhccceeee-ecCCCCchhhhcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHH Confidence 444322222222222 22222 333444555444433211 1111222333444555555555544455567777788 Q ss_pred HHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhcccccccCcceeee Q lcl|NC_019456. 271 RIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTPGKRVKGFYFSF 336 (435) Q Consensus 271 ~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~~i~f 336 (435) .+.|+...++|..-... .++. +.++. ...|...+...++.+..-+... -.. .... .+++ T Consensus 309 ~~~I~~~s~~p~~~~~~--~gn~-Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~-~~~-~~~~--~i~v 381 (453) T protein:vir:73 309 ERSIFQFTMAANISDEN--FGNS-SGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNA-SNK-DAWK--DIEY 381 (453) T ss_pred HHHHHHHhCCcccCccc--ccCc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CCc-cccc--cceE Confidence 88899988888543222 1221 21221 1223333444433333222221 111 1112 3444 Q ss_pred chhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccc-cccccccccccc Q lcl|NC_019456. 337 NVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDN-KIQTDASVAAPK 415 (435) Q Consensus 337 d~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~-~~~~~~~~~~~~ 415 (435) .+..-...|..+.++.+.+++ |+++.--+.+++++-. ++- ++ ++.+.+..... ..........++ T Consensus 382 ~f~~~~p~~~~~~a~~~~k~~--giis~et~~~~~~~~~--d~~-~E---------~~ri~~E~~~~~~~~~~~~~~~~~ 447 (453) T protein:vir:73 382 TFTRNEPKDIKEQAETANILK--GITSEETALSVISVIP--DVQ-AE---------MEKIKKKKLLQLSLTRTSNLVRMK 447 (453) T ss_pred EeCCCCCCCHHHHHHHHHHHh--ccCcHHHHHHhCCCCC--CHH-HH---------HHHHHHHHHHHHHHHHhccCCcch Confidence 445666788999999999886 7888877877776532 111 11 11111110000 000000111111 Q ss_pred cCCCCC Q lcl|NC_019456. 416 QEGGEN 421 (435) Q Consensus 416 ~~~~~~ 421 (435) ++-|+= T Consensus 448 ~~~~~~ 453 (453) T protein:vir:73 448 QMRGNL 453 (453) T ss_pred hhhcCC Confidence 111111 No 216 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=96.42 E-value=0.00064 Score=38.09 Aligned_cols=375 Identities=11% Similarity=0.056 Sum_probs=158.9 Q ss_pred CchH---HHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_019456. 1 MSFM---SKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQ 77 (435) Q Consensus 1 Mg~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 77 (435) +..+ .++.+...+.- ........................+. -..++.....|+..+.-+-.-|+.+.-+... T Consensus 18 ~~~~~~~~~~~~Yy~g~~---~I~~~~~~~~~~~~~~~~~~~~~~~~--ki~~n~~k~Iv~~~~~yl~G~p~~~~~~d~~ 92 (470) T protein:vir:10 18 NDLINNYKQAVNYYENKT---DITTRNNGKAKLNKEGKKDPLRSADN--RIPSNFYQLLVDQEAGYVASVFPDIDVGKDA 92 (470) T ss_pred HHHHHHHHHHHHHhcccc---chhccccchhcccccccccccccCCc--ccccchHHHHHHhhhhheeccceeeecCchH Confidence 1111 11122111110 00000000000000000000000000 1123334445566665555667665433322 Q ss_pred cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--CceE----EE Q lcl|NC_019456. 78 MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NNSY----WY 151 (435) Q Consensus 78 ~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~~----~~ 151 (435) . ...+...+. .+..+-...+..++..+|.+|.++..+.. |.+ .+..++|..+.+..+.. +... +| T Consensus 93 ~-~~~l~~~~~------~~~~~~~~~l~~~~~~~G~a~~~~y~d~~-~~~-~~~~~~p~~~~~v~d~~~~~~~~a~ir~y 163 (470) T protein:vir:10 93 D-NKKIIDVLG------DDRALTLNGLLVDSSNAGRAWLHYWIDED-GNF-RYGIIQPDQITPIYATTLDNKLLGILRSY 163 (470) T ss_pred H-HHHHHHHHh------hhHHHHHHHHHHHHhhcCeeEEEEEecCC-Cce-EEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 2 234444443 12444556778899999999999887644 665 57788998888877653 2111 22 Q ss_pred EEe-cCCe-----eEEEchhheEEeccCC-------------------------------C---------ccccccCcHH Q lcl|NC_019456. 152 RVT-SDIY-----NFTIPINDVIHVKHVV-------------------------------P---------SNSWYGVSPI 185 (435) Q Consensus 152 ~~~-~~~~-----~~~~~~~~iih~~~~~-------------------------------~---------~~~~~G~s~l 185 (435) ... ..+. ...+....+.|++... + .+...|.|-+ T Consensus 164 ~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~ 243 (470) T protein:vir:10 164 KQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKNKYRLPEL 243 (470) T ss_pred EeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeecCCCCCCch Confidence 211 1111 1223444554443110 0 0123577778 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccc-------cCCceeeeccC Q lcl|NC_019456. 186 DVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQ-------EAGWKVDRYES 256 (435) Q Consensus 186 ~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl-------~~g~~~~~~~~ 256 (435) ..+...+.....+.....+.+... +..++..-...+.++ ....+ +..+ ++.+ ++++++..... T Consensus 244 e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~---~~~~~----~~~~-~i~~~~~~~~~~~~~~~lt~~~ 315 (470) T protein:vir:10 244 NKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQ---FMNDL----RKYK-SIKINNTGNGDNSGVDKLQIDI 315 (470) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccch---hhhhh----hhcC-eEeccCCCCCcCceeEEEeecC Confidence 777777766554333333333322 222332211111111 11111 1222 2222 12344544434 Q ss_pred ChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhh Q lcl|NC_019456. 257 KFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKL 322 (435) Q Consensus 257 ~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l 322 (435) + ...+....+.....|...-++|..-... .++. +..+. ...|..++...++.|...+..+ T Consensus 316 ~--~~~~~~~~~~L~~~I~~~s~~p~~~~~~--~gn~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~- 389 (470) T protein:vir:10 316 P--VEARDDALKITRKNIFLFGQGIDPANFE--SSNA-SGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS- 389 (470) T ss_pred C--hHHHHHHHHHHHHHHHHHhCCCCCCccc--cccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc- Confidence 3 3345666677788888888888542221 1221 11221 2223334444444343333211 Q ss_pred cccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccc Q lcl|NC_019456. 323 FTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILD 402 (435) Q Consensus 323 ~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~ 402 (435) ......+.+.+......|..+.++.+.++ .|+++.--+++++++- +++- ++ ++.+.+.... T Consensus 390 -----~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~~~p~v--~D~~-~E---------~eri~~E~~e 450 (470) T protein:vir:10 390 -----DADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIV--DDWQ-QE---------LKDLAKDKEE 450 (470) T ss_pred -----CcccceeeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCCC--CCHH-HH---------HHHHHHHHHH Confidence 11123456666777788999999999887 5899988888887542 2210 11 1111111100 Q ss_pred ccccccccccccccCCCCCCCC Q lcl|NC_019456. 403 NKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 403 ~~~~~~~~~~~~~~~~~~~~~~ 424 (435) ...... .......++.++++ T Consensus 451 ~~~~~~--~~~~~~~~~~dde~ 470 (470) T protein:vir:10 451 NDPYSN--QADELNGKGVNDEQ 470 (470) T ss_pred HHHhhc--cccccCCCCCCCCC Confidence 000000 01111112222222 No 217 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=96.38 E-value=0.00068 Score=37.95 Aligned_cols=393 Identities=11% Similarity=0.099 Sum_probs=174.3 Q ss_pred CchHHHHHhh-----ccccccccccccccch--------hhhhhccccc----cC-ccc------ccHHHHhhhHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQF-----FGVHDQANQIVQNPIP--------QPLDMAGVKL----EQ-ATF------SREHILESNEYIFSI 56 (435) Q Consensus 1 Mg~~~~~~~~-----~~~~~~~~~~~~~~~~--------~~~~~~~~~~----~~-~~~------~~~~~~~~~~~v~~~ 56 (435) |.||.+.-.. ......+..+....++ ..+...|... .. ... -..+....+|.|..| T Consensus 1 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~A 80 (511) T protein:vir:56 1 MKFWTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVKELIKSYRALAEYHEVDDA 80 (511) T ss_pred CCCccchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchHHHHHHHHHHhhccchhhH Confidence 8877654321 1111111111111111 0111111100 00 000 112445678999999 Q ss_pred HHHHHHHHhhCc-----eeeeeccccc-------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCC Q lcl|NC_019456. 57 VTRLSNVLASLP-----LHEYQNYKQM-------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLST 124 (435) Q Consensus 57 i~~ia~~ia~~~-----~~~~~~~~~~-------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~ 124 (435) |+.|.+.+.-+. +.+.-++.+. ..+.+.++++ =-+-...+ ..++..|.+.|..|..++.+... T Consensus 81 v~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgRi~fHkiid~k~ 155 (511) T protein:vir:56 81 IQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVS-LLQMRKHG----YKWFRKWYVDSRIYFHKILDKDN 155 (511) T ss_pred HHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhhhhcceEEEEEEecccc Confidence 999998877554 2232222111 1122333332 22233334 44556677789999988888776 Q ss_pred CcEEEEEEeCCceeEEEEc-----CC------CceEEEEEecC--------------CeeEEEchhheEEeccCC---Cc Q lcl|NC_019456. 125 GEPIALWPLDPNTVSILRN-----TD------NNSYWYRVTSD--------------IYNFTIPINDVIHVKHVV---PS 176 (435) Q Consensus 125 g~~~~l~~l~~~~v~~~~~-----~~------~~~~~~~~~~~--------------~~~~~~~~~~iih~~~~~---~~ 176 (435) | +.+|..|||..++..+. .+ +..-||.+.+. +....++.+.|.|..... +. T Consensus 156 G-I~eLr~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~hSGL~d~~~ 234 (511) T protein:vir:56 156 N-IIELRPLNPMKMELVREIQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAHSGLMRGCA 234 (511) T ss_pred c-eeehhhcCcccchhhhhhhcccccccccccceeeeeEecCCCcccCcccccccccccceeechhheeeecccceeccC Confidence 5 88999999987764321 11 12223333321 134678999997766432 24 Q ss_pred cccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ce-EEEeCCcCCHHHHHHHHHHHHHHhcC-------------C Q lcl|NC_019456. 177 NSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KF-VLQYDRSISPEKRQAMVNDFLRMVKE-------------N 239 (435) Q Consensus 177 ~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~-~~~~~~~~~~e~~~~~~~~~~~~~~~-------------~ 239 (435) +..+.+|-|..+.+.+.....++...-.+ +...| +. -+-+ +.+....+++....+...++| . T Consensus 235 ~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDV-GnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~dd 313 (511) T protein:vir:56 235 DDPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDV-GNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNT 313 (511) T ss_pred CCCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEec-CCCCchhHHHHHHHHHHhcCceEEEeccCceeccc Confidence 55567899999999988887777665533 23333 22 2223 233333333333333222222 1 Q ss_pred Cc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccc-Cccc---HHH-HH-HHH Q lcl|NC_019456. 240 GG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQA-KSTT---NVE-HV-THS 302 (435) Q Consensus 240 ~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~-~~~~---~~e-~~-~~~ 302 (435) .+ ...| +.|.+++.|.-...--++.+. .+....+.++++||.+-|..... ++.+ ..| .. ..- T Consensus 314 rk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiK 392 (511) T protein:vir:56 314 TNAMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIEDV-LYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELK 392 (511) T ss_pred hhhhhhHhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHH Confidence 11 1222 246777777655443343344 45677799999999999974432 1111 112 11 112 Q ss_pred HHHHHhHHHHH----HHHHHHHhhcc-----cccccC-cceeeechhhhhccCH-------HHHHHHHHHHHh--cCCcC Q lcl|NC_019456. 303 WTMTLMPIIRQ----YESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGDT-------AARTQYYQTLTR--NGIFK 363 (435) Q Consensus 303 ~~~~i~P~~~~----i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d~-------~~~~~~~~~~~~--~g~~t 363 (435) |...|..+-.. +.+.|..+|+. +.++.. ...|.|++..--.... ..+++++..+-. +-.++ T Consensus 393 F~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S 472 (511) T protein:vir:56 393 FTKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYS 472 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccc Confidence 33333333333 33344444332 233221 1234444332112111 222332222210 11334 Q ss_pred HHHHHHH-hCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCC Q lcl|NC_019456. 364 PNEIREL-EGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGE 420 (435) Q Consensus 364 ~NE~R~~-~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (435) .+=+++. |.+.-. + +....+.... ..+.+..++.+.+. T Consensus 473 ~~yi~k~ILr~tDe--e-------------i~~~~k~I~~----E~k~~~~~~~e~~f 511 (511) T protein:vir:56 473 HKYIQKNILRLSDD--Q-------------ITAMQSEIDE----EETNPRFQQDDQGF 511 (511) T ss_pred hHHHHHHHhccCHH--H-------------HHHHHHHHHH----hhcCCCCCCcccCC Confidence 4444432 333210 0 0000000000 00011122222222 No 218 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=96.34 E-value=0.00073 Score=37.79 Aligned_cols=378 Identities=10% Similarity=0.026 Sum_probs=158.4 Q ss_pred Cch--HHHH------------------HhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHH Q lcl|NC_019456. 1 MSF--MSKV------------------RQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRL 60 (435) Q Consensus 1 Mg~--~~~~------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~i 60 (435) |.+ ...+ .....+...--............ ...............-..++....+|+.. T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~ 79 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAE-NEAKAEDNAFRNADNRISHNWHQLLLDQK 79 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhccc-ccccccccccccccceeccchhHHHHHhh Confidence 222 1111 11111110000000000000000 00000000000000011234455556666 Q ss_pred HHHHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEE Q lcl|NC_019456. 61 SNVLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSI 140 (435) Q Consensus 61 a~~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~ 140 (435) +.-+-.-|+.+....... ...+..+.. | ........+...+..+|.+|.++..+...|.+ .+..++|..+-+ T Consensus 80 ~~yl~G~p~~~~~~~~~~--~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~-~~~~~~p~~~~~ 151 (471) T protein:vir:10 80 KAYALTYPPTFDVDDKKV--NDMIVDVLG--D---DYERISKQLCVNAGNAGIAWLHVWKDASDNSF-RYACVDSKEVIP 151 (471) T ss_pred hhhhcccCceeccCChHH--HHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEEEeeCCCCee-EEEEEcccceEE Confidence 655556666654333221 122232321 2 34455677789999999999999887666764 677889988887 Q ss_pred EEcCCC--ceE----EEEEe--cCCee----EEEchhheEEeccCC-------------------------------Cc- Q lcl|NC_019456. 141 LRNTDN--NSY----WYRVT--SDIYN----FTIPINDVIHVKHVV-------------------------------PS- 176 (435) Q Consensus 141 ~~~~~~--~~~----~~~~~--~~~~~----~~~~~~~iih~~~~~-------------------------------~~- 176 (435) ..+... ... +|... .++.. ..+..+.+.|++... +. T Consensus 152 i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 231 (471) T protein:vir:10 152 IYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFG 231 (471) T ss_pred EEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCC Confidence 776543 111 12211 11111 234455555554211 10 Q ss_pred --------cccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccc--- Q lcl|NC_019456. 177 --------NSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQ--- 245 (435) Q Consensus 177 --------~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl--- 245 (435) +...|.|-+..+...+.....+.....+.+......++.+.+... +..+.....+ .. .+++.+ T Consensus 232 ~iPvv~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~-~~~~~~~~~~----~~-~~~i~~~~~ 305 (471) T protein:vir:10 232 LVPFIPFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGG-QDKQEFLEDL----KR-YKMIKMDND 305 (471) T ss_pred ceeEEEeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCc-cccchhHHHh----hc-CCeEEecCC Confidence 112466777776666665554332223232222222222222110 1111111111 11 122222 Q ss_pred ----cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH--------------HHHHHHH Q lcl|NC_019456. 246 ----EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT--------------HSWTMTL 307 (435) Q Consensus 246 ----~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~--------------~~~~~~i 307 (435) +++++|.....+ ...+....+...+.|....++|..-.... ++.+. .+.. ..|...+ T Consensus 306 ~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~tp~~~~~~~--gn~Sg-~Alk~~~~~l~~k~~~~~~~~~~~l 380 (471) T protein:vir:10 306 GMGDQSGVTTIAIDIP--TEARNLILERTKKQIFISGQGVNPETDKL--GNSSG-VALKFLYSLLELKAGNMETQFRSGY 380 (471) T ss_pred CCccCccceEEeecCC--hHHHHHHHHHHHHHHHHHhCCcCCCcccc--cCccH-HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 123444443333 34466677777888988888886533221 12111 2211 1222223 Q ss_pred hHHHHHHHHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeec Q lcl|NC_019456. 308 MPIIRQYESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYIS 387 (435) Q Consensus 308 ~P~~~~i~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~ 387 (435) ...++.+...+ ... ....+.+.+......|..+.++.+.++ .|+++.--++++++.- +++- T Consensus 381 ~~~~~li~~~~-----~~~---d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~p~v--~D~~------- 441 (471) T protein:vir:10 381 ATLVKMILKHL-----GLS---DKLKIKQTWTRNSINNDTEMAQVVSTL--ATITSRENVAKSNPIV--EDWQ------- 441 (471) T ss_pred HHHHHHHHHHh-----ccC---CCceeEEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCC--CCHH------- Confidence 33332222222 111 122355556677788999999999987 5889988888877432 1110 Q ss_pred ccccchhccccccccccccccccccccccCCCCCCCCCC Q lcl|NC_019456. 388 KDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENG 426 (435) Q Consensus 388 ~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (435) .-++.+.+... .. ....+...+.+++++.+ T Consensus 442 ---~E~eri~~E~~---~~---~~~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 442 ---DELRLQKAEQE---GR---SEKLYDMEEVEHESEVE 471 (471) T ss_pred ---HHHHHHHHHHH---HH---HhcccccCCCCCccccC Confidence 11222211110 00 01111222333333322 No 219 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=96.24 E-value=0.00084 Score=37.45 Aligned_cols=417 Identities=11% Similarity=0.081 Sum_probs=163.9 Q ss_pred CchHHHHHhhcccccccccc--ccccchh-hhhhccccccCc----cc-------ccHHHHhhhHHHHHHHHHHHHHHhh Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQI--VQNPIPQ-PLDMAGVKLEQA----TF-------SREHILESNEYIFSIVTRLSNVLAS 66 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~--~~~~~~~-~~~~~~~~~~~~----~~-------~~~~~~~~~~~v~~~i~~ia~~ia~ 66 (435) |||. +...-....+..++ ....+.. .+...|...... .. -..+....+|.|..||+.|.+.+.- T Consensus 5 fgf~--~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv 82 (558) T protein:vir:10 5 FGFS--IEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEADGAIEDVVNEAIV 82 (558) T ss_pred hcch--hhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchhhHHHHhhcceeE Confidence 4553 11110111111111 1111110 010111111000 00 1124456789999999999988766 Q ss_pred Cc-----eeeeeccccc-------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC--CCcEEEEEE Q lcl|NC_019456. 67 LP-----LHEYQNYKQM-------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS--TGEPIALWP 132 (435) Q Consensus 67 ~~-----~~~~~~~~~~-------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~--~g~~~~l~~ 132 (435) +. +.+.-++.+. ..+.+.++++ =-+-...+ ..++..|.+.|..|..++.|.. ..-+.+|.. T Consensus 83 ~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~-ll~F~~~~----~e~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~ 157 (558) T protein:vir:10 83 SDLYDSPVEVELSNLNASNTLKKKIREEFRYIKE-MMDFDKKS----HEIFRNWYVDGRVFYLKVIDTKNPQEGIQDLRY 157 (558) T ss_pred ecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhheeeeEEEEEEEEeCCCccccceeeee Confidence 54 2222222111 1122333332 22233334 4556667788999988876544 223889999 Q ss_pred eCCceeEEEEcC----------------C------CceEEEEEecCC-------------eeEEEchhheEEeccC-CCc Q lcl|NC_019456. 133 LDPNTVSILRNT----------------D------NNSYWYRVTSDI-------------YNFTIPINDVIHVKHV-VPS 176 (435) Q Consensus 133 l~~~~v~~~~~~----------------~------~~~~~~~~~~~~-------------~~~~~~~~~iih~~~~-~~~ 176 (435) |+|..++.++.- . +...||.+...+ ....++.+-|.+.... ... T Consensus 158 lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~dAI~y~hSGL~d~ 237 (558) T protein:vir:10 158 IDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKDSITMCTSGLVDR 237 (558) T ss_pred eCcccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeechhheeeecccceec Confidence 999887543321 1 112233333321 1234554444444321 112 Q ss_pred cccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ce-EEEeCCcCCHHHHHHHHHHHHHHhcC-------------C Q lcl|NC_019456. 177 NSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KF-VLQYDRSISPEKRQAMVNDFLRMVKE-------------N 239 (435) Q Consensus 177 ~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~-~~~~~~~~~~e~~~~~~~~~~~~~~~-------------~ 239 (435) +.-.-+|-|..+.+.+.....++...-.+ +...| +. -+-+ +.+....+++....+...++| . T Consensus 238 ~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDV-GnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~dd 316 (558) T protein:vir:10 238 NKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDV-GNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDD 316 (558) T ss_pred CCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEec-CCCCchhHHHHHHHHHHhccceEEEeccCceeccc Confidence 33344678888888888777766655433 23333 22 2223 233333333333333222222 1 Q ss_pred Cc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc-cHHHH-H-HHHHHH Q lcl|NC_019456. 240 GG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST-TNVEH-V-THSWTM 305 (435) Q Consensus 240 ~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~-~~~e~-~-~~~~~~ 305 (435) .+ ...| ..|.+++.|.-...-.++.+. .+....+.++++||.+-|.....-+. ...|= . ..-|.. T Consensus 317 rk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~DV-~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~K 395 (558) T protein:vir:10 317 RKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELSDV-DYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAK 395 (558) T ss_pred chhhhhHhhhcccccCCCCccceeeccccCCcchHHHH-HHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHH Confidence 11 2222 246777777655443444444 45577799999999999975433221 11121 1 112333 Q ss_pred HHhHHHHH----HHHHHHHhhcc-----cccccC-cceeeechhhhhccC-------HHHHHHHHHHHHh-cC-CcCHHH Q lcl|NC_019456. 306 TLMPIIRQ----YESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTLTR-NG-IFKPNE 366 (435) Q Consensus 306 ~i~P~~~~----i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~~~-~g-~~t~NE 366 (435) .|..+-.. +.+.|...|+. +.++.. ...|.|++..--... +..|++++..+-. .| .++.+= T Consensus 396 FI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dy 475 (558) T protein:vir:10 396 FVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEY 475 (558) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHH Confidence 33333333 33334444332 233321 123444433111111 2223333332211 11 233333 Q ss_pred HHHH-hC------------------CCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCC Q lcl|NC_019456. 367 IREL-EG------------------QAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGL 427 (435) Q Consensus 367 ~R~~-~g------------------~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (435) +|+. |. -+..++|...++ +.+...|-. ++. ..+... ..+..+.-+-.....+ T Consensus 476 i~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~-~~~~~~~~~--~~~----~~~~~~--~~~~~~~~~~~~~~~~ 546 (558) T protein:vir:10 476 VRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDP-ITGEPLPQE--GDP----AMEGMG--EQPVDPDLEAQAQAVD 546 (558) T ss_pred HHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccCh-hhccccCcc--CCc----hhccCC--CCCcccccccchhhhh Confidence 3332 22 223332221111 111111100 000 000000 0000000000001111 Q ss_pred CCCCCCCC Q lcl|NC_019456. 428 QSTEPEGS 435 (435) Q Consensus 428 ~~~~~~~~ 435 (435) ..++.+.+ T Consensus 547 ~~~~~~~~ 554 (558) T protein:vir:10 547 AQYSKDTK 554 (558) T ss_pred hhhhhhhh Confidence 11111111 No 220 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=95.91 E-value=0.0013 Score=36.46 Aligned_cols=402 Identities=11% Similarity=0.019 Sum_probs=152.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhh----hHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILES----NEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) +.-|..++...++...=........+.+ ..-.......++. .+++...++.++..+.+-|..+. T Consensus 20 ~~~W~~ird~~~G~~~~r~~g~~YLPk~--------~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vf~k~p~~~---- 87 (513) T protein:vir:97 20 LPRWHVIETLLGGTEAMREAGETYLPRH--------QEETDKGYQERLASAVLLNMVEQTLDTLSGKPFSEPIKLN---- 87 (513) T ss_pred HHHHHHHHHHhcChHHHHhhcccCCCCC--------CCCCHHHHHHHHhcccCCChHHHHHHHHhhhhhhcCcccC---- Confidence 3445555444433311000000000000 0000011122222 23444555555544444444331 Q ss_pred ccccchHHH-hhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCC-----------------cEEEEEEeCCcee Q lcl|NC_019456. 77 QMDNEPLAD-LLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTG-----------------EPIALWPLDPNTV 138 (435) Q Consensus 77 ~~~~~~l~~-~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g-----------------~~~~l~~l~~~~v 138 (435) ......+.. ++..---...+-.+|.+.++...+.+|.++++|......+ +|. +..+.|..| T Consensus 88 ~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy-~~~~~~e~I 166 (513) T protein:vir:97 88 EDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPY-WVMIKPECL 166 (513) T ss_pred cCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCce-EEEecHhhh Confidence 111223333 3333344677899999999999999999998885432211 111 222222211 Q ss_pred E----------------------EEEcCCCceEE--EEEecCCeeEEEch---------hheEEecc------------- Q lcl|NC_019456. 139 S----------------------ILRNTDNNSYW--YRVTSDIYNFTIPI---------NDVIHVKH------------- 172 (435) Q Consensus 139 ~----------------------~~~~~~~~~~~--~~~~~~~~~~~~~~---------~~iih~~~------------- 172 (435) - ...|+.+...+ |.+-..|..+.+.. .-+++-.. T Consensus 167 inW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~~~~~~g~~~l~~IP~v~~~ 246 (513) T protein:vir:97 167 LFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEWALADEWATGLNYVPLVTFY 246 (513) T ss_pred cCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCccccceEEecCCCCcCCceeEEEEe Confidence 0 01122221111 11111111111100 00111111 Q ss_pred CCCccccccCcHHHHHH-HHHHHHHHHHHHHHHHhhcC-CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccC-Cc Q lcl|NC_019456. 173 VVPSNSWYGVSPIDVLS-SSLKFQRSVENFSQNEMEKK-DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEA-GW 249 (435) Q Consensus 173 ~~~~~~~~G~s~l~~~~-~~i~~~~~~~~~~~~~~~n~-~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~-g~ 249 (435) ....+...|.+|+..++ ..+........+...++.-+ +..++..- +++..+ ...-+++.++.+++ |. T Consensus 247 ~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~---~~~~~~-------~i~iG~~~~~~lpe~~~ 316 (513) T protein:vir:97 247 ADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGA---SGEDSD-------PVVVGPNKVLYNPDPAG 316 (513) T ss_pred cCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecC---CcCCCC-------ceEeeccccccCCCCCC Confidence 11223445777876544 35666666566655555444 33344311 111100 11224456666764 44 Q ss_pred --eeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHH---HHHHhHHHHHHHHHHHHhhcc Q lcl|NC_019456. 250 --KVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSW---TMTLMPIIRQYESQFNMKLFT 324 (435) Q Consensus 250 --~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~---~~~i~P~~~~i~~~l~~~l~~ 324 (435) +|.+.+-+...... +..+....++ +..|. .+|..... + .+.++...-+ ++.|.-++..+++.++..|-. T Consensus 317 ~~~yie~~g~~i~~~~-~~l~~le~qm-~~~Ga--~ll~~~~~-~-~Ta~a~~~~~~~~~S~L~~~a~~le~al~~~l~~ 390 (513) T protein:vir:97 317 RFYYVEHTGQAIAAGR-TDLKDLEEQM-AGYGA--EFLKRKTG-G-QTATARALDSAEATSDLSAMTGLFEDALAQALDI 390 (513) T ss_pred cceeeccCchhHHHHH-HHHHHHHHHH-HHHHH--HhhccCCc-c-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44455444333222 2222223333 33332 23322221 1 2222222222 344555666666666554322 Q ss_pred ccc----ccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCC-CCCC-CcCCceee---e---cccccc Q lcl|NC_019456. 325 PGK----RVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQ-APIP-DEAADHLY---I---SKDLYP 392 (435) Q Consensus 325 ~~~----~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~-~p~~-~~~gd~~~---~---~~n~~~ 392 (435) -.. ...+..|..+.+-....-....++++.+++..|.++.-..++.+-. .-++ +.--|+.+ . ...... T Consensus 391 ~a~wlg~~~~~~~v~in~dF~~~~~~~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~ 470 (513) T protein:vir:97 391 TADWLRLGPNGGTVELVKDYDLEEMDAPGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGR 470 (513) T ss_pred HHHHhCCCCCccEEEeccccCcccCCHHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCC Confidence 111 1113445555544333333445677777889999998777766532 1111 00001000 0 000000 Q ss_pred ----hhccccccccccccccccccccccCCCCCCCCCCCCCCCC Q lcl|NC_019456. 393 ----LDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEP 432 (435) Q Consensus 393 ----l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (435) ++.+. ...+.++.+........-++++-++.-+.+.+++ T Consensus 471 ~~~d~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 471 AGLDLDPAQ-KNPPEGGEGEGEGEGEGGEGGEGGEGGGNPGGES 513 (513) T ss_pred CCccccccC-CCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCC Confidence 00000 0001111111111111112222223334444444 No 221 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=95.79 E-value=0.0015 Score=36.14 Aligned_cols=369 Identities=10% Similarity=0.032 Sum_probs=147.9 Q ss_pred Cch-------HHHH---HhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCcee Q lcl|NC_019456. 1 MSF-------MSKV---RQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLH 70 (435) Q Consensus 1 Mg~-------~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 70 (435) -.+ ..++ .+...+.. ............ ..... ...+. -..++.....++..+.-+-.-|+. T Consensus 32 ~~~i~~~~~~~~~~~~~~~yY~g~~---~i~~~~~~~~~~---~~~~~-~~~~~--ki~~n~~~~Iv~~~~~~l~g~p~~ 102 (468) T protein:vir:96 32 LRLITKHKENVEDITVGERYYNHQP---DVLFNAPKRNVK---GEIDP-FKPDW--RMYTNYHQNLVDQKVAYAVANPVT 102 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCC---cccccccccccc---ccccc-ccccc--ccccchHHHHHHHHHhhhccCCce Confidence 000 0111 11111100 000000000000 00000 00000 012344445566655555555666 Q ss_pred eeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCC--Cce Q lcl|NC_019456. 71 EYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTD--NNS 148 (435) Q Consensus 71 ~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~--~~~ 148 (435) +.-++... ...+...+. .+..+....+..++..+|.+|.++..+.. |.+ .+..++|..+.+..+.. +.. T Consensus 103 ~~~~d~~~-~~~l~~~~~------n~~~~~~~~~~~~~~~~G~~~~~v~~d~~-~~~-~i~~~~p~~~~~v~~~~~~~~~ 173 (468) T protein:vir:96 103 YGTEDEKS-LKTIQEVLN------HKWDDKLVDILTAASNKGVEWIQPYVDEQ-GEF-KTFRVPAEQAIPIWTNKERDEL 173 (468) T ss_pred eccCChHH-HHHHHHHHh------cCHHHHHHHHHHHHhhcCeEEEEEEEcCC-Cce-EEEEEcccceEEEEcCCCCCce Confidence 53333222 223333332 23456667788999999999988777644 554 57788888877766532 221 Q ss_pred E----EEEEecCCeeEEEchhheEEeccC-------------------------CC---------ccccccCcHHHHHHH Q lcl|NC_019456. 149 Y----WYRVTSDIYNFTIPINDVIHVKHV-------------------------VP---------SNSWYGVSPIDVLSS 190 (435) Q Consensus 149 ~----~~~~~~~~~~~~~~~~~iih~~~~-------------------------~~---------~~~~~G~s~l~~~~~ 190 (435) . +|..........+..+.+.|++.. ++ .+...|.|-+..+.. T Consensus 174 ~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~~ 253 (468) T protein:vir:96 174 KAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKNNPQEVSDLFMYKT 253 (468) T ss_pred EEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCcccEEEecCCCCCCCchHHHHH Confidence 1 122211122223334444443221 00 012357777777666 Q ss_pred HHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCcccccc--CCceeeeccCChhhHHHHHH Q lcl|NC_019456. 191 SLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQE--AGWKVDRYESKFEPADLSSV 266 (435) Q Consensus 191 ~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~--~g~~~~~~~~~~~~~~~~e~ 266 (435) .+.....+.....+.+... +..+++ +....+ .+.... ..+. ++++.++ ++.+++.+........+... T Consensus 254 liDa~d~~~S~~~~~~~~~~~p~lv~~-g~~~~~--~~~~~~----~~~~-~~~i~~~~d~~~~~~~l~~~~~~~~~~~~ 325 (468) T protein:vir:96 254 IIDAMDKRLSDTQNTFDEATELIYVLK-GYEGED--LEEFMY----NLKY-YKAINVDGDGSGGVDTIQIDVPVQSAKEY 325 (468) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeeee-cCCccc--cchhhh----hhhc-CceEEecCCCCCcceEEeecCChHHHHHH Confidence 6666554333323233222 222322 212211 111111 1122 2333332 33344444443334456677 Q ss_pred HHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHH--------------HHHHHHhHHHHHHHHHHHHhhcccccccCcc Q lcl|NC_019456. 267 EQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTH--------------SWTMTLMPIIRQYESQFNMKLFTPGKRVKGF 332 (435) Q Consensus 267 ~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~--------------~~~~~i~P~~~~i~~~l~~~l~~~~~~~~g~ 332 (435) .+...+.|...-++|....... .++ .+.++... .|...+...++.+... +....... T Consensus 326 ~~~l~~~I~~~s~~p~~~~~~~-~~n-~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~-----~g~~~d~~-- 396 (468) T protein:vir:96 326 LDMLRDYVIEFGQGVDFQQDKF-GNS-PSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDF-----YKLSIKVQ-- 396 (468) T ss_pred HHHHHHHHHHHhCccccccccc-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hCCCcccc-- Confidence 7778888999999886432211 111 12222211 1222222222222221 11111112 Q ss_pred eeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccccccccccccc Q lcl|NC_019456. 333 YFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVA 412 (435) Q Consensus 333 ~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~ 412 (435) .+.+.++.-...|..+.++. +.+.|+++.-.++++++.- +++- .-++.+.+ .+.... T Consensus 397 ~i~i~f~~~~p~d~~e~a~~---~~~~g~iS~et~i~~l~~v--~D~~----------~E~~ri~~---E~~~~~----- 453 (468) T protein:vir:96 397 DVEITFNFNVMVNELEQSQI---GVNSQYLSKETVVTNHPWV--DDPV----------AEMERIDQ---EELALP----- 453 (468) T ss_pred eeeEEecCCCCcCHHHHHHH---HHhcCCCchHHHHHhCCCC--CCHH----------HHHHHHHH---HHHHHH----- Confidence 23333344445666555554 5567999988888877432 1110 11111111 110000 Q ss_pred ccccCCCCCCCCCCCCC Q lcl|NC_019456. 413 APKQEGGENTNENGLQS 429 (435) Q Consensus 413 ~~~~~~~~~~~~~~~~~ 429 (435) ..+.+-++.++.+++ T Consensus 454 --~~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 454 --SIEEGLNGKENNEPT 468 (468) T ss_pred --HHhhccCCCCCCCCC Confidence 011111222222222 No 222 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=95.20 E-value=0.0025 Score=34.81 Aligned_cols=397 Identities=11% Similarity=0.054 Sum_probs=169.0 Q ss_pred CchHHHHHhhcccccc-------------ccccc--cccch--------hhhhhcccccc-C----ccc-------ccHH Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQ-------------ANQIV--QNPIP--------QPLDMAGVKLE-Q----ATF-------SREH 45 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~-------------~~~~~--~~~~~--------~~~~~~~~~~~-~----~~~-------~~~~ 45 (435) |.=|+.....|+.-.+ +.++. ...++ ..+...+.... + ... -..+ T Consensus 1 ~~~~~~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYR 80 (524) T ss_pred CCchhhHHHHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHH Confidence 6555554444432211 11111 10000 01111111000 0 000 1123 Q ss_pred HHhhhHHHHHHHHHHHHHHhhCc-----eeeeeccccc-------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCC Q lcl|NC_019456. 46 ILESNEYIFSIVTRLSNVLASLP-----LHEYQNYKQM-------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGN 113 (435) Q Consensus 46 ~~~~~~~v~~~i~~ia~~ia~~~-----~~~~~~~~~~-------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~ 113 (435) ....+|.|..||+.|.+.+.-+. +.+.-++.+. ..+.+.++++ =-+-...+ ..++..|.+.|. T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgR 155 (524) T protein:vir:10 81 NLMNNYEVDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLN-LLNFQRKG----TDHFQRWYVDSR 155 (524) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhheeece Confidence 45678999999999998876554 2222222111 1122333332 22233334 445666778899 Q ss_pred cceEEeeeCCC--CcEEEEEEeCCceeEEEE----cC-CC------ceEEEEEe-------------cCCeeEEEchhhe Q lcl|NC_019456. 114 GYAWIQKSLST--GEPIALWPLDPNTVSILR----NT-DN------NSYWYRVT-------------SDIYNFTIPINDV 167 (435) Q Consensus 114 ~~~~i~~~~~~--g~~~~l~~l~~~~v~~~~----~~-~~------~~~~~~~~-------------~~~~~~~~~~~~i 167 (435) .|..++.+... .-+.+|..|+|..++..+ +. ++ ...+|.+. ..+....++.+.| T Consensus 156 i~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~dAI 235 (524) T protein:vir:10 156 IFFHKIINPKKMKDGVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRAAV 235 (524) T ss_pred EEEEEEeeCCCccccceeeeeeCCccceeeeeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecchhhe Confidence 99887765332 237899999999886532 11 11 11122222 2233567899999 Q ss_pred EEeccC-CCccccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ce-EEEeCCcCCHHHHHHHHHHHHHHhcC---- Q lcl|NC_019456. 168 IHVKHV-VPSNSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KF-VLQYDRSISPEKRQAMVNDFLRMVKE---- 238 (435) Q Consensus 168 ih~~~~-~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~-~~~~~~~~~~e~~~~~~~~~~~~~~~---- 238 (435) .|.... .+.++-.-+|-|..+.+.+.....++...-.+ +...| +. -+-+ +.+....+++....+...++| T Consensus 236 vy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDV-GnlPk~KAeqYl~~im~k~kNKlvY 314 (524) T protein:vir:10 236 VYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDT-GNMPSRKAAAQMQHIMNTMKNRVVY 314 (524) T ss_pred eeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEec-CCCCchhHHHHHHHHHHhcCceeEE Confidence 997643 22333345688888888888777776655433 23333 22 2223 333333333333333222222 Q ss_pred ---------CCc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCc----cc Q lcl|NC_019456. 239 ---------NGG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKS----TT 294 (435) Q Consensus 239 ---------~~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~----~~ 294 (435) ..+ ...| +.|.+++.|.-...--++.+. .+....+.++++||.+-|+....+. .+ T Consensus 315 Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~ 393 (524) T protein:vir:10 315 DASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMDDV-LYFRTALYRALRIPESRIPSESNSGVMFDAG 393 (524) T ss_pred eccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCchhccCCCCcccccccc Confidence 112 1222 246777777655443343344 4567779999999999996443322 22 Q ss_pred HHHHHH-HHHHHHHhHHHHH----HHHHHHHhhcc-----cccccC-cceeeechhhhhccC-------HHHHHHHHHHH Q lcl|NC_019456. 295 NVEHVT-HSWTMTLMPIIRQ----YESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTL 356 (435) Q Consensus 295 ~~e~~~-~~~~~~i~P~~~~----i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~ 356 (435) ++=... .-|...|..+-.. +.+.|..+|+. +.++.. ...|.|++..--... +..+++++..+ T Consensus 394 ~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 473 (524) T protein:vir:10 394 TAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMA 473 (524) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHh Confidence 211111 1233333333333 33444444332 233221 123444433211111 12233333322 Q ss_pred Hh-cC-CcCHHHHHHH-hCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCC Q lcl|NC_019456. 357 TR-NG-IFKPNEIREL-EGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGEN 421 (435) Q Consensus 357 ~~-~g-~~t~NE~R~~-~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (435) -. .| .++.+=+|+. |.+.-. + +....+....... . .--..|..+-.+. T Consensus 474 dpyvGky~s~~yi~k~ILr~tDe--e-------------i~~~~k~I~~E~k-~-~~~~~~~~~~~~f 524 (524) T protein:vir:10 474 EPFIGKYISHQTAMKDFLQMTDE--E-------------INQEAKQIEEESK-E-ARFQNPDEEEEDF 524 (524) T ss_pred hhhhcccchhHHHHHHHhccCHH--H-------------HHHHHHHHHHHhh-c-CCCCCCChhhhcC Confidence 11 11 2344444432 333210 0 0000000000000 0 0000111111111 No 223 >protein:vir:97376 Length: 320 # NCBI annotation: putative portal protein # Family: family:all:11744 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762589;genbank:gi:115304290;genbank:GeneID:5130579 Probab=95.01 E-value=0.0011 Score=36.89 Aligned_cols=304 Identities=16% Similarity=0.199 Sum_probs=134.3 Q ss_pred CchHHHHHh-hccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccccc Q lcl|NC_019456. 1 MSFMSKVRQ-FFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQMD 79 (435) Q Consensus 1 Mg~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 79 (435) ||+|+.-++ .+.+.-+..-.++.....-..+.|+ .-|.+.+-+|++||.-- -.|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~-~~~~~----- 57 (320) T protein:vir:97 1 MGIFNFKKRETLTPELKESIIRQVTIEDESPFTGT-----------------TDFNVRNEVAESIATYL-GAYKT----- 57 (320) T ss_pred CCccccccccccChhHHhhhhheeeeccCCCcccc-----------------cccchhhHHHHHHHHHh-hhhcc----- Confidence 999874432 1111111111111111100011111 11122344555555321 11111 Q ss_pred cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCceEEEEEecCCe- Q lcl|NC_019456. 80 NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNSYWYRVTSDIY- 158 (435) Q Consensus 80 ~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~- 158 (435) ....+.+|. +-..|++.++.+.+..-..|++.-. .-|+ ...++..++-.+ -++. +.-++.. T Consensus 58 ~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~----~~~~~~~~~~~~----~~~~-~~~~D~FN 119 (320) T protein:vir:97 58 SAKRLSLLT-------NNPSFLRRLVKHALHNKTTYVYKSP--TYGW----LITDSMTIEGLR----ARLT-FTLPDPFN 119 (320) T ss_pred ccceeeeee-------CCHHHHHHHHHHhhcccceEEeeCC--ccce----eeecceeeeeee----eeEE-EecCcccc Confidence 111222332 2337999999999998887776543 2232 223322222111 0000 1011110 Q ss_pred ---eEEEchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCC--ceEEEeCCcCC-HHHHHHHHHHH Q lcl|NC_019456. 159 ---NFTIPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKD--KFVLQYDRSIS-PEKRQAMVNDF 232 (435) Q Consensus 159 ---~~~~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~--~~~~~~~~~~~-~e~~~~~~~~~ 232 (435) ..++|-.|+=. ..+.++|..+- .+...++. +....-+.+.+.. +..+..+-+.. +|-.++..+.+ T Consensus 120 ~~V~mtvpfyD~~I-----Ldnpl~gv~tq-e~gkM~g~---a~~~v~kkL~~~~~IKafi~Tdid~GLee~kD~~~~kI 190 (320) T protein:vir:97 120 SAVTMTVPFYDVGI-----IDSPLVEVDTE-EANKMLEA---AYSAVMKKLHNTGAIKAFISSDIDVGLEKMKEESDSKI 190 (320) T ss_pred eeEEEEeeeechhh-----hhhhhcccChH-HhhHHHHH---HhhhhhhhccccceeEEEEecccchhHHHHHHHHHHHH Confidence 11111111111 12344565553 22222222 2222333344443 44555544432 33344444444 Q ss_pred HHHh---cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhH Q lcl|NC_019456. 233 LRMV---KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMP 309 (435) Q Consensus 233 ~~~~---~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P 309 (435) .++. +--.+.-+++.+-+++++.....-+.-.+. ...++..+.-|+||-.+|-+. ++.++.-.|+...+.| T Consensus 191 k~mq~~A~~~nG~T~i~~~dDI~Qi~pDYS~sn~~D~-~l~~t~alS~y~m~~~IL~Gs-----Ate~~~Iaf~~~~V~P 264 (320) T protein:vir:97 191 KAMLATAELLSGYTYIQRGDDVTQMMPDYTTSNVTDF-AAMRTFAASQLSVSDKILDGS-----ATDGEKVAVMFRFVEP 264 (320) T ss_pred HHHHHHHHHhcCcccccCCcceeeecccccccchhHH-HHHHHHHHhhcCCchhhcccc-----CCcceeeehhhHhHHH Confidence 3322 223568888999999998776554433332 455677888999999988543 3446777888999999 Q ss_pred HHHHH---HHHHHHhhcccccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCC--cCCcee Q lcl|NC_019456. 310 IIRQY---ESQFNMKLFTPGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPD--EAADHL 384 (435) Q Consensus 310 ~~~~i---~~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~--~~gd~~ 384 (435) ++.++ +..|..++-. ...+.|-.. .|-+..|.+- -.|.+..|+ .|||+- T Consensus 265 LL~Q~~~~Ek~Lvy~m~~------E~FVs~mtT-------------------GG~l~S~~~~-~~~~~~~~~~~~~~~~~ 318 (320) T protein:vir:97 265 ILEQFREYEPSLIYAMRD------EFFVSFMTT-------------------GGMLNSNRVD-GWGKEKAPNESKGGDVG 318 (320) T ss_pred HHHHhhhcCcceeeeecc------ceeeeeeec-------------------Cceeeccccc-ccccccCCccccCCccc Confidence 99996 5566554421 112222111 2222222221 123332222 344443 Q ss_pred ee Q lcl|NC_019456. 385 YI 386 (435) Q Consensus 385 ~~ 386 (435) -+ T Consensus 319 ~~ 320 (320) T protein:vir:97 319 DV 320 (320) T ss_pred CC Confidence 33 No 224 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=94.39 E-value=0.0046 Score=33.42 Aligned_cols=402 Identities=9% Similarity=0.049 Sum_probs=155.5 Q ss_pred Cch-HHHHHhhccccccccccccccchhhhh----hccccccCc-ccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeec Q lcl|NC_019456. 1 MSF-MSKVRQFFGVHDQANQIVQNPIPQPLD----MAGVKLEQA-TFSREHILESNEYIFSIVTRLSNVLASLPLHEYQN 74 (435) Q Consensus 1 Mg~-~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~-~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 74 (435) |.+ -+++..++.......-.+-.-...++. +........ ...+.. ..++....+|+..+.-+-.-|+.+..+ T Consensus 13 ~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~k--i~~n~~~~iv~~~~~~l~g~~~~~~~~ 90 (489) T protein:vir:99 13 SKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNR--IASDFAKYITVFEQGYMLGVPVEYKNE 90 (489) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccCCcce--eecchHHHHHHHHhhhhccCCceeecC Confidence 332 111111111000000000000000110 000000000 000001 123455566777766665566665433 Q ss_pred ccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeee---CCCCcEEEEEEeCCceeEEEEcCCC--ceE Q lcl|NC_019456. 75 YKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKS---LSTGEPIALWPLDPNTVSILRNTDN--NSY 149 (435) Q Consensus 75 ~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~---~~~g~~~~l~~l~~~~v~~~~~~~~--~~~ 149 (435) .... ...+..++. ......+...+...++.+|.+|.++... +..|. ..+..++|..+.+..+... ... T Consensus 91 d~~~-~~~l~~~~~-----~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~-~~i~~~~p~~~~~v~dd~~~~~~~ 163 (489) T protein:vir:99 91 NKDL-QAAIDLMSV-----RNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTE-VKLYQLPAEQTFVIYDDTYQRNSL 163 (489) T ss_pred ChhH-HHHHHHHHh-----hcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcc-eEEEEEcccceEEEEcCCCCCceE Confidence 3222 122222222 2234566788899999999999877542 12233 4577888888877766432 111 Q ss_pred ----EEEEec-CCe----eEEEchhheEEeccCC--------------Cc---------cccccCcHHHHHHHHHHHHHH Q lcl|NC_019456. 150 ----WYRVTS-DIY----NFTIPINDVIHVKHVV--------------PS---------NSWYGVSPIDVLSSSLKFQRS 197 (435) Q Consensus 150 ----~~~~~~-~~~----~~~~~~~~iih~~~~~--------------~~---------~~~~G~s~l~~~~~~i~~~~~ 197 (435) +|.... .+. ...+.++.+.+++... +. +...|.|.+..+...+..... T Consensus 164 ~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~s~~~~v~~liDa~d~ 243 (489) T protein:vir:99 164 MAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVNEYANNEERTGAYESVLDNIDAYDL 243 (489) T ss_pred EEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeEEEeecCCCCCCchhhhHHHHHHHHH Confidence 111111 111 1234445555543211 00 112356666655555554433 Q ss_pred HHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHh---------cCCCccccccC-------CceeeeccCChh Q lcl|NC_019456. 198 VENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMV---------KENGGAVVQEA-------GWKVDRYESKFE 259 (435) Q Consensus 198 ~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~---------~~~~~~~vl~~-------g~~~~~~~~~~~ 259 (435) +.....+..... +..+++ +.....+...+....+.... ...++++.++. +.++..+..... T Consensus 244 ~~s~~~~~~~~~~~~~l~i~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 322 (489) T protein:vir:99 244 SQSELANFQQDSVNALLVIA-GNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYD 322 (489) T ss_pred HHHHHHHHHHHhhhhhhhhc-cCCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCC Confidence 222211111111 111111 11122222122211111000 01122333322 223333333333 Q ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH--------------HHHHHHHHhHHHHHHHHHHHHhhccc Q lcl|NC_019456. 260 PADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV--------------THSWTMTLMPIIRQYESQFNMKLFTP 325 (435) Q Consensus 260 ~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~--------------~~~~~~~i~P~~~~i~~~l~~~l~~~ 325 (435) ...+....+...+.|...-++|..-..... ++. +..+. ...|...+...++.+...+...-... T Consensus 323 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~ 400 (489) T protein:vir:99 323 TAGSEAYKNRLVADILRFTFTPDTQDMKFS-GVQ-SGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEA 400 (489) T ss_pred hHHHHHHHHHHHHHHHHHhCCccccccccc-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc Confidence 334556667778889999999864321111 111 11221 12233344444444433332211110 Q ss_pred ccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccccccc Q lcl|NC_019456. 326 GKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKI 405 (435) Q Consensus 326 ~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~ 405 (435) ........+.+.++.-...|..+.++.+.+++ |+++.-.+.++++. +.++-.+ . -++.+.+....... T Consensus 401 ~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~--giis~et~~~~l~~--v~~~d~~-----~---E~~ri~~E~~~~~~ 468 (489) T protein:vir:99 401 TTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY--GIVSDQTIFEILNT--VTGVDAE-----A---ELKRLKEEADKKQS 468 (489) T ss_pred ccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhcCC--CCchhHH-----H---HHHHHHHHHHHHhc Confidence 00000112445556666788899999999884 88998888887632 1110000 0 01111111100000 Q ss_pred cccccccccccCCCCCCCCCCCCCCCC Q lcl|NC_019456. 406 QTDASVAAPKQEGGENTNENGLQSTEP 432 (435) Q Consensus 406 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (435) ..+....++.+++.+.++.++ T Consensus 469 ------~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 469 ------LPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred ------cccccccCCCCCCcCCCCCCC Confidence 011112222222222222222 No 225 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=94.03 E-value=0.0056 Score=32.93 Aligned_cols=411 Identities=11% Similarity=0.066 Sum_probs=169.5 Q ss_pred Cc--hHHHHHhhcccccccccccccc--chhhh-h---hccccccCcc-c-------ccHHHHhhhHHHHHHHHHHHHHH Q lcl|NC_019456. 1 MS--FMSKVRQFFGVHDQANQIVQNP--IPQPL-D---MAGVKLEQAT-F-------SREHILESNEYIFSIVTRLSNVL 64 (435) Q Consensus 1 Mg--~~~~~~~~~~~~~~~~~~~~~~--~~~~~-~---~~~~~~~~~~-~-------~~~~~~~~~~~v~~~i~~ia~~i 64 (435) |. ||..-...-....+..+...+. +.... . ..|....... . -..+....+|.|..||+-|.+.+ T Consensus 1 ~~~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea 80 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDSAVDDVVNET 80 (537) T ss_pred CccccccceeecccccccCCcccCCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchhhHHHHhhcce Confidence 32 1110000001111111111111 10000 0 0111100000 0 11244567899999999999887 Q ss_pred hhCce-----eeeeccccc-------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC--CCcEEEE Q lcl|NC_019456. 65 ASLPL-----HEYQNYKQM-------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS--TGEPIAL 130 (435) Q Consensus 65 a~~~~-----~~~~~~~~~-------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~--~g~~~~l 130 (435) .-+.- .+.-+..+. ..+.+.++++ =-+....+ ..++..|.+.|..|..++.|.. ..-+.+| T Consensus 81 iv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~-ll~F~~~~----~e~fR~WYVDgRi~fhKiid~k~pk~GI~EL 155 (537) T protein:vir:10 81 ICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILR-LLDFDNRA----YEIFRRWYVDGRLFFHKVIDPKKPRQGLVEL 155 (537) T ss_pred eEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhheeeeEEEEEEEEeCCCccccceee Confidence 76542 222111111 1122333332 22233334 4556667788999888876544 2238899 Q ss_pred EEeCCceeEEEEc-----CCCce-------------EEEEEec------CCeeEEEchhheEEecc-CCCccccccCcHH Q lcl|NC_019456. 131 WPLDPNTVSILRN-----TDNNS-------------YWYRVTS------DIYNFTIPINDVIHVKH-VVPSNSWYGVSPI 185 (435) Q Consensus 131 ~~l~~~~v~~~~~-----~~~~~-------------~~~~~~~------~~~~~~~~~~~iih~~~-~~~~~~~~G~s~l 185 (435) ..|+|..++.++. .++.. -|+.+.+ .+....++.+-|.+... .-..+..+.+|-| T Consensus 156 r~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~n~~~i~syL 235 (537) T protein:vir:10 156 RYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAPDSIAYCHSGIQDLNKNMVLSHL 235 (537) T ss_pred eeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceeccHhheeeecccceeCCCCeeeeee Confidence 9999988754432 11111 1222332 23345677755554442 2233456678889 Q ss_pred HHHHHHHHHHHHHHHHHHHH-hhcCC--ce-EEEeCCcCCHHHHHHHHHHHHHHhcC-------------CCc-cccc-- Q lcl|NC_019456. 186 DVLSSSLKFQRSVENFSQNE-MEKKD--KF-VLQYDRSISPEKRQAMVNDFLRMVKE-------------NGG-AVVQ-- 245 (435) Q Consensus 186 ~~~~~~i~~~~~~~~~~~~~-~~n~~--~~-~~~~~~~~~~e~~~~~~~~~~~~~~~-------------~~~-~~vl-- 245 (435) ..+.+.+.....++...-.+ +...| +. .+-+ +.+....+++....+...++| ..+ ...| T Consensus 236 hkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDV-GnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlED 314 (537) T protein:vir:10 236 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDV-GNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLED 314 (537) T ss_pred hhhhHHHHhhHHHHhhHHHHhhhccccceEEEEec-CCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhh Confidence 99999888877777665433 23333 22 2223 233333333333333222222 111 2222 Q ss_pred --------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc-cHHH-HH-HHHHHHHHhHHHHH- Q lcl|NC_019456. 246 --------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST-TNVE-HV-THSWTMTLMPIIRQ- 313 (435) Q Consensus 246 --------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~-~~~e-~~-~~~~~~~i~P~~~~- 313 (435) ..|.+++.|.-...--++.+. .+....+.++++||.+-|......+. ...| .. ..-|...|..+-.. T Consensus 315 yWLPRReGgrgTEItTLpGgqnlgem~DV-~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rF 393 (537) T protein:vir:10 315 FWLPRREGGRGTEISTLPGGQNLGELEDV-KYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRF 393 (537) T ss_pred hcccccCCCcccceeeccccCCcChHHHH-HHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHH Confidence 246777777655443343343 45677799999999999965433221 1112 11 11233333333333 Q ss_pred ---HHHHHHHhhcc-----cccccC-cceeeechhhhhccC-------HHHHHHHHHHHH--hcCCcCHHHHHHH-hCC- Q lcl|NC_019456. 314 ---YESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTLT--RNGIFKPNEIREL-EGQ- 373 (435) Q Consensus 314 ---i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~~--~~g~~t~NE~R~~-~g~- 373 (435) +.+.|...|+. +.+|.. ...|.|++..--... +..|++++..+- -+-.++.+=+|+. |.+ T Consensus 394 s~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~t 473 (537) T protein:vir:10 394 SELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQT 473 (537) T ss_pred HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccC Confidence 33333344332 233321 123444433111111 122333333221 0112344434332 222 Q ss_pred -----------------CCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 374 -----------------APIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 374 -----------------~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) +..++|..++- ...+.++..+..+..+.-+.+...+.+|..++|- T Consensus 474 DeeI~~~~k~I~~E~k~~~~~~p~~~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 535 (537) T protein:vir:10 474 ESEIKEIDKEIKQEIADGVIMDPQAMQA-----------------MEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKRG 535 (537) T ss_pred HHHHHHHHHHHHHHhhCCCCCCcccccc-----------------cccCCCCcccCCCCCCCcccCCccCCCCCCccCC Confidence 11111111100 0000011111111111111111223344444444 No 226 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=93.77 E-value=0.0064 Score=32.60 Aligned_cols=397 Identities=13% Similarity=0.071 Sum_probs=166.5 Q ss_pred CchHHHHH--hhcc------------ccccccccccccchh--------hhhhcccccc-Cc---c--------cccHHH Q lcl|NC_019456. 1 MSFMSKVR--QFFG------------VHDQANQIVQNPIPQ--------PLDMAGVKLE-QA---T--------FSREHI 46 (435) Q Consensus 1 Mg~~~~~~--~~~~------------~~~~~~~~~~~~~~~--------~~~~~~~~~~-~~---~--------~~~~~~ 46 (435) |||++.+. .+|. ....+..+....+.. .+...|.... +. . .-..+. T Consensus 4 ~~~~~~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~YR~ 83 (524) T protein:vir:98 4 LGFGNVLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLINTYRG 83 (524) T ss_pred cchhhHHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHHHHHHH Confidence 55555332 1111 111111111111000 0111221111 00 0 011244 Q ss_pred HhhhHHHHHHHHHHHHHHhhCc-----eeeeecccc-------cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCc Q lcl|NC_019456. 47 LESNEYIFSIVTRLSNVLASLP-----LHEYQNYKQ-------MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNG 114 (435) Q Consensus 47 ~~~~~~v~~~i~~ia~~ia~~~-----~~~~~~~~~-------~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~ 114 (435) ...+|.|..||+-|.+.+.-+. +.+.-+..+ ...+.+.++++ =-+-...+ ..++..|.+.|.. T Consensus 84 ma~~pEvd~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgRi 158 (524) T protein:vir:98 84 IMSYPEVENAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLN-IYDFDNMG----ARLFRDWYVDSRI 158 (524) T ss_pred HhhccchhhHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhhhhccee Confidence 5678999999999998875443 222211111 11122333332 22233334 4455667788999 Q ss_pred ceEEeee--CCCCcEEEEEEeCCceeEEEE-----c-CCCc------eEEEEEe-------------cCCeeEEEchhhe Q lcl|NC_019456. 115 YAWIQKS--LSTGEPIALWPLDPNTVSILR-----N-TDNN------SYWYRVT-------------SDIYNFTIPINDV 167 (435) Q Consensus 115 ~~~i~~~--~~~g~~~~l~~l~~~~v~~~~-----~-~~~~------~~~~~~~-------------~~~~~~~~~~~~i 167 (435) |..++.+ ...| +.+|..|+|..++..+ + ..+. .-+|.+. .-+....++.+.| T Consensus 159 ~fhkiid~~~~kG-I~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI 237 (524) T protein:vir:98 159 YFHKIMHKDESKG-IRELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRSAI 237 (524) T ss_pred EEEEEEcCCCCcc-eeeeeeeCCccceeeeeccccccccchhhccceeeeeeeccCCCccccccceecCCCceeechhhe Confidence 9988854 2333 8899999999886543 1 1121 1122222 1123467899999 Q ss_pred EEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ceEEEeCCcCCHHHHHHHHHHHHHHhc------- Q lcl|NC_019456. 168 IHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KFVLQYDRSISPEKRQAMVNDFLRMVK------- 237 (435) Q Consensus 168 ih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~~~~~~~~~~~e~~~~~~~~~~~~~~------- 237 (435) .|.........-.-+|-|..+.+.+.....++...-.+ +...| +.+..--+.+....+++....+...++ T Consensus 238 vy~hSGL~d~~~~iisyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa 317 (524) T protein:vir:98 238 VYAHSGLEDCSNNIIGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDA 317 (524) T ss_pred eeeccCcccCCCCeeeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeec Confidence 99764321111112577888888887777766655433 23333 222222233433333333333323222 Q ss_pred ------CCCc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccc---CcccHHH Q lcl|NC_019456. 238 ------ENGG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQA---KSTTNVE 297 (435) Q Consensus 238 ------~~~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~---~~~~~~e 297 (435) +..+ ...| +.|.+++.|.-...--++.+. .+....+.++++||.+-|..... -+.+++= T Consensus 318 ~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlgem~DV-~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EI 396 (524) T protein:vir:98 318 RTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDDI-KWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEI 396 (524) T ss_pred cCceeeccccccchhhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCceeccCCCCccccccccch Confidence 2222 2222 246778877655443344444 45677799999999999964321 1222211 Q ss_pred HHH-HHHHHHHhHHH----HHHHHHHHHhhcccc-----cccC-cceeeechhhhhccC-------HHHHHHHHHHHHh- Q lcl|NC_019456. 298 HVT-HSWTMTLMPII----RQYESQFNMKLFTPG-----KRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTLTR- 358 (435) Q Consensus 298 ~~~-~~~~~~i~P~~----~~i~~~l~~~l~~~~-----~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~~~- 358 (435) ... .-|...|..+- ..+.+.|..+|+... ++.. ...|.|++..--... +..+++++..+-. T Consensus 397 tRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpy 476 (524) T protein:vir:98 397 TRDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGV 476 (524) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccc Confidence 111 11333333332 334444444444322 2211 113444333211111 1223333333221 Q ss_pred cC-CcCHHHHHHH-hCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCC Q lcl|NC_019456. 359 NG-IFKPNEIREL-EGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGEN 421 (435) Q Consensus 359 ~g-~~t~NE~R~~-~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (435) .| +++.+=+|+. |.+.-. + + ....+.... ....+ --..|..+-++. T Consensus 477 vGky~s~dyi~k~ILr~tDe--e----i---------~~~~k~I~~-E~k~~-~~~~p~~e~~~f 524 (524) T protein:vir:98 477 VGKYVSHKYIMKEILRMSDE--D----I---------DEQAKLIEE-ESKEE-RFKNPEAEEENF 524 (524) T ss_pred cccccchHHHHHHHhccCHH--H----H---------HHHHHHHHH-HHhCC-CCcCCccccccC Confidence 12 4444444442 333210 0 0 000000000 00000 000111111222 No 227 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=93.69 E-value=0.0067 Score=32.51 Aligned_cols=406 Identities=10% Similarity=0.038 Sum_probs=147.1 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhh----HHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESN----EYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) +.-|..++..+++...=........+.+ .+.....-.......++.. +++...++.++..+-+-|..+ T Consensus 46 ~~~W~~ird~~~G~~~~r~~g~~YLP~~---~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~----- 117 (535) T protein:vir:80 46 LPKWRKIMDCLSGQEAIKAKREEYLPMP---SVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIR----- 117 (535) T ss_pred HHHHHHHHHHhcChHHHHhcccccCCCC---CcccCCcCCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCcce----- Confidence 3345555554443321111110011110 0000000001112223322 233334443333333323222 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc-------------EEEEEEeCCcee----- Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE-------------PIALWPLDPNTV----- 138 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~-------------~~~l~~l~~~~v----- 138 (435) . ....+..++..---...+-.+|.+.++...+.+|.++++|..... |. |. +..+.|..| T Consensus 118 ~-~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~-~~~~t~ade~~~~~rPy-~~~y~ae~IinW~~ 194 (535) T protein:vir:80 118 Q-LPPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNV-GRPVTVLEQKLGLYRPT-ITLVHPTSIINWRT 194 (535) T ss_pred e-ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCC-CCcccHHHHHhcCCCcE-EEEechhhccCccc Confidence 1 122344444333446678999999999999999999998864322 21 11 111111111 Q ss_pred ------------------EEEEcCCCceE--EEEE-ec--CCee------------------EEEchh------heEEec Q lcl|NC_019456. 139 ------------------SILRNTDNNSY--WYRV-TS--DIYN------------------FTIPIN------DVIHVK 171 (435) Q Consensus 139 ------------------~~~~~~~~~~~--~~~~-~~--~~~~------------------~~~~~~------~iih~~ 171 (435) ....+.++... .|.+ .. +|.. ..++.+ ..|=|- T Consensus 195 ~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv 274 (535) T protein:vir:80 195 KLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQ 274 (535) T ss_pred cccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccccccceeecccCCCcccCeeEEE Confidence 01011111111 0000 00 0000 011111 011110 Q ss_pred --cCCCccccccCcHHHHHHH-HHHHHHHHHHHHHHHhhcC-CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccC Q lcl|NC_019456. 172 --HVVPSNSWYGVSPIDVLSS-SLKFQRSVENFSQNEMEKK-DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEA 247 (435) Q Consensus 172 --~~~~~~~~~G~s~l~~~~~-~i~~~~~~~~~~~~~~~n~-~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~ 247 (435) +....+...|.+|+..++. .+........+...++.-+ +..+++.-....++... + -....-+++..+.++. T Consensus 275 ~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~G~~~~~~~~~~---~-~~~i~iG~~~~~~lP~ 350 (535) T protein:vir:80 275 FIGPLDNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFTGLTKDWVEDVF---K-DFKVHLGSRAIIPLPQ 350 (535) T ss_pred EeecCCCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeeecCchhhhhcCC---C-CcceEecCcccccCCC Confidence 1112344567777665444 4555444444444444333 33344322111111000 0 0001123445667776 Q ss_pred CceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHH--HHHHhHHHHHHHHHHHHhhccc Q lcl|NC_019456. 248 GWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSW--TMTLMPIIRQYESQFNMKLFTP 325 (435) Q Consensus 248 g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~--~~~i~P~~~~i~~~l~~~l~~~ 325 (435) +.++.-+..++.-+.. +..+...+++++ .|. .++.....+-.+. +....+- ++.|.-++.++++.++..|-.- T Consensus 351 ~~~~~~~e~~~~~~a~-~~l~~~e~qM~~-lGa--~ll~~~~~~~Ta~-~a~~~~~~~~S~L~~~a~~le~al~~aL~~~ 425 (535) T protein:vir:80 351 GATAGILQITPNSVPF-EAMTHKESQMIA-MGA--NLLVKSGGNRTFG-EAQQEEASEQSILSACTKNVSMAFRKALRWA 425 (535) T ss_pred CCCcceeeeccchhHH-HHHHHHHHHHHH-HHH--HhhccCcccccHH-HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH Confidence 6555444444333332 222222333332 221 1221111111111 1111111 3445556666666665433211 Q ss_pred cccc------CcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCC---CcCCceeeecccccchhcc Q lcl|NC_019456. 326 GKRV------KGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIP---DEAADHLYISKDLYPLDKY 396 (435) Q Consensus 326 ~~~~------~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~---~~~gd~~~~~~n~~~l~~~ 396 (435) ..+. ....|..+.+-....-....+..+.++++.|.++....++.+..--+- .++-++.- -...+.. T Consensus 426 A~w~G~~~~~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~----ri~~E~~ 501 (535) T protein:vir:80 426 NQFQTGIVNDETVEYNLNTDFPAARLTPNERAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEG----KATVEFI 501 (535) T ss_pred HHHcCCccCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHH----HHHhhhh Confidence 1111 123355555544333223345666778899999999988877432221 11222110 0000000 Q ss_pred ccccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 397 YDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 397 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) . .....+...+.....++..+-+| ++..++..|. T Consensus 502 ~-~~~~~g~~~d~~~~g~~~~~~~~----~~~~~~~~~~ 535 (535) T protein:vir:80 502 A-KTAAAGKVGDAASGGTNKAKLNN----GNGGGNQAGN 535 (535) T ss_pred h-ccccCCCCCCCCCCCCCcCcccC----CccccccCCC Confidence 0 00000000011111122222222 3333333333 No 228 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=93.56 E-value=0.0071 Score=32.36 Aligned_cols=396 Identities=10% Similarity=0.054 Sum_probs=169.7 Q ss_pred CchHHHHHhhcccc---------ccccccc--cccchh--h------hhhccccc---cCc-cc-------ccHHHHhhh Q lcl|NC_019456. 1 MSFMSKVRQFFGVH---------DQANQIV--QNPIPQ--P------LDMAGVKL---EQA-TF-------SREHILESN 50 (435) Q Consensus 1 Mg~~~~~~~~~~~~---------~~~~~~~--~~~~~~--~------~~~~~~~~---~~~-~~-------~~~~~~~~~ 50 (435) |.+.+-++-|.... .+..++. ...+.. . ....|... ... .. -..+....+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 66654433221100 1111111 111100 0 00111100 000 00 112445678 Q ss_pred HHHHHHHHHHHHHHhhCc-----eeeeeccccc-------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEE Q lcl|NC_019456. 51 EYIFSIVTRLSNVLASLP-----LHEYQNYKQM-------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWI 118 (435) Q Consensus 51 ~~v~~~i~~ia~~ia~~~-----~~~~~~~~~~-------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i 118 (435) |.|..||+.|.+.+.-+. +.+.-++.+. ..+.+.++++ =-+-...++ .++..|.+.|..|..+ T Consensus 81 pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 81 PEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCR-LLDASRKLD----TLFRRWYVDSRIFFHK 155 (516) T ss_pred cchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhhhhcceEEEEE Confidence 999999999998876554 2222121111 1122333332 222333444 4555667778988886 Q ss_pred eeeCCCCcEEEEEEeCCceeEEEE-----cCCCce------EEEEEec-------------CCeeEEEchhheEEeccC- Q lcl|NC_019456. 119 QKSLSTGEPIALWPLDPNTVSILR-----NTDNNS------YWYRVTS-------------DIYNFTIPINDVIHVKHV- 173 (435) Q Consensus 119 ~~~~~~g~~~~l~~l~~~~v~~~~-----~~~~~~------~~~~~~~-------------~~~~~~~~~~~iih~~~~- 173 (435) +.+....-+.+|..|+|..++.++ +.+|.. .+|.+.. .+....++.+-|.|.... T Consensus 156 iid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL 235 (516) T protein:vir:10 156 IMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGL 235 (516) T ss_pred EecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccc Confidence 666545558999999998876543 222211 1221111 112355666666665521 Q ss_pred CCccccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ce-EEEeCCcCCHHHHHHHHHHHHHHhcC----------- Q lcl|NC_019456. 174 VPSNSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KF-VLQYDRSISPEKRQAMVNDFLRMVKE----------- 238 (435) Q Consensus 174 ~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~-~~~~~~~~~~e~~~~~~~~~~~~~~~----------- 238 (435) ...+...-+|-|..+.+.+.....++...-.+ +...| +. -+-+ +.+....+++-...+...++| T Consensus 236 ~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDv-GnlPk~KAeqYl~~im~k~kNklvYDa~TGev 314 (516) T protein:vir:10 236 MDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDV-GNMNNRKATEYVNGIMQSLKNRVVYDSNTGTV 314 (516) T ss_pred eeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEec-CCCCchhHHHHHHHHHHhcCceeEEeCCCCee Confidence 12233333788888888888877776665433 23333 22 2222 233333333333333222222 Q ss_pred --CCc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc---cHHH-HHH- Q lcl|NC_019456. 239 --NGG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST---TNVE-HVT- 300 (435) Q Consensus 239 --~~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~---~~~e-~~~- 300 (435) ..+ ...| +.|.+++.|.-...--++.+. .+....+.++++||.+-|......+. .+.| ... T Consensus 315 ~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDE 393 (516) T protein:vir:10 315 KNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDV-RWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDE 393 (516) T ss_pred ccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHH Confidence 111 1222 246777777655443343444 45677799999999999975544321 2222 121 Q ss_pred HHHHHHHhHHHHH----HHHHHHHhhcc-----cccccC-cceeeechhhhhccC-------HHHHHHHHHHHH--hcCC Q lcl|NC_019456. 301 HSWTMTLMPIIRQ----YESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTLT--RNGI 361 (435) Q Consensus 301 ~~~~~~i~P~~~~----i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~~--~~g~ 361 (435) .-|...|..+-.. +.+.|..+|+. +.++.. ...|.|++..--... +..+++++..+- -+.+ T Consensus 394 iKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky 473 (516) T protein:vir:10 394 LDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKY 473 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 1233334333333 33344444332 233322 123444433211111 223333333332 2345 Q ss_pred cCHHHHHHH-hCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCC Q lcl|NC_019456. 362 FKPNEIREL-EGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGEN 421 (435) Q Consensus 362 ~t~NE~R~~-~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (435) ++.+=+|+. |.+.-- + . .. ..+........+ --..|..+ .+. T Consensus 474 ~s~~yi~k~ILr~tDe--e----i--~~-------e~k~I~~E~~~~--~~~~p~~~-~~f 516 (516) T protein:vir:10 474 VSHDYVMKNILQMTEE--Q----I--AQ-------EEKQIEQEAGIK--RFQNPENE-DDF 516 (516) T ss_pred cchHHHHHHHhcCCHh--h----H--HH-------HHHHHHHhhhCC--CCCCCCcc-ccC Confidence 566666653 444311 0 0 00 000000000000 00011111 122 No 229 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=93.56 E-value=0.0071 Score=32.36 Aligned_cols=396 Identities=10% Similarity=0.054 Sum_probs=169.7 Q ss_pred CchHHHHHhhcccc---------ccccccc--cccchh--h------hhhccccc---cCc-cc-------ccHHHHhhh Q lcl|NC_019456. 1 MSFMSKVRQFFGVH---------DQANQIV--QNPIPQ--P------LDMAGVKL---EQA-TF-------SREHILESN 50 (435) Q Consensus 1 Mg~~~~~~~~~~~~---------~~~~~~~--~~~~~~--~------~~~~~~~~---~~~-~~-------~~~~~~~~~ 50 (435) |.+.+-++-|.... .+..++. ...+.. . ....|... ... .. -..+....+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 66654433221100 1111111 111100 0 00111100 000 00 112445678 Q ss_pred HHHHHHHHHHHHHHhhCc-----eeeeeccccc-------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEE Q lcl|NC_019456. 51 EYIFSIVTRLSNVLASLP-----LHEYQNYKQM-------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWI 118 (435) Q Consensus 51 ~~v~~~i~~ia~~ia~~~-----~~~~~~~~~~-------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i 118 (435) |.|..||+.|.+.+.-+. +.+.-++.+. ..+.+.++++ =-+-...++ .++..|.+.|..|..+ T Consensus 81 pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 81 PEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCR-LLDASRKLD----TLFRRWYVDSRIFFHK 155 (516) T ss_pred cchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhhh----HHHhhhhhcceEEEEE Confidence 999999999998876554 2222121111 1122333332 222333444 4555667778988886 Q ss_pred eeeCCCCcEEEEEEeCCceeEEEE-----cCCCce------EEEEEec-------------CCeeEEEchhheEEeccC- Q lcl|NC_019456. 119 QKSLSTGEPIALWPLDPNTVSILR-----NTDNNS------YWYRVTS-------------DIYNFTIPINDVIHVKHV- 173 (435) Q Consensus 119 ~~~~~~g~~~~l~~l~~~~v~~~~-----~~~~~~------~~~~~~~-------------~~~~~~~~~~~iih~~~~- 173 (435) +.+....-+.+|..|+|..++.++ +.+|.. .+|.+.. .+....++.+-|.|.... T Consensus 156 iid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL 235 (516) T protein:vir:10 156 IMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGL 235 (516) T ss_pred EecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccc Confidence 666545558999999998876543 222211 1221111 112355666666665521 Q ss_pred CCccccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ce-EEEeCCcCCHHHHHHHHHHHHHHhcC----------- Q lcl|NC_019456. 174 VPSNSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KF-VLQYDRSISPEKRQAMVNDFLRMVKE----------- 238 (435) Q Consensus 174 ~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~-~~~~~~~~~~e~~~~~~~~~~~~~~~----------- 238 (435) ...+...-+|-|..+.+.+.....++...-.+ +...| +. -+-+ +.+....+++-...+...++| T Consensus 236 ~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDv-GnlPk~KAeqYl~~im~k~kNklvYDa~TGev 314 (516) T protein:vir:10 236 MDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDV-GNMNNRKATEYVNGIMQSLKNRVVYDSNTGTV 314 (516) T ss_pred eeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEec-CCCCchhHHHHHHHHHHhcCceeEEeCCCCee Confidence 12233333788888888888877776665433 23333 22 2222 233333333333333222222 Q ss_pred --CCc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc---cHHH-HHH- Q lcl|NC_019456. 239 --NGG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST---TNVE-HVT- 300 (435) Q Consensus 239 --~~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~---~~~e-~~~- 300 (435) ..+ ...| +.|.+++.|.-...--++.+. .+....+.++++||.+-|......+. .+.| ... T Consensus 315 ~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDE 393 (516) T protein:vir:10 315 KNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDV-RWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDE 393 (516) T ss_pred ccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHH Confidence 111 1222 246777777655443343444 45677799999999999975544321 2222 121 Q ss_pred HHHHHHHhHHHHH----HHHHHHHhhcc-----cccccC-cceeeechhhhhccC-------HHHHHHHHHHHH--hcCC Q lcl|NC_019456. 301 HSWTMTLMPIIRQ----YESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTLT--RNGI 361 (435) Q Consensus 301 ~~~~~~i~P~~~~----i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~~--~~g~ 361 (435) .-|...|..+-.. +.+.|..+|+. +.++.. ...|.|++..--... +..+++++..+- -+.+ T Consensus 394 iKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky 473 (516) T protein:vir:10 394 LDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKY 473 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 1233334333333 33344444332 233322 123444433211111 223333333332 2345 Q ss_pred cCHHHHHHH-hCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCC Q lcl|NC_019456. 362 FKPNEIREL-EGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGEN 421 (435) Q Consensus 362 ~t~NE~R~~-~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (435) ++.+=+|+. |.+.-- + . .. ..+........+ --..|..+ .+. T Consensus 474 ~s~~yi~k~ILr~tDe--e----i--~~-------e~k~I~~E~~~~--~~~~p~~~-~~f 516 (516) T protein:vir:10 474 VSHDYVMKNILQMTEE--Q----I--AQ-------EEKQIEQEAGIK--RFQNPENE-DDF 516 (516) T ss_pred cchHHHHHHHhcCCHh--h----H--HH-------HHHHHHHhhhCC--CCCCCCcc-ccC Confidence 566666653 444311 0 0 00 000000000000 00011111 122 No 230 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=92.27 E-value=0.012 Score=31.09 Aligned_cols=402 Identities=13% Similarity=0.117 Sum_probs=163.2 Q ss_pred CchHHHHHhhc----ccccccccccccc-chhhhhhccc-----cccCc-cc-------ccHHHHhhhHHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFF----GVHDQANQIVQNP-IPQPLDMAGV-----KLEQA-TF-------SREHILESNEYIFSIVTRLSN 62 (435) Q Consensus 1 Mg~~~~~~~~~----~~~~~~~~~~~~~-~~~~~~~~~~-----~~~~~-~~-------~~~~~~~~~~~v~~~i~~ia~ 62 (435) |+= |+.+- ....+..++..+. ........+. ..... .. -..+....+|.|..||+-|.+ T Consensus 1 m~~---lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVn 77 (533) T protein:vir:10 1 MSQ---LFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVN 77 (533) T ss_pred Ccc---ccccccccccccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhc Confidence 432 22110 0111111111000 0000000000 00000 00 112445678999999999998 Q ss_pred HHhhCce-----eeeecccc-------cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC--CCcEE Q lcl|NC_019456. 63 VLASLPL-----HEYQNYKQ-------MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS--TGEPI 128 (435) Q Consensus 63 ~ia~~~~-----~~~~~~~~-------~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~--~g~~~ 128 (435) .+.-+.- .+.-+..+ ...+.+.++++ =-+-...+ ..++..|.+.|..|..++.+.. ..-+. T Consensus 78 eaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~-ll~F~~~~----~e~fR~WYVDgRi~fHkiid~~~pk~GI~ 152 (533) T protein:vir:10 78 ETICGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILR-LLDFENRS----YEIFRRWYVDGRLFYHKVIDPDNPQGGLI 152 (533) T ss_pred ceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhhhhcceEEEEEEecCCCccccce Confidence 8776542 22212111 11122333332 22233334 4455566777888888776533 23488 Q ss_pred EEEEeCCceeEEEEc-----CCC-------------ceEEEEEecC------CeeEEEchhheEEeccC-CCccccccCc Q lcl|NC_019456. 129 ALWPLDPNTVSILRN-----TDN-------------NSYWYRVTSD------IYNFTIPINDVIHVKHV-VPSNSWYGVS 183 (435) Q Consensus 129 ~l~~l~~~~v~~~~~-----~~~-------------~~~~~~~~~~------~~~~~~~~~~iih~~~~-~~~~~~~G~s 183 (435) +|..|||..++.++. .++ ..-|+.+.+. +....++.+-|.+.... ...+.-.-+| T Consensus 153 ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~~~~~i~s 232 (533) T protein:vir:10 153 ELRYIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAPDSICYVHSGIMDLNKNMTLS 232 (533) T ss_pred eeeeccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccccCCCceecchhheeeeeccceeCCCCceec Confidence 999999988876431 111 1113333333 33456777555554421 1223334467 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-hhcCC--ce-EEEeCCcCCHHHHHHHHHHHHHHhcC-------------CCc-cccc Q lcl|NC_019456. 184 PIDVLSSSLKFQRSVENFSQNE-MEKKD--KF-VLQYDRSISPEKRQAMVNDFLRMVKE-------------NGG-AVVQ 245 (435) Q Consensus 184 ~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~-~~~~~~~~~~e~~~~~~~~~~~~~~~-------------~~~-~~vl 245 (435) -|..+.+.+.....++...-.+ +...| +. -+-+ +.+....+++....+...++| ..+ ...| T Consensus 233 yLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDV-GnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMl 311 (533) T protein:vir:10 233 HLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDV-GNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSML 311 (533) T ss_pred cchHhHHHHHhhHHHHhhHHHHhhhccccceEEEEec-CCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhH Confidence 8888888888777776655433 23333 22 2223 233333333333333222222 111 2222 Q ss_pred ----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc-cHHH-HH-HHHHHHHHhHHHH Q lcl|NC_019456. 246 ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST-TNVE-HV-THSWTMTLMPIIR 312 (435) Q Consensus 246 ----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~-~~~e-~~-~~~~~~~i~P~~~ 312 (435) ..|.+++.|.-...--++.+. .+....+.++++||.+-|.....-+. ...| .. ..-|...|..+-. T Consensus 312 EDyWLPRReGgrgTEItTLpGgqnLgem~DV-~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~ 390 (533) T protein:vir:10 312 EDFWLPRREGGRGTEITTLPGGQNLGELEDV-KYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRK 390 (533) T ss_pred hhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHH Confidence 246777777655443343343 45677799999999999965433221 1112 11 1123333333333 Q ss_pred H----HHHHHHHhhcc-----cccccC-cceeeechhhhhccC-------HHHHHHHHHHH--HhcCCcCHHHHHHH-hC Q lcl|NC_019456. 313 Q----YESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTL--TRNGIFKPNEIREL-EG 372 (435) Q Consensus 313 ~----i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~--~~~g~~t~NE~R~~-~g 372 (435) . +.+.|...|+. +.++.. ...|.|++..--... +..|++++..+ +-+-.++.+=+|+. |. T Consensus 391 rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr 470 (533) T protein:vir:10 391 RFSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLK 470 (533) T ss_pred HHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhc Confidence 3 33334444332 233322 123444433111111 22233333322 11123344444432 32 Q ss_pred C------------------CCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCC-CCCCCCCC Q lcl|NC_019456. 373 Q------------------APIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNEN-GLQSTEPE 433 (435) Q Consensus 373 ~------------------~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 433 (435) + +..++|..+ +.|... ...+..+|...++- .+.+..++ T Consensus 471 ~tDeei~~~~kqI~~E~k~~~~~~p~~~-------~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~ 527 (533) T protein:vir:10 471 QTDVEMKEIDKQIESEMESGIIADPAAE-------MDPAMA----------------AGDPDAGGAPAEEVAPEGPDPSD 527 (533) T ss_pred cCHHHHHHHHHHHHHHHhCCCCCCCcch-------hhHHhc----------------CCCCCcCCcccccCCCCCCCcch Confidence 2 222222111 000000 00111111111100 00000000 Q ss_pred CC Q lcl|NC_019456. 434 GS 435 (435) Q Consensus 434 ~~ 435 (435) -. T Consensus 528 ~~ 529 (533) T protein:vir:10 528 ER 529 (533) T ss_pred hh Confidence 00 No 231 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=91.66 E-value=0.015 Score=30.61 Aligned_cols=396 Identities=9% Similarity=0.053 Sum_probs=167.1 Q ss_pred CchHHHHHhhc-----------cccccccccccccchh--------hhhhcccccc-----Ccc-----c-ccHHHHhhh Q lcl|NC_019456. 1 MSFMSKVRQFF-----------GVHDQANQIVQNPIPQ--------PLDMAGVKLE-----QAT-----F-SREHILESN 50 (435) Q Consensus 1 Mg~~~~~~~~~-----------~~~~~~~~~~~~~~~~--------~~~~~~~~~~-----~~~-----~-~~~~~~~~~ 50 (435) |.+.+-++-|. +....+..+....+.. .+...|.... ... . ...+....+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTNN 80 (516) T ss_pred CCchHhcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhhc Confidence 55544332211 1111111111111100 0001111100 000 0 112445678 Q ss_pred HHHHHHHHHHHHHHhhCc-----eeeeeccccc-------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEE Q lcl|NC_019456. 51 EYIFSIVTRLSNVLASLP-----LHEYQNYKQM-------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWI 118 (435) Q Consensus 51 ~~v~~~i~~ia~~ia~~~-----~~~~~~~~~~-------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i 118 (435) |.|..||+.|.+.+.-+. +.+.-++.+. ..+.+.++++ =-+-...++ .++..|.+.|..|..+ T Consensus 81 pEvd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~-ll~F~~~~~----~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 81 PEVERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICR-LLDASRKLD----TLFRRWYIDSRIFFHK 155 (516) T ss_pred cchhHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhhh----HHHHhhhhcceEEEEE Confidence 999999999998876554 3332221111 1122333322 222333444 4555667778988886 Q ss_pred eeeCCCCcEEEEEEeCCceeEEEEc-----CC------CceEEEEEec-------C------CeeEEEchhheEEeccC- Q lcl|NC_019456. 119 QKSLSTGEPIALWPLDPNTVSILRN-----TD------NNSYWYRVTS-------D------IYNFTIPINDVIHVKHV- 173 (435) Q Consensus 119 ~~~~~~g~~~~l~~l~~~~v~~~~~-----~~------~~~~~~~~~~-------~------~~~~~~~~~~iih~~~~- 173 (435) +.+....-+.+|..|+|..+..++. .+ +...++.+.. + +....++.+-|.+.... T Consensus 156 iid~~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~daI~y~hSGl 235 (516) T protein:vir:10 156 IMPNPKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSAIVYAHSGL 235 (516) T ss_pred EecCcccceeeeeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecchhheeeeecCc Confidence 6665455689999999988765432 11 1111222211 1 12345555555554321 Q ss_pred CCccccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ce-EEEeCCcCCHHHHHHHHHHHHHHhcC----------- Q lcl|NC_019456. 174 VPSNSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KF-VLQYDRSISPEKRQAMVNDFLRMVKE----------- 238 (435) Q Consensus 174 ~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~-~~~~~~~~~~e~~~~~~~~~~~~~~~----------- 238 (435) .+.+.-.=+|-|..+.+.+.....++...-.+ +...| +. -+-+ +.+....+++....+...++| T Consensus 236 ~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDV-GnLPk~KAeqYl~~iM~k~KNklvYDa~TGev 314 (516) T protein:vir:10 236 QDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDV-GNMPNRKATEYVNGIMQSLKNRVVYDSNTGTV 314 (516) T ss_pred ccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEec-CCCCchhHHHHHHHHHHhcCceeEEeCCCCee Confidence 11222222677888888887777666654433 23333 22 2223 233333333333333222222 Q ss_pred --CCc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcc---cHHHH-HH- Q lcl|NC_019456. 239 --NGG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKST---TNVEH-VT- 300 (435) Q Consensus 239 --~~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~---~~~e~-~~- 300 (435) ..+ ...| +.|.+++.|.-...--++.+. .+....+.++++||.+-|......+. .+.|- .. T Consensus 315 ~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDE 393 (516) T protein:vir:10 315 KNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMDDV-RWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDE 393 (516) T ss_pred ccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHH-HHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHH Confidence 111 1222 246777777655443344444 45677799999999999975544321 22221 11 Q ss_pred HHHH---HHHhH-HHHHHHHHHHHhhcc-----cccccC-cceeeechhhhhccC-------HHHHHHHHHHHH--hcCC Q lcl|NC_019456. 301 HSWT---MTLMP-IIRQYESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTLT--RNGI 361 (435) Q Consensus 301 ~~~~---~~i~P-~~~~i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~~--~~g~ 361 (435) .-|. ..|+- +...+.+.|.++|+. +.++.. ...|.|++..--... +..+++++..+- -+.+ T Consensus 394 iKF~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky 473 (516) T protein:vir:10 394 LDFRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKY 473 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 1233 33332 233445555554443 333322 113444433211111 223333333332 2345 Q ss_pred cCHHHHHHH-hCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCC Q lcl|NC_019456. 362 FKPNEIREL-EGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGEN 421 (435) Q Consensus 362 ~t~NE~R~~-~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (435) ++.+=+|+. |.+.-- + . ...-. .+.+.... + --..|..+ .+. T Consensus 474 ~s~~yi~k~ILr~tDe--e----i--~~~~k---~I~~E~~~--~----~~~~p~~e-~~f 516 (516) T protein:vir:10 474 VSHDYVMKNILQMTDE--Q----I--AQEEK---QIEKEANV--K----RFQNPENE-DDF 516 (516) T ss_pred cchHHHHHHHhcCCHh--H----H--HHHHH---HHHHhhhC--C----CCCCCCcc-ccC Confidence 566666653 444311 0 0 00000 00000000 0 00011111 111 No 232 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=90.63 E-value=0.02 Score=29.92 Aligned_cols=397 Identities=11% Similarity=0.075 Sum_probs=162.1 Q ss_pred CchHHHHHhhccccc---------cccc--cccccchh---------hhh-hccccccCcc----c-------ccHHHHh Q lcl|NC_019456. 1 MSFMSKVRQFFGVHD---------QANQ--IVQNPIPQ---------PLD-MAGVKLEQAT----F-------SREHILE 48 (435) Q Consensus 1 Mg~~~~~~~~~~~~~---------~~~~--~~~~~~~~---------~~~-~~~~~~~~~~----~-------~~~~~~~ 48 (435) ...++.++.|++... +..+ +....+.. +.. .+|....... . -..+... T Consensus 2 ~~~l~~~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~ma 81 (521) T protein:vir:81 2 FSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGLM 81 (521) T ss_pred cchhhhhHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHHh Confidence 111222222222111 1111 11111100 000 0111110000 0 1123456 Q ss_pred hhHHHHHHHHHHHHHHhhCc-----eeeeeccccc-------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcce Q lcl|NC_019456. 49 SNEYIFSIVTRLSNVLASLP-----LHEYQNYKQM-------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYA 116 (435) Q Consensus 49 ~~~~v~~~i~~ia~~ia~~~-----~~~~~~~~~~-------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~ 116 (435) .+|.|..||+-|.+.+.-+. +.+.-+..+. ..+.+.++++ =-+-...+ ..++..|.+.|..|. T Consensus 82 ~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgRi~f 156 (521) T protein:vir:81 82 NNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLN-TIQFDRRG----QDMFRRWYVDSRIFF 156 (521) T ss_pred hccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhhhhcceEEE Confidence 78999999999998876554 2222111111 1122333332 22233334 445566778899999 Q ss_pred EEeeeCC-CCcEEEEEEeCCceeEEEEc-----CCC------ceEEEEEecC-------------CeeEEEchhheEEec Q lcl|NC_019456. 117 WIQKSLS-TGEPIALWPLDPNTVSILRN-----TDN------NSYWYRVTSD-------------IYNFTIPINDVIHVK 171 (435) Q Consensus 117 ~i~~~~~-~g~~~~l~~l~~~~v~~~~~-----~~~------~~~~~~~~~~-------------~~~~~~~~~~iih~~ 171 (435) .++.+.. ..-+.+|..|+|..++.++. ..+ ..-+|.+..+ +....++.+-|.+.. T Consensus 157 hkiid~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~h 236 (521) T protein:vir:81 157 HKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAH 236 (521) T ss_pred EEEEcCCccccceeeeeeCCcceeeeeeecccccCccceecceeeeeeeecCCccccccceeecCCcceeechhheeeee Confidence 8885422 23388999999988765431 111 1112222221 223456655555544 Q ss_pred cC-CCccccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ce-EEEeCCcCCHHHHHHHHHHHHHHhcC-------- Q lcl|NC_019456. 172 HV-VPSNSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KF-VLQYDRSISPEKRQAMVNDFLRMVKE-------- 238 (435) Q Consensus 172 ~~-~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~-~~~~~~~~~~e~~~~~~~~~~~~~~~-------- 238 (435) .. ...++-.-+|-|..+.+.+.....++...-.+ +...| +. -+-+ +.+....+++....+...++| T Consensus 237 SGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDv-Gnlpk~KAeqYl~~im~k~kNklvYDa~T 315 (521) T protein:vir:81 237 SGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDT-GNMNNRKAAQHMNSVAQSFKNRVVYDAST 315 (521) T ss_pred ccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEec-CCCCchhHHHHHHHHHHhcCceeEeeccc Confidence 21 12233334678888888888777776665433 23333 22 2223 333333333333333222222 Q ss_pred -----CCc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCccc---HHH-H Q lcl|NC_019456. 239 -----NGG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTT---NVE-H 298 (435) Q Consensus 239 -----~~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~---~~e-~ 298 (435) ..+ +..| +.|.+++.|.-...--++.+. .+....+.++++||.+-|+....+..+ ..| . T Consensus 316 Gev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EIt 394 (521) T protein:vir:81 316 GKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDI-RYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEIT 394 (521) T ss_pred ccccccccccchhhhhcccccCCCcccceeecccCCCCChHHHH-HHHHHHHHHHhCCccccccCCCCcceeccccchhh Confidence 222 2222 246778777654433343343 456777999999999999644432211 112 1 Q ss_pred HH-HHHHHHHhHHH----HHHHHHHHHhhcccc-----cccC-cceeeechhhhhccC-------HHHHHHHHHHHHh-c Q lcl|NC_019456. 299 VT-HSWTMTLMPII----RQYESQFNMKLFTPG-----KRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTLTR-N 359 (435) Q Consensus 299 ~~-~~~~~~i~P~~----~~i~~~l~~~l~~~~-----~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~~~-~ 359 (435) .. .-|...|..+- ..+.+.|..+|+... ++.. ...|.|++..--... +..+++++..+-. . T Consensus 395 RDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyv 474 (521) T protein:vir:81 395 RDELEFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYI 474 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhh Confidence 11 11333333333 334444444444322 2211 113444333211111 1223333332211 1 Q ss_pred C-CcCHHHHHHH-hCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCC Q lcl|NC_019456. 360 G-IFKPNEIREL-EGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGE 420 (435) Q Consensus 360 g-~~t~NE~R~~-~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (435) | .++.+=+++. |.+.-. + +....+... +....+--..+....++. T Consensus 475 Gky~s~dyi~k~ILr~tDe--e-------------i~~~~k~I~-~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:81 475 GKYFSNQTVMRDILKYTDD--Q-------------MDTEKKQIE-EEANDPRFKQTPDEIEDF 521 (521) T ss_pred ccccchHHHHHHHhccCHH--H-------------HHHHHHHHH-HHhhCCCCCCCcccccCC Confidence 1 2344444432 333210 0 000000000 000000001111122222 No 233 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=90.07 E-value=0.023 Score=29.58 Aligned_cols=397 Identities=11% Similarity=0.062 Sum_probs=169.7 Q ss_pred Cch-HHHHHhhccc------------cccccccccccchh---------hhhhccccccC-------c-c---cccHHHH Q lcl|NC_019456. 1 MSF-MSKVRQFFGV------------HDQANQIVQNPIPQ---------PLDMAGVKLEQ-------A-T---FSREHIL 47 (435) Q Consensus 1 Mg~-~~~~~~~~~~------------~~~~~~~~~~~~~~---------~~~~~~~~~~~-------~-~---~~~~~~~ 47 (435) |.| +-.++.++.. ...+..+....+.. .+...+..... . . .-..+.. T Consensus 1 m~~~~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~m 80 (521) T protein:vir:10 1 MNPIFLKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSL 80 (521) T ss_pred CCcchhHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHH Confidence 554 1122222211 11111111111100 00001100000 0 0 0112445 Q ss_pred hhhHHHHHHHHHHHHHHhhCc-----eeeeeccccc-------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcc Q lcl|NC_019456. 48 ESNEYIFSIVTRLSNVLASLP-----LHEYQNYKQM-------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGY 115 (435) Q Consensus 48 ~~~~~v~~~i~~ia~~ia~~~-----~~~~~~~~~~-------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~ 115 (435) ..+|.|..||+.|.+.+.-+. +.+.-++.+. ..+.+.++++ =-+-...+ ..++..|.+.|..| T Consensus 81 a~~pEvd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgRi~ 155 (521) T protein:vir:10 81 SKYHEVDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILK-LLKFEREG----KRHFRRWYVDSRIY 155 (521) T ss_pred hhccchhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhheeeeeEE Confidence 678999999999998877654 2222111111 1122333332 22233334 45566677889998 Q ss_pred eEEeeeCCC--CcEEEEEEeCCceeEEEEc-----CCC------ceEEEEEec--------C---CeeEEEchhheEEec Q lcl|NC_019456. 116 AWIQKSLST--GEPIALWPLDPNTVSILRN-----TDN------NSYWYRVTS--------D---IYNFTIPINDVIHVK 171 (435) Q Consensus 116 ~~i~~~~~~--g~~~~l~~l~~~~v~~~~~-----~~~------~~~~~~~~~--------~---~~~~~~~~~~iih~~ 171 (435) ..++.+... .-+.+|..|+|..++..+. .++ ..-+|.+.+ + +....++.+.|.|.. T Consensus 156 fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~daI~y~h 235 (521) T protein:vir:10 156 FHKMIDPARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPIDAIVYSH 235 (521) T ss_pred EEEEeeCCCccccceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeeccCCCceecCCCCCCcceeechhheeeec Confidence 887765332 2388999999998854431 111 111222221 1 123568887776665 Q ss_pred c-CCCccccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ce-EEEeCCcCCHHHHHHHHHHHHHHhcC-------- Q lcl|NC_019456. 172 H-VVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KF-VLQYDRSISPEKRQAMVNDFLRMVKE-------- 238 (435) Q Consensus 172 ~-~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~-~~~~~~~~~~e~~~~~~~~~~~~~~~-------- 238 (435) . ....+..+.+|-|..+.+.+.....++...-.+ +...| +. -+-++ .+....+++....+....+| T Consensus 236 SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvG-nlpk~KAeqYl~~iM~k~kNklVYDa~T 314 (521) T protein:vir:10 236 SGKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVG-TMPNKKATQHLNNVMQGLKNRVVYDSST 314 (521) T ss_pred ccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecC-CCCchhHHHHHHHHHHhcCceEEEeccC Confidence 3 223456778899999999988887777665533 23333 22 22232 33333333333333222222 Q ss_pred -----CCc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccc---CcccHHHHH Q lcl|NC_019456. 239 -----NGG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQA---KSTTNVEHV 299 (435) Q Consensus 239 -----~~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~---~~~~~~e~~ 299 (435) ..+ ...| +.|.+++.|.-...--++.+. .+....+.++++||.+-|..... -+.+++=.. T Consensus 315 Gev~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItR 393 (521) T protein:vir:10 315 GKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDDV-RWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITR 393 (521) T ss_pred ceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCccccCCCCCceecccccchhH Confidence 111 1222 246777777655443343444 45677799999999999965422 112221111 Q ss_pred -HHHHHHHHhHHHHH----HHHHHHHhhcc-----cccccC-cceeeechhhhhccC-------HHHHHHHHHHHH---- Q lcl|NC_019456. 300 -THSWTMTLMPIIRQ----YESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTLT---- 357 (435) Q Consensus 300 -~~~~~~~i~P~~~~----i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~~---- 357 (435) ..-|...|..+-.. +.+.|..+|+. +.++.. ...|.|++..--... +..+++++..+- T Consensus 394 DEikF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~y 473 (521) T protein:vir:10 394 DELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEV 473 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccc Confidence 11233333333333 33344444332 233221 123444433211111 233444444431 Q ss_pred hcCCcCHHHHHHH-hCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCC Q lcl|NC_019456. 358 RNGIFKPNEIREL-EGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGEN 421 (435) Q Consensus 358 ~~g~~t~NE~R~~-~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (435) -+-+++.+=+++. |.+.-. + + ...+. .... ....+ --..|..+-++. T Consensus 474 vGky~s~dyi~k~ILr~tDe--e----i------k~~~k---~I~~-E~~~~-~~~~p~~e~~df 521 (521) T protein:vir:10 474 TGKYLSHEYVMKNILRMSDE--D----I------KTERE---KIDG-ELKDS-VYKNPEDPMEEF 521 (521) T ss_pred cccccchHHHHHHHhcCCHh--H----H------HHHHH---HHHH-hhhCC-CCCCCcchhhcC Confidence 1224455555543 443311 0 0 00000 0000 00000 000111111222 No 234 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=89.83 E-value=0.024 Score=29.45 Aligned_cols=396 Identities=11% Similarity=0.147 Sum_probs=149.1 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccH----HHHh--hhHHHHHHHHHHHHHHhhCce---ee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSRE----HILE--SNEYIFSIVTRLSNVLASLPL---HE 71 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~--~~~~v~~~i~~ia~~ia~~~~---~~ 71 (435) =||...+.+-..... ........+| .. +..+.+. -.|. ..+.|+...+ .|-++|- ++ T Consensus 47 ~gfv~~~~~ng~i~~----v~~~~l~~~f---~n---pd~~~~~i~~l~~y~yi~~~~v~ql~~----li~~lp~l~y~i 112 (525) T protein:vir:10 47 DGFVMDLCNNGKIKT----VNLDTLQLWF---NN---PDKYINNIVNLLTYYYIIDGNVFQLYD----LIFSLPPLDYQI 112 (525) T ss_pred HHHHHHhhcCCceee----eeHHHHHhhh---cC---hHHHHHHHHHHHHHhhhhcchHHHHHH----HHHhcCCcceee Confidence 222222221100000 0000111111 00 0000000 0111 1122333333 3445553 22 Q ss_pred --eecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcce--------------------EEeeeCCCCcEEE Q lcl|NC_019456. 72 --YQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYA--------------------WIQKSLSTGEPIA 129 (435) Q Consensus 72 --~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~--------------------~i~~~~~~g~~~~ 129 (435) ..+.+.-..| ..+++..-..-.-..++-+.+..++...|.-.. ++-.+ ..|..+. T Consensus 113 ~~~~~~k~~~~~--~s~~n~~l~k~i~hk~ltrdll~q~a~~gtlig~wlg~~~~py~~vf~~~kyvfp~~r-~~g~~v~ 189 (525) T protein:vir:10 113 KVLKRDKDYKED--LSTINLYLEKKIQHKQLTRDLLVQLAHSGTLIGTWLGSKREPYFNVFNNLKYVFPYGR-AKGKMVA 189 (525) T ss_pred hhhhhccchhhH--HHHHHHHHHHhHHHHHHHHHHHHHhhccCceeEeeecCCCCcchhhhhhhhhhccccc-cCCceEE Confidence 2222222221 222332222222233444444444444443111 11111 1122221 Q ss_pred EEEeCCceeEEEEc---------------CCCceEEEEEecCC----eeEEEchhheEEeccCCCcc-ccccCcHHHHHH Q lcl|NC_019456. 130 LWPLDPNTVSILRN---------------TDNNSYWYRVTSDI----YNFTIPINDVIHVKHVVPSN-SWYGVSPIDVLS 189 (435) Q Consensus 130 l~~l~~~~v~~~~~---------------~~~~~~~~~~~~~~----~~~~~~~~~iih~~~~~~~~-~~~G~s~l~~~~ 189 (435) . ++-...+...+ ......+..+.+.. ....+|-+.++|.|...+.. .-.|.|...+.. T Consensus 190 v--id~~~f~~~~~~~r~~~~~~lsp~i~~~~y~~~~~~~~~~~~~~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l 267 (525) T protein:vir:10 190 V--IDLQWFDEMSELERKLTFENLSPLITENKYKKWKEYNGENEDALRYIMLPISKTLVARIHTLSRNQRLGIPYGTQTL 267 (525) T ss_pred E--EehHHhhhhhHHHHHHHHHhhchhhhhhhhhHHhhcccccchhheeeecccceeEEeeecccccCcccCcchhhhHH Confidence 1 11111110000 00000011111111 12467889999999765443 334888888777 Q ss_pred HHHHHHHHHHHHHHHHhhc--CCceEEEeCCcC------CHHHHHHHHHHHHHH----hcCCCcccc--ccCCceee--e Q lcl|NC_019456. 190 SSLKFQRSVENFSQNEMEK--KDKFVLQYDRSI------SPEKRQAMVNDFLRM----VKENGGAVV--QEAGWKVD--R 253 (435) Q Consensus 190 ~~i~~~~~~~~~~~~~~~n--~~~~~~~~~~~~------~~e~~~~~~~~~~~~----~~~~~~~~v--l~~g~~~~--~ 253 (435) ..|.......+...+.... .+..++++.++. .+...+++.+..+.+ ++...++.+ ++.=+.++ . T Consensus 268 ~dI~hk~klrd~EqsIA~kii~a~avLk~gg~~gn~mk~p~~~kqkil~gVk~aleK~~kdK~Gi~vi~~Pdfa~~efp~ 347 (525) T protein:vir:10 268 FDIQHKQKLRDLEQSIADKIIKAMAVLKFRGKDDNDSKVKESAKRKVLAGVKRALEKGVKDKNGIACIAMPDFATFEFPE 347 (525) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhheeeeeccccCccccCchHHHHHHHHHHHHHHhcccccccCeEEEeccceeeccccc Confidence 7777655544443333322 234556655442 222334443333333 333334444 23323333 2 Q ss_pred c-----cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHH-HHHHHHHHHHhHHHHHHH---HHHHHhhcc Q lcl|NC_019456. 254 Y-----ESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVE-HVTHSWTMTLMPIIRQYE---SQFNMKLFT 324 (435) Q Consensus 254 ~-----~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e-~~~~~~~~~i~P~~~~i~---~~l~~~l~~ 324 (435) + +..++ .+ +...++|-.++|++.+++++... +++++. .+.-|| ..|.=.++.|+ +.|-...|+ T Consensus 348 ik~~~~glDg~--K~----d~I~~DI~~A~GlS~sL~nGdgg-NyAtaslnld~fy-kkigVm~e~Iee~y~kL~d~Vl~ 419 (525) T protein:vir:10 348 IKNGDKTLDPK--KY----DSIDNDITNATGISQVLTNGTKG-NYASAKLNLDVFY-KKIGVMLEIIEEIYNQLIDIILG 419 (525) T ss_pred ccCcccCCCch--hh----hhhhhhhhhhhccceeeecCCCC-ceeeeeeeHHHHH-HHHHHHHHHHHHHHHHHHhhhcC Confidence 2 22222 12 23457899999999999987654 455443 334444 34444555555 333333333 Q ss_pred cccccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccch-----hccccc Q lcl|NC_019456. 325 PGKRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPL-----DKYYDA 399 (435) Q Consensus 325 ~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l-----~~~~~~ 399 (435) . ..+--+.|+++.-...+.+++.+.+-++...|+. .--+....|+.-- .++-..++..- +++..+ T Consensus 420 ~---~k~~nyifnydkd~pi~~kkk~d~LIkL~d~g~s-~k~vldl~gis~e------~y~E~s~yEtE~lkl~EKi~pp 489 (525) T protein:vir:10 420 E---EKGCNYIFQYNKDTPIEREKKLDTLIKLEAQGYS-AKYVLDILGISSE------EYFEESIYEIEKLKLREKIMPP 489 (525) T ss_pred c---ccCcceEEecCCCchhhhhhhhhhhhhhhccchh-hhhhhhhhccCcc------hHHHHHHHHHHHHHHhhhcccc Confidence 2 2233344666666677888888899999988874 3334445554321 11111111100 000000 Q ss_pred cccc--cccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 400 ILDN--KIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 400 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) .... .+........|..++.++++. .-++-+. |- T Consensus 490 ~~~~v~SGk~~n~iG~P~~dd~~~~da-ti~s~~~-~~ 525 (525) T protein:vir:10 490 LNTNVLSGKDGNDIGSPKLDDSDSSDA-TIESKER-GV 525 (525) T ss_pred ccceeeeccccccccCCccCCCcchhh-hhhhhhc-CC Confidence 0000 001111111122222111110 0000000 00 No 235 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=88.06 E-value=0.035 Score=28.58 Aligned_cols=397 Identities=11% Similarity=0.057 Sum_probs=163.1 Q ss_pred CchHHHHHh-----hccccccccccccccch---------hhhh-hccccccCcc----c-------ccHHHHhhhHHHH Q lcl|NC_019456. 1 MSFMSKVRQ-----FFGVHDQANQIVQNPIP---------QPLD-MAGVKLEQAT----F-------SREHILESNEYIF 54 (435) Q Consensus 1 Mg~~~~~~~-----~~~~~~~~~~~~~~~~~---------~~~~-~~~~~~~~~~----~-------~~~~~~~~~~~v~ 54 (435) |++|.+... -+.....+..+....+. .+.. ..|....... . -..+....+|.|. T Consensus 8 ~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd 87 (521) T protein:vir:65 8 LARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVE 87 (521) T ss_pred hhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHHhhccchh Confidence 444322211 01111111111111000 0000 0111111100 0 1123456789999 Q ss_pred HHHHHHHHHHhhCc-----eeeeeccccc-------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeC Q lcl|NC_019456. 55 SIVTRLSNVLASLP-----LHEYQNYKQM-------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSL 122 (435) Q Consensus 55 ~~i~~ia~~ia~~~-----~~~~~~~~~~-------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~ 122 (435) .||+-|.+.+.-+. +.+.-+..+. ..+.+.++++ =-+-...+ ..++..|.+.|..|..++.+. T Consensus 88 ~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgRi~fhkiid~ 162 (521) T protein:vir:65 88 NAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLN-TIQFDRRG----QDMFRRWYVDSRIFFHKIIGK 162 (521) T ss_pred hHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhhhhcceeEEEEEEcC Confidence 99999998876554 2222111111 1122333332 22233334 445566778899999888542 Q ss_pred C-CCcEEEEEEeCCceeEEEEc-----CCC------ceEEEEEec-------------CCeeEEEchhheEEeccC-CCc Q lcl|NC_019456. 123 S-TGEPIALWPLDPNTVSILRN-----TDN------NSYWYRVTS-------------DIYNFTIPINDVIHVKHV-VPS 176 (435) Q Consensus 123 ~-~g~~~~l~~l~~~~v~~~~~-----~~~------~~~~~~~~~-------------~~~~~~~~~~~iih~~~~-~~~ 176 (435) . ..-+.+|..|+|..++.++. ..+ ..-+|.+.. .+....++.+-|.+.... ... T Consensus 163 ~pk~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~hSGl~d~ 242 (521) T protein:vir:65 163 NPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAHSGLMDC 242 (521) T ss_pred CccccceeeeeeCCcceeeeeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeechhheeeeeccceeC Confidence 2 23388999999988775431 111 111222221 122345665555554421 122 Q ss_pred cccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ce-EEEeCCcCCHHHHHHHHHHHHHHhcC-------------C Q lcl|NC_019456. 177 NSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KF-VLQYDRSISPEKRQAMVNDFLRMVKE-------------N 239 (435) Q Consensus 177 ~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~-~~~~~~~~~~e~~~~~~~~~~~~~~~-------------~ 239 (435) ++-.-+|-|..+.+.+.....++...-.+ +...| +. -+-+ +.+....+++....+...++| . T Consensus 243 ~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDv-GnlPk~KAeqYl~~im~k~kNklvYDa~TGev~dd 321 (521) T protein:vir:65 243 DDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDT-GNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQ 321 (521) T ss_pred CCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEec-CCCCchhHHHHHHHHHHhcCceeEeeccccccccc Confidence 33344688888888888777776665433 23333 22 2223 333333333333333222222 2 Q ss_pred Cc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCccc---HHH-HHH-HHH Q lcl|NC_019456. 240 GG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTT---NVE-HVT-HSW 303 (435) Q Consensus 240 ~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~---~~e-~~~-~~~ 303 (435) .+ +..| +.|.+++.|.-...--++.+. .+....+.++++||.+-++....+..+ ..| ... .-| T Consensus 322 rk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF 400 (521) T protein:vir:65 322 QANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDI-RYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEF 400 (521) T ss_pred ccccchhhhhcccccCCCCccceeecccCCCcChHHHH-HHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHH Confidence 22 2222 246778777654433343343 456777999999999998655443221 112 111 123 Q ss_pred HHHHhHHH----HHHHHHHHHhhcccc-----cccC-cceeeechhhhhccC-------HHHHHHHHHHHHh-c-CCcCH Q lcl|NC_019456. 304 TMTLMPII----RQYESQFNMKLFTPG-----KRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTLTR-N-GIFKP 364 (435) Q Consensus 304 ~~~i~P~~----~~i~~~l~~~l~~~~-----~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~~~-~-g~~t~ 364 (435) ...|..+- ..+.+.|..+|+... ++.. ...|.|++..--... +..+++++..+-. . -.++. T Consensus 401 ~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~ 480 (521) T protein:vir:65 401 SKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSN 480 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccch Confidence 33333333 334444444444322 2211 113444333211111 1223333332211 1 13344 Q ss_pred HHHHHH-hCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCC Q lcl|NC_019456. 365 NEIREL-EGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGE 420 (435) Q Consensus 365 NE~R~~-~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (435) +=+++. |.+.-. + +....+... +....+--..++..+++. T Consensus 481 dyi~k~ILr~tDe--e-------------i~~~~k~I~-~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:65 481 QTVMRDILKYTDD--Q-------------MDTEKKQIE-EEANDPRFKQTPDEIEDF 521 (521) T ss_pred HHHHHHHhccCHH--H-------------HHHHHHHHH-HhhhCCCCCCCcccccCC Confidence 444442 333210 0 000000000 000000001111122222 No 236 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=87.49 E-value=0.038 Score=28.34 Aligned_cols=397 Identities=11% Similarity=0.053 Sum_probs=162.6 Q ss_pred Cch--HHHHHhhccccc------------cccccccccch-hhh---------hhcccccc-Ccc-----------cccH Q lcl|NC_019456. 1 MSF--MSKVRQFFGVHD------------QANQIVQNPIP-QPL---------DMAGVKLE-QAT-----------FSRE 44 (435) Q Consensus 1 Mg~--~~~~~~~~~~~~------------~~~~~~~~~~~-~~~---------~~~~~~~~-~~~-----------~~~~ 44 (435) |.| ++.++ +|+... .+..+....+. ... ...|.... .+. .-.. T Consensus 1 m~~~~L~~~~-~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:72 1 MKFNVLSLFA-PWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTY 79 (524) T ss_pred CCCchhhHhh-ccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHH Confidence 777 33332 222111 01111111000 000 00111000 000 0112 Q ss_pred HHHhhhHHHHHHHHHHHHHHhhCc-----eeeeecccc-------cccchHHHhhhccccccCCHHHHHHHHHHHHHhcC Q lcl|NC_019456. 45 HILESNEYIFSIVTRLSNVLASLP-----LHEYQNYKQ-------MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSG 112 (435) Q Consensus 45 ~~~~~~~~v~~~i~~ia~~ia~~~-----~~~~~~~~~-------~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G 112 (435) +..+.+|.|..||+.|.+.+.-+. +.+.-++.+ ...+.+.++++ --+-...+ ..++..|.+.| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDg 154 (524) T protein:vir:72 80 RNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLN-HLSFQRKG----SDHFRRWYVDS 154 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhheeee Confidence 445678999999999998876554 222211111 11122333332 22233334 45566677889 Q ss_pred CcceEEeeeCC--CCcEEEEEEeCCceeEEEE-----cCCCc------eEEEEEecC-------------CeeEEEchhh Q lcl|NC_019456. 113 NGYAWIQKSLS--TGEPIALWPLDPNTVSILR-----NTDNN------SYWYRVTSD-------------IYNFTIPIND 166 (435) Q Consensus 113 ~~~~~i~~~~~--~g~~~~l~~l~~~~v~~~~-----~~~~~------~~~~~~~~~-------------~~~~~~~~~~ 166 (435) ..|..++.|.. ..-+.+|..|+|..++..+ ..+|. .-+|.+..+ +....++.+- T Consensus 155 Ri~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dA 234 (524) T protein:vir:72 155 RIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAA 234 (524) T ss_pred EEEEEEEEeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhh Confidence 99888876544 2238899999999886532 11111 112222211 2334566666 Q ss_pred eEEeccC-CCccccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ceEEEeCCcCCHHHHHHHHHHHHHHhcC---- Q lcl|NC_019456. 167 VIHVKHV-VPSNSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KFVLQYDRSISPEKRQAMVNDFLRMVKE---- 238 (435) Q Consensus 167 iih~~~~-~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~~~~~~~~~~~e~~~~~~~~~~~~~~~---- 238 (435) |.|.... .+.++-.-+|-|..+.+.+.....++...-.+ +...| +.+..--+.+....+++-...+...++| T Consensus 235 I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvY 314 (524) T protein:vir:72 235 VVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVY 314 (524) T ss_pred eeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEE Confidence 6665522 12233344677888888887777766655433 23333 2222212333333333333333222222 Q ss_pred ---------CCc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC----ccc Q lcl|NC_019456. 239 ---------NGG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK----STT 294 (435) Q Consensus 239 ---------~~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~----~~~ 294 (435) ..+ ...| +.|.+++.|.-...--++.+. .+....+.++++||.+-|.....+ +.+ T Consensus 315 Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~ 393 (524) T protein:vir:72 315 DASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDI-RWFRQALYMALRVPLSRIPQDQQGGVMFDSG 393 (524) T ss_pred eCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHH-HHHHHHHHHHhCCchhhcCCCCCcccccccc Confidence 111 1222 246777777655443343343 456777999999999999433221 122 Q ss_pred HHHHH-HHHHHHHHhHHHHH----HHHHHHHhhcc-----cccccC-cceeeechhhhhccC-------HHHHHHHHHHH Q lcl|NC_019456. 295 NVEHV-THSWTMTLMPIIRQ----YESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTL 356 (435) Q Consensus 295 ~~e~~-~~~~~~~i~P~~~~----i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~ 356 (435) ++=.. ..-|...|..+-.. +.+.|..+|+. +.++.. ...|.|++..--... +..+++++..+ T Consensus 394 ~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 473 (524) T protein:vir:72 394 TSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMA 473 (524) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHh Confidence 21111 11233333333333 33344444332 223221 123444333211111 12233333322 Q ss_pred Hh-cC-CcCHHHHHHH-hCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCC Q lcl|NC_019456. 357 TR-NG-IFKPNEIREL-EGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGEN 421 (435) Q Consensus 357 ~~-~g-~~t~NE~R~~-~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (435) -. .| .++.+=+++. |.+.-. + +....+........ .--..|..+-.+. T Consensus 474 dpyvGky~s~~yi~k~ILr~tDe--e-------------i~~~~k~I~~E~k~--~~~~~~~~~~~~f 524 (524) T protein:vir:72 474 EPFIGKYISHRTAMKDILQMTDE--E-------------IEQEAKQIEEESKE--ARFQDPDQEQEDF 524 (524) T ss_pred hhhhcccchhHHHHHHHhccCHH--H-------------HHHHHHHHHHHhhc--CCCCCCchhhhcC Confidence 11 11 2344444432 333210 0 00000000000000 0000111111111 No 237 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=87.21 E-value=0.04 Score=28.23 Aligned_cols=398 Identities=10% Similarity=0.039 Sum_probs=163.0 Q ss_pred Cch-HHHHHhhcccc------------ccccccccccch-hhhh--h-------cc--------ccccCcc----cccHH Q lcl|NC_019456. 1 MSF-MSKVRQFFGVH------------DQANQIVQNPIP-QPLD--M-------AG--------VKLEQAT----FSREH 45 (435) Q Consensus 1 Mg~-~~~~~~~~~~~------------~~~~~~~~~~~~-~~~~--~-------~~--------~~~~~~~----~~~~~ 45 (435) |.| +..++.++-.. ..+..+....+. ..+. . .| ....... .-..+ T Consensus 1 m~f~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR 80 (523) T protein:vir:68 1 MKFNILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYR 80 (523) T ss_pred CCCchhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHH Confidence 666 22222222111 111111111110 0000 0 00 0000000 01124 Q ss_pred HHhhhHHHHHHHHHHHHHHhhCc-----eeeeeccccc-------ccchHHHhhhccccccCCHHHHHHHHHHHHHhcCC Q lcl|NC_019456. 46 ILESNEYIFSIVTRLSNVLASLP-----LHEYQNYKQM-------DNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGN 113 (435) Q Consensus 46 ~~~~~~~v~~~i~~ia~~ia~~~-----~~~~~~~~~~-------~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~ 113 (435) ....+|.|..||+-|.+.+.-+. +.+.-+..+. ..+.+.++++ --+-...+ ..++..|.+.|. T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgR 155 (523) T protein:vir:68 81 NLMTNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLN-HLSFQRKG----SDHFRRWYVDSR 155 (523) T ss_pred HHhhccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHHhheeeeE Confidence 45678999999999998876654 2222121111 1123333332 22233334 455666778899 Q ss_pred cceEEeeeCC--CCcEEEEEEeCCceeEEEE-----cCCCc------eEEEEEecC-------------CeeEEEchhhe Q lcl|NC_019456. 114 GYAWIQKSLS--TGEPIALWPLDPNTVSILR-----NTDNN------SYWYRVTSD-------------IYNFTIPINDV 167 (435) Q Consensus 114 ~~~~i~~~~~--~g~~~~l~~l~~~~v~~~~-----~~~~~------~~~~~~~~~-------------~~~~~~~~~~i 167 (435) .|..++.+.. ..-+.+|..|+|..++..+ ...|. .-+|.+... +....++.+-| T Consensus 156 i~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI 235 (523) T protein:vir:68 156 IFFHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKAAI 235 (523) T ss_pred EEEEEEeeCCCccccceeeeeeCCcceeEEEeecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecchhhe Confidence 9888876543 2238899999999886532 11111 112222211 23455666666 Q ss_pred EEeccC-CCccccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ceEEEeCCcCCHHHHHHHHHHHHHHhcC----- Q lcl|NC_019456. 168 IHVKHV-VPSNSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KFVLQYDRSISPEKRQAMVNDFLRMVKE----- 238 (435) Q Consensus 168 ih~~~~-~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~~~~~~~~~~~e~~~~~~~~~~~~~~~----- 238 (435) .|.... .+.++-.-+|-|..+.+.+.....++...-.+ +...| +.+..--+.+....+++-...+...++| T Consensus 236 ~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYD 315 (523) T protein:vir:68 236 VYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYD 315 (523) T ss_pred eeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEe Confidence 665522 12233344678888888887777766655433 23333 2222212333333333333333222222 Q ss_pred --------CCc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccc-Cc--ccHH Q lcl|NC_019456. 239 --------NGG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQA-KS--TTNV 296 (435) Q Consensus 239 --------~~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~-~~--~~~~ 296 (435) ..+ ...| +.|.+++.|.-...--++.+. .+....+.++++||.+-|..... -+ .+++ T Consensus 316 a~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~E 394 (523) T protein:vir:68 316 ATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDV-RWFRNALYMALRIPITRIPSDQGGIQFDAGTS 394 (523) T ss_pred ccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHH-HHHHHHHHHHhCCcceeecCCCcceecccccc Confidence 112 1222 246777777655443343343 45677799999999999954321 11 2221 Q ss_pred HHH-HHHHHHHHhHHHHH----HHHHHHHhhcc-----cccccC-cceeeechhhhhccC-------HHHHHHHHHHHHh Q lcl|NC_019456. 297 EHV-THSWTMTLMPIIRQ----YESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTLTR 358 (435) Q Consensus 297 e~~-~~~~~~~i~P~~~~----i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~~~ 358 (435) =.. ..-|...|..+-.. +.+.|...|+. +.++.. ...|.|++..--... +..+++++..+-. T Consensus 395 ItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dp 474 (523) T protein:vir:68 395 ITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEP 474 (523) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhh Confidence 111 11233333333333 33344444332 233221 123444433211111 1223333332211 Q ss_pred -cC-CcCHHHHHHH-hCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCC Q lcl|NC_019456. 359 -NG-IFKPNEIREL-EGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGEN 421 (435) Q Consensus 359 -~g-~~t~NE~R~~-~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (435) .| .++.+=+++. |.+.-. + +....+........ .--..|..+-.+. T Consensus 475 yvGky~s~~yi~k~ILr~tDe--e-------------i~~~~kqI~~E~k~--~~~~~p~~e~~~f 523 (523) T protein:vir:68 475 FIGKYISHRTAMKDILQMSDE--E-------------IEQEAKQIEEESKE--ARFQDPDQEQEDF 523 (523) T ss_pred hhcccchhHHHHHHHhccCHH--H-------------HHHHHHHHHHHhhc--CCCCCCchhhhcC Confidence 11 2344444432 333210 0 00000000000000 0000111111111 No 238 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=85.80 E-value=0.05 Score=27.71 Aligned_cols=397 Identities=11% Similarity=0.052 Sum_probs=162.5 Q ss_pred Cch--HHHHHhhccccc------------cccccccccch-hhh---------hhcccccc-Ccc-----------cccH Q lcl|NC_019456. 1 MSF--MSKVRQFFGVHD------------QANQIVQNPIP-QPL---------DMAGVKLE-QAT-----------FSRE 44 (435) Q Consensus 1 Mg~--~~~~~~~~~~~~------------~~~~~~~~~~~-~~~---------~~~~~~~~-~~~-----------~~~~ 44 (435) |.| ++.++ +|+... .+..+....+. ... ...|.... .+. .-.. T Consensus 1 m~~~~L~~~~-~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:10 1 MKFNVLSLFA-PWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTY 79 (524) T ss_pred CCCchhhHhh-ccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHH Confidence 777 33332 222111 01111111000 000 00111000 000 0112 Q ss_pred HHHhhhHHHHHHHHHHHHHHhhCc-----eeeeecccc-------cccchHHHhhhccccccCCHHHHHHHHHHHHHhcC Q lcl|NC_019456. 45 HILESNEYIFSIVTRLSNVLASLP-----LHEYQNYKQ-------MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSG 112 (435) Q Consensus 45 ~~~~~~~~v~~~i~~ia~~ia~~~-----~~~~~~~~~-------~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G 112 (435) +..+.+|.|..||+.|.+.+.-+. +.+.-++.+ ...+.+.++++ --+-...+ ..++..|.+.| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDg 154 (524) T protein:vir:10 80 RNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLN-HLSFQRKG----SDHFRRWYVDS 154 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhheeee Confidence 445678999999999998876554 222211111 11122333332 22233334 45566677889 Q ss_pred CcceEEeeeCC--CCcEEEEEEeCCceeEEEE-----cCCCc------eEEEEEecC-------------CeeEEEchhh Q lcl|NC_019456. 113 NGYAWIQKSLS--TGEPIALWPLDPNTVSILR-----NTDNN------SYWYRVTSD-------------IYNFTIPIND 166 (435) Q Consensus 113 ~~~~~i~~~~~--~g~~~~l~~l~~~~v~~~~-----~~~~~------~~~~~~~~~-------------~~~~~~~~~~ 166 (435) ..|..++.+.. ..-+.+|..|+|..++..+ ..+|. .-+|.+..+ +....++.+- T Consensus 155 Ri~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dA 234 (524) T protein:vir:10 155 RIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAA 234 (524) T ss_pred EEEEEEEeeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhh Confidence 99888876543 2238899999999886532 11111 112222211 2334566666 Q ss_pred eEEeccC-CCccccccCcHHHHHHHHHHHHHHHHHHHHHH-hhcCC--ceEEEeCCcCCHHHHHHHHHHHHHHhcC---- Q lcl|NC_019456. 167 VIHVKHV-VPSNSWYGVSPIDVLSSSLKFQRSVENFSQNE-MEKKD--KFVLQYDRSISPEKRQAMVNDFLRMVKE---- 238 (435) Q Consensus 167 iih~~~~-~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~-~~n~~--~~~~~~~~~~~~e~~~~~~~~~~~~~~~---- 238 (435) |.|.... .+.++-.-+|-|..+.+.+.....++...-.+ +...| +.+..--+.+....+++-...+...++| T Consensus 235 I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvY 314 (524) T protein:vir:10 235 IVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVY 314 (524) T ss_pred eeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEE Confidence 6655522 12233344677888888887777766655433 23333 2222212333333333333333222222 Q ss_pred ---------CCc-cccc----------cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC----ccc Q lcl|NC_019456. 239 ---------NGG-AVVQ----------EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK----STT 294 (435) Q Consensus 239 ---------~~~-~~vl----------~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~----~~~ 294 (435) ..+ ...| +.|.+++.|.-...--++.+. .+....+.++++||.+-|.....+ +.+ T Consensus 315 Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~ 393 (524) T protein:vir:10 315 DASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDV-RWFRQALYMALRVPLSRIPQDQQGGVMFDSG 393 (524) T ss_pred eCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHH-HHHHHHHHHHhCCchhhcCCCCCcccccccc Confidence 111 1222 246777777655443343343 456777999999999999433221 122 Q ss_pred HHHHH-HHHHHHHHhHHHHH----HHHHHHHhhcc-----cccccC-cceeeechhhhhccC-------HHHHHHHHHHH Q lcl|NC_019456. 295 NVEHV-THSWTMTLMPIIRQ----YESQFNMKLFT-----PGKRVK-GFYFSFNVNGLLRGD-------TAARTQYYQTL 356 (435) Q Consensus 295 ~~e~~-~~~~~~~i~P~~~~----i~~~l~~~l~~-----~~~~~~-g~~i~fd~~~l~~~d-------~~~~~~~~~~~ 356 (435) ++=.. ..-|...|..+-.. +.+.|..+|+. +.++.. ...|.|++..--... +..+++++..+ T Consensus 394 ~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 473 (524) T protein:vir:10 394 TSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMA 473 (524) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHh Confidence 21111 11233333333333 33344444332 223221 123444433211111 12233333322 Q ss_pred Hh-cC-CcCHHHHHHH-hCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCC Q lcl|NC_019456. 357 TR-NG-IFKPNEIREL-EGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGEN 421 (435) Q Consensus 357 ~~-~g-~~t~NE~R~~-~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 421 (435) -. .| .++.+=+++. |.+.-. + +....+........ .--..|..+-.+. T Consensus 474 dpyvGky~s~~yi~k~ILr~tDe--e-------------i~~~~k~I~~E~k~--~~~~~~~~~~~~f 524 (524) T protein:vir:10 474 EPFIGKYISHRTAMKDILQMTDE--E-------------IEQEAKQIEEESKE--ARFQDPDQEQEDF 524 (524) T ss_pred hhhhcccchhHHHHHHHhccCHH--H-------------HHHHHHHHHHHhhc--CCCCCCchhhhcC Confidence 11 11 2344444432 333210 0 00000000000000 0000111111111 No 239 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=85.67 E-value=0.051 Score=27.66 Aligned_cols=373 Identities=12% Similarity=0.042 Sum_probs=147.1 Q ss_pred Cc-------------hHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhh----hHHHHHHHHHHHHH Q lcl|NC_019456. 1 MS-------------FMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILES----NEYIFSIVTRLSNV 63 (435) Q Consensus 1 Mg-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~v~~~i~~ia~~ 63 (435) |. -|..++...++...=........+.+ ..........++. .+++...++.++.. T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~--------~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~ 72 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKL--------SGQTDDMYNAYKQRALFYSITSKTLSALSGM 72 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCC--------CCCCHHHHHHHHhhccCCchHHHHHHHHhch Confidence 43 23333333322211000000000000 0001111122222 24555556655555 Q ss_pred HhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeE---- Q lcl|NC_019456. 64 LASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVS---- 139 (435) Q Consensus 64 ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~---- 139 (435) +-+-|..+. . ...+.++ +. =....+-.+|.++++...+.+|.++++|.....+++|.-. .++|..+- T Consensus 73 vf~k~p~~~-----~-p~~l~~~-~~-D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~-~~~~~~Ii~W~~ 143 (452) T protein:vir:94 73 VLDQPPVIT-----H-PDAMSKY-FE-DQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYIS-VYTTENILNWEE 143 (452) T ss_pred hhcCCceec-----c-cHHHHHH-Hh-cccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEE-EechhhhcCccc Confidence 555554431 1 1233333 21 2468889999999999999999999988876555555422 22222110 Q ss_pred ---------------EEEcCC---CceE--EEE--------------EecCCeeEEEchhheEE-----------ec--c Q lcl|NC_019456. 140 ---------------ILRNTD---NNSY--WYR--------------VTSDIYNFTIPINDVIH-----------VK--H 172 (435) Q Consensus 140 ---------------~~~~~~---~~~~--~~~--------------~~~~~~~~~~~~~~iih-----------~~--~ 172 (435) ...+.. +... .|. ...++..... ..++.+ |- + T Consensus 144 ~~~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~-~~~~~~~~~~~~l~~IP~v~~~ 222 (452) T protein:vir:94 144 DEDGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWEL-AKTSTIQNVGVTMDYIPFFCIT 222 (452) T ss_pred cccCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeee-ccceeecCCCcccceeEEEEEc Confidence 011111 1000 000 0011110000 001111 11 1 Q ss_pred CCCccccccCcHHHHHHH-HHHHHHHHHHHHHHHhhcC-CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccC-Cc Q lcl|NC_019456. 173 VVPSNSWYGVSPIDVLSS-SLKFQRSVENFSQNEMEKK-DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEA-GW 249 (435) Q Consensus 173 ~~~~~~~~G~s~l~~~~~-~i~~~~~~~~~~~~~~~n~-~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~-g~ 249 (435) ....+...+.+|+..++. .+........+...++.-+ +..++...... + ...-+++.++.+++ |. T Consensus 223 ~~~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~--~----------~i~iG~~~~~~lpe~~~ 290 (452) T protein:vir:94 223 PSGLSMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQ--S----------TMHIGSTKAWVIPEVAA 290 (452) T ss_pred CCCCCCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCC--C----------ceEecccccccCCCCCC Confidence 112244567888775444 4555555555555554444 33333322111 1 11224556777774 65 Q ss_pred e--eeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH-HH--HHHHHhHHHHHHHHHHHHhhc- Q lcl|NC_019456. 250 K--VDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT-HS--WTMTLMPIIRQYESQFNMKLF- 323 (435) Q Consensus 250 ~--~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~-~~--~~~~i~P~~~~i~~~l~~~l~- 323 (435) + |...+-+...+.... .+....++ +..|. .++-.....+.+ .++.. .+ -+..|.-++.++++.++.-|= T Consensus 291 ~~~yie~~g~~i~~~~~~-l~~le~~m-~~~Ga--~ll~~~~~~~~s-~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~ 365 (452) T protein:vir:94 291 KVGFLEFTGQGLQSLEKA-LSEKQAQL-ASLSA--RLIDNSTRGSEA-TETVKLRYMSETASLKSVTRAVEALLNKAYSC 365 (452) T ss_pred cceEEccCchhHHHHHHH-HHHHHHHH-HHHHH--HhhccCCCcchH-HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 4 444444433322222 22222222 22221 122221211111 12211 11 124445555555555543221 Q ss_pred -ccc-cccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccccc Q lcl|NC_019456. 324 -TPG-KRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAIL 401 (435) Q Consensus 324 -~~~-~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~ 401 (435) ... ....+..|..+.+-.........++++.+++..|.++....++.+..--+.+...+.- .+. . T Consensus 366 ~a~w~g~~~~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~----------~i~---~ 432 (452) T protein:vir:94 366 IMDMESMGGTLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESM----------GVI---P 432 (452) T ss_pred HHHHcCCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHH----------HHH---H Confidence 010 0112345666666544443345666677789999999999988883332221111110 000 0 Q ss_pred cccccccccccccccCCCCCC Q lcl|NC_019456. 402 DNKIQTDASVAAPKQEGGENT 422 (435) Q Consensus 402 ~~~~~~~~~~~~~~~~~~~~~ 422 (435) +...+.. .+.+.+..++.+. T Consensus 433 E~~~~~~-~~~~~~~~~~~~~ 452 (452) T protein:vir:94 433 DPPAPEP-SPSNTPPNPSSKA 452 (452) T ss_pred HhhccCc-ccCCCCCCCccCC Confidence 0000000 0011111111111 No 240 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=85.30 E-value=0.054 Score=27.54 Aligned_cols=364 Identities=8% Similarity=-0.009 Sum_probs=150.9 Q ss_pred CchHHHHH---hhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_019456. 1 MSFMSKVR---QFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASLPLHEYQNYKQ 77 (435) Q Consensus 1 Mg~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 77 (435) +.-..++. +...+.. ...... .......... ......-..+....-+|+..+.-+-+-|+...-.+.. T Consensus 14 ~~~~~r~~~~~~YY~g~~---~i~~~~---~~~~~~~~~~---~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~~~~~ 84 (451) T protein:vir:10 14 AARRQEILQAKSYYYNKN---DILKKG---VVVQNRDENP---LRNADNRISHNFHEILVDEKASYMFTYPVLFDIDNNK 84 (451) T ss_pred HHHHHHHHHHHHHhcccC---cccccc---cccccccccc---ccccccccccchHHHHHHhhhhheecccceeecCCcH Confidence 11111111 1111100 000000 0000000000 0000001113445556676666665666654322222 Q ss_pred cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC-------CCcEEEEEEeCCceeEEEEcCC--Cce Q lcl|NC_019456. 78 MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS-------TGEPIALWPLDPNTVSILRNTD--NNS 148 (435) Q Consensus 78 ~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~-------~g~~~~l~~l~~~~v~~~~~~~--~~~ 148 (435) .. ..+...++ + .........+...+..+|.||.++..+.. .|. ..+..++|..+.+..+.. +.. T Consensus 85 ~~-~~~~~~~~-~----n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~-~~~~~i~p~~~~~vydd~~~~~~ 157 (451) T protein:vir:10 85 EL-NEKVTDVL-G----NEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQT-FKYGVVNTEEIIPIYRNGIEREL 157 (451) T ss_pred HH-HHHHHHHh-c----cCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccc-eeEEEEcccceEEEEcCCCCCce Confidence 11 12222222 1 24556677888999999999998877543 132 347778888877766442 121 Q ss_pred E----EEEEec-C-Ce--------eEEEchhheEEeccCC---------------Cc---------cccccCcHHHHHHH Q lcl|NC_019456. 149 Y----WYRVTS-D-IY--------NFTIPINDVIHVKHVV---------------PS---------NSWYGVSPIDVLSS 190 (435) Q Consensus 149 ~----~~~~~~-~-~~--------~~~~~~~~iih~~~~~---------------~~---------~~~~G~s~l~~~~~ 190 (435) . +|.... . +. ...++.+.+.+++... +. +...|.|-+..+.. T Consensus 158 ~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~~d~e~v~~ 237 (451) T protein:vir:10 158 EAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEFSNNIKKQSDLSKYKK 237 (451) T ss_pred EEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEeccCCCCCCchhhHHH Confidence 1 111111 1 10 1234455555543211 00 12336677777666 Q ss_pred HHHHHHHHHHHHHHHhhcCCceEEEeC---CcCCHHHHHHHHHHHHHHhcCCCcccccc-------CCceeeeccCChhh Q lcl|NC_019456. 191 SLKFQRSVENFSQNEMEKKDKFVLQYD---RSISPEKRQAMVNDFLRMVKENGGAVVQE-------AGWKVDRYESKFEP 260 (435) Q Consensus 191 ~i~~~~~~~~~~~~~~~n~~~~~~~~~---~~~~~e~~~~~~~~~~~~~~~~~~~~vl~-------~g~~~~~~~~~~~~ 260 (435) .+.....+..-.++.+......++.+. .....+....+ +.. +++.+. ++++|..-.. .. T Consensus 238 liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~--------~~~-~~i~~~~~~~~~~~~~~~l~~~~--~~ 306 (451) T protein:vir:10 238 ILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKEL--------KRY-KTIKTETDSEGDSGGLKTMQIEI--PT 306 (451) T ss_pred HHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHH--------hhC-CeEEecCcCCccCCcceEEeecC--CH Confidence 666655433333323322222222222 22222322221 111 222222 3345444333 33 Q ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHH--------------HHHHHHHhHHHHHHHHHHHHhhcccc Q lcl|NC_019456. 261 ADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVT--------------HSWTMTLMPIIRQYESQFNMKLFTPG 326 (435) Q Consensus 261 ~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~--------------~~~~~~i~P~~~~i~~~l~~~l~~~~ 326 (435) ..+....+...+.|...-++|.. .....++. +..+.. ..|...+...++-+...+. .. T Consensus 307 ~~~~~~~~~l~~~I~~~s~~p~~--~~~~~gn~-Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~-----~~ 378 (451) T protein:vir:10 307 EARKIILEILKKQIYESGQGLQQ--DTENFGNA-SGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLG-----VT 378 (451) T ss_pred HHHHHHHHHHHHHHHHHhCcccc--cccccccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CC Confidence 44667777788899999999853 11111221 112221 1222333333333332221 11 Q ss_pred cccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhcccccccccccc Q lcl|NC_019456. 327 KRVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQ 406 (435) Q Consensus 327 ~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~ 406 (435) . . ..+.+.+......|..+.++.+.++. |+++.--+.+++++-.-+.+.-. ...+........ T Consensus 379 d-~--~~i~i~f~~~~p~n~~e~~~~~~kl~--g~iS~et~~~~~p~v~d~~~e~~------------~~~ee~~~~~~~ 441 (451) T protein:vir:10 379 D-Y--KKIQQTYTRNMMSNDLEDADIATKSV--GIIPTKIILRHHPWVDDVEEAEK------------LYLEEKKIQASK 441 (451) T ss_pred C-c--cceeEEecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHH------------HHHHHHHHHHHH Confidence 1 1 23444445666788999999999984 78898888888754321111110 000000000000 Q ss_pred ccccccccccCCCCCCC Q lcl|NC_019456. 407 TDASVAAPKQEGGENTN 423 (435) Q Consensus 407 ~~~~~~~~~~~~~~~~~ 423 (435) .+ +.-++-++ T Consensus 442 -~~------~~~~~~~~ 451 (451) T protein:vir:10 442 -VS------DDYNNFTE 451 (451) T ss_pred -HH------hhcCCCCC Confidence 00 00000001 No 241 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=83.94 E-value=0.064 Score=27.11 Aligned_cols=396 Identities=11% Similarity=0.057 Sum_probs=146.8 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhh----HHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESN----EYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) +.-|..++..+++...=........+.+- .-....-.......++.. +++...++.+...+-+-|..+ T Consensus 15 ~~~W~~ird~~~G~~~~r~~g~~YLP~~~---~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~vf~k~p~~----- 86 (501) T protein:vir:95 15 LPLYYLIRDAIAGEPTVKGARTTYLPMPN---AEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQVFMRDPVV----- 86 (501) T ss_pred HHHHHHHHHHhcChHHHHhcccccCcCCC---CCCCcccchHHHHHHhhccccCchHHHHHHHHhhhhhcCCcce----- Confidence 33344444444433211000000010000 000000000112222222 233333333333333322222 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc--------------EEEEEEeCCceeE--- Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE--------------PIALWPLDPNTVS--- 139 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~--------------~~~l~~l~~~~v~--- 139 (435) + ....+..++..---...+-.+|.+.++...+.+|.++++|......+. |. +..+.|..+- T Consensus 87 ~-~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rPy-~~~~~~~~IinW~ 164 (501) T protein:vir:95 87 K-VPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRPT-LYVYSPTEIINWR 164 (501) T ss_pred e-CcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCcE-EEEecHhhhcCcc Confidence 0 122344444334446678999999999999999999998864322111 11 2222222110 Q ss_pred -----------------EEEcCC-------------------CceEEEEEecCCee----EEEc---------------- Q lcl|NC_019456. 140 -----------------ILRNTD-------------------NNSYWYRVTSDIYN----FTIP---------------- 163 (435) Q Consensus 140 -----------------~~~~~~-------------------~~~~~~~~~~~~~~----~~~~---------------- 163 (435) .....+ |...+......... ...+ T Consensus 165 ~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 244 (501) T protein:vir:95 165 TTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQ 244 (501) T ss_pred eeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCcccccceeeeeccC Confidence 000011 11000000000000 0000 Q ss_pred --hhh---eEEeccCCCccccccCcHHHHHHH-HHHHHHHHHHHHHHHhhcC-CceEEEeCCcCCHHHHHHHHHHHHHHh Q lcl|NC_019456. 164 --IND---VIHVKHVVPSNSWYGVSPIDVLSS-SLKFQRSVENFSQNEMEKK-DKFVLQYDRSISPEKRQAMVNDFLRMV 236 (435) Q Consensus 164 --~~~---iih~~~~~~~~~~~G~s~l~~~~~-~i~~~~~~~~~~~~~~~n~-~~~~~~~~~~~~~e~~~~~~~~~~~~~ 236 (435) .=. ++++ +....+...+.+|+..++. .+........+...++.-+ +..+++. ++++..+..... ... T Consensus 245 ~~~l~~IPfv~~-~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G---~~~~~~~~~~~~--~i~ 318 (501) T protein:vir:95 245 GKRLTEIPFMFI-GSENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVLIG---LTEEWVTNVLKG--SVN 318 (501) T ss_pred CCcCCeeeEEEE-ecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeeeeC---CcccccccCCCC--cee Confidence 000 0111 1112233456677665443 4444444444444444333 3333331 111211100000 011 Q ss_pred cCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHH---HHHHhHHHHH Q lcl|NC_019456. 237 KENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSW---TMTLMPIIRQ 313 (435) Q Consensus 237 ~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~---~~~i~P~~~~ 313 (435) -+++..+.++.|.++.-+..++.-+. .+..+....++.. .| ..++..... + .+.++...-+ ++.|.-++.+ T Consensus 319 ~G~~~~~~lP~~~~~~~ie~~~~~i~-~~~l~~l~~~m~~-~G--a~ll~~~~~-~-~Ta~~~~~~~~~~~S~L~~~a~~ 392 (501) T protein:vir:95 319 FGSRGGIPLPVGADAKLLQASENTML-KEAMDTKERQMVA-LG--AKLVEQKEV-Q-RTATEAELEAASEGSTLSSATKN 392 (501) T ss_pred ecccccccCCCCCceeEEecChhhHH-HHHHHHHHHHHHH-HH--HhhccCCcc-c-hhHHHHHHHHHHHhHHHHHHHHH Confidence 13345667777766655555544443 2223333333332 23 123322211 1 1222211111 3445556666 Q ss_pred HHHHHHHhhcc--ccc--ccCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeeccc Q lcl|NC_019456. 314 YESQFNMKLFT--PGK--RVKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKD 389 (435) Q Consensus 314 i~~~l~~~l~~--~~~--~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n 389 (435) +++.++..|-. ... ...+..|+.+.+-.........++++.+++..|.++....++.+-.--+..+--+. T Consensus 393 le~al~~~l~~~a~w~g~~~~~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~------ 466 (501) T protein:vir:95 393 VSAAFEWALKWAARWVGQADSGVKFELNTDFDIARMTPDERRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSK------ 466 (501) T ss_pred HHHHHHHHHHHHHHHcCCCCCceEEEEecccccccCCHHHHHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHH------ Confidence 66666543221 111 11234566665544433344456777789999999999998877433332110000 Q ss_pred ccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 390 LYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPE 433 (435) Q Consensus 390 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (435) ..+.+.+..........-.....+..||++ =+++| T Consensus 467 --e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~-------~~~~~ 501 (501) T protein:vir:95 467 --AKEKIAKDTAEAMALATPANVPGDGSGGDN-------VGNSE 501 (501) T ss_pred --HHHHHHhhhcCcccccccCCCCCCCccccc-------ccCCC Confidence 001111111110000000011112223333 12222 No 242 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=80.81 E-value=0.091 Score=26.28 Aligned_cols=404 Identities=12% Similarity=0.036 Sum_probs=139.7 Q ss_pred Cc-----hHHHHHhhccccccccccccccchhhhhhcccc--------ccC---cccccHHHHhhhHHHHHHHHHHHHHH Q lcl|NC_019456. 1 MS-----FMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVK--------LEQ---ATFSREHILESNEYIFSIVTRLSNVL 64 (435) Q Consensus 1 Mg-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~---~~~~~~~~~~~~~~v~~~i~~ia~~i 64 (435) |+ +.+.++..+..-+...+.....+.+..+..-.. ... +...+.. .. .++--.|++.+|..+ T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~-~~-dstg~~a~~~LAs~l 78 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQK-MF-DSTAPLALRNFVAAM 78 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccc-cc-cchHHHHHHHHHHHH Confidence 43 233332222211111111122222222221110 000 0000000 11 122234555555544 Q ss_pred hh-C-----ce-eeeeccccccc------------chHHHhhh-ccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCC Q lcl|NC_019456. 65 AS-L-----PL-HEYQNYKQMDN------------EPLADLLK-TSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLST 124 (435) Q Consensus 65 a~-~-----~~-~~~~~~~~~~~------------~~l~~~l~-~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~ 124 (435) -. + || ++.-......+ +.++..+. .+- +.+.-+..++.+++++|++.+++..+.. T Consensus 79 ~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~s----nf~~~~~~~~~~L~~~Gta~l~~~~~~~- 153 (549) T protein:vir:10 79 DSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQG----GFVTQMGATYQSIGLFGPGALMIEHDVG- 153 (549) T ss_pred HhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhc----ChHHHHHHHHHHHHhhcceeeEEeecCC- Confidence 32 2 23 22212211111 11111111 122 3445566678899999999999876543 Q ss_pred CcEEEEEEeCCceeEEEEcCCCceEEEEE----------------------------------------ec--------- Q lcl|NC_019456. 125 GEPIALWPLDPNTVSILRNTDNNSYWYRV----------------------------------------TS--------- 155 (435) Q Consensus 125 g~~~~l~~l~~~~v~~~~~~~~~~~~~~~----------------------------------------~~--------- 155 (435) +...+..++..++-+..+..|..-.++. .+ T Consensus 154 -~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~~~ 232 (549) T protein:vir:10 154 -KGIVYRNVPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPRKL 232 (549) T ss_pred -CeeEEEEEEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCcccc Confidence 3444434444444444444443221110 00 Q ss_pred CCee-----E--EEchhheE-----------EeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEe- Q lcl|NC_019456. 156 DIYN-----F--TIPINDVI-----------HVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQY- 216 (435) Q Consensus 156 ~~~~-----~--~~~~~~ii-----------h~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~- 216 (435) ++.. + ....+.|+ -.|.....+..||.||...+.-.+.....+.+............++.+ T Consensus 233 ~~~~~pf~sv~~e~~~~~il~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~ 312 (549) T protein:vir:10 233 DGRNMQFASYWLDEGRDRIVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLAN 312 (549) T ss_pred ccccCceEEEEEEecCCEeeccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Confidence 0000 0 00001111 112222235589999999999988888877776665544443333333 Q ss_pred -CCcCCHHHHHHHHHHHHHHhcCCCccccc--cCCceeeeccCChhhHH-HHHHHHHHHHHHHHHhCCCHHHhCCcccCc Q lcl|NC_019456. 217 -DRSISPEKRQAMVNDFLRMVKENGGAVVQ--EAGWKVDRYESKFEPAD-LSSVEQISRIRIATAFNVPISFLNDDQAKS 292 (435) Q Consensus 217 -~~~~~~e~~~~~~~~~~~~~~~~~~~~vl--~~g~~~~~~~~~~~~~~-~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~ 292 (435) ++.++..... .+.-..... .+...+.++.... +.+ ..+..+.....|-.+|-+....+-...... T Consensus 313 ~~g~~~~~~l~----------pgg~~~~~~~~~~~~~~~pl~~~~-~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~ 381 (549) T protein:vir:10 313 EDGVLDGFDLR----------SGALNWGGLNDKGEEMVKPLLTGK-QAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDM 381 (549) T ss_pred cccccccceec----------cCCccccccCCCCccceeeecccc-chhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCc Confidence 3333322211 111111111 2334577765543 233 334455667789999987764332222222 Q ss_pred ccHH-HHHHH------------HHHHHHhHHHHHHHHHHHHh-hcccc--c-ccCcceeeechh-hhhc----cCHHHHH Q lcl|NC_019456. 293 TTNV-EHVTH------------SWTMTLMPIIRQYESQFNMK-LFTPG--K-RVKGFYFSFNVN-GLLR----GDTAART 350 (435) Q Consensus 293 ~~~~-e~~~~------------~~~~~i~P~~~~i~~~l~~~-l~~~~--~-~~~g~~i~fd~~-~l~~----~d~~~~~ 350 (435) ++.+ .+... +...-+.|++.+....+.+. .|++. + ...+..+...+. .|-+ .+..... T Consensus 382 TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~ 461 (549) T protein:vir:10 382 TATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAIL 461 (549) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHH Confidence 2221 11111 11234456655444444432 23321 1 112323333322 2322 1122111 Q ss_pred HHHHHHHh---cC-----CcCHHH----HHHHhCCCCCCCcCCceeeeccc-ccchhccccccccccccccccccccccC Q lcl|NC_019456. 351 QYYQTLTR---NG-----IFKPNE----IRELEGQAPIPDEAADHLYISKD-LYPLDKYYDAILDNKIQTDASVAAPKQE 417 (435) Q Consensus 351 ~~~~~~~~---~g-----~~t~NE----~R~~~g~~p~~~~~gd~~~~~~n-~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 417 (435) ..+..... .+ .+..++ +.+.+|.|+ . ++.+.. ...+-.....+.... ......+ . T Consensus 462 ~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~------~-~irs~eev~~~r~~~~~qqq~~---~~~~~a~--~ 529 (549) T protein:vir:10 462 QWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPV------E-AMSTDEELQAQQAAEAQAAQMQ---QMLAAAP--V 529 (549) T ss_pred HHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCc------c-ccCCHHHHHHHHHHHHHHHHHH---HHHHHHH--H Confidence 22221111 11 122223 223334442 0 111110 000000000000000 0000000 0 Q ss_pred CCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 418 GGENTNENGLQSTEPEGS 435 (435) Q Consensus 418 ~~~~~~~~~~~~~~~~~~ 435 (435) ++.-..+=.+..+..... T Consensus 530 a~~~a~~~~~~~ta~~~~ 547 (549) T protein:vir:10 530 AAGAIKDLSDAQTAAQTA 547 (549) T ss_pred HHHHHHhhhhhcCCCccc Confidence 000000111111111111 No 243 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=69.72 E-value=0.22 Score=24.20 Aligned_cols=377 Identities=12% Similarity=0.025 Sum_probs=133.1 Q ss_pred CchHHHHHhhccc-ccccc----------------ccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHH Q lcl|NC_019456. 1 MSFMSKVRQFFGV-HDQAN----------------QIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNV 63 (435) Q Consensus 1 Mg~~~~~~~~~~~-~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 63 (435) -.++.....--.. .++.. ...+...+....|+..... .. .......|..-.+.+ T Consensus 36 ~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~~~WF~l~~~-----d~-~~~~~~~v~~~L~~v--- 106 (547) T protein:vir:10 36 SDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPATKWFELAFR-----DK-ELNSDDECRKWLENA--- 106 (547) T ss_pred cccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCCCcccccccC-----Cc-cccchHHHHHHHHHH--- Confidence 1111110000000 00000 0000000000001111000 00 001111111111111 Q ss_pred HhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEc Q lcl|NC_019456. 64 LASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRN 143 (435) Q Consensus 64 ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~ 143 (435) ++.++..+ .+-| .+.-+..++.+++.+|++.+++..+...+....+..++..++-+..+ T Consensus 107 ----------------e~~i~~~l-~~sn----f~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~~v~~d 165 (547) T protein:vir:10 107 ----------------THDVYSAL-QDSN----FNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDSYFEED 165 (547) T ss_pred ----------------HHHHHHHH-HhcC----cHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceEEEeeC Confidence 11222222 2333 33335666789999999998887654333333444444455544444 Q ss_pred CCCceEEEEE-------------------------------------------e-------------------------- Q lcl|NC_019456. 144 TDNNSYWYRV-------------------------------------------T-------------------------- 154 (435) Q Consensus 144 ~~~~~~~~~~-------------------------------------------~-------------------------- 154 (435) ..|..-.++. . T Consensus 166 ~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s~~ 245 (547) T protein:vir:10 166 SRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVLAPTERPFGKKW 245 (547) T ss_pred CCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccceeeccccceeEEE Confidence 4443211000 0 Q ss_pred --cCCeeEE-----EchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHH Q lcl|NC_019456. 155 --SDIYNFT-----IPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKR 225 (435) Q Consensus 155 --~~~~~~~-----~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~ 225 (435) .++.... |..-=.+..|.....+..||.||...+.-.+.....+.+......... +...+.-++.+.+ . T Consensus 246 ~e~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~--~ 323 (547) T protein:vir:10 246 ILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERGLISD--I 323 (547) T ss_pred EEecCceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc--c Confidence 0000000 001112233333334668999999999998888877777655444333 3333332222221 1 Q ss_pred HHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHH-HHH----- Q lcl|NC_019456. 226 QAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNV-EHV----- 299 (435) Q Consensus 226 ~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~-e~~----- 299 (435) + -..|++.+.+..-.++++.....-....+..+.....|-.+|-+....+......| +.+ .+. T Consensus 324 ~----------~~pgg~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~T-AtEV~~r~~E~~ 392 (547) T protein:vir:10 324 D----------LGASGLTVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQMKDSPAMT-ATEVQVRYELMQ 392 (547) T ss_pred e----------ecCCeeeecCCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhhhcCCCcccc-HHHHHHHHHHHH Confidence 1 12456666666667777766543333345566667889999987665543333222 221 011 Q ss_pred -------HHHHHHHHhHHHHHHHHHHHHh-hcccc--ccc--Ccceeeechh-hhhcc----CHHHHHHHHHHHH---hc Q lcl|NC_019456. 300 -------THSWTMTLMPIIRQYESQFNMK-LFTPG--KRV--KGFYFSFNVN-GLLRG----DTAARTQYYQTLT---RN 359 (435) Q Consensus 300 -------~~~~~~~i~P~~~~i~~~l~~~-l~~~~--~~~--~g~~i~fd~~-~l~~~----d~~~~~~~~~~~~---~~ 359 (435) ..+...-+.|++.+.-..+.+. .|++- +.. .+..+++... .|-+. +.......+...- +. T Consensus 393 ~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~ 472 (547) T protein:vir:10 393 RLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAEI 472 (547) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 1112345566666555555443 33321 111 1222222222 22222 1111112222111 10 Q ss_pred C-----CcCHHHH----HHHhCCCCCCCcCCceeeecccccchhcccc-cccccccc--ccccccccccCC-CCCC-CCC Q lcl|NC_019456. 360 G-----IFKPNEI----RELEGQAPIPDEAADHLYISKDLYPLDKYYD-AILDNKIQ--TDASVAAPKQEG-GENT-NEN 425 (435) Q Consensus 360 g-----~~t~NE~----R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~-~~~~~~~~--~~~~~~~~~~~~-~~~~-~~~ 425 (435) + .+..+++ .+.+|.|+ +.+........+-+.-. .+....+. .++...+-...| ++.. .++ T Consensus 473 ~P~vld~id~d~~~~~~a~~~Gvp~------~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~~~a~~~~~ 546 (547) T protein:vir:10 473 NPEVLDIPDWDEMVRMLGSLLGAPQ------TLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQGKGQAALKEN 546 (547) T ss_pred ChhhhhcCCHHHHHHHHHHHhCCCh------hccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccchhcc Confidence 0 1333332 23344432 11111111110100000 00000000 000000000000 0000 000 Q ss_pred C Q lcl|NC_019456. 426 G 426 (435) Q Consensus 426 ~ 426 (435) . T Consensus 547 ~ 547 (547) T protein:vir:10 547 Q 547 (547) T ss_pred C Confidence 0 No 244 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=68.07 E-value=0.24 Score=23.96 Aligned_cols=387 Identities=10% Similarity=0.083 Sum_probs=152.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhh----hHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILES----NEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) +.-|+.++...++...... .......+- ..........++. .+++...++.++..+.+-|..+ T Consensus 22 ~~~W~~ird~~~G~~~~~~-r~~yl~~~~-------~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~----- 88 (489) T protein:vir:78 22 APKWQKVRHALAGELVSYL-RNVGLNEPD-------KAYGEARQAEYEAGGIVYNFTRRTLSGMVGSVMRKEPEI----- 88 (489) T ss_pred HHHHHHHHHHhcCcccccc-cCCCCCCCC-------CCCChHHHHHHHhccccCChHHHHHHHHhchhhcCCcce----- Confidence 4556666655554321100 000000000 0000011222222 2344455555554444444433 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCC-----------cEEEEEEeCCceeE------ Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTG-----------EPIALWPLDPNTVS------ 139 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g-----------~~~~l~~l~~~~v~------ 139 (435) + ....+..++..---...+-.+|.+.++...+.+|.++++|......+ +|. +..+.|..+- T Consensus 89 ~-~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy-~~~~~~~~IinW~~~~ 166 (489) T protein:vir:78 89 N-IPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPT-IAFYTTENIVNWRLTR 166 (489) T ss_pred e-ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcE-EEEechhhhcCceeee Confidence 1 12334455544455778899999999999999999999887643221 121 2222222210 Q ss_pred -----------E-----EEcC---CCceE---E---------------EEEecCCeeEEEchhheEEecc---------- Q lcl|NC_019456. 140 -----------I-----LRNT---DNNSY---W---------------YRVTSDIYNFTIPINDVIHVKH---------- 172 (435) Q Consensus 140 -----------~-----~~~~---~~~~~---~---------------~~~~~~~~~~~~~~~~iih~~~---------- 172 (435) + ..+. .+... | |....+|.... ....+.|-.. T Consensus 167 v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~-~~~~~~~~~g~~~l~~IPfv 245 (489) T protein:vir:78 167 VGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQE-DVVEIYPDLGESLRGVIPFT 245 (489) T ss_pred eCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccc-eeeEEeccCCCCccCeeeEE Confidence 0 0011 11000 0 00001111000 0001111110 Q ss_pred ---CCCccccccCcHHHHHHH-HHHHHHHHHHHHHHHhhcC-CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccC Q lcl|NC_019456. 173 ---VVPSNSWYGVSPIDVLSS-SLKFQRSVENFSQNEMEKK-DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEA 247 (435) Q Consensus 173 ---~~~~~~~~G~s~l~~~~~-~i~~~~~~~~~~~~~~~n~-~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~ 247 (435) ....+...+.+|+..++. .+........+...++.-+ +..++......+++........ ...-+++..+.|+. T Consensus 246 ~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~~~~--~i~~g~~~~~~lp~ 323 (489) T protein:vir:78 246 FIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEANPN--GIKFGSRRGHNLGY 323 (489) T ss_pred EEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccCCcccccccCcc--ceeeCCcccccCCC Confidence 111233456777665443 5555555555555554444 4444443333444333221110 11123445566766 Q ss_pred CceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHH---HHHHhHHHHHHHHHHHHhhcc Q lcl|NC_019456. 248 GWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSW---TMTLMPIIRQYESQFNMKLFT 324 (435) Q Consensus 248 g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~---~~~i~P~~~~i~~~l~~~l~~ 324 (435) +.++.-+..+...+. .+..+....+ .+..| ..++- .++.-+.++...-+ ++.|.-++..+++.++..|-. T Consensus 324 ~~~~~~ie~~~~~~~-r~~l~~le~q-m~~lG--a~l~~---~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~ 396 (489) T protein:vir:78 324 GGSAQLIQAGENNLA-RQNMLDKEQQ-AIQIG--AQLIT---PTQQITAQSARIQRGADTSVMATIARNVSQAYTDALRW 396 (489) T ss_pred CCCcceeccCcchHH-HHHHHHHHHH-HHHHh--hhhcc---CCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 655544444443332 1211111222 22222 12231 11112223222222 455666777777776655221 Q ss_pred cccc--c-Cc--ceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccc Q lcl|NC_019456. 325 PGKR--V-KG--FYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDA 399 (435) Q Consensus 325 ~~~~--~-~g--~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~ 399 (435) -..+ . .+ ..|..+.+-....-.......+.++++.|.++....++.+-.--+.++- + +...+. T Consensus 397 ~a~w~G~~~~~~~~i~~n~dF~~~~~d~~~~~al~~~~~~G~is~~t~~~~L~~~gv~d~~-~-----------e~~~~e 464 (489) T protein:vir:78 397 VAVMLGKPEDTEVEFRLNMDFFLEPMTAQDRAAWMADINAGLLPATAYYAALRKAGVTDWT-D-----------ADIKDA 464 (489) T ss_pred HHHHcCCCCCCceEEEeecccCcccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcc-H-----------HHHHHH Confidence 1111 1 12 2344444433322223346667778899999999988877443232110 0 111111 Q ss_pred cccccccccccccccccCCCCCCCC Q lcl|NC_019456. 400 ILDNKIQTDASVAAPKQEGGENTNE 424 (435) Q Consensus 400 ~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (435) ..+...+.......+-..+..+.+. T Consensus 465 i~~~~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 465 VADQPLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred HhhcCCCcccCCcccCCCCcccccC Confidence 1111111100011111111111111 No 245 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=63.81 E-value=0.31 Score=23.37 Aligned_cols=402 Identities=12% Similarity=0.026 Sum_probs=136.1 Q ss_pred CchHHH-----HHhhccccccccccccccchhhhhhcccc----ccCcccccHHHHhhhHHHHHHHHHHHHHHhhC---- Q lcl|NC_019456. 1 MSFMSK-----VRQFFGVHDQANQIVQNPIPQPLDMAGVK----LEQATFSREHILESNEYIFSIVTRLSNVLASL---- 67 (435) Q Consensus 1 Mg~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~---- 67 (435) |....- ++..+..-+...+....-+....++.-.. .+........... .+.--.|++.+|..+-+. T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~-dst~~~a~~~Las~l~~~ltP~ 79 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPW-QAVGARCLNNLAAKLMLALFPQ 79 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccc-cccHHHHHHHHHHHHHhhcCCC Confidence 443221 11111110110011111111111110000 0000000000111 223334556665555432 Q ss_pred -ceeeee-cc-------c-ccccc----------hHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcE Q lcl|NC_019456. 68 -PLHEYQ-NY-------K-QMDNE----------PLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEP 127 (435) Q Consensus 68 -~~~~~~-~~-------~-~~~~~----------~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~ 127 (435) ||.-.. .+ . ..... ...+..+.+- +.+.=+..++.+++.+|+++.++..+.. |.+ T Consensus 80 ~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~s----nf~~~~~~~~~~L~~~G~a~l~~~~~~~-~~~ 154 (522) T protein:vir:94 80 SPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETN----SFRVPLFEALKQLIVSGNCLLYIPEPEQ-GTY 154 (522) T ss_pred CcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhc----CcHHHHHHHHHHHHhhCcEeEeeeccCC-Cce Confidence 332111 10 0 00000 1111112233 3455566778899999999988876544 444 Q ss_pred E--EEEEeCCceeEEEEcCCCceEE----------------------------------E-EEecCCeeEE--------- Q lcl|NC_019456. 128 I--ALWPLDPNTVSILRNTDNNSYW----------------------------------Y-RVTSDIYNFT--------- 161 (435) Q Consensus 128 ~--~l~~l~~~~v~~~~~~~~~~~~----------------------------------~-~~~~~~~~~~--------- 161 (435) . ..|||. ++-+..+..|...- . .....+.... T Consensus 155 ~~~~~~pl~--~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~g~~~ 232 (522) T protein:vir:94 155 SPMRMYRLV--SYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYLRYEEVEGIEV 232 (522) T ss_pred eeEEEEEcc--eEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCceeEEeeccCcee Confidence 3 445554 33333444332210 0 0000000000 Q ss_pred --------EchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHH Q lcl|NC_019456. 162 --------IPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVND 231 (435) Q Consensus 162 --------~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~ 231 (435) |..-=.+..|.....+..||.||...+.-.+.....+.+......... +..++.-++........ T Consensus 233 ~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~----- 307 (522) T protein:vir:94 233 TGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLN----- 307 (522) T ss_pred cccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchhee----- Confidence 011112233333334668999999999999988888777766555443 33444333444443321 Q ss_pred HHHHhcCCCcccccc--CCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHH-HHHHH------- Q lcl|NC_019456. 232 FLRMVKENGGAVVQE--AGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNV-EHVTH------- 301 (435) Q Consensus 232 ~~~~~~~~~~~~vl~--~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~-e~~~~------- 301 (435) .+..+.++-+ +++...++.....-.-..+..+.....|-++|-+..... ......++.+ ..... T Consensus 308 -----~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~-~~~~r~TAtEV~~r~~E~~~~LG 381 (522) T protein:vir:94 308 -----KAATGEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAVQ-RNAERVTAEEIRYVAGELEATLG 381 (522) T ss_pred -----ccCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhcc-CCCccccHHHHHHHHHHHHHHHh Confidence 1222223332 334444444332222234455566777888886542211 1111112221 11111 Q ss_pred -----HHHHHHhHHHHHHHHHHHHh-hcccccccCcceeeechhhhhc----cCHHHHHHHHHHHHhcC------CcCHH Q lcl|NC_019456. 302 -----SWTMTLMPIIRQYESQFNMK-LFTPGKRVKGFYFSFNVNGLLR----GDTAARTQYYQTLTRNG------IFKPN 365 (435) Q Consensus 302 -----~~~~~i~P~~~~i~~~l~~~-l~~~~~~~~g~~i~fd~~~l~~----~d~~~~~~~~~~~~~~g------~~t~N 365 (435) +...-+.|++.+.-..+.+. +|++- ......+++ .+.|.. .+......++..+-+.+ .+..+ T Consensus 382 ~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~-p~~~v~v~~-~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d 459 (522) T protein:vir:94 382 GVYSVQSQELQLPIVRVLMNQLQSAGMIPDL-PKEAVEPTV-STGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLP 459 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC-CcccEEeeE-ecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHH Confidence 12334455555544444332 33221 111223333 222221 12222222222221111 01222 Q ss_pred H----HHHHhCCCCCCCcCCceeeecccccchhccccccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 366 E----IRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 366 E----~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) + +.+.+|.+|. ..+..+.....+-. .+... ...+... ...++.-+-.-++...++-++ T Consensus 460 ~~~~~~a~~~Gv~~~-----~ivr~~ee~~~~~~---q~~~~--~~~~~~~--~~~~~~~~a~~~~~~~~~~~~ 521 (522) T protein:vir:94 460 TLKLRLLNALGIDTA-----GLLLTQDEKIQRMA---EQSSQ--QAVVQGA--SAAGANMGAAVGQGAGEDMAQ 521 (522) T ss_pred HHHHHHHHHcCCChh-----hccCCHHHHHHHHH---HHHHH--HHHHHHH--HHHHHHhhhhhhcccchhhhc Confidence 2 2233344321 00000000000000 00000 0000000 000111000001111111111 No 246 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=59.04 E-value=0.4 Score=22.76 Aligned_cols=404 Identities=12% Similarity=0.088 Sum_probs=144.4 Q ss_pred Cch--HHHHHhhccccccccccccccchhhhhhcc----ccccCccccc---HHHHhhhHHHHHHHHHHHHHHhhC--c- Q lcl|NC_019456. 1 MSF--MSKVRQFFGVHDQANQIVQNPIPQPLDMAG----VKLEQATFSR---EHILESNEYIFSIVTRLSNVLASL--P- 68 (435) Q Consensus 1 Mg~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~---~~~~~~~~~v~~~i~~ia~~ia~~--~- 68 (435) |.- .++++..+..-+.........+.+..++.- .......... .... -.+.-..|++.+|..+-.. | T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~-~dst~~~a~~~Las~l~~~ltpp 79 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRI-IDSTGTMAARTLASGMMSGITSP 79 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCccccccccc-ccchHHHHHHHHHHHHHHhhcCC Confidence 654 334443332222222222222333332210 0000000000 0001 1223334555565555432 3 Q ss_pred ---e-eeeecccccc------------cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEE Q lcl|NC_019456. 69 ---L-HEYQNYKQMD------------NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWP 132 (435) Q Consensus 69 ---~-~~~~~~~~~~------------~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~ 132 (435) | ++.-.+.... ++.++.. +.+-| .+.-+..++.+++++|++.+++..+.. ++..+.+ T Consensus 80 ~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~-l~~sn----f~~~~~~~~~~L~~~Gta~l~~~~d~~--~~~r~~~ 152 (559) T protein:vir:95 80 ARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDM-FNKSN----LYQSLPQLYGSLGTYSTGAMAVLDDDE--DIIRTMP 152 (559) T ss_pred CCcccccccCCccccchHHHHHHHHHHHHHHHHH-HHhcC----cHHHHHHHHHHHHhhCceeeEeecCCC--ceeEEEE Confidence 2 3221111111 1112222 22333 444456678899999999988876543 3455555 Q ss_pred eCCceeEEEEcCCCceEEEEEe--------------------------cCC--eeEE-----Ec---------------- Q lcl|NC_019456. 133 LDPNTVSILRNTDNNSYWYRVT--------------------------SDI--YNFT-----IP---------------- 163 (435) Q Consensus 133 l~~~~v~~~~~~~~~~~~~~~~--------------------------~~~--~~~~-----~~---------------- 163 (435) ++..++-+..+..|..-.++.. .+. ...+ +| T Consensus 153 ~~l~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~pf 232 (559) T protein:vir:95 153 FPIGSYYLANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPF 232 (559) T ss_pred eecCeEEEeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEeccccccccccccccceE Confidence 6656665555555432211100 000 0000 00 Q ss_pred --------hh--he-----------EEeccCCCccccccCc-HHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcCC Q lcl|NC_019456. 164 --------IN--DV-----------IHVKHVVPSNSWYGVS-PIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSIS 221 (435) Q Consensus 164 --------~~--~i-----------ih~~~~~~~~~~~G~s-~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~~ 221 (435) .+ .+ +-.|.....+..||.| |...+.-.+.....+.+.............+.++.... T Consensus 233 ~s~~~e~~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~ 312 (559) T protein:vir:95 233 KSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLK 312 (559) T ss_pred EEEEEEecCCCceeeecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceecccccc Confidence 00 01 1111111235689999 89988888877777776655544444333333333321 Q ss_pred HHHHHHHHHHHHHHhcCCCccccccC---CceeeeccC-ChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCcccC-cccH Q lcl|NC_019456. 222 PEKRQAMVNDFLRMVKENGGAVVQEA---GWKVDRYES-KFEPADLSSVEQISRIRIATAFNVPISFL-NDDQAK-STTN 295 (435) Q Consensus 222 ~e~~~~~~~~~~~~~~~~~~~~vl~~---g~~~~~~~~-~~~~~~~~e~~~~~~~~Ia~~fgvP~~~l-g~~~~~-~~~~ 295 (435) ....+ -..|++.+.+. .-.++++.. ++......+..+.....|-++|-+.+.+. +..... -++. T Consensus 313 ~~~~~----------l~pgg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAt 382 (559) T protein:vir:95 313 NQRAS----------LLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVE 382 (559) T ss_pred cccee----------eeccceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHH Confidence 11111 11333332221 123444422 33222223334456788999998866433 221111 1222 Q ss_pred H-----HHH--------HHHHHHHHhHHHHHHHHHHHHh-hcccc-cccCcceeeechhhhh-cc----CH---HHHHHH Q lcl|NC_019456. 296 V-----EHV--------THSWTMTLMPIIRQYESQFNMK-LFTPG-KRVKGFYFSFNVNGLL-RG----DT---AARTQY 352 (435) Q Consensus 296 ~-----e~~--------~~~~~~~i~P~~~~i~~~l~~~-l~~~~-~~~~g~~i~fd~~~l~-~~----d~---~~~~~~ 352 (435) + ++. ..+....+.|++.+.-..+.+. .|++. ....+..+++.+...+ +. +. ...++. T Consensus 383 EV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~ 462 (559) T protein:vir:95 383 AVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNF 462 (559) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHH Confidence 1 110 1122345667776666666554 23321 1112333333333222 21 11 111122 Q ss_pred HHHHHhcC-----CcCHHHHH----HHhCCCCCCCcCCceeeecccccchhcccccccc-----c----ccccccccccc Q lcl|NC_019456. 353 YQTLTRNG-----IFKPNEIR----ELEGQAPIPDEAADHLYISKDLYPLDKYYDAILD-----N----KIQTDASVAAP 414 (435) Q Consensus 353 ~~~~~~~g-----~~t~NE~R----~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~-----~----~~~~~~~~~~~ 414 (435) +..+-+.+ .+..+++- +.+|.|+ +.+...-....+...-..+.. . .....+ ..++ T Consensus 463 ~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~------~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~-~~~~ 535 (559) T protein:vir:95 463 IGQLAQVKPEALDKLNVDQAIDAFADMSGVSP------TVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVK-TLSE 535 (559) T ss_pred HHHHhccChhhhhcCCHHHHHHHHHHHhCCch------hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-cccc Confidence 22221111 13334432 3344442 111111111111000000000 0 000000 0000 Q ss_pred ccCCCCC---------CCCCCCCC Q lcl|NC_019456. 415 KQEGGEN---------TNENGLQS 429 (435) Q Consensus 415 ~~~~~~~---------~~~~~~~~ 429 (435) ....+.+ +..-++.. T Consensus 536 ~~~~~~~~l~~~~~~~~~~~~~~~ 559 (559) T protein:vir:95 536 AKTSDPSVLSAMANAVSGQGGQSQ 559 (559) T ss_pred ccCCChhHHHHHHHhhcCccccCC Confidence 0000000 00000011 No 247 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=57.98 E-value=0.42 Score=22.63 Aligned_cols=417 Identities=10% Similarity=0.022 Sum_probs=166.1 Q ss_pred CchHHHHHhhccccccccccccccchh------------hhhh-ccccccCccc--ccHHHHhhhHHHHHHHHHHHHHHh Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQ------------PLDM-AGVKLEQATF--SREHILESNEYIFSIVTRLSNVLA 65 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~-~~~~~~~~~~--~~~~~~~~~~~v~~~i~~ia~~ia 65 (435) |+-=.+- ..+.++.-.-....+-. +.++ .+........ ...+.-...|.-...|+..+.-+ T Consensus 1 m~~~~~q---~~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~il~G~dr~~~~~ps~r~~V~~~~~~L- 76 (563) T protein:vir:74 1 MPYNHKQ---YDPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLVLRGDDSVPILMPSGRKIVEAVHRFL- 76 (563) T ss_pred CCccccc---cCCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhhcCCCceeeeccchHHHHHHHHHHhc- Confidence 5441110 00000000000000000 0000 0000000000 00000111222334455555444 Q ss_pred hCcee--eeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCC--CcEEEEEEeCCceeEEE Q lcl|NC_019456. 66 SLPLH--EYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLST--GEPIALWPLDPNTVSIL 141 (435) Q Consensus 66 ~~~~~--~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~--g~~~~l~~l~~~~v~~~ 141 (435) .-+++ |-...+........+-++..--..........+...+.++-|++...+.+|... |.=..+..++|...... T Consensus 77 g~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~~vDP~~~fp~ 156 (563) T protein:vir:74 77 GVGFDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVDEVDPRQIFLI 156 (563) T ss_pred CCCcEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEeecCCceeeec Confidence 33333 322221111112233344455566667788888889999999999999987532 22234555555544333 Q ss_pred EcCCCce---------------------E-----EEEEecCCee------------------E-----EE---------c Q lcl|NC_019456. 142 RNTDNNS---------------------Y-----WYRVTSDIYN------------------F-----TI---------P 163 (435) Q Consensus 142 ~~~~~~~---------------------~-----~~~~~~~~~~------------------~-----~~---------~ 163 (435) .+++... . .|...+.+.. + .+ . T Consensus 157 ~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~~~~~~~~~~~~~ 236 (563) T protein:vir:74 157 EDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISDEQARRKEQVRSA 236 (563) T ss_pred cCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCccchhhhcccchhhhh Confidence 2222110 0 0000011100 0 00 0 Q ss_pred hh--------h------eEEeccCCCccccccCcHHHHHHHHHHHHHHH-HHHHHHHhhcC-CceEEEeCCcCCHHHHHH Q lcl|NC_019456. 164 IN--------D------VIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSV-ENFSQNEMEKK-DKFVLQYDRSISPEKRQA 227 (435) Q Consensus 164 ~~--------~------iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~-~~~~~~~~~n~-~~~~~~~~~~~~~e~~~~ 227 (435) .. + |.|++...+.+..+|.|-|.-+...+...+.. ....+..--.| +-.++......+. ... T Consensus 237 ~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~d~--~~g 314 (563) T protein:vir:74 237 QHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNASAPVDP--NTG 314 (563) T ss_pred hhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEecccccccc--ccc Confidence 01 1 66777766778899999988887777665432 22222222223 3233332111110 110 Q ss_pred HHHHHHHHhcCCCccccccC---CceeeeccCChhhHHHHHHHHHHH-HHHHHHhCCCHHHhCCcccCc----ccHHHHH Q lcl|NC_019456. 228 MVNDFLRMVKENGGAVVQEA---GWKVDRYESKFEPADLSSVEQISR-IRIATAFNVPISFLNDDQAKS----TTNVEHV 299 (435) Q Consensus 228 ~~~~~~~~~~~~~~~~vl~~---g~~~~~~~~~~~~~~~~e~~~~~~-~~Ia~~fgvP~~~lg~~~~~~----~~~~e~~ 299 (435) ....|. -+.|.++-|++ +..+..++..++-..+..-.++.. +.|+..-++|..-+|-.+.+. ++-.=++ T Consensus 315 ~~~~w~---vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SGiALeL~L 391 (563) T protein:vir:74 315 ELTDWN---IGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESGISLELQL 391 (563) T ss_pred cccccc---cCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccccccchhhhhhhh Confidence 011121 23344544542 346777776654433333233333 367888899999999555432 1111011 Q ss_pred HHHH----HHH--HhHHHHHHHHHHHHhhcccccc--------------cCcc--eeeechhhhhccCHHHHHHHHHHHH Q lcl|NC_019456. 300 THSW----TMT--LMPIIRQYESQFNMKLFTPGKR--------------VKGF--YFSFNVNGLLRGDTAARTQYYQTLT 357 (435) Q Consensus 300 ~~~~----~~~--i~P~~~~i~~~l~~~l~~~~~~--------------~~g~--~i~fd~~~l~~~d~~~~~~~~~~~~ 357 (435) .-.. +.. |.--++++-..+.+.||+-.+. ..+. .+..-+.+.+..|.....+....++ T Consensus 392 ~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P~d~~~vv~~~~tl~ 471 (563) T protein:vir:74 392 KPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMPVNKTQVTQDTLLLQ 471 (563) T ss_pred hHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCCccHHHHHHHHHHHH Confidence 0000 000 1222222222333333321111 0111 1233456778889999999999999 Q ss_pred hcCCcCHHHHHHHh---CCCCCCCcCCceeeecccccchhcccc---ccccccccccccccccccCCCCCCCCCCC--CC Q lcl|NC_019456. 358 RNGIFKPNEIRELE---GQAPIPDEAADHLYISKDLYPLDKYYD---AILDNKIQTDASVAAPKQEGGENTNENGL--QS 429 (435) Q Consensus 358 ~~g~~t~NE~R~~~---g~~p~~~~~gd~~~~~~n~~~l~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 429 (435) ++|+++.--+=++| |++- | ..+.-. +-.....+.+ .+.....+.+. ..-.+||-.+.+..+ .| T Consensus 472 ~aGiiSretAv~~L~~~g~~~-p--dae~e~---~~ie~~~i~~~~~a~a~ad~~~~~---~a~~~~g~~~~~~dd~g~p 542 (563) T protein:vir:74 472 QAHLILRKMAVAKLRSIGWEY-P--EVDDQG---NALTDDDIADMLLAEAEADASLGL---SAMDNGGAGEQQFDDQGNP 542 (563) T ss_pred HcCchhHHHHHHHHHhCCCCC-C--cHHHHH---hhcCHHHHHHHHHHHhhccCcccc---eecccCCCCcccccccCCc Confidence 99999998886776 6542 2 111111 1111111111 00000000000 000111111111100 11 Q ss_pred CCCCCC Q lcl|NC_019456. 430 TEPEGS 435 (435) Q Consensus 430 ~~~~~~ 435 (435) .++-|+ T Consensus 543 ~~~~~~ 548 (563) T protein:vir:74 543 IDQFGN 548 (563) T ss_pred hhHcCC Confidence 122222 No 248 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=57.93 E-value=0.42 Score=22.63 Aligned_cols=388 Identities=12% Similarity=0.026 Sum_probs=164.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhC------------- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASL------------- 67 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~------------- 67 (435) |+-=.+- .+.. .+.......+-+ .+...--.+..+..--.++-.+..+.+ T Consensus 1 ~~~~~~~---~~~~----~~~~~g~~~~p~----------~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r 63 (527) T protein:vir:10 1 MGQDKRQ---YGST----QQLRAGEANFPN----------AVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQR 63 (527) T ss_pred CCccccc---cCCC----cCcCCccccCcc----------cCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccc Confidence 5542111 0111 111000000000 011000000000000001111111111 Q ss_pred ce---------------eeee-----cccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCC--C Q lcl|NC_019456. 68 PL---------------HEYQ-----NYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLST--G 125 (435) Q Consensus 68 ~~---------------~~~~-----~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~--g 125 (435) || .+.. ......-..++..+..+ .+.....++...+.++-|++...+.+|... | T Consensus 64 ~~~~ps~~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~----e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~ 139 (527) T protein:vir:10 64 PIYVPNGEKLIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDR----ENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEG 139 (527) T ss_pred eeeehhhHHhhCCcceeeccCccccccchhHHHHHHHHHHHHH----hhhHHHHHHHHHhhhhhcceeEEEeeccCCCcC Confidence 11 1100 00001112223333333 456667788888889999999999987532 1 Q ss_pred cEEEEEEeCCceeEEEEcCCCceE---EEEE-------------------------ecCC-----eeEE----------- Q lcl|NC_019456. 126 EPIALWPLDPNTVSILRNTDNNSY---WYRV-------------------------TSDI-----YNFT----------- 161 (435) Q Consensus 126 ~~~~l~~l~~~~v~~~~~~~~~~~---~~~~-------------------------~~~~-----~~~~----------- 161 (435) .=..+..+||..+.+..++++..+ ++.. .+.| +... T Consensus 140 ~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w 219 (527) T protein:vir:10 140 SRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKW 219 (527) T ss_pred CCceEeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecccc Confidence 224666777666665555533211 0000 0000 0000 Q ss_pred ----------------------------EchhheEEeccCCCccccccCcHHHHHHHHHHHHHH-HHH-HHHHHhhcCCc Q lcl|NC_019456. 162 ----------------------------IPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRS-VEN-FSQNEMEKKDK 211 (435) Q Consensus 162 ----------------------------~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~-~~~-~~~~~~~n~~~ 211 (435) +.-==|+||+..++.+...|.|-|.-+...+..... +.. .....|...+. T Consensus 220 ~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi 299 (527) T protein:vir:10 220 DDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGF 299 (527) T ss_pred ccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCce Confidence 001125677776677888999988866665554432 222 22223333233 Q ss_pred eEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC Q lcl|NC_019456. 212 FVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK 291 (435) Q Consensus 212 ~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~ 291 (435) .+++.-... +.+ -+.....-..|.++=|+++.++..++..+.-..+........+.|+..-++|.+-+|..+.+ T Consensus 300 ~~~tg~~~v---d~~---G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s 373 (527) T protein:vir:10 300 YATDSAPPR---DSR---GNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAA 373 (527) T ss_pred eeecccccc---ccc---CCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCC Confidence 232211111 111 00001112345566688999999988877666677778888889999999999999966655 Q ss_pred cccHHHHHHHHHHHHHhHHHH--------------HHHH-HHHH-----hhcccccccCcceeeechhhhhccCHHHHHH Q lcl|NC_019456. 292 STTNVEHVTHSWTMTLMPIIR--------------QYES-QFNM-----KLFTPGKRVKGFYFSFNVNGLLRGDTAARTQ 351 (435) Q Consensus 292 ~~~~~e~~~~~~~~~i~P~~~--------------~i~~-~l~~-----~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~ 351 (435) +--+.-+. .-.+.|++. ++.. ++.+ ..+..........+...+...+..|.+..++ T Consensus 374 ~~~SG~AL----eL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie 449 (527) T protein:vir:10 374 VAESGIAL----DLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFN 449 (527) T ss_pred cCcHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHH Confidence 42221111 111222211 1111 0000 0111111111112334445667889999999 Q ss_pred HHHHHHhcCCcCHHHHHHHh----CCCCCCCcCCc--eeeecccccchhccccccccccccccccccccccCCCCCCCCC Q lcl|NC_019456. 352 YYQTLTRNGIFKPNEIRELE----GQAPIPDEAAD--HLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNEN 425 (435) Q Consensus 352 ~~~~~~~~g~~t~NE~R~~~----g~~p~~~~~gd--~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (435) ...+++.+|+++.-=+-++| |... +..+ +..--.-...+. .+++...-...-+++-+ - T Consensus 450 ~v~tL~~aGi~S~~tAv~~L~~~~g~eD---~E~E~~~I~~era~~a~a-----------~a~A~~~~~a~~~~~~g--~ 513 (527) T protein:vir:10 450 QLLQLWEAGLIPAKKLTEELSKIMGFEL---TEEDFKQATEDKKTQGIA-----------QAEAADPFGAQMAAEQG--I 513 (527) T ss_pred HHHHHHHcCchhHHHHHHHHHhccCCCC---hHHHHHHHHHHHHHHhHH-----------hhhhcCchhhhhccccC--C Confidence 99999999999999887777 4222 2222 110000000000 00000000000000000 0 Q ss_pred CCCCCCCCCC Q lcl|NC_019456. 426 GLQSTEPEGS 435 (435) Q Consensus 426 ~~~~~~~~~~ 435 (435) ++...++-|+ T Consensus 514 ~~~~~d~~~~ 523 (527) T protein:vir:10 514 PDEEDDQALN 523 (527) T ss_pred CCCCcccccC Confidence 0111111122 No 249 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=57.77 E-value=0.43 Score=22.61 Aligned_cols=388 Identities=12% Similarity=0.026 Sum_probs=164.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhhHHHHHHHHHHHHHHhhC------------- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESNEYIFSIVTRLSNVLASL------------- 67 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~------------- 67 (435) |+-=.+- .+.. .+.......+-+ .+...--.+..+..--.++-.+..+.+ T Consensus 1 ~~~~~~~---~~~~----~~~~~g~~~~p~----------~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r 63 (527) T protein:vir:10 1 MGQDKRQ---YGST----QQLRAGEANFPN----------AVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQR 63 (527) T ss_pred CCccccc---cCCC----cCcCCccccCcc----------cCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccc Confidence 5542111 0111 111000000000 011000000000000001111111111 Q ss_pred ce---------------eeee-----cccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCC--C Q lcl|NC_019456. 68 PL---------------HEYQ-----NYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLST--G 125 (435) Q Consensus 68 ~~---------------~~~~-----~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~--g 125 (435) || .+.. ......-..++..+..+ .+.....++...+.++-|++...+.+|... | T Consensus 64 ~~~~ps~~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~----e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~ 139 (527) T protein:vir:10 64 PIYVPNGEKLIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDR----ENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEG 139 (527) T ss_pred eeeehhhHHhhCCcceeeccCccccccchhHHHHHHHHHHHHH----hhhHHHHHHHHHhhhhhcceeEEEeeccCCCcC Confidence 11 1100 00001112223333333 456667788888889999999999987532 1 Q ss_pred cEEEEEEeCCceeEEEEcCCCceE---EEEE-------------------------ecCC-----eeEE----------- Q lcl|NC_019456. 126 EPIALWPLDPNTVSILRNTDNNSY---WYRV-------------------------TSDI-----YNFT----------- 161 (435) Q Consensus 126 ~~~~l~~l~~~~v~~~~~~~~~~~---~~~~-------------------------~~~~-----~~~~----------- 161 (435) .=..+..+||..+.+..++++..+ ++.. .+.| +... T Consensus 140 ~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w 219 (527) T protein:vir:10 140 SRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKW 219 (527) T ss_pred CCceEeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecccc Confidence 224666777666665555533211 0000 0000 0000 Q ss_pred ----------------------------EchhheEEeccCCCccccccCcHHHHHHHHHHHHHH-HHH-HHHHHhhcCCc Q lcl|NC_019456. 162 ----------------------------IPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRS-VEN-FSQNEMEKKDK 211 (435) Q Consensus 162 ----------------------------~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~-~~~-~~~~~~~n~~~ 211 (435) +.-==|+||+..++.+...|.|-|.-+...+..... +.. .....|...+. T Consensus 220 ~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi 299 (527) T protein:vir:10 220 DDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGF 299 (527) T ss_pred ccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCce Confidence 001125677776677888999988866665554432 222 22223333233 Q ss_pred eEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC Q lcl|NC_019456. 212 FVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK 291 (435) Q Consensus 212 ~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~ 291 (435) .+++.-... +.+ -+.....-..|.++=|+++.++..++..+.-..+........+.|+..-++|.+-+|..+.+ T Consensus 300 ~~~tg~~~v---d~~---G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s 373 (527) T protein:vir:10 300 YATDSAPPR---DSR---GNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAA 373 (527) T ss_pred eeecccccc---ccc---CCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCC Confidence 232211111 111 00001112345566688999999988877666677778888889999999999999966655 Q ss_pred cccHHHHHHHHHHHHHhHHHH--------------HHHH-HHHH-----hhcccccccCcceeeechhhhhccCHHHHHH Q lcl|NC_019456. 292 STTNVEHVTHSWTMTLMPIIR--------------QYES-QFNM-----KLFTPGKRVKGFYFSFNVNGLLRGDTAARTQ 351 (435) Q Consensus 292 ~~~~~e~~~~~~~~~i~P~~~--------------~i~~-~l~~-----~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~ 351 (435) +--+.-+. .-.+.|++. ++.. ++.+ ..+..........+...+...+..|.+..++ T Consensus 374 ~~~SG~AL----eL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie 449 (527) T protein:vir:10 374 VAESGIAL----DLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFA 449 (527) T ss_pred cCcHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHH Confidence 42221111 111222211 1111 0000 0111111111112334445667889999999 Q ss_pred HHHHHHhcCCcCHHHHHHHh----CCCCCCCcCCc--eeeecccccchhccccccccccccccccccccccCCCCCCCCC Q lcl|NC_019456. 352 YYQTLTRNGIFKPNEIRELE----GQAPIPDEAAD--HLYISKDLYPLDKYYDAILDNKIQTDASVAAPKQEGGENTNEN 425 (435) Q Consensus 352 ~~~~~~~~g~~t~NE~R~~~----g~~p~~~~~gd--~~~~~~n~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (435) ...+++.+|+++.-=+-++| |... +..+ +..--.-...+..+ +....-+.+.. .++|-.. T Consensus 450 ~v~tL~~aGiiS~etAv~~L~~~~g~eD---~E~E~~~I~~era~~a~a~a-~a~~~~~a~~~-------~~~g~~~--- 515 (527) T protein:vir:10 450 QLLELWEAGLIPAKKLTEELSKIMGFEL---TEEDFRQATEDKKTQGIAQA-EAADPFGAQMA-------AEQGIPD--- 515 (527) T ss_pred HHHHHHHcCchhHHHHHHHHHhccCCCc---hHHHHHHHHHHHHHHhHHhh-hhcCchhhhhc-------cccCCCC--- Confidence 99999999999999887777 4222 2222 11000000000000 00000000000 0011111 Q ss_pred CCCCCCCCCC Q lcl|NC_019456. 426 GLQSTEPEGS 435 (435) Q Consensus 426 ~~~~~~~~~~ 435 (435) ...++-|+ T Consensus 516 --~~~d~~~~ 523 (527) T protein:vir:10 516 --EEDDQALN 523 (527) T ss_pred --CCcccccC Confidence 11111122 No 250 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=54.63 E-value=0.5 Score=22.24 Aligned_cols=389 Identities=10% Similarity=0.053 Sum_probs=153.0 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhh----hHHHHHHHHHHHHHHhhCceeeeeccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILES----NEYIFSIVTRLSNVLASLPLHEYQNYK 76 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~v~~~i~~ia~~ia~~~~~~~~~~~ 76 (435) +.-|+.++...++.... .......+.+- ..........++. .+++...++.++..+.+-|..+ T Consensus 22 ~~~W~~ird~~~G~~~~-~~r~~yl~~~~-------~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~----- 88 (491) T protein:vir:95 22 APKWQKVRHALAGDLVG-YLRNVGLNEPD-------KAYGEARQAEYEAGGIVYNFTRRTLSGMVGSVMRKEPEI----- 88 (491) T ss_pred HHHHHHHHHHhcCcchh-hcccCCCcCCC-------CCCCHHHHHHHHhcccCCChHHHHHHHHhchhhcCCcee----- Confidence 44456555555442110 00111110000 0000011122222 2344455555555554444443 Q ss_pred ccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCC-----------cEEEEEEeCCceeE------ Q lcl|NC_019456. 77 QMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTG-----------EPIALWPLDPNTVS------ 139 (435) Q Consensus 77 ~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g-----------~~~~l~~l~~~~v~------ 139 (435) + ....+..++..---...+-.+|.+.++...+.+|.++++|......+ +|. +..+.|..+- T Consensus 89 ~-~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy-~~~~~~~~IinW~~~~ 166 (491) T protein:vir:95 89 N-IPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPT-IAFYTTENIVNWRLTR 166 (491) T ss_pred e-ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcE-EEEechhhhcCceeee Confidence 1 12334445544455778899999999999999999999887543322 111 2222222110 Q ss_pred ----------------EEEc---CCCce---EEEEE---------------ecCCeeEEEchhheEEecc---------- Q lcl|NC_019456. 140 ----------------ILRN---TDNNS---YWYRV---------------TSDIYNFTIPINDVIHVKH---------- 172 (435) Q Consensus 140 ----------------~~~~---~~~~~---~~~~~---------------~~~~~~~~~~~~~iih~~~---------- 172 (435) ...+ ..+.. .|+.. ..++... ....+++|-.. T Consensus 167 v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~-~~~~~~~~~~g~~~l~~IPfv 245 (491) T protein:vir:95 167 VGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQ-EEVVEIYPDLGESLRGVIPFT 245 (491) T ss_pred eCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCcce-eeeeeeeecCCCcccCeeEEE Confidence 0011 11110 00000 0011100 00111222111 Q ss_pred ---CCCccccccCcHHHHHHH-HHHHHHHHHHHHHHHhhcC-CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccC Q lcl|NC_019456. 173 ---VVPSNSWYGVSPIDVLSS-SLKFQRSVENFSQNEMEKK-DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEA 247 (435) Q Consensus 173 ---~~~~~~~~G~s~l~~~~~-~i~~~~~~~~~~~~~~~n~-~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~ 247 (435) ....+...+.+|+..++. .+........+...++.-+ |..++......+++..+.... ....-+++..+.++. T Consensus 246 ~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~--~~i~~g~~~~~~lP~ 323 (491) T protein:vir:95 246 FIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANP--NGIKFGSRCGHNLGY 323 (491) T ss_pred EEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhccCc--ceeEecCcCCcCCCC Confidence 112234457777765443 5555555555554444444 444444433444444332111 011123344566666 Q ss_pred CceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHH---HHHHhHHHHHHHHHHHHhhcc Q lcl|NC_019456. 248 GWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSW---TMTLMPIIRQYESQFNMKLFT 324 (435) Q Consensus 248 g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~---~~~i~P~~~~i~~~l~~~l~~ 324 (435) +.++.-+......+. .+..+....+ ++..|. .++- .++.-+.++...-+ ++.|.-++.++++.++..|=. T Consensus 324 ~~~~~~ie~~~~~~~-~~~l~~~e~q-m~~~Ga--~l~~---~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~ 396 (491) T protein:vir:95 324 GGSAQLIQAGENNLA-RQNMLDKEQQ-AIQIGA--QLIT---PSQQITAESARIQRGADTSVMATIARNVSQAYTDALRW 396 (491) T ss_pred CCccceeecCcchHH-HHHHHHHHHH-HHHHHH--Hhcc---CCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 655555554433332 1111111111 222221 2221 11112223222211 345566666777666654211 Q ss_pred --cccc---cCcceeeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccchhccccc Q lcl|NC_019456. 325 --PGKR---VKGFYFSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYPLDKYYDA 399 (435) Q Consensus 325 --~~~~---~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~ 399 (435) .... .....|..+.+-....-......++-++++.|.++.-..++.+-.--+.+.-.++. ++.+.+. T Consensus 397 ~a~w~G~~~~~~v~i~~n~dF~~~~~~~~~~~all~~~~~G~is~~t~~~~L~~~~vl~~~~e~~--------~~~ie~~ 468 (491) T protein:vir:95 397 VAMMLGKPEDSEVEFQLNMDFFLQPMTAQDRAAWMADINAGLLPATAYYAALRKAGVTDWTDEDI--------LNAIEDA 468 (491) T ss_pred HHHHcCCCCCCceEEEeecccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCccHHHH--------HHHHHhc Confidence 1111 11223444555433332233567777788899999999888764322221101111 0111110 Q ss_pred cccccccccccccccccCCCCCCCCCCCCCCC Q lcl|NC_019456. 400 ILDNKIQTDASVAAPKQEGGENTNENGLQSTE 431 (435) Q Consensus 400 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (435) ... .....+...+.++... +..+ T Consensus 469 ~~~-----~~~~~~~~~~~~~~~~----~~~~ 491 (491) T protein:vir:95 469 PLP-----SGAVTQVAGEIPQAAQ----QQQE 491 (491) T ss_pred CCC-----CCccccccccchhhhh----hccC Confidence 000 0000111111111111 1111 No 251 >protein:vir:572 Length: 506 # NCBI annotation: unknown # Family: family:all:6660 # MgeID: mge:13 # MgeName: SPBc2 # Cross-refs: genbank:acc:NP_046607;genbank:gi:9630180;genbank:GeneID:1261432 Probab=50.76 E-value=0.6 Score=21.80 Aligned_cols=394 Identities=15% Similarity=0.183 Sum_probs=135.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccH----HHHh--hhHHHHHHHHHHHHHHhhCce---ee Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSRE----HILE--SNEYIFSIVTRLSNVLASLPL---HE 71 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~--~~~~v~~~i~~ia~~ia~~~~---~~ 71 (435) =||...+.+-.-... ........+| .- +..+... -.|. ..+.|+... +.|-.+|- ++ T Consensus 30 ~GFi~~~~~NG~v~~----i~~~~L~~~F---~N---PD~~~~~I~~L~~Y~YI~~~~i~QL~----~LI~aLP~L~Y~I 95 (506) T protein:vir:57 30 SGFISNMFSNGIVTE----IEAEQLKNYF---SN---PDEFQEEIEDLAQYFYISTAEIHQLF----ELIEALPTLNYKI 95 (506) T ss_pred HHHHHHhhcCCceee----eeHHHHHhhh---cC---hHHHHHHHHHHHHHhhhhcchHHHHH----HHHHhcCCcceee Confidence 223222221100000 0000011111 00 0000000 0011 112233333 33445553 22 Q ss_pred --eecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCC---------------------CCcEE Q lcl|NC_019456. 72 --YQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLS---------------------TGEPI 128 (435) Q Consensus 72 --~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~---------------------~g~~~ 128 (435) ..+.+.-..| ..+++..-.. .-..++-+.+..++...|.-. -.|-+. .|+.+ T Consensus 96 ~~~~k~K~~~~~--iS~lN~~L~K-v~HK~LTRDLL~Q~A~aGTLv--G~WLG~~k~PY~~iF~~iKYVFP~~R~~G~~V 170 (506) T protein:vir:57 96 DSFNKVKSSDKH--ISLLNKSLHK-VKHKRLTRDLLKQVATAGTLV--GIWLGDAKSPYPFIFDEIKYVFPSFRRNGDWV 170 (506) T ss_pred hhhhhccchhhH--HHHHHHHHHH-HHHHHHHHHHHHHhhccCcee--EeeecCCCCcchhhhhhhhhhccccccCCceE Confidence 2222222222 1222211111 223344444444444444311 111111 11111 Q ss_pred EEEEeCCceeEEEEcCCCce----EEEEEecCCe-----------eEEEchhheEEeccC-CCccccccCcHHHHHHHHH Q lcl|NC_019456. 129 ALWPLDPNTVSILRNTDNNS----YWYRVTSDIY-----------NFTIPINDVIHVKHV-VPSNSWYGVSPIDVLSSSL 192 (435) Q Consensus 129 ~l~~l~~~~v~~~~~~~~~~----~~~~~~~~~~-----------~~~~~~~~iih~~~~-~~~~~~~G~s~l~~~~~~i 192 (435) .. ++-..++...+..... +...+..+.. ...+|.+.-+..|.. ...+.-.|.|...+..-.+ T Consensus 171 ~V--vD~~~F~~~~~~~R~~~~~~LSP~I~~~~Y~~~~~~~~~~R~~~LP~~rT~~~R~~TL~RNQ~LG~~~~T~~L~Dv 248 (506) T protein:vir:57 171 CV--VDMELFTKYKDDQRNELLKSLSPYIKQSDYENFMKDREKYRFKELPQERTFPLRTGTLKRNQGLGTSWVTPGLYDV 248 (506) T ss_pred EE--EehHHhhhhhHHHHHHHHHhhhhhhhhhhhhhHhhhHHhhhhhhcccccchhheeeeecccccccccccchhHHHH Confidence 11 1111111000000000 0000000000 001122222211111 0123334556555554444 Q ss_pred HHHHHHHHHHHHHhhcC--CceEEEeCCcCC-----H-HHHHHHHHHHHHHhcC------CCcccc--ccCCceeeeccC Q lcl|NC_019456. 193 KFQRSVENFSQNEMEKK--DKFVLQYDRSIS-----P-EKRQAMVNDFLRMVKE------NGGAVV--QEAGWKVDRYES 256 (435) Q Consensus 193 ~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~-----~-e~~~~~~~~~~~~~~~------~~~~~v--l~~g~~~~~~~~ 256 (435) ..........++....- ...++++.++-. + .--++.++....+++. ..++.+ +++=++++--.. T Consensus 249 ~HK~KLkD~E~SIA~KII~A~AVL~~~~~~~Ngeyt~~K~~~a~K~Ki~~GVK~ALEK~~KDGv~~vs~PDFA~~~FP~v 328 (506) T protein:vir:57 249 LHKKKLKDVERSIANKIINAVAVLTIGTDKGNGEYTNMKLPKAVKQKIHGGVKTALEKNQKDGVTVVSIPDFADINFPDV 328 (506) T ss_pred HHHHHHHHHHHHHHHHHhhhheeeeeecccCCcccccccchHHHHHHHHHHHHHHHhcccccCeEEEecccccccccccc Confidence 44444333333332221 334555433211 1 1123333444333321 123333 333333333333 Q ss_pred ChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHHHHHHHHHHhHHHHHHHHHHHHhhcc---cccccCcce Q lcl|NC_019456. 257 KFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHVTHSWTMTLMPIIRQYESQFNMKLFT---PGKRVKGFY 333 (435) Q Consensus 257 ~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~~~~~~~~i~P~~~~i~~~l~~~l~~---~~~~~~g~~ 333 (435) +..-++-... +....+|-.++|+..+++++.. ++|+++.-....|...|--+++.||+|+..+|+. +.+....++ T Consensus 329 K~~~LD~~K~-D~I~~DI~~A~GlS~~L~NG~~-GNYAts~LNLD~FYKrIGV~~E~IEqEvY~~L~~lvL~~~~~~NY~ 406 (506) T protein:vir:57 329 KADGLDGAKF-DHINSDIQSAYGLSGSLLNGDG-GNYATSSLNLDTFYKRIGVLMEDIEQEVYQKLFNLVLPAAQKDNYY 406 (506) T ss_pred cccCCCchhh-cccchhhhhhhccchheecCCC-cceeeeechHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCcee Confidence 3322322222 1234579999999999998755 4555554334455566777888999888888754 444444555 Q ss_pred eeechhhhhccCHHHHHHHHHHHHhcCCcCHHHHHH-HhCCCCCC-----------CcCCceeeecccccchhccccccc Q lcl|NC_019456. 334 FSFNVNGLLRGDTAARTQYYQTLTRNGIFKPNEIRE-LEGQAPIP-----------DEAADHLYISKDLYPLDKYYDAIL 401 (435) Q Consensus 334 i~fd~~~l~~~d~~~~~~~~~~~~~~g~~t~NE~R~-~~g~~p~~-----------~~~gd~~~~~~n~~~l~~~~~~~~ 401 (435) +.+|-|- .-..+++.+.+-++-..|| +.--+.. .+|..--. -..-++++.+.+...+ T Consensus 407 ~~Y~KD~--Pl~~~~K~D~LIKL~~~G~-S~K~V~Dnl~GvS~E~Y~E~tlYE~E~LKL~EKI~P~~~s~~~-------- 475 (506) T protein:vir:57 407 MNYDKDK--PLTLKEKMDILIKLNDKGW-SIKHVVDNLAGVSWESYLEQTLYETEELKLQEKIRPYQTSYTF-------- 475 (506) T ss_pred EeeCCCC--ccchhhhhchheeecccCc-cHHHHHHhhhccchHHHHHHHHHHHHHhhHHhhcCccccccee-------- Confidence 5565543 4455667777666666665 5444444 44432100 0000111111111100 Q ss_pred cccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 402 DNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 402 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) ++.+...|. ++.+++ ++.-+++-+.|. T Consensus 476 -----tGN~vG~P~-~~~~~~-D~Tv~Satsngn 502 (506) T protein:vir:57 476 -----TGNEVGRPN-EGNKNN-DNTVKSATSNGN 502 (506) T ss_pred -----cccccCCCC-CCCCcc-cchhhhcccCCC Confidence 011111121 222211 111222222222 No 252 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=50.14 E-value=0.62 Score=21.73 Aligned_cols=385 Identities=12% Similarity=0.075 Sum_probs=127.5 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhccccccCcccccHHHHhhh---HHHHHHHHHHHHHHhhCceeeeecccc Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMAGVKLEQATFSREHILESN---EYIFSIVTRLSNVLASLPLHEYQNYKQ 77 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~v~~~i~~ia~~ia~~~~~~~~~~~~ 77 (435) +.+|+--.. .....-++...+...+. -.|+..... ...+... +...+- +..-++.+ T Consensus 56 ~~~~dst~~-~a~~~Laa~l~~~ltP~-~~WF~l~~~------d~~~~~~~~~~~~~~~---v~~~l~~v---------- 114 (535) T protein:vir:33 56 TTPWQAVGA-RGLNNLASKLMLALFPM-QSWMKLTIS------EYEAKQLVGDPDGLAK---VDEGLSMV---------- 114 (535) T ss_pred cccccccHH-HHHHHHHHHHHHhhcCC-CcccccccC------hHHHhccccCcchHHH---HHHHHHHH---------- Confidence 111110000 00000000000000000 001111000 0000000 000000 11111111 Q ss_pred cccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCce--------- Q lcl|NC_019456. 78 MDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNNS--------- 148 (435) Q Consensus 78 ~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~~--------- 148 (435) ++.++..+ .+-| .+.=+..++.+++.+|+++.++..+..++.....|||. ++-+..+..|.. T Consensus 115 --e~~~~~~~-~~sn----f~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~--~~~v~~d~~G~vd~i~r~~~~ 185 (535) T protein:vir:33 115 --ERIIMNYI-ESNS----YRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLS--SYVVQRDAYGNVLQIVTRDQI 185 (535) T ss_pred --HHHHHHHH-HhcC----cHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcC--eeEEeeCCCCCeeEEEeeEee Confidence 11222222 2333 44445666788889999988887654333334455554 333333332211 Q ss_pred ---------------------------EEE--E-EecCCeeEEE-----------------chhheEEeccCCCcccccc Q lcl|NC_019456. 149 ---------------------------YWY--R-VTSDIYNFTI-----------------PINDVIHVKHVVPSNSWYG 181 (435) Q Consensus 149 ---------------------------~~~--~-~~~~~~~~~~-----------------~~~~iih~~~~~~~~~~~G 181 (435) +|. . ...++....+ ..-=.+..|.....+..|| T Consensus 186 t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YG 265 (535) T protein:vir:33 186 AFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYG 265 (535) T ss_pred cHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEEEEEEEeCccccccccccccccCCceeeeeeecCCCccc Confidence 000 0 0011111111 0111233333334466899 Q ss_pred CcHHHHHHHHHHHHHHHHHHHHHHhhcCCc--eEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccc--cCCceeeeccCC Q lcl|NC_019456. 182 VSPIDVLSSSLKFQRSVENFSQNEMEKKDK--FVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQ--EAGWKVDRYESK 257 (435) Q Consensus 182 ~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~--~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl--~~g~~~~~~~~~ 257 (435) .||...+.-.+...+.+.+........... .++.-++........ .+..+.++. .+++...++... T Consensus 266 rgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~----------~~~~g~~v~g~~~~v~~~~~~~~ 335 (535) T protein:vir:33 266 RSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLT----------KAQTGDFVPGRREDIDFLQLEKQ 335 (535) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc----------cCCceeeecCCcccceeeecccc Confidence 999999998888888777766655444333 333333333333321 222222332 334455555444 Q ss_pred hhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHH-HH------------HHHHHHHHhHHHHHHHHHHHHh-hc Q lcl|NC_019456. 258 FEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVE-HV------------THSWTMTLMPIIRQYESQFNMK-LF 323 (435) Q Consensus 258 ~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e-~~------------~~~~~~~i~P~~~~i~~~l~~~-l~ 323 (435) +.-.-..+..+.....|-++|-+.. +........++.+= .. ..+...-+.|++.+....+.+. ++ T Consensus 336 ~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~l 414 (535) T protein:vir:33 336 ADFTVAKAVSDQIEARLSYAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQI 414 (535) T ss_pred cchhHHHHHHHHHHHHHHHHHhhhh-cccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 3322344455566778888885432 11111111222210 11 1122344566666655555443 33 Q ss_pred ccccccCcceeeechhhhhc----cCHHHHHHHHHHHHhcC------CcCHHH----HHHHhCCCCCCCcCCceeeecc- Q lcl|NC_019456. 324 TPGKRVKGFYFSFNVNGLLR----GDTAARTQYYQTLTRNG------IFKPNE----IRELEGQAPIPDEAADHLYISK- 388 (435) Q Consensus 324 ~~~~~~~g~~i~fd~~~l~~----~d~~~~~~~~~~~~~~g------~~t~NE----~R~~~g~~p~~~~~gd~~~~~~- 388 (435) ++- ......++| .+.|.. .+......++..+-..+ .+..++ +.+.+|.|+. .++-+. T Consensus 415 P~~-p~~~v~~~y-is~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~------~i~~~~e 486 (535) T protein:vir:33 415 PEL-PKEAVEPTI-STGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTS------GILLTDE 486 (535) T ss_pred CCC-CccceeEEE-ecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHh------HhcCCHH Confidence 322 222334444 233321 12222233333222110 122222 2223344421 000000 Q ss_pred cccchhccc---cccccccccccccccccccCCCCCCCCCCCCCCCCCCC Q lcl|NC_019456. 389 DLYPLDKYY---DAILDNKIQTDASVAAPKQEGGENTNENGLQSTEPEGS 435 (435) Q Consensus 389 n~~~l~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (435) ....+-... ...........+.......++-+.. +.--....=+.| T Consensus 487 e~~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~-~~~~~~~g~~~~ 535 (535) T protein:vir:33 487 QKQALMMQDAAQTGVENAAAAGGAGVGALATSSPEAM-QGAAAKAGLNAT 535 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhcchhhcCChhH-HHHHHhccCCCC Confidence 000000000 0000000000000111111110000 000000000111 No 253 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=49.71 E-value=0.63 Score=21.68 Aligned_cols=401 Identities=13% Similarity=0.086 Sum_probs=137.0 Q ss_pred CchH-------HHHHhhccccccccccccccchhhhhhcccc--c-cCcccccH-HHHhhhHHHHHHHHHHHHHHhhC-- Q lcl|NC_019456. 1 MSFM-------SKVRQFFGVHDQANQIVQNPIPQPLDMAGVK--L-EQATFSRE-HILESNEYIFSIVTRLSNVLASL-- 67 (435) Q Consensus 1 Mg~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~-~~~~~~~~v~~~i~~ia~~ia~~-- 67 (435) |.=. +.++..+..-+...+.....+....++.-.. + ......+. .... .+.--.|++.+|..+-+. T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~lt 79 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPW-QAVGARGLNNLSAKVMLALF 79 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccc-cchHHHHHHHHHHHHHHhhc Confidence 3321 1111111100100011111111111111100 0 00000000 0011 223334666666555432 Q ss_pred c---eeeee-cccc-------------cc------cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCC Q lcl|NC_019456. 68 P---LHEYQ-NYKQ-------------MD------NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLST 124 (435) Q Consensus 68 ~---~~~~~-~~~~-------------~~------~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~ 124 (435) | |.-.. .+.. +. ++.++. .+.+-| .+.=+..++.+++.+|++..++..+... T Consensus 80 P~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~-~~~~sn----f~~~~~~~~~~L~~~G~a~ly~~~~~~~ 154 (543) T protein:vir:88 80 PLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMS-YMEANS----YRVTLFELIRQLALAGTALIYLPPPDAS 154 (543) T ss_pred CCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHH-HHHhcC----cHHHHHHHHHHHHhhCceeeeeccCccc Confidence 3 21111 1100 00 111222 222333 4455566788899999999887654322 Q ss_pred C---cEEEEEEeCCceeEEEEcCCCce-----------------------------------EEEEE--ecC-CeeE--- Q lcl|NC_019456. 125 G---EPIALWPLDPNTVSILRNTDNNS-----------------------------------YWYRV--TSD-IYNF--- 160 (435) Q Consensus 125 g---~~~~l~~l~~~~v~~~~~~~~~~-----------------------------------~~~~~--~~~-~~~~--- 160 (435) + .+...|||... -+..+..|.. ++..+ ..+ +... T Consensus 155 ~~~~~~~~~~pl~~y--~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~V~pr~~~~~~~~~~ 232 (543) T protein:vir:88 155 SNSYNPMKLYTLHNH--VVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIYIDDESGDFLSYQ 232 (543) T ss_pred cceecceEEeEcceE--EEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEEEEeecCCCcccccc Confidence 1 12334555332 2222222211 01101 111 1100 Q ss_pred -----EE---------chhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHH Q lcl|NC_019456. 161 -----TI---------PINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEK 224 (435) Q Consensus 161 -----~~---------~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~ 224 (435) .+ ..--.+..|.....+..||.||...+.-.+...+.+.+......... +..++.-++...... T Consensus 233 ~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~ 312 (543) T protein:vir:88 233 EIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRR 312 (543) T ss_pred cccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhh Confidence 01 01112223333334668999999999998888887777666554443 333433343443333 Q ss_pred HHHHHHHHHHHhcCCCccccc--cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHH-HHHHH Q lcl|NC_019456. 225 RQAMVNDFLRMVKENGGAVVQ--EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNV-EHVTH 301 (435) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~vl--~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~-e~~~~ 301 (435) .. .+..+.++. .+++...++...+.-.-..+..+.....|-++|-+.... .......++.+ ..... T Consensus 313 ~~----------~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~-~~~~~r~TAtEV~~r~~ 381 (543) T protein:vir:88 313 LV----------KAQTGDFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSAV-QRSGERVTAEEIRYVAS 381 (543) T ss_pred cc----------cCCCceeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhc-cCCCCcccHHHHHHHHH Confidence 21 222222332 244455555544333334455566677888888654211 11111122221 01111 Q ss_pred ------------HHHHHHhHHHHHHHHHHHHh-hcccccccCcceeeec--hhhhhcc-CHHHHHHHHHHHHhcC----- Q lcl|NC_019456. 302 ------------SWTMTLMPIIRQYESQFNMK-LFTPGKRVKGFYFSFN--VNGLLRG-DTAARTQYYQTLTRNG----- 360 (435) Q Consensus 302 ------------~~~~~i~P~~~~i~~~l~~~-l~~~~~~~~g~~i~fd--~~~l~~~-d~~~~~~~~~~~~~~g----- 360 (435) +...-+.|++.+.-..+.+. +|++-. ..+..+++- +..+.+. +......++...-..+ T Consensus 382 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p-~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~vl 460 (543) T protein:vir:88 382 ELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLP-QEAVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNGD 460 (543) T ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc-hhceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhhh Confidence 11334455555544444332 333211 112234432 1222221 2222222222111110 Q ss_pred -CcCHHHHH----HHhCCCCCCCcCCceeeecccccchhccc--------ccccccccccccccccc-------ccCCCC Q lcl|NC_019456. 361 -IFKPNEIR----ELEGQAPIPDEAADHLYISKDLYPLDKYY--------DAILDNKIQTDASVAAP-------KQEGGE 420 (435) Q Consensus 361 -~~t~NE~R----~~~g~~p~~~~~gd~~~~~~n~~~l~~~~--------~~~~~~~~~~~~~~~~~-------~~~~~~ 420 (435) .+..+++- +.+|.+|. ..+..+.....+.... .....+.+.......++ ...|.. T Consensus 461 d~id~d~~~~~~a~~~Gv~~~-----~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 535 (543) T protein:vir:88 461 PDLNVNNIKLRLANAIGIDTA-----GLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQATASPEAMESAMDTAGVQ 535 (543) T ss_pred ccCCHHHHHHHHHHHhCCChh-----hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhccChHHHHHHhhhcCCC Confidence 12333322 23355431 0011111111110000 00000011100000000 111111 Q ss_pred CCCCCCCCCCCCCCC Q lcl|NC_019456. 421 NTNENGLQSTEPEGS 435 (435) Q Consensus 421 ~~~~~~~~~~~~~~~ 435 (435) .+ +.++ T Consensus 536 ~~---------p~~~ 541 (543) T protein:vir:88 536 PG---------PIAT 541 (543) T ss_pred CC---------CCCC Confidence 11 1222 No 254 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=47.78 E-value=0.69 Score=21.47 Aligned_cols=383 Identities=13% Similarity=0.055 Sum_probs=118.7 Q ss_pred CchHHHHHhhccccccccccccccchhhhh------hccccccCccccc----HHH---HhhhHHHHHHHHHHHHHHhhC Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLD------MAGVKLEQATFSR----EHI---LESNEYIFSIVTRLSNVLASL 67 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~----~~~---~~~~~~v~~~i~~ia~~ia~~ 67 (435) |..=+. +.......+.+........ +++..+. ..|-. ... ....+..++-++. -++.+ T Consensus 45 ~~~~~~-----~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~---~L~~v 115 (535) T protein:vir:94 45 FPKDSD-----NASTDYTTPWQAVGARGLNNLASKLMLALFPM-QTWMKLTISEFEAKQLVAQPAELAKVEE---GLSMV 115 (535) T ss_pred CCCCCC-----ccccccCCcccccHHHHHHHHHHHHHhhhcCC-CCccccccChhhhhccccchhHHHHHHH---HHHHH Confidence 110000 0000000000000000000 0011111 01100 000 0000111111111 11111 Q ss_pred ceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCCc Q lcl|NC_019456. 68 PLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDNN 147 (435) Q Consensus 68 ~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~~ 147 (435) ++.++..+ .+-| .+.=+..++.+++.+|++..++..+...+.....|||.. +-+..+..|. T Consensus 116 ------------e~~~~~~~-~~sn----f~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~--y~v~~d~~G~ 176 (535) T protein:vir:94 116 ------------ERILMNYI-ESNS----YRVTLFETLKQLVVAGNALLYIPEPEGTYNPMKLYRLSS--YVVQRDAFGT 176 (535) T ss_pred ------------HHHHHHHH-HhcC----cHHHHHHHHHHHHhhCcEeEeeccCcCcccceEEEEcCe--EEEeeCCCCC Confidence 11122222 2333 334445567788888999888765543333334555532 2222222221 Q ss_pred eE-----------------------------------E-------------EEEecCCeeE-------EEchhheEEecc Q lcl|NC_019456. 148 SY-----------------------------------W-------------YRVTSDIYNF-------TIPINDVIHVKH 172 (435) Q Consensus 148 ~~-----------------------------------~-------------~~~~~~~~~~-------~~~~~~iih~~~ 172 (435) .. + +.+..+|... .|..--.+..|. T Consensus 177 vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~g~~~~P~~~~Rw 256 (535) T protein:vir:94 177 VLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYLDEESGEYLKYEEIDGVEVEGTDASYPVDACPYIPVRM 256 (535) T ss_pred eEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEEEeeCCCCcEEEEEEecCeeeccccccCccccCCceeeee Confidence 10 0 0000111100 011112233333 Q ss_pred CCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcC--CceEEEeCCcCCHHHHHHHHHHHHHHhcCCCcccc--ccCC Q lcl|NC_019456. 173 VVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKK--DKFVLQYDRSISPEKRQAMVNDFLRMVKENGGAVV--QEAG 248 (435) Q Consensus 173 ~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~--~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~v--l~~g 248 (435) ....+..||.||..-+.-.+.....+.+......... +..++.-++.+...... ....+.++ ..++ T Consensus 257 ~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~~----------~~~~g~~v~g~~~~ 326 (535) T protein:vir:94 257 VRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQVRRLT----------KAQTGDFVSGRPED 326 (535) T ss_pred eecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccccchhhcc----------cCCCceeecCCccc Confidence 3334668999999999888888777766555433322 33333333333333221 11112222 2344 Q ss_pred ceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHH-HHHHH------------HHHHHHhHHHHHHH Q lcl|NC_019456. 249 WKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNV-EHVTH------------SWTMTLMPIIRQYE 315 (435) Q Consensus 249 ~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~-e~~~~------------~~~~~i~P~~~~i~ 315 (435) +.+.+++..+.-.-..+..+..+..|.++|-+.. +........++.+ .+... +...-+.|++.+.- T Consensus 327 v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~ 405 (535) T protein:vir:94 327 ISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLL 405 (535) T ss_pred ceeeecccccchhHHHHHHHHHHHHHHHHHhHhh-hccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 4555555443322334445556778888884321 1111111112221 01111 12344556655544 Q ss_pred HHHHHh-hcccccccCcceeeechhhhhc----cCHHHHHHHHHHHHhcC------CcCHHH----HHHHhCCCCCCCcC Q lcl|NC_019456. 316 SQFNMK-LFTPGKRVKGFYFSFNVNGLLR----GDTAARTQYYQTLTRNG------IFKPNE----IRELEGQAPIPDEA 380 (435) Q Consensus 316 ~~l~~~-l~~~~~~~~g~~i~fd~~~l~~----~d~~~~~~~~~~~~~~g------~~t~NE----~R~~~g~~p~~~~~ 380 (435) ..+.+. +|++- ......+++ +..|-. .+......++..+-+.+ .+..++ +.+.+|.|+. T Consensus 406 ~il~r~g~lP~~-p~~~v~~~~-vs~la~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~---- 479 (535) T protein:vir:94 406 KQLQATNQIPEL-PKEAVEPTI-STGMEALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTS---- 479 (535) T ss_pred HHHHhCCCCCCC-ChhhccceE-eehHHHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChh---- Confidence 444333 33321 111122333 222221 22333333333222211 112222 2222333321 Q ss_pred Cceeeecccccchhccccccc------ccccccccc--ccccccCCCCCCCCCCCCCC Q lcl|NC_019456. 381 ADHLYISKDLYPLDKYYDAIL------DNKIQTDAS--VAAPKQEGGENTNENGLQST 430 (435) Q Consensus 381 gd~~~~~~n~~~l~~~~~~~~------~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 430 (435) ..+..+.....+-.....+. ...+...+. ..++... ....+.-|-.|- T Consensus 480 -~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~-~~~~~~~g~~~~ 535 (535) T protein:vir:94 480 -GILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGTMATASPENM-KAAAAQAGMAPN 535 (535) T ss_pred -hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccChHHH-HHHHHHhccCCC Confidence 00000000000000000000 000000000 0000000 000000011111 No 255 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=41.40 E-value=0.93 Score=20.76 Aligned_cols=381 Identities=10% Similarity=0.064 Sum_probs=116.9 Q ss_pred Cc-hHHHHHhhccccccccccccccchhhhh------hccccccCccccc----H---HHHhhhHHHHHHHHHHHHHHhh Q lcl|NC_019456. 1 MS-FMSKVRQFFGVHDQANQIVQNPIPQPLD------MAGVKLEQATFSR----E---HILESNEYIFSIVTRLSNVLAS 66 (435) Q Consensus 1 Mg-~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~----~---~~~~~~~~v~~~i~~ia~~ia~ 66 (435) +. +|.. - +.......+.+........ .++..+....|-. . ......+...+-++. -++. T Consensus 41 lP~~~~~---~-~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~---~l~~ 113 (515) T protein:vir:70 41 LPYLMNN---K-GDNETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLAT---IFAR 113 (515) T ss_pred cccccCC---C-CCcccccccccchHHHHHHHHHHHHHHhhcCCCCcccccccChhhhhccccchhHHHHHHH---HHHH Confidence 00 0000 0 0000000000000000000 0011110011100 0 000011111111111 1111 Q ss_pred CceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEEcCCC Q lcl|NC_019456. 67 LPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILRNTDN 146 (435) Q Consensus 67 ~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~~~~~ 146 (435) +. +.++..+ .+-| .+.=+..++.+++.+|++.+++.. .++ ...|||... -+..+..| T Consensus 114 ve------------~~~~~~l-~~sn----f~~~~~~~~~~L~~~G~a~l~~d~--~~~--~~~~pl~~y--~v~~d~~G 170 (515) T protein:vir:70 114 VE------------TTAMKAL-EQRQ----FRPAIVEVFKHLIVAGNCLLYKPS--KGA--MSAVPMHHY--VVNRDTNG 170 (515) T ss_pred HH------------HHHHHHH-HhcC----chHHHHHHHHHHHhHCeEEEEEeC--CCC--eEEEEcCeE--EEeeCCCc Confidence 10 1111122 1222 334445566677788888776532 222 345555332 22222222 Q ss_pred ce---------------------------------------------------EEEEEecCCeeE----EE--chhheEE Q lcl|NC_019456. 147 NS---------------------------------------------------YWYRVTSDIYNF----TI--PINDVIH 169 (435) Q Consensus 147 ~~---------------------------------------------------~~~~~~~~~~~~----~~--~~~~iih 169 (435) .. .+++. .++... .+ ..-=.+- T Consensus 171 ~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~~~e-~d~~~~~~es~y~~~e~P~~~ 249 (515) T protein:vir:70 171 DLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQS-ADDIPVGKESRIKSEKLPFIP 249 (515) T ss_pred CeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCCceEEEEe-cCceeeccccccccccCCcee Confidence 11 11111 111100 00 0011122 Q ss_pred eccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCc--eEEEeCCcCCHHHHHHHHHHHHHHhcCCCccccccC Q lcl|NC_019456. 170 VKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDK--FVLQYDRSISPEKRQAMVNDFLRMVKENGGAVVQEA 247 (435) Q Consensus 170 ~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~--~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~vl~~ 247 (435) .|.....+..||.||..-+.-.+...+.+.+........... .++.-++........ .+..+.++-+. T Consensus 250 ~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l~----------~~~~g~iv~g~ 319 (515) T protein:vir:70 250 LTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHFV----------NSGTGEVITGV 319 (515) T ss_pred eeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhcc----------ccCCceeecCC Confidence 222223456899999999999998888777766655444333 333333333333221 11112223332 Q ss_pred CceeeeccCC-hhhHH-HHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHH-HHHHHHHHHHHhHHHHHHHHHHHHhh-- Q lcl|NC_019456. 248 GWKVDRYESK-FEPAD-LSSVEQISRIRIATAFNVPISFLNDDQAKSTTNV-EHVTHSWTMTLMPIIRQYESQFNMKL-- 322 (435) Q Consensus 248 g~~~~~~~~~-~~~~~-~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~-e~~~~~~~~~i~P~~~~i~~~l~~~l-- 322 (435) .-.+.++... ..+.+ ..+..+.....|.++|-+........+.. ++++ .....-....|-|.+..+.+||-.-| T Consensus 320 ~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rv-TAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~ 398 (515) T protein:vir:70 320 AEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERV-TAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAM 398 (515) T ss_pred cccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhhccCCccc-cHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH Confidence 2344444322 22233 33445556778888887654333333222 2221 11112223445555555555544433 Q ss_pred ------ccc--ccccCcceeeechhhhhcc-CHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCcCCceeeecccccc- Q lcl|NC_019456. 323 ------FTP--GKRVKGFYFSFNVNGLLRG-DTAARTQYYQTLTRNGIFKPNEIRELEGQAPIPDEAADHLYISKDLYP- 392 (435) Q Consensus 323 ------~~~--~~~~~g~~i~fd~~~l~~~-d~~~~~~~~~~~~~~g~~t~NE~R~~~g~~p~~~~~gd~~~~~~n~~~- 392 (435) +++ .+...-.++.+ +..+.+. +......++. .+..-..-+-++...++.+..=+..++..-+|.+++. T Consensus 399 r~~~~~~p~~P~~~v~~~~vs~-l~~L~r~q~~~~i~~~~q-~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs 476 (515) T protein:vir:70 399 WGLQEAGDSFTSELVDPVIVTG-IEALGRMAELDKLANFAQ-YMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKS 476 (515) T ss_pred HHHHhhCCCCChhhcccceehh-HHHHHHHHHHHHHHHHHH-HHHHHhccChhHHhhCCHHHHHHHHHHHhCCCccccCC Confidence 221 11111111111 1222111 1111212222 2211011112233333222111111111222222211 Q ss_pred ---hhccccc---cccccccccccccccccCCCCCCCCC Q lcl|NC_019456. 393 ---LDKYYDA---ILDNKIQTDASVAAPKQEGGENTNEN 425 (435) Q Consensus 393 ---l~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (435) +...-+. +........+.......-.++...+. T Consensus 477 ~eev~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) T protein:vir:70 477 EEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhhhhccC Confidence 0000000 00000000000000001111111111 No 256 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=33.19 E-value=1.4 Score=19.83 Aligned_cols=392 Identities=14% Similarity=0.114 Sum_probs=131.6 Q ss_pred CchHHHHHhhccccccccccccccchhhhhhc--------cccccCcccccHHHHhhhHHHHHHHHHHHHHHhhC----- Q lcl|NC_019456. 1 MSFMSKVRQFFGVHDQANQIVQNPIPQPLDMA--------GVKLEQATFSREHILESNEYIFSIVTRLSNVLASL----- 67 (435) Q Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~----- 67 (435) |.+-.++....... +.....+.....+. +........ ..... .++--.|++.+|..+... T Consensus 1 m~~~~r~~~L~~~R----~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~--~~~~~-dstg~~a~~~LAa~l~~~ltpp~ 73 (522) T protein:vir:10 1 MKARERYNQLTTAR----QMFLDKAVECSELTLPYLIDDDISSRPNHKS--LTVPW-QSVGAKCCVTLAAKLMLAVLPPQ 73 (522) T ss_pred CchHHHHHHHHHHh----hHHHHHHHHHHHHhhhcccCCCCCCCccccc--ccccc-cchHHHHHHHHHHHHHHhhcCCC Confidence 88866654432111 11111122222111 100000000 00111 123334566666555432 Q ss_pred -ce-eeeecc----cccc--------------cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcE Q lcl|NC_019456. 68 -PL-HEYQNY----KQMD--------------NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEP 127 (435) Q Consensus 68 -~~-~~~~~~----~~~~--------------~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~ 127 (435) || ++.-.+ +... ++.++.. +.+- +.+.=+..++.+++.+|++.+++..+. T Consensus 74 ~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~-l~~s----nf~~~~~~~~~~L~~~G~a~ly~~~~~----- 143 (522) T protein:vir:10 74 TSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDY-IAAS----NDRVAVHQALKHLIVGGNALIFMGKDG----- 143 (522) T ss_pred CccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHH-HHhc----CcHHHHHHHHHHHHhHCceeEEEcCCC----- Confidence 23 221111 1100 1111222 2233 355556777889999999988765432 Q ss_pred EEEEEeCCceeEEEEcCCCceE--------------------------------------EEEE--ecC-CeeEEE--ch Q lcl|NC_019456. 128 IALWPLDPNTVSILRNTDNNSY--------------------------------------WYRV--TSD-IYNFTI--PI 164 (435) Q Consensus 128 ~~l~~l~~~~v~~~~~~~~~~~--------------------------------------~~~~--~~~-~~~~~~--~~ 164 (435) ...|||.. +-+..+..|... +..+ ..+ +....+ .. T Consensus 144 ~~~~pl~~--y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~ 221 (522) T protein:vir:10 144 LKTFPLTR--YVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQEAF 221 (522) T ss_pred ceEEEcce--EEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEccC Confidence 12344432 222222222111 0000 000 000000 01 Q ss_pred hh---------------eEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCc--eEEEeCCcCCHHHHHH Q lcl|NC_019456. 165 ND---------------VIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDK--FVLQYDRSISPEKRQA 227 (435) Q Consensus 165 ~~---------------iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~--~~~~~~~~~~~e~~~~ 227 (435) +. .+-.|.....+..||.||...+.-.+.....+.+........... .++.-++........ T Consensus 222 ~~~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~- 300 (522) T protein:vir:10 222 DKIIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIA- 300 (522) T ss_pred CccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeecccccccccccc- Confidence 11 122222223456899999999999998888777766655444333 333333333332211 Q ss_pred HHHHHHHHhcCCCccccccC--CceeeeccCChhhHH-HHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHHHH----- Q lcl|NC_019456. 228 MVNDFLRMVKENGGAVVQEA--GWKVDRYESKFEPAD-LSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVEHV----- 299 (435) Q Consensus 228 ~~~~~~~~~~~~~~~~vl~~--g~~~~~~~~~~~~~~-~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e~~----- 299 (435) .+..+.++.+. ++...+++.. .+++ ..+..+..+..|..+|- +....++..-++++. T Consensus 301 ---------~~~~~~~v~g~~~~v~~~~~~~~-~d~~~~~~~i~~~~~ri~~aFl-----~~~~~d~~rvTAtEV~~r~~ 365 (522) T protein:vir:10 301 ---------KAGNGAIVQGRPEDVAVIQVGKT-ADFSTAANMATAIEKRLLEAFL-----VMNVRNAERVTAEEVRLTQL 365 (522) T ss_pred ---------CCCCcceecCCCccceeeccccc-ccchHHHHHHHHHHHHHHHHHh-----hccCCCCCCCCHHHHHHHHH Confidence 22223333332 2333333322 2333 33444556677888873 222222211122111 Q ss_pred ----------HHHHHHHHhHHHHHHHHHHHHh-hccccc--ccCcceeeechhhhhc-cCHHHHHHHHHHHHhc------ Q lcl|NC_019456. 300 ----------THSWTMTLMPIIRQYESQFNMK-LFTPGK--RVKGFYFSFNVNGLLR-GDTAARTQYYQTLTRN------ 359 (435) Q Consensus 300 ----------~~~~~~~i~P~~~~i~~~l~~~-l~~~~~--~~~g~~i~fd~~~l~~-~d~~~~~~~~~~~~~~------ 359 (435) ..+...-+.|++.+....+.+. +|++-- ......+. -++.|-+ .+......++..+-+. T Consensus 366 E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~-~is~Laraq~~~~l~~~~~~i~~~~~p~~~ 444 (522) T protein:vir:10 366 ELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVA-GVNALGRGQDRESLTAFVGTIAQTLGPEAL 444 (522) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccccccccc-chhHHHHHHHHHHHHHHHHHHHHhhCchhh Confidence 1122344566666655555544 333221 11111111 1222221 1122222222222111 Q ss_pred -CCcCHHH----HHHHhCCCCCCCcCCceeeecccccchhccccccccc---cccccccccccccCCCCCCC-CCCCCCC Q lcl|NC_019456. 360 -GIFKPNE----IRELEGQAPIPDEAADHLYISKDLYPLDKYYDAILDN---KIQTDASVAAPKQEGGENTN-ENGLQST 430 (435) Q Consensus 360 -g~~t~NE----~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 430 (435) -.+..++ +.+.+|.|+. ..+..+..+..+.......... .....+-...+-..+..+.+ -+..++. T Consensus 445 ~~~id~d~~~~~~a~~~Gvp~~-----~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 519 (522) T protein:vir:10 445 MQYLNPLEAIKRLAAAQGIDVL-----NLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTKNPQLMDEEQPP 519 (522) T ss_pred hhcCCHHHHHHHHHHHhCCChh-----hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccccHHHHHHhCCC Confidence 0122222 2222343321 0000000000000000000000 00000000000000111110 0001111 Q ss_pred CCC Q lcl|NC_019456. 431 EPE 433 (435) Q Consensus 431 ~~~ 433 (435) ..| T Consensus 520 ~~~ 522 (522) T protein:vir:10 520 MEE 522 (522) T ss_pred CCC Confidence 111 No 257 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=26.11 E-value=2 Score=18.96 Aligned_cols=410 Identities=10% Similarity=0.064 Sum_probs=137.7 Q ss_pred Cch---HHHHHhhccccccccccccccchhhhhhcccc--------ccCcccccHHHHhhhHHHHHHHHHHHHHHhhC-- Q lcl|NC_019456. 1 MSF---MSKVRQFFGVHDQANQIVQNPIPQPLDMAGVK--------LEQATFSREHILESNEYIFSIVTRLSNVLASL-- 67 (435) Q Consensus 1 Mg~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~-- 67 (435) |.- .+.+++.+..-+.........+.+..++.-.. ......... .. -.+.--.|++.+|..+... T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-~~-~dst~~~a~~~LAa~L~~~lt 78 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHN-NI-LDNTGTRALRVLAAGMMAGMT 78 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhccc-cc-ccccHHHHHHHHHHHHHHhhc Confidence 321 11122111111111111111122222211000 000000000 01 1233335666666655432 Q ss_pred c----e-eeeecccccc------------cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEE Q lcl|NC_019456. 68 P----L-HEYQNYKQMD------------NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIAL 130 (435) Q Consensus 68 ~----~-~~~~~~~~~~------------~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l 130 (435) | | ++.-.+.... ++.+... +.+- +.+.-+..++.+++.+|++.+++..+.. ....+ T Consensus 79 pp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~-l~~s----nf~~~~~~~~~~Lv~~G~a~l~~~~d~~--~~~rf 151 (555) T protein:vir:98 79 SPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMI-FAKS----NTYRALHSMYEELGAFGTASSIVLPDFD--AVVYH 151 (555) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHH-HHhc----CcHHHHHHHHHHHHhhCceEEEEecCCC--ceEEE Confidence 2 3 2221111111 1112222 2233 3445556778899999999998876643 34445 Q ss_pred EEeCCceeEEEEcCCCceEEEEEe--------------------------cCC-e-eE---------------------- Q lcl|NC_019456. 131 WPLDPNTVSILRNTDNNSYWYRVT--------------------------SDI-Y-NF---------------------- 160 (435) Q Consensus 131 ~~l~~~~v~~~~~~~~~~~~~~~~--------------------------~~~-~-~~---------------------- 160 (435) .+++..++-+..+..|..-.++.. .+. . .. T Consensus 152 ~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~ 231 (555) T protein:vir:98 152 HSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNM 231 (555) T ss_pred EEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCcccc Confidence 555555555555554432111100 000 0 00 Q ss_pred -----E---------------EchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcC Q lcl|NC_019456. 161 -----T---------------IPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSI 220 (435) Q Consensus 161 -----~---------------~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~ 220 (435) . |..-=.+..|.....+..||.||...+.-.+.....+.+.............+.++... T Consensus 232 p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~ 311 (555) T protein:vir:98 232 AWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSA 311 (555) T ss_pred ceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccc Confidence 0 00011122222222456899999999999888887776665544443333333333222 Q ss_pred CHHHHHHHHHHHHHHhcCCCccc-ccc--CCceeeec-cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC--ccc Q lcl|NC_019456. 221 SPEKRQAMVNDFLRMVKENGGAV-VQE--AGWKVDRY-ESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK--STT 294 (435) Q Consensus 221 ~~e~~~~~~~~~~~~~~~~~~~~-vl~--~g~~~~~~-~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~--~~~ 294 (435) ..... . -..|++. +.. .+-.+.++ +..+.-....+..+.....|-++|-+...+.....+. -++ T Consensus 312 ~~~~~--------~--~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TA 381 (555) T protein:vir:98 312 KNQDI--------S--TVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTA 381 (555) T ss_pred ccccc--------e--eccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccH Confidence 11001 0 1122221 111 11122222 2222222234445667788999997664443222222 122 Q ss_pred HH-HHH------------HHHHHHHHhHHHHHHHHHHHHh-hcccc-cccCcceeeechhhhh-cc----CHHHH---HH Q lcl|NC_019456. 295 NV-EHV------------THSWTMTLMPIIRQYESQFNMK-LFTPG-KRVKGFYFSFNVNGLL-RG----DTAAR---TQ 351 (435) Q Consensus 295 ~~-e~~------------~~~~~~~i~P~~~~i~~~l~~~-l~~~~-~~~~g~~i~fd~~~l~-~~----d~~~~---~~ 351 (435) .+ .+. ..+...-+.|++.+.-..+.+. ++++- ....+..+...+...+ +. +.... ++ T Consensus 382 tEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~ 461 (555) T protein:vir:98 382 TEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVG 461 (555) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHH Confidence 21 011 1112344566665555555554 23321 1122333443333322 21 11111 11 Q ss_pred HHHHHHhcC-----CcCHHH----HHHHhCCCCCCCcCCceeeecccccchhcc-cccccccccc-cccc--ccccccCC Q lcl|NC_019456. 352 YYQTLTRNG-----IFKPNE----IRELEGQAPIPDEAADHLYISKDLYPLDKY-YDAILDNKIQ-TDAS--VAAPKQEG 418 (435) Q Consensus 352 ~~~~~~~~g-----~~t~NE----~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~-~~~~~~~~~~-~~~~--~~~~~~~~ 418 (435) .+..+.+.+ .+..++ +.+.+|.|+- .+...-....+-.. .+.+...... ..++ .......+ T Consensus 462 ~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~------~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~ 535 (555) T protein:vir:98 462 NLGAVAGIKPEVLDKFDADRWADTYADMLGIDPE------LIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGS 535 (555) T ss_pred HHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCcc------ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 111111110 122222 2233343320 10000000000000 0000000000 0000 00011111 Q ss_pred CCCCCCCCCCCCCCCCC Q lcl|NC_019456. 419 GENTNENGLQSTEPEGS 435 (435) Q Consensus 419 ~~~~~~~~~~~~~~~~~ 435 (435) .+.+..++.+..-.--+ T Consensus 536 ~~~~~~~~~~~~~~~~~ 552 (555) T protein:vir:98 536 VDTSKQNALTDVTRAFS 552 (555) T ss_pred cccCcchhHHHHHhhhc Confidence 11111111100000000 No 258 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=26.11 E-value=2 Score=18.96 Aligned_cols=410 Identities=10% Similarity=0.064 Sum_probs=137.7 Q ss_pred Cch---HHHHHhhccccccccccccccchhhhhhcccc--------ccCcccccHHHHhhhHHHHHHHHHHHHHHhhC-- Q lcl|NC_019456. 1 MSF---MSKVRQFFGVHDQANQIVQNPIPQPLDMAGVK--------LEQATFSREHILESNEYIFSIVTRLSNVLASL-- 67 (435) Q Consensus 1 Mg~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~-- 67 (435) |.- .+.+++.+..-+.........+.+..++.-.. ......... .. -.+.--.|++.+|..+... T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-~~-~dst~~~a~~~LAa~L~~~lt 78 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHN-NI-LDNTGTRALRVLAAGMMAGMT 78 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhccc-cc-ccccHHHHHHHHHHHHHHhhc Confidence 321 11122111111111111111122222211000 000000000 01 1233335666666655432 Q ss_pred c----e-eeeecccccc------------cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEE Q lcl|NC_019456. 68 P----L-HEYQNYKQMD------------NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIAL 130 (435) Q Consensus 68 ~----~-~~~~~~~~~~------------~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l 130 (435) | | ++.-.+.... ++.+... +.+- +.+.-+..++.+++.+|++.+++..+.. ....+ T Consensus 79 pp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~-l~~s----nf~~~~~~~~~~Lv~~G~a~l~~~~d~~--~~~rf 151 (555) T protein:vir:10 79 SPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMI-FAKS----NTYRALHSMYEELGAFGTASSIVLPDFD--AVVYH 151 (555) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHH-HHhc----CcHHHHHHHHHHHHhhCceEEEEecCCC--ceEEE Confidence 2 3 2221111111 1112222 2233 3445556778899999999998876643 34445 Q ss_pred EEeCCceeEEEEcCCCceEEEEEe--------------------------cCC-e-eE---------------------- Q lcl|NC_019456. 131 WPLDPNTVSILRNTDNNSYWYRVT--------------------------SDI-Y-NF---------------------- 160 (435) Q Consensus 131 ~~l~~~~v~~~~~~~~~~~~~~~~--------------------------~~~-~-~~---------------------- 160 (435) .+++..++-+..+..|..-.++.. .+. . .. T Consensus 152 ~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~ 231 (555) T protein:vir:10 152 HSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNM 231 (555) T ss_pred EEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCcccc Confidence 555555555555554432111100 000 0 00 Q ss_pred -----E---------------EchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcC Q lcl|NC_019456. 161 -----T---------------IPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSI 220 (435) Q Consensus 161 -----~---------------~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~ 220 (435) . |..-=.+..|.....+..||.||...+.-.+.....+.+.............+.++... T Consensus 232 p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~ 311 (555) T protein:vir:10 232 AWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSA 311 (555) T ss_pred ceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccc Confidence 0 00011122222222456899999999999888887776665544443333333333222 Q ss_pred CHHHHHHHHHHHHHHhcCCCccc-ccc--CCceeeec-cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC--ccc Q lcl|NC_019456. 221 SPEKRQAMVNDFLRMVKENGGAV-VQE--AGWKVDRY-ESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK--STT 294 (435) Q Consensus 221 ~~e~~~~~~~~~~~~~~~~~~~~-vl~--~g~~~~~~-~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~--~~~ 294 (435) ..... . -..|++. +.. .+-.+.++ +..+.-....+..+.....|-++|-+...+.....+. -++ T Consensus 312 ~~~~~--------~--~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TA 381 (555) T protein:vir:10 312 KNQDI--------S--TVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTA 381 (555) T ss_pred ccccc--------e--eccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccH Confidence 11001 0 1122221 111 11122222 2222222234445667788999997664443222222 122 Q ss_pred HH-HHH------------HHHHHHHHhHHHHHHHHHHHHh-hcccc-cccCcceeeechhhhh-cc----CHHHH---HH Q lcl|NC_019456. 295 NV-EHV------------THSWTMTLMPIIRQYESQFNMK-LFTPG-KRVKGFYFSFNVNGLL-RG----DTAAR---TQ 351 (435) Q Consensus 295 ~~-e~~------------~~~~~~~i~P~~~~i~~~l~~~-l~~~~-~~~~g~~i~fd~~~l~-~~----d~~~~---~~ 351 (435) .+ .+. ..+...-+.|++.+.-..+.+. ++++- ....+..+...+...+ +. +.... ++ T Consensus 382 tEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~ 461 (555) T protein:vir:10 382 TEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVG 461 (555) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHH Confidence 21 011 1112344566665555555554 23321 1122333443333322 21 11111 11 Q ss_pred HHHHHHhcC-----CcCHHH----HHHHhCCCCCCCcCCceeeecccccchhcc-cccccccccc-cccc--ccccccCC Q lcl|NC_019456. 352 YYQTLTRNG-----IFKPNE----IRELEGQAPIPDEAADHLYISKDLYPLDKY-YDAILDNKIQ-TDAS--VAAPKQEG 418 (435) Q Consensus 352 ~~~~~~~~g-----~~t~NE----~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~-~~~~~~~~~~-~~~~--~~~~~~~~ 418 (435) .+..+.+.+ .+..++ +.+.+|.|+- .+...-....+-.. .+.+...... ..++ .......+ T Consensus 462 ~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~------~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~ 535 (555) T protein:vir:10 462 NLGAVAGIKPEVLDKFDADRWADTYADMLGIDPE------LIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGS 535 (555) T ss_pred HHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCcc------ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 111111110 122222 2233343320 10000000000000 0000000000 0000 00011111 Q ss_pred CCCCCCCCCCCCCCCCC Q lcl|NC_019456. 419 GENTNENGLQSTEPEGS 435 (435) Q Consensus 419 ~~~~~~~~~~~~~~~~~ 435 (435) .+.+..++.+..-.--+ T Consensus 536 ~~~~~~~~~~~~~~~~~ 552 (555) T protein:vir:10 536 VDTSKQNALTDVTRAFS 552 (555) T ss_pred cccCcchhHHHHHhhhc Confidence 11111111100000000 No 259 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=26.11 E-value=2 Score=18.96 Aligned_cols=410 Identities=10% Similarity=0.064 Sum_probs=137.7 Q ss_pred Cch---HHHHHhhccccccccccccccchhhhhhcccc--------ccCcccccHHHHhhhHHHHHHHHHHHHHHhhC-- Q lcl|NC_019456. 1 MSF---MSKVRQFFGVHDQANQIVQNPIPQPLDMAGVK--------LEQATFSREHILESNEYIFSIVTRLSNVLASL-- 67 (435) Q Consensus 1 Mg~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~-- 67 (435) |.- .+.+++.+..-+.........+.+..++.-.. ......... .. -.+.--.|++.+|..+... T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-~~-~dst~~~a~~~LAa~L~~~lt 78 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHN-NI-LDNTGTRALRVLAAGMMAGMT 78 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhccc-cc-ccccHHHHHHHHHHHHHHhhc Confidence 321 11122111111111111111122222211000 000000000 01 1233335666666655432 Q ss_pred c----e-eeeecccccc------------cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEE Q lcl|NC_019456. 68 P----L-HEYQNYKQMD------------NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIAL 130 (435) Q Consensus 68 ~----~-~~~~~~~~~~------------~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l 130 (435) | | ++.-.+.... ++.+... +.+- +.+.-+..++.+++.+|++.+++..+.. ....+ T Consensus 79 pp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~-l~~s----nf~~~~~~~~~~Lv~~G~a~l~~~~d~~--~~~rf 151 (555) T protein:vir:10 79 SPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMI-FAKS----NTYRALHSMYEELGAFGTASSIVLPDFD--AVVYH 151 (555) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHH-HHhc----CcHHHHHHHHHHHHhhCceEEEEecCCC--ceEEE Confidence 2 3 2221111111 1112222 2233 3445556778899999999998876643 34445 Q ss_pred EEeCCceeEEEEcCCCceEEEEEe--------------------------cCC-e-eE---------------------- Q lcl|NC_019456. 131 WPLDPNTVSILRNTDNNSYWYRVT--------------------------SDI-Y-NF---------------------- 160 (435) Q Consensus 131 ~~l~~~~v~~~~~~~~~~~~~~~~--------------------------~~~-~-~~---------------------- 160 (435) .+++..++-+..+..|..-.++.. .+. . .. T Consensus 152 ~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~ 231 (555) T protein:vir:10 152 HSLTAGEYAIAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNM 231 (555) T ss_pred EEeecceeEEeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCcccc Confidence 555555555555554432111100 000 0 00 Q ss_pred -----E---------------EchhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCceEEEeCCcC Q lcl|NC_019456. 161 -----T---------------IPINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDKFVLQYDRSI 220 (435) Q Consensus 161 -----~---------------~~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~~~~~~~~~~ 220 (435) . |..-=.+..|.....+..||.||...+.-.+.....+.+.............+.++... T Consensus 232 p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~ 311 (555) T protein:vir:10 232 AWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSA 311 (555) T ss_pred ceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccc Confidence 0 00011122222222456899999999999888887776665544443333333333222 Q ss_pred CHHHHHHHHHHHHHHhcCCCccc-ccc--CCceeeec-cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccC--ccc Q lcl|NC_019456. 221 SPEKRQAMVNDFLRMVKENGGAV-VQE--AGWKVDRY-ESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAK--STT 294 (435) Q Consensus 221 ~~e~~~~~~~~~~~~~~~~~~~~-vl~--~g~~~~~~-~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~--~~~ 294 (435) ..... . -..|++. +.. .+-.+.++ +..+.-....+..+.....|-++|-+...+.....+. -++ T Consensus 312 ~~~~~--------~--~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TA 381 (555) T protein:vir:10 312 KNQDI--------S--TVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTA 381 (555) T ss_pred ccccc--------e--eccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccH Confidence 11001 0 1122221 111 11122222 2222222234445667788999997664443222222 122 Q ss_pred HH-HHH------------HHHHHHHHhHHHHHHHHHHHHh-hcccc-cccCcceeeechhhhh-cc----CHHHH---HH Q lcl|NC_019456. 295 NV-EHV------------THSWTMTLMPIIRQYESQFNMK-LFTPG-KRVKGFYFSFNVNGLL-RG----DTAAR---TQ 351 (435) Q Consensus 295 ~~-e~~------------~~~~~~~i~P~~~~i~~~l~~~-l~~~~-~~~~g~~i~fd~~~l~-~~----d~~~~---~~ 351 (435) .+ .+. ..+...-+.|++.+.-..+.+. ++++- ....+..+...+...+ +. +.... ++ T Consensus 382 tEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~ 461 (555) T protein:vir:10 382 TEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVG 461 (555) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHH Confidence 21 011 1112344566665555555554 23321 1122333443333322 21 11111 11 Q ss_pred HHHHHHhcC-----CcCHHH----HHHHhCCCCCCCcCCceeeecccccchhcc-cccccccccc-cccc--ccccccCC Q lcl|NC_019456. 352 YYQTLTRNG-----IFKPNE----IRELEGQAPIPDEAADHLYISKDLYPLDKY-YDAILDNKIQ-TDAS--VAAPKQEG 418 (435) Q Consensus 352 ~~~~~~~~g-----~~t~NE----~R~~~g~~p~~~~~gd~~~~~~n~~~l~~~-~~~~~~~~~~-~~~~--~~~~~~~~ 418 (435) .+..+.+.+ .+..++ +.+.+|.|+- .+...-....+-.. .+.+...... ..++ .......+ T Consensus 462 ~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~------~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~ 535 (555) T protein:vir:10 462 NLGAVAGIKPEVLDKFDADRWADTYADMLGIDPE------LIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGS 535 (555) T ss_pred HHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCcc------ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 111111110 122222 2233343320 10000000000000 0000000000 0000 00011111 Q ss_pred CCCCCCCCCCCCCCCCC Q lcl|NC_019456. 419 GENTNENGLQSTEPEGS 435 (435) Q Consensus 419 ~~~~~~~~~~~~~~~~~ 435 (435) .+.+..++.+..-.--+ T Consensus 536 ~~~~~~~~~~~~~~~~~ 552 (555) T protein:vir:10 536 VDTSKQNALTDVTRAFS 552 (555) T ss_pred cccCcchhHHHHHhhhc Confidence 11111111100000000 No 260 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=21.71 E-value=2.6 Score=18.35 Aligned_cols=399 Identities=11% Similarity=0.051 Sum_probs=126.0 Q ss_pred CchH-----HHHHhhccccccccccccccchhhhhhccccc--cC-cccccHHHHhhhHHHHHHHHHHHHHHhhC----- Q lcl|NC_019456. 1 MSFM-----SKVRQFFGVHDQANQIVQNPIPQPLDMAGVKL--EQ-ATFSREHILESNEYIFSIVTRLSNVLASL----- 67 (435) Q Consensus 1 Mg~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~~----- 67 (435) |-+. +.+...|..-+...+.....+....++.-... .. ....... ... ++--.|++.+|..+... T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~-~~d-stg~~a~~~LAa~l~~~ltpp~ 78 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDDLSSQN-AWQ-DDGASATNFLSNKLSQVLFPAQ 78 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCCccccc-ccc-chHHHHHHHHHHHHHHhhcCCC Confidence 3332 33333322111111111112222222111100 00 0001111 112 23334566666555432 Q ss_pred -cee-eeecccc-------------cc------cchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCc Q lcl|NC_019456. 68 -PLH-EYQNYKQ-------------MD------NEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGE 126 (435) Q Consensus 68 -~~~-~~~~~~~-------------~~------~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~ 126 (435) ||. +.-.+.. +. +..++.. +.+- +.+.=+..++.+++.+|++..++. + . +. T Consensus 79 ~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~-l~~s----nf~~~~~~~~~~L~~~G~a~ly~~-~-~-~~ 150 (517) T protein:vir:10 79 RSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLY-GESL----QFRPAVVEAFKHLIVTGNVMMYHP-D-K-TS 150 (517) T ss_pred CccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHH-HHhc----CcHHHHHHHHHHHHhHCeEEEEEe-C-C-CC Confidence 232 2111100 00 0111111 2233 455556777888999999977643 2 2 33 Q ss_pred EEEEEEeCCceeEEEEcCCCceE--------------------------------------E--EEEecCCeeEEE---- Q lcl|NC_019456. 127 PIALWPLDPNTVSILRNTDNNSY--------------------------------------W--YRVTSDIYNFTI---- 162 (435) Q Consensus 127 ~~~l~~l~~~~v~~~~~~~~~~~--------------------------------------~--~~~~~~~~~~~~---- 162 (435) ....|||... -+..+..|... + +....++....+ T Consensus 151 ~~~~~pl~~y--~v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~d 228 (517) T protein:vir:10 151 PIQAVPLHHY--CVRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDGKYLIRQSAD 228 (517) T ss_pred cEEEEEcCeE--EEeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCCceEEEEEeC Confidence 4456666432 22222222110 0 000011111000 Q ss_pred ------------chhheEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCc--eEEEeCCcCCHHHHHHH Q lcl|NC_019456. 163 ------------PINDVIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDK--FVLQYDRSISPEKRQAM 228 (435) Q Consensus 163 ------------~~~~iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~--~~~~~~~~~~~e~~~~~ 228 (435) ..-=.+-.|.....+..||.||..-+.-.+.....+.+........... .++.-++........ T Consensus 229 ~~~~~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~-- 306 (517) T protein:vir:10 229 DVPVGKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFV-- 306 (517) T ss_pred ceeeccccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhcc-- Confidence 0111222233323466899999999999988888776666654433333 333323333322211 Q ss_pred HHHHHHHhcCCCccccccCCceeeec--cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHH-HHHHHHHHH Q lcl|NC_019456. 229 VNDFLRMVKENGGAVVQEAGWKVDRY--ESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNV-EHVTHSWTM 305 (435) Q Consensus 229 ~~~~~~~~~~~~~~~vl~~g~~~~~~--~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~-e~~~~~~~~ 305 (435) .+..+.++-+..-++.++ +....-.-..+..+.....|-++|-+.....-..+. .++++ .....-... T Consensus 307 --------~~~~g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r-vTAtEV~~r~~E~~~ 377 (517) T protein:vir:10 307 --------EGGSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRRDAER-VTAYEIQRDAMLVEQ 377 (517) T ss_pred --------CCCccccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhhccCCcc-ccHHHHHHHHHHHHH Confidence 111122222222233333 333222223445556677888888654322222222 12221 111111223 Q ss_pred HHhHHHHHHHHH------------HHHhhcccccccCcceeeechhhhhc-cCHHHHHHHHHHHHhcCCcCHHH-HHHHh Q lcl|NC_019456. 306 TLMPIIRQYESQ------------FNMKLFTPGKRVKGFYFSFNVNGLLR-GDTAARTQYYQTLTRNGIFKPNE-IRELE 371 (435) Q Consensus 306 ~i~P~~~~i~~~------------l~~~l~~~~~~~~g~~i~fd~~~l~~-~d~~~~~~~~~~~~~~g~~t~NE-~R~~~ 371 (435) .|-|.+..+.++ +-..+ +.... .-.++.. +..+.+ .+......++...-... -..+ +...+ T Consensus 378 ~LGpv~~rl~~Ell~Pli~r~~~~l~~~l-~~~~v-~~~~~s~-la~l~r~~~~~~i~~~~~~i~~~a--~~~~~~~~~i 452 (517) T protein:vir:10 378 SLGGVYSLFATTFQGPLARWFMNGISSIL-TSKNV-SPTILTG-IEALGRMAELDKLGTFNGYVSMTA--QWPEPLQQAI 452 (517) T ss_pred HhhhHHHHHHHHHHHHHHHHHHHHhhhhc-CCCCc-cceeecc-HHHHHHHHHHHHHHHHHHHHHHhh--cCChHHHhcC Confidence 344444443333 32211 11110 1011111 111111 11121222211111110 0011 11112 Q ss_pred CCCCCCCcCCceeeecccccchh-cc---cc-----ccccccccccccccccccCCCCCCCCCCC Q lcl|NC_019456. 372 GQAPIPDEAADHLYISKDLYPLD-KY---YD-----AILDNKIQTDASVAAPKQEGGENTNENGL 427 (435) Q Consensus 372 g~~p~~~~~gd~~~~~~n~~~l~-~~---~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (435) +.+.+-+..++.+-+|.+++--+ .. .. .+.....+............+..+.+-++ T Consensus 453 d~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~~~~~~~~~~~~~ag~~~~~~~~~~~~~~~~~~ 517 (517) T protein:vir:10 453 KWPDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQINPQGGQ 517 (517) T ss_pred CHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCCC Confidence 21111111111111221111000 00 00 00000000000000000000111111111 No 261 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=21.69 E-value=2.6 Score=18.34 Aligned_cols=382 Identities=13% Similarity=0.074 Sum_probs=126.5 Q ss_pred Cc-hHHHHHhhccccccccccccccchhhhh------hccccccCccccc----HHHHh-------hhHHHHHHHHHHHH Q lcl|NC_019456. 1 MS-FMSKVRQFFGVHDQANQIVQNPIPQPLD------MAGVKLEQATFSR----EHILE-------SNEYIFSIVTRLSN 62 (435) Q Consensus 1 Mg-~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~----~~~~~-------~~~~v~~~i~~ia~ 62 (435) .. +|.. .. -+.......+.+........ +++..+. .+|-. ...+. ....|..- T Consensus 40 lP~~~~~-~~-~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~-~~WF~l~~~d~~~~~~~~~~~~~~~v~~~------ 110 (535) T protein:vir:15 40 IPSLFPK-ES-DNESTDYTTPWQAVGARGLNNLASKLMLALFPM-QSWMKLTISEYEAKQLVGDPDGLAKVDEG------ 110 (535) T ss_pred cccccCC-CC-CcccccccccccccHHHHHHHHHHHHHHhhcCC-CcccccccChHHHhccCCCcchHHHHHHH------ Confidence 00 0000 00 00000000000000000000 0001111 01100 00000 00111111 Q ss_pred HHhhCceeeeecccccccchHHHhhhccccccCCHHHHHHHHHHHHHhcCCcceEEeeeCCCCcEEEEEEeCCceeEEEE Q lcl|NC_019456. 63 VLASLPLHEYQNYKQMDNEPLADLLKTSPNPNMTAFEFIARLETDRNVSGNGYAWIQKSLSTGEPIALWPLDPNTVSILR 142 (435) Q Consensus 63 ~ia~~~~~~~~~~~~~~~~~l~~~l~~~Pn~~~~~~~f~~~~~~~~~~~G~~~~~i~~~~~~g~~~~l~~l~~~~v~~~~ 142 (435) ++.+ ++.++..+ .+-| .+.=+..++.+++.+|++..++..+..++.....|||.- +-+.. T Consensus 111 -L~~v------------e~~~~~~l-~~sn----f~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~--~~v~~ 170 (535) T protein:vir:15 111 -LSMV------------ERIIMNYI-ESNS----YRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLSS--YVVQR 170 (535) T ss_pred -HHHH------------HHHHHHHH-HhcC----cHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcCe--eEEee Confidence 1111 11222222 2333 444456667888889999888766543333344555542 22222 Q ss_pred cCCCce------------------------------------EEEEE---ecCCeeEEE-----------------chhh Q lcl|NC_019456. 143 NTDNNS------------------------------------YWYRV---TSDIYNFTI-----------------PIND 166 (435) Q Consensus 143 ~~~~~~------------------------------------~~~~~---~~~~~~~~~-----------------~~~~ 166 (435) +..|.. +|..+ ..++....+ ..-= T Consensus 171 d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~~~~~~P 250 (535) T protein:vir:15 171 DAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMP 250 (535) T ss_pred CCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEEEEeeCccccccccccccccCC Confidence 222211 00000 011111110 0111 Q ss_pred eEEeccCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHhhcCCc--eEEEeCCcCCHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_019456. 167 VIHVKHVVPSNSWYGVSPIDVLSSSLKFQRSVENFSQNEMEKKDK--FVLQYDRSISPEKRQAMVNDFLRMVKENGGAVV 244 (435) Q Consensus 167 iih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~n~~~--~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~v 244 (435) .+..|.....+..||.||...+.-.+...+.+.+........... .++.-++........ .+..+.++ T Consensus 251 ~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~----------~~~~g~~v 320 (535) T protein:vir:15 251 YIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLT----------KAQTGDFV 320 (535) T ss_pred ceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhcc----------cCCceeee Confidence 233333334466899999999998888888777766655444333 333333333333321 12222233 Q ss_pred c--cCCceeeeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCcccCcccHHH-HH------------HHHHHHHHhH Q lcl|NC_019456. 245 Q--EAGWKVDRYESKFEPADLSSVEQISRIRIATAFNVPISFLNDDQAKSTTNVE-HV------------THSWTMTLMP 309 (435) Q Consensus 245 l--~~g~~~~~~~~~~~~~~~~e~~~~~~~~Ia~~fgvP~~~lg~~~~~~~~~~e-~~------------~~~~~~~i~P 309 (435) . .+++...++...+.-.-..+..+.....|-++|-+.. +........++.+= .. ..+...-+.| T Consensus 321 ~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~P 399 (535) T protein:vir:15 321 PGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLP 399 (535) T ss_pred cCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHH Confidence 2 3344555554443322334455556778888885432 11111111222210 11 1122344566 Q ss_pred HHHHHHHHHHHh-hcccccccCcceeeechhhhhc----cCHHHHHHHHHHHHhcC------CcCHHH----HHHHhCCC Q lcl|NC_019456. 310 IIRQYESQFNMK-LFTPGKRVKGFYFSFNVNGLLR----GDTAARTQYYQTLTRNG------IFKPNE----IRELEGQA 374 (435) Q Consensus 310 ~~~~i~~~l~~~-l~~~~~~~~g~~i~fd~~~l~~----~d~~~~~~~~~~~~~~g------~~t~NE----~R~~~g~~ 374 (435) ++.+....+.+. ++++- ......++| .+.|.. .+......++..+-..+ .+..++ +.+.+|.| T Consensus 400 li~r~~~il~r~g~lP~~-p~~~v~~~y-is~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp 477 (535) T protein:vir:15 400 LVRVLLKQLQATSQIPEL-PKEAVEPTI-STGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGID 477 (535) T ss_pred HHHHHHHHHHhcCCCCCC-CccceeEEE-ecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCC Confidence 666655555443 33322 222334444 233321 22222333333222110 122222 22334444 Q ss_pred CCCCcCCceeeecc-cccchhccc-------cccccccccccccccccccCCCCCCCCCCCCCC Q lcl|NC_019456. 375 PIPDEAADHLYISK-DLYPLDKYY-------DAILDNKIQTDASVAAPKQEGGENTNENGLQST 430 (435) Q Consensus 375 p~~~~~gd~~~~~~-n~~~l~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (435) +.. ++.+. ....+-... ......++...+.....+.....-.+.-|.+.+ T Consensus 478 ~~~------i~~~~eev~~~~~q~~~~~~~~~~a~~~g~~~~~~~~~~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 478 TSG------ILLTDEQKQALMMQDAAQTGIENAAATGGAGVGALATSSPEAMQGAAAQAGLDAT 535 (535) T ss_pred hhh------hcCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhccChHHHHHHHhccCCCCC Confidence 310 00000 000000000 000000000000000011111111111122222 Done!