Query lcl|NC_020081.1_cdsid_YP_007349202.1 [gene=G380_gp180] [protein=putative portal protein] [protein_id=YP_007349202.1] [location=complement(28441..30099)] Match_columns 552 No_of_seqs 249 out of 1066 Neff 8.9 Searched_HMMs 1612 Date Thu Nov 7 18:27:29 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_25 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_25_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80644 Length: 551 100.0 6E-119 4E-122 668.8 50.9 539 1-552 5-551 (551) 2 protein:vir:63755 Length: 547 100.0 1E-117 9E-121 661.1 50.9 539 1-552 1-547 (547) 3 protein:vir:96579 Length: 576 100.0 5E-117 3E-120 658.3 48.9 545 1-552 6-575 (576) 4 protein:vir:99312 Length: 563 100.0 1E-116 6E-120 656.6 50.3 548 1-552 1-563 (563) 5 protein:vir:95599 Length: 563 100.0 1E-116 6E-120 656.6 50.3 548 1-552 1-563 (563) 6 protein:vir:80796 Length: 574 100.0 2E-114 2E-117 643.4 48.4 536 1-545 9-574 (574) 7 protein:vir:100691 Length: 535 100.0 9E-95 5.6E-98 536.2 47.7 518 1-546 1-535 (535) 8 protein:vir:93610 Length: 454 100.0 1.2E-81 7.4E-85 464.2 43.8 441 37-552 1-454 (454) 9 protein:vir:1380 Length: 422 # 100.0 1E-81 6.4E-85 464.6 41.6 415 1-492 1-422 (422) 10 protein:vir:4337 Length: 434 # 100.0 2.2E-81 1.4E-84 462.8 41.3 426 24-503 1-434 (434) 11 protein:vir:100249 Length: 431 100.0 3.7E-81 2.3E-84 461.6 41.0 415 1-499 1-431 (431) 12 protein:vir:102080 Length: 429 100.0 2.5E-80 1.6E-83 457.0 44.3 418 24-506 1-429 (429) 13 protein:vir:100150 Length: 437 100.0 8.5E-81 5.3E-84 459.5 41.4 429 32-527 1-437 (437) 14 protein:vir:7853 Length: 518 # 100.0 8.1E-80 5E-83 454.2 46.4 466 24-552 1-503 (518) 15 protein:vir:102727 Length: 945 100.0 9.7E-81 6E-84 459.2 41.1 500 1-552 25-595 (945) 16 protein:vir:107605 Length: 432 100.0 1.6E-80 9.6E-84 458.1 42.1 425 1-506 1-432 (432) 17 protein:vir:102855 Length: 432 100.0 1.6E-80 9.6E-84 458.1 42.1 425 1-506 1-432 (432) 18 protein:vir:105002 Length: 432 100.0 1.6E-80 9.6E-84 458.1 42.1 425 1-506 1-432 (432) 19 protein:vir:101648 Length: 518 100.0 1.1E-79 6.6E-83 453.5 46.4 461 24-552 1-503 (518) 20 protein:vir:105064 Length: 421 100.0 7.1E-80 4.4E-83 454.5 41.8 410 35-520 1-421 (421) 21 protein:vir:102118 Length: 409 100.0 2.4E-79 1.5E-82 451.6 43.4 402 35-493 1-409 (409) 22 protein:vir:6240 Length: 457 # 100.0 2.8E-79 1.7E-82 451.3 43.5 447 1-541 1-457 (457) 23 protein:vir:10362 Length: 432 100.0 3E-79 1.8E-82 451.1 41.8 413 1-522 7-432 (432) 24 protein:vir:1884 Length: 424 # 100.0 4.4E-79 2.7E-82 450.2 42.3 416 8-496 1-424 (424) 25 protein:vir:5737 Length: 419 # 100.0 7.5E-79 4.6E-82 448.9 42.6 411 1-537 1-419 (419) 26 protein:vir:81152 Length: 411 100.0 4.6E-79 2.9E-82 450.1 41.4 404 24-493 1-411 (411) 27 protein:vir:1431 Length: 419 # 100.0 1.3E-78 8.1E-82 447.6 42.4 410 37-521 1-419 (419) 28 protein:vir:97060 Length: 432 100.0 1.3E-78 8.1E-82 447.6 41.9 413 1-522 7-432 (432) 29 protein:vir:483 Length: 413 # 100.0 2.9E-78 1.8E-81 445.6 43.8 407 35-517 1-413 (413) 30 protein:vir:4454 Length: 414 # 100.0 2.4E-78 1.5E-81 446.1 43.1 408 1-518 1-414 (414) 31 protein:vir:189 Length: 424 # 100.0 2.7E-78 1.7E-81 445.8 41.9 417 8-496 1-424 (424) 32 protein:vir:1326 Length: 457 # 100.0 7.2E-78 4.5E-81 443.5 43.5 447 1-530 1-457 (457) 33 protein:vir:81072 Length: 432 100.0 5E-78 3.1E-81 444.4 42.4 413 1-522 7-432 (432) 34 protein:vir:1266 Length: 416 # 100.0 8.6E-78 5.3E-81 443.1 42.8 409 35-512 1-416 (416) 35 protein:vir:101647 Length: 460 100.0 7.9E-78 4.9E-81 443.3 42.4 427 32-518 1-460 (460) 36 protein:vir:79984 Length: 441 100.0 1.5E-77 9.4E-81 441.7 42.8 429 1-526 1-441 (441) 37 protein:vir:9408 Length: 441 # 100.0 1.5E-77 9.4E-81 441.7 42.8 429 1-526 1-441 (441) 38 protein:vir:94666 Length: 723 100.0 1.2E-77 7.5E-81 442.3 41.8 446 50-552 1-478 (723) 39 protein:vir:4509 Length: 424 # 100.0 8.1E-78 5E-81 443.2 40.7 412 17-509 1-424 (424) 40 protein:vir:80333 Length: 419 100.0 7.9E-78 4.9E-81 443.3 40.2 410 37-534 1-419 (419) 41 protein:vir:98396 Length: 441 100.0 3.1E-77 1.9E-80 440.0 42.7 429 1-526 1-441 (441) 42 protein:vir:3868 Length: 417 # 100.0 1.9E-76 1.2E-79 435.7 42.2 409 37-534 1-417 (417) 43 protein:vir:4598 Length: 416 # 100.0 2.8E-76 1.7E-79 434.8 42.2 405 37-526 1-416 (416) 44 protein:vir:81095 Length: 416 100.0 2.8E-76 1.7E-79 434.8 42.2 405 37-526 1-416 (416) 45 protein:vir:81218 Length: 423 100.0 1.1E-75 6.5E-79 431.6 39.9 406 37-507 1-423 (423) 46 protein:vir:93943 Length: 409 100.0 5.9E-75 3.7E-78 427.5 42.8 402 27-512 1-409 (409) 47 protein:vir:94426 Length: 409 100.0 8.9E-75 5.5E-78 426.6 43.2 402 27-512 1-409 (409) 48 protein:vir:8418 Length: 409 # 100.0 5E-75 3.1E-78 427.9 41.2 405 24-518 1-409 (409) 49 protein:vir:96980 Length: 409 100.0 1.2E-74 7.2E-78 425.9 43.1 401 27-512 1-409 (409) 50 protein:vir:2683 Length: 412 # 100.0 1.2E-74 7.5E-78 425.8 43.0 405 24-498 1-412 (412) 51 protein:vir:9702 Length: 406 # 100.0 3.4E-74 2.1E-77 423.3 41.8 401 37-517 1-406 (406) 52 protein:vir:4194 Length: 540 # 100.0 4.6E-74 2.9E-77 422.6 41.7 468 27-552 1-511 (540) 53 protein:vir:960 Length: 413 # 100.0 1.1E-73 6.7E-77 420.6 41.2 402 24-507 1-413 (413) 54 protein:vir:9359 Length: 348 # 100.0 1.1E-72 7E-76 415.0 39.8 341 113-512 1-348 (348) 55 protein:vir:95378 Length: 406 100.0 6.6E-72 4.1E-75 410.8 40.7 398 37-508 1-406 (406) 56 protein:vir:8317 Length: 409 # 100.0 4.5E-72 2.8E-75 411.7 39.1 403 1-473 1-409 (409) 57 protein:vir:4156 Length: 542 # 100.0 6.9E-72 4.3E-75 410.7 40.0 453 19-552 1-479 (542) 58 protein:vir:3153 Length: 467 # 100.0 1.3E-71 8.3E-75 409.1 40.0 410 81-543 1-467 (467) 59 protein:vir:99452 Length: 651 100.0 7.6E-72 4.7E-75 410.5 36.3 479 24-552 1-601 (651) 60 protein:vir:80134 Length: 403 100.0 3E-71 1.9E-74 407.2 38.6 396 24-508 1-403 (403) 61 protein:vir:8100 Length: 466 # 100.0 1.4E-70 8.8E-74 403.5 40.8 441 1-519 1-466 (466) 62 protein:vir:3843 Length: 397 # 100.0 1.1E-68 6.6E-72 393.2 40.0 393 37-525 1-397 (397) 63 protein:vir:104259 Length: 403 100.0 9.4E-69 5.8E-72 393.5 38.3 395 24-503 1-403 (403) 64 protein:vir:6210 Length: 394 # 100.0 1.5E-68 9.6E-72 392.4 37.1 387 1-519 1-394 (394) 65 protein:vir:7407 Length: 392 # 100.0 3.1E-65 1.9E-68 374.3 38.0 371 1-510 3-392 (392) 66 protein:vir:100882 Length: 383 100.0 8E-65 5E-68 372.0 38.9 379 1-505 1-383 (383) 67 protein:vir:1082 Length: 359 # 100.0 5.5E-65 3.4E-68 372.9 37.7 354 37-462 1-359 (359) 68 protein:vir:4854 Length: 386 # 100.0 1.1E-64 7.1E-68 371.1 38.4 382 1-514 1-386 (386) 69 protein:vir:3989 Length: 392 # 100.0 1.5E-64 9.4E-68 370.5 37.7 382 1-510 3-392 (392) 70 protein:vir:1023 Length: 392 # 100.0 1.5E-64 9.4E-68 370.5 37.7 382 1-510 3-392 (392) 71 protein:vir:100187 Length: 385 100.0 2.4E-64 1.5E-67 369.3 38.0 379 24-504 1-385 (385) 72 protein:vir:95965 Length: 385 100.0 3.1E-63 1.9E-66 363.2 36.2 371 37-496 1-385 (385) 73 protein:vir:9507 Length: 395 # 100.0 1.5E-62 9.5E-66 359.5 37.7 383 37-519 1-395 (395) 74 protein:vir:100650 Length: 395 100.0 1.5E-62 9.5E-66 359.5 37.7 383 37-519 1-395 (395) 75 protein:vir:101289 Length: 395 100.0 1.5E-62 9.5E-66 359.5 37.7 383 37-519 1-395 (395) 76 protein:vir:4995 Length: 384 # 100.0 8E-62 4.9E-65 355.6 37.4 369 1-468 1-384 (384) 77 protein:vir:4828 Length: 382 # 100.0 2.9E-61 1.8E-64 352.5 38.4 374 1-509 1-382 (382) 78 protein:vir:79772 Length: 648 100.0 2.7E-60 1.7E-63 347.1 42.9 473 1-552 1-532 (648) 79 protein:vir:78310 Length: 376 100.0 6.6E-61 4.1E-64 350.5 35.4 366 37-483 1-376 (376) 80 protein:vir:4952 Length: 386 # 100.0 5.4E-60 3.3E-63 345.5 39.7 383 1-519 1-386 (386) 81 protein:vir:94002 Length: 378 100.0 4.6E-61 2.9E-64 351.4 33.1 360 37-513 1-378 (378) 82 protein:vir:93867 Length: 378 100.0 9.8E-61 6.1E-64 349.6 33.2 360 37-513 1-378 (378) 83 protein:vir:1661 Length: 378 # 100.0 4.3E-60 2.7E-63 346.1 35.4 360 1-513 1-378 (378) 84 protein:vir:9641 Length: 395 # 100.0 1.3E-59 7.9E-63 343.5 35.0 379 1-499 1-395 (395) 85 protein:vir:858 Length: 378 # 100.0 4.8E-59 3E-62 340.3 33.3 359 24-518 1-378 (378) 86 protein:vir:98643 Length: 395 100.0 1.8E-58 1.1E-61 337.1 35.7 380 1-499 1-395 (395) 87 protein:vir:4089 Length: 395 # 100.0 1.3E-58 7.8E-62 338.0 34.1 380 24-518 1-395 (395) 88 protein:vir:103971 Length: 376 100.0 3.9E-57 2.4E-60 329.8 34.0 354 1-431 1-376 (376) 89 protein:vir:267 Length: 348 # 100.0 5.4E-57 3.3E-60 329.1 33.4 329 27-438 1-348 (348) 90 protein:vir:94869 Length: 378 100.0 1.1E-56 6.7E-60 327.4 32.9 359 24-518 1-378 (378) 91 protein:vir:79207 Length: 351 100.0 1.9E-56 1.2E-59 326.1 33.2 327 27-431 1-351 (351) 92 protein:vir:79150 Length: 368 100.0 8.3E-57 5.1E-60 328.0 28.9 364 1-444 1-368 (368) 93 protein:vir:78191 Length: 351 100.0 7.5E-56 4.6E-59 322.8 33.3 329 1-431 1-351 (351) 94 protein:vir:78641 Length: 278 100.0 1.1E-55 6.8E-59 321.9 33.5 276 113-429 1-278 (278) 95 protein:vir:5691 Length: 344 # 100.0 5.1E-56 3.2E-59 323.7 31.5 334 27-429 1-344 (344) 96 protein:vir:6058 Length: 344 # 100.0 1.6E-55 1E-58 321.0 33.0 330 27-429 1-344 (344) 97 protein:vir:98567 Length: 340 100.0 2.5E-55 1.5E-58 319.9 33.7 328 1-428 1-340 (340) 98 protein:vir:2013 Length: 344 # 100.0 1.9E-55 1.2E-58 320.6 31.0 334 27-432 1-344 (344) 99 protein:vir:100328 Length: 346 100.0 7.8E-55 4.8E-58 317.2 33.8 342 27-434 1-346 (346) 100 protein:vir:3780 Length: 345 # 100.0 8.8E-55 5.5E-58 316.9 32.0 325 27-431 1-345 (345) 101 protein:vir:3743 Length: 345 # 100.0 4.9E-54 3E-57 312.9 33.5 333 8-431 1-345 (345) 102 protein:vir:1150 Length: 350 # 100.0 5E-54 3.1E-57 312.8 32.6 336 27-429 1-350 (350) 103 protein:vir:78749 Length: 337 100.0 6.9E-54 4.3E-57 312.0 31.8 325 27-430 1-337 (337) 104 protein:vir:98853 Length: 219 100.0 2.7E-45 1.7E-48 265.0 22.7 210 199-433 1-219 (219) 105 protein:vir:4698 Length: 251 # 100.0 3.6E-40 2.2E-43 236.8 26.3 249 1-326 1-251 (251) 106 protein:vir:5249 Length: 437 # 99.9 5.6E-24 3.4E-27 148.1 34.2 411 18-527 1-437 (437) 107 protein:vir:94049 Length: 532 99.9 1.9E-20 1.2E-23 128.7 37.6 488 1-546 1-532 (532) 108 protein:vir:107742 Length: 537 99.8 6.1E-20 3.8E-23 125.9 35.9 466 3-537 1-537 (537) 109 protein:vir:99563 Length: 862 99.8 6.2E-17 3.8E-20 109.4 38.3 494 1-552 40-600 (862) 110 protein:vir:108215 Length: 469 99.8 3.5E-17 2.1E-20 110.8 35.8 430 27-531 1-469 (469) 111 protein:vir:80040 Length: 461 99.7 3.7E-17 2.3E-20 110.7 28.4 408 27-514 1-461 (461) 112 protein:vir:104338 Length: 422 99.7 2.4E-17 1.5E-20 111.7 27.0 393 40-519 1-422 (422) 113 protein:vir:79647 Length: 435 99.7 3.4E-17 2.1E-20 110.9 27.5 407 1-515 1-435 (435) 114 protein:vir:79538 Length: 502 99.7 4.4E-17 2.7E-20 110.3 26.6 446 1-526 1-502 (502) 115 protein:vir:107662 Length: 427 99.7 2.1E-16 1.3E-19 106.6 29.3 397 43-519 1-427 (427) 116 protein:vir:96068 Length: 765 99.7 3.8E-15 2.4E-18 99.6 35.9 502 1-552 1-590 (765) 117 protein:vir:1986 Length: 512 # 99.7 2.5E-14 1.6E-17 95.1 37.4 430 1-552 1-452 (512) 118 protein:vir:79511 Length: 448 99.6 1.4E-14 8.5E-18 96.6 33.8 424 27-522 1-448 (448) 119 protein:vir:77981 Length: 448 99.6 5.2E-14 3.2E-17 93.4 32.7 427 1-538 1-448 (448) 120 protein:vir:103860 Length: 528 99.6 4.2E-13 2.6E-16 88.4 37.5 456 1-552 1-489 (528) 121 protein:vir:79063 Length: 491 99.5 2E-12 1.2E-15 84.7 35.8 427 27-552 1-458 (491) 122 protein:vir:99853 Length: 488 99.5 3.8E-12 2.4E-15 83.2 35.1 420 31-552 1-445 (488) 123 protein:vir:79233 Length: 526 99.5 5E-12 3.1E-15 82.5 38.0 450 1-552 1-487 (526) 124 protein:vir:99232 Length: 526 99.5 6.1E-12 3.8E-15 82.1 39.0 450 1-552 1-487 (526) 125 protein:vir:6382 Length: 553 # 99.5 7.5E-13 4.7E-16 87.0 29.7 452 27-545 1-553 (553) 126 protein:vir:107880 Length: 491 99.5 6.1E-12 3.8E-15 82.1 34.6 437 27-552 1-458 (491) 127 protein:vir:101541 Length: 694 99.5 3E-12 1.9E-15 83.7 32.0 506 1-552 41-615 (694) 128 protein:vir:389 Length: 530 # 99.5 4.5E-13 2.8E-16 88.3 27.3 445 15-534 1-530 (530) 129 protein:vir:3420 Length: 533 # 99.5 3.8E-12 2.3E-15 83.2 31.8 452 29-534 1-533 (533) 130 protein:vir:78589 Length: 695 99.5 4.1E-12 2.6E-15 83.0 31.8 497 1-552 52-616 (695) 131 protein:vir:3648 Length: 695 # 99.4 7.4E-12 4.6E-15 81.6 32.0 496 1-552 52-616 (695) 132 protein:vir:96738 Length: 505 99.4 1.3E-12 7.8E-16 85.8 27.5 445 8-526 1-505 (505) 133 protein:vir:106716 Length: 698 99.4 3.4E-12 2.1E-15 83.4 28.4 492 1-552 52-619 (698) 134 protein:vir:95542 Length: 548 99.4 1.4E-12 8.6E-16 85.6 25.9 479 1-552 1-548 (548) 135 protein:vir:95254 Length: 488 99.4 4.2E-11 2.6E-14 77.4 35.8 430 37-530 1-488 (488) 136 protein:vir:105782 Length: 449 99.4 5.3E-12 3.3E-15 82.4 26.4 402 27-518 1-449 (449) 137 protein:vir:10321 Length: 495 99.3 2.7E-11 1.7E-14 78.5 28.8 436 1-529 1-495 (495) 138 protein:vir:78161 Length: 355 99.2 8.9E-11 5.5E-14 75.7 27.4 321 177-544 1-355 (355) 139 protein:vir:98816 Length: 446 99.2 4.4E-10 2.7E-13 71.9 32.3 386 24-466 1-446 (446) 140 protein:vir:5961 Length: 503 # 99.1 1.9E-09 1.2E-12 68.4 28.7 457 3-527 1-503 (503) 141 protein:vir:98444 Length: 434 99.1 3.7E-09 2.3E-12 66.8 29.0 377 67-524 1-434 (434) 142 protein:vir:3964 Length: 453 # 99.0 4.9E-09 3E-12 66.2 27.0 409 37-514 1-453 (453) 143 protein:vir:3609 Length: 452 # 98.9 1.2E-08 7.6E-12 64.0 28.9 397 43-526 1-452 (452) 144 protein:vir:5839 Length: 533 # 98.9 9.9E-09 6.1E-12 64.5 25.7 471 1-552 1-531 (533) 145 protein:vir:2427 Length: 485 # 98.9 9.3E-09 5.8E-12 64.6 25.0 423 51-525 1-485 (485) 146 protein:vir:4223 Length: 486 # 98.9 1E-08 6.4E-12 64.4 25.1 412 51-541 1-486 (486) 147 protein:vir:104082 Length: 485 98.9 1.8E-08 1.1E-11 63.0 25.7 436 24-525 1-485 (485) 148 protein:vir:99088 Length: 629 98.9 5.5E-09 3.4E-12 65.9 22.7 476 24-552 1-564 (629) 149 protein:vir:8654 Length: 629 # 98.9 4.8E-09 3E-12 66.2 22.2 476 24-552 1-564 (629) 150 protein:vir:96839 Length: 474 98.9 1.3E-08 8.3E-12 63.8 24.6 433 1-517 1-474 (474) 151 protein:vir:95806 Length: 440 98.9 2.6E-08 1.6E-11 62.2 29.4 406 65-513 1-440 (440) 152 protein:vir:9871 Length: 429 # 98.8 3.3E-08 2E-11 61.6 29.8 396 55-510 1-429 (429) 153 protein:vir:99522 Length: 470 98.8 3.6E-08 2.2E-11 61.4 27.5 423 37-515 1-470 (470) 154 protein:vir:733 Length: 453 # 98.8 4.8E-08 3E-11 60.7 26.6 395 59-507 1-453 (453) 155 protein:vir:105889 Length: 474 98.8 5.2E-08 3.2E-11 60.5 31.6 419 34-519 1-474 (474) 156 protein:vir:94101 Length: 474 98.8 5.2E-08 3.2E-11 60.5 31.6 419 34-519 1-474 (474) 157 protein:vir:94742 Length: 409 98.8 5.8E-08 3.6E-11 60.3 30.5 356 59-462 1-409 (409) 158 protein:vir:7768 Length: 484 # 98.8 6.1E-08 3.8E-11 60.1 26.3 427 32-538 1-484 (484) 159 protein:vir:102426 Length: 631 98.8 2.5E-08 1.6E-11 62.2 22.6 474 24-552 1-566 (631) 160 protein:vir:99072 Length: 479 98.7 7.3E-08 4.5E-11 59.7 26.5 396 59-539 1-479 (479) 161 protein:vir:80680 Length: 441 98.7 8E-08 5E-11 59.5 25.4 392 52-510 1-441 (441) 162 protein:vir:9306 Length: 511 # 98.7 9E-08 5.6E-11 59.2 30.8 457 17-530 1-511 (511) 163 protein:vir:99916 Length: 504 98.7 9.8E-08 6.1E-11 59.0 28.6 442 17-538 1-504 (504) 164 protein:vir:78537 Length: 480 98.7 1.2E-07 7.4E-11 58.5 28.3 405 71-540 1-480 (480) 165 protein:vir:96240 Length: 511 98.7 1.3E-07 7.9E-11 58.4 31.3 457 17-530 1-511 (511) 166 protein:vir:103951 Length: 511 98.7 1.4E-07 8.7E-11 58.2 31.9 459 17-530 1-511 (511) 167 protein:vir:97171 Length: 512 98.6 1.5E-07 9.2E-11 58.0 32.9 433 37-530 1-512 (512) 168 protein:vir:1634 Length: 409 # 98.6 1.6E-07 9.8E-11 57.9 29.3 334 73-462 1-409 (409) 169 protein:vir:96266 Length: 474 98.6 2E-07 1.2E-10 57.3 30.0 417 27-535 1-474 (474) 170 protein:vir:95899 Length: 474 98.6 2E-07 1.2E-10 57.3 30.0 417 27-535 1-474 (474) 171 protein:vir:95113 Length: 474 98.6 2.1E-07 1.3E-10 57.2 32.6 398 27-530 1-474 (474) 172 protein:vir:99781 Length: 511 98.6 2.7E-07 1.7E-10 56.6 30.2 462 12-530 1-511 (511) 173 protein:vir:9751 Length: 422 # 98.6 2.9E-07 1.8E-10 56.5 26.9 359 59-491 1-422 (422) 174 protein:vir:2341 Length: 488 # 98.6 2.9E-07 1.8E-10 56.4 25.7 431 31-517 1-488 (488) 175 protein:vir:93747 Length: 472 98.5 3.4E-07 2.1E-10 56.0 30.3 395 49-530 1-472 (472) 176 protein:vir:106639 Length: 481 98.5 3.6E-07 2.2E-10 55.9 29.9 435 32-512 1-481 (481) 177 protein:vir:97447 Length: 474 98.5 3.7E-07 2.3E-10 55.8 32.3 423 27-529 1-474 (474) 178 protein:vir:94498 Length: 474 98.5 3.7E-07 2.3E-10 55.8 32.3 423 27-529 1-474 (474) 179 protein:vir:78805 Length: 511 98.5 3.9E-07 2.4E-10 55.7 30.2 459 17-530 1-511 (511) 180 protein:vir:96366 Length: 511 98.5 3.9E-07 2.4E-10 55.7 30.2 459 17-530 1-511 (511) 181 protein:vir:7987 Length: 456 # 98.5 2E-07 1.2E-10 57.3 20.6 394 49-511 1-456 (456) 182 protein:vir:78227 Length: 480 98.5 4.6E-07 2.8E-10 55.4 28.6 406 71-540 1-480 (480) 183 protein:vir:102602 Length: 456 98.5 5.4E-07 3.3E-10 55.0 24.2 399 49-507 1-456 (456) 184 protein:vir:105819 Length: 456 98.5 5.4E-07 3.3E-10 55.0 24.2 399 49-507 1-456 (456) 185 protein:vir:106571 Length: 499 98.5 5.5E-07 3.4E-10 54.9 34.9 408 37-529 1-499 (499) 186 protein:vir:1236 Length: 483 # 98.5 5.7E-07 3.5E-10 54.8 30.9 408 32-530 1-483 (483) 187 protein:vir:78083 Length: 537 98.5 5.9E-07 3.7E-10 54.7 28.7 462 20-532 1-537 (537) 188 protein:vir:105292 Length: 478 98.4 7.7E-07 4.8E-10 54.1 33.5 395 33-519 1-478 (478) 189 protein:vir:94805 Length: 492 98.4 9.1E-07 5.7E-10 53.7 28.0 425 22-530 1-492 (492) 190 protein:vir:5665 Length: 511 # 98.3 1.2E-06 7.5E-10 53.0 27.6 430 26-503 1-511 (511) 191 protein:vir:2732 Length: 501 # 98.3 1.3E-06 7.8E-10 52.9 26.6 445 13-530 1-501 (501) 192 protein:vir:4898 Length: 502 # 98.3 1.4E-06 8.5E-10 52.7 28.2 436 24-534 1-502 (502) 193 protein:vir:9568 Length: 410 # 98.3 1.5E-06 9.3E-10 52.5 27.6 357 36-492 1-410 (410) 194 protein:vir:79043 Length: 479 98.3 1.9E-06 1.2E-09 52.0 24.9 418 1-515 2-479 (479) 195 protein:vir:97336 Length: 492 98.3 1.9E-06 1.2E-09 51.9 28.6 416 22-530 1-492 (492) 196 protein:vir:97900 Length: 639 98.3 2E-06 1.3E-09 51.8 21.3 475 24-552 1-569 (639) 197 protein:vir:107517 Length: 639 98.3 2E-06 1.3E-09 51.8 21.3 475 24-552 1-569 (639) 198 protein:vir:96494 Length: 501 98.2 3E-06 1.9E-09 50.9 29.0 447 13-549 1-501 (501) 199 protein:vir:2500 Length: 501 # 98.2 3.1E-06 1.9E-09 50.8 29.4 427 27-540 1-501 (501) 200 protein:vir:107112 Length: 478 98.1 3.7E-06 2.3E-09 50.4 30.0 395 33-518 1-478 (478) 201 protein:vir:38 Length: 496 # N 98.1 3.8E-06 2.4E-09 50.3 26.8 418 36-515 1-496 (496) 202 protein:vir:106027 Length: 629 98.1 5.2E-06 3.2E-09 49.5 23.5 464 24-552 1-554 (629) 203 protein:vir:103219 Length: 201 98.0 3.7E-07 2.3E-10 55.9 11.5 190 286-517 1-201 (201) 204 protein:vir:8184 Length: 474 # 97.9 1.3E-05 8.1E-09 47.4 27.0 414 28-509 1-474 (474) 205 protein:vir:104500 Length: 537 97.8 1.6E-05 9.8E-09 46.9 31.6 465 22-546 1-537 (537) 206 protein:vir:106491 Length: 646 97.8 1.8E-05 1.1E-08 46.6 27.6 472 1-552 1-557 (646) 207 protein:vir:94546 Length: 506 97.7 2.2E-05 1.4E-08 46.1 27.0 441 24-521 1-506 (506) 208 protein:vir:80959 Length: 499 97.7 2.6E-05 1.6E-08 45.7 25.2 419 35-515 1-499 (499) 209 protein:vir:106999 Length: 564 97.7 2.9E-05 1.8E-08 45.5 29.9 474 1-549 1-564 (564) 210 protein:vir:98265 Length: 524 97.7 3.1E-05 1.9E-08 45.3 27.5 443 1-516 1-524 (524) 211 protein:vir:103177 Length: 533 97.6 3.3E-05 2.1E-08 45.1 29.8 458 1-550 1-533 (533) 212 protein:vir:3028 Length: 500 # 97.6 3.8E-05 2.4E-08 44.8 24.2 424 1-513 1-500 (500) 213 protein:vir:9815 Length: 500 # 97.6 3.8E-05 2.4E-08 44.8 24.2 424 1-513 1-500 (500) 214 protein:vir:101806 Length: 516 97.5 5.1E-05 3.2E-08 44.1 29.2 443 1-516 1-516 (516) 215 protein:vir:101189 Length: 516 97.5 5.1E-05 3.2E-08 44.1 29.2 443 1-516 1-516 (516) 216 protein:vir:108049 Length: 524 97.5 5.9E-05 3.6E-08 43.8 28.6 443 1-516 1-524 (524) 217 protein:vir:100598 Length: 516 97.5 6.1E-05 3.8E-08 43.7 30.0 444 1-516 1-516 (516) 218 protein:vir:106282 Length: 521 97.5 6.4E-05 4E-08 43.6 24.9 445 1-516 1-521 (521) 219 protein:vir:96179 Length: 468 97.3 8.9E-05 5.5E-08 42.8 29.3 399 35-507 1-468 (468) 220 protein:vir:105461 Length: 470 97.3 9.6E-05 5.9E-08 42.6 29.8 397 36-514 1-470 (470) 221 protein:vir:1587 Length: 508 # 97.2 0.00012 7.3E-08 42.1 24.9 429 1-513 1-508 (508) 222 protein:vir:102950 Length: 471 97.2 0.00013 8.4E-08 41.8 28.6 390 59-510 1-471 (471) 223 protein:vir:6596 Length: 521 # 97.2 0.00014 8.6E-08 41.7 29.4 443 1-503 8-521 (521) 224 protein:vir:81017 Length: 521 97.1 0.00016 1E-07 41.4 29.1 441 1-503 8-521 (521) 225 protein:vir:104892 Length: 558 96.9 0.00029 1.8E-07 40.0 30.6 470 1-546 1-558 (558) 226 protein:vir:79703 Length: 505 96.9 0.00029 1.8E-07 40.0 29.5 419 37-523 1-505 (505) 227 protein:vir:9922 Length: 489 # 96.8 0.00033 2E-07 39.7 30.6 415 59-519 1-489 (489) 228 protein:vir:6896 Length: 523 # 96.3 0.0008 4.9E-07 37.6 25.5 444 1-516 1-523 (523) 229 protein:vir:103458 Length: 524 96.0 0.0011 7.1E-07 36.7 27.2 445 1-516 1-524 (524) 230 protein:vir:4073 Length: 279 # 95.7 0.00039 2.4E-07 39.3 9.9 267 113-451 1-279 (279) 231 protein:vir:78907 Length: 518 95.6 0.0017 1.1E-06 35.7 29.2 423 24-512 1-518 (518) 232 protein:vir:4782 Length: 522 # 95.4 0.0022 1.4E-06 35.2 27.1 439 1-528 1-522 (522) 233 protein:vir:7208 Length: 524 # 95.3 0.0024 1.5E-06 34.9 27.8 445 1-516 1-524 (524) 234 protein:vir:102330 Length: 451 95.2 0.0025 1.5E-06 34.9 27.4 384 61-499 1-451 (451) 235 protein:vir:98883 Length: 517 94.1 0.0055 3.4E-06 33.0 32.1 432 37-515 1-517 (517) 236 protein:vir:105154 Length: 525 93.1 0.0089 5.5E-06 31.8 18.1 464 1-540 1-525 (525) 237 protein:vir:94709 Length: 522 81.4 0.086 5.3E-05 26.4 26.2 433 32-518 1-522 (522) 238 protein:vir:101494 Length: 527 52.1 0.56 0.00035 21.9 26.9 419 42-520 1-527 (527) 239 protein:vir:102239 Length: 527 52.0 0.57 0.00035 21.9 27.0 419 42-520 1-527 (527) 240 protein:vir:94572 Length: 535 49.8 0.63 0.00039 21.7 16.6 425 27-506 1-535 (535) 241 protein:vir:101418 Length: 569 48.5 0.67 0.00041 21.5 19.1 468 1-528 12-569 (569) 242 protein:vir:7017 Length: 515 # 39.3 1 0.00063 20.5 22.6 418 22-538 1-515 (515) 243 protein:vir:102668 Length: 547 35.6 1.2 0.00076 20.1 26.1 426 31-518 1-547 (547) 244 protein:vir:3361 Length: 535 # 33.0 1.4 0.00086 19.8 19.4 433 24-525 1-535 (535) 245 protein:vir:97376 Length: 320 30.6 1.6 0.00097 19.5 12.3 305 24-469 1-320 (320) 246 protein:vir:10447 Length: 536 25.6 2 0.0013 18.9 23.4 424 24-507 1-536 (536) 247 protein:vir:2198 Length: 536 # 24.7 2.1 0.0013 18.8 23.7 424 24-507 1-536 (536) 248 protein:vir:103330 Length: 517 23.0 2.4 0.0015 18.5 25.6 417 24-514 1-517 (517) No 1 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=5.8e-119 Score=668.80 Aligned_cols=539 Identities=55% Similarity=0.909 Sum_probs=476.2 Q ss_pred CCCCCCCcc--cccchhhc--ccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCch Q lcl|NC_020081. 1 MGLLDGFFK--GRKQQDNI--IDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQN 76 (552) Q Consensus 1 ~~~~~~~~~--~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (552) |||+. ||| +.+++.-+ +++++......++++...+. |++.++.++++.|..+.+.+.++|+.+|+..++.. T Consensus 5 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~ 79 (551) T protein:vir:80 5 LGLFE-SIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQIS----KAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQD 79 (551) T ss_pred hhhHH-HhhhccCChhhcccccccccceeeecccccHHHHH----HhhccCcceeecccccceecCcccccCccccChhH Confidence 99999 999 66666555 66666777777777777665 45777889999999999999999999999988887 Q ss_pred HHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCcc Q lcl|NC_020081. 77 LLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTR 156 (552) Q Consensus 77 ~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~ 156 (552) +...|+.+++ ++|+++||.+++++|++++.+.+....+++|.+++++.+.+++.++.++++.+.++|++||...+|+ + T Consensus 80 l~~~~~~~~~-npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~-~ 157 (551) T protein:vir:80 80 LHGVLKKFGG-NIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDIN-R 157 (551) T ss_pred HHHHHHHhhc-CHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCc-c Confidence 7778888886 5788999999999999999999999999999999999999999999999999999999999765555 4 Q ss_pred CCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEccccee Q lcl|NC_020081. 157 DNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMA 236 (552) Q Consensus 157 ~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi 236 (552) +|+.+|+++++.++|++||+|++++|+..|+|++||||+|.+|++..+++|.... ...+|+++..+.....|+++||| T Consensus 158 ~s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~--~~~~y~~~~~g~~~~~~~~~eii 235 (551) T protein:vir:80 158 DSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPD--NGNRFVQVIDQKIVATFNAREMA 235 (551) T ss_pred chHHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCcccccc--CceEEEEEeCCcEEEEEcccceE Confidence 7999999999999999999999999999999999999999999999999986533 34578888888888899999999 Q ss_pred eecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccc Q lcl|NC_020081. 237 WEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGING 316 (552) Q Consensus 237 ~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~n 316 (552) |+++++..+..+++||+|||.+++.+|..+.++++|+.++|+||++|+|||.++++..++++++++++++|++.++|..| T Consensus 236 H~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~n 315 (551) T protein:vir:80 236 FAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGING 315 (551) T ss_pred EecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccc Confidence 99999888888899999999999999999999999999999999999999999887778999999999999999999999 Q ss_pred cccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHH Q lcl|NC_020081. 317 AWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKG 396 (552) Q Consensus 317 agk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~ 396 (552) +|++|||+++|++|+++++++.|+||+|++++++++||++|||||++||+.++++.++...++.+++|++++.+.|+++| T Consensus 316 ag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~t 395 (551) T protein:vir:80 316 SWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKG 395 (551) T ss_pred cCccccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHH Confidence 99999998889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCC-CCCCCeeeccccccc Q lcl|NC_020081. 397 LEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPD-TEGGDVTLAGVHVQR 475 (552) Q Consensus 397 l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p-~~ggD~~~~~~n~~~ 475 (552) |+||+++||++||++|+++++.+++|+|+..+..++++.++++....+|+||+||+|+++|||| +||||+++.++++++ T Consensus 396 L~P~~~~ie~~ln~~L~~~~~~~~~f~f~~~~~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~ 475 (551) T protein:vir:80 396 LQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQR 475 (551) T ss_pred HHHHHHHHHHHHHhhhccccCCceEEEeeccChhhHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCceeeccccccc Confidence 9999999999999999999998999999999988888888877777789999999999999998 799999999999999 Q ss_pred hhhhccccccccccCCCCCccCcccCCC---CCCCCCCCCCCCcccccCCCCccccccccccccccCccccccccccccC Q lcl|NC_020081. 476 LGQIMQQEQVEYQRQMDANQFLAQQTGY---DGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGKDDNGNVVNDWEA 552 (552) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (552) ++...+.+..+...++...+...+..+. ++.+.++.+.++ ..++++|++.+++++.++.|++++ |.+.+||++ T Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~ 551 (551) T protein:vir:80 476 IGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDT-TGDIGKDGQRKDKDNANAGKQGMK---GDKPNDWQT 551 (551) T ss_pred ccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCcccc-CCCccccccccCccccchhhhhcC---CCCccccCC Confidence 8887776666555444333322222222 222333333333 467788999999999999999998 799999999 No 2 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=1.5e-117 Score=661.09 Aligned_cols=539 Identities=55% Similarity=0.895 Sum_probs=473.9 Q ss_pred CCCCCCCcccccchhhc----ccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCch Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNI----IDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQN 76 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (552) ||++. |+|...-.++- +++++..+....+++...+.| .+.++++++++|..+.+..+++|+++|+..++.. T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k----~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 75 (547) T protein:vir:63 1 MGLFE-SIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISK----AMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQD 75 (547) T ss_pred Cchhh-hhhhhcCCccccccccccccccchhhhhhhHHHHHH----hhcccchhhhchhhheeecccccccCCccCChhH Confidence 99999 99885543333 556666677777777776654 5777899999999999999999999999988777 Q ss_pred HHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCcc Q lcl|NC_020081. 77 LLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTR 156 (552) Q Consensus 77 ~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~ 156 (552) +...|+.++. ++++++||.+++++++++|.+.++...+.+|.+++++.+++.+.++..+++.+.++|++||...+|+ + T Consensus 76 l~~l~~~~~~-npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~-~ 153 (547) T protein:vir:63 76 LHGVLKKFGG-NIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDIN-R 153 (547) T ss_pred HHHHHHHhhc-CHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCc-c Confidence 7777777775 5889999999999999999999999999999999999999999999999999999999999765555 4 Q ss_pred CCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEccccee Q lcl|NC_020081. 157 DNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMA 236 (552) Q Consensus 157 ~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi 236 (552) +|+++|+++++.++|++||+|++++|+..|+|++||||+|.+|++..+++|... ....+|+++.++.....|+++||| T Consensus 154 ~s~~~f~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~--~~~~~y~~~~~~~~~~~~~~~eii 231 (547) T protein:vir:63 154 DSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIP--DNGNRFVQVIDQKIVATFNAREMA 231 (547) T ss_pred chHHHHHHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccccc--cCceEEEEEcCCcEEEEeccccEE Confidence 799999999999999999999999999999999999999999999999988543 345678888888888899999999 Q ss_pred eecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccc Q lcl|NC_020081. 237 WEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGING 316 (552) Q Consensus 237 ~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~n 316 (552) |+++++..+...++||+|||.++..+|..+.++++|+.++|+||++|+|||.++++..++++++++++++|++.++|..| T Consensus 232 h~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~n 311 (547) T protein:vir:63 232 FAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGING 311 (547) T ss_pred EecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCccc Confidence 99999888888889999999999999999999999999999999999999999887778999999999999999999999 Q ss_pred cccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHH Q lcl|NC_020081. 317 AWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKG 396 (552) Q Consensus 317 agk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~ 396 (552) +|++|||+++|++|+++++++.|+||+|++++++++||++|||||++||+.++++.++...++.+++|++++.+.|+++| T Consensus 312 agk~~vl~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~t 391 (547) T protein:vir:63 312 SWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKG 391 (547) T ss_pred ccccccccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHH Confidence 99999998889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCC-CCCCCeeeccccccc Q lcl|NC_020081. 397 LEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPD-TEGGDVTLAGVHVQR 475 (552) Q Consensus 397 l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p-~~ggD~~~~~~n~~~ 475 (552) |.||+++||++||++|++.++..++|+|+..+..++.+.+++.....+|+||+||+|+++|||| +||||+++.++++.+ T Consensus 392 L~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~ 471 (547) T protein:vir:63 392 LQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQR 471 (547) T ss_pred HHHHHHHHHHHHHhhcccccCCceEEEeeccccccHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCceeeccccccc Confidence 9999999999999999998888899999999888888877777777789999999999999998 799999999999999 Q ss_pred hhhhccccccccccCCCCCccCcccCCCCCCC---CCCCCCCCcccccCCCCccccccccccccccCccccccccccccC Q lcl|NC_020081. 476 LGQIMQQEQVEYQRQMDANQFLAQQTGYDGNM---DNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGKDDNGNVVNDWEA 552 (552) Q Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (552) ++...+.+..+...+....+...++.+.+..+ +++.+.+ +..+++.|+..+++++.++.|++++ |.+.+||++ T Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~d~~~~~~~~~~~~~~~~~---~~~~~~~~~ 547 (547) T protein:vir:63 472 IGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKD-TTGDIGKDGQRKDKDNANAGKQGMK---GDKPNDWQT 547 (547) T ss_pred ccccccccCCccccchhhccccccccCCCCCCCCCCCCCCcc-cCCCcCccccccCccccchhhhhcC---CCCccccCC Confidence 88777666555555444443333333333222 3332222 3567788999999999999999999 799999999 No 3 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=4.7e-117 Score=658.33 Aligned_cols=545 Identities=46% Similarity=0.801 Sum_probs=466.3 Q ss_pred CCCCCCCcc-cccchhhc--ccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccC-CCCch Q lcl|NC_020081. 1 MGLLDGFFK-GRKQQDNI--IDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSI-HGKQN 76 (552) Q Consensus 1 ~~~~~~~~~-~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 76 (552) -+||. ||| |.+|.+.. +.++++++++++++.+. .+.+.++...+++++.+|+...++.+++|+.+|+. ..... T Consensus 6 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~~~ 82 (576) T protein:vir:96 6 ADIFK-RLRLGRDYEDIIDTVPIDDGLQANIRNIEEK--SKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNSDN 82 (576) T ss_pred HHHHH-HHhccCccccchhhhhcccChhHHHHHhhhh--hhhhccccCCccchhhcceeeeeecCCCccccCcchhhhhh Confidence 46666 888 45555444 56699999999999864 55566778888899999999999999999998873 34456 Q ss_pred HHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCcc Q lcl|NC_020081. 77 LLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTR 156 (552) Q Consensus 77 ~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~ 156 (552) ....|+.++. ++++++||.+++++++++|+++.....+.+|.+++++.+...++++.++++.+.++|++++..++|+ + T Consensus 83 ~~~~l~~~~~-npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~-~ 160 (576) T protein:vir:96 83 LHDVLKQFGN-NPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDID-R 160 (576) T ss_pred hHHHHHHhhc-CHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCc-c Confidence 6778888886 5789999999999999999999999999999999999999999999999999999999998766666 4 Q ss_pred CCHHHHHHHHHHHHHhcCCeeEEEEEC--CCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccc Q lcl|NC_020081. 157 DNFRSFVKKLVRDRLTYDKINFELVYD--KLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKE 234 (552) Q Consensus 157 ~t~~~f~~~~v~d~ll~Gna~~~i~r~--~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~e 234 (552) +|+++||+.++.+++++||+|++++++ ..|+|++||||+|.+|++..+++|..+.. ..+|++..++.....|+++| T Consensus 161 ~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~~~~--~~~~~~~~~~~~~~~~~~~d 238 (576) T protein:vir:96 161 DSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKIIKG--GKRFVQVINKKVVASFTSRE 238 (576) T ss_pred ccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCceeee--eeEEEEecCCceEEEecccc Confidence 899999999999999999999999865 46789999999999999999999876543 45678888888889999999 Q ss_pred eeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc Q lcl|NC_020081. 235 MAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGI 314 (552) Q Consensus 235 vi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~ 314 (552) ||||++++..+...++||+|||.+++.+|..+.++++|+.++|+||++|+|||+++++..+++++++++++.|++.++|. T Consensus 239 ii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~ 318 (576) T protein:vir:96 239 MAMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGI 318 (576) T ss_pred eEEEeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc Confidence 99999998887777899999999999999999999999999999999999999999887889999999999999999999 Q ss_pred cccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccc-cccccccchhHHHHHHHHH Q lcl|NC_020081. 315 NGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATG-HSGNTLNEGSSAEKYRNSK 393 (552) Q Consensus 315 ~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~-~~~~~~~~~n~e~~~~~~~ 393 (552) .|+|++|+|+++|++|+++++++.|+||+|++++++++||++|||||++||+.+.++.|+ ..+++.+|+|++++.+.|+ T Consensus 319 ~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~ 398 (576) T protein:vir:96 319 NGSWQVPVVMADDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQ 398 (576) T ss_pred cccccceeecCCCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHH Confidence 999999888889999999999999999999999999999999999999999999988766 4566789999999999999 Q ss_pred HHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccc Q lcl|NC_020081. 394 DKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHV 473 (552) Q Consensus 394 ~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~ 473 (552) +.||.||+++||++||++|++.++.+++|+|+++|.+++++.+++...+.+|+||+||+|+++||||+||||+++.++++ T Consensus 399 ~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~f~r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~ 478 (576) T protein:vir:96 399 NKGLQPLLRFIEDLINTHIISEYSDKYVFQFVGGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFI 478 (576) T ss_pred HHHHHHHHHHHHHHHHhhhchhccCceEEEeccCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcceecccccc Confidence 99999999999999999999999989999999999999999988877777899999999999999999999999999999 Q ss_pred cchhhhccccccccccCCCCCccC--------cccCCCCCCCCCCCCCCC-----cccccCCCCcccccccccccc---- Q lcl|NC_020081. 474 QRLGQIMQQEQVEYQRQMDANQFL--------AQQTGYDGNMDNVNGKDS-----FNQNVGKDGQSKQQANTNSTP---- 536 (552) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~---- 536 (552) ++++........+...+....+.. ...+....+.+++++++. ...++++|++.|.++|++|+| T Consensus 479 ~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 558 (576) T protein:vir:96 479 QSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGRESNDPTKIDSPVGTDGQLKDQDNVKSQEGSNK 558 (576) T ss_pred ccccccccCCCCCCccccccccccccccCCCCCCCCCCCCCCCcccccccccCCCCCCccccccccCCCCcccccccccc Confidence 888765544333333222111100 000111111122222211 224588999999999999999 Q ss_pred -ccCccccccccccccC Q lcl|NC_020081. 537 -QGGKDDNGNVVNDWEA 552 (552) Q Consensus 537 -~~~~~~~~~~~~~~~~ 552 (552) +|+++.+.+|+|||+. T Consensus 559 ~~~~~~~~~~~~~~~~~ 575 (576) T protein:vir:96 559 GQGTKGKGNEKPSDFKN 575 (576) T ss_pred cccccccCCCCcccccC Confidence 7788888899999999 No 4 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=9.6e-117 Score=656.62 Aligned_cols=548 Identities=44% Similarity=0.749 Sum_probs=472.0 Q ss_pred CCCCCCCcc-cccchhhc----ccccCcccccccccchhhhhc-cccccccccccccccccccccccCCccccccc-CCC Q lcl|NC_020081. 1 MGLLDGFFK-GRKQQDNI----IDINDDMAVRIKQIEEDAILK-KGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPS-IHG 73 (552) Q Consensus 1 ~~~~~~~~~-~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 73 (552) |.-+=.++| |++|+.|- ++|++++++.++++++...+. .+.|++.++++|+++|+...++.+++|+.+|+ +.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 80 (563) T protein:vir:99 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKN 80 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCC Confidence 554445776 78998854 788999999999999987754 56788999999999999999999999999987 555 Q ss_pred CchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCC Q lcl|NC_020081. 74 KQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDND 153 (552) Q Consensus 74 ~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~p 153 (552) ++.+...|+.++. ++|+++||.+++++++++|++......+++|.+++++.+...++++.+++++++++|.+++...+| T Consensus 81 ~~~l~~~l~~~~~-n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p 159 (563) T protein:vir:99 81 EHNLHDVLKKFGN-NPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDV 159 (563) T ss_pred cccHHHHHHHhhc-chHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCC Confidence 5566788998886 588999999999999999999999999999999999999999999999999999999998765555 Q ss_pred CccCCHHHHHHHHHHHHHhcCCeeEEEE--ECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEc Q lcl|NC_020081. 154 FTRDNFRSFVKKLVRDRLTYDKINFELV--YDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFK 231 (552) Q Consensus 154 n~~~t~~~f~~~~v~d~ll~Gna~~~i~--r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~ 231 (552) + ++|+++|+++++.+++++||+|++++ |+..|+|++||||+|++|++..+.+|..+. ...+|++...+.....|. T Consensus 160 ~-~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~--~~~~y~~~~~g~~~~~~~ 236 (563) T protein:vir:99 160 D-RDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIK--GGKRFVQVVDKRVVASFT 236 (563) T ss_pred C-cchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceec--cceeEEEEeCCceeEEec Confidence 4 47999999999999999999999876 788899999999999999999999987654 345678888888888999 Q ss_pred ccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_020081. 232 AKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMF 311 (552) Q Consensus 232 ~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~ 311 (552) ++|+|||++++..+...++||+|||.+++.+|..+.++++|+.++|+||++|+|||+++++..+++++++++++.|++.+ T Consensus 237 ~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~ 316 (563) T protein:vir:99 237 SRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSL 316 (563) T ss_pred CcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHh Confidence 99999999999888888899999999999999999999999999999999999999999887889999999999999999 Q ss_pred ccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhccccccccccc-ccccccchhHHHHHH Q lcl|NC_020081. 312 SGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGH-SGNTLNEGSSAEKYR 390 (552) Q Consensus 312 ~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~-~~~~~~~~n~e~~~~ 390 (552) +|..|+|++|+|+++|++|+++++++.|+||++++++++++||++|||||++||+.+++++++. .+++.+++|++++.+ T Consensus 317 ~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~ 396 (563) T protein:vir:99 317 SGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQ 396 (563) T ss_pred ccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHH Confidence 9999999998888999999999999999999999999999999999999999999999887654 466779999999999 Q ss_pred HHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeecc Q lcl|NC_020081. 391 NSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAG 470 (552) Q Consensus 391 ~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~ 470 (552) .|++.||.||++.||++||++|+++++.+++|+|+++|.+++++++++...+.+|+||+||+|+++||||+||||+++++ T Consensus 397 ~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~ 476 (563) T protein:vir:99 397 QSQNKGLQPLLRFIEDLVNRHIISEYGDKYTFQFVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDA 476 (563) T ss_pred HHHHHHHHHHHHHHHHHHHhhhchhcccccEEEeccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeecc Confidence 99999999999999999999999999999999999999999999988777777899999999999999999999999999 Q ss_pred ccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCC---CCCCcccccCCCCccccccccccccccCcccc--cc Q lcl|NC_020081. 471 VHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVN---GKDSFNQNVGKDGQSKQQANTNSTPQGGKDDN--GN 545 (552) Q Consensus 471 ~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 545 (552) +++++++........+...+....+........+.+...+. ...+..+..+++++.+++++..|...|+..++ |+ T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 556 (563) T protein:vir:99 477 SFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQGRKGE 556 (563) T ss_pred cccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCccccccccCc Confidence 99999887766555554444433332221111111111111 11123456667888888888888877655554 88 Q ss_pred ccccccC Q lcl|NC_020081. 546 VVNDWEA 552 (552) Q Consensus 546 ~~~~~~~ 552 (552) |++||+. T Consensus 557 ~~~~~~~ 563 (563) T protein:vir:99 557 KSSDFKH 563 (563) T ss_pred CcccccC Confidence 9999999 No 5 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=9.6e-117 Score=656.62 Aligned_cols=548 Identities=44% Similarity=0.749 Sum_probs=472.0 Q ss_pred CCCCCCCcc-cccchhhc----ccccCcccccccccchhhhhc-cccccccccccccccccccccccCCccccccc-CCC Q lcl|NC_020081. 1 MGLLDGFFK-GRKQQDNI----IDINDDMAVRIKQIEEDAILK-KGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPS-IHG 73 (552) Q Consensus 1 ~~~~~~~~~-~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 73 (552) |.-+=.++| |++|+.|- ++|++++++.++++++...+. .+.|++.++++|+++|+...++.+++|+.+|+ +.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 80 (563) T protein:vir:95 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKN 80 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCC Confidence 554445776 78998854 788999999999999987754 56788999999999999999999999999987 555 Q ss_pred CchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCC Q lcl|NC_020081. 74 KQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDND 153 (552) Q Consensus 74 ~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~p 153 (552) ++.+...|+.++. ++|+++||.+++++++++|++......+++|.+++++.+...++++.+++++++++|.+++...+| T Consensus 81 ~~~l~~~l~~~~~-n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p 159 (563) T protein:vir:95 81 EHNLHDVLKKFGN-NPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDV 159 (563) T ss_pred cccHHHHHHHhhc-chHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCC Confidence 5566788998886 588999999999999999999999999999999999999999999999999999999998765555 Q ss_pred CccCCHHHHHHHHHHHHHhcCCeeEEEE--ECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEc Q lcl|NC_020081. 154 FTRDNFRSFVKKLVRDRLTYDKINFELV--YDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFK 231 (552) Q Consensus 154 n~~~t~~~f~~~~v~d~ll~Gna~~~i~--r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~ 231 (552) + ++|+++|+++++.+++++||+|++++ |+..|+|++||||+|++|++..+.+|..+. ...+|++...+.....|. T Consensus 160 ~-~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~--~~~~y~~~~~g~~~~~~~ 236 (563) T protein:vir:95 160 D-RDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIK--GGKRFVQVVDKRVVASFT 236 (563) T ss_pred C-cchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceec--cceeEEEEeCCceeEEec Confidence 4 47999999999999999999999876 788899999999999999999999987654 345678888888888999 Q ss_pred ccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_020081. 232 AKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMF 311 (552) Q Consensus 232 ~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~ 311 (552) ++|+|||++++..+...++||+|||.+++.+|..+.++++|+.++|+||++|+|||+++++..+++++++++++.|++.+ T Consensus 237 ~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~ 316 (563) T protein:vir:95 237 SRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSL 316 (563) T ss_pred CcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHh Confidence 99999999999888888899999999999999999999999999999999999999999887889999999999999999 Q ss_pred ccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhccccccccccc-ccccccchhHHHHHH Q lcl|NC_020081. 312 SGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGH-SGNTLNEGSSAEKYR 390 (552) Q Consensus 312 ~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~-~~~~~~~~n~e~~~~ 390 (552) +|..|+|++|+|+++|++|+++++++.|+||++++++++++||++|||||++||+.+++++++. .+++.+++|++++.+ T Consensus 317 ~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~ 396 (563) T protein:vir:95 317 SGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQ 396 (563) T ss_pred ccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHH Confidence 9999999998888999999999999999999999999999999999999999999999887654 466779999999999 Q ss_pred HHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeecc Q lcl|NC_020081. 391 NSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAG 470 (552) Q Consensus 391 ~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~ 470 (552) .|++.||.||++.||++||++|+++++.+++|+|+++|.+++++++++...+.+|+||+||+|+++||||+||||+++++ T Consensus 397 ~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~ 476 (563) T protein:vir:95 397 QSQNKGLQPLLRFIEDLVNRHIISEYGDKYTFQFVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDA 476 (563) T ss_pred HHHHHHHHHHHHHHHHHHHhhhchhcccccEEEeccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeecc Confidence 99999999999999999999999999999999999999999999988777777899999999999999999999999999 Q ss_pred ccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCC---CCCCcccccCCCCccccccccccccccCcccc--cc Q lcl|NC_020081. 471 VHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVN---GKDSFNQNVGKDGQSKQQANTNSTPQGGKDDN--GN 545 (552) Q Consensus 471 ~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 545 (552) +++++++........+...+....+........+.+...+. ...+..+..+++++.+++++..|...|+..++ |+ T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 556 (563) T protein:vir:95 477 SFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQGRKGE 556 (563) T ss_pred cccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCccccccccCc Confidence 99999887766555554444433332221111111111111 11123456667888888888888877655554 88 Q ss_pred ccccccC Q lcl|NC_020081. 546 VVNDWEA 552 (552) Q Consensus 546 ~~~~~~~ 552 (552) |++||+. T Consensus 557 ~~~~~~~ 563 (563) T protein:vir:95 557 KSSDFKH 563 (563) T ss_pred CcccccC Confidence 9999999 No 6 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=2.5e-114 Score=643.39 Aligned_cols=536 Identities=52% Similarity=0.865 Sum_probs=447.9 Q ss_pred CCCCCC----CcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCch Q lcl|NC_020081. 1 MGLLDG----FFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQN 76 (552) Q Consensus 1 ~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (552) +|+-.. ++.-.+|++++-...+..- +.+.-.+ ..+.++...+.+++.+++.+.++++++|+.+|+..++.. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (574) T protein:vir:80 9 LGIEKSSIEETRNMENYKMHLREIDTNVV-NNEPYSM----ESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQD 83 (574) T ss_pred hccchhhHHHHHhhhhhccccchhhhhhh-hccCCCH----HHHHHhHhhhcccccchhhhhccccccccCcCccCCccc Confidence 222110 1111223222211111111 0011111 124455666779999999999999999999999998888 Q ss_pred HHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCcc Q lcl|NC_020081. 77 LLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTR 156 (552) Q Consensus 77 ~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~ 156 (552) +...|+.++. ++|+++||.+++++|++||+++..++++++|.|++++.+.+.+.++.++.|++..+|..++..++|+. T Consensus 84 ~~~~l~~~~~-~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~- 161 (574) T protein:vir:80 84 LHKTLKKFGN-NIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNR- 161 (574) T ss_pred HHHHHHhhcc-ChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCcc- Confidence 8888998885 58899999999999999999999999999999999999988888889999999999998775555553 Q ss_pred CCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEccccee Q lcl|NC_020081. 157 DNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMA 236 (552) Q Consensus 157 ~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi 236 (552) +|+.+|++.++.+++++||+|++++|+..|+|++||||+|.+|++..+.+|.. ...+.+|+++.++.....|+++||| T Consensus 162 ~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~--~~~~~~y~~~~~g~~~~~~~~~eii 239 (574) T protein:vir:80 162 DNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKL--IKNGERFVQVIDNRIVAKFNERELA 239 (574) T ss_pred ccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccc--ccCceEEEEEeCCceEEEEccccEE Confidence 58899999999999999999999999999999999999999999999888743 3445678888899999999999999 Q ss_pred eecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccc Q lcl|NC_020081. 237 WEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGING 316 (552) Q Consensus 237 ~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~n 316 (552) |+++++.++..+++||+|||.+++.+|..++++++|+.++|+||++|+|||+++++..+++++++++++.|++.++|..| T Consensus 240 h~~~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n 319 (574) T protein:vir:80 240 FAVRNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGING 319 (574) T ss_pred EEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccc Confidence 99999988888899999999999999999999999999999999999999999887778999999999999999999999 Q ss_pred cccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHH Q lcl|NC_020081. 317 AWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKG 396 (552) Q Consensus 317 agk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~ 396 (552) +|++|||+++|++|+++++++.|+||+|++++++++||++|||||++||+.+++|+|++++++.+|+|++++.+.|++.| T Consensus 320 ~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~t 399 (574) T protein:vir:80 320 SWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKG 399 (574) T ss_pred cccceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHH Confidence 99999998899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccch Q lcl|NC_020081. 397 LEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRL 476 (552) Q Consensus 397 l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~ 476 (552) |+||+++||++||++|+++++.+++|+|+++|..++.+++++.....+|+||+||+|+++||||+||||++++++|++++ T Consensus 400 L~P~~~~ie~~ln~~Ll~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~ 479 (574) T protein:vir:80 400 LQPLLRFIEDTVNTYIVAEFGEKYQFQFRGGDLSAQLDKLKIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAI 479 (574) T ss_pred HHHHHHHHHHHHHhhhhhhcCCceEEEecccchhhHHHHHHHHHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeec Confidence 99999999999999999999999999999999999999888877777899999999999999999999999999999999 Q ss_pred hhhccccccccccCCCCC------------ccCcccCCCCCCCCC---------CCCC-----CCcccccCCCCcccccc Q lcl|NC_020081. 477 GQIMQQEQVEYQRQMDAN------------QFLAQQTGYDGNMDN---------VNGK-----DSFNQNVGKDGQSKQQA 530 (552) Q Consensus 477 ~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~---------~~~~-----~~~~~~~~~~~~~~~~~ 530 (552) +...+....+...+.... .+..+.+...+..+. .+|+ ..+.+.+++|++.|+++ T Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 559 (574) T protein:vir:80 480 GQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDEQQGLNGKSKKVNGKVDDNVGKDGQLKSEE 559 (574) T ss_pred ccccccccCCccchhccccccccccCCCCCCCCCCCCCCccccccchhhhhhhhhccchhhhcCCccccccccccccccc Confidence 877655444332221111 111111100000000 0111 11345788999999999 Q ss_pred ccccccccCcccccc Q lcl|NC_020081. 531 NTNSTPQGGKDDNGN 545 (552) Q Consensus 531 ~~~~~~~~~~~~~~~ 545 (552) |++|+++|+++++.+ T Consensus 560 ~~~~~~~~~~~~~~~ 574 (574) T protein:vir:80 560 NTNSTKHGTDGIKKE 574 (574) T ss_pred ccccccccCccccCC Confidence 999999999999988 No 7 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=9e-95 Score=536.18 Aligned_cols=518 Identities=30% Similarity=0.456 Sum_probs=380.1 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccc--cccccCC-cccccccCCCCchH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPII--GSMSMNP-DFKEAPSIHGKQNL 77 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~-~~~~~~~~~~~~~~ 77 (552) |.++. -.|.-.- |--+........++-.++.|.+++.+.++... +... ..+..+| +|+.+++..... . T Consensus 1 ~~~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~g~~~~~~~~~~~-~ 71 (535) T protein:vir:10 1 MAILK-DLRNAFS---LSNKKSTSYIELGDYDKDIVNKAIRPGRASAR----DTVDGIDIADGNVAGQYSVASISDVL-S 71 (535) T ss_pred ChhhH-HHHHHHH---hhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhh----ccccccccccCCcccccccCcccccc-C Confidence 55443 1111110 11112233345567788888888766654432 2222 3456777 577777766554 5 Q ss_pred HHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccC Q lcl|NC_020081. 78 LQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRD 157 (552) Q Consensus 78 ~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~ 157 (552) ++.|++.+.+++++++||.++++.++++|++...+..+.++.+++++.+...+.++.++.|++.++|.. .||++| T Consensus 72 ~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~-----~PN~~~ 146 (535) T protein:vir:10 72 TKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYN-----TGSEYY 146 (535) T ss_pred HHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHh-----CCCCCC Confidence 666777777888999999999999999999999999999999999999988888888899999888764 466777 Q ss_pred CHH----HHHHHHHHHHHhcC-CeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcc Q lcl|NC_020081. 158 NFR----SFVKKLVRDRLTYD-KINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKA 232 (552) Q Consensus 158 t~~----~f~~~~v~d~ll~G-na~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~ 232 (552) +++ +|+++++.++|++| ++|++|+|+..|+|++||||+|.+|++..+.++.. ...+|+++.++.....|++ T Consensus 147 ~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~~----~~~~~~~~~~~~~~~~~~~ 222 (535) T protein:vir:10 147 EWRDTFPRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYSPRSKD----QPRKFEQFVSETKSVKFSE 222 (535) T ss_pred ChhHHHHHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEcCcccc----CceEEEEEecCceeEEECc Confidence 654 57788888877765 78999999999999999999999999998877642 3456777778888889999 Q ss_pred cceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC--CCCCHHHHHHHHHHHHHH Q lcl|NC_020081. 233 KEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTG--QEQSNQALTSFRREWTSM 310 (552) Q Consensus 233 ~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~--~~~s~~~~~~~~~~~~~~ 310 (552) +||||+++++..+..+++||+|||.++..+|..+.++++|+.++|+||++|+|||++++. ..++++++++|+++|++. T Consensus 223 ~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~ 302 (535) T protein:vir:10 223 RNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQ 302 (535) T ss_pred ccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHH Confidence 999999998887888889999999999999999999999999999999999999999863 457999999999999999 Q ss_pred hccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhccccccccccccc--ccccchhHHHH Q lcl|NC_020081. 311 FSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSG--NTLNEGSSAEK 388 (552) Q Consensus 311 ~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~--~~~~~~n~e~~ 388 (552) ++|..|+|++|||+++|++|+++++++.|+||+|++++++++||++|||||++||+.+++|+++... ...+.++++++ T Consensus 303 ~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~ 382 (535) T protein:vir:10 303 GSGLGGAWKIPILAAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAK 382 (535) T ss_pred hcCcccccccccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHH Confidence 9999999999999988999999999999999999999999999999999999999999998876543 34567789999 Q ss_pred HHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeee Q lcl|NC_020081. 389 YRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTL 468 (552) Q Consensus 389 ~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~ 468 (552) ...|++.||.||++.||++||++|++.++.+++|+|+..+..+..+.+++++...+|+||+||+|+++||||+||||+++ T Consensus 383 ~~~~~~~~L~P~l~~ie~~ln~~Ll~~~~~~~~f~f~~l~~~d~~~r~~~~~~~~~g~lT~NE~R~~~gl~piegGD~~~ 462 (535) T protein:vir:10 383 LESSKDKGLTPLLSFIEQVINDKIMRYVDTDYRFSFTLGDAQDKLQEEQVWKLKLANGYFINEYRKDHGLKTVDGLDVPG 462 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccCCeEEEEeccccccCHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCccccc Confidence 99999999999999999999999999888889998865444333333444445567899999999999999999999876 Q ss_pred cccc---ccchhhhccccccccccCCCCCccCcccCCC--CCCCCCCCCCCCcccccCCCCccccccccccccccCcccc Q lcl|NC_020081. 469 AGVH---VQRLGQIMQQEQVEYQRQMDANQFLAQQTGY--DGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGKDDN 543 (552) Q Consensus 469 ~~~n---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 543 (552) +... +.......+ ...+...+....+....+++. +...+.+-|+++ ++.+..|..++ ++-...++ T Consensus 463 ~~~~~~~~~~~~~~~~-~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~-----~~~~~~~~~~~----~~~~~~~~ 532 (535) T protein:vir:10 463 FIGSAENFINATGFGQ-PNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDD-----PKSPLPKPSES----DDVSNNED 532 (535) T ss_pred cccchhhccccccccc-ccCCCCCCCccccCCccccCcccccccccccCCCC-----CCCCCCcCCCC----Cccccccc Confidence 5332 211111000 000000010111111110000 000000111111 11111111111 11111122 Q ss_pred ccc Q lcl|NC_020081. 544 GNV 546 (552) Q Consensus 544 ~~~ 546 (552) +.+ T Consensus 533 ~~~ 535 (535) T protein:vir:10 533 ADT 535 (535) T ss_pred cCC Confidence 222 No 8 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=1.2e-81 Score=464.24 Aligned_cols=441 Identities=14% Similarity=0.136 Sum_probs=304.4 Q ss_pred hccccccccccccccccccccccccCCcccc------cccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKE------APSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPAR 110 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~ 110 (552) |....+. +++.+....++. ..+|.+.+.. ..+..+-..+. ..|...+.|.+||.+++.. T Consensus 1 ~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~~~g~~~~g~~v~~----~~al~~~~V~~~v~~Ia~~--------- 65 (454) T protein:vir:93 1 MWNLLRR-TRKNQKSGRDVR-EAGWTSLFQAVAEPFAGAWQQGVKADP----EAVLSFHAVFACISLISQD--------- 65 (454) T ss_pred CCCcccc-Cccccccccccc-chhhhhhhhhhhhhhcchhhcCcccCh----HHhhccHHHHHHHHHHHHh--------- Confidence 2222211 111111111111 1122221100 00111111111 1233345577887765554 Q ss_pred hhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEE Q lcl|NC_020081. 111 NSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHN 190 (552) Q Consensus 111 ~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~ 190 (552) ++++++.+..++.++.. .....+.+ +.++ ..||++||+++||+.++.+++++||+|++|+|+..|+|++ T Consensus 66 --iA~lp~~~~~~~~~g~~---~~~~~~~~----~~L~--~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~ 134 (454) T protein:vir:93 66 --IAKMRLRLMQTDAQGIR---RETRRGDI----ARLC--RRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKE 134 (454) T ss_pred --hccCceEEEEeccCCcc---chhhhHHH----HHHH--hcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEE Confidence 45667777655444321 11222333 2222 3788999999999999999999999999999999999999 Q ss_pred EEEecCceeEEEECCCcccccccceeEEEEEcC----CceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHH Q lcl|NC_020081. 191 FKAVDASTVYVAVDEDGKERKAKDGVRYVQVID----DKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYH 266 (552) Q Consensus 191 L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~----~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~ 266 (552) ||||+|++|++..+++|.++ |..... ......|+++||||+++++ +.++++|+||+..+..+|.++ T Consensus 135 L~~i~~~~v~v~~~~~g~~~-------y~~~~~~~~~~~~~~~~~~~eViH~k~~~---~~~~~~G~sp~~~~~~~i~~~ 204 (454) T protein:vir:93 135 LRILDWNRVEPLVADDGEVF-------YRITPDRNCGITEAVTVPAREVIHDRFNC---FFHPLIGLPPVYAAGLAATQG 204 (454) T ss_pred EEEEcCcceEEEEcCCCcEE-------EEEEeccccccceeEEecCcceEEeccCC---CCCCceeccHHHHHHHHHHHH Confidence 99999999999998888543 222222 2345679999999998754 456789999999999999999 Q ss_pred HHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHH Q lcl|NC_020081. 267 DNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWL 346 (552) Q Consensus 267 ~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~ 346 (552) .++++++.++|+||++|+|||++++ .++++++++++++|++.++| .|+|+++|| ++|++|+++++++.|+||+|++ T Consensus 205 ~~~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g-~n~g~~~vl-~~g~~~~~l~~~~~d~q~le~~ 280 (454) T protein:vir:93 205 HHIQENSTSFFRNGGRPSGVIEIPG--SITEENAKKLKSNWDSGYTG-ENAGKTAIL-SNGAKYNPTTFSPVDSQTVEQL 280 (454) T ss_pred HHHHHHHHHHHhccCCccEEEecCC--CCCHHHHHHHHHHHHHHhcc-cccCCceec-cCCceEEEcccChhHHHHHHHH Confidence 9999999999999999999999876 46899999999999999988 789998766 5799999999999999999999 Q ss_pred HHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecc-- Q lcl|NC_020081. 347 NYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNF-- 424 (552) Q Consensus 347 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f-- 424 (552) ++++++||++|||||++||+.+.+ +++|++++.+.|++.||.||++.||++||++|++..+..++|.+ T Consensus 281 ~~~~~~Ia~~fgVPp~~lg~~~~~----------t~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~f~~~~ 350 (454) T protein:vir:93 281 KMTAEIVCSVFRVPAYKIGVGQPP----------SSDNVEALEQQYYSQCLQTLIESIELLLDEALETGENESTEFDVTT 350 (454) T ss_pred HHHHHHHHHHhCCCHHHcCCCCCC----------cchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcEEEeechh Confidence 999999999999999999987654 47899999999999999999999999999999987665555543 Q ss_pred -cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCC Q lcl|NC_020081. 425 -VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGY 503 (552) Q Consensus 425 -~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (552) +++|.+++++.+..+ +.+|+||+||+|+++||||+||||+++++.+..+++...+++..+.+......+. T Consensus 351 ll~~D~~~r~~~~~~~--~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 421 (454) T protein:vir:93 351 LLRMDSERRMKTLGDA--VKNTLLTPNEARKRENLPPLAGGDALYLQQQNYSLEALSRRDAREDPFASSGKTA------- 421 (454) T ss_pred hhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccchHhhhccCcccCCCCCCccCC------- Confidence 578888888776654 4579999999999999999999999999999988876654332221111111000 Q ss_pred CCCCCCCCCCCCcccccCCCCccccccccccccccCccccccccccccC Q lcl|NC_020081. 504 DGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGKDDNGNVVNDWEA 552 (552) Q Consensus 504 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (552) ....+....+. +++..+.+ .+.+.+. --.+|+. T Consensus 422 --~~~~~~~~~d~----~~~~~e~~--~d~~~~~--------~~~~~~~ 454 (454) T protein:vir:93 422 --SVPQAVAASDG----NKAITETE--HDAVKAM--------FRGILKK 454 (454) T ss_pred --CCCCCCCCCCC----CCCccCCc--cchhhhh--------hhhhhcC Confidence 00000000000 00101100 0111111 1111111 No 9 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=1e-81 Score=464.58 Aligned_cols=415 Identities=18% Similarity=0.197 Sum_probs=311.8 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||++++-|..+.= . .+.+.. ......-+......| ..+...+...... T Consensus 1 MG~f~~lf~~~~~---------~--------~~~~~~---------~~~~~~~~~~~~~~~-~~~g~~~~~~v~~----- 48 (422) T protein:vir:13 1 MGFLRGLFNKKNN---------N--------DEKRSN---------YDEDIGIDISDSNFW-EKFGIKLNFSVRG----- 48 (422) T ss_pred CchhhhhhhccCC---------c--------cchhhh---------hhhccccccCcchhh-hhccccCCcccch----- Confidence 9998854432211 0 000000 000000000000001 1111111111111 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .-|...+++.+||.+++..++ ++++.+..... ..+.|++..+|. ..||++||++ T Consensus 49 --~~al~~~~v~~ci~~ia~~iA-----------~lp~~~~~~~~--------~~~~~~~~~lL~-----~~PN~~~t~~ 102 (422) T protein:vir:13 49 --KRALKENTVYVCTKIRAESIG-----------KLSLKIYKDKE--------EYKEHELYYLLR-----YKPNPLMSSI 102 (422) T ss_pred --hhhhccHHHHHHHHHHHHhhh-----------hCceEEEecCc--------ccccchHHHHHh-----hhcccCCCHH Confidence 112234567888877666654 45565532211 112344554443 3688999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS 240 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~ 240 (552) +||+.++.+++++||+|++|+|+..|+|++|+||+|++|++..+++|.... ....+|++...++....|.++||||++. T Consensus 103 ~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~-~~~~~y~~~~~~g~~~~~~~~eiih~~~ 181 (422) T protein:vir:13 103 NFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSS-LSKVWYVVTDKNGKEHKLLPDEMLHFIG 181 (422) T ss_pred HHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceec-cceEEEEEEeCCCeEEEEcccceEEEcC Confidence 999999999999999999999999999999999999999999998886432 3345677777777888999999999986 Q ss_pred cccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccc Q lcl|NC_020081. 241 NPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKI 320 (552) Q Consensus 241 ~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~ 320 (552) ++ +.++++|+||+..+..+|..+.++++++.++|+||++|+|||++++ .+++++++++++.|++.++|.+|+|++ T Consensus 182 ~~---~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~ 256 (422) T protein:vir:13 182 DI---TLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVG--DLDEKAKKIFKKEFESMSNGLENAHSI 256 (422) T ss_pred CC---CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC--CCCHHHHHHHHHHHHHHhcCccccCCc Confidence 53 4567999999999999999999999999999999999999999976 468999999999999999999999998 Q ss_pred eeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHH Q lcl|NC_020081. 321 PVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPL 400 (552) Q Consensus 321 ~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~ 400 (552) +|+ ++|++|+++++++.|+||+|++++++++||++|||||++||..+.+ +++|++++.+.|++.||.|| T Consensus 257 ~vl-~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~----------~~sn~e~~~~~f~~~~l~P~ 325 (422) T protein:vir:13 257 SLL-PFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERA----------TFNNLTEQQKDFYVTTLQSS 325 (422) T ss_pred eec-CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHH Confidence 665 5799999999999999999999999999999999999999976654 47899999999999999999 Q ss_pred HHHHHHHHHhhcCcccc--cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccc Q lcl|NC_020081. 401 LKFIEDAVNKYIVSQFG--GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHV 473 (552) Q Consensus 401 ~~~ie~~ln~~L~~~~~--~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~ 473 (552) +++||++||++|+++.+ .+++++| +++|.+++++.++.+ +.+|+||+||+|+++||||+||||++++++|+ T Consensus 326 ~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~--~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~ 403 (422) T protein:vir:13 326 LTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIG--IQGGFIEANEARRRENLPPVEGGDRLLVNGNM 403 (422) T ss_pred HHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcCeeeeccCc Confidence 99999999999998764 3566665 466788887776643 45799999999999999999999999999999 Q ss_pred cchhhhccccccccccCCC Q lcl|NC_020081. 474 QRLGQIMQQEQVEYQRQMD 492 (552) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~ 492 (552) ++++.+.+.+....++++. T Consensus 404 ~~l~~~~~~~~~~g~~~g~ 422 (422) T protein:vir:13 404 IPIEMAGEQYKKGGEKGGK 422 (422) T ss_pred cchhhcccccccCCCcCCC Confidence 9998765432111111111 No 10 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=2.2e-81 Score=462.76 Aligned_cols=426 Identities=15% Similarity=0.125 Sum_probs=307.4 Q ss_pred ccccccccchhhhhcc--ccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKK--GKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQ 101 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~ 101 (552) |.....++-....... ......++....+. ...|.. +.-.++..+...+. .-|...+.+.+||.+++.. T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~-~~g~~~~~g~~v~~----~~al~~~~V~~~i~~ia~~ 71 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWGGKTIRLTD----GAFWSQ-FLGRESSSGKKVTV----DKAMKLSAVWACVRLISTS 71 (434) T ss_pred CccchhhhhhhcccccchhhhcccccccccCc----hHHHHH-HhcCCccCCceech----hhhhccHHHHHHHHHHHHh Confidence 2222111111100000 00000001100000 000110 11111112111111 1122345567787765555 Q ss_pred HHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEE Q lcl|NC_020081. 102 VSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELV 181 (552) Q Consensus 102 ~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~ 181 (552) + +.+++.+..++.++.. .....|++..+|. .+||++||+++||+.++.+++++||+|++|. T Consensus 72 i-----------a~lp~~~~~~~~~g~~---~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~ 132 (434) T protein:vir:43 72 V-----------AGLPLGVYERKADGSR---VDARSFPLYDVVH-----NSPNDDMTAFQFWQAMVASMLLWGNAYAEIR 132 (434) T ss_pred h-----------hhCceEEEEEcCCCcc---ccccccHHHHHHh-----ccCCCCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 4 4567777655544321 2223455555443 3689999999999999999999999999998 Q ss_pred ECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHH Q lcl|NC_020081. 182 YDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALN 261 (552) Q Consensus 182 r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~ 261 (552) ++ .|+|++||||+|++|++..+.+|.. +|+++..++..+.|+++||||++.+ +.++++|+||+..++. T Consensus 133 ~~-~G~~~~L~~l~p~~v~~~~~~~g~~-------~y~~~~~~g~~~~~~~~eVih~~~~----~~dg~~G~spi~~~~~ 200 (434) T protein:vir:43 133 RA-AGRPAALDFLLPSRVDLECDENGRL-------KYFYTTKKGARREIERTNMLHIPAF----TLDGRIGLSAIRYGVD 200 (434) T ss_pred eC-CCcEEEEEEEcCcceEEEEcCCCeE-------EEEEEecCceEEEEccccEEEecCc----CCCCccccCHHHHHHH Confidence 87 6999999999999999999888753 4555666677889999999998642 4567999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHH Q lcl|NC_020081. 262 HLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDME 341 (552) Q Consensus 262 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q 341 (552) +|..+.++++++.++|+||++|+|+|++++ .+++++.+++|+.|++ +.|..|+|+++|+ ++|++|+++++++.|+| T Consensus 201 ~i~~~~~~~~~~~~~f~ng~~~~gil~~~~--~l~~e~~~~~r~~~~~-~~g~~nag~~~vl-~~g~~~~~l~~~~~d~q 276 (434) T protein:vir:43 201 VFGSVMSAEDAANGTFKNGLLPTVAFKVDR--ILQPAQREEFREYVKS-VSGAMNSGRSPVL-EQGITPETIGINPVDAQ 276 (434) T ss_pred HHHHHHHHHHHHHHHHhccCCcceEEecCC--CCCHHHHHHHHHHHHH-hcCccccCCcccc-CCCceEEEccCChhHHH Confidence 999999999999999999999999999876 4689999999999976 6788899998766 46999999999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cce Q lcl|NC_020081. 342 FEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDY 420 (552) Q Consensus 342 ~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~ 420 (552) |+|++++++++||++|||||++||+.+.++ .++++++++.+.|++.||.||+..||++||++|++..+ .++ T Consensus 277 ~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--------~~~s~~e~~~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~ 348 (434) T protein:vir:43 277 LLETREHGVIEICRWFGVPPWMIGQTDKGS--------NWGTGLEQQMLAFLTFSISSITNQIQQCVNKRLLTAPERIRY 348 (434) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhCCCcCCc--------cccchHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCc Confidence 999999999999999999999999877654 34689999999999999999999999999999998654 346 Q ss_pred eecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCc Q lcl|NC_020081. 421 VFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQ 495 (552) Q Consensus 421 ~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~ 495 (552) +|+| +++|.+++++.+..+ +.+|+||+||+|+++||||+||||++++++|+++++.+.+.+..++...+..++ T Consensus 349 ~~~fd~~~llr~d~~~r~~~~~~~--~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~ 426 (434) T protein:vir:43 349 YAEFSLEGFLKADSAGRAAWYSTM--AQNGFMTRNEGRRKENLPELPGGDILTVQSNLVPIDQLGQSNKSQAVRAALMNW 426 (434) T ss_pred eEEEechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCeEeeccCccchhhhhccCCCcchhhhhhcc Confidence 6666 477888888877754 457999999999999999999999999999999998766544443332222222 Q ss_pred cCcccCCC Q lcl|NC_020081. 496 FLAQQTGY 503 (552) Q Consensus 496 ~~~~~~~~ 503 (552) ..+.+|.+ T Consensus 427 ~~~~~~~~ 434 (434) T protein:vir:43 427 FSQPEPQE 434 (434) T ss_pred CCCCCCCC Confidence 21111111 No 11 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=3.7e-81 Score=461.55 Aligned_cols=415 Identities=13% Similarity=0.130 Sum_probs=303.4 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhcccccccccccccccccccccccc---CCc---ccccccCCCC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSM---NPD---FKEAPSIHGK 74 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~---~~~~~~~~~~ 74 (552) ||+++ +||++.-.. ..+.++- +... ...++.....++.-+ +|. |....+..+. T Consensus 1 Mgl~d-~~r~~~~~~--------~~~~~~~--~~~~----------~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~ 59 (431) T protein:vir:10 1 MGLFD-FIRREKQPE--------AQARPHV--EPSF----------QASTPTTSIPGETFEGLDDPRLKEYIRRGELNGG 59 (431) T ss_pred Ccchh-hhhcCcccc--------ccccccc--cccc----------ccccccccccccccccccchHHHHhhccCccCcc Confidence 99999 888754311 1111000 0000 000000000011000 000 0000111111 Q ss_pred chHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_020081. 75 QNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDF 154 (552) Q Consensus 75 ~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn 154 (552) ..+. .-|...+++.+||.+++..+ +++++.+..++..+ +....|++..+|. .+|| T Consensus 60 ~v~~----~~al~~~~V~~ci~~Ia~~i-----------A~lp~~v~~~~~~~-----~~~~~~~~~~lL~-----~~PN 114 (431) T protein:vir:10 60 TGRE----TRALRNMAVLRCVTLISGTI-----------GMLPMNLISSDDSK-----QVLTDDPAHRLLK-----YKPN 114 (431) T ss_pred eech----hhhhccHHHHHHHHHHHHhh-----------ccCceEEEEecCce-----eeeccchHHHHHh-----hccC Confidence 1111 11223456788877665554 45667665443222 1223355555553 3688 Q ss_pred ccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccc Q lcl|NC_020081. 155 TRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKE 234 (552) Q Consensus 155 ~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~e 234 (552) ++||+++||+.++.+++++||+|++|+|+. |.+++||||+|.+|++..+.+|.. +|++...++....|+++| T Consensus 115 ~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~~v~~~~~~~~~~-------~y~~~~~~g~~~~~~~~d 186 (431) T protein:vir:10 115 DWQTPMEFKSLMQLRALLDGESMARIVWSG-NRPIRLIPMDRGSAKGRLTSTWQI-------VYDYTTPTGDKIELPARE 186 (431) T ss_pred CCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CceEEEEEEcCceeEEEEcCCCeE-------EEEEEeCCceEEEEchhh Confidence 999999999999999999999999999985 899999999999999988877643 455555566778899999 Q ss_pred eeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc Q lcl|NC_020081. 235 MAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGI 314 (552) Q Consensus 235 vi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~ 314 (552) |||++. + +.++++|+||+..+..+|.++.++++|..++|+||++|+|||++++ .+++++++++++.|++.++|. T Consensus 187 ViHir~-~---~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~ls~e~~~~~~~~~~~~~~g~ 260 (431) T protein:vir:10 187 VFHLRD-L---SIDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPK--ELSDNAYGRMKASVQENHTGS 260 (431) T ss_pred EEEecC-c---CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC--CCCHHHHHHHHHHHHHHhcCc Confidence 999863 3 3467999999999999999999999999999999999999999876 469999999999999999999 Q ss_pred cccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHH Q lcl|NC_020081. 315 NGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKD 394 (552) Q Consensus 315 ~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~ 394 (552) +|+|+++|+ ++|++|++++++++|+||+|++++++++||++|||||++||..+.+ +++|+|++.+.|++ T Consensus 261 ~n~g~~~vl-~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~----------t~sn~eq~~~~f~~ 329 (431) T protein:vir:10 261 ENAGSWMLL-EEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTS----------WGSGIEQLAIFFIQ 329 (431) T ss_pred cccCCceec-CCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCC----------ccccHHHHHHHHHH Confidence 999997655 5799999999999999999999999999999999999999976543 47899999999999 Q ss_pred HHhhHHHHHHHHHHHhhcCcccc-cceeecc-----cccChHHHHHHHHHHHH--HhcCCcCHHHHHHHhCCCCCCC--C Q lcl|NC_020081. 395 KGLEPLLKFIEDAVNKYIVSQFG-GDYVFNF-----VGGDAKTEAEIISILES--KAKIGLTINDIRKELGYPDTEG--G 464 (552) Q Consensus 395 ~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f-----~~~d~~~~~~~~~~~~~--~~~g~lT~NE~R~~~gl~p~~g--g 464 (552) .||.||++.||++||++|+++.+ .+++|+| +++|.+++++.++.+.. +.+|+||+||+|+++||||++| | T Consensus 330 ~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~g 409 (431) T protein:vir:10 330 YGLSHWFVSWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVA 409 (431) T ss_pred HHHHHHHHHHHHHHHhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCccc Confidence 99999999999999999997543 3566665 57889999988875532 3467899999999999999965 9 Q ss_pred CeeeccccccchhhhccccccccccCCCCCccCcc Q lcl|NC_020081. 465 DVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQ 499 (552) Q Consensus 465 D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (552) |++++|+|....+..++ .+..+ T Consensus 410 D~~~~p~n~~~~~~~~~-------------~p~~~ 431 (431) T protein:vir:10 410 DQLRNPMTQKQKGSGDE-------------PPATT 431 (431) T ss_pred cceecccccccCCCCCC-------------CCCCC Confidence 99999988665432111 00000 No 12 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=2.5e-80 Score=456.99 Aligned_cols=418 Identities=18% Similarity=0.173 Sum_probs=308.1 Q ss_pred ccccccccchhhhhccccccccccccccccccccc----cccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGS----MSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRV 99 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~ 99 (552) |..+ +.+.+-.. +........... ..|... .+ .+..... ..|...+.+++||..++ T Consensus 1 M~~~----------~~~f~~~~-r~~~~~~~~~~~~~~~~~~~g~---~~--~~~~v~~----~~al~~~~v~~~i~~ia 60 (429) T protein:vir:10 1 MDSV----------KKFFNFEK-RQTSQVIELNKDDEKLLEWLGI---SP--STISVKG----KNALKVATVFACIKILS 60 (429) T ss_pred Cchh----------hhhhcccc-cCcccccccCCChHHHHHHhcC---CC--Ccceech----hhhhccHHHHHHHHHHH Confidence 2222 11111010 111111111111 111110 00 0001001 12334566788887666 Q ss_pred HHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEE Q lcl|NC_020081. 100 NQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFE 179 (552) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~ 179 (552) ..+ ++++|.+..+..++. .....|++..+|. .+||+.||+++||+.++.+++++||+|++ T Consensus 61 ~~i-----------a~l~~~~~~~~~~~~----~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gnay~~ 120 (429) T protein:vir:10 61 ESV-----------SKLPLKIYQEDEYGI----QRGTKHYLNNLLR-----LRPNPYMSSMNFFGSLEAQKNLYGNSYAN 120 (429) T ss_pred Hhh-----------ccCceEEEEecCCce----eeccccHHHHHHH-----hhccCCCCHHHHHHHHHHHHhhcCCeEEE Confidence 554 456777655543332 1223355555543 26889999999999999999999999999 Q ss_pred EEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHH Q lcl|NC_020081. 180 LVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIA 259 (552) Q Consensus 180 i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~ 259 (552) |+|+..|+|++||||+|++|++..+++|..... ...|+++..++....|+++||||++++. +.++++|+||+..+ T Consensus 121 i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~~--~~~~~~~~~~g~~~~~~~~evih~~~~~---~~~~~~G~s~i~~~ 195 (429) T protein:vir:10 121 IEFDRKGKVQALWPIDASKVTVYIDDVGLLNSK--TKMWYVVNTGGQQRVLKPEEILHFKNGI---TLDGLVGVPTMEYL 195 (429) T ss_pred EEECCCCcEEEEEEEcCceeEEEEcCccccccc--ceEEEEEccCCeEEEEccccEEEecCCC---CCCCcccccHHHHH Confidence 999999999999999999999999987754432 2334555566677889999999998653 55679999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhH Q lcl|NC_020081. 260 LNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKD 339 (552) Q Consensus 260 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d 339 (552) ..+|..+.++++++.++|+||++|+|+|++++ .+++++.+++++.|++.++|..|+|+++|+ ++|++|++++.++.| T Consensus 196 ~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl-~~g~~~~~l~~~~~d 272 (429) T protein:vir:10 196 KSTLENSASADKFINNFYKQGLQVKGLVQYVG--DLNEDAKKVFRENFESMSSGLQNSHRIALM-PVGYQFQPISLNMSD 272 (429) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEcCC--CCCHHHHHHHHHHHHHHhccccccCceeec-CCCceEEEccCChhH Confidence 99999999999999999999999999999876 468999999999999999999999997665 579999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-- Q lcl|NC_020081. 340 MEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-- 417 (552) Q Consensus 340 ~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-- 417 (552) +||++++++++++||++|||||++||..+.+ +++|++++.+.|++.||.||++.||++||++|+++.+ T Consensus 273 ~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~----------~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~ 342 (429) T protein:vir:10 273 AQFLENTELTIRQIATAFGIKMHQLNDLSKA----------TLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSELD 342 (429) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhcC Confidence 9999999999999999999999999976644 4789999999999999999999999999999997654 Q ss_pred cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCC Q lcl|NC_020081. 418 GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMD 492 (552) Q Consensus 418 ~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~ 492 (552) .+++|+| +++|++++++.++.+ ..+|+||+||+|+++||||+||||++++++|+++++..++.+.. +++ T Consensus 343 ~g~~~~fd~~~ll~~d~~~~~~~~~~~--~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~d~~~~~~~k----~g~ 416 (429) T protein:vir:10 343 KGFYSKFNVDAILRADIKTRYEAYRTG--IQGGFLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLK----GGD 416 (429) T ss_pred CCcEEEeechhhhcCCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccccccC----CCC Confidence 4566665 377888888776654 45799999999999999999999999999999999865543221 111 Q ss_pred CCccCcccCCCCCC Q lcl|NC_020081. 493 ANQFLAQQTGYDGN 506 (552) Q Consensus 493 ~~~~~~~~~~~~~~ 506 (552) . ....++++..++ T Consensus 417 ~-~~~~~~~~~e~~ 429 (429) T protein:vir:10 417 T-NGEVSKEGNEGN 429 (429) T ss_pred C-CCCCCCCCCCCC Confidence 0 001111111111 No 13 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=8.5e-81 Score=459.55 Aligned_cols=429 Identities=14% Similarity=0.120 Sum_probs=300.2 Q ss_pred chhhhhcccccccccccccccccccccc-ccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 32 EEDAILKKGKNTKSNKPKAYEEPIIGSM-SMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPAR 110 (552) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~ 110 (552) ++....+.+.+-+.........|++..- +....+....+..+..... +-+...+.+.+||.+++..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~~~~~~~~~~~~~g~~v~~----~~al~~~~v~~ci~~Ia~~ia------- 69 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVPISLTDGSFWSAWGGMGSSSGETVTA----DSALQLSAVWSCVRLIAETIA------- 69 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCcccCCchhHHHhhcccccCCCceech----HhhhccHHHHHHHHHHHHHHh------- Confidence 1111111110000000011111211100 0000011111111111111 223345567888877666654 Q ss_pred hhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEE Q lcl|NC_020081. 111 NSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHN 190 (552) Q Consensus 111 ~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~ 190 (552) .+++.+..++.++.. .....|++..+|. ..||++||+++||+.++.+++++||+|++|+|+ .|++++ T Consensus 70 ----~lp~~~~~~~~~g~~---~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-~g~~~~ 136 (437) T protein:vir:10 70 ----TLPLNLYQTKPDGTR---VLAKQHRLYTVIH-----SQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-AGVLIG 136 (437) T ss_pred ----hCceeEEEEcCCCce---eeccccHHHHHhh-----ccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEE Confidence 456666554443321 1223355544442 468999999999999999999999999999999 599999 Q ss_pred EEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHH Q lcl|NC_020081. 191 FKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTE 270 (552) Q Consensus 191 L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~ 270 (552) ||||+|..|++..+.+|.. +|+....++....|+++||||++. + +.++++|+||+.+++.+|..+.+++ T Consensus 137 L~~l~p~~v~i~~~~~g~~-------~y~~~~~~g~~~~~~~~dIih~r~-~---~~d~~~G~spi~~~~~~i~~~~~~~ 205 (437) T protein:vir:10 137 LELMLPQRTTVKRLTSGAL-------QYTYRNVDGTVSTLAEDDVFHVRG-F---SLDGLMGLTPIQYAREVLGNSTAAN 205 (437) T ss_pred EEEEcCcceEEEECCCCeE-------EEEEEecCceEEEEccccEEEecC-c---CCCCcccccHHHHHHHHHHHHHHHH Confidence 9999999999998877643 344444456678899999999864 2 3467999999999999999999999 Q ss_pred HHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHH Q lcl|NC_020081. 271 VFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLI 350 (552) Q Consensus 271 ~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~ 350 (552) +|+.++|+||++|+|||++++ .+++++++++++.|.+.++|..|+|+++|+ ++|++|+++++++.|+||+|++++++ T Consensus 206 ~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~nag~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~ 282 (437) T protein:vir:10 206 KTSASVFRNGLRPSGVLSTDQ--ILQKEKRAEIRTDLAEQFGGAMQAGKTMVL-EAGMKYQAITMNPGDVQLLETRAFNI 282 (437) T ss_pred HHHHHHHhccCCccEEEEcCC--CCCHHHHHHHHHHHHHHhcCccccCcceec-cCCceEEeccCChhhHHHHHHHHHHH Confidence 999999999999999999875 468999999999999999999999997655 57999999999999999999999999 Q ss_pred HHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecc----- Q lcl|NC_020081. 351 NVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNF----- 424 (552) Q Consensus 351 ~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f----- 424 (552) ++||++|||||++||+.++++ .+++|++++.+.|++.||+||+..||++|+++|+++.+ ..++|+| T Consensus 283 ~~Ia~~fgVPp~~lg~~~~~t--------~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~kll~~~e~~~~~~~fd~~~l 354 (437) T protein:vir:10 283 EEICRWYRVPPFMVGHSEKST--------SWGTGIEQQTLGFLTFTLRPWLTRIEQAARRSLLRPGERDQFYAEFSVEGL 354 (437) T ss_pred HHHHHHhCCCHHHhCCCCCcc--------cccchHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEEechhh Confidence 999999999999999987654 35689999999999999999999999999999997644 3455555 Q ss_pred cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeee-ccccccchhhhccccccccccCCCCCccCcccCCC Q lcl|NC_020081. 425 VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTL-AGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGY 503 (552) Q Consensus 425 ~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~-~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (552) +++|.+++++.++.+ +.+|+||+||+|+++||||++|||.++ ++.++.+++..++....... .....+. T Consensus 355 l~~d~~~r~~~~~~~--~~~G~~T~NE~R~~~gl~pi~gg~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~ 424 (437) T protein:vir:10 355 LRADSAGRAAFYSTM--TQNGLMTRDECRAKENLPPMGGNAAVLTVQSALLPIDKLGEHTTATAA--------QDALKAW 424 (437) T ss_pred hccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCcceEeecCcccchhhccCcCCCcch--------hcccccc Confidence 577888888877654 457999999999999999999888755 67888877654332111100 0000000 Q ss_pred CCCCCCCCCCCCcccccCCCCccc Q lcl|NC_020081. 504 DGNMDNVNGKDSFNQNVGKDGQSK 527 (552) Q Consensus 504 ~~~~~~~~~~~~~~~~~~~~~~~~ 527 (552) +.+..+..+. ++| T Consensus 425 ~~~~~~~~~~-----------~e~ 437 (437) T protein:vir:10 425 LYQEEKTRAT-----------QER 437 (437) T ss_pred CCCCCCCCcc-----------ccC Confidence 1111111110 111 No 14 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=8.1e-80 Score=454.20 Aligned_cols=466 Identities=13% Similarity=0.129 Sum_probs=307.5 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVS 103 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~ 103 (552) |-+-.-|+..++.+- .+. ..+...+.+.+......+. .+......+.....|.+||.+++..+ T Consensus 1 ~~~~~~~~~~~p~~~---------~~~--~~~~~~~~~~~~~g~~~~~-----~~~~~~~~~~~~~~V~acV~~IA~~i- 63 (518) T protein:vir:78 1 MLLANGQTLSAPAMA---------ELS--PQMQDSYYYAPAVGMQLER-----QFSLYGGIYKNQPWVRTVIAKRAQAL- 63 (518) T ss_pred CcccCceeeccchhh---------hhh--hhhhhcccccceeceeccc-----ccchhhHHhhhhHHHHHHHHHHHHhh- Confidence 111111111111110 000 1122233333322111111 11222234455677888887766554 Q ss_pred HHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC Q lcl|NC_020081. 104 MFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD 183 (552) Q Consensus 104 ~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~ 183 (552) ++++|.+..++.++... +..+.+..++ .+||++||+++||+.++.+++++||+|++|+|+ T Consensus 64 ----------A~lp~~l~~~~~~~~~~----~~~~~~~~Ll------~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~ 123 (518) T protein:vir:78 64 ----------ARLPVKCMFTSGDTETE----EHDTGYAKLL------ADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN 123 (518) T ss_pred ----------ccCceEEEEEcCCcccc----ccchHHHHHH------hCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc Confidence 56677776665543221 1223333222 368899999999999999999999999999999 Q ss_pred CCCCEEEEEEecCceeEEEECCCcccccccceeEEEEE-cC--CceEEEEcccceeeecccccCCccCC-cccccHHHHH Q lcl|NC_020081. 184 KLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQV-ID--DKVVAKFKAKEMAWEVSNPRTDLTVG-KYGYPELEIA 259 (552) Q Consensus 184 ~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~-~~--~~~~~~~~~~evi~~~~~~~~~~~~g-~~G~spl~~~ 259 (552) ..|+|++||||+|.+|+|..+.++.. .+|+.. .. +.....|+++||||++... .++ .+|+||+.++ T Consensus 124 ~~G~~~~L~~l~p~~Vtv~~~~~~~~------~~y~~~~~~~~~~~~~~~~~~eIiHir~~~----~dg~~~G~Spi~~~ 193 (518) T protein:vir:78 124 KSGTPEKLMPMHPSRVAIKRNSRTGR------YEYYFQAGAGVGTQLVSFADDEVVPIRFFN----PDGLERGLSLMESL 193 (518) T ss_pred CCCcEEEEEEECCCceEEEEcCCCCE------EEEEEEecCCccceeEEecCCcEEEecCCC----CCcccccccHHHHH Confidence 99999999999999999998865432 223222 22 3356789999999987432 233 5899999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhH Q lcl|NC_020081. 260 LNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKD 339 (552) Q Consensus 260 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d 339 (552) ..+|..+.++++|+.++|+||++|+|||++++ .+++++++++++.|++.++|..|+|+++|+ ++|++|+++++++.| T Consensus 194 ~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~--~ls~e~~~~~k~~~~~~~~G~~nag~~~vL-~~G~~~~~l~~~~~d 270 (518) T protein:vir:78 194 KSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK--RLSPEAQQRLREQFDRAHAGSSNTGKTMVV-EEGMEPIPLQLTAVE 270 (518) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCccEEEecCC--CCCHHHHHHHHHHHHHHhcCcccCCceeEc-CCCceEEeccCChhH Confidence 99999999999999999999999999999875 468999999999999999999999997665 579999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccc Q lcl|NC_020081. 340 MEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGD 419 (552) Q Consensus 340 ~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~ 419 (552) +||+|++++++++||++|||||++||+.+++ +++|++++.+.|+++||.||+.+||++||++|++.++.+ T Consensus 271 ~q~le~r~~~~~eIa~afgVPp~~lg~~~~s----------t~sn~e~~~~~f~~~tL~P~~~~ie~eln~~L~~~~~~~ 340 (518) T protein:vir:78 271 MQFIEARQLNREEVCGVYDIAPPIVHILDRA----------TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRK 340 (518) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhccCCCC----------CchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCc Confidence 9999999999999999999999999987654 478999999999999999999999999999999887776 Q ss_pred eeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCC--CCCeeeccccccchhhhccccccc--cccC Q lcl|NC_020081. 420 YVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTE--GGDVTLAGVHVQRLGQIMQQEQVE--YQRQ 490 (552) Q Consensus 420 ~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~--ggD~~~~~~n~~~~~~~~~~~~~~--~~~~ 490 (552) ++|+| +++|.+++++.+..+ +.+|+||+||+|+++||||++ |||+++++.|+++++......... +... T Consensus 341 ~~~~fd~~~Llr~D~~~r~~~~~~~--~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~ 418 (518) T protein:vir:78 341 NRMKFDIDDVIQPDWEAKSESTQKM--VNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAP 418 (518) T ss_pred ceEEeechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceecccccccccCCCCCCCC Confidence 66666 478888888777654 457999999999999999996 899999999999987543221111 1111 Q ss_pred CCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccccccccccC-----------------ccccc------c-c Q lcl|NC_020081. 491 MDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGG-----------------KDDNG------N-V 546 (552) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~------~-~ 546 (552) ..+......+.+.+++...+.-+.++.... .+.-.++.+...+...|. ++..| | - T Consensus 419 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (518) T protein:vir:78 419 KRPASTPVASLDQSPPASVPGLSPTNSDRS-TDSGKTEPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKY 497 (518) T ss_pred CCCCcccccccccCccccCCCCCccccccc-ccccccchhcccCCCCcccccchHHHHHHHHhhcCCcchhhhhhhhhhc Confidence 111000000000000000000000000000 000000001111111110 00000 0 0 Q ss_pred cccccC Q lcl|NC_020081. 547 VNDWEA 552 (552) Q Consensus 547 ~~~~~~ 552 (552) +.|.+. T Consensus 498 ~~~~~~ 503 (518) T protein:vir:78 498 PDDLED 503 (518) T ss_pred chhHHH Confidence 111111 No 15 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=9.7e-81 Score=459.24 Aligned_cols=500 Identities=20% Similarity=0.248 Sum_probs=321.1 Q ss_pred CCC---CCCCcccccchhhccc-------ccCc--ccc---cccccchhhhhccccccccccccccccccccccccCCcc Q lcl|NC_020081. 1 MGL---LDGFFKGRKQQDNIID-------INDD--MAV---RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDF 65 (552) Q Consensus 1 ~~~---~~~~~~~~~~~~~~~~-------~~~~--~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 65 (552) -++ .+.--||++|+---+- -+.- .+. ++.|+.++..|....++++.........+. ..++.+ T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s---~es~s~ 101 (945) T protein:vir:10 25 SNIKANVDSLSRGKDYPGFKPLLTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYS---PESLMY 101 (945) T ss_pred ccchhchhhhhcccCCCCcchhhhhhhhhccceeeeeeeeehhhhHHHhhcccccccccccchhhhhhhcc---Ccccee Confidence 111 2334477777632100 0000 011 111221222222222222211111100000 001111 Q ss_pred cccccCCCCc--hHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCC--hhHHHHHHHHH Q lcl|NC_020081. 66 KEAPSIHGKQ--NLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPN--DHNKKKIKEIE 141 (552) Q Consensus 66 ~~~~~~~~~~--~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~--~~~~~~~~~l~ 141 (552) .+ +...+. .....+++.+...+.+.+||.++++.+ +++++.+..+..++... .+..+..|++. T Consensus 102 vt--sls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sI-----------AsLPlklYrr~edG~~~~~~kk~~~~hpL~ 168 (945) T protein:vir:10 102 LP--SISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKL-----------TSKELEIYKHIEDKHVNYYLKRIRDARNIL 168 (945) T ss_pred cc--cccCccceeeehhhhhhhhccHHHHHHHHHHHhhh-----------ccCceEEEEecccCcccccccccccchHHH Confidence 11 111111 012355677778888899988766654 45666665444333222 12334567777 Q ss_pred HHHHhcCCCCCCCccCCHHH----HHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeE Q lcl|NC_020081. 142 NFIEKTGRIDNDFTRDNFRS----FVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVR 217 (552) Q Consensus 142 ~~l~~~n~~~~pn~~~t~~~----f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~ 217 (552) .+|. +||+.||+++ |+++++.+++++||+|++++|+..|+|++||||+|++|++..+++|..+ ++ T Consensus 169 ~LL~------rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLdPs~Vti~~ddDG~~~-----y~ 237 (945) T protein:vir:10 169 EFLE------RPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVDGTTIKPILSEDTGIV-----VG 237 (945) T ss_pred HHHh------CCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCCcEE-----EE Confidence 7665 4677888776 7778999999999999999999999999999999999999999888653 35 Q ss_pred EEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCceEEEeCCC---- Q lcl|NC_020081. 218 YVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFA-QGGTTRGLLHIKTG---- 292 (552) Q Consensus 218 y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~-ng~~p~gil~~~~~---- 292 (552) |++..++.....++++|+|||++++..+....+||+||+.+++.+|..+.++++++.++|. ||++|+|||+++++ T Consensus 238 Yv~~idG~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d 317 (945) T protein:vir:10 238 YVQEVDGAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKE 317 (945) T ss_pred EEEecCCceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccc Confidence 6777788888889999999998887766666678999999999999999999999999995 78899999998643 Q ss_pred ----CCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccc Q lcl|NC_020081. 293 ----QEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPN 368 (552) Q Consensus 293 ----~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 368 (552) ..+++++++++++.|++.++|. ++|+ |+++++|++|+++++++.|+||++++++++++||++|||||++||+.+ T Consensus 318 ~k~~~~LseEq~erlKe~wee~~sG~-NnG~-piVLdeGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e 395 (945) T protein:vir:10 318 GDIYPQLSREQLESIQRQLQAIMMGD-YTQV-PILSGGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGILE 395 (945) T ss_pred cccccccCHHHHHHHHHHHHHHhCCc-cccc-ceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCC Confidence 4578999999999999999884 5565 456678999999999999999999999999999999999999999865 Q ss_pred cccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccc-ccceeecccccChHHHHHHHHHH-HHHhcCC Q lcl|NC_020081. 369 RGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQF-GGDYVFNFVGGDAKTEAEIISIL-ESKAKIG 446 (552) Q Consensus 369 ~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~-~~~~~~~f~~~d~~~~~~~~~~~-~~~~~g~ 446 (552) . .+++|++++...|+++||+|++++||++||++|++.. +..++|+|+..+..+..+.++++ +...+|+ T Consensus 396 ~----------st~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~eg~~i~fdFd~ldl~D~ksraEal~kli~sGi 465 (945) T protein:vir:10 396 G----------SNKATAEVMASLTKAKGLEPLMATISKGFDEVVSEFRNEKDIKLWFKEDDLEKERDWWNIIQGQLNTGF 465 (945) T ss_pred C----------CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccCceeEEEecchhccCHHHHHHHHHHHHhCCC Confidence 4 3578999999999999999999999999999998654 35678888655443333333333 2445789 Q ss_pred cCHHHHHHHhCCCCCCCCCeeeccc-cccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCc Q lcl|NC_020081. 447 LTINDIRKELGYPDTEGGDVTLAGV-HVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQ 525 (552) Q Consensus 447 lT~NE~R~~~gl~p~~ggD~~~~~~-n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (552) ||+||+|+++||||+||||+++++. ++++.+.....+....+ ....+...+++...+.. .++ +++.. T Consensus 466 LTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p--~q~aq~~~dqp~~kGGe-----~dE-ns~~p---- 533 (945) T protein:vir:10 466 RSINEARMEKGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMP--PQLAQAMADQPSQQGGG-----VDE-NSSVP---- 533 (945) T ss_pred cCHHHHHHHhCCCCCCCcceeeeccccccccccccccccCCCC--cccccCCCCCCCCCCCC-----CCC-CCCCC---- Confidence 9999999999999999999999886 45555433221111100 01111111111100000 000 00000 Q ss_pred cccccccccc--------------c------ccCcccc-cc-c--------------cccccC Q lcl|NC_020081. 526 SKQQANTNST--------------P------QGGKDDN-GN-V--------------VNDWEA 552 (552) Q Consensus 526 ~~~~~~~~~~--------------~------~~~~~~~-~~-~--------------~~~~~~ 552 (552) .+.++.... + +-++.|+ |+ | |+.|-. T Consensus 534 -sE~kda~~e~~~~l~~~~~~~a~e~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 595 (945) T protein:vir:10 534 -SEQKNAGLEVLRNLFKSLDANASENLKQVIELTNDDNYLKEKELLTRVLKSVGLDSVSEFIE 595 (945) T ss_pred -CcccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCchhHHHHHHHHHHHHhhhHHHHHHHh Confidence 001111111 0 0111111 11 0 000000 No 16 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=1.6e-80 Score=458.12 Aligned_cols=425 Identities=17% Similarity=0.174 Sum_probs=311.8 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+++ |+++-. +.. ++........+.+...-..|.... ++ +..... T Consensus 1 M~~~~-r~~~~~------------~~~--------------~r~~~~~~~~~~~~~~~~~~~g~~---~~--~~~v~~-- 46 (432) T protein:vir:10 1 MKIVD-SVKKFF------------NFE--------------KRQTSQVIELNKDDEKLLEWLGIS---PS--TISVKG-- 46 (432) T ss_pred CChHH-HHHHhc------------Ccc--------------ccCcccccccCCchHHHHHHhCCC---cC--ccccch-- Confidence 77777 432110 000 000000000000000001111100 00 000000 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .-+...+.+.+||.+++..+ +++++.+..++.++. .....|++..+|. .+||++||++ T Consensus 47 --~~al~~~~v~~~i~~ia~~i-----------a~lp~~~~~~~~~~~----~~~~~~~l~~lL~-----~~PN~~~t~~ 104 (432) T protein:vir:10 47 --KNALKVATVFACIKILSESV-----------SKLPLKIYQEDEYGI----QRGTKHYLNNLLR-----LRPNPYMSSM 104 (432) T ss_pred --hhhhccHHHHHHHHHHHHhh-----------ccCceEEEEecCCce----eeccccHHHHHHH-----hhccCCCCHH Confidence 12334566788887766654 456676655544332 1223455555553 2688999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS 240 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~ 240 (552) +||+.++.+++++||+|++|+|+..|+|++||||+|++|++..++++..... ...|+++..++....|+++||||++. T Consensus 105 ~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~--~~~~y~~~~~g~~~~~~~~eiih~r~ 182 (432) T protein:vir:10 105 NFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSK--TKMWYVVNTGGQQRVLKPEEILHFKN 182 (432) T ss_pred HHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccccc--ceEEEEEecCCeEEEEccccEEEecC Confidence 9999999999999999999999999999999999999999999887755432 23345555667778899999999986 Q ss_pred cccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccc Q lcl|NC_020081. 241 NPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKI 320 (552) Q Consensus 241 ~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~ 320 (552) ++ +.++++|+||+..+..+|..+.++++++.++|+||++|+|||++++ .+++++.+++++.|++.++|..|+|++ T Consensus 183 ~~---~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~ 257 (432) T protein:vir:10 183 GI---TLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG--DLNEDAKKVFRENFESMSSGLQNSHRI 257 (432) T ss_pred CC---CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC--CCCHHHHHHHHHHHHHHhcccccCCcc Confidence 43 4567999999999999999999999999999999999999999876 468999999999999999999999998 Q ss_pred eeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHH Q lcl|NC_020081. 321 PVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPL 400 (552) Q Consensus 321 ~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~ 400 (552) +|+ ++|++|+++++++.|+||++++++++++||++|||||++||..+.+ ++++++++.+.|++.||+|+ T Consensus 258 ~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~----------~~s~~e~~~~~~~~~~l~P~ 326 (432) T protein:vir:10 258 ALM-PVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKA----------TLNNIEQQQQQFYTDTLQAT 326 (432) T ss_pred eec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHH Confidence 665 5799999999999999999999999999999999999999976654 47899999999999999999 Q ss_pred HHHHHHHHHhhcCcccc--cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccc Q lcl|NC_020081. 401 LKFIEDAVNKYIVSQFG--GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHV 473 (552) Q Consensus 401 ~~~ie~~ln~~L~~~~~--~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~ 473 (552) ++.||++||++|++..+ .+++|+| ++.|.+++++.+..+ +.+|++|+||+|+++||||+||||++++++|+ T Consensus 327 ~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~--~~~G~~t~NE~R~~~g~~pi~ggD~~~~~~n~ 404 (432) T protein:vir:10 327 LTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTG--IQGGFLKPNEARSKEDLPPEAGGDRLLVNGNM 404 (432) T ss_pred HHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCeEeecccc Confidence 99999999999997644 4456665 467888888776644 45799999999999999999999999999999 Q ss_pred cchhhhccccccccccCCCCCccCcccCCCCCC Q lcl|NC_020081. 474 QRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGN 506 (552) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (552) ++++.+.+... ++++. ....++++.+++ T Consensus 405 ~~~~~~~~~~~----k~~~~-~~~~~~~~~~~~ 432 (432) T protein:vir:10 405 LPIDMAGQAYL----KGGDT-NGEVSKEGNEGN 432 (432) T ss_pred cchhhcccccc----CCCCC-CCCCCCCCCCCC Confidence 99886654321 11111 111111111111 No 17 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=1.6e-80 Score=458.12 Aligned_cols=425 Identities=17% Similarity=0.174 Sum_probs=311.8 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+++ |+++-. +.. ++........+.+...-..|.... ++ +..... T Consensus 1 M~~~~-r~~~~~------------~~~--------------~r~~~~~~~~~~~~~~~~~~~g~~---~~--~~~v~~-- 46 (432) T protein:vir:10 1 MKIVD-SVKKFF------------NFE--------------KRQTSQVIELNKDDEKLLEWLGIS---PS--TISVKG-- 46 (432) T ss_pred CChHH-HHHHhc------------Ccc--------------ccCcccccccCCchHHHHHHhCCC---cC--ccccch-- Confidence 77777 432110 000 000000000000000001111100 00 000000 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .-+...+.+.+||.+++..+ +++++.+..++.++. .....|++..+|. .+||++||++ T Consensus 47 --~~al~~~~v~~~i~~ia~~i-----------a~lp~~~~~~~~~~~----~~~~~~~l~~lL~-----~~PN~~~t~~ 104 (432) T protein:vir:10 47 --KNALKVATVFACIKILSESV-----------SKLPLKIYQEDEYGI----QRGTKHYLNNLLR-----LRPNPYMSSM 104 (432) T ss_pred --hhhhccHHHHHHHHHHHHhh-----------ccCceEEEEecCCce----eeccccHHHHHHH-----hhccCCCCHH Confidence 12334566788887766654 456676655544332 1223455555553 2688999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS 240 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~ 240 (552) +||+.++.+++++||+|++|+|+..|+|++||||+|++|++..++++..... ...|+++..++....|+++||||++. T Consensus 105 ~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~--~~~~y~~~~~g~~~~~~~~eiih~r~ 182 (432) T protein:vir:10 105 NFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSK--TKMWYVVNTGGQQRVLKPEEILHFKN 182 (432) T ss_pred HHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccccc--ceEEEEEecCCeEEEEccccEEEecC Confidence 9999999999999999999999999999999999999999999887755432 23345555667778899999999986 Q ss_pred cccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccc Q lcl|NC_020081. 241 NPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKI 320 (552) Q Consensus 241 ~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~ 320 (552) ++ +.++++|+||+..+..+|..+.++++++.++|+||++|+|||++++ .+++++.+++++.|++.++|..|+|++ T Consensus 183 ~~---~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~ 257 (432) T protein:vir:10 183 GI---TLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG--DLNEDAKKVFRENFESMSSGLQNSHRI 257 (432) T ss_pred CC---CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC--CCCHHHHHHHHHHHHHHhcccccCCcc Confidence 43 4567999999999999999999999999999999999999999876 468999999999999999999999998 Q ss_pred eeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHH Q lcl|NC_020081. 321 PVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPL 400 (552) Q Consensus 321 ~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~ 400 (552) +|+ ++|++|+++++++.|+||++++++++++||++|||||++||..+.+ ++++++++.+.|++.||+|+ T Consensus 258 ~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~----------~~s~~e~~~~~~~~~~l~P~ 326 (432) T protein:vir:10 258 ALM-PVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKA----------TLNNIEQQQQQFYTDTLQAT 326 (432) T ss_pred eec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHH Confidence 665 5799999999999999999999999999999999999999976654 47899999999999999999 Q ss_pred HHHHHHHHHhhcCcccc--cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccc Q lcl|NC_020081. 401 LKFIEDAVNKYIVSQFG--GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHV 473 (552) Q Consensus 401 ~~~ie~~ln~~L~~~~~--~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~ 473 (552) ++.||++||++|++..+ .+++|+| ++.|.+++++.+..+ +.+|++|+||+|+++||||+||||++++++|+ T Consensus 327 ~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~--~~~G~~t~NE~R~~~g~~pi~ggD~~~~~~n~ 404 (432) T protein:vir:10 327 LTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTG--IQGGFLKPNEARSKEDLPPEAGGDRLLVNGNM 404 (432) T ss_pred HHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCeEeecccc Confidence 99999999999997644 4456665 467888888776644 45799999999999999999999999999999 Q ss_pred cchhhhccccccccccCCCCCccCcccCCCCCC Q lcl|NC_020081. 474 QRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGN 506 (552) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (552) ++++.+.+... ++++. ....++++.+++ T Consensus 405 ~~~~~~~~~~~----k~~~~-~~~~~~~~~~~~ 432 (432) T protein:vir:10 405 LPIDMAGQAYL----KGGDT-NGEVSKEGNEGN 432 (432) T ss_pred cchhhcccccc----CCCCC-CCCCCCCCCCCC Confidence 99886654321 11111 111111111111 No 18 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=1.6e-80 Score=458.12 Aligned_cols=425 Identities=17% Similarity=0.174 Sum_probs=311.8 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+++ |+++-. +.. ++........+.+...-..|.... ++ +..... T Consensus 1 M~~~~-r~~~~~------------~~~--------------~r~~~~~~~~~~~~~~~~~~~g~~---~~--~~~v~~-- 46 (432) T protein:vir:10 1 MKIVD-SVKKFF------------NFE--------------KRQTSQVIELNKDDEKLLEWLGIS---PS--TISVKG-- 46 (432) T ss_pred CChHH-HHHHhc------------Ccc--------------ccCcccccccCCchHHHHHHhCCC---cC--ccccch-- Confidence 77777 432110 000 000000000000000001111100 00 000000 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .-+...+.+.+||.+++..+ +++++.+..++.++. .....|++..+|. .+||++||++ T Consensus 47 --~~al~~~~v~~~i~~ia~~i-----------a~lp~~~~~~~~~~~----~~~~~~~l~~lL~-----~~PN~~~t~~ 104 (432) T protein:vir:10 47 --KNALKVATVFACIKILSESV-----------SKLPLKIYQEDEYGI----QRGTKHYLNNLLR-----LRPNPYMSSM 104 (432) T ss_pred --hhhhccHHHHHHHHHHHHhh-----------ccCceEEEEecCCce----eeccccHHHHHHH-----hhccCCCCHH Confidence 12334566788887766654 456676655544332 1223455555553 2688999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS 240 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~ 240 (552) +||+.++.+++++||+|++|+|+..|+|++||||+|++|++..++++..... ...|+++..++....|+++||||++. T Consensus 105 ~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~--~~~~y~~~~~g~~~~~~~~eiih~r~ 182 (432) T protein:vir:10 105 NFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSK--TKMWYVVNTGGQQRVLKPEEILHFKN 182 (432) T ss_pred HHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccccc--ceEEEEEecCCeEEEEccccEEEecC Confidence 9999999999999999999999999999999999999999999887755432 23345555667778899999999986 Q ss_pred cccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccc Q lcl|NC_020081. 241 NPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKI 320 (552) Q Consensus 241 ~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~ 320 (552) ++ +.++++|+||+..+..+|..+.++++++.++|+||++|+|||++++ .+++++.+++++.|++.++|..|+|++ T Consensus 183 ~~---~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~ 257 (432) T protein:vir:10 183 GI---TLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG--DLNEDAKKVFRENFESMSSGLQNSHRI 257 (432) T ss_pred CC---CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC--CCCHHHHHHHHHHHHHHhcccccCCcc Confidence 43 4567999999999999999999999999999999999999999876 468999999999999999999999998 Q ss_pred eeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHH Q lcl|NC_020081. 321 PVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPL 400 (552) Q Consensus 321 ~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~ 400 (552) +|+ ++|++|+++++++.|+||++++++++++||++|||||++||..+.+ ++++++++.+.|++.||+|+ T Consensus 258 ~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~----------~~s~~e~~~~~~~~~~l~P~ 326 (432) T protein:vir:10 258 ALM-PVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKA----------TLNNIEQQQQQFYTDTLQAT 326 (432) T ss_pred eec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHH Confidence 665 5799999999999999999999999999999999999999976654 47899999999999999999 Q ss_pred HHHHHHHHHhhcCcccc--cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccc Q lcl|NC_020081. 401 LKFIEDAVNKYIVSQFG--GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHV 473 (552) Q Consensus 401 ~~~ie~~ln~~L~~~~~--~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~ 473 (552) ++.||++||++|++..+ .+++|+| ++.|.+++++.+..+ +.+|++|+||+|+++||||+||||++++++|+ T Consensus 327 ~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~--~~~G~~t~NE~R~~~g~~pi~ggD~~~~~~n~ 404 (432) T protein:vir:10 327 LTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTG--IQGGFLKPNEARSKEDLPPEAGGDRLLVNGNM 404 (432) T ss_pred HHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCeEeecccc Confidence 99999999999997644 4456665 467888888776644 45799999999999999999999999999999 Q ss_pred cchhhhccccccccccCCCCCccCcccCCCCCC Q lcl|NC_020081. 474 QRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGN 506 (552) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (552) ++++.+.+... ++++. ....++++.+++ T Consensus 405 ~~~~~~~~~~~----k~~~~-~~~~~~~~~~~~ 432 (432) T protein:vir:10 405 LPIDMAGQAYL----KGGDT-NGEVSKEGNEGN 432 (432) T ss_pred cchhhcccccc----CCCCC-CCCCCCCCCCCC Confidence 99886654321 11111 111111111111 No 19 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=1.1e-79 Score=453.54 Aligned_cols=461 Identities=14% Similarity=0.147 Sum_probs=305.0 Q ss_pred ccc-cccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAV-RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQV 102 (552) Q Consensus 24 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~ 102 (552) |-+ ..+.+..+.+.+. . ..+...+.+.+....... + .+......+.....|++||.+++..+ T Consensus 1 ~~~~~~~~~~~p~~~e~-------~-----~~~~~~~~~~~~~~~~~~---~--~~~~~~~~a~~~~~V~acV~~IA~~i 63 (518) T protein:vir:10 1 MLLANGQTLSAPAMAEL-------S-----PQMQDSYYYAPAVGMQLE---R--QFSLYGGIYKNQPWVRTVIAKRAQAL 63 (518) T ss_pred CcccCceeecCchhhhh-------h-----hhhhcccccccccceecc---c--ccchhhHHHhhhHHHHHHHHHHHHhh Confidence 111 1111111111110 0 011122222221110000 0 01112223445567888887766654 Q ss_pred HHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE Q lcl|NC_020081. 103 SMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY 182 (552) Q Consensus 103 ~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r 182 (552) +++++.+..++.++.. ....|.+..++ .+||++||+++||+.++.+++++||+|++|+| T Consensus 64 -----------A~lpl~l~~~~~~~~~----~~~~~~~~~Ll------~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r 122 (518) T protein:vir:10 64 -----------ARLPVKCMFTSGDTET----EESDTGYAKLL------ADPCEYLDPFAFWEWVASTLDIYGETYLAIQK 122 (518) T ss_pred -----------ccCceEEEEEcCCCce----eccchHHHHHH------cCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEE Confidence 4567777655544332 11223333322 36889999999999999999999999999999 Q ss_pred CCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcC---CceEEEEcccceeeecccccCCccCC-cccccHHHH Q lcl|NC_020081. 183 DKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVID---DKVVAKFKAKEMAWEVSNPRTDLTVG-KYGYPELEI 258 (552) Q Consensus 183 ~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~---~~~~~~~~~~evi~~~~~~~~~~~~g-~~G~spl~~ 258 (552) +..|+|++||||+|.+|+|..+.++.. .+|+.... +.....|+++||||++.. ..++ ++|+||+.+ T Consensus 123 ~~~G~~~~L~~l~p~~v~v~~~~~~~~------~~y~~~~~~~~~~~~~~~~~~eViHir~~----s~dg~~~G~spi~~ 192 (518) T protein:vir:10 123 NKSGTPEKLMPMHPSRVAIKRNSRTGR------YEYYFQAGAGVGTQLVSFADDEVVPIRFF----NPDGLERGLSLMES 192 (518) T ss_pred CCCCcEEEEEEECCCceEEEEcCCCCE------EEEEEEecCCccceEEEecCCcEEEecCC----CCCcccccccHHHH Confidence 999999999999999999988765432 22333222 335578999999998753 2334 589999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchh Q lcl|NC_020081. 259 ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSK 338 (552) Q Consensus 259 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~ 338 (552) +..+|..+.++++++.++|+||++|+|||++++ .+++++++++++.|++.++|..|+|+++|+ ++|++|+++++++. T Consensus 193 a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~--~ls~e~~~~~k~~~~~~~~G~~nag~v~vL-~~G~~~~~l~~s~~ 269 (518) T protein:vir:10 193 LKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK--RLSEAAQQRLREQFDRAHSGSSNTGKTMVV-EEGMEPIPLQLTAV 269 (518) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC--CCCHHHHHHHHHHHHHHhcCccccCcceEc-CCCceEEEccCChh Confidence 999999999999999999999999999999876 468999999999999999999999998665 57999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccccc Q lcl|NC_020081. 339 DMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGG 418 (552) Q Consensus 339 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~ 418 (552) |+||+|++++++++||++|||||++||+.+++ +++|++++.+.|+++||.||+..||++||++|++.++. T Consensus 270 D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~----------t~sn~eq~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~ 339 (518) T protein:vir:10 270 EMQFIEARQLNREEVCGVYDIAPPIVHILDRA----------TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVR 339 (518) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhccCCCC----------CchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccC Confidence 99999999999999999999999999987765 47899999999999999999999999999999988776 Q ss_pred ceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCC--CCCeeeccccccchhhhcccccc--cccc Q lcl|NC_020081. 419 DYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTE--GGDVTLAGVHVQRLGQIMQQEQV--EYQR 489 (552) Q Consensus 419 ~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~--ggD~~~~~~n~~~~~~~~~~~~~--~~~~ 489 (552) +++|+| +++|.+++++.+..+ +.+|+||+||+|+++||||++ |||+++++.|+++++........ +++. T Consensus 340 ~~~~~fd~~~llr~D~~~r~~~~~~~--~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~ 417 (518) T protein:vir:10 340 KNRMKFDIDDVIQPDWEAKSESTQKM--VNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPA 417 (518) T ss_pred CceEEEechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccceecccccccccCCCCCCC Confidence 667666 578888888877654 447999999999999999995 89999999999988643222111 0011 Q ss_pred CCCCCccCcccCCCCCCCCCCCCCCCcccccC----CCCcccccccccccccc-----------------Cccccc---- Q lcl|NC_020081. 490 QMDANQFLAQQTGYDGNMDNVNGKDSFNQNVG----KDGQSKQQANTNSTPQG-----------------GKDDNG---- 544 (552) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~-----------------~~~~~~---- 544 (552) ...+... +. .+..+.+.+.......+. .++-.++.+..-+..++ +++..| T Consensus 418 ~~~~~~~----~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 492 (518) T protein:vir:10 418 PKRPAST----PV-ASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQ 492 (518) T ss_pred CCCCCcc----cc-ccccccccccCCCCCcccccccccccccchhccccCCCcccccchHHHHHHHHhhcCccchhHhhh Confidence 0000000 00 000000001000000000 00000000001111111 010000 Q ss_pred --c-ccccccC Q lcl|NC_020081. 545 --N-VVNDWEA 552 (552) Q Consensus 545 --~-~~~~~~~ 552 (552) | -+.|.+. T Consensus 493 ~~~~~~~~~~~ 503 (518) T protein:vir:10 493 LAEKYPDDLED 503 (518) T ss_pred hhhhcchhHHH Confidence 0 0111111 No 20 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=7.1e-80 Score=454.49 Aligned_cols=410 Identities=17% Similarity=0.192 Sum_probs=302.1 Q ss_pred hhhccccccccccccccccccccccccC---CcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020081. 35 AILKKGKNTKSNKPKAYEEPIIGSMSMN---PDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARN 111 (552) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~ 111 (552) .....+.+++. ++. .....|+ ......++..+...+.+ .|...+.+++||.+++..+ T Consensus 1 m~~~~~~~~~~-~~~------s~~~~w~~~~~~~~~~~~~~g~~vt~~----~al~~~~v~~~i~~Ia~~i--------- 60 (421) T protein:vir:10 1 MFIPQMFEGKK-RSV------SGGGFWEAMLGGVRSSHSKAGVMITPE----TALALSAVRACVTLLAESV--------- 60 (421) T ss_pred CCCcchhcccc-ccc------CcchhhHHHhhhhccCcccCCceechH----HhhccHHHHHHHHHHHHhh--------- Confidence 22222222221 111 1111222 12222333333332222 2334556788887666554 Q ss_pred hccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEE Q lcl|NC_020081. 112 SDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNF 191 (552) Q Consensus 112 ~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L 191 (552) ++++|.+..++.++.. +....|++..+|. .+||++||+++||+.++.+++++||||++|+|+..|+|++| T Consensus 61 --A~lp~~~~~~~~~g~~---~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L 130 (421) T protein:vir:10 61 --AQLPVELYRRDKNGGR---QRATDHPIYDLIH-----SQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKEL 130 (421) T ss_pred --ccCceEEEEEcCCCce---eecccchHHHHHh-----hcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEE Confidence 4567777655544321 1223355554443 36889999999999999999999999999999999999999 Q ss_pred EEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 192 KAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEV 271 (552) Q Consensus 192 ~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~ 271 (552) |||+|.+|++..+++|.. +|+ ....+ ..++++||||++.+ +.++++|+||+..+..+|..+.++++ T Consensus 131 ~~l~~~~v~v~~~~~g~~-------~y~-~~~~g--~~~~~~eiih~~~~----~~d~~~G~spi~~~~~~i~~~~~~~~ 196 (421) T protein:vir:10 131 IPINPKKVIVLKGPDGMP-------YYE-IPEIG--ETLPMRMMHHVKVF----SLDGYIGSSPIQTNADVLGLNLAVEE 196 (421) T ss_pred EEecCceEEEEECCCceE-------EEE-EcCCC--cEEchhhEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHH Confidence 999999999998887743 233 32222 35789999998753 45679999999999999999999999 Q ss_pred HHHHHHhccCCCceEEEeCCCC--CCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHH Q lcl|NC_020081. 272 FNARFFAQGGTTRGLLHIKTGQ--EQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYL 349 (552) Q Consensus 272 ~~~~~f~ng~~p~gil~~~~~~--~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~ 349 (552) ++.++|+||++|+|||++++.. ..++++++++++.|++.++|..|+|+++++ ++|++|+++++++.|+||+|+++++ T Consensus 197 ~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~ 275 (421) T protein:vir:10 197 HASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALL-QEGMSYKQMSQDNEKAQLLQSRQWG 275 (421) T ss_pred HHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceec-CCCceEEecCCChhHHHHHHHHHHh Confidence 9999999999999999987643 348999999999999999999999997655 5799999999999999999999999 Q ss_pred HHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecc---- Q lcl|NC_020081. 350 INVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNF---- 424 (552) Q Consensus 350 ~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f---- 424 (552) +++||++|||||++||+.+.+ +++|+|++.+.|++.||.|+++.||++||++|++..+ .+++|+| T Consensus 276 ~~~Ia~~fgVPp~~lg~~~~~----------t~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~ 345 (421) T protein:vir:10 276 VEEVCRLYKIPPHMVQMLAKA----------TNNNIEHQGLQFVMYTLLAWLKRHEGALQRDLLLPSERRDLYIEFNVSG 345 (421) T ss_pred HHHHHHHhCCCHHHcCCCcCC----------ccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechh Confidence 999999999999999987654 4789999999999999999999999999999997644 3455665 Q ss_pred -cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCC Q lcl|NC_020081. 425 -VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGY 503 (552) Q Consensus 425 -~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (552) +++|.+++++.++.+ +.+|+||+||+|+++|+||+||||++++|+|++.++.....+..+ ...++ T Consensus 346 l~~~d~~~~~~~~~~~--~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~----------~~~~~-- 411 (421) T protein:vir:10 346 LLRGDQKSRYESYALG--RQWGWLSVNDIRRMENLPPIAGGDKYLTPLNMVDSAQIIPGDKKP----------TAQQM-- 411 (421) T ss_pred hhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccccCCCCc----------ccccC-- Confidence 467888888776654 457999999999999999999999999999988765443211110 00000 Q ss_pred CCCCCCCCCCCCccccc Q lcl|NC_020081. 504 DGNMDNVNGKDSFNQNV 520 (552) Q Consensus 504 ~~~~~~~~~~~~~~~~~ 520 (552) ..++...+.+ T Consensus 412 -------~e~d~~~~~~ 421 (421) T protein:vir:10 412 -------AEIDTILSRT 421 (421) T ss_pred -------cccccccccC Confidence 0000000111 No 21 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=2.4e-79 Score=451.64 Aligned_cols=402 Identities=18% Similarity=0.181 Sum_probs=305.2 Q ss_pred hhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_020081. 35 AILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDK 114 (552) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~ 114 (552) ++.+...+. +.+..+.+...-..|... .++ +..... .-|...+.+.+||..+++.++ T Consensus 1 m~f~~~~~~---~~~~~~~~~~~~~~~~g~---~~~--~~~v~~----~~al~~~~v~~~i~~ia~~ia----------- 57 (409) T protein:vir:10 1 MLFRKGFKN---QSQEISIDDKKILEWLGI---NPS--ETYVNG----KSCLKQATVFGCIRILSDNIS----------- 57 (409) T ss_pred CcccccccC---cCCCCCCChHHHHHHhcC---CcC--cceech----hhhhccHHHHHHHHHHHHhhh----------- Confidence 333322222 221211111101112111 111 011001 123456667888877666654 Q ss_pred ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEe Q lcl|NC_020081. 115 GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAV 194 (552) Q Consensus 115 ~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l 194 (552) ++++.+..+.... .+.+.|++..+|. .+||+.||+++||+.++.+++++||+|++++|+..|++++|||| T Consensus 58 ~lp~~~~~~~~~~-----~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i 127 (409) T protein:vir:10 58 KLPIKIYQKKDGI-----KRVPDHYLEYLLK-----LRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPL 127 (409) T ss_pred hCceEEEEecCCe-----eeccCchHHHHHh-----hccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEE Confidence 4566664332111 1123355544442 36889999999999999999999999999999999999999999 Q ss_pred cCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 195 DASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNA 274 (552) Q Consensus 195 ~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~ 274 (552) +|++|++..+++|.... .....|++....+....|+++||||++.. ..++++|+||+..+..+|..+.++++++. T Consensus 128 ~~~~V~v~~~~~~~~~~-~~~~~y~~~~~~g~~~~~~~~evih~r~~----~~d~~~G~s~i~~~~~~i~~~~~~~~~~~ 202 (409) T protein:vir:10 128 KSDGMKIFVDDTGLLNS-ENNVWYLYTDDLGQRHKFMSDEILHFKGL----TADGLAGLSVIELLNHLIENGKSSETYLN 202 (409) T ss_pred cCCceEEEEcCCccccc-cceEEEEEEeCCceeEEeccccEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 99999999988876543 33456666667777889999999998742 34578999999999999999999999999 Q ss_pred HHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHH Q lcl|NC_020081. 275 RFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVIC 354 (552) Q Consensus 275 ~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 354 (552) ++|+||++|+|||++++ .+++++.+++++.|++.++|..|+|+++++ ++|++|++++.++.|+||+|++++++++|| T Consensus 203 ~~f~ng~~~~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 279 (409) T protein:vir:10 203 NFFKNGLQVKGLVQYAG--DLNPEAEEVFKENFERMSSGLKNAHRIAML-PIGYKFEPISQKLVDAQFLENSQLTIRQIA 279 (409) T ss_pred HHHhccCCCcEEEEcCC--CCCHHHHHHHHHHHHHHhccccccCCceec-CCCceEEEccCChhhHHHHHHHHHHHHHHH Confidence 99999999999999875 468999999999999999999999997665 579999999999999999999999999999 Q ss_pred HHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccc--ccceeeccc-----cc Q lcl|NC_020081. 355 SIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQF--GGDYVFNFV-----GG 427 (552) Q Consensus 355 ~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~--~~~~~~~f~-----~~ 427 (552) ++|||||++||+.+.+ +++|++++.+.|++.||.|+++.||++||++|++.. ..+++++|+ +. T Consensus 280 ~~fgVPp~~lg~~~~~----------~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~ 349 (409) T protein:vir:10 280 SVFGVKMHQLNDLDRA----------THSNITEQNREFYIDTLQSILNMYELEINYKLFLISEIKNGFYSKFNVDTILRA 349 (409) T ss_pred HHhCCCHHHcCCCCCC----------ccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhhcc Confidence 9999999999976543 479999999999999999999999999999998754 345666654 67 Q ss_pred ChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCC Q lcl|NC_020081. 428 DAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDA 493 (552) Q Consensus 428 d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~ 493 (552) |.+++++.+.. .+.+|+||+||+|+++||||+||||++++++|+++++...+.. .++++. T Consensus 350 d~~~~~~~~~~--~~~~G~~T~NE~R~~lgl~p~~ggD~~~~~~n~~~~~~~~~~~----~kgGe~ 409 (409) T protein:vir:10 350 DIKTRYESYKE--AIQNGFKTPNEIRELEEDEPLEGGDVLLINGNMIPVKMAGEQY----SKGGEK 409 (409) T ss_pred CHHHHHHHHHH--HHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCccchhhccccc----cccCCC Confidence 88888776664 3557999999999999999999999999999999987553211 111111 No 22 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=2.8e-79 Score=451.27 Aligned_cols=447 Identities=16% Similarity=0.180 Sum_probs=305.0 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+++ +++++.-+ +... ..++..-....+... . .-..+..+...+.+ T Consensus 1 Mg~~~-~l~~~~~~--------------------~~~~----~~~~~~~~~~~~~~~----~---~~~~~~~g~~v~~~- 47 (457) T protein:vir:62 1 MGFWS-ALFGRGHS--------------------PALD----AAEGRAWEPYDPSIY----N---LGATASSGERVTPH- 47 (457) T ss_pred Cchhh-hhhccccc--------------------cccc----cccccccccchhhhh----h---ccccccCCceechH- Confidence 88888 44332210 0000 000000000011000 0 00112222221111 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .|...+++.+||.+++..+ +++++.++.+..... .....+.+..++ ..||++||++ T Consensus 48 ---~al~~~~v~~~i~~ia~~i-----------A~lp~~~~~~~~~~~----~~~~~~~~~~ll------~~pn~~~t~~ 103 (457) T protein:vir:62 48 ---DALQVSAVFASVRLLSETI-----------ATLPLSTYSKRGGTR----KEIDTPEWLDFP------NAEPGGMGRI 103 (457) T ss_pred ---HhhccHHHHHHHHHHHHhH-----------hhCceEEEEecCCcc----ccccchHHHHhc------cccCCCCCHH Confidence 1223466788887766654 456777655432211 111222222222 3567889999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCc--eEEEEcccceeee Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDK--VVAKFKAKEMAWE 238 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~--~~~~~~~~evi~~ 238 (552) +||+.++.+++++||+|++|+++ .|++.+||||+|.+|++..+.++..... ..+.|.+...+. ....|+++||||+ T Consensus 104 ~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~-~~~~y~~~~~g~~~~~~~~~~~eiih~ 181 (457) T protein:vir:62 104 DILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGLRRK-VFEAYDIDADGNEVLLGWFTPRDVLHI 181 (457) T ss_pred HHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCccce-eEEEEEEccCCceeEEEeeCccceEEe Confidence 99999999999999999999765 6899999999999999987655433221 112233333333 2356899999999 Q ss_pred cccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccc Q lcl|NC_020081. 239 VSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAW 318 (552) Q Consensus 239 ~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nag 318 (552) +.+. ....++|+||+.+++.+|..+.++++++.++|+||++|+|||++++ .+++++++++++.|++.++|.+|+| T Consensus 182 r~~~---~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~ls~e~~~~~~~~~~~~~~G~~nag 256 (457) T protein:vir:62 182 PGMM---LPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPG--TMSEEGLARAREAWRAANSGVDNAH 256 (457) T ss_pred cCCC---CCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCC--CCCHHHHHHHHHHHHHHhcCccccC Confidence 7542 3334899999999999999999999999999999999999999976 4699999999999999999999999 Q ss_pred cceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhh Q lcl|NC_020081. 319 KIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLE 398 (552) Q Consensus 319 k~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~ 398 (552) +++|| ++|++|+++++++.|+||+|++++++++||++|||||++||+.+.+++ .++|++++.+.|++.||. T Consensus 257 ~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--------~~sn~eq~~~~f~~~~l~ 327 (457) T protein:vir:62 257 RVALL-TEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTS--------WGSGLAEQNIAFTMFSLR 327 (457) T ss_pred cceec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCccc--------ccchHHHHHHHHHHHHHH Confidence 98666 579999999999999999999999999999999999999999887654 347899999999999999 Q ss_pred HHHHHHHHHHHhhcCccccc-ceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCC--Ceeecc Q lcl|NC_020081. 399 PLLKFIEDAVNKYIVSQFGG-DYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGG--DVTLAG 470 (552) Q Consensus 399 P~~~~ie~~ln~~L~~~~~~-~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~gg--D~~~~~ 470 (552) ||+++||++||++|+++.+. .++++| +++|.+++++.+..+ +..|+||+||+|+++||||++|| |++++| T Consensus 328 P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~--~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~~ 405 (457) T protein:vir:62 328 PWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLG--LQNGIYSIDEVRAAEDMTPLPDGLGEKYRVP 405 (457) T ss_pred HHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCcceeeec Confidence 99999999999999987653 344554 577888888877654 45799999999999999999987 999999 Q ss_pred ccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccccccccccCcc Q lcl|NC_020081. 471 VHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGKD 541 (552) Q Consensus 471 ~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 541 (552) +|+.+++.....+..+.....++ ...++..+..++ +...++++++ +++|-.. T Consensus 406 ~n~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~----------~~~~~~d~~~------~~~~~~~ 457 (457) T protein:vir:62 406 LNLGEIGEEPEPEPAPAPPAIDP---PAEEPADDEEPD----------NAEGDPDEGE------TEDDDDA 457 (457) T ss_pred cccccccccccccccCCCccCCC---CccCCCCCCCCC----------CCCCCCcccc------ccccccC Confidence 99998876654443322211111 000000011111 1111111111 1111111 No 23 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=3e-79 Score=451.11 Aligned_cols=413 Identities=16% Similarity=0.170 Sum_probs=295.7 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcc------cccccCCCC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDF------KEAPSIHGK 74 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~ 74 (552) ||+++ |.+.- ... ..+..+.++.++.|.. ....+..+. T Consensus 7 ~~~~~-~~~~~------------------------~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~ 50 (432) T protein:vir:10 7 LGLLG-QLKAM------------------------FVP-----------PDPVDIGGGQTFTPVNATARDLGIIISDTGA 50 (432) T ss_pred cchhh-hhHhh------------------------cCC-----------ccccccccccccccCcchhhhhcccccccCc Confidence 44444 21111 100 0000011111111100 000111111 Q ss_pred chHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_020081. 75 QNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDF 154 (552) Q Consensus 75 ~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn 154 (552) ..+. .-+...+++.+||.+++..+ ++++|.++.++.++.. ....|++..+|. ..|| T Consensus 51 ~v~~----~~al~~~~V~~~i~~Ia~~i-----------a~lp~~~y~~~~~g~~----~~~~~~l~~lL~-----~~PN 106 (432) T protein:vir:10 51 AVNA----DAIMRLDAVAACVKLVSQAI-----------AAMPLTMYMRTPDGRK----EAVNHPLYTLLL-----DGPN 106 (432) T ss_pred ccch----hhhhcchHHHHHHHHHHHhh-----------hhCceeEEEecCCCcc----cccccHHHHHHH-----hccc Confidence 1111 12334466788887666554 4567777665554321 223355555543 3688 Q ss_pred ccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccc Q lcl|NC_020081. 155 TRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKE 234 (552) Q Consensus 155 ~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~e 234 (552) ++||+++||+.++.+++++||+|++++|+ +|++.+||||+|++|++..+.+|.. .|++...++..+.|+++| T Consensus 107 ~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~~~~g~~-------~y~~~~~~g~~~~~~~~~ 178 (432) T protein:vir:10 107 STQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKGNT-------AYRYRRTDGQMIDIPKQQ 178 (432) T ss_pred ccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEEcCCCcE-------EEEEEecCceEEEEcCcc Confidence 99999999999999999999999999997 5999999999999999999888753 345555567778899999 Q ss_pred eeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc Q lcl|NC_020081. 235 MAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGI 314 (552) Q Consensus 235 vi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~ 314 (552) |||++.+ +.++++|+||+..+..+|..+.++++|+.++|+||++|+|||++++ .+++++++++++.| .|. T Consensus 179 iih~~~~----~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~--~l~~e~~~~~~~~~----~~~ 248 (432) T protein:vir:10 179 IWKIMGY----SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR--FLTDDQYDSFAKKV----SGS 248 (432) T ss_pred EEEecCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC--CCCHHHHHHHHHHH----hhh Confidence 9998643 4567999999999999999999999999999999999999999876 46888888877766 466 Q ss_pred cccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHH Q lcl|NC_020081. 315 NGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKD 394 (552) Q Consensus 315 ~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~ 394 (552) .|+|+++|+ ++|++|++++++++|+||+|++++++++||++|||||++||+.+.+++ .+++|+|++.+.|++ T Consensus 249 ~nag~~~vl-~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~-------~~~sn~e~~~~~f~~ 320 (432) T protein:vir:10 249 VEAGRAPLL-EGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTT-------SWGSGIESQQLGFLS 320 (432) T ss_pred hhCCCceec-CCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcc-------cccchHHHHHHHHHH Confidence 789997655 579999999999999999999999999999999999999999887653 346889999999999 Q ss_pred HHhhHHHHHHHHHHHhhcCcccc-cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCee- Q lcl|NC_020081. 395 KGLEPLLKFIEDAVNKYIVSQFG-GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVT- 467 (552) Q Consensus 395 ~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~- 467 (552) +||.||++.||++||++|++..+ .+++|+| +++|.+++++.+..+ +.+|+||+||+|+++||||++|||.+ T Consensus 321 ~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~--~~~G~~T~NE~R~~~glppi~g~~~~~ 398 (432) T protein:vir:10 321 MTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQL--VNNGLMTRDEAREIEGLPKLGGNAAVL 398 (432) T ss_pred HHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHH--HhCCCCCHHHHHHHhCCCCCCCCcceE Confidence 99999999999999999997644 3566666 478888888877654 45799999999999999999987654 Q ss_pred eccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCC Q lcl|NC_020081. 468 LAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGK 522 (552) Q Consensus 468 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (552) .++.++++++...+.... ++....+ ++..++.. + T Consensus 399 ~~~~~~~pl~~~~~~~~~------~~~~~~~-----~~~~~~~~----------~ 432 (432) T protein:vir:10 399 TVQSAMVPLDSIGLQASP------EPASGLG-----NQQQDKVS----------K 432 (432) T ss_pred eecCcccchhhhcccCCC------CCCCCCC-----Cccccccc----------C Confidence 478888887754321100 0000000 00000000 0 No 24 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=4.4e-79 Score=450.16 Aligned_cols=416 Identities=11% Similarity=0.100 Sum_probs=301.9 Q ss_pred cccccchhhcccccCcccccccccchhhh-hccccccccccc-cccccccccccccCCcccccccCCCCchHHHHHHHhh Q lcl|NC_020081. 8 FKGRKQQDNIIDINDDMAVRIKQIEEDAI-LKKGKNTKSNKP-KAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWS 85 (552) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a 85 (552) +..-+|-.+| ....-.=.++ +++....+.... .....|+. +.. +..+..... ..| T Consensus 1 ~~~~~~~~~~---------~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~------~~~~~~v~~----~~a 57 (424) T protein:vir:18 1 MEEPKYTIDL---------RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVS----AHG------HLGDSSIND----ERI 57 (424) T ss_pred CCCCcceEee---------cCCCchHHHHHhhhcccccccccccccccccc----ccc------ccccccccH----HHh Confidence 3333443332 2211111222 111101110000 01111221 111 011111111 122 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHH Q lcl|NC_020081. 86 RKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKK 165 (552) Q Consensus 86 ~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~ 165 (552) ...+.+.+||.+++.. ++++++.+...+.++.. +.....|++..+|. ..||+.||+++||+. T Consensus 58 l~~~~v~~cv~~Ia~~-----------iA~lp~~~~~~~~~~~~--~~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~ 119 (424) T protein:vir:18 58 LQISTVWRCVSLISTL-----------TACLPLDVFETDQNDNR--KKVDLSNPLARLLR-----YSPNQYMTAQEFREA 119 (424) T ss_pred hccHHHHHHHHHHHHh-----------hccCceEEEEeecCCce--eeeccccHHHHHHh-----hccCCCCCHHHHHHH Confidence 3345678888766555 45667776544433221 11112355555543 368899999999999 Q ss_pred HHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCC Q lcl|NC_020081. 166 LVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTD 245 (552) Q Consensus 166 ~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~ 245 (552) ++.+++++||+|++|+|+..|+|++||||+|.+|++..+. + ..+|.+.. ++....|+++||||++. + T Consensus 120 ~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~~~~-~-------~~~y~~~~-~g~~~~~~~~eIih~r~-~--- 186 (424) T protein:vir:18 120 MTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG-K-------KVVYRYQR-DSEYADFSQKEIFHLKG-F--- 186 (424) T ss_pred HHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcC-C-------eEEEEEEe-CCeEEEeccccEEEecC-c--- Confidence 9999999999999999999999999999999999987653 2 22344433 45667899999999874 2 Q ss_pred ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecc Q lcl|NC_020081. 246 LTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITA 325 (552) Q Consensus 246 ~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~ 325 (552) ..++++|+||+..+..+|..+.++++++.++|+||++|+|||+++.. .+++++++++++.|++.++| .|+|+++|| + T Consensus 187 ~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~-~l~~e~~~~~~~~~~~~~~g-~nag~~~vl-~ 263 (424) T protein:vir:18 187 GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGG-PVKKRLWIL-E 263 (424) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCc-CCCHHHHHHHHHHHHHHhCC-cccCCceec-c Confidence 34679999999999999999999999999999999999999998754 46899999999999987765 689998766 5 Q ss_pred CCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_020081. 326 EDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIE 405 (552) Q Consensus 326 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie 405 (552) +|++|+++++++.|+||+|++++++++||++|||||++||+.+++++ .++|++++.+.|+++||.||++.|| T Consensus 264 ~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~--------~~sn~eq~~~~f~~~tl~P~~~~ie 335 (424) T protein:vir:18 264 AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS--------WGSGIEQQNLGFLQYTLQPYISRWE 335 (424) T ss_pred CCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc--------ccccHHHHHHHHHHHHHHHHHHHHH Confidence 79999999999999999999999999999999999999999887653 4578999999999999999999999 Q ss_pred HHHHhhcCcccc-cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhh Q lcl|NC_020081. 406 DAVNKYIVSQFG-GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQI 479 (552) Q Consensus 406 ~~ln~~L~~~~~-~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~ 479 (552) ++||++|++..+ .+++|+| +++|.+++++.+..+ +.+|+||+||+|+++||||+||||++++++|+++++.+ T Consensus 336 ~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~--~~~G~~T~NE~R~~~gl~pi~gGD~~~~~~n~~~l~~~ 413 (424) T protein:vir:18 336 NSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAM--GEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDL 413 (424) T ss_pred HHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcCeeeeccCccchHhh Confidence 999999998765 3456665 578888888877754 45799999999999999999999999999999998765 Q ss_pred ccccccccccCCCCCcc Q lcl|NC_020081. 480 MQQEQVEYQRQMDANQF 496 (552) Q Consensus 480 ~~~~~~~~~~~~~~~~~ 496 (552) ..... +.++.+ T Consensus 414 ~~~~~------p~~~ga 424 (424) T protein:vir:18 414 GTNKE------PRNNGA 424 (424) T ss_pred hccCC------CccCCC Confidence 32110 000000 No 25 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=7.5e-79 Score=448.90 Aligned_cols=411 Identities=16% Similarity=0.188 Sum_probs=299.8 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) |++++ +|+++..+ .. .....+.+.+ ...+...+...+.+ T Consensus 1 m~~~~-~~~~~~~~--------------------------------~~-~~~~~~~~~~------~~~~~~~g~~v~~~- 39 (419) T protein:vir:57 1 MFIPQ-FWKGRPSE--------------------------------NR-VNWQVVPGGM------RSSSSQAGVIITPE- 39 (419) T ss_pred Ccchh-hhccCCcc--------------------------------cc-cccccccccc------ccccccCCceechH- Confidence 44444 33332221 00 0001111110 01111111111111 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .+...+.+++||.+++..+ +++++.+..++.++.. +....|++..+|. .+||++||++ T Consensus 40 ---~al~~~~v~~~i~~ia~~i-----------a~lp~~~~~~~~~g~~---~~~~~~~l~~lL~-----~~PN~~~t~~ 97 (419) T protein:vir:57 40 ---TALALSAVRACVTLLAESV-----------AQLPCVLYRRTENGGR---EIAFDHPLHDLIR-----YQPNRKDTAF 97 (419) T ss_pred ---HhhccHHHHHHHHHHHHhh-----------ccCceEEEEEcCCCce---eccccchHHHHHh-----hccccCCCHH Confidence 1223456788887766654 4567776554444321 2223455555543 3688999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS 240 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~ 240 (552) +||+.++.+++++||+|++|+|+..|+|++||||+|++|++..+.+|.. |+++...+ ..++++||+|++. T Consensus 98 ~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g~~--------~y~~~~~~--~~~~~~~vih~r~ 167 (419) T protein:vir:57 98 EYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDGMP--------YYDIPSIG--EILPMRMVHHIKS 167 (419) T ss_pred HHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCceE--------EEEEcCCc--eEEchhhEEEecC Confidence 9999999999999999999999999999999999999999998887643 23333333 4588999999874 Q ss_pred cccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC--CCCCHHHHHHHHHHHHHHhccccccc Q lcl|NC_020081. 241 NPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTG--QEQSNQALTSFRREWTSMFSGINGAW 318 (552) Q Consensus 241 ~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~--~~~s~~~~~~~~~~~~~~~~G~~nag 318 (552) + +.+++||+||+..+..+|..+.++++++.++|+||++|+|+|++++. ..+++++++++++.|.+.++|..|+| T Consensus 168 ~----~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag 243 (419) T protein:vir:57 168 F----SLDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAF 243 (419) T ss_pred c----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccc Confidence 3 35679999999999999999999999999999999999999998753 34689999999999999999999999 Q ss_pred cceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhh Q lcl|NC_020081. 319 KIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLE 398 (552) Q Consensus 319 k~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~ 398 (552) +++++ ++|++|++++++++|+||+|++++++++||++|||||++||..+.+ +++|+|++.+.|++.||. T Consensus 244 ~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~----------t~sn~e~~~~~f~~~~l~ 312 (419) T protein:vir:57 244 SVGML-QEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKS----------TNNNIEHQGLQYVIYTML 312 (419) T ss_pred cceec-CCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------ccccHHHHHHHHHHHHHH Confidence 97655 5799999999999999999999999999999999999999976544 478999999999999999 Q ss_pred HHHHHHHHHHHhhcCcccc-cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeecccc Q lcl|NC_020081. 399 PLLKFIEDAVNKYIVSQFG-GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVH 472 (552) Q Consensus 399 P~~~~ie~~ln~~L~~~~~-~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n 472 (552) |+++.||++||++|+++.+ .+++++| +++|.+++++.++.+ +.+|+||+||+|+++||||+||||++++|+| T Consensus 313 P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~--~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n 390 (419) T protein:vir:57 313 AILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALG--RQWGWLSVNDIRRMENLTPIPGGDKYLTPLN 390 (419) T ss_pred HHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcCeeeeccc Confidence 9999999999999987543 3566665 467888888776653 4579999999999999999999999999999 Q ss_pred ccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccccccccc Q lcl|NC_020081. 473 VQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQ 537 (552) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 537 (552) ++++..+...... .+++.. +.+..+++++ T Consensus 391 ~~~~~~~~~~~~~------~~~~~~------------------------------~~~~~~~~~~ 419 (419) T protein:vir:57 391 MVDSKALTGIGKA------TPQQLK------------------------------DIEAILCTRN 419 (419) T ss_pred cccccccccccCC------CcccCc------------------------------chhhhhhccC Confidence 8775443321111 000000 0001111111 No 26 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=4.6e-79 Score=450.05 Aligned_cols=404 Identities=16% Similarity=0.119 Sum_probs=302.0 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVS 103 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~ 103 (552) |..+.+ +.+.. +.+ ........|... .|... +... . ..+.....+.+||.++++.+ T Consensus 1 MG~~~~------~~~~~-~~~-~~~~~~~~~~~~--~~~g~----~~~~-----~----~~al~~~~V~~~v~~Ia~~i- 56 (411) T protein:vir:81 1 MGWWSR------LTRFF-RPR-NETVDMTNPLLL--QWLGV----DPDT-----P----RNQLSEATYFACLKILSESL- 56 (411) T ss_pred CchHHH------HHhhc-cCc-ccccccchHHHH--HHhcC----cccC-----h----hhhhccHHHHHHHHHHHHhH- Confidence 333211 11110 000 011111111111 11100 0110 0 11122345678877666654 Q ss_pred HHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC Q lcl|NC_020081. 104 MFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD 183 (552) Q Consensus 104 ~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~ 183 (552) +++++.+..++.++.. +...|++..+|. .+||+.||+++||+.++.+++++||+|++|+|+ T Consensus 57 ----------A~lp~~~~~~~~~~~~----~~~~~~l~~lL~-----~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~ 117 (411) T protein:vir:81 57 ----------GKLPLKMYQKTERGIV----KSDREELYNLLK-----LRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS 117 (411) T ss_pred ----------hhCceeEEEecCCcee----eecccHHHHHHh-----hccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec Confidence 4567777655544322 122355554443 368899999999999999999999999999998 Q ss_pred CCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHH Q lcl|NC_020081. 184 KLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHL 263 (552) Q Consensus 184 ~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i 263 (552) .|++.+||||+|+.|++..+++|.........+.+....++....|+++||||++.++ +.++++|+||+.++..+| T Consensus 118 -~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~eiih~k~~~---~~~~~~G~s~~~~~~~~i 193 (411) T protein:vir:81 118 -GPQLQALWILPSQYVTIVVDDRGLLGEKNAIWYRYNDPYDGKMYVFRNDEILHFKTSV---TFDGITGLSVRDVLKHTV 193 (411) T ss_pred -CCceEEEEEECCceEEEEEcCcccccccceEEEEEEecCCceEEEEccccEEEEcCCC---CCCCcccccHHHHHHHHH Confidence 6999999999999999999887755433222222333345667789999999998654 456799999999999999 Q ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHH Q lcl|NC_020081. 264 QYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFE 343 (552) Q Consensus 264 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~ 343 (552) ..+.++++++.++|+||++|+|+|++++ .++++++++++++|.+.++|.+|+|+++++ ++|++|+++++++.|+||+ T Consensus 194 ~~~~~~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl-~~g~~~~~l~~~~~d~q~~ 270 (411) T protein:vir:81 194 DGALESQKFMNNLYKTGLTGKAVLEYTG--DLNQEARDRLVKGFEQFANGSKNAGKIIPV-PLGMKLVPLDIKLTDSQFF 270 (411) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCC--CCCHHHHHHHHHHHHHHhcCccccCCceec-CCCceEEEccCCHHHHHHH Confidence 9999999999999999999999999876 468999999999999999999999997554 6799999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc--ccee Q lcl|NC_020081. 344 KWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG--GDYV 421 (552) Q Consensus 344 e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~ 421 (552) |++++++++||++|||||++||+.+.+ +++|++++.+.|++.||.|+++.||++||++|++..+ .+++ T Consensus 271 e~~~~~~~~Ia~~fgVPp~~lg~~~~~----------t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~ 340 (411) T protein:vir:81 271 ELKKYTALQIAAAFGIKPNQINDYEKS----------SYASAEAQNLAFYVDTLLYVLKQYEEEITYKILSNDLISQGHY 340 (411) T ss_pred HHHHHHHHHHHHHhCCCHHHhCCCCCC----------CchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhcCCCcE Confidence 999999999999999999999987654 5799999999999999999999999999999997543 4566 Q ss_pred ecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCC Q lcl|NC_020081. 422 FNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDA 493 (552) Q Consensus 422 ~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~ 493 (552) |+| ++.|.+++++.++.+ +.+|+||+||+|+++||||+||||++++++|+++++.+... ..++++. T Consensus 341 ~~fd~~~ll~~d~~~~~~~~~~~--~~~g~~t~NE~R~~~gl~p~~ggD~~~~~~n~~pl~~~~~~----~~kgGd~ 411 (411) T protein:vir:81 341 FKFNVNVILRADIKTQMDSLSTA--VQNGIMTPNEARDYLDMPADDYGNNLMANGNYIPLSMLGAN----YGKGGDS 411 (411) T ss_pred EEeechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccchhhhhhh----hccCCCC Confidence 665 467888887766543 45799999999999999999999999999999998765321 1111211 No 27 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=1.3e-78 Score=447.56 Aligned_cols=410 Identities=14% Similarity=0.117 Sum_probs=297.2 Q ss_pred hccccccccccccccccccccccccCC-cccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNP-DFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKG 115 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~ 115 (552) |.+..+..+.. ..+.+. ..+|.. .+...++..+...+.+ -|...+.+.+||.++++.+ ++ T Consensus 1 ~~~~r~~~~~~--~~~~~~--~~~~~~~~~g~~~s~~~~~vt~~----~al~~~~v~~~v~~ia~~i-----------A~ 61 (419) T protein:vir:14 1 MFFSRQLLSNL--GQTQMS--AGGWVSALLGSSRSDSGQVVTPA----SALALTVLQNCVTLLAESI-----------AQ 61 (419) T ss_pred Ccccccccccc--cccccC--cchhhHHhhcCCCccCCcccchH----HhhccHHHHHHHHHHHHhh-----------cc Confidence 22211111111 111000 111111 1111222222222221 2334556788887766654 45 Q ss_pred cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEec Q lcl|NC_020081. 116 VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVD 195 (552) Q Consensus 116 ~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~ 195 (552) +++.++.++.++. .....|++..+|. .+||++||+++||+.++.+++++||+|++|+|+.+|+|++||||+ T Consensus 62 lp~~~~~~~~~~~----~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~ 132 (419) T protein:vir:14 62 LPIELYERSGEDR----KPATDHPLYSILK-----YEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLD 132 (419) T ss_pred CceEEEEecCCcc----ccccccHHHHHHH-----hhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEec Confidence 6777755544332 2233456655554 368899999999999999999999999999999999999999999 Q ss_pred CceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 196 ASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNAR 275 (552) Q Consensus 196 p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~ 275 (552) |++|++..+.+|.. +|.+... ..++.++|+|++.+ +.++++|+||+..+..+|..+.++++++.+ T Consensus 133 ~~~v~v~~~~~~~~-------~y~~~~~----~~~~~~~i~h~~~~----~~dg~~G~s~i~~~~~~i~~~~~~~~~~~~ 197 (419) T protein:vir:14 133 NEAVTVMRGSDLKP-------VYRVRGS----DPMPQRLVHHVRWM----SINGYTGLSPVLLHANAIGHAQAIQQYAGK 197 (419) T ss_pred CceEEEEECCCceE-------EEEEccC----cccchhheeEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 99999998887743 2333222 23677888887643 457799999999999999999999999999 Q ss_pred HHhccCCCceEEEeCCCCC--CCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHH Q lcl|NC_020081. 276 FFAQGGTTRGLLHIKTGQE--QSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVI 353 (552) Q Consensus 276 ~f~ng~~p~gil~~~~~~~--~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~I 353 (552) +|+||++|+|+|++++... .++++++++++.|++.++|.+|+|+++++ ++|++|+++++++.|+||+|++++++++| T Consensus 198 ~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~I 276 (419) T protein:vir:14 198 SFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALL-QEGMTFRPLSMTNVDAALIDALRLSALDI 276 (419) T ss_pred HHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceec-CCCceEEEccCChhhHHHHHHHHHHHHHH Confidence 9999999999999976432 36889999999999999999999998766 56999999999999999999999999999 Q ss_pred HHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecc-----ccc Q lcl|NC_020081. 354 CSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNF-----VGG 427 (552) Q Consensus 354 a~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f-----~~~ 427 (552) |++|||||++||..+.+ +++|+|++.+.|+++||.|++++||++||++|+++.+ .+++++| +++ T Consensus 277 a~~fgVpp~~lg~~~~~----------t~s~~E~~~~~f~~~~L~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~ 346 (419) T protein:vir:14 277 ARIYKIPAHMVNELERA----------TFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRG 346 (419) T ss_pred HHHhCCCHHHhcCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhcc Confidence 99999999999976544 4789999999999999999999999999999997643 3566665 467 Q ss_pred ChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCC Q lcl|NC_020081. 428 DAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNM 507 (552) Q Consensus 428 d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (552) |.+++++.++.+ +.+|+||+||+|+++|+||+||||++++|+|+++++...+.+..+ ++++ T Consensus 347 d~~~~~~~~~~~--~~~G~~T~NE~R~~~gl~p~~gGD~~~~~~n~~~~~~~~~~~~~~-----------------~~~~ 407 (419) T protein:vir:14 347 DQSSRYAAYAVG--RQWGWLSINDIRRLENMPPVKGGDIYLSPMNMVDASKPQQLPVGK-----------------SEPT 407 (419) T ss_pred CHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcCeeeeccccccccccccccCCC-----------------CCCc Confidence 888888877654 457999999999999999999999999999988766443211110 0000 Q ss_pred CCCCCCCCcccccC Q lcl|NC_020081. 508 DNVNGKDSFNQNVG 521 (552) Q Consensus 508 ~~~~~~~~~~~~~~ 521 (552) .. +.++..+-.+ T Consensus 408 ~~--~~~e~~~~l~ 419 (419) T protein:vir:14 408 KA--AIDEIGRILS 419 (419) T ss_pred cc--cccchhcccC Confidence 00 0000000000 No 28 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=1.3e-78 Score=447.58 Aligned_cols=413 Identities=16% Similarity=0.170 Sum_probs=295.3 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCc------ccccccCCCC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPD------FKEAPSIHGK 74 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~ 74 (552) ||+++ |.+. ....-.+..+.++.++.|. +...++..+. T Consensus 7 ~g~~~-~~~~-----------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 50 (432) T protein:vir:97 7 LGLLG-QLKA-----------------------------------MFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGA 50 (432) T ss_pred Cchhh-hhHh-----------------------------------hcCCccccccccccccccCchhhhhhcccccccCc Confidence 44443 2111 1000000011111111110 0001111121 Q ss_pred chHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_020081. 75 QNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDF 154 (552) Q Consensus 75 ~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn 154 (552) ..+. ..+...+.+.+||.+++..+ +.+++.++.++.++.. ....|++..+|. .+|| T Consensus 51 ~v~~----~~a~~~~aV~~~v~~Ia~~i-----------a~lp~~~y~~~~~g~~----~~~~~pl~~lL~-----~~PN 106 (432) T protein:vir:97 51 AVNA----DAIMRLDAVAACVKLVSQAV-----------AAMPLMMYMRTPDGRK----EAVNHPLYTLLL-----DGPN 106 (432) T ss_pred ccch----HhhhcchHHHHHHHHHHHhh-----------ccCceEEEEecCCCcc----cccccHHHHHHH-----hccc Confidence 1111 12334466788887666554 5567777665554321 223355555543 3688 Q ss_pred ccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccc Q lcl|NC_020081. 155 TRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKE 234 (552) Q Consensus 155 ~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~e 234 (552) ++||+++||+.++.+++++||+|++++|+ +|++.+||||+|+.|++..+.+|.. +|+....++..+.|+++| T Consensus 107 ~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~~~~g~~-------~y~~~~~~g~~~~~~~~~ 178 (432) T protein:vir:97 107 STQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKGNT-------AYRYRRTDGQMIDIPRQQ 178 (432) T ss_pred ccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEEcCCCcE-------EEEEEecCceEEEEcccc Confidence 99999999999999999999999999997 5999999999999999999888753 455555666778899999 Q ss_pred eeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc Q lcl|NC_020081. 235 MAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGI 314 (552) Q Consensus 235 vi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~ 314 (552) |||++.+ +.++++|+||+..+..+|..+.+++++..++|+||++|+|||++++ .+++++++++++.| .|. T Consensus 179 iih~r~~----~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~--~l~~e~~~~~~~~~----~~~ 248 (432) T protein:vir:97 179 IWKIMGY----SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR--FLTDDQYDSFSKKV----SGS 248 (432) T ss_pred EEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCC--CCCHHHHHHHHHHH----hhh Confidence 9998643 4567999999999999999999999999999999999999999876 46888877776655 567 Q ss_pred cccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHH Q lcl|NC_020081. 315 NGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKD 394 (552) Q Consensus 315 ~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~ 394 (552) .|+|+++++ ++|++|+++++++.|+||+|++++++++||++|||||++||+.+.++. .+++|++++.+.|++ T Consensus 249 ~nag~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~-------~~~s~~e~~~~~f~~ 320 (432) T protein:vir:97 249 VEAGRAPLL-EGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTT-------SWGSGIESQQLGFLT 320 (432) T ss_pred hcCCCceec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccc-------ccchhHHHHHHHHHH Confidence 789997665 579999999999999999999999999999999999999999876652 346889999999999 Q ss_pred HHhhHHHHHHHHHHHhhcCcccc-cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeee Q lcl|NC_020081. 395 KGLEPLLKFIEDAVNKYIVSQFG-GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTL 468 (552) Q Consensus 395 ~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~ 468 (552) +||.||++.||++||++|++..+ ..++|+| +++|.+++++.+..+ +.+|+||+||+|+++||||++|||.++ T Consensus 321 ~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~--~~~G~~T~NE~R~~~glpp~~g~~~~~ 398 (432) T protein:vir:97 321 MTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQL--VNNGLMTRDEAREIEGLPKLGGNAAVL 398 (432) T ss_pred HHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHH--HhCCCCCHHHHHHHhCCCCCCCCcceE Confidence 99999999999999999997644 3466665 578888888877654 457999999999999999999887654 Q ss_pred -ccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCC Q lcl|NC_020081. 469 -AGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGK 522 (552) Q Consensus 469 -~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (552) ++.++++++.+...... ++..... ++..++. .+ T Consensus 399 ~~~~~~~pl~~~~~~~~~------~~~~~~~-----~~~~~~~----------~~ 432 (432) T protein:vir:97 399 TVQSAMVPLDSIGLQASP------EPASGLG-----NQQQDKV----------SK 432 (432) T ss_pred eecccccchhhhcccCCC------CCCCCCC-----Ccccccc----------cC Confidence 78888887755331100 0000000 0000000 00 No 29 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=2.9e-78 Score=445.64 Aligned_cols=407 Identities=14% Similarity=0.171 Sum_probs=301.8 Q ss_pred hhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_020081. 35 AILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDK 114 (552) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~ 114 (552) .+...+.++++.............+.+ ++ ++..+..... ..+...+.+.+||..+++.+ + T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~--~~---~~~~g~~v~~----~~~l~~~~v~~~i~~Ia~~i-----------A 60 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGL--SY---DTYTGKRISS----QRAMRLTAVYSCVRVLAESV-----------G 60 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhc--Cc---ccccCceech----hhhhccHHHHHHHHHHHHhh-----------h Confidence 444455444433221111111111111 11 1112211111 11223456788887766654 4 Q ss_pred ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEe Q lcl|NC_020081. 115 GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAV 194 (552) Q Consensus 115 ~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l 194 (552) ++++.+..++.... .....|++..+|. ..||++||+++||+.++.+++++||+|++++|+ .|+|++|||| T Consensus 61 ~~p~~~~~~~~~~~----~~~~~~~~~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l 130 (413) T protein:vir:48 61 MLPCSLYKISGTLK----TRVVDERLHKLVS-----AKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPI 130 (413) T ss_pred hCceEEEEecCCcc----eeecccHHHHHHH-----hhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEE Confidence 55666654443322 1223455655553 368899999999999999999999999999997 6899999999 Q ss_pred cCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 195 DASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNA 274 (552) Q Consensus 195 ~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~ 274 (552) +|++|++..+.++. ..|++...++....|+++||||++.+ ..++++|+||+..+..+|..+.++++++. T Consensus 131 ~~~~v~~~~~~~~~-------~~y~~~~~~g~~~~~~~~evih~~~~----~~d~~~G~s~i~~~~~~i~~~~~~~~~~~ 199 (413) T protein:vir:48 131 DPGCVEPKLNSQWQ-------PVYQVTFPDGSVDVLTQDEIWHVRTL----TLDGLVGLNPIAYAREAISLAAATEEHGA 199 (413) T ss_pred cCceEEEEEcCCce-------EEEEEEecCceEEEEccccEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 99999999887764 34555666677788999999999753 34678999999999999999999999999 Q ss_pred HHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHH Q lcl|NC_020081. 275 RFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVIC 354 (552) Q Consensus 275 ~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 354 (552) ++|+||++|+|||++++ .+++++.+++++.|++.++|..|+|++++ +++|++|++++.++.|+||+|++++++++|| T Consensus 200 ~~~~ng~~p~gil~~~~--~~~~e~~~~~~~~~~~~~~g~~n~g~~~v-l~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 276 (413) T protein:vir:48 200 RLFGNGAVTSGVLRTEQ--KLTPDAYERLKKDFEERHTGLGNAHRPMI-LEMGLDWKSMALNAEDSQFLETRKFQLEEIC 276 (413) T ss_pred HHHhccCCcceEEEeCC--CCCHHHHHHHHHHHHHHhcCccccCccee-cCCCceEEeccCChhHHHHHHHHHHHHHHHH Confidence 99999999999999876 45899999999999999999999999755 5679999999999999999999999999999 Q ss_pred HHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecc-----cccC Q lcl|NC_020081. 355 SIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNF-----VGGD 428 (552) Q Consensus 355 ~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f-----~~~d 428 (552) ++|||||++||..+.+ +++|++++.+.|++.||.|+++.||++||++|+++.+ .+++|+| +++| T Consensus 277 ~~fgVPp~~lg~~~~~----------t~~n~e~~~~~f~~~~i~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d 346 (413) T protein:vir:48 277 RLFRVPLHMVQNTDRA----------TFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRESKQGKFYAKFNAGALLRGD 346 (413) T ss_pred HHhCCCHHHhCCCcCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccC Confidence 9999999999976543 5789999999999999999999999999999997654 3556665 4667 Q ss_pred hHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCC Q lcl|NC_020081. 429 AKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMD 508 (552) Q Consensus 429 ~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (552) .+++++.++.+ +.+|+||+||+|+++|+||+||||++++++|++++..+..... .+++ T Consensus 347 ~~~~~~~~~~~--~~~g~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~--------------------~~~~ 404 (413) T protein:vir:48 347 MKSRFEAYATG--INWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTTSPSAGDDNG--------------------KKKE 404 (413) T ss_pred HHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcceeeccccccccccccccCC--------------------CCCC Confidence 78887766543 4579999999999999999999999999999877653321100 0000 Q ss_pred CCCCCCCcc Q lcl|NC_020081. 509 NVNGKDSFN 517 (552) Q Consensus 509 ~~~~~~~~~ 517 (552) ..+..++.. T Consensus 405 ~~~~~~~~~ 413 (413) T protein:vir:48 405 SGDADKTAS 413 (413) T ss_pred CCCccccCC Confidence 000000000 No 30 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=2.4e-78 Score=446.10 Aligned_cols=408 Identities=16% Similarity=0.193 Sum_probs=300.3 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||++++-|+.+.= ..+. ........+.. .+ +...+.....+ T Consensus 1 Mg~f~~lf~r~~~--------------------~~~~---------~~~~~~~~~~~------~~---~~~~g~~v~~~- 41 (414) T protein:vir:44 1 MVFFSGLFQRKSD--------------------APVT---------TPAELADAIGL------SY---DTYTGKQISSQ- 41 (414) T ss_pred CchhhhhhccCcc--------------------Cccc---------chhhHhHhhcc------Cc---cccCCceechh- Confidence 8888744443210 0000 00000000000 00 01111111111 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .+...+.+++||.++++.+ +++++.+..++.+.. .....|++..+|. .+||++||++ T Consensus 42 ---~al~~~~v~~~i~~Ia~~i-----------a~~p~~~~~~~~~~~----~~~~~~~~~~lL~-----~~PN~~~t~~ 98 (414) T protein:vir:44 42 ---RAMRLTAVFSCVRVLAESV-----------GMLPCNLYHLNGSLK----QRATGERLHKLIS-----THPNGYMTPQ 98 (414) T ss_pred ---hhhccHHHHHHHHHHHHHh-----------ccCceEEEEecCCce----eecccchHHHHHH-----hhcccCCCHH Confidence 1223556788887766554 456676655544332 1233455555443 3688999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS 240 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~ 240 (552) +||+.++.+++++||+|++|+|+ .|+|++||||+|..|++..+.+|.. .|.+...++....|+++||||++. T Consensus 99 ~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~-------~y~~~~~~g~~~~~~~~evih~~~ 170 (414) T protein:vir:44 99 EFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSWEP-------VYQVTFPDGSTDVLSQEDIWHVRT 170 (414) T ss_pred HHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCCcE-------EEEEEecCceEEEEccccEEEecC Confidence 99999999999999999999987 6999999999999999988877643 455555666778899999999874 Q ss_pred cccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccc Q lcl|NC_020081. 241 NPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKI 320 (552) Q Consensus 241 ~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~ 320 (552) + +.++++|+||+..+..+|..+.++++++.++|+||++|+|||++++ .++++++++++++|++.++|..|+|++ T Consensus 171 ~----~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~ 244 (414) T protein:vir:44 171 L----TLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQ--TLSDQAYERLKKDFEERHTGLGNAHRP 244 (414) T ss_pred C----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC--CCCHHHHHHHHHHHHHHhcCccccCcc Confidence 3 3567999999999999999999999999999999999999999876 468999999999999999999999997 Q ss_pred eeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHH Q lcl|NC_020081. 321 PVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPL 400 (552) Q Consensus 321 ~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~ 400 (552) +++ ++|++|+++++++.|+||+|.+++++++||++|||||++||+.+.+ +++|++++.+.|++.||+|+ T Consensus 245 ~vl-~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~----------t~~n~e~~~~~~~~~~l~P~ 313 (414) T protein:vir:44 245 MIL-EMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRA----------TFNNIEELGLGFINYSLVPY 313 (414) T ss_pred eec-CCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHH Confidence 555 6799999999999999999999999999999999999999976543 57999999999999999999 Q ss_pred HHHHHHHHHhhcCccccc-ceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeecccccc Q lcl|NC_020081. 401 LKFIEDAVNKYIVSQFGG-DYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQ 474 (552) Q Consensus 401 ~~~ie~~ln~~L~~~~~~-~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~ 474 (552) ++.||++||++|+++.+. +++++| +++|.+++++.++.+ +.+|+||+||+|+++||||+||||+++++.|+. T Consensus 314 ~~~ie~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~--~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~ 391 (414) T protein:vir:44 314 LTRIEQRINTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATG--INWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMT 391 (414) T ss_pred HHHHHHHHHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcceeccccccc Confidence 999999999999987653 455555 467888887766643 457999999999999999999999999999876 Q ss_pred chhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCccc Q lcl|NC_020081. 475 RLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQ 518 (552) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (552) ....... ....++.+.+.+++ + + T Consensus 392 ~~~~~~~---------------~~~~~~~~~~~d~~-----~-~ 414 (414) T protein:vir:44 392 TKPSDGS---------------KAGKQKDNANADET-----T-S 414 (414) T ss_pred ccCCccc---------------cCCCCCCCCCCCCC-----C-C Confidence 5431110 00000001111110 0 0 No 31 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=2.7e-78 Score=445.82 Aligned_cols=417 Identities=12% Similarity=0.110 Sum_probs=300.0 Q ss_pred cccccchhhcccccCcccccccccchhhhhcccccccccccccccccc-ccccccCCcccccccCCCCchHHHHHHHhhc Q lcl|NC_020081. 8 FKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPI-IGSMSMNPDFKEAPSIHGKQNLLQMLKLWSR 86 (552) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~ 86 (552) +..-+|-++|. .+.-.-.++ +.+..+. +...+.... .+..++. ++ ..+...+. .-|. T Consensus 1 ~~~~~~~~~~~---------~~~g~~~~~-~~~f~~~--~~~~~~~~~~~~~~~~~-~~-----~~~~~v~~----~~al 58 (424) T protein:vir:18 1 MEEPKYTIDLR---------TNNGWWARL-KSWFVGG--RLVTPNQGSQTGPVSAH-GY-----LGDSSIND----ERIL 58 (424) T ss_pred CCCCccccccC---------CCCchHHHH-Hhhcccc--ccccccchhhccccccc-cc-----cccccccH----HHhh Confidence 33444433332 111001111 1111111 111111100 0111111 01 01111011 1223 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHH Q lcl|NC_020081. 87 KNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKL 166 (552) Q Consensus 87 ~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~ 166 (552) ..+.+.+||..++..+ +++++.+...+.++.. +.....|++..+|. ..||+.||+++||+.+ T Consensus 59 ~~~~v~~cv~~Ia~~i-----------A~lp~~vy~~~~~~~~--~~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~ 120 (424) T protein:vir:18 59 QISTVWRCVSLISTLT-----------ACLPLDVFETDQNDNR--KKVDLSNPLARLLR-----YSPNQYMTAQEFREAM 120 (424) T ss_pred ccHHHHHHHHHHHHhh-----------ccCceEEEEeccCCce--eeeccccHHHHHHh-----hccCCCCCHHHHHHHH Confidence 4456788887666554 4566666544433211 11112355555443 3688999999999999 Q ss_pred HHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCc Q lcl|NC_020081. 167 VRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDL 246 (552) Q Consensus 167 v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~ 246 (552) +.+++++||+|++|+|+..|+|++||||+|.+|++..+.+ ..+|.+.. ++....|+++||||++. + + T Consensus 121 ~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~--------~~~y~~~~-~g~~~~~~~~eVihir~-~---~ 187 (424) T protein:vir:18 121 TMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--------KVVYRYQR-DSEYADFSQKEIFHLKG-F---G 187 (424) T ss_pred HHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC--------eEEEEEEe-CCeEEEeccccEEEecC-c---C Confidence 9999999999999999999999999999999999876532 23344433 45667899999999864 2 3 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccC Q lcl|NC_020081. 247 TVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAE 326 (552) Q Consensus 247 ~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~ 326 (552) .++++|+||+..+..+|..+.++++|+.++|+||++|+|+|+++.. .+++++++++++.|++.++| .|+|+++|| ++ T Consensus 188 ~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-~l~~e~~~~~~~~~~~~~~~-~nag~~~vl-~~ 264 (424) T protein:vir:18 188 FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGG-PVKKRLWIL-EA 264 (424) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCc-CCCHHHHHHHHHHHHHHhCC-cccCCceec-cC Confidence 4679999999999999999999999999999999999999999754 46899999999999987654 789997666 57 Q ss_pred CceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_020081. 327 DVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIED 406 (552) Q Consensus 327 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~ 406 (552) |++|+++++++.|+||+|++++++++||++|||||++||+.+++++ .++|++++.+.|+++||.||+++||+ T Consensus 265 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~--------~~sn~eq~~~~f~~~tl~P~~~~ie~ 336 (424) T protein:vir:18 265 GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS--------WGSGIEQQNLGFLQYTLQPYISRWEN 336 (424) T ss_pred CceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCccc--------ccccHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999877653 45789999999999999999999999 Q ss_pred HHHhhcCcccc-cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhc Q lcl|NC_020081. 407 AVNKYIVSQFG-GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIM 480 (552) Q Consensus 407 ~ln~~L~~~~~-~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~ 480 (552) +||++|++..+ .+++|+| +++|.+++++.+..+ +.+|+||+||+|+++||||+||||++++++|+++++.+. T Consensus 337 ~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~--~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~n~~~l~~~~ 414 (424) T protein:vir:18 337 SIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAM--GESGLRTINEMRRTDNMPPLPGGDVAMRQAQYVPITDLG 414 (424) T ss_pred HHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcCeeeeccCccchhhhh Confidence 99999998765 3456665 467888888877754 457999999999999999999999999999999987654 Q ss_pred cccccccccCCCCCcc Q lcl|NC_020081. 481 QQEQVEYQRQMDANQF 496 (552) Q Consensus 481 ~~~~~~~~~~~~~~~~ 496 (552) .... ..++.+ T Consensus 415 ~~~~------~~~n~a 424 (424) T protein:vir:18 415 TNKE------PRNNGA 424 (424) T ss_pred ccCC------ccccCC Confidence 3210 000000 No 32 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=7.2e-78 Score=443.49 Aligned_cols=447 Identities=14% Similarity=0.162 Sum_probs=304.5 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+++ ++.++.-+. ... ....+.-.+..+... . .-.++..+...+.+ T Consensus 1 Mg~~~-~l~~r~~~~--------------------~~~----~~~~~~~~~~~~~~~--~-----~~~~~~~g~~V~~~- 47 (457) T protein:vir:13 1 MGFWS-ALFGRGHSP--------------------ALD----GIEARAWEPYDPSIY--N-----LGAVAASGETVTPH- 47 (457) T ss_pred Cchhh-hhhcccccc--------------------ccc----ccccccccccchHHH--h-----hcccccCCceechH- Confidence 88777 343322200 000 000011111111100 0 00112222222221 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .|...+.+.+||.+++..+ +++++.+..++.... .....+. ++.+++ .|++.||++ T Consensus 48 ---~al~~~~V~~~v~~Ia~~i-----------A~lp~~~~~~~~~~~----~~~~~~~---l~~~ln---~~~n~~t~~ 103 (457) T protein:vir:13 48 ---DALQVSAVFASVRLLSETI-----------ATLPLSTYSKRGGSR----KEIVTPE---WLDYPN---AEPGGMGRI 103 (457) T ss_pred ---HhhccHHHHHHHHHHHHhh-----------ccCceEEEEecCCcc----cccccch---HHHhcc---ccCCCCCHH Confidence 1223455788887766654 456666654432221 1112222 334443 334479999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCce--EEEEcccceeee Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKV--VAKFKAKEMAWE 238 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~--~~~~~~~evi~~ 238 (552) +||+.++.+++++||+|++|+++ .|+|++||||+|.+|++..+.++..... ....|.....+.. ...|+++||||+ T Consensus 104 ~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~-~~~~y~~~~~~~~~~~~~~~~~diih~ 181 (457) T protein:vir:13 104 DILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRRK-VFEAYDIDADGNEVLLGWFTPRDVLHI 181 (457) T ss_pred HHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccce-eEEEEEEecCCceeeEEeeCccceEEe Confidence 99999999999999999999876 5999999999999999987765543221 1122333333332 346899999998 Q ss_pred cccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccc Q lcl|NC_020081. 239 VSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAW 318 (552) Q Consensus 239 ~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nag 318 (552) +.+. ....++|+||+.++..+|..+.++++|+.++|+||++|+|||++++ .+++++++++++.|++.++|.+|+| T Consensus 182 ~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~ls~e~~~~~~~~~~~~~~g~~nag 256 (457) T protein:vir:13 182 PGMM---LPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPG--TMSEEGLARAREAWRAANSGVDNAH 256 (457) T ss_pred cCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCC--CCCHHHHHHHHHHHHHHhcCccccC Confidence 7542 2334899999999999999999999999999999999999999976 5699999999999999999999999 Q ss_pred cceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhh Q lcl|NC_020081. 319 KIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLE 398 (552) Q Consensus 319 k~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~ 398 (552) +++|| ++|++|+++++++.|+||+|++++++++||++|||||++||+.+.+++ .++|++++.+.|+++||. T Consensus 257 ~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--------~~sn~eq~~~~f~~~tl~ 327 (457) T protein:vir:13 257 RVALL-TEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTS--------WGSGLAEQNIAFTMFSLR 327 (457) T ss_pred cceec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCccc--------ccchHHHHHHHHHHHHHH Confidence 98665 579999999999999999999999999999999999999999887653 357899999999999999 Q ss_pred HHHHHHHHHHHhhcCccccc-ceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCC--Ceeecc Q lcl|NC_020081. 399 PLLKFIEDAVNKYIVSQFGG-DYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGG--DVTLAG 470 (552) Q Consensus 399 P~~~~ie~~ln~~L~~~~~~-~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~gg--D~~~~~ 470 (552) ||+++||++||++|+++.+. .++++| +++|.+++++.+..+ +.+|+||+||+|+++||+|++|| |++++| T Consensus 328 P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~--~~~G~~T~NE~R~~~gl~Pi~~g~~d~~~~~ 405 (457) T protein:vir:13 328 PWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLG--LQNGIYSIDEVRAAEDMTPLPDGLGEKYRVP 405 (457) T ss_pred HHHHHHHHHHHHhhcCccccCceeEEeechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCcccceeec Confidence 99999999999999987654 345554 577888888877654 44799999999999999999987 999999 Q ss_pred ccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcccccc Q lcl|NC_020081. 471 VHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQA 530 (552) Q Consensus 471 ~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (552) +|+.+++.....+.........+.. .++...++..+..+.+.++++++++.. T Consensus 406 ~n~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 406 LNLGEVGEEPEPEPAPAPPAIEPPA--------EEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred cccccccccccccccCCCCCCCCCc--------cccCCCCCCCCCCccccCCCCcccccC Confidence 9999987655433332221111100 000101111111111111222211111 No 33 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=5e-78 Score=444.39 Aligned_cols=413 Identities=16% Similarity=0.167 Sum_probs=294.5 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCC------cccccccCCCC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNP------DFKEAPSIHGK 74 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~ 74 (552) ||+++ +.+....+..+-......++.| .+.-.++..+. T Consensus 7 mg~f~------------------------------------r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 50 (432) T protein:vir:81 7 LGLFG------------------------------------QLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGA 50 (432) T ss_pred cchhh------------------------------------hhhhhcccccccccccccccccCccchhhhcccccccCc Confidence 44444 1111100000000000000000 00001111111 Q ss_pred chHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_020081. 75 QNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDF 154 (552) Q Consensus 75 ~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn 154 (552) ..+. ..+...+.+.+||.++++.++ .+++.+..+..++.. ....|++..+|. ..|| T Consensus 51 ~v~~----~~al~~~~V~~~i~~Ia~~ia-----------~lp~~~y~~~~~g~~----~~~~~~l~~lL~-----~~PN 106 (432) T protein:vir:81 51 AVNA----DAIMRLDAVAACVKLVSQAIA-----------AMPLTMYMRTPDGRK----EAVNHPLYTLLL-----DGPN 106 (432) T ss_pred ccch----HhhhccHHHHHHHHHHHHhhh-----------hCceeeEEecCCcce----ecccchHHHHHH-----hccc Confidence 1111 123344567888877666654 456666544443321 123355555553 3688 Q ss_pred ccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccc Q lcl|NC_020081. 155 TRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKE 234 (552) Q Consensus 155 ~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~e 234 (552) ++||+++||+.++.+++++||||++++|+ +|+|++||||+|+.|++..+.+|.. +|..+..++....|+++| T Consensus 107 ~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~~~~~g~~-------~y~~~~~~g~~~~~~~~~ 178 (432) T protein:vir:81 107 STQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDPKGNT-------AYRYRRTDGQMIDIPKQQ 178 (432) T ss_pred ccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEECCCCcE-------EEEEEecCceEEEEcccc Confidence 99999999999999999999999999997 5999999999999999999888743 355555566778899999 Q ss_pred eeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc Q lcl|NC_020081. 235 MAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGI 314 (552) Q Consensus 235 vi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~ 314 (552) |||++.+ +.++++|+||+..+..+|..+.++++|+.++|+||++|+|||+++. .++++++++++++| .|. T Consensus 179 iih~r~~----~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~--~l~~e~~~~~~~~~----~~~ 248 (432) T protein:vir:81 179 IWKIMGY----SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR--FLTDDQYDSFAKKV----SGS 248 (432) T ss_pred EEEecCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC--CCCHHHHHHHHHHH----hhh Confidence 9998743 4567999999999999999999999999999999999999999875 46888888877766 466 Q ss_pred cccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHH Q lcl|NC_020081. 315 NGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKD 394 (552) Q Consensus 315 ~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~ 394 (552) .|+|+++++ ++|++|++++++++|+||+|++++++++||++|||||++||+.+.++. .+++|+|++.+.|++ T Consensus 249 ~nag~~~vl-~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~-------~~~sn~eq~~~~f~~ 320 (432) T protein:vir:81 249 VEAGRAPLL-EGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTT-------SWGSGIESQQLGFLT 320 (432) T ss_pred hcCCCceec-CCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccc-------cccchHHHHHHHHHH Confidence 789997655 579999999999999999999999999999999999999999887653 345789999999999 Q ss_pred HHhhHHHHHHHHHHHhhcCcccc-cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCee- Q lcl|NC_020081. 395 KGLEPLLKFIEDAVNKYIVSQFG-GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVT- 467 (552) Q Consensus 395 ~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~- 467 (552) .||.||++.||++||++|+++.+ .+++|+| +++|.+++++.+..+ +.+|+||+||+|+++||||++|||.+ T Consensus 321 ~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~--~~~G~~t~NE~R~~~glpp~~g~~~~~ 398 (432) T protein:vir:81 321 MTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQL--VNNGLMTRDEAREIEGLPKLGGNAAVL 398 (432) T ss_pred HHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHH--HhCCCCCHHHHHHHhCCCCCCCCcceE Confidence 99999999999999999997654 3566666 578899998877754 45799999999999999999987655 Q ss_pred eccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCC Q lcl|NC_020081. 468 LAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGK 522 (552) Q Consensus 468 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (552) .++.++++++......... + .....++..+++ ++ T Consensus 399 ~~~~~~~pl~~~~~~~~~~------~-----~~~~~n~~~~~~----------~~ 432 (432) T protein:vir:81 399 TVQSAMVPLDSIGLQASPE------P-----ASGLGNQQQDKV----------SK 432 (432) T ss_pred eecCcccchhhhccCCCCC------C-----CCCCCCcccccc----------cC Confidence 4688888876543211000 0 000000000000 00 No 34 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=8.6e-78 Score=443.09 Aligned_cols=409 Identities=14% Similarity=0.170 Sum_probs=299.4 Q ss_pred hhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_020081. 35 AILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDK 114 (552) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~ 114 (552) ++++.+.++++... ....++... +...|.-.++..+.....+ -+...+.+++||..+++.++ T Consensus 1 m~~~~~f~~~~~~~-~~~~~~~~~--~~~~~~~~~~~~~~~v~~~----~al~~~~v~~~i~~Ia~~ia----------- 62 (416) T protein:vir:12 1 MLLERMFEKRSGSS-DHEDGFNNI--LLNMFGGRKTASGERVSES----NSLVQPDIFACVNVLSDDIA----------- 62 (416) T ss_pred CccchhcccccCcc-ccCccchhH--HHHhhcCcccccCceechh----hhhccHHHHHHHHHHHHhhh----------- Confidence 66666655554432 111111100 0001111122222211111 12233456788877666654 Q ss_pred ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEe Q lcl|NC_020081. 115 GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAV 194 (552) Q Consensus 115 ~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l 194 (552) ++++.+..+...+. ...+.|++..+|. .+||+.||+++||+.++.+++++||+|++|+|+..|+|.+|||| T Consensus 63 ~l~~~~~~~~~~~~----~~~~~~~l~~~l~-----~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l 133 (416) T protein:vir:12 63 KLPIHTYKRTDGGI----ERKPEHKSAHAVY-----ARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPL 133 (416) T ss_pred hCceEEEEecCCcc----ccccccHHHHHHH-----hhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEE Confidence 45666543332221 1223345444432 36789999999999999999999999999999999999999999 Q ss_pred cCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 195 DASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNA 274 (552) Q Consensus 195 ~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~ 274 (552) +|.+|++..+.+++. + |+++..++..+.|+++||||++.+ +.++++|+||+.++..++..+.++++++. T Consensus 134 ~~~~v~v~~~~~~~~------~-~~~~~~~g~~~~~~~~eiih~~~~----~~~~~~G~s~i~~~~~~i~~~~~~~~~~~ 202 (416) T protein:vir:12 134 RPDYTNAYVHPTTGM------L-WYQTVLNGKAIELYDYEVLHFKGL----STDGIHGKSPIGVVREHIGAQAAATKYNA 202 (416) T ss_pred CCcceEEEEeCCCcE------E-EEEEecCCeEEEecCccEEEecCc----CCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 999999988777643 2 334444556778999999998743 34579999999999999999999999999 Q ss_pred HHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHH Q lcl|NC_020081. 275 RFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVIC 354 (552) Q Consensus 275 ~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 354 (552) ++|+||++|++||++++ .+++++++++++.|+... ++++++++ ++|++|+++++++.|+||+|++++++++|| T Consensus 203 ~~~~ng~~p~~il~~~~--~~~~e~~~~~~~~~~~~~----~~~~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 275 (416) T protein:vir:12 203 KLYKNEATPRGILKVPA--FLDEKPKENVRKEWKRVN----KVENIAII-DYGLEYQSISMPLQEAQFVESMKFNKAQIS 275 (416) T ss_pred HHHhcCCCCceEEecCC--CCCHHHHHHHHHHHHHHh----cCCCeeec-CCCceEEEccCChhhHHHHHHHHHHHHHHH Confidence 99999999999999876 469999999999998653 46776555 679999999999999999999999999999 Q ss_pred HHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc--cceeecc-----ccc Q lcl|NC_020081. 355 SIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG--GDYVFNF-----VGG 427 (552) Q Consensus 355 ~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~f-----~~~ 427 (552) ++|||||++||....+ +++|++++.+.|++.||.|+++.||++||++|+++.+ .+++|+| ++. T Consensus 276 ~~fgVPp~~lg~~~~~----------t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~ 345 (416) T protein:vir:12 276 MIYKVPLHKLNELDKA----------TFSNIEHQSIEYVRNTLQPWIVNFEQELNVKLFLDHDQKSGHYVKFNIDSELRG 345 (416) T ss_pred HHhCCCHHHhCCccCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchhhcCCceEEeechhhhcc Confidence 9999999999976543 5799999999999999999999999999999997654 3455655 466 Q ss_pred ChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCC Q lcl|NC_020081. 428 DAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNM 507 (552) Q Consensus 428 d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (552) |.+++++.+..+ +.+|+||+||+|+++||||+||||++++++|+++++.+..++........ .+.. T Consensus 346 d~~~~~~~~~~~--~~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~n~~~~~~~~~~~~~~~~~~~------------~gge 411 (416) T protein:vir:12 346 DSKTQAEYLKTL--HETGVLNKDEIRELLERNPIENGDKYISSLNYVFLDFLEEYQRLKAGGAM------------KGGD 411 (416) T ss_pred CHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccchhhcccccccc------------CCCC Confidence 888888766654 45799999999999999999999999999999998876554432211000 0000 Q ss_pred CCCCC Q lcl|NC_020081. 508 DNVNG 512 (552) Q Consensus 508 ~~~~~ 512 (552) ++.+| T Consensus 412 ~~~~g 416 (416) T protein:vir:12 412 NKNEG 416 (416) T ss_pred CcCCC Confidence 01111 No 35 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=7.9e-78 Score=443.28 Aligned_cols=427 Identities=15% Similarity=0.190 Sum_probs=307.1 Q ss_pred chhhhhcccccccccccccccccccccc-cc-CCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 32 EEDAILKKGKNTKSNKPKAYEEPIIGSM-SM-NPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPA 109 (552) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~ 109 (552) +-..+.+.+.+.. ....+ ....+ .| ++.+... +..+ .......+...+.+.+||.+++..++ T Consensus 1 ~~~~~~~~~~~~~---~~~~~--~~~~~~~~~g~~~~~~-~~~~----~~~~~~~a~~~~~v~~~v~~ia~~iA------ 64 (460) T protein:vir:10 1 MANRIIRALRELT---GLDNK--FNDAFIKYIGQTFTKY-DNNG----KTYLEQGYNINPDVYSCISQMAAKTV------ 64 (460) T ss_pred CchhHHHHHhhhh---ccCCC--chHHHHHhhccccCCC-ccch----hhhhHHHHhcchHHHHHHHHHHHhhh------ Confidence 2222322221111 11111 01010 11 2221111 1111 22334445556677888877666644 Q ss_pred HhhccccceeeeeccccccCChhH--------------HHHHHHH------HHHHHhcCCCCCCCccCCHHHHHHHHHHH Q lcl|NC_020081. 110 RNSDKGVGYEIRLKDPLQEPNDHN--------------KKKIKEI------ENFIEKTGRIDNDFTRDNFRSFVKKLVRD 169 (552) Q Consensus 110 ~~~~~~~~~~i~~k~~~~~~~~~~--------------~~~~~~l------~~~l~~~n~~~~pn~~~t~~~f~~~~v~d 169 (552) ++++.+..++.+....... ....|++ +..... +..+||++||+++||+.++.+ T Consensus 65 -----~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--L~~~PN~~~t~~~f~~~~~~~ 137 (460) T protein:vir:10 65 -----AVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAF--PLESPNPTQTWADIYSLYKTY 137 (460) T ss_pred -----hCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHH--HHhCCCCCCCHHHHHHHHHHH Confidence 4556655444333211100 0000110 111111 224789999999999999999 Q ss_pred HHhcCCeeEEEEECC----CCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCC Q lcl|NC_020081. 170 RLTYDKINFELVYDK----LGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTD 245 (552) Q Consensus 170 ~ll~Gna~~~i~r~~----~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~ 245 (552) ++++||+|++|+|+. .|.|.+||||+|++|++..+.+|..........++....++....|+++||||++++.... T Consensus 138 lll~Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~~~~~ 217 (460) T protein:vir:10 138 MRLNGNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYANPNF 217 (460) T ss_pred HhhcCCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecCCCCc Confidence 999999999999964 4789999999999999999999877666656666666778888999999999998765544 Q ss_pred cc--CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceee Q lcl|NC_020081. 246 LT--VGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVI 323 (552) Q Consensus 246 ~~--~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il 323 (552) .. ++++|+||+..++.+|..+.++++++.++|+||+.|++|++.+. .+++++++++++.|++.++|.+|+|++++ T Consensus 218 ~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~--~l~~e~~~~~~~~~~~~~~g~~n~g~~~v- 294 (460) T protein:vir:10 218 DLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGST--GLTQPQADSLKQRLTEMDKSPDRLSQIAG- 294 (460) T ss_pred ccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecCC--CCCHHHHHHHHHHHHHHhcCccccCCcee- Confidence 33 45899999999999999999999999999999999999988654 56999999999999999999999999755 Q ss_pred ccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHH Q lcl|NC_020081. 324 TAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKF 403 (552) Q Consensus 324 ~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ 403 (552) +++|++|+++++++.|+||+|++++++++||++|||||++||+.+.++ .+++|++++.+.|++.||.||++. T Consensus 295 l~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--------~~~sn~e~~~~~f~~~~l~P~~~~ 366 (460) T protein:vir:10 295 ASGEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGG--------LNTGNLEEERKRVVTDNIQPDLVI 366 (460) T ss_pred cCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--------CccccHHHHHHHHHHHHHHHHHHH Confidence 567999999999999999999999999999999999999999876653 468999999999999999999999 Q ss_pred HHHHHHhhcCccccc--ceeecccccChH-HHHHHHHHHHHHhcCCcCHHHHHHHhCCCCC--CCCCeeeccccccchhh Q lcl|NC_020081. 404 IEDAVNKYIVSQFGG--DYVFNFVGGDAK-TEAEIISILESKAKIGLTINDIRKELGYPDT--EGGDVTLAGVHVQRLGQ 478 (552) Q Consensus 404 ie~~ln~~L~~~~~~--~~~~~f~~~d~~-~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~--~ggD~~~~~~n~~~~~~ 478 (552) ||++||++|+++.+. +++++|+..... .+.+.......+.+|+||+||+|+++||||+ +|||++++++|+++++. T Consensus 367 ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~d~~~~~~~~~~g~~T~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~ 446 (460) T protein:vir:10 367 LKQAFDKKFIKRFKGYENAVIEWDISELPEMQTDMVAMASWLNTIPVTPNEIRIAMKYETLNQDGMDIVFMPSNKVRIDD 446 (460) T ss_pred HHHHHHHhhcCcccccCCceEEeecchhhhHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCCeeeecccccchhh Confidence 999999999987653 466666544332 2222223334556799999999999999999 68999999999999875 Q ss_pred hccccccccccCCCCCccCcccCCCCCCCCCCCCCCCccc Q lcl|NC_020081. 479 IMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQ 518 (552) Q Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (552) ..... .++..++ ++ T Consensus 447 ~~~~~-----~~~~~nq---------------------~~ 460 (460) T protein:vir:10 447 VSNNL-----IDSAFNQ---------------------NQ 460 (460) T ss_pred ccccc-----CCCcccC---------------------CC Confidence 43210 0000000 00 No 36 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=1.5e-77 Score=441.73 Aligned_cols=429 Identities=15% Similarity=0.160 Sum_probs=291.4 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchh-hhhcccccccccccc-ccccccccccccCCcccccccCCCCchHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEED-AILKKGKNTKSNKPK-AYEEPIIGSMSMNPDFKEAPSIHGKQNLL 78 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (552) |--.+- .-| =++-+.++..++ .+...+.....++.. .+..+....+...+++.. ..+..... T Consensus 1 ~~~~~~----~~~---------~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 64 (441) T protein:vir:79 1 MHWYNT----DCY---------FVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQG---TKLRQYKD 64 (441) T ss_pred CccccC----ccc---------cccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCc---ccccccch Confidence 222110 000 012222222211 122222222122211 111111111111111110 01111000 Q ss_pred HHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCC Q lcl|NC_020081. 79 QMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDN 158 (552) Q Consensus 79 ~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t 158 (552) .-|...+.+.+||.+++..++ .+++.+.. + ++ ..+.|++..+|. ..||++|| T Consensus 65 ----~~al~~~~V~~cv~~Ia~~iA-----------~lp~~~~~-~--~~-----~~~~~~~~~lL~-----~~PN~~~t 116 (441) T protein:vir:79 65 ----IEAIRHSDIFTAVMMIASDLA-----------RMPIRVTV-N--GQ-----INYSDRIVNLLN-----TRPNPMYN 116 (441) T ss_pred ----hhhhccHHHHHHHHHHHHhhc-----------cCceeeec-C--cc-----ccccchHHHHHh-----cccCcCCC Confidence 112234456778877666544 45555532 1 11 122344443332 47899999 Q ss_pred HHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEE---cCCceEEEEcccce Q lcl|NC_020081. 159 FRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQV---IDDKVVAKFKAKEM 235 (552) Q Consensus 159 ~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~---~~~~~~~~~~~~ev 235 (552) +++||+.++.+++++||||++|+|+..|+|++||||+|+.|+|..+.+|..++ +++. ........|+++|| T Consensus 117 ~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~------~~~~~~~~~~~~~~~~~~~dv 190 (441) T protein:vir:79 117 GYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYY------FHQRIDSNGNNIERNVKFEDM 190 (441) T ss_pred HHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEE------EEEEeccCCceeEEEEccccE Confidence 99999999999999999999999999999999999999999999988886432 2222 22334578999999 Q ss_pred eeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccc Q lcl|NC_020081. 236 AWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGIN 315 (552) Q Consensus 236 i~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~ 315 (552) ||++.+ +.++++|+||+.++..+|..+.++++++.++|+||++|+|||++++.. .++++++++|+.|++.++|.. T Consensus 191 ih~k~~----~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~~~e~~e~~r~~~~~~~~G~~ 265 (441) T protein:vir:79 191 LDIKFY----SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL-DNKKARDRAREEFHKSFSGTK 265 (441) T ss_pred EEeccC----CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCC-CCHHHHHHHHHHHHHHhcCcc Confidence 998753 466799999999999999999999999999999999999999998643 468889999999999999999 Q ss_pred ccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_020081. 316 GAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDK 395 (552) Q Consensus 316 nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~ 395 (552) |+|+++|+ ++|++|++++++++|+||+|.+++++++||++|||||++||.... + .+.+++...| .+ T Consensus 266 nag~~~vl-~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-----------~-~s~~q~~~~~-~~ 331 (441) T protein:vir:79 266 QAGKVVVL-DESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-----------N-MSITDANLDY-LS 331 (441) T ss_pred ccCcceec-CCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC-----------C-ccHHHHHHHH-HH Confidence 99998655 579999999999999999999999999999999999999986321 1 2345555555 56 Q ss_pred HhhHHHHHHHHHHHhhcCcccccceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCC--eee Q lcl|NC_020081. 396 GLEPLLKFIEDAVNKYIVSQFGGDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGD--VTL 468 (552) Q Consensus 396 ~l~P~~~~ie~~ln~~L~~~~~~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD--~~~ 468 (552) ||.|+++.||++||++|+++.. +++|+| ++.|.+++++.++.+ +.+|+||+||+|+++||||+|||| +++ T Consensus 332 tl~P~~~~ie~eln~kl~~~~~-~~~~~fd~~~llr~D~~~~~~~~~~~--i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~ 408 (441) T protein:vir:79 332 TLKPYITCVCAELNFKFNDEYV-NREFKFDTTEIRVVDEKTQAEIDKIN--IDSGKMNIDEIRQRDGLAPIPGGNGSIHR 408 (441) T ss_pred HHHHHHHHHHHHHhhhcccccc-CceEEeechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCcceEe Confidence 9999999999999999987653 455555 677888888877654 447999999999999999999998 577 Q ss_pred ccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcc Q lcl|NC_020081. 469 AGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQS 526 (552) Q Consensus 469 ~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (552) +++|+++++.++.++..+..... +.. .|+|.++ T Consensus 409 ~~~n~~~~~~~~~~~~~~~~~~~---~~~----------------------kgGe~~e 441 (441) T protein:vir:79 409 VDLNHVNIELVDEYQMNKSRATD---KKL----------------------KGGEENE 441 (441) T ss_pred ecccccccccccccccccccccc---ccc----------------------CCCCCCC Confidence 89999998876543322211100 000 0000000 No 37 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=1.5e-77 Score=441.73 Aligned_cols=429 Identities=15% Similarity=0.160 Sum_probs=291.4 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchh-hhhcccccccccccc-ccccccccccccCCcccccccCCCCchHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEED-AILKKGKNTKSNKPK-AYEEPIIGSMSMNPDFKEAPSIHGKQNLL 78 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (552) |--.+- .-| =++-+.++..++ .+...+.....++.. .+..+....+...+++.. ..+..... T Consensus 1 ~~~~~~----~~~---------~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 64 (441) T protein:vir:94 1 MHWYNT----DCY---------FVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQG---TKLRQYKD 64 (441) T ss_pred CccccC----ccc---------cccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCc---ccccccch Confidence 222110 000 012222222211 122222222122211 111111111111111110 01111000 Q ss_pred HHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCC Q lcl|NC_020081. 79 QMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDN 158 (552) Q Consensus 79 ~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t 158 (552) .-|...+.+.+||.+++..++ .+++.+.. + ++ ..+.|++..+|. ..||++|| T Consensus 65 ----~~al~~~~V~~cv~~Ia~~iA-----------~lp~~~~~-~--~~-----~~~~~~~~~lL~-----~~PN~~~t 116 (441) T protein:vir:94 65 ----IEAIRHSDIFTAVMMIASDLA-----------RMPIRVTV-N--GQ-----INYSDRIVNLLN-----TRPNPMYN 116 (441) T ss_pred ----hhhhccHHHHHHHHHHHHhhc-----------cCceeeec-C--cc-----ccccchHHHHHh-----cccCcCCC Confidence 112234456778877666544 45555532 1 11 122344443332 47899999 Q ss_pred HHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEE---cCCceEEEEcccce Q lcl|NC_020081. 159 FRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQV---IDDKVVAKFKAKEM 235 (552) Q Consensus 159 ~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~---~~~~~~~~~~~~ev 235 (552) +++||+.++.+++++||||++|+|+..|+|++||||+|+.|+|..+.+|..++ +++. ........|+++|| T Consensus 117 ~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~------~~~~~~~~~~~~~~~~~~~dv 190 (441) T protein:vir:94 117 GYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYY------FHQRIDSNGNNIERNVKFEDM 190 (441) T ss_pred HHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEE------EEEEeccCCceeEEEEccccE Confidence 99999999999999999999999999999999999999999999988886432 2222 22334578999999 Q ss_pred eeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccc Q lcl|NC_020081. 236 AWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGIN 315 (552) Q Consensus 236 i~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~ 315 (552) ||++.+ +.++++|+||+.++..+|..+.++++++.++|+||++|+|||++++.. .++++++++|+.|++.++|.. T Consensus 191 ih~k~~----~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~~~e~~e~~r~~~~~~~~G~~ 265 (441) T protein:vir:94 191 LDIKFY----SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL-DNKKARDRAREEFHKSFSGTK 265 (441) T ss_pred EEeccC----CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCC-CCHHHHHHHHHHHHHHhcCcc Confidence 998753 466799999999999999999999999999999999999999998643 468889999999999999999 Q ss_pred ccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_020081. 316 GAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDK 395 (552) Q Consensus 316 nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~ 395 (552) |+|+++|+ ++|++|++++++++|+||+|.+++++++||++|||||++||.... + .+.+++...| .+ T Consensus 266 nag~~~vl-~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-----------~-~s~~q~~~~~-~~ 331 (441) T protein:vir:94 266 QAGKVVVL-DESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-----------N-MSITDANLDY-LS 331 (441) T ss_pred ccCcceec-CCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC-----------C-ccHHHHHHHH-HH Confidence 99998655 579999999999999999999999999999999999999986321 1 2345555555 56 Q ss_pred HhhHHHHHHHHHHHhhcCcccccceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCC--eee Q lcl|NC_020081. 396 GLEPLLKFIEDAVNKYIVSQFGGDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGD--VTL 468 (552) Q Consensus 396 ~l~P~~~~ie~~ln~~L~~~~~~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD--~~~ 468 (552) ||.|+++.||++||++|+++.. +++|+| ++.|.+++++.++.+ +.+|+||+||+|+++||||+|||| +++ T Consensus 332 tl~P~~~~ie~eln~kl~~~~~-~~~~~fd~~~llr~D~~~~~~~~~~~--i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~ 408 (441) T protein:vir:94 332 TLKPYITCVCAELNFKFNDEYV-NREFKFDTTEIRVVDEKTQAEIDKIN--IDSGKMNIDEIRQRDGLAPIPGGNGSIHR 408 (441) T ss_pred HHHHHHHHHHHHHhhhcccccc-CceEEeechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCcceEe Confidence 9999999999999999987653 455555 677888888877654 447999999999999999999998 577 Q ss_pred ccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcc Q lcl|NC_020081. 469 AGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQS 526 (552) Q Consensus 469 ~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (552) +++|+++++.++.++..+..... +.. .|+|.++ T Consensus 409 ~~~n~~~~~~~~~~~~~~~~~~~---~~~----------------------kgGe~~e 441 (441) T protein:vir:94 409 VDLNHVNIELVDEYQMNKSRATD---KKL----------------------KGGEENE 441 (441) T ss_pred ecccccccccccccccccccccc---ccc----------------------CCCCCCC Confidence 89999998876543322211100 000 0000000 No 38 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=1.2e-77 Score=442.29 Aligned_cols=446 Identities=13% Similarity=0.084 Sum_probs=296.9 Q ss_pred cccccc--ccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccc Q lcl|NC_020081. 50 AYEEPI--IGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQ 127 (552) Q Consensus 50 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~ 127 (552) .-+.|. .+..+|.+. +..+. . ...+..++++.+||.+++..+ ++++|.++.. +. T Consensus 1 ~~~~~~~~g~~~~~~~~-----~~~~~--~----~~~~~~~~~V~acV~~Ia~~i-----------A~lpl~l~~~--~~ 56 (723) T protein:vir:94 1 MTTFPSGAGGWNAWSAD-----SVFGN--G----AKGWSNSAVAYRCISMLANNA-----------ASVDLVVRGP--DG 56 (723) T ss_pred CcccccCCCcccccccc-----ccccc--c----HHHHhhhHHHHHHHHHHHHhh-----------ccceeEEEcC--CC Confidence 111111 011111111 01110 0 112235567788887655554 4566665432 22 Q ss_pred cCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC---CCCEEEEEEecCceeEEEEC Q lcl|NC_020081. 128 EPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK---LGDLHNFKAVDASTVYVAVD 204 (552) Q Consensus 128 ~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~---~G~~~~L~~l~p~~v~v~~~ 204 (552) + ..+.|++..+|. .+||++||+++||+.++.+++++||+|++|+|+. .|.|.+|+||++..+.+... T Consensus 57 ~-----~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~ 126 (723) T protein:vir:94 57 E-----LDELHPLSQLWN-----VMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVAT 126 (723) T ss_pred c-----cchhhHHHHHHh-----hCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeec Confidence 2 123355555543 3689999999999999999999999999999754 58999999999998888877 Q ss_pred CCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Q lcl|NC_020081. 205 EDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTR 284 (552) Q Consensus 205 ~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~ 284 (552) .++........+.|.....++....|+++||||++.+ ++.+++||+|||..++.+|..+.++++|+.++|+||++|+ T Consensus 127 ~~~~~~~~~~~~~y~~~~~~G~~~~~~~~dIiHir~~---~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~ 203 (723) T protein:vir:94 127 RAADAVPQAQIIGYVIERTDGVRVPVLADEMLWLRFS---DPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPG 203 (723) T ss_pred CCCccceeeeeeEEEEEecCceeEEecccceEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc Confidence 7766555545556666666777889999999999754 3567899999999999999999999999999999999999 Q ss_pred eEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecc---------CCceeeeccCchhHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 285 GLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITA---------EDVKFVNMTQSSKDMEFEKWLNYLINVICS 355 (552) Q Consensus 285 gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~---------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 355 (552) |||+.+ .+++++.+++++.|++.++|..|+|+++||.+ +|++|++++++++|+||+|++++++++||+ T Consensus 204 giL~~~---~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~ 280 (723) T protein:vir:94 204 GVVNLG---DMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVML 280 (723) T ss_pred eEEEcC---CCCHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHH Confidence 999974 36899999999999999999999999988753 589999999999999999999999999999 Q ss_pred HhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecc-----cccChH Q lcl|NC_020081. 356 IYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNF-----VGGDAK 430 (552) Q Consensus 356 ~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f-----~~~d~~ 430 (552) +|||||++||. ..+++|.+++.+.|+++||.||++.||++||++|++.++.+++|+| +++|.+ T Consensus 281 afgVPp~~i~~------------~st~sN~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~g~~~~~~f~~~~lLr~D~~ 348 (723) T protein:vir:94 281 AFGIRKDALLG------------GSTYENQAEAKAAVWTETLIPQMEVMASITDLQLLPDIGWTVEWDFNSVPALQEDLE 348 (723) T ss_pred HhCCChhHcCC------------CCCcccHHHHHHHHHHHHHHHHHHHHHHHHhHhhcccccCceEEeecchhhhhcCHH Confidence 99999999963 1357899999999999999999999999999999998888888877 467888 Q ss_pred HHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCC--eeeccccccchhhhccccccccccCCC-CCccC-cccCCCCCC Q lcl|NC_020081. 431 TEAEIISILESKAKIGLTINDIRKELGYPDTEGGD--VTLAGVHVQRLGQIMQQEQVEYQRQMD-ANQFL-AQQTGYDGN 506 (552) Q Consensus 431 ~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD--~~~~~~n~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~ 506 (552) ++.+.+..+ +.+|+||+||+|+++||||+|||| +++.|.+......... .+...++. ...+. ....+.... T Consensus 349 ~r~~~~~~~--v~~G~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~~~---~p~~~e~~~~~~~~~~~~~~~~p~ 423 (723) T protein:vir:94 349 AQAGRNQGY--LVNDVLMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAPAPAP---APAVEEGAARMLALLERVAADRPL 423 (723) T ss_pred HHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCcccceeccccccccCCCCC---CccchhhhHhhhhhccccccccCc Confidence 888777644 457999999999999999999988 4456653322111000 00000000 00000 000000000 Q ss_pred CC-CCCCCCCcccccCCCCccccccccccccccCccccccc--------cccccC Q lcl|NC_020081. 507 MD-NVNGKDSFNQNVGKDGQSKQQANTNSTPQGGKDDNGNV--------VNDWEA 552 (552) Q Consensus 507 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~ 552 (552) +. ...+..-...+.+.++++-.=+.-.+-=++--..=|.+ -.+|.. T Consensus 424 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (723) T protein:vir:94 424 PELPVRATTVLHHDPGPDPQQTLYERLEALLQPLLVELGRRQAAVTLREFDLLMR 478 (723) T ss_pred CCCCCCCCCCCCCCcccCCchhHHHHHHHHHhhhHHHHHHHHHHHHHHhhchhhc Confidence 00 00000000111111111100000000000000000000 001111 No 39 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=8.1e-78 Score=443.22 Aligned_cols=412 Identities=12% Similarity=0.119 Sum_probs=292.5 Q ss_pred cccccCcccccccccc---hhhhhccccccccccccccccccccc-cccCCcccccccCCCCchHHHHHHHhhcchHHHH Q lcl|NC_020081. 17 IIDINDDMAVRIKQIE---EDAILKKGKNTKSNKPKAYEEPIIGS-MSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILN 92 (552) Q Consensus 17 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~ 92 (552) ++ .-...++.- --.+.+.+.++++.. .++.|.... +.+.. .+..+..... ..|...+.+. T Consensus 1 ~~-----~~~~~~~~~~~~~~~~~~~lf~~~~~~--~~~~~~~~~~~~~~~-----~~~~~~~vs~----~~al~~~~v~ 64 (424) T protein:vir:45 1 ML-----YCWWAHWLWPEGGRVLLDALFRSKSLE--NPSTPITGDAVDTDG-----LFRADVYVSP----ETAMKLAAVY 64 (424) T ss_pred Ce-----eEeeeceecCcchhHHHHhhccccCCC--CCccccchhhhhhhc-----cccCCceech----HHhhccHHHH Confidence 00 000001110 001111112222211 111121111 00000 0111111011 1233445678 Q ss_pred HHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHh Q lcl|NC_020081. 93 AIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLT 172 (552) Q Consensus 93 a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll 172 (552) +||.+++..+ +++++.+..+.... .+....|++..+|. ..||++||+++||+.++.++++ T Consensus 65 ~cv~~Ia~~i-----------A~lp~~v~~~~~~~----~~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~v~~lll 124 (424) T protein:vir:45 65 SCIYVLSSSL-----------AQMPLHVMRRHKGK----VEPARDHPAFYLVH-----DEPNTWQTSYKWRELKQRHILG 124 (424) T ss_pred HHHHHHHHHH-----------hhCceEEEEecCCc----eeecccchHHHHHH-----hhcccCCCHHHHHHHHHHHHhh Confidence 8887766655 45667665443221 11223455555553 3688999999999999999999 Q ss_pred cCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCccc Q lcl|NC_020081. 173 YDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYG 252 (552) Q Consensus 173 ~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G 252 (552) +||+|++|+|+..|+|++||||+|..|++..+. +. +.|.... ......|+++||||++.. +.++++| T Consensus 125 ~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~~-~~-------~~y~~~~-~~~~~~~~~~eVih~r~~----~~d~~~G 191 (424) T protein:vir:45 125 WGNGYTWVKRNRRGEVISLDCCMPWETTLMNTG-GR-------YTYGLYN-EYGAFAISPDDMIHIRAL----GNNQKMG 191 (424) T ss_pred cCCeEEEEEEcCCCcEEEEEEecCceEEEEEcC-Ce-------EEEEEEe-cCceEEECcccEEEecCc----CCCCccc Confidence 999999999999999999999999999987543 22 2344333 334567999999998742 3467999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc-cccccceeeccCCceee Q lcl|NC_020081. 253 YPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGI-NGAWKIPVITAEDVKFV 331 (552) Q Consensus 253 ~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~-~nagk~~il~~~g~~~~ 331 (552) +||+..+..+|..+.++++++.++|+||++|+|||++++. +++++++++++.|++.+.|. +|+|+++|+ ++|++|+ T Consensus 192 ~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~--l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl-~~g~~~~ 268 (424) T protein:vir:45 192 LSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSG--LNKESWGWLKDQWQKASQALRRQENKTMLL-PADLDYK 268 (424) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC--CCHHHHHHHHHHHHHHhccccccCCceeEc-CCCceEE Confidence 9999999999999999999999999999999999999864 58999999999999999986 588997655 5799999 Q ss_pred eccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhh Q lcl|NC_020081. 332 NMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKY 411 (552) Q Consensus 332 ~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~ 411 (552) ++++++.|+||+|++++++++||++|||||++||+.+.+ +++|++++.+.|++.||.||++.||++||++ T Consensus 269 ~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~----------t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~k 338 (424) T protein:vir:45 269 ALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKA----------TFSNISAQAIQFVRYTMMPWVTNWEQELNRR 338 (424) T ss_pred EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 999999999999999999999999999999999987654 4789999999999999999999999999999 Q ss_pred cCcccc--cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccc Q lcl|NC_020081. 412 IVSQFG--GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQ 484 (552) Q Consensus 412 L~~~~~--~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~ 484 (552) |++..+ .+++++| +++|.+++++.++.+ +.+|+||+||+|+++||||+||||++++++|+.+... T Consensus 339 Ll~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~--~~~g~~T~NE~R~~~gl~pi~ggD~~~~~~n~~~~~~------ 410 (424) T protein:vir:45 339 LFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFA--ITDGWMSRNEARAFEDMNPVEGLDEMLVSVNAANPAG------ 410 (424) T ss_pred cCChhhhcCCcEEEeechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccc------ Confidence 998643 4566665 477888888877654 4579999999999999999999999999998764210 Q ss_pred cccccCCCCCccCcccCCCCCCCCC Q lcl|NC_020081. 485 VEYQRQMDANQFLAQQTGYDGNMDN 509 (552) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (552) +.. +...+ .+.+++ T Consensus 411 -------~~~-~~~~~---~~~~~~ 424 (424) T protein:vir:45 411 -------DFK-PPKND---EGKTNE 424 (424) T ss_pred -------ccC-CCCCC---CCCCCC Confidence 000 00000 000000 No 40 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=7.9e-78 Score=443.30 Aligned_cols=410 Identities=14% Similarity=0.117 Sum_probs=299.2 Q ss_pred hccccccccccccccccccccccccCCcc-cccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDF-KEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKG 115 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~ 115 (552) |...+..++ ....+.|. +..|...+ .-.++..+...+.+ -|...+.+.+||.++++.+ ++ T Consensus 1 m~~~~~~~~--~~~~~~~~--~~~~~~~~~g~~~s~~~~~v~~~----~al~~~~v~~cv~~ia~~i-----------a~ 61 (419) T protein:vir:80 1 MFFSRQLLS--NLGQTQPG--SGGWVSALLGSARSEAGQVVTPA----SALSLTVLQNCVTLLAESI-----------AQ 61 (419) T ss_pred CCccccccc--ccCcCCCC--cchhhHHhhcccccccCcccChH----HhhccHHHHHHHHHHHHhh-----------cc Confidence 222111111 11111221 11222111 11222222222111 2334566788887766654 56 Q ss_pred cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEec Q lcl|NC_020081. 116 VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVD 195 (552) Q Consensus 116 ~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~ 195 (552) +++.++.++.++. ...+.|++..+|. .+||++||+++||+.++.+++++||+|++|+|+..|+|++||||+ T Consensus 62 lp~~~~~~~~~~~----~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~ 132 (419) T protein:vir:80 62 LPVELYERSGDDR----KPATDHPLYSILK-----YEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLD 132 (419) T ss_pred CceEEEEecCCCc----ccccccHHHHHHH-----hhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEec Confidence 6777766554432 2223455555553 368899999999999999999999999999999999999999999 Q ss_pred CceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 196 ASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNAR 275 (552) Q Consensus 196 p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~ 275 (552) |++|++..+.+|.. .|.+.. . ..+++++|+|++.+ +.+++||+||+..+..+|..+.++++++.+ T Consensus 133 ~~~v~i~~~~~~~~-------~y~~~~--~--~~~~~~~i~h~~~~----~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~ 197 (419) T protein:vir:80 133 NEAVTVMKGPDLKP-------MYRVAG--A--DPLPQRLVHHVRWM----SINGYTGLSPVLLHANAIGHAQAIQQYAGK 197 (419) T ss_pred CceEEEEECCCceE-------EEEEcC--c--cccchhheEEecCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 99999998877643 233322 1 24788888887653 456799999999999999999999999999 Q ss_pred HHhccCCCceEEEeCCCC--CCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHH Q lcl|NC_020081. 276 FFAQGGTTRGLLHIKTGQ--EQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVI 353 (552) Q Consensus 276 ~f~ng~~p~gil~~~~~~--~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~I 353 (552) +|+||++|+|+|+++++. ..++++++++++.|++.++|..|+|+++++ ++|++|++++.++.|+||+|++++++++| T Consensus 198 ~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl-~~g~~~~~l~~s~~d~q~~e~~~~~~~~I 276 (419) T protein:vir:80 198 SFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALL-QEGMKFKPLSMTNVDAALIDALRLSALDI 276 (419) T ss_pred HHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceec-CCCceEEeccCChhhHHHHHHHHHHHHHH Confidence 999999999999987543 347889999999999999999999998665 57999999999999999999999999999 Q ss_pred HHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecc-----ccc Q lcl|NC_020081. 354 CSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNF-----VGG 427 (552) Q Consensus 354 a~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f-----~~~ 427 (552) |++|||||++||+.+.+ +++|++++.+.|++.||.|+++.||++|+++|++..+ .+++++| +++ T Consensus 277 a~~fgVPp~llg~~~~~----------t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~ 346 (419) T protein:vir:80 277 ARIYKIPAHMVNELERA----------TFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRG 346 (419) T ss_pred HHHhCCCHHHhcCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhcc Confidence 99999999999976544 4789999999999999999999999999999997543 4566665 467 Q ss_pred ChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCC Q lcl|NC_020081. 428 DAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNM 507 (552) Q Consensus 428 d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (552) |.+++++.++.+ +.+|+||+||+|+++|+||+||||++++|+|++.++.+...+..+ ++++ T Consensus 347 d~~~~~~~~~~~--~~~G~~T~NE~R~~~g~~p~~gGD~~~~~~n~~~~~~~~~~~~~~-----------------~~~~ 407 (419) T protein:vir:80 347 DQSSRYAAYAVG--RQWGWLSINDIRRLENMPPVKGGDIYLSPMNMVDASKPQPIPMGK-----------------TEPT 407 (419) T ss_pred CHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccccccCCC-----------------CCch Confidence 888888777654 457999999999999999999999999999987765433211111 0000 Q ss_pred CCCCCCCCcccccCCCCcccccccccc Q lcl|NC_020081. 508 DNVNGKDSFNQNVGKDGQSKQQANTNS 534 (552) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 534 (552) ...-. +-+..-| T Consensus 408 ~~~~~---------------~~~~~l~ 419 (419) T protein:vir:80 408 KAALD---------------EIGRILS 419 (419) T ss_pred hhhHH---------------HHHhhcC Confidence 00000 0001111 No 41 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=3.1e-77 Score=440.04 Aligned_cols=429 Identities=15% Similarity=0.164 Sum_probs=292.2 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccc-hhhhhccccccccccccc-cccccccccccCCcccccccCCCCchHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIE-EDAILKKGKNTKSNKPKA-YEEPIIGSMSMNPDFKEAPSIHGKQNLL 78 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (552) |--.+- .-| . ++=+.++.. +..+...+.+...++... ...+...-...-+++.. ..+..... T Consensus 1 ~~~~~~----~~~---~------~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 64 (441) T protein:vir:98 1 MHWYNT----DCY---F------VDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQG---TKLRQYKD 64 (441) T ss_pred CceecC----ccc---e------eccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccc---cCccccch Confidence 222220 000 0 111111111 111222222221122111 11111111111111110 01111100 Q ss_pred HHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCC Q lcl|NC_020081. 79 QMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDN 158 (552) Q Consensus 79 ~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t 158 (552) .-|...+.+.+||.+++..++ ++++.+.. +++ ..+.|++..+| ...||+.|| T Consensus 65 ----~~al~~~~V~acv~~Ia~~iA-----------~lpl~~~~---~~~-----~~~~~~~~~lL-----~~~PN~~~t 116 (441) T protein:vir:98 65 ----IEAIRHSDIFTAVMMIASDLA-----------RMPIRVTV---NGQ-----INYSDRIVNLL-----NTRPNPMYN 116 (441) T ss_pred ----hhhhccHHHHHHHHHHHHhhc-----------cCceEEec---CCc-----ccccchHHHHH-----hcccccCCC Confidence 112233456778776666554 45555531 111 11234443333 247899999 Q ss_pred HHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEE---cCCceEEEEcccce Q lcl|NC_020081. 159 FRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQV---IDDKVVAKFKAKEM 235 (552) Q Consensus 159 ~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~---~~~~~~~~~~~~ev 235 (552) +++||+.++.+++++||+|++|+|+..|+|++||||+|++|++..+.+|..++ +++. ........|+++|| T Consensus 117 ~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~------~~~~~~~~~~~~~~~~~~~dv 190 (441) T protein:vir:98 117 GYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKLDARGRLYY------FHQRIDSNGNNIERNVKFEDM 190 (441) T ss_pred HHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEECCCCcEEE------EEEEeccCcceeeEEEccccE Confidence 99999999999999999999999999999999999999999999998886532 2222 22334578999999 Q ss_pred eeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccc Q lcl|NC_020081. 236 AWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGIN 315 (552) Q Consensus 236 i~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~ 315 (552) ||++.+ +.++++|+||+.++..+|..+.++++++.++|+||++|+|||++++.. .++++++++++.|++.++|.+ T Consensus 191 iHir~~----~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~-~~~e~~~~~~~~~~~~~~G~~ 265 (441) T protein:vir:98 191 LDIKFY----SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVL-DNKKARDRAREEFHKSFSGTK 265 (441) T ss_pred EEeccC----CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCC-CCHHHHHHHHHHHHHHhcCcc Confidence 998753 456799999999999999999999999999999999999999998653 367889999999999999999 Q ss_pred ccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_020081. 316 GAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDK 395 (552) Q Consensus 316 nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~ 395 (552) |+|+++|+ ++|++|++++++++|+||+|.+++++++||++|||||++||.... ..+.+++...|+ + T Consensus 266 nag~~~vl-~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~------------~~s~~q~~~~y~-~ 331 (441) T protein:vir:98 266 QAGKVVVL-DESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA------------NMSITDANLDYL-S 331 (441) T ss_pred ccCcceec-CCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC------------CccHHHHHHHHH-H Confidence 99997655 579999999999999999999999999999999999999986321 134566666665 6 Q ss_pred HhhHHHHHHHHHHHhhcCcccccceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCC--eee Q lcl|NC_020081. 396 GLEPLLKFIEDAVNKYIVSQFGGDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGD--VTL 468 (552) Q Consensus 396 ~l~P~~~~ie~~ln~~L~~~~~~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD--~~~ 468 (552) ||+||++.||++||++|+++.. +++|+| +++|.+++++.++.+ +.+|+||+||+|+++||||+|||| +++ T Consensus 332 tl~P~~~~ie~~ln~~L~~~~~-~~~~~fd~~~llr~d~~~~~~~~~~~--~~~G~~T~NE~R~~~gl~pi~gGd~~~~~ 408 (441) T protein:vir:98 332 TLKPYITCVCAELNFKFNDEYV-NREFKFDTTEIRVVDEKTQAEIDKIN--IDSGKMNIDEIRQRDGLAPIPGGNGSIHR 408 (441) T ss_pred HHHHHHHHHHHHHHhhcccccc-CceEEEechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCcceEe Confidence 9999999999999999997653 455565 678888888877654 457999999999999999999998 577 Q ss_pred ccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcc Q lcl|NC_020081. 469 AGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQS 526 (552) Q Consensus 469 ~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (552) +++|+++++.++.++........ .... |+|.++ T Consensus 409 ~~~n~~~~~~~~~~q~~~~~~~~---~~~k----------------------gGe~ne 441 (441) T protein:vir:98 409 VDLNHVNIELVDEYQMNKSRATD---KKLK----------------------GGEENE 441 (441) T ss_pred ecccccccccccccccccccccc---cccC----------------------CCCCCC Confidence 89999998876654433211100 0000 000000 No 42 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=1.9e-76 Score=435.75 Aligned_cols=409 Identities=12% Similarity=0.040 Sum_probs=292.4 Q ss_pred hccccccccccccccccccccccccCCcc---cccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDF---KEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSD 113 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~ 113 (552) |+... .. .......|...+ ...|+..+..-. ..|...+.+.+||.++++.++ T Consensus 1 m~~~~-~~---------~~~~~~~~~~~~~~~~~~~~~~g~~~~-----~~Al~~~~V~~cv~~ia~~iA---------- 55 (417) T protein:vir:38 1 MKLFR-GL---------ATEVDPHWADHLLDSGVIPSFRGGYLG-----ISALRNSDVLTAVSIVSGDVS---------- 55 (417) T ss_pred Ccccc-cc---------ccCCCccchhhhcccccccccCCceec-----hhhcccHHHHHHHHHHHHhhc---------- Confidence 44321 00 001111221111 112222222111 123344556888877666554 Q ss_pred cccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCC-CCEEEEE Q lcl|NC_020081. 114 KGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKL-GDLHNFK 192 (552) Q Consensus 114 ~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~-G~~~~L~ 192 (552) .+++.+..+..+... ..|++..+|. .+||++||+++||+.++.+++++||+|++|+|+.. |.|.+|+ T Consensus 56 -~lp~~~~~~~~~~~~------~~~~~~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~ 123 (417) T protein:vir:38 56 -RFPLVITDSSTDEVI------DLANIEYLMN-----TKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFE 123 (417) T ss_pred -cCeeEEEEcCCccee------ccchHHHHHh-----cccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEE Confidence 456666444333221 2244444332 47899999999999999999999999999999864 6799999 Q ss_pred EecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 193 AVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVF 272 (552) Q Consensus 193 ~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~ 272 (552) ||+|++|++...++|.++ +++...++.....++++||||++.. +.++++|+||+.++..+|.++.++++| T Consensus 124 ~l~p~~v~v~~~~~~~~~------y~~~~~~~~~~~~~~~~dviH~r~~----~~d~~~G~s~l~~~~~~i~~~~~~~~~ 193 (417) T protein:vir:38 124 FYAPSQTQVDTSDPDNII------YRFTPYNSSMQKVCGFEDVIHWKFF----SYDTIMGRSPLLSLGDEIGLQESGVST 193 (417) T ss_pred EeCCceEEEEEcCCCeEE------EEEEEcCCcEEEEecCcceEEecCC----CCCCccccCHHHHHHHHHHHHHHHHHH Confidence 999999999887776432 2234455666778999999999742 456799999999999999999999999 Q ss_pred HHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHH Q lcl|NC_020081. 273 NARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINV 352 (552) Q Consensus 273 ~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 352 (552) +.++|+||++|++||+.++ .+++++++++++.|++.++|. |+|+++|+ ++|++|++++++++|+||+|++++++++ T Consensus 194 ~~~~f~ng~~p~~il~~~~--~l~~e~~~~~~~~~~~~~~g~-n~g~~~vl-~~g~~~~~l~~~~~d~q~le~~~~~~~~ 269 (417) T protein:vir:38 194 LQKFFKSGLKGSIIKAKES--RLSAEARQKIREDFERAQAGA-DAGSPIIV-DATMDYQPLEVDTNVLNLINSNNYSTAQ 269 (417) T ss_pred HHHHHhccCCCcEEEEeCC--CCCHHHHHHHHHHHHHHhccc-ccCCceec-cCCceEEEccCCHHHHHHHHHHHhhHHH Confidence 9999999999999999875 468999999999999999884 89997655 5799999999999999999999999999 Q ss_pred HHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecccccChHH Q lcl|NC_020081. 353 ICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNFVGGDAKT 431 (552) Q Consensus 353 Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~d~~~ 431 (552) ||++|||||++||.. .+++|++++.+.|+++||.||++.||++|+++|++..+ .+++|+|+..+. . T Consensus 270 Ia~~fgVPp~~lg~~------------~~~s~~e~~~~~~~~~tl~P~~~~ie~~l~~~Ll~~~~~~~~~~~fd~~~l-~ 336 (417) T protein:vir:38 270 IAKALRVPAYRLAQN------------SPNQSVKQLADDYIRNDLPFYFEPITSEFELKLLDDAQRHQYCIGFDTKSV-N 336 (417) T ss_pred HHHHhCCCHHHhCCC------------CcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhcChhhcccceEEechhhh-h Confidence 999999999999831 25789999999999999999999999999999997644 356778864432 2 Q ss_pred HHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCC--CeeeccccccchhhhccccccccccCCC-CCccCcccCCCCCCCC Q lcl|NC_020081. 432 EAEIISILESKAKIGLTINDIRKELGYPDTEGG--DVTLAGVHVQRLGQIMQQEQVEYQRQMD-ANQFLAQQTGYDGNMD 508 (552) Q Consensus 432 ~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~gg--D~~~~~~n~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 508 (552) +....++...+.+|+||+||+|+++||||+||| |.+++++|+++++....++..+...... .++..+.+ T Consensus 337 ~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~~~~~kgg~~~~~~~~-------- 408 (417) T protein:vir:38 337 GLPIADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQAEHAAELKGGDTNAKGNQ-------- 408 (417) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccccccccccccccccccccCCCCCCCCCCC-------- Confidence 222333444566899999999999999999987 7899999999998766544322111000 00000000 Q ss_pred CCCCCCCcccccCCCCcccccccccc Q lcl|NC_020081. 509 NVNGKDSFNQNVGKDGQSKQQANTNS 534 (552) Q Consensus 509 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 534 (552) +.++.+ ++ | T Consensus 409 ~~~~~~---~~--------------~ 417 (417) T protein:vir:38 409 NGSGTN---AN--------------S 417 (417) T ss_pred cCCCCc---CC--------------C Confidence 000000 00 0 No 43 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=2.8e-76 Score=434.81 Aligned_cols=405 Identities=16% Similarity=0.176 Sum_probs=284.3 Q ss_pred hcccccccccccccccc-ccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEE-PIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKG 115 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~ 115 (552) |.-+.+. +++...... .+...+..-+++. ...+..... .-|...+.+.+||.+++..++ + T Consensus 1 Mg~f~~~-~~r~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~----~~al~~~~v~~cv~~Ia~~iA-----------~ 61 (416) T protein:vir:45 1 MGIFYKN-EKRDLQYNEDDLQMMVQTLPGFQ---GTKLRQYKD----IEAIRHSDIFTAVMMIASDLA-----------R 61 (416) T ss_pred CCccccc-ccccccCCCcchhHHHHHhcccc---ccCccccch----hhhhcchHHHHHHHHHHHhhc-----------c Confidence 3322211 111111110 0000000001100 011111000 111223345778776665544 4 Q ss_pred cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEec Q lcl|NC_020081. 116 VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVD 195 (552) Q Consensus 116 ~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~ 195 (552) +++.+.. +++ ..+.|++..+| ..+||+.||+++||+.++.+++++||+|++|+|+..|+|++||||+ T Consensus 62 ~p~~~~~---~~~-----~~~~~~~~~lL-----~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~ 128 (416) T protein:vir:45 62 MPIRVTV---NGQ-----INYSDRIVNLL-----NTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRK 128 (416) T ss_pred CceEEec---Ccc-----ccccchHHHHH-----hcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 4555432 111 11223343333 2478999999999999999999999999999999999999999999 Q ss_pred CceeEEEECCCcccccccceeEEEEEc---CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 196 ASTVYVAVDEDGKERKAKDGVRYVQVI---DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVF 272 (552) Q Consensus 196 p~~v~v~~~~~g~~~~~~~~~~y~~~~---~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~ 272 (552) |++|++..+.+|..++ +++.. .......|+++||||++.+ +.++++|+||+.++..+|..+.+++++ T Consensus 129 ~~~v~v~~~~~g~~~~------~~~~~~~~~~~~~~~~~~~evihir~~----~~d~~~G~s~i~~~~~~i~~~~~~~~~ 198 (416) T protein:vir:45 129 TSEIELKSDARGRLYY------FHQRIDSNGNNIERNVKFEDMLDIKFY----SLDGINGLSLLDTLSRTIESDNNGKDF 198 (416) T ss_pred CceeEEEECCCccEEE------EEEEecCCCceeEEEEccccEEEeccC----CCCCccccCHHHHHHHHHHHHHHHHHH Confidence 9999999988886432 22222 2234568999999998753 456799999999999999999999999 Q ss_pred HHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHH Q lcl|NC_020081. 273 NARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINV 352 (552) Q Consensus 273 ~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 352 (552) +.++|+||++|+|||++++.. .++++++++++.|++.++|..|+|+++|+ ++|++|++++++++|+||+|.+++++++ T Consensus 199 ~~~~f~ng~~~~gil~~~~~~-~~~~~~~~~~~~~~~~~~g~~nag~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 276 (416) T protein:vir:45 199 LNNFLRNGTHAGGILKMKGVL-DNKKARDRAREEFHKSFSGTKQAGKVVVL-DESMTFDQLEVDTEVLKLIRENKSSTRE 276 (416) T ss_pred HHHHHhccCCCcEEEEeCCCC-CCHHHHHHHHHHHHHHhcCccccCceeec-CCCceeEeccCCHHHHHHHHHHHHHHHH Confidence 999999999999999998643 46788999999999999999999997655 5799999999999999999999999999 Q ss_pred HHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecc-----ccc Q lcl|NC_020081. 353 ICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNF-----VGG 427 (552) Q Consensus 353 Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f-----~~~ 427 (552) ||++|||||++||.... ..+.+++... +.+||.|+++.||++||++|++++. +++|+| ++. T Consensus 277 Ia~~fgVPp~~lg~~~~------------~~~~~~~~~~-~~~~l~P~~~~ie~~ln~~l~~~~~-~~~~~f~~~~l~~~ 342 (416) T protein:vir:45 277 IAGVFGIPLHKFGIETA------------NMSITDANLD-YLSTLKPYITCVCAELNFKFNDEYV-NREFKFDTTEIRVV 342 (416) T ss_pred HHHHhCCCHHHcCCCCC------------CccHHHHHHH-HHHHHHHHHHHHHHHHhhhcccccc-CceEEEechhhhcc Confidence 99999999999986321 1234555554 4569999999999999999988653 445554 577 Q ss_pred ChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCC--eeeccccccchhhhccccccccccCCCCCccCcccCCCCC Q lcl|NC_020081. 428 DAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGD--VTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDG 505 (552) Q Consensus 428 d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD--~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (552) |.+++++.++.+ +.+|+||+||+|+++||||+|||| ++++++|+++++.+++++........ .... T Consensus 343 D~~~~~~~~~~~--~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~---~~~k------- 410 (416) T protein:vir:45 343 DEKTQAEIDKIN--IDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATD---KKLK------- 410 (416) T ss_pred CHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccCcccccccc---cccC------- Confidence 888888877654 457999999999999999999988 67889999998876543332211100 0000 Q ss_pred CCCCCCCCCCcccccCCCCcc Q lcl|NC_020081. 506 NMDNVNGKDSFNQNVGKDGQS 526 (552) Q Consensus 506 ~~~~~~~~~~~~~~~~~~~~~ 526 (552) |+|.++ T Consensus 411 ---------------gGe~n~ 416 (416) T protein:vir:45 411 ---------------GGEENE 416 (416) T ss_pred ---------------CCCCCC Confidence 000000 No 44 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=2.8e-76 Score=434.81 Aligned_cols=405 Identities=16% Similarity=0.176 Sum_probs=284.3 Q ss_pred hcccccccccccccccc-ccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEE-PIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKG 115 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~ 115 (552) |.-+.+. +++...... .+...+..-+++. ...+..... .-|...+.+.+||.+++..++ + T Consensus 1 Mg~f~~~-~~r~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~----~~al~~~~v~~cv~~Ia~~iA-----------~ 61 (416) T protein:vir:81 1 MGIFYKN-EKRDLQYNEDDLQMMVQTLPGFQ---GTKLRQYKD----IEAIRHSDIFTAVMMIASDLA-----------R 61 (416) T ss_pred CCccccc-ccccccCCCcchhHHHHHhcccc---ccCccccch----hhhhcchHHHHHHHHHHHhhc-----------c Confidence 3322211 111111110 0000000001100 011111000 111223345778776665544 4 Q ss_pred cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEec Q lcl|NC_020081. 116 VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVD 195 (552) Q Consensus 116 ~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~ 195 (552) +++.+.. +++ ..+.|++..+| ..+||+.||+++||+.++.+++++||+|++|+|+..|+|++||||+ T Consensus 62 ~p~~~~~---~~~-----~~~~~~~~~lL-----~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~ 128 (416) T protein:vir:81 62 MPIRVTV---NGQ-----INYSDRIVNLL-----NTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRK 128 (416) T ss_pred CceEEec---Ccc-----ccccchHHHHH-----hcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 4555432 111 11223343333 2478999999999999999999999999999999999999999999 Q ss_pred CceeEEEECCCcccccccceeEEEEEc---CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 196 ASTVYVAVDEDGKERKAKDGVRYVQVI---DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVF 272 (552) Q Consensus 196 p~~v~v~~~~~g~~~~~~~~~~y~~~~---~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~ 272 (552) |++|++..+.+|..++ +++.. .......|+++||||++.+ +.++++|+||+.++..+|..+.+++++ T Consensus 129 ~~~v~v~~~~~g~~~~------~~~~~~~~~~~~~~~~~~~evihir~~----~~d~~~G~s~i~~~~~~i~~~~~~~~~ 198 (416) T protein:vir:81 129 TSEIELKSDARGRLYY------FHQRIDSNGNNIERNVKFEDMLDIKFY----SLDGINGLSLLDTLSRTIESDNNGKDF 198 (416) T ss_pred CceeEEEECCCccEEE------EEEEecCCCceeEEEEccccEEEeccC----CCCCccccCHHHHHHHHHHHHHHHHHH Confidence 9999999988886432 22222 2234568999999998753 456799999999999999999999999 Q ss_pred HHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHH Q lcl|NC_020081. 273 NARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINV 352 (552) Q Consensus 273 ~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 352 (552) +.++|+||++|+|||++++.. .++++++++++.|++.++|..|+|+++|+ ++|++|++++++++|+||+|.+++++++ T Consensus 199 ~~~~f~ng~~~~gil~~~~~~-~~~~~~~~~~~~~~~~~~g~~nag~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 276 (416) T protein:vir:81 199 LNNFLRNGTHAGGILKMKGVL-DNKKARDRAREEFHKSFSGTKQAGKVVVL-DESMTFDQLEVDTEVLKLIRENKSSTRE 276 (416) T ss_pred HHHHHhccCCCcEEEEeCCCC-CCHHHHHHHHHHHHHHhcCccccCceeec-CCCceeEeccCCHHHHHHHHHHHHHHHH Confidence 999999999999999998643 46788999999999999999999997655 5799999999999999999999999999 Q ss_pred HHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecc-----ccc Q lcl|NC_020081. 353 ICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNF-----VGG 427 (552) Q Consensus 353 Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f-----~~~ 427 (552) ||++|||||++||.... ..+.+++... +.+||.|+++.||++||++|++++. +++|+| ++. T Consensus 277 Ia~~fgVPp~~lg~~~~------------~~~~~~~~~~-~~~~l~P~~~~ie~~ln~~l~~~~~-~~~~~f~~~~l~~~ 342 (416) T protein:vir:81 277 IAGVFGIPLHKFGIETA------------NMSITDANLD-YLSTLKPYITCVCAELNFKFNDEYV-NREFKFDTTEIRVV 342 (416) T ss_pred HHHHhCCCHHHcCCCCC------------CccHHHHHHH-HHHHHHHHHHHHHHHHhhhcccccc-CceEEEechhhhcc Confidence 99999999999986321 1234555554 4569999999999999999988653 445554 577 Q ss_pred ChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCC--eeeccccccchhhhccccccccccCCCCCccCcccCCCCC Q lcl|NC_020081. 428 DAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGD--VTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDG 505 (552) Q Consensus 428 d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD--~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (552) |.+++++.++.+ +.+|+||+||+|+++||||+|||| ++++++|+++++.+++++........ .... T Consensus 343 D~~~~~~~~~~~--~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~---~~~k------- 410 (416) T protein:vir:81 343 DEKTQAEIDKIN--IDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATD---KKLK------- 410 (416) T ss_pred CHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccCcccccccc---cccC------- Confidence 888888877654 457999999999999999999988 67889999998876543332211100 0000 Q ss_pred CCCCCCCCCCcccccCCCCcc Q lcl|NC_020081. 506 NMDNVNGKDSFNQNVGKDGQS 526 (552) Q Consensus 506 ~~~~~~~~~~~~~~~~~~~~~ 526 (552) |+|.++ T Consensus 411 ---------------gGe~n~ 416 (416) T protein:vir:81 411 ---------------GGEENE 416 (416) T ss_pred ---------------CCCCCC Confidence 000000 No 45 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=1.1e-75 Score=431.64 Aligned_cols=406 Identities=15% Similarity=0.169 Sum_probs=295.4 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGV 116 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~ 116 (552) |+.+.+- ..+....+.+....+ .++.+ ...+..+.. ..+.+.....+.+++||.++++.+ +++ T Consensus 1 Mg~~~~~-~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~---~~~~~~~~~~~~v~~~i~~ia~~i-----------a~l 63 (423) T protein:vir:81 1 MGFLQKL-GLAPSVVATPEPIEL-VGPIF-ESLKLSTKN---MTVEQIWEDQPHLRTVTTFIARNV-----------ASL 63 (423) T ss_pred CchhHhh-ccccccccCcccccc-ccccc-cccccccch---hhHHHHHHhhhHHHHHHHHHHHhH-----------hhC Confidence 5544331 122222333222211 11111 111111111 123344445566788887766654 456 Q ss_pred ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCC--CCEEEEEEe Q lcl|NC_020081. 117 GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKL--GDLHNFKAV 194 (552) Q Consensus 117 ~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~--G~~~~L~~l 194 (552) ++.+..++.++.. +..+.|++..+|. .||++||+++||+.++.+++++||+|++|.|+.. +.+..|+|+ T Consensus 64 p~~~~~~~~dg~~---~~~~~~~~~~ll~------~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~ 134 (423) T protein:vir:81 64 QLQAFERVEDGGR---ERVREGHLARVCK------LANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPI 134 (423) T ss_pred ceEEEEEecCCce---eeeccchHHHHhh------cCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeec Confidence 7776544433321 2223354444432 5789999999999999999999999999999863 567889999 Q ss_pred cCceeEEEECCCcccccccceeEEEEE---cCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 195 DASTVYVAVDEDGKERKAKDGVRYVQV---IDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEV 271 (552) Q Consensus 195 ~p~~v~v~~~~~g~~~~~~~~~~y~~~---~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~ 271 (552) ++..|++....++.. ..+|... ..++....++++||||++.. .+.+.++|+||+..++.+|..+.++++ T Consensus 135 ~~~~v~~~~~~~~~~-----~~~Y~~~~~~~~~g~~~~~~~~evih~r~~---~~~~~~~G~spi~~~~~~i~~~~~~~~ 206 (423) T protein:vir:81 135 PVSWVQRRAYKDGWG-----SLDYIIIESGDNDGRSVKVPGERVIHRHGY---NPKTMKRGKSPVQSLRDILGEQIEAAI 206 (423) T ss_pred ccceeeeeeccCCCc-----ceEEEEEEecCCCceEEEEcccceEEecCC---CCCCccccccHHHHHHHHHHHHHHHHH Confidence 999998877655421 1233322 23456678999999998742 233446899999999999999999999 Q ss_pred HHHHHHhccCCCceEEEeCCC---CCCCHHHHHHHHHHHHHHhc-cccccccceeeccCCceeeeccCchhHHHHHHHHH Q lcl|NC_020081. 272 FNARFFAQGGTTRGLLHIKTG---QEQSNQALTSFRREWTSMFS-GINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLN 347 (552) Q Consensus 272 ~~~~~f~ng~~p~gil~~~~~---~~~s~~~~~~~~~~~~~~~~-G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~ 347 (552) ++.++|+||++|+|||+++.. ..+++++++++++.|++.++ |..|+|+++|+ ++|++|++++++++|+||+|+++ T Consensus 207 ~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl-~~g~~~~~l~~s~~d~q~~e~~~ 285 (423) T protein:vir:81 207 FRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLL-EDGMKAENFHTTSKDEQTVETTK 285 (423) T ss_pred HHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceec-CCCceEEeccCChhhHHHHHHHH Confidence 999999999999999998642 24789999999999999985 67889997655 57999999999999999999999 Q ss_pred HHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc---cceeecc Q lcl|NC_020081. 348 YLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG---GDYVFNF 424 (552) Q Consensus 348 ~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~---~~~~~~f 424 (552) +++++||++|||||++||+.+.+ +|+|+|++.+.|+++||.|+++.||++|+++|++..+ .+++|+| T Consensus 286 ~~~~eIa~~fgVPp~~lg~~~~~----------t~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~f 355 (423) T protein:vir:81 286 LSLQTVAQVYGINPTMVGQLDNA----------NYSNVREFRKALYGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEF 355 (423) T ss_pred hhHHHHHHHhCCCHHHhcCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccCccEEEe Confidence 99999999999999999987654 4789999999999999999999999999999998764 3566666 Q ss_pred -----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcc Q lcl|NC_020081. 425 -----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQ 499 (552) Q Consensus 425 -----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (552) +++|.+++.+.+.++.. ..|+||+||+|+++||||+||||++++|+|+.+.+.... T Consensus 356 d~~~llr~d~~~r~~~~~~~l~-~~G~~T~NE~R~~~gl~p~~gGD~~~~p~n~~~~~~~~~------------------ 416 (423) T protein:vir:81 356 NLEEKLRASFEEAAEIKRAAVG-NVAWMTINEVRAMDNLPSIDGGDDLARPLNTEFGDSEDA------------------ 416 (423) T ss_pred cchhhhccCHHHHHHHHHHHHh-CCCCcCHHHHHHHhCCCCCCCcceeecccccccCccCCC------------------ Confidence 57788888887665322 358999999999999999999999999998776432110 Q ss_pred cCCCCCCC Q lcl|NC_020081. 500 QTGYDGNM 507 (552) Q Consensus 500 ~~~~~~~~ 507 (552) +++..++ T Consensus 417 -~~~~~~t 423 (423) T protein:vir:81 417 -PGEEVET 423 (423) T ss_pred -CCCCCCC Confidence 0001111 No 46 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=5.9e-75 Score=427.53 Aligned_cols=402 Identities=13% Similarity=0.109 Sum_probs=290.1 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFC 106 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~ 106 (552) |.+.....++.-.+ ..+....++.+.+.+.+.... +..+- -...+...+.+.+||..++..++ T Consensus 1 ~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~--~~~~v------~~~~~~~~~~V~~ci~~Ia~~ia--- 63 (409) T protein:vir:93 1 MAKENIVTRIKKKL------IDNWIDQSTSKLYDFSPWKNR--SFWGV------INNTLETNETIFSAITKLSNSMA--- 63 (409) T ss_pred CCccchhhhhhhhh------hhhhhccccccccccccccCc--ccccc------chhhhhccHHHHHHHHHHHHhhh--- Confidence 22222222211110 111111222222222221110 11110 01123344567888877666654 Q ss_pred HHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCC Q lcl|NC_020081. 107 TPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLG 186 (552) Q Consensus 107 ~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G 186 (552) .+++.+..+... ..|++..+|. .+||++||+++||+.++.+++++||+|++|+|+..| T Consensus 64 --------~lp~~~~~~~~~---------~~~~~~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G 121 (409) T protein:vir:93 64 --------SLPLKMYEDYKV---------VNTEVSDLLT-----VSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH 121 (409) T ss_pred --------hCceeEeecccc---------ccchHHHHHh-----hhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCC Confidence 455555433211 1244444332 368899999999999999999999999999999999 Q ss_pred CEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHH Q lcl|NC_020081. 187 DLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYH 266 (552) Q Consensus 187 ~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~ 266 (552) ++++||||+|++|++..+.++. .++|.+...++....|+++||||++.++ +.+++||+||+.++..++..+ T Consensus 122 ~~~~L~~l~~~~v~~~~~~~~~------~~~y~~~~~~g~~~~~~~~eVih~r~~~---~~~~~~G~s~i~~~~~~i~~~ 192 (409) T protein:vir:93 122 QPSKLFLLNPDVVEMLIENQSR------ELYYSIHAATGNKLIVHNMDMLHFKHIV---ASNMVQGISPIDVLKNTTDFD 192 (409) T ss_pred cEEEEEEEcCceeEEEEeCCCc------EEEEEEEcCCceEEEEccccEEEeCCCC---CCCccccccHHHHHHHHHHHH Confidence 9999999999999999887764 3456666666777889999999997543 456799999999999999999 Q ss_pred HHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHH Q lcl|NC_020081. 267 DNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWL 346 (552) Q Consensus 267 ~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~ 346 (552) .++++++ ++.++..++++++.+ ..+++++++++++.|++.+. ++|+++++ ++|++|+++++++.|+||+|++ T Consensus 193 ~~~~~~~--~~~~~~~~~~i~~~~--~~l~~e~~~~~~~~~~~~~~---~~g~~~vl-~~g~~~~~l~~~~~d~q~~e~r 264 (409) T protein:vir:93 193 NAVRTFN--LTEMQKPDSFMLKYG--SNVGKEKRQQVLEDFKQYYE---ENGGILFQ-EPGVEIEPLPKKYVSEDIVASE 264 (409) T ss_pred HHHHHHH--HHhcCCCCceEEecC--CCCCHHHHHHHHHHHHHHhh---cCCCeeec-CCCceEEEcCCChhHHHHHHHH Confidence 9998884 566666666676654 45799999999999998774 56776554 6799999999999999999999 Q ss_pred HHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc--cceeecc Q lcl|NC_020081. 347 NYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG--GDYVFNF 424 (552) Q Consensus 347 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~f 424 (552) ++++++||++|||||++||.... .+++|++++.+.|++.||.|+++.||++||++|+++.+ .+++|+| T Consensus 265 ~~~~~~Ia~~fgVPp~~lg~~~~----------~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~f 334 (409) T protein:vir:93 265 NLTRERVANVFQLPSVFLNARSN----------TNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKF 334 (409) T ss_pred HHHHHHHHHHhCCCHHHhCCCCC----------CCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEe Confidence 99999999999999999996543 35799999999999999999999999999999998765 3466665 Q ss_pred -----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcc Q lcl|NC_020081. 425 -----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQ 499 (552) Q Consensus 425 -----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (552) ++.|.+++++.++.+ +.+|++|+||+|+++|+||+||||++++++|+++++.....+.. .++++.+..+ T Consensus 335 d~~~ll~~d~~~~~~~~~~~--~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~--~~gG~~n~~e-- 408 (409) T protein:vir:93 335 NVKSYLRADSATQAEVYFKA--VRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKS--LKGGDKNVNE-- 408 (409) T ss_pred echhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcCeeeecccccccccchhhccc--ccCCCCCcCC-- Confidence 467888888766644 45799999999999999999999999999999998765433221 1111111000 Q ss_pred cCCCCCCCCCCCC Q lcl|NC_020081. 500 QTGYDGNMDNVNG 512 (552) Q Consensus 500 ~~~~~~~~~~~~~ 512 (552) | T Consensus 409 ------------~ 409 (409) T protein:vir:93 409 ------------S 409 (409) T ss_pred ------------C Confidence 0 No 47 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=8.9e-75 Score=426.55 Aligned_cols=402 Identities=14% Similarity=0.112 Sum_probs=291.6 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFC 106 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~ 106 (552) |.++.....+-..+ ..+....++...++|.+... .+..+- -++-+...+.+.+||.+++..++ T Consensus 1 ~~~~~~~~~~k~~~------~~~~~~~~~~~~~~~~~~~~--~~~~~v------~~~~a~~~~~v~~~i~~Ia~~ia--- 63 (409) T protein:vir:94 1 MAKENIVTRIKKKL------IDNWIDQSASKLYDFSPWKN--KSFWGV------INNTLETNETIFSAITKLSNSMA--- 63 (409) T ss_pred CcccccchhhhhHH------hhhhhcCCcccccccccccC--cccccc------chhhhhccHHHHHHHHHHHHhhh--- Confidence 44433333321111 11111222222222222111 111110 11123345667888887766654 Q ss_pred HHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCC Q lcl|NC_020081. 107 TPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLG 186 (552) Q Consensus 107 ~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G 186 (552) .+++.+..+... ..|++..+|. .+||++||+++||+.++.+++++||+|++|+|+..| T Consensus 64 --------~lp~~~~~~~~~---------~~~~~~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G 121 (409) T protein:vir:94 64 --------SLPLKMYEDYKV---------VNTEVSDLLT-----VSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYH 121 (409) T ss_pred --------hCceeEeecccc---------cchhHHHHHh-----hhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCC Confidence 445555432211 1244444332 368899999999999999999999999999999999 Q ss_pred CEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHH Q lcl|NC_020081. 187 DLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYH 266 (552) Q Consensus 187 ~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~ 266 (552) +|++||||+|++|++..+.++.. ++|.+...++....|+++||||++.. ++.++++|+||+.++..++..+ T Consensus 122 ~~~~L~~l~~~~v~v~~~~~~~~------~~y~~~~~~g~~~~~~~~dvih~r~~---~~~~~~~G~s~l~~~~~~i~~~ 192 (409) T protein:vir:94 122 QPSKLFLLNPDVVEMLIENQSRE------LYYSIHAATGNKLIVHNMDMLHFKHI---VASNMVQGISPIDVLKNTTDFD 192 (409) T ss_pred cEEEEEEEcCceeEEEEeCCCcE------EEEEEEcCCceEEEEccccEEEecCC---CCCCccccccHHHHHHHHHHHH Confidence 99999999999999998877642 34556566667778999999999753 3567799999999999999999 Q ss_pred HHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHH Q lcl|NC_020081. 267 DNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWL 346 (552) Q Consensus 267 ~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~ 346 (552) .++++++ ++.++..++++++.+ ..+++++++++++.|++.++ ++|++++ +++|++|++++++++|+||+|.+ T Consensus 193 ~~~~~~~--~~~~~~~~~~i~~~~--~~l~~e~~~~~~~~~~~~~~---~~g~~~v-l~~g~~~~~l~~~~~d~q~~e~~ 264 (409) T protein:vir:94 193 NAVRTFN--LTEMQKPDSFMLKYG--SNVGKEKRQQVLEDFKQYYE---ENGGILF-QEPGVEIEPLPKKYVSEDIVASE 264 (409) T ss_pred HHHHHHH--HHhcCCCCeeEEecC--CCCCHHHHHHHHHHHHHHhh---cCCCeee-cCCCceEEEcCCChhHHHHHHHH Confidence 9998885 555555666676654 35799999999999998775 5677654 46799999999999999999999 Q ss_pred HHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc--cceeecc Q lcl|NC_020081. 347 NYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG--GDYVFNF 424 (552) Q Consensus 347 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~f 424 (552) ++++++||++|||||++||.... .+++|++++.+.|++.||.|+++.||++||++|+++.+ .+++|+| T Consensus 265 ~~~~~~Ia~~fgVPp~~lg~~~~----------~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~f 334 (409) T protein:vir:94 265 NLTRERVANVFQLPSVFLNARSN----------TNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKF 334 (409) T ss_pred HHHHHHHHHHhCCCHHHhCCCCC----------CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEe Confidence 99999999999999999996543 35899999999999999999999999999999998765 3466665 Q ss_pred -----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcc Q lcl|NC_020081. 425 -----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQ 499 (552) Q Consensus 425 -----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (552) ++.|.+++++.++.+ +.+|+||+||+|+++|+||+||||++++++|+++++.....+.. .++++.+..+ T Consensus 335 d~~~ll~~d~~~~~~~~~~~--~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~--~kGG~~n~~e-- 408 (409) T protein:vir:94 335 NVKSYLRADSATQAEVYFKA--VRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKS--LKGGDKNVNE-- 408 (409) T ss_pred echhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcCeEeecccccccccchhhccc--ccCCCCCcCC-- Confidence 467888888766643 45799999999999999999999999999999998765432211 1111111000 Q ss_pred cCCCCCCCCCCCC Q lcl|NC_020081. 500 QTGYDGNMDNVNG 512 (552) Q Consensus 500 ~~~~~~~~~~~~~ 512 (552) + T Consensus 409 ------------~ 409 (409) T protein:vir:94 409 ------------S 409 (409) T ss_pred ------------C Confidence 0 No 48 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=5e-75 Score=427.95 Aligned_cols=405 Identities=14% Similarity=0.156 Sum_probs=286.4 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVS 103 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~ 103 (552) |+.+. +...+.+..+ ......+ .+......+..+..... ..|...+.+.+||.+++..++ T Consensus 1 Mgl~~---------~~f~~~~~~~---~~~~~~~----~~~~~~~~~~~g~~v~~----~~al~~~~v~~~v~~ia~~iA 60 (409) T protein:vir:84 1 MSLFT---------RIFSGPSEER---TLTKISG----IPSPAEDWAMHGDRPGA----NSAMTLGAFYACVTLLADTVA 60 (409) T ss_pred Cchhh---------hhhcCCCccc---ccccccc----cccccchhhccCcccch----hhhhccHHHHHHHHHHHHhhh Confidence 33321 0000111111 0011100 01000111111111111 122344567888877666654 Q ss_pred HHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEE-E Q lcl|NC_020081. 104 MFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELV-Y 182 (552) Q Consensus 104 ~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~-r 182 (552) +++|.+..++...+ ...|++..+|. .+||++||+++||+.++.+++++||+|++|. + T Consensus 61 -----------~lp~~~~~~~~~~~------~~~~~l~~lL~-----~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~ 118 (409) T protein:vir:84 61 -----------SLSIDAYRKKDNVR------IPVSPAPKLLE-----STPYPGLTWFDWLWMLMESLAVTGNAFGYISAR 118 (409) T ss_pred -----------hCceEEEEecCCcc------cccchHHHHhh-----ccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEE Confidence 45666654433322 23355554442 4789999999999999999999999999986 6 Q ss_pred CCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHH Q lcl|NC_020081. 183 DKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNH 262 (552) Q Consensus 183 ~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~ 262 (552) +..|+|++||||+|++|++....++... .+++.+..++ ..|+++||||++.+. ..+.++|+||+..+..+ T Consensus 119 ~~~g~~~~L~~l~p~~v~v~~~~~~~~~-----~~~~~~~~~g--~~~~~~dvih~~~~~---~~~~~~G~s~i~~~~~~ 188 (409) T protein:vir:84 119 DEANRPTAIMPIHPDCIHVTDAKDEDGD-----WIEPVYRIDG--KVVPNHRIMHIKRYP---VAGCALGMSPIEKAASA 188 (409) T ss_pred CCCCceEEEEEEcCceeEEEEcCCCcce-----EEEEEecCCc--eEEchhhEEEecCCC---CCcccccccHHHHHHHH Confidence 8899999999999999998766544221 1222222233 458899999998653 23347999999999999 Q ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHH Q lcl|NC_020081. 263 LQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEF 342 (552) Q Consensus 263 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~ 342 (552) |..+.++++++.++|+||++|+|||++++ .++++++++++++|.+.+ .|+|++++ +++|++|+++++++.|+|| T Consensus 189 i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~---~n~g~~~v-l~~g~~~~~~~~~~~d~q~ 262 (409) T protein:vir:84 189 IGLGLAAERYGLRWFRDSANPSGILSSDA--DLTPDQVKQTQKQWIQSH---HNRRLPAV-MSAGIKWQSVSITPNESQF 262 (409) T ss_pred HHHHHHHHHHHHHHHhcCCCccEEEecCC--CCCHHHHHHHHHHHHHHh---ccCCCeee-cCCCceEEEccCChhHHHH Confidence 99999999999999999999999999876 468999999999998765 46788655 5679999999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceee Q lcl|NC_020081. 343 EKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVF 422 (552) Q Consensus 343 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~ 422 (552) +|++++++++||++|||||++||+.+.+++ .++|++++.+.|++.||.||++.||++||++|.. +..++| T Consensus 263 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--------~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~L~~--g~~i~f 332 (409) T protein:vir:84 263 LETRSFQRSEIAMWFRIPPHMIGDVEKSTS--------WGTGIEEQGINFVRHTLLPWLRCIEQALDTFLPR--GQFVKF 332 (409) T ss_pred HHHHHHHHHHHHHHhCCCHHHhCCCCCccc--------ccchHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--CCeEEE Confidence 999999999999999999999998876653 3578999999999999999999999999999843 333444 Q ss_pred c---ccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcc Q lcl|NC_020081. 423 N---FVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQ 499 (552) Q Consensus 423 ~---f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (552) + ++++|.+++++.+..+ +.+|+||+||+|+++||||+||||++++++|+++++.+...+..... T Consensus 333 d~~~l~~~d~~~~~~~~~~~--~~~G~~t~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~----------- 399 (409) T protein:vir:84 333 NVDGLMRGDVTARFTAYQMG--LQNGIWSVNEVRAWEDAPPIPEGDIHLQPMNFVPLGYVPPEEPAQEP----------- 399 (409) T ss_pred echhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcceeeecccccccccCCccccCcCC----------- Confidence 3 3577888888766644 45799999999999999999999999999999988764332211100 Q ss_pred cCCCCCCCCCCCCCCCccc Q lcl|NC_020081. 500 QTGYDGNMDNVNGKDSFNQ 518 (552) Q Consensus 500 ~~~~~~~~~~~~~~~~~~~ 518 (552) .+.+..+| +. T Consensus 400 -----~~~~~~~g----n~ 409 (409) T protein:vir:84 400 -----QPNSATEG----NK 409 (409) T ss_pred -----CCCCccCC----CC Confidence 00000000 00 No 49 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=1.2e-74 Score=425.91 Aligned_cols=401 Identities=13% Similarity=0.113 Sum_probs=288.3 Q ss_pred cccccchhhhhccccccccc-cccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSN-KPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMF 105 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~ 105 (552) |.+...-.++ |+. -.+.-..++.+.++|.+.. ..+..+- -.+.+...+.+.+||.+++..++ T Consensus 1 ~~~~~~~~~~-------k~~~~~~~~~~~~~~~~~~~~~~--~~~~~~v------~~~~a~~~~~V~~ci~~ia~~ia-- 63 (409) T protein:vir:96 1 MAKENIVTRI-------KKKLIDNWIDQSASKLYDFSPWK--NKSFWGV------INNTLETNETIFSAITKLSNSMA-- 63 (409) T ss_pred Cccccchhhh-------hhHHhhhhhcccccccccccccc--Ccccccc------chhhHhhhHHHHHHHHHHHHhhh-- Confidence 2221111111 111 1111122223322332211 0111111 01123345667889887776654 Q ss_pred HHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCC Q lcl|NC_020081. 106 CTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKL 185 (552) Q Consensus 106 ~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~ 185 (552) .+++.+..+... ..|++..+|. .+||++||+++||+.++.+++++||+|++|+|+.. T Consensus 64 ---------~lp~~~~~~~~~---------~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~ 120 (409) T protein:vir:96 64 ---------SLPLKMYEDYKV---------VNTEVSDLLT-----VSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY 120 (409) T ss_pred ---------hCceEEeecccc---------cchhHHHHHh-----hhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC Confidence 445555432211 1244444432 36889999999999999999999999999999999 Q ss_pred CCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHH Q lcl|NC_020081. 186 GDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQY 265 (552) Q Consensus 186 G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~ 265 (552) |++++||||+|+.|++..+.++. ..+|.+...++....|+++||||++..+ +.++++|+||+..+..++.. T Consensus 121 G~~~~L~~l~~~~v~v~~~~~~~------~~~y~~~~~~g~~~~~~~~evih~r~~~---~~~~~~G~s~l~~~~~~i~~ 191 (409) T protein:vir:96 121 HQPSKLFLLNPDVVEMLIENQSR------ELYYSIHAATGNKLIVHNMDMLHFKHIV---ASNMVQGISPIDVLKNTTDF 191 (409) T ss_pred CcEEEEEEEcCceeEEEEeCCCc------EEEEEEEcCCceEEEEccccEEEeCCCC---CCCccccccHHHHHHHHHHH Confidence 99999999999999999887664 2456666666777889999999997533 55779999999999999999 Q ss_pred HHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHH Q lcl|NC_020081. 266 HDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKW 345 (552) Q Consensus 266 ~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~ 345 (552) +.+++++. ++.++..++++++.+ ..++++++++++++|++.++ |+|++++ +++|++|+++++++.|+||+|+ T Consensus 192 ~~~~~~~~--~~~~~~~~~~i~~~~--~~l~~e~~~~~~~~~~~~~~---n~g~~~v-l~~g~~~~~l~~~~~d~q~~e~ 263 (409) T protein:vir:96 192 DNAVRTFN--LTEMQKPDSFMLKYG--SNVSTEKRQQVLEDFKQYYE---ENGGILF-QEPGVEIEPLPKKYVSEDIVAS 263 (409) T ss_pred HHHHHHHH--HHhcCCCceeEEecC--CCCCHHHHHHHHHHHHHHhh---cCCCeee-cCCCceEEEcCCChhHHHHHHH Confidence 99988874 444444455565543 46799999999999998875 5677654 4679999999999999999999 Q ss_pred HHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc--cceeec Q lcl|NC_020081. 346 LNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG--GDYVFN 423 (552) Q Consensus 346 ~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~ 423 (552) +++++++||++|||||++||...++ +++|+|++.+.|+++||.|+++.||++||++|+++.+ .+++|+ T Consensus 264 ~~~~~~~Ia~~fgVPp~~lg~~~~~----------~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~ 333 (409) T protein:vir:96 264 ENLTRERVANVFQLPSIFLNARSNT----------NFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFK 333 (409) T ss_pred HHHHHHHHHHHhCCCHHHhCCCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEE Confidence 9999999999999999999975543 5799999999999999999999999999999998765 356666 Q ss_pred c-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCc Q lcl|NC_020081. 424 F-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLA 498 (552) Q Consensus 424 f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (552) | ++.|.+++++.+..+ +.+|+||+||+|+++|+||+||||++++++|+++++.....+.. .++++.+..+ T Consensus 334 fd~~~ll~~d~~~~~e~~~~~--~~~G~~T~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~~--~~gG~~n~~e- 408 (409) T protein:vir:96 334 FNVKSYLRADSATQAEVYFKA--VRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKS--LKGGDKNVNE- 408 (409) T ss_pred eechhhhccCHHHHHHHHHHH--HhCCCCCHHHHHHHhCCCCCCCcceeeecccccccccchhhccc--ccCCCCCcCC- Confidence 5 467888888776643 45799999999999999999999999999999988654332211 1111111000 Q ss_pred ccCCCCCCCCCCCC Q lcl|NC_020081. 499 QQTGYDGNMDNVNG 512 (552) Q Consensus 499 ~~~~~~~~~~~~~~ 512 (552) + T Consensus 409 -------------~ 409 (409) T protein:vir:96 409 -------------S 409 (409) T ss_pred -------------C Confidence 0 No 50 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=1.2e-74 Score=425.83 Aligned_cols=405 Identities=14% Similarity=0.115 Sum_probs=288.5 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVS 103 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~ 103 (552) |+-..+......+.+.+... ....+..+...+.+.. ..+..+. -...+...+.+.+||.++++.++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~--~~~~~~v------~~~~a~~~~~v~~~i~~ia~~iA 66 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDN------WIDQSTSKLYDFSPWK--NRSFWGV------INNTLETNETIFSAITKLSNSMA 66 (412) T ss_pred CccchhhhhhhhhhhhHhhh------hhcccccccccccccC--Ccccccc------chhhhhccHHHHHHHHHHHHhHh Confidence 33322211111111110000 0011111111111000 0011110 11223345667888877666654 Q ss_pred HHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC Q lcl|NC_020081. 104 MFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD 183 (552) Q Consensus 104 ~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~ 183 (552) .+++.+..+... ..|++..+|. .+||+.||+++||+.++.+++++||+|++|+|+ T Consensus 67 -----------~lp~~~~~~~~~---------~~~~~~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~ 121 (412) T protein:vir:26 67 -----------SLPLKMYEDYKV---------VNTEVSDLLT-----VSPNNSLSSFDFINQIETIRNEKGNAYVLIERD 121 (412) T ss_pred -----------hCceeEeecccc---------ccchHHHHHH-----hhcccCCCHHHHHHHHHHHHhhcCceEEEEEEC Confidence 455555432211 1244444332 368899999999999999999999999999999 Q ss_pred CCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHH Q lcl|NC_020081. 184 KLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHL 263 (552) Q Consensus 184 ~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i 263 (552) ..|++++|+||+|+.|++..+.++. .++|.+...++....|+++||||++.. ++.+++||+||+.++..++ T Consensus 122 ~~G~~~~L~~l~~~~v~v~~~~~~~------~~~y~~~~~~g~~~~~~~~evih~~~~---~~~~~~~G~s~i~~~~~~i 192 (412) T protein:vir:26 122 IYHQPSKLFLLNPDVVEMLIENQSR------ELYYSIHAATGNKLIVHNMDMLHFKHI---VASNMVQGISPIDVLKNTT 192 (412) T ss_pred CCCcEEEEEEEcCceeEEEEeCCCc------EEEEEEEcCCceEEEEccccEEEeCCC---CCCCCcccccHHHHHHHHH Confidence 9999999999999999999887764 345666667777788999999999753 3456799999999999999 Q ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHH Q lcl|NC_020081. 264 QYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFE 343 (552) Q Consensus 264 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~ 343 (552) ..+.++++++ ++.++..++++++.+ ..+++++++++++.|++.+. ++|++++ +++|++|+++++++.|+||+ T Consensus 193 ~~~~a~~~~~--~~~~~~~~~~i~~~~--~~l~~e~~~~~~~~~~~~~~---~~g~~~v-l~~g~~~~~l~~~~~d~q~~ 264 (412) T protein:vir:26 193 DFDNAVRTFN--LTEMQKPDSFMLKYG--SNVGKEKRQQVLEDFKQYYE---ENGGILF-QEPGVEIEPLPKKYVSEDIV 264 (412) T ss_pred HHHHHHHHHH--HHhcCCCCceEEecC--CCCCHHHHHHHHHHHHHHhh---cCCCeee-cCCCceEEEcCCChhHHHHH Confidence 9999998884 555566666676654 35799999999999998764 5677654 56799999999999999999 Q ss_pred HHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccccc--cee Q lcl|NC_020081. 344 KWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGG--DYV 421 (552) Q Consensus 344 e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~--~~~ 421 (552) |++++++++||++|||||++||.... .+++|++++.+.|++.||.|+++.||++||++|++..+. +++ T Consensus 265 e~~~~~~~~Ia~afgVPp~~lg~~~~----------~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kLl~~~~~~~~~~ 334 (412) T protein:vir:26 265 ASENLTRERVANVFQLPSVFLNARSN----------TNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRY 334 (412) T ss_pred HHHHHHHHHHHHHhCCCHHHhCCCCC----------CCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccCcce Confidence 99999999999999999999996443 368999999999999999999999999999999987653 456 Q ss_pred ecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCcc Q lcl|NC_020081. 422 FNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQF 496 (552) Q Consensus 422 ~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 496 (552) |+| ++.|.+++++.++.+ +.+|+||+||+|+++||||+||||+++++.|+++++.....+.. .++++.+.. T Consensus 335 ~~fd~~~l~~~d~~~~~~~~~~~--~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~--~~gG~~n~~ 410 (412) T protein:vir:26 335 FKFNVKSYLRADSATQAEVYFKA--VRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKS--LKGGDKNVN 410 (412) T ss_pred EEeechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcCeeeecccccccccchhhccc--ccCCCCCcC Confidence 665 466888887766644 45799999999999999999999999999999988654332211 111111100 Q ss_pred Cc Q lcl|NC_020081. 497 LA 498 (552) Q Consensus 497 ~~ 498 (552) ++ T Consensus 411 e~ 412 (412) T protein:vir:26 411 ES 412 (412) T ss_pred CC Confidence 00 No 51 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=3.4e-74 Score=423.35 Aligned_cols=401 Identities=12% Similarity=0.035 Sum_probs=288.3 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGV 116 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~ 116 (552) |..+.+.+ .....+..++..-+ ...++.. .....|...+.+.+||..++..++ .+ T Consensus 1 m~~f~~~~-~~~~~~~~~~~~~~------~~~~~~~-------~~~~~Al~~~~V~~~i~~Ia~~iA-----------~l 55 (406) T protein:vir:97 1 MSFFQPLG-TSKVSYDDYISSVL------AGDVSQK-------YLGVSALKNSDILTATSIIAGDIA-----------RF 55 (406) T ss_pred CccccccC-CCCCCcchHHHHHh------cCCCCcc-------cccchhhccHHHHHHHHHHHHhhh-----------hC Confidence 44332211 11222222211111 0011110 011123344556788877766654 34 Q ss_pred ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC-CCCEEEEEEec Q lcl|NC_020081. 117 GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK-LGDLHNFKAVD 195 (552) Q Consensus 117 ~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~-~G~~~~L~~l~ 195 (552) ++.++.++.. ..+.|++..+|. .+||++||+++||+.++.+++++||+|++|+|+. .|++.+||||+ T Consensus 56 p~~~~~~~g~-------~~~~~~~~~lL~-----~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~ 123 (406) T protein:vir:97 56 PLVKKDVNGD-------IIHDEDINYLLN-----VKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYR 123 (406) T ss_pred eeEEEecCcc-------ccccchHHHHhh-----ccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEEC Confidence 4544433322 122344444442 3789999999999999999999999999999985 68999999999 Q ss_pred CceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 196 ASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNAR 275 (552) Q Consensus 196 p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~ 275 (552) |+.|++..+++|.+. +.+....++....|+++||||++.+ +.++++|+||+.++..+|..+.++++|..+ T Consensus 124 p~~v~v~~~~~~~~~------y~~~~~~~~~~~~~~~~evih~r~~----~~dg~~G~spi~~~~~~i~~~~a~~~~~~~ 193 (406) T protein:vir:97 124 PSETTVEETDNHEIV------YTFTDMLTAKQVKCFAHDVIHWKFF----SHDTILGRSPLLSLGDEIDLQTGGINTLIK 193 (406) T ss_pred CCeeEEEEcCCceEE------EEEEecCCceEEEEccccEEEecCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 999999888776532 1233345566788999999999743 467789999999999999999999999999 Q ss_pred HHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 276 FFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICS 355 (552) Q Consensus 276 ~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 355 (552) +|+||+.|++++..+ ..+++++++++++.|++.++| .|+|+++|+ ++|++|++++++++|+||+|++++++++||+ T Consensus 194 ~f~ng~~~~~i~~~~--~~l~~e~~~~~~~~~~~~~~g-~n~g~~~vl-~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~ 269 (406) T protein:vir:97 194 FFKDGFSSGILTMKG--AQLSGDARQRARQEFEKMREG-SVGGSPLVF-DSTMEYTPLEIDTNVLQLITSNNFSTAQIAK 269 (406) T ss_pred HHhccCCCceEEecC--CCCCHHHHHHHHHHHHHHhcc-cccCceeec-CCCceEEEccCCHHHHHHHHHHHhhHHHHHH Confidence 999999887766643 357999999999999999887 688987655 5799999999999999999999999999999 Q ss_pred HhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeeccc-ccChHHHH Q lcl|NC_020081. 356 IYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNFV-GGDAKTEA 433 (552) Q Consensus 356 ~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f~-~~d~~~~~ 433 (552) +|||||++||.. .++++++++.+.|++.||.||++.||++|+++|++..+ ..++++|+ +.+.+.+. T Consensus 270 afgVPp~~lg~~------------~~~~~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~~~~~~~ 337 (406) T protein:vir:97 270 ALRVPSYKLGVN------------SPNQSVAQLMEDYVTNDLPFYFDAITSELGLKTLNDKDRRLYHIEFDTRSVTGRNV 337 (406) T ss_pred HhCCCHHHcCCC------------CCcchHHHHHHHHHHHHHHHHHHHHHHHHhhhhcChhhccceeEEEecCccchhhH Confidence 999999999842 23578999999999999999999999999999997654 35677775 44555554 Q ss_pred HHHHHHHHHhcCCcCHHHHHHHhCCCCCCC--CCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCC Q lcl|NC_020081. 434 EIISILESKAKIGLTINDIRKELGYPDTEG--GDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVN 511 (552) Q Consensus 434 ~~~~~~~~~~~g~lT~NE~R~~~gl~p~~g--gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (552) +.+. +.+.+|+||+||+|+++|+||+++ ||++++++|+++++.....+..... ..+....+ T Consensus 338 ~~~~--~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~---------------~~~gg~~~ 400 (406) T protein:vir:97 338 DEIV--KLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKEEYQDKVGI---------------KGKGGEVN 400 (406) T ss_pred HHHH--HHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccCccchhccccccccccc---------------ccCCCCCC Confidence 4433 445679999999999999999966 9999999999998765432221110 00000001 Q ss_pred CCCCcc Q lcl|NC_020081. 512 GKDSFN 517 (552) Q Consensus 512 ~~~~~~ 517 (552) ++++.. T Consensus 401 ~~~~~~ 406 (406) T protein:vir:97 401 AEEDKS 406 (406) T ss_pred CCCCCC Confidence 111100 No 52 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=4.6e-74 Score=422.62 Aligned_cols=468 Identities=14% Similarity=0.131 Sum_probs=294.5 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFC 106 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~ 106 (552) |+.+-+.=+....+..-++. ..+..+...-.+ .+. .++..+..|++..+.++++++||.++++.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~--~~~------~pp~~~~~La~~~~~n~~v~scI~~ia~~ia--- 66 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGD---TDSQALKEDRFE--EYV------EPKVHPLVLLSLLQVNPYHASACSIKANDIL--- 66 (540) T ss_pred CCCcccChhhccchhhhhcc---ccccccccCCCC--ccc------cCCCCHHHHHHHHHhcHHHHHHHHHHHHHHh--- Confidence 55544443333322211211 111111100000 111 1223467888888889999999987766654 Q ss_pred HHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCC Q lcl|NC_020081. 107 TPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLG 186 (552) Q Consensus 107 ~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G 186 (552) ++++.+..++. .+.++ .||+.||+++||++++.+++++||+|++++|+..| T Consensus 67 --------~~~~~i~~~~~-------------~~~~~--------lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G 117 (540) T protein:vir:41 67 --------RTGYLIDGDDG-------------GVEEL--------LRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQG 117 (540) T ss_pred --------cCCceEecCcc-------------chhhh--------ccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCC Confidence 45665543221 12222 26788999999999999999999999999999999 Q ss_pred CEEEEEEecCceeEEEECCCccccccccee-EEEE---------EcCCceEEEEcccceeeecccccCCccCCcccccHH Q lcl|NC_020081. 187 DLHNFKAVDASTVYVAVDEDGKERKAKDGV-RYVQ---------VIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPEL 256 (552) Q Consensus 187 ~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~-~y~~---------~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl 256 (552) +|++||||+|.+|+|..+.++......... .|+. ...+.....|+++||||++.+ .+.+++||+||+ T Consensus 118 ~~~~L~~i~~~~V~v~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~---~~~~~~~G~Spi 194 (540) T protein:vir:41 118 EPVRLDYIPAHTVRVHRDGSRYMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLP---SPICSYYGVPRY 194 (540) T ss_pred cEEEEEEeCCcceEEeEcCceeEeeecCceeeeeecccccceeeccccccceeecccceEEecCC---CCCCCcccccHH Confidence 999999999999999877655333222111 1111 122334567899999998754 356679999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC----CC----HHHHHHHHHHHHHHhccc-cccccceeec--- Q lcl|NC_020081. 257 EIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQE----QS----NQALTSFRREWTSMFSGI-NGAWKIPVIT--- 324 (552) Q Consensus 257 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~----~s----~~~~~~~~~~~~~~~~G~-~nagk~~il~--- 324 (552) .++..+|..+.++++|+.++|+||++|+|||++++... .+ ...++++++.|...++|. .|+|+++|+. T Consensus 195 ~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~ 274 (540) T protein:vir:41 195 LSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPG 274 (540) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCC Confidence 99999999999999999999999999999999986432 11 123467888888888875 5778876664 Q ss_pred --cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHH Q lcl|NC_020081. 325 --AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLK 402 (552) Q Consensus 325 --~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~ 402 (552) ++|++|++++++++|+||+|++++++++||++|||||++||+.+.++ .+++|++++.+.|+++||.|+++ T Consensus 275 ~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~--------~n~sn~eq~~~~f~~~tL~P~~~ 346 (540) T protein:vir:41 275 GDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGP--------LGGNFAEVARRTYYESVVRPQQE 346 (540) T ss_pred CcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCC--------CCcccHHHHHHHHHHHHHHHHHH Confidence 46899999999999999999999999999999999999999977553 46889999999999999999999 Q ss_pred HHHHHHHhhcCcccccceeecccccChH--HHHHHHHHHHHHhcCCcCHHHHHHHh-CCCCCCCCCeeeccccccchhhh Q lcl|NC_020081. 403 FIEDAVNKYIVSQFGGDYVFNFVGGDAK--TEAEIISILESKAKIGLTINDIRKEL-GYPDTEGGDVTLAGVHVQRLGQI 479 (552) Q Consensus 403 ~ie~~ln~~L~~~~~~~~~~~f~~~d~~--~~~~~~~~~~~~~~g~lT~NE~R~~~-gl~p~~ggD~~~~~~n~~~~~~~ 479 (552) .||++||++|+++.+.+++|+|+..+.. ++++.++ ..+.+|+||+||+|+.+ |++| ++|.++.|.|+...... T Consensus 347 ~ie~~ln~~L~~~~~~~~~i~f~~~~ll~~D~~~~~~--~lv~~G~lT~NE~Re~L~g~e~--gdd~~l~p~n~~~~~~~ 422 (540) T protein:vir:41 347 IVSSVLTDFIQLKLDPGARFVFNEEILMESEFVHNYA--LLVQCGVLTPSEVREKLFGLDG--GPDMFMVPSSIGKSAMK 422 (540) T ss_pred HHHHHHHHhhhhccCCceEEEecchhhcchHHHHHHH--HHHhCCCCCHHHHHHHhCcCcC--CCccccccccccccccc Confidence 9999999999998888888888655442 2222222 34557999999999754 4443 44667788887654433 Q ss_pred ccccccccccCCCCCcc-CcccCCCCCCCCCCCCCCCcccccCCCCccccccccccccc--------cCcc-cc-c---- Q lcl|NC_020081. 480 MQQEQVEYQRQMDANQF-LAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQ--------GGKD-DN-G---- 544 (552) Q Consensus 480 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~-~~-~---- 544 (552) .+....+.......... ...++...+......-.+++++.......+...+...+.|+ |+.+ .+ | T Consensus 423 ~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (540) T protein:vir:41 423 RQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKIDEVLSDFRAEAYENGKKMLSIAGDMGTMSAINRGVSMI 502 (540) T ss_pred ccccccCCCCccccccccchhcccccCccccccccccccccccccccccCCccccchhHHHHHhhhhhhhhhhhcCceec Confidence 22111111000000000 00000000000000000001111111111111111111111 0000 00 0 Q ss_pred -cccccccC Q lcl|NC_020081. 545 -NVVNDWEA 552 (552) Q Consensus 545 -~~~~~~~~ 552 (552) -|+...++ T Consensus 503 ~~~~~~~~~ 511 (540) T protein:vir:41 503 PPKPSNLEA 511 (540) T ss_pred CCCCcchHH Confidence 01111111 No 53 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=1.1e-73 Score=420.61 Aligned_cols=402 Identities=14% Similarity=0.132 Sum_probs=288.4 Q ss_pred ccccccccchhhhhccccccccccc--ccccccc-cccc-ccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKP--KAYEEPI-IGSM-SMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRV 99 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~ 99 (552) |..|..+.. ...++...++++... +...... .... ...+.+....... ....+ ...+++.+||..++ T Consensus 1 ~~~~~~~~~-~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~-~~~~~v~~cI~~ia 71 (413) T protein:vir:96 1 MPGVSEIRK-DKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELISD-------GYTKL-SDSPEVRMAVDCIA 71 (413) T ss_pred CCccchhhh-hhcCCccccCCCcchhhhhhccccccccccccchhhHhhhccc-------hhHHH-hhchHHHHHHHHHH Confidence 444443331 111222111111100 0010000 0000 0011111000000 11112 23456788887666 Q ss_pred HHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEE Q lcl|NC_020081. 100 NQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFE 179 (552) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~ 179 (552) +.+ +.++|.+..++.+.+. ...|++..+|. ..||++||+++||+.++.+++++||+|++ T Consensus 72 ~~i-----------a~~~~~~~~~~~~~~~-----~~~~~~~~ll~-----~~PN~~~t~~~f~~~~~~~lll~Gn~~~~ 130 (413) T protein:vir:96 72 DLV-----------SNMTIQLMQNGETGDK-----RIKNDLSRVVD-----IEPNKYLSRKTFIQWLVRSMLLEGNGNAV 130 (413) T ss_pred Hhh-----------ccCceEEEEecCCCcc-----ccccHHHHHHH-----hccccCCCHHHHHHHHHHHHhhcCCeEEE Confidence 554 4567777555443321 12244444332 36889999999999999999999999999 Q ss_pred EEECCCC-CEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHH Q lcl|NC_020081. 180 LVYDKLG-DLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEI 258 (552) Q Consensus 180 i~r~~~G-~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~ 258 (552) ++|+..| .+.+||||+|.+|++..+.+ .++|.+...+. .+.++||||++.++ .+.++++|+||+.+ T Consensus 131 i~r~~~g~~~~~L~~l~~~~v~~~~~~~--------~~~y~~~~~~~---~~~~~evih~k~~~--~~~~~~~G~s~~~~ 197 (413) T protein:vir:96 131 VKPQVSGDKIIGLTPISPYKVTFNVSDD--------DLDYSITFDNK---EYDPSTLLHFVLNP--SIERPFIGTGYKVA 197 (413) T ss_pred EEEcCCCCceEEEEEecCceeEEEEcCC--------eEEEEEeecCc---EEchhhEEEEeccC--CCCCccccccHHHH Confidence 9999877 57899999999999987643 23455555443 57899999998754 34567899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeecc-Cch Q lcl|NC_020081. 259 ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMT-QSS 337 (552) Q Consensus 259 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~-~~~ 337 (552) +..+|..+.++++++.++|+||++|+|+|++++ .+++++.++++++|++.++|..|+|+++|+..++.++..+. +++ T Consensus 198 ~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~ 275 (413) T protein:vir:96 198 LKDIVGNLKQASVTKKGFMASEYMPNLIVSVDS--DSDELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTL 275 (413) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC--CCCHHHHHHHHHHHHHHhcCccccCceeeecCCcccccccccCCh Confidence 999999999999999999999999999999876 46899999999999999999999999988877777777764 689 Q ss_pred hHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc Q lcl|NC_020081. 338 KDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG 417 (552) Q Consensus 338 ~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~ 417 (552) +|+||+|++++++++||++|||||++||.. .+.+++...|++.||.||++.||++||++|++. T Consensus 276 ~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~---------------~~~~~~~~~~~~~~l~P~~~~ie~~ln~~ll~~-- 338 (413) T protein:vir:96 276 NDLAINDAVTLDKKTVAGIFGVPAFLLGVG---------------TYNKDEFNNFINTKIMSIAQVIQQTYNKLIVEE-- 338 (413) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHcCCC---------------cchHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC-- Confidence 999999999999999999999999999742 235778889999999999999999999999874 Q ss_pred cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCC Q lcl|NC_020081. 418 GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMD 492 (552) Q Consensus 418 ~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~ 492 (552) +++|+| ++.|.+++++.+..+ +.+|+||+||+|+++|+||+||||++++++|+++++.....+..+. T Consensus 339 -~~~~~fd~~~ll~~d~~~~~~~~~~~--~~~G~~t~NE~R~~~g~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~----- 410 (413) T protein:vir:96 339 -DMYFSLNPRSLYNYSLTEMVSAGAQM--TQLNALRRNEFRNWVGMPPDAEMDDLLVLENYLQQKDLVNQKKLIQ----- 410 (413) T ss_pred -CcEEEEechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcceeeecccccchhhcccccCCCC----- Confidence 344444 577888887766543 4579999999999999999999999999999999876544322110 Q ss_pred CCccCcccCCCCCCC Q lcl|NC_020081. 493 ANQFLAQQTGYDGNM 507 (552) Q Consensus 493 ~~~~~~~~~~~~~~~ 507 (552) +++ T Consensus 411 ------------~dt 413 (413) T protein:vir:96 411 ------------DET 413 (413) T ss_pred ------------CCC Confidence 000 No 54 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=100.00 E-value=1.1e-72 Score=415.03 Aligned_cols=341 Identities=13% Similarity=0.135 Sum_probs=270.9 Q ss_pred ccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEE Q lcl|NC_020081. 113 DKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFK 192 (552) Q Consensus 113 ~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~ 192 (552) ++.+++.+..++.. ..|++..+|. .+||++||+++||+.++.+++++||+|++|+|+..|+|++|| T Consensus 1 ia~lp~~~~~~~~~---------~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~ 66 (348) T protein:vir:93 1 MASLPLKMYEDYKV---------VNTEVSDLLT-----VSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLF 66 (348) T ss_pred CcccceEeEecCcC---------cccHHHHHHH-----hCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEE Confidence 67777877543321 2255555553 368899999999999999999999999999999999999999 Q ss_pred EecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 193 AVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVF 272 (552) Q Consensus 193 ~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~ 272 (552) ||+|.+|++..+.++.. .+|.+...++....|+++||||++... +.++++|+||+.++..++..+.+++++ T Consensus 67 ~l~~~~v~~~~~~~~~~------~~y~~~~~~g~~~~~~~~eiih~r~~~---~~~~~~G~s~~~~~~~~i~~~~~~~~~ 137 (348) T protein:vir:93 67 LLNPDVVEMLIENQSRE------LYYSIHAATGNKLIVHNMDMLHFKHIV---ASNMVQGISPIDVLKNTTDFDNAVRTF 137 (348) T ss_pred EEcCCceEEEEeCCCcE------EEEEEEcCCCeEEEEccccEEEecCCC---CCCceeeccHHHHHHHHHHHHHHHHHH Confidence 99999999998877642 456666666777889999999998533 456799999999999999999999988 Q ss_pred HHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHH Q lcl|NC_020081. 273 NARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINV 352 (552) Q Consensus 273 ~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 352 (552) + ++.++..++++++.+ ..++++++++++++|++.+. |+|+++ ++++|++|++++++++|+||+|++++++++ T Consensus 138 ~--~~~~~~~~~~i~~~~--~~l~~e~~~~~~~~~~~~~~---n~~~~~-vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 209 (348) T protein:vir:93 138 N--LTEMQKPDSFMLKYG--SNVSTEKRQQVLEDFKQYYE---ENGGIL-FQEPGVEIEPLPKKYVSEDIVASENLTRER 209 (348) T ss_pred H--HHhcCCCceeEEecC--CCCCHHHHHHHHHHHHHHhh---cCCCee-ecCCCceEEEcCCChhHHHHHHHHHHHHHH Confidence 6 333444455666554 35799999999999999874 567765 456799999999999999999999999999 Q ss_pred HHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc--cceeecc-----c Q lcl|NC_020081. 353 ICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG--GDYVFNF-----V 425 (552) Q Consensus 353 Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~f-----~ 425 (552) ||++|||||++||.... .+++|++++.+.|++.||.|+++.||++||++|++..+ .+++|+| + T Consensus 210 Ia~~fgVP~~~lg~~~~----------~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~ 279 (348) T protein:vir:93 210 VANVFQLPSIFLNARSN----------TNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYL 279 (348) T ss_pred HHHHhCCCHHHhCCCCC----------CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhh Confidence 99999999999986443 36899999999999999999999999999999998754 3455555 4 Q ss_pred ccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCC Q lcl|NC_020081. 426 GGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDG 505 (552) Q Consensus 426 ~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (552) +.|.+++++.+..+ +.+|++|+||+|+++|+||+||||+++++.|+.+++.....+.. .++++.+ T Consensus 280 ~~d~~~~a~~~~~~--~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~--~~gg~~n----------- 344 (348) T protein:vir:93 280 RADSATQAEVYFKA--VRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKS--LKGGDKN----------- 344 (348) T ss_pred ccCHHHHHHHHHHH--HhCCCCCHHHHHHHhCCCCCCCcCeEeecccccccccchhhccc--ccCCCCC----------- Confidence 66778887766543 55799999999999999999999999999999988654332211 0111100 Q ss_pred CCCCCCC Q lcl|NC_020081. 506 NMDNVNG 512 (552) Q Consensus 506 ~~~~~~~ 512 (552) + .++ T Consensus 345 -~--~~~ 348 (348) T protein:vir:93 345 -V--NES 348 (348) T ss_pred -c--CCC Confidence 0 000 No 55 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=6.6e-72 Score=410.84 Aligned_cols=398 Identities=14% Similarity=0.073 Sum_probs=287.3 Q ss_pred hccccc--cccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_020081. 37 LKKGKN--TKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDK 114 (552) Q Consensus 37 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~ 114 (552) |+.+.+ ...++......+....+ ++. ..+..+...... -+...+++++||.+++..+ + T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~~----~~~~~~~v~~~i~~ia~~i-----------a 60 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIRADTGYVGL-FMS----GEDVSFLVPGYV----RLSDNPEVRMAVHKIADLI-----------S 60 (406) T ss_pred Ccchhhhccccccccccccchhhhh-hcc----CcccCccccCHH----HHhhcHHHHHHHHHHHHhh-----------c Confidence 333211 11111212222111111 111 111121111111 1234567788888766654 4 Q ss_pred ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCe--eEEEEECCCCCEEEEE Q lcl|NC_020081. 115 GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKI--NFELVYDKLGDLHNFK 192 (552) Q Consensus 115 ~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna--~~~i~r~~~G~~~~L~ 192 (552) .++|.+.....++... ..+++..+ +..+||+.||+++||+.++.+++++|++ |++++|+..|+|++|| T Consensus 61 ~~~~~~~~~~~~~~~~-----~~~~~~~~-----l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~ 130 (406) T protein:vir:95 61 SMTIYLMQNTEDGDIR-----IRNELSRK-----IDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELV 130 (406) T ss_pred cCceEEEEecCCccee-----ecchHHHH-----HhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEE Confidence 5566665443332211 11222222 2347889999999999999999999765 5567799999999999 Q ss_pred EecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 193 AVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVF 272 (552) Q Consensus 193 ~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~ 272 (552) ||+|.+|++..+.+|. ....++ ..|+++||||+++++ ++.++++|+||+.++..++..+.+++++ T Consensus 131 ~i~~~~v~~~~~~~~~----------~~~~~~---~~~~~~evih~~~~~--~~~~~~~G~s~i~~~~~~i~~~~~~~~~ 195 (406) T protein:vir:95 131 PLTPSKVNFLDTPDGY----------QVLYGG---QTFNYDEVLHFIYNP--DPERPYIGRGYRVVLKDIADNLKQATAT 195 (406) T ss_pred EEcCceeEEEEcCCeE----------EEEecc---EEEchhHEEEeeccC--CCCCCccccCHHHHHHHHHHHHHHHHHH Confidence 9999999998887652 112222 368999999998764 3556789999999999999999999999 Q ss_pred HHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeecc-CchhHHHHHHHHHHHHH Q lcl|NC_020081. 273 NARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMT-QSSKDMEFEKWLNYLIN 351 (552) Q Consensus 273 ~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~-~~~~d~q~~e~~~~~~~ 351 (552) +.++|+||++|+|+|++++ .+++++.++++++|.+.+.|..|+|+++|+..+|.+++++. ++++|+||+|+++++++ T Consensus 196 ~~~~~~ng~~~~~il~~~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~ 273 (406) T protein:vir:95 196 KKSFMSGKYMPSLIVKVDA--ATAELSSEEGRNAVFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKR 273 (406) T ss_pred HHHHHhccCCcceEEEeCC--CCCHHHHHHHHHHHHHHhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHH Confidence 9999999999999999876 46899999999999999999999999988887778887765 68999999999999999 Q ss_pred HHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecc---cccC Q lcl|NC_020081. 352 VICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNF---VGGD 428 (552) Q Consensus 352 ~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f---~~~d 428 (552) +||++|||||++||.. ++.+++...|++.||.|+++.||++||++|+++.+..+.|.+ ++.| T Consensus 274 ~Ia~~fgVp~~~lg~~---------------~~~~~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~fd~~~l~~~d 338 (406) T protein:vir:95 274 TVAGMFGVPAFLLGIG---------------EFNRDEYNNFINSTILPIAKGIEQELTRKLLISPDLYFKFNPRSLYAYD 338 (406) T ss_pred HHHHHhCCCHHHcCCC---------------CchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcEEEeechhhhcCC Confidence 9999999999999842 245788889999999999999999999999987655445543 5667 Q ss_pred hHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCC Q lcl|NC_020081. 429 AKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMD 508 (552) Q Consensus 429 ~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (552) .+++++.+..+ +.+|+||+||+|+++||||+||||++++++|+++++.+...+.. ++++.+. .+++++ T Consensus 339 ~~~~~~~~~~l--~~~G~~t~NE~R~~~gl~p~~~gd~~~~~~n~~~~~~~~~~~~~---k~g~~~~-------~~~~~~ 406 (406) T protein:vir:95 339 LKELAEVGSNM--YVRGIMEGNEVRDWLGLSPKEGLSELVILENYIPLDKIGDQSKL---KGGDNSG-------ADGQTD 406 (406) T ss_pred HHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcceeeeccCccchhhccccccc---CCCCCCC-------CCCCCC Confidence 88887766654 45799999999999999999999999999999998765432211 1111000 000000 No 56 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=4.5e-72 Score=411.74 Aligned_cols=403 Identities=14% Similarity=0.100 Sum_probs=277.4 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCccccc-ccCCCCchHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEA-PSIHGKQNLLQ 79 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 79 (552) ||... +.+|--. +....++-....-+.-.+.++++- ++.....+...+.....+|.. +... ....+...+. T Consensus 1 ~~~~~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~g-~~~~~~~~~~~~~t~- 72 (409) T protein:vir:83 1 MGFWS-NLFGIPS---IPDLPNDNGPVDYNPGDPDMVEFR--GPEEEPEARALPWIRPTAWSG-YPESWATPSWGSAQD- 72 (409) T ss_pred Cchhh-hhccccc---CCCcccccccccccCCCCceeecc--CCCcchhhhhccccccccccc-ccccccccCccccch- Confidence 77766 3333211 011111111111111111222111 111111111111111112221 1110 0011111111 Q ss_pred HHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCH Q lcl|NC_020081. 80 MLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNF 159 (552) Q Consensus 80 ~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~ 159 (552) +. +....++.+||.++++.+ +++++.+..... +.+.+.. . +..+||+.||+ T Consensus 73 --~~-~~~~~~v~acV~~Ia~~i-----------A~lpl~~~~~~~----------~~~~~~~---l--l~~~PN~~~t~ 123 (409) T protein:vir:83 73 --KL-RTLIDVAWACIDLNASVL-----------SSMPIYRMRNGR----------IIDSVAW---M--SNPDPEVYTSW 123 (409) T ss_pred --hh-HhhhHHHHHHHHHHHHhh-----------ccCceEEeeCCc----------cccchhh---h--cccCCCCCCCH Confidence 12 223456788887766654 445555542211 1111111 1 23589999999 Q ss_pred HHHHHHHHHHHHhcCCeeEEE-EECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeee Q lcl|NC_020081. 160 RSFVKKLVRDRLTYDKINFEL-VYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWE 238 (552) Q Consensus 160 ~~f~~~~v~d~ll~Gna~~~i-~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~ 238 (552) ++|++.++.++++ ||+|+++ .|+.+|+|++|+||+|++|++..+++|.. .|+. .. .+.++||||+ T Consensus 124 ~~f~~~l~~~lll-Gnay~~~i~r~~~G~~~~L~pl~p~~v~v~~~~~g~~-------~y~~-~~-----~~~~~eiiHi 189 (409) T protein:vir:83 124 QEFAKQLFWDFQL-GEAFVLPMAHGSDGYPIRFRVVPPWLVNVELKKGARR-------EYRI-GG-----LNVTDEILHI 189 (409) T ss_pred HHHHHHHHHHHhh-CCcEEEEEEECCCCcEEEEEEECCcceEEEEcCCceE-------EEEE-cc-----ccCccceEEe Confidence 9999999999887 9999985 58999999999999999999998877643 2222 11 2346899998 Q ss_pred cccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccc Q lcl|NC_020081. 239 VSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAW 318 (552) Q Consensus 239 ~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nag 318 (552) +... +.+++||+||++.++.+|..+.++++|+.++|+||++|+|||++++ .++++++++++++|++.++| |+| T Consensus 190 r~~~---~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~~--~ls~e~~~~~~~~~~~~~~~--nag 262 (409) T protein:vir:83 190 RYQG---NTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVER--RLSETEAVDLMDRWIESRSK--YAG 262 (409) T ss_pred CCCC---CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecCC--CCCHHHHHHHHHHHHHhhCC--ccC Confidence 7643 4556899999999999999999999999999999999999999876 56999999999999998766 788 Q ss_pred cceeeccCCcee-eeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHh Q lcl|NC_020081. 319 KIPVITAEDVKF-VNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGL 397 (552) Q Consensus 319 k~~il~~~g~~~-~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l 397 (552) +.++|.+ |+++ ++++++++|+||+|++++++++||++|||||++||+...++ +.+|+|+|++.+.|++.|| T Consensus 263 ~~~il~~-g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~-------~~tysn~eq~~~~f~~~tL 334 (409) T protein:vir:83 263 HPALVTG-GATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATG-------SLTYSNIEQLFSFHDRSSL 334 (409) T ss_pred ccceecC-CcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCcc-------ccccccHHHHHHHHHHHHH Confidence 8777664 6665 67999999999999999999999999999999999866433 3468999999999999999 Q ss_pred hHHHHHHHHHHHhhcCcccccceeec---ccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccc Q lcl|NC_020081. 398 EPLLKFIEDAVNKYIVSQFGGDYVFN---FVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHV 473 (552) Q Consensus 398 ~P~~~~ie~~ln~~L~~~~~~~~~~~---f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~ 473 (552) .||+++||++||++|++. +..++|. ++++|.+++++.++.+ +.+|+||+||+|+++||||++|||.+-.+. + T Consensus 335 ~P~~~~ie~~l~~~Ll~~-~~~~~f~~~~llr~d~~~r~~~~~~~--~~~G~lT~NE~R~~~glpp~~ggd~l~~~g-v 409 (409) T protein:vir:83 335 RPKATAVMAALDRWALPS-PQHLELNRDDYTRPSLVERATAYKIM--IEAGVMEPNEARAMERLHSEAAAVRLSGGG-V 409 (409) T ss_pred HHHHHHHHHHHHHhhCCC-CcEEEeehhhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCcccCCCC-C Confidence 999999999999999975 3344444 3578888888877654 447999999999999999999999873221 1 No 57 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=6.9e-72 Score=410.72 Aligned_cols=453 Identities=13% Similarity=0.158 Sum_probs=290.7 Q ss_pred cccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHH Q lcl|NC_020081. 19 DINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITR 98 (552) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~ 98 (552) -.+..++..+.-..++.. .+...++.+ .....+ .+.. ++..+..|.++.+.++++++||.++ T Consensus 1 ~~~~~~~i~s~~~~~~i~----------~~~~~s~~~-~~~~~~-~~~~------pp~~~~~la~l~~~n~~v~scI~~i 62 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIK----------REEVESQAL-GETRFE-EYVE------PKVNPLVLLSLLQVNPYHASACSIK 62 (542) T ss_pred Cccccccccccccchhhh----------hcccccccc-ccccCC-cccc------CCCCHHHHHHHHhhcHHHHHHHHHH Confidence 111222222211111111 111111111 111111 1211 1223567778888889999999887 Q ss_pred HHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeE Q lcl|NC_020081. 99 VNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINF 178 (552) Q Consensus 99 ~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~ 178 (552) ++.++ ++++.+... +.+.+.++ .||+.||+++|++.++.+++++||+|+ T Consensus 63 a~~IA-----------~l~~~~~~~------------~~~~l~~~--------lpN~~~s~~~f~~~~v~~lll~Gnayi 111 (542) T protein:vir:41 63 ANDII-----------RTGYILEGD------------DEGVVDEF--------IRACKPSFEYVLLRALEDLQVFNYCTL 111 (542) T ss_pred HHHHh-----------hCceeeecc------------cchhhhhh--------cCCCCCCHHHHHHHHHHHHhhcCCeEE Confidence 76654 445554211 11222222 267889999999999999999999999 Q ss_pred EEEECCCCCEEEEEEecCceeEEEECCCccccccc-cee-EEEEEc--------CCceEEEEcccceeeecccccCCccC Q lcl|NC_020081. 179 ELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAK-DGV-RYVQVI--------DDKVVAKFKAKEMAWEVSNPRTDLTV 248 (552) Q Consensus 179 ~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~-~~~-~y~~~~--------~~~~~~~~~~~evi~~~~~~~~~~~~ 248 (552) +++|+..|+|.+|+||||++|++..+.++...... ... +|..+. .+.....++++||||++.+ .+.+ T Consensus 112 ~i~rd~~G~~~~L~~l~~~~v~v~~d~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~---~~~~ 188 (542) T protein:vir:41 112 EVVRDDRGDPIRFEYIPSHTIRVHKDGSRYRQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANELVFIHIP---SPVC 188 (542) T ss_pred EEEEcCCCcEEEEEEEcCcceEEEEcCCeeEeeecCCcceeEEeecccccccccccccccccCcccEEEecCC---CCCC Confidence 99999999999999999999999887665332211 111 222111 1222345788999998754 3567 Q ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC--------CCCCHHHHHHHHHHHHHHhccc-ccccc Q lcl|NC_020081. 249 GKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTG--------QEQSNQALTSFRREWTSMFSGI-NGAWK 319 (552) Q Consensus 249 g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~--------~~~s~~~~~~~~~~~~~~~~G~-~nagk 319 (552) ++||+|||..++.+|..+.++++|+.++|+||++|+|||++++. ..+++++++++++.|++.+.|. .|+|+ T Consensus 189 ~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk 268 (542) T protein:vir:41 189 SYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHT 268 (542) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCc Confidence 79999999999999999999999999999999999999999753 3568899999999999999886 56777 Q ss_pred ceeec-----cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHH Q lcl|NC_020081. 320 IPVIT-----AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKD 394 (552) Q Consensus 320 ~~il~-----~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~ 394 (552) ++|+. .+|++|+++++++.|++|++++++++++||++|||||++||+.+.++ .+++|+|++.+.|++ T Consensus 269 ~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t--------~n~sn~Eq~~~~f~~ 340 (542) T protein:vir:41 269 PLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGP--------LGGNFAEVTRRTYYE 340 (542) T ss_pred eeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcc--------cccccHHHHHHHHHH Confidence 76653 46899999999999999999999999999999999999999976554 467899999999999 Q ss_pred HHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCC-eeeccccc Q lcl|NC_020081. 395 KGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGD-VTLAGVHV 473 (552) Q Consensus 395 ~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD-~~~~~~n~ 473 (552) +||.|+++.||++||++|+++++.+++|+|+..+.........+...+.+|+||+||+|+.+ +++++|| .++.|.+. T Consensus 341 ~tL~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~ll~~d~~~~~~~~v~~GilT~NE~Re~L--~g~~pgdd~~l~p~~~ 418 (542) T protein:vir:41 341 SVVRPQQNIISSILTDFFQVKFNPKTRFKFNDETLLESDSVRNCALLVQSGVLTPAEARERL--FGLDGGPDIFMVPSKG 418 (542) T ss_pred HHHHHHHHHHHHHHHhhcccccCCceEEEecchhhcchHHHHHHHHHHhCCCCCHHHHHHhh--CCCCCCCccccccccc Confidence 99999999999999999999988888999876554433222333345567999999999753 3444444 55666665 Q ss_pred cchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccccccccccCcccc-ccccccccC Q lcl|NC_020081. 474 QRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGKDDN-GNVVNDWEA 552 (552) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 552 (552) .......+.... +..+.. +..+.+...+ +.. ..+.++. +...++++.+.+- ||.-+|-.. T Consensus 419 ~~~~~~~~~~n~------~~~~~~-----~~~k~~~k~~-~~~-~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~ 479 (542) T protein:vir:41 419 AAKSVKRQERNY------EKNQIR-----EIRKIYAKYR-PRF-NEIISSK------LSAEEKKKKIDESLAEFRAEAYE 479 (542) T ss_pred cccccccCCcCC------CCCchh-----hhhhcccccC-ccc-ccccccc------ccchhhcccccchhhhhHHhHHh Confidence 432211111000 000000 0000000000 000 0000010 0001111111000 111111111 No 58 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=1.3e-71 Score=409.14 Aligned_cols=410 Identities=14% Similarity=0.158 Sum_probs=280.9 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCc----- Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFT----- 155 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~----- 155 (552) ||+++++++++++||.++++.+ +++++.++.+...+. .......++.+.+++..+ .||+ T Consensus 1 l~~l~~~n~~v~~ci~~ia~~i-----------a~~p~~i~~~~~~~~-~~~~~~~~~~~~~~l~~~----~pn~~~~~~ 64 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYV-----------AGFGINIIPHPEAED-PDRDGEQYERVWDFWFGD----DSNWQVGPM 64 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhh-----------hcCCeEEEEccCccc-ccchhhhhhhHHHHhhcc----CCCccccch Confidence 9999999999999998777664 467888876543322 222334555555555433 3444 Q ss_pred ---cCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEE------------ Q lcl|NC_020081. 156 ---RDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQ------------ 220 (552) Q Consensus 156 ---~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~------------ 220 (552) ++|+.+||+.++.+++++||+|++++|+..|+|++||||+|++|++..+..+..........|+. T Consensus 65 ~~~~~t~~~~~~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 144 (467) T protein:vir:31 65 ESERATATNVLQTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNG 144 (467) T ss_pred hhHhhHHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeeccc Confidence 45778999999999999999999999999999999999999999998776543222211111111 Q ss_pred ----------EcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC Q lcl|NC_020081. 221 ----------VIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIK 290 (552) Q Consensus 221 ----------~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 290 (552) ....+....++++||||++.+ ++.+++||+||+.+++.+|..+.+++.++.++|+||++|+|||.++ T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~diih~r~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 221 (467) T protein:vir:31 145 DLDPVFVDADDGSTGTSVSNPANELIFKRNH---SPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVK 221 (467) T ss_pred ceeeeeeeeccccccceeEeccccEEEecCC---CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec Confidence 112344567899999999754 3456789999999999999999999999999999999999999987 Q ss_pred CCCCCCHHHHHHHHHHHHHHhc-----------cccccccceeeccCCc-------eeeecc-CchhHHHHHHHHHHHHH Q lcl|NC_020081. 291 TGQEQSNQALTSFRREWTSMFS-----------GINGAWKIPVITAEDV-------KFVNMT-QSSKDMEFEKWLNYLIN 351 (552) Q Consensus 291 ~~~~~s~~~~~~~~~~~~~~~~-----------G~~nagk~~il~~~g~-------~~~~l~-~~~~d~q~~e~~~~~~~ 351 (552) + ..++++++++++++|++.+. |..+++++.++. .|+ ++++++ .+++|+||+++++++++ T Consensus 222 ~-~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~-~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~ 299 (467) T protein:vir:31 222 G-AELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLA-DGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEH 299 (467) T ss_pred C-cCCCHHHHHHHHHHHHhhhcchhhhhhhhhccccccccccccc-CCCcccccceeEEeccccChhhHHHHHHHHHHHH Confidence 5 35799999999999998776 556788865554 454 455554 36789999999999999 Q ss_pred HHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc--cceeecc----- Q lcl|NC_020081. 352 VICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG--GDYVFNF----- 424 (552) Q Consensus 352 ~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~f----- 424 (552) +||++|||||++||+.+.++ .+++++++.+.|++.||+|+++.||++||++|++... .+++++| T Consensus 300 ~Ia~~fgVpp~~lG~~~~~~---------~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l 370 (467) T protein:vir:31 300 DILKVHDVPPVIAGVVESGA---------FSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKP 370 (467) T ss_pred HHHHHhCCCHHHcccCCCCC---------cccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchh Confidence 99999999999999876543 2468999999999999999999999999999997543 3344443 Q ss_pred cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCC Q lcl|NC_020081. 425 VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYD 504 (552) Q Consensus 425 ~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (552) ++.|.+++.+.+.. .+.+|++|+||+|+++||||+++++.+ +....... ...+.. +......++ .+ T Consensus 371 ~~~d~~~~~~~~~~--~~~~G~~T~NE~R~~~Gl~pi~d~~~~--~~~~~~~~--~~~~~~-------~~~~~~~~~-~~ 436 (467) T protein:vir:31 371 DTKLQDVEIASQRV--QAMQGLLTVNELRDEFGFEPFPEEHVY--GGETLVAE--VTGGSG-------PGGGIGDQI-EQ 436 (467) T ss_pred hccCHHHHHHHHHH--HHhCCCcCHHHHHHHhCCCCCCccccc--CCcccccc--cccccC-------CCCcccCcC-CC Confidence 45667777665554 355799999999999999999654432 21111100 000000 000000000 00 Q ss_pred CCCCCCCCC-CCcccccCCCCccccccccccccccCcccc Q lcl|NC_020081. 505 GNMDNVNGK-DSFNQNVGKDGQSKQQANTNSTPQGGKDDN 543 (552) Q Consensus 505 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 543 (552) ...++++.. +.-..+..++. =.+.|.+.|- T Consensus 437 ~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~ 467 (467) T protein:vir:31 437 LVEDRADEIIDSYQADLETEQ---------LIEIGANADS 467 (467) T ss_pred CCCCcccchHhhhhhccccch---------hhhhccccCC Confidence 000000000 00001111111 1122222222 No 59 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=7.6e-72 Score=410.48 Aligned_cols=479 Identities=14% Similarity=0.158 Sum_probs=314.9 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVS 103 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~ 103 (552) |.-. +...++.+++...+ ..+.--...+....+.....+..++.+. ++..+..|+++++.++++++||.+ T Consensus 1 ~~~~-~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-p~~~~~~L~~~~e~~~~~~~~i~~------ 70 (651) T protein:vir:99 1 MTDT-TGETQETKVHVEGL--GGEADLAKSPNSTQIPDHRIQSHNVGVN-PPYNPDRLAAFLELNETLATGIRK------ 70 (651) T ss_pred CCCc-cceeeeeEEEeecc--cccccccccccccccchhhhcccCCCCC-CCCCHHHHHHHHhcChHHHHHHHH------ Confidence 2222 22334444443211 0000000111222223333344455444 555789999999999999999865 Q ss_pred HHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhc-----CCCCCCCccCCHHHHHHHHHHHHHhcCCeeE Q lcl|NC_020081. 104 MFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKT-----GRIDNDFTRDNFRSFVKKLVRDRLTYDKINF 178 (552) Q Consensus 104 ~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~-----n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~ 178 (552) ++.+++|+||.+..+..... ++....+....+++++.+ ..+...|+.+|+.+|++.++.|++.+||+|+ T Consensus 71 -----~~~~iag~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~i 144 (651) T protein:vir:99 71 -----KSRYEVGFGFDLVPAQGVDG-DDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLAL 144 (651) T ss_pred -----HhhhhhccCceeeecccCCC-CccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhh Confidence 44557899999887643222 233344455667776553 2345568889999999999999999999999 Q ss_pred EEEECCCCCEEEEEEecCceeEEEECCCccc--------------------------------ccc--cc---------- Q lcl|NC_020081. 179 ELVYDKLGDLHNFKAVDASTVYVAVDEDGKE--------------------------------RKA--KD---------- 214 (552) Q Consensus 179 ~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~--------------------------------~~~--~~---------- 214 (552) +++++..|+|+.|+++++..+++..+..... +.. .. T Consensus 145 eiIrn~~g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~ 224 (651) T protein:vir:99 145 EMLTDIEGRPVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVID 224 (651) T ss_pred hhhhcCccchhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeec Confidence 9999999999999999999887654321100 000 00 Q ss_pred ------------------------ee-EEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHH Q lcl|NC_020081. 215 ------------------------GV-RYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNT 269 (552) Q Consensus 215 ------------------------~~-~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~ 269 (552) .. ..+...+......++++||||++.+ .+.+++||+|||..+..+|.++.++ T Consensus 225 ~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~---~~~~g~~G~spl~~a~~~i~~a~~a 301 (651) T protein:vir:99 225 ESGDEPTIRYREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPNP---SILEDDYGVPDWVSAIRTISADEAA 301 (651) T ss_pred cCCcceeEEeccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecCC---CCCCCcccccHHHHHHHHHHHHHHH Confidence 00 0011222334456889999999754 2457799999999999999999999 Q ss_pred HHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecc----------CCceeeeccCch-h Q lcl|NC_020081. 270 EVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITA----------EDVKFVNMTQSS-K 338 (552) Q Consensus 270 ~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~----------~g~~~~~l~~~~-~ 338 (552) ++|+.++|+||++|+|||+++++ .++++++++++++|++.+. |+|+++||.. .|++|+++++++ + T Consensus 302 ~~~~~~~f~NG~~p~gil~~~~~-~ls~e~~~~lr~~~~~~~~---nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~ 377 (651) T protein:vir:99 302 KDYNRDFFDNDTIPRMVIKVTGG-ELSEESKRDLRQMLNGLRE---ESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISE 377 (651) T ss_pred HHHHHHHHhccCCCceEEEecCC-CCCHHHHHHHHHHHHHHhc---cCCceEEeecccccccccccCCceEEEcCcCchh Confidence 99999999999999999999753 5799999999999998653 6889877754 289999999987 5 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc- Q lcl|NC_020081. 339 DMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG- 417 (552) Q Consensus 339 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~- 417 (552) |+||+|++++++++||++|||||++||+.+.+ +++|++++.+.|+++||+||++.||++||++|++..+ T Consensus 378 D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~----------~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~ 447 (651) T protein:vir:99 378 EMDFRQFREKNEHEIAKVLEVPPVKIGVTDSA----------NRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALG 447 (651) T ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhccCCCC----------CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccc Confidence 99999999999999999999999999987643 5799999999999999999999999999999998653 Q ss_pred ---cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCC--CCCeeeccccccchhhhcccccccc Q lcl|NC_020081. 418 ---GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTE--GGDVTLAGVHVQRLGQIMQQEQVEY 487 (552) Q Consensus 418 ---~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~--ggD~~~~~~n~~~~~~~~~~~~~~~ 487 (552) ..++|+| ++.|.+++++.+..+ +.+|+||+||+|+++||||++ +||.++.+.+...++...+ T Consensus 448 ~~~~~i~~ef~~~~llr~D~~~~~e~~~~~--i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~------ 519 (651) T protein:vir:99 448 VTDWTIEYELRGADQPKQEAQLAEQRVRAM--RLAGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAG------ 519 (651) T ss_pred ccCceEEEEeccchhhhccHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcccccccccccccccccccc------ Confidence 2356665 456777777766643 457999999999999999995 4888887766554432211 Q ss_pred ccCCCCCccCcccCCCCCCCCCCCCCCCcc---cccC-CCCccccccccccccc--cCccccccccccc----------- Q lcl|NC_020081. 488 QRQMDANQFLAQQTGYDGNMDNVNGKDSFN---QNVG-KDGQSKQQANTNSTPQ--GGKDDNGNVVNDW----------- 550 (552) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~-~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~----------- 550 (552) +++ ++.... .+..+..++.+.. +... ++.-+.. .-.+|.-. |=+.....-.=.| T Consensus 520 --gge-~~~~~~-----~~~~~~~~~~e~~~~~~~~~~~e~~~~~-~v~ss~~~~~gyd~~~~~l~~~f~~~~~~~~~y~ 590 (651) T protein:vir:99 520 --GGE-TEAVHE-----PPEENKIGEREWDTVKSELTTKDPIEQM-QFSSSNLDEGLYDFGENELYLSFLRDEGQSSLYA 590 (651) T ss_pred --CCC-Cccccc-----Cccccccccchhhhhhhhhcccchhhhh-hHHHHHHHhhcCCCccceEEEEEeecCCCCceee Confidence 110 000000 0000111111000 0001 1111111 00111100 1110001101111 Q ss_pred ---------cC Q lcl|NC_020081. 551 ---------EA 552 (552) Q Consensus 551 ---------~~ 552 (552) ++ T Consensus 591 y~~v~~~~~~~ 601 (651) T protein:vir:99 591 YVDVPASEWSA 601 (651) T ss_pred eeCCCHHHHHH Confidence 11 No 60 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=3e-71 Score=407.19 Aligned_cols=396 Identities=14% Similarity=0.115 Sum_probs=281.7 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccC-CcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMN-PDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQV 102 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~ 102 (552) |. ++.+.++|++. .|.. ..++. ..... ...+... ...++.+ +.+++||..+++.+ T Consensus 1 Mg-----------~~~~f~~k~~~-----~~~~-~~~~~~~~~~~--~~~~~~~----~~~~~~~-~~V~~~I~~ia~~i 56 (403) T protein:vir:80 1 MG-----------LFNFFRRKTRS-----EPTN-AISWFLTQEAY--DTLAIPG----YTRLSDN-PEVRMAVHKIAELI 56 (403) T ss_pred Cc-----------ccccccccccc-----cccc-hhhhhcccccc--cccccch----hhhhhhh-HHHHHHHHHHHHhh Confidence 11 11222333221 1111 11111 11000 1111111 1123333 45678887666654 Q ss_pred HHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhc--CCeeEEE Q lcl|NC_020081. 103 SMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTY--DKINFEL 180 (552) Q Consensus 103 ~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~--Gna~~~i 180 (552) +++++.+.....++.. ...+++..+|. .+||+.||+++||+.++.++++. ||||+++ T Consensus 57 -----------A~~p~~~~~~~~~g~~-----~~~~~~~~lL~-----~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~ 115 (403) T protein:vir:80 57 -----------SSMTIHLMQNTDNGDI-----RIKNELSRKID-----INPYSLMTRKAWMYNIVYTMLLDGEGNSVVFP 115 (403) T ss_pred -----------hhCceEEEEecCCcee-----ecCChHHHHHh-----ccCCcCCCHHHHHHHHHHHHhhcCCccEEEEE Confidence 4456665433332211 11233333332 36899999999999999999984 8899999 Q ss_pred EECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHH Q lcl|NC_020081. 181 VYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIAL 260 (552) Q Consensus 181 ~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~ 260 (552) +|+..|+|.+||||+|.+|++..+.+|.. ++|. ...|+++||||+++++ .+.++++|+||+..+. T Consensus 116 ~~~~~g~~~~L~~l~p~~v~~~~~~~g~~------~~y~-------~~~~~~~eiih~~~~~--~~~~~~~G~s~~~~~~ 180 (403) T protein:vir:80 116 KYTTSGLIDELIPLAPSKVSFVDTDTGYQ------IWYQ-------GKAYNYDEVLHFIVNP--DPEKPYMGRGYRVVLK 180 (403) T ss_pred EEcCCCcEEEEEEEcCCeeEEEEcCCceE------EEEe-------ecccchhhEEEEeccC--CCcCccccccHHHHHH Confidence 99999999999999999999998877632 2221 1357899999998754 4556789999999999 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeecc-CchhH Q lcl|NC_020081. 261 NHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMT-QSSKD 339 (552) Q Consensus 261 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~-~~~~d 339 (552) .++....++++++.++|+||++|++||+++.. +++++.++++++|.+.+.|..++|+++++...+.++..+. +++.| T Consensus 181 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~--~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d 258 (403) T protein:vir:80 181 DIVNNLKQATTTKKSFMSGKYMPSLIVKVDAA--TAELSSEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQVKPLSLKD 258 (403) T ss_pred HHHHHHHHHHHHHHHHHhccCCcceEEEeCCC--CChHHHHHHHHHHHHHHhhhhhcCCeeeecccccccceeccCCHHH Confidence 99999999999999999999999999998764 5788889999999999999999999888876666666554 57899 Q ss_pred HHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccc Q lcl|NC_020081. 340 MEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGD 419 (552) Q Consensus 340 ~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~ 419 (552) +||+|.+++++++||++|||||++||..+ ..++....|++.||.|+++.||++|+++|+++.+.. T Consensus 259 ~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~---------------~~~~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~ 323 (403) T protein:vir:80 259 LAIHETVELDKRTVAGIFGVPAFLLGVGK---------------YDKDEYNNFINSTILPIAKGIEQELTRKLLISPDLY 323 (403) T ss_pred HHHHHHHHHhHHHHHHHhCCCHHHcCCCC---------------ccHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcE Confidence 99999999999999999999999998532 223455679999999999999999999999875543 Q ss_pred eeec---ccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCcc Q lcl|NC_020081. 420 YVFN---FVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQF 496 (552) Q Consensus 420 ~~~~---f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 496 (552) ++|. ++++|.+++++.+..+ +.+|+||+||+|+++||||+||||++++++|+++++...+++.. ++++.+.. T Consensus 324 ~~f~~~~ll~~d~~~~~~~~~~~--~~~Gi~t~NE~R~~~gl~p~~ggd~~~~~~n~~pl~~~~~~~~~---k~ge~~~~ 398 (403) T protein:vir:80 324 FKFNPRSLYAYDLKELAEVGSNM--YVRGLMEGNEVRDWLGLSPKEGLSELVILENYIPLDKIGDQNKL---KGGEKGGA 398 (403) T ss_pred EEeechhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCeEeecccccchhhccchhhc---cCCCCCCC Confidence 4443 3577888888877654 45799999999999999999999999999999998765443221 11111100 Q ss_pred CcccCCCCCCCC Q lcl|NC_020081. 497 LAQQTGYDGNMD 508 (552) Q Consensus 497 ~~~~~~~~~~~~ 508 (552) . +.++ T Consensus 399 ~-------~~~~ 403 (403) T protein:vir:80 399 D-------GQTD 403 (403) T ss_pred C-------CCCC Confidence 0 0000 No 61 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=1.4e-70 Score=403.51 Aligned_cols=441 Identities=12% Similarity=0.101 Sum_probs=290.5 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhh--ccccccccccccccccccccccccCCcccccccCCCCchHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAIL--KKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLL 78 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (552) ||+++ |+++..-. ..+...+.... ......-.+..-....|-...+-++ ..++..+..... T Consensus 1 M~~~~-~l~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g----~~~~~~~~~g~~ 63 (466) T protein:vir:81 1 MRLID-RLLSTRGA------------APRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAG----PSTELAPDTFVG 63 (466) T ss_pred CchhH-HHhhccCc------------ccccchhhhhhhhhhhhccccccccccccHHHHHhhcc----ccccccCccccc Confidence 99998 77655421 00110000000 0000000000000000000000000 001111111111 Q ss_pred HHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCC Q lcl|NC_020081. 79 QMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDN 158 (552) Q Consensus 79 ~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t 158 (552) .-...+...+++++||.++++.+ +.+++.+..++.... .+.+.|++..++ .+||+.|| T Consensus 64 -v~~~~a~~~~~v~~~i~~Ia~~i-----------a~lp~~~~~~~~~~~----~~~~~~~~~~L~------~~PN~~~t 121 (466) T protein:vir:81 64 -LATQAYQANGPVFACMLVRQLVF-----------SSVRFRWQRLRDGKP----SDTFGSRDLQIL------ETPWKGGT 121 (466) T ss_pred -cchhhhhccHHHHHHHHHHHHhh-----------ccCceEEEEecCCce----eeccccHHHHHh------hCCCCCCC Confidence 11122334566788887766654 456676654432211 112223333322 36889999 Q ss_pred HHHHHHHHHHHHHhcCCeeEEEEECCC--------CCEEEEEEecCceeEEEECCCcccccccceeEEEEEcC----Cce Q lcl|NC_020081. 159 FRSFVKKLVRDRLTYDKINFELVYDKL--------GDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVID----DKV 226 (552) Q Consensus 159 ~~~f~~~~v~d~ll~Gna~~~i~r~~~--------G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~----~~~ 226 (552) +++||+.++.+++++||||++|+|+.. |.+++|+||+|.+|++..+.++.... .|.+... +.. T Consensus 122 ~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~-----~y~~~~~~~~~~~~ 196 (466) T protein:vir:81 122 TQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVDVVVEERMVRGGRGELGGGQLGWRKV-----GYLYTEGGRQSGNE 196 (466) T ss_pred HHHHHHHHHHHHHhcCCeEEEEEecCccccccccCcceeEEEEecCcceEEEEcCCCceEE-----EEEEEecCcccccc Confidence 999999999999999999999999765 55899999999999999988875432 2333322 234 Q ss_pred EEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_020081. 227 VAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRRE 306 (552) Q Consensus 227 ~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~ 306 (552) ...|+++||||++.. +++.+++||+||+.++.++|..+.++++++.++|+||++|+|||++++ .+++++++++++. T Consensus 197 ~~~~~~~dviHir~~--~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~--~l~~e~~~~~~~~ 272 (466) T protein:vir:81 197 SVGFLAEDVVHFAPI--PDPLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNP--MADPAAVKKWADE 272 (466) T ss_pred eeeeccccEEEEcCC--CCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC--CCCHHHHHHHHHH Confidence 568999999999753 245678999999999999999999999999999999999999999875 4689999999999 Q ss_pred HHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHH Q lcl|NC_020081. 307 WTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSA 386 (552) Q Consensus 307 ~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e 386 (552) |++.++|..|+|+++|+ ++|++|++++++++|+||+|++++++++||++|||||++||+.+.. ...+|+|+| T Consensus 273 ~~~~~~g~~n~g~~~vl-~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~-------~~st~sn~e 344 (466) T protein:vir:81 273 VNSKHAGVDNAWKNLNL-YPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGL-------AAATYSNYG 344 (466) T ss_pred HHHHhcCccccccceEc-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCC-------CccccccHH Confidence 99999999999997554 6799999999999999999999999999999999999999986532 245689999 Q ss_pred HHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecc-----cccChHHHHHHHHH----HHHHhcCCcCHHHHHHHh Q lcl|NC_020081. 387 EKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNF-----VGGDAKTEAEIISI----LESKAKIGLTINDIRKEL 456 (552) Q Consensus 387 ~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f-----~~~d~~~~~~~~~~----~~~~~~g~lT~NE~R~~~ 456 (552) ++.+.|++.||.||+++||++||++|++..+ ..++|+| +++|.+++++...+ +.....+++|+||+|+ T Consensus 345 q~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g~t~nE~r~-- 422 (466) T protein:vir:81 345 QARRRLADGTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEKDAADIQKVRAETINTLITAGYEPESVVA-- 422 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcCCChhhccc-- Confidence 9999999999999999999999999997543 3456665 57788888876432 2233334469999995 Q ss_pred CCCCCCCCCeee-ccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccc Q lcl|NC_020081. 457 GYPDTEGGDVTL-AGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQN 519 (552) Q Consensus 457 gl~p~~ggD~~~-~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (552) ++++||.++ .+.++.+++.....+... ...+++. .+.++.|.+ T Consensus 423 ---~~~~gd~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~--------~~Gg~~ngn 466 (466) T protein:vir:81 423 ---AVNSGDLRLLKHTGLTSVQLLPPGVSAS---------ASSDTPT--------SGGADDNGN 466 (466) T ss_pred ---cccCCccccccCCCcchhhhcccccccc---------cCCCCcc--------cCCCCcCCC Confidence 566888754 344444433221111100 0000000 000011111 No 62 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=1.1e-68 Score=393.24 Aligned_cols=393 Identities=15% Similarity=0.085 Sum_probs=275.6 Q ss_pred hccccccc-cccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_020081. 37 LKKGKNTK-SNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKG 115 (552) Q Consensus 37 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~ 115 (552) |....+.+ +++......+ .|...+.... .+..... ..+...+.+.+||.+++..++. T Consensus 1 M~~f~~~~~~~~~~~~~~~-----~~~~~~~~~~--~~~~v~~----~~al~~~~V~~~v~~ia~~ia~----------- 58 (397) T protein:vir:38 1 MPLLKLNKSHSQGFSLNDP-----DWVNFLTGGE--AQKYVSA----DTALKNSDIFSLIMQLSGDLAM----------- 58 (397) T ss_pred CcchhhhhcccCcccCCch-----hhhhhhcCCc--CCceech----HHhhccHHHHHHHHHHHHHHhh----------- Confidence 44332211 1122121111 1221111110 1111111 1223456678888877766653 Q ss_pred cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEec Q lcl|NC_020081. 116 VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVD 195 (552) Q Consensus 116 ~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~ 195 (552) +++.+ + .+....++ .+||+.||+++||+.++.+++++||||++|+|+..|++++||||+ T Consensus 59 ~p~~~--~-------------~~~~~~l~------~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~ 117 (397) T protein:vir:38 59 VRYTS--E-------------SDRSQSII------SNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLR 117 (397) T ss_pred Ccccc--c-------------ccHHHHHH------hcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEc Confidence 23321 1 11122222 368899999999999999999999999999999999999999999 Q ss_pred CceeEEEECCCcccccccceeEEEEEc---CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 196 ASTVYVAVDEDGKERKAKDGVRYVQVI---DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVF 272 (552) Q Consensus 196 p~~v~v~~~~~g~~~~~~~~~~y~~~~---~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~ 272 (552) |++|++..+++|.. .+|.+.. .++....|+++||||++++. ..+.+||+||+.++..+|..+.+++++ T Consensus 118 ~~~v~i~~~~~~~~------~~y~~~~~~~~~~~~~~~~~~eiih~~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~ 188 (397) T protein:vir:38 118 PSQVQPMLLQDGSG------LIYNINFDEPAIGYMENVPAADVIHIRLLS---KNGGKTGISPLSALINEQQIKDASNEL 188 (397) T ss_pred CceeEEEEcCCCce------EEEEEEeccccccceeEecCccEEEecCCC---CCCccccccHHHHHHHHHHHHHHHHHH Confidence 99999999888753 2333332 23455789999999998753 334579999999999999999999999 Q ss_pred HHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHH Q lcl|NC_020081. 273 NARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINV 352 (552) Q Consensus 273 ~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 352 (552) +.++|+||++|+|+|++++. +++++.+++++.|+..+++ .|+|+++ ++++|++|++++.++.|+||++++++++++ T Consensus 189 ~~~~f~ng~~~~~il~~~~~--~~~e~~~~~~~~~~~~~~~-~n~~~~~-vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~ 264 (397) T protein:vir:38 189 TLKALKQSVTASAVLTIQKG--GLLDAETRIARSKEISKQI-HNSDGPV-VIDALEDYKPLEVKGNIASLLNQVDWTRDQ 264 (397) T ss_pred HHHHHhccCCccEEEEeCCC--CCHHHHHHHHHHHHHHhcc-cccCCce-ecCCCceEEecCCChhHHHHHHHHHHHHHH Confidence 99999999999999999865 4788899999999887655 7888865 456899999999999999999999999999 Q ss_pred HHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHH Q lcl|NC_020081. 353 ICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTE 432 (552) Q Consensus 353 Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~ 432 (552) ||++|||||++||.... +++|.++ ...|+.+||.|++..||++||++|+++++.++.+. ++.|.+++ T Consensus 265 Ia~afgVp~~~lg~~~~-----------~~~~~e~-~~~~~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~-~~~d~~~~ 331 (397) T protein:vir:38 265 IAKVYGVPDSYLNGQGD-----------QQSSITQ-ISGQYAKSLNRYVQAIVGELNDKLHANISANIRFA-IDAMGDQY 331 (397) T ss_pred HHHHhCCCHHHhCCCCC-----------cccHHHH-HHHHHHHHHHHHHHHHHHHHHHhccChhccccccc-ccCCHHHH Confidence 99999999999986432 2355554 46688899999999999999999999877665544 46788888 Q ss_pred HHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCC Q lcl|NC_020081. 433 AEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNG 512 (552) Q Consensus 433 ~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (552) ++.++.+ +.+|+||+||+|+++|++|++|||.+............. ..++++.+....+ .. T Consensus 332 ~~~~~~~--~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~~~~~------~~~~g~~~~~~~~---------e~-- 392 (397) T protein:vir:38 332 ASTISSS--VKGGTIAGNQARFILQNSGYLAKDLPDPEKEPQQAIQLI------QQEGGENDGNNSD---------ER-- 392 (397) T ss_pred HHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCcccccccccccccccc------ccccCCCCCCCCC---------CC-- Confidence 8776654 457999999999999999999999664433222211110 0111111100000 00 Q ss_pred CCCcccccCCCCc Q lcl|NC_020081. 513 KDSFNQNVGKDGQ 525 (552) Q Consensus 513 ~~~~~~~~~~~~~ 525 (552) +.|++ T Consensus 393 --------~~~~~ 397 (397) T protein:vir:38 393 --------GSDPE 397 (397) T ss_pred --------CCCCC Confidence 00000 No 63 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=9.4e-69 Score=393.53 Aligned_cols=395 Identities=13% Similarity=0.107 Sum_probs=273.7 Q ss_pred ccccccccchhhhhcccccccc--ccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKS--NKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQ 101 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~ 101 (552) |. .++|-. .+++ .+-....+|+..+.++.+ .. .-+.+. ...++.+||.++++. T Consensus 1 mg---------~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~------~~--------t~~~~~-~~~~v~~cv~~Ia~~ 55 (403) T protein:vir:10 1 MG---------FKSWIT-EKLNPGQRIIRDMEPVSHRTNRKP------FT--------TGQAYS-KIEILNRTANMVIDS 55 (403) T ss_pred Cc---------chhhhh-hccchhhhhhhcccccccccCCcc------cc--------cHHHHH-HHHHHHHHHHHHHHH Confidence 11 111211 1111 111111122211111111 00 011222 345678888766665 Q ss_pred HHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEE Q lcl|NC_020081. 102 VSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELV 181 (552) Q Consensus 102 ~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~ 181 (552) ++ .+++.+.-+ .....+.+..+.|++..+|. .+||++||+++|++.++.+++++||+|+++. T Consensus 56 ia-----------~~p~~v~~~--~~~~~~~~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~ 117 (403) T protein:vir:10 56 AA-----------ECSYTVGDK--YNIVTYANGVKTKTLDTLLN-----VRPNPFMDISTFRRLVVTDLLFEGCAYIYWD 117 (403) T ss_pred Hh-----------hCceeEeec--ccccccccccccchHHHHHh-----hCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe Confidence 44 445554322 22222222233455544443 3689999999999999999999999998874 Q ss_pred ECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccc-cCCccCCcccccHHHHHH Q lcl|NC_020081. 182 YDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNP-RTDLTVGKYGYPELEIAL 260 (552) Q Consensus 182 r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~-~~~~~~g~~G~spl~~~~ 260 (552) + ..|++|++..|++..+.++.. +++.. .+ ...+.++||+|++.+. ..+..++++|+||+.+++ T Consensus 118 ~------~~l~~l~~~~~~v~~~~~~~~-------~~~~~-~~--~~~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~ 181 (403) T protein:vir:10 118 G------TSLYHVPAALMQVEADANKFI-------KKFIF-NN--QINYRVDEIIFIKDNSYVCGTNSQISGQSRVATVI 181 (403) T ss_pred C------ceeEeecCcceEEEEcCCceE-------EEEEe-cC--ceeecccceEEecccccccCCCCCcccccHHHHHH Confidence 3 369999999999987665422 12222 22 2457889999997432 234557899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccC--chh Q lcl|NC_020081. 261 NHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQ--SSK 338 (552) Q Consensus 261 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~--~~~ 338 (552) .++..+.++++|..++|+||++|+|||++++ .++++++++++++|++.++|..|+|+++|+ ++|++|+++++ ++. T Consensus 182 ~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~--~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl-~~g~~~~~~~~~~~~~ 258 (403) T protein:vir:10 182 DSLEKRSKMLNFKEKFLDNGTVIGLILETDE--ILNKKLRERKQEELQLDYNPSTGQSSVLIL-DGGMKAKPYSQISSFK 258 (403) T ss_pred HHHHHHHHHHHHHHHHHhccCCcceEEEeCC--CCCHHHHHHHHHHHHHHhCCcccCcceeec-CCCceeEEecccCCHH Confidence 9999999999999999999999999999865 569999999999999999999999997655 57999999975 578 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccccc Q lcl|NC_020081. 339 DMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGG 418 (552) Q Consensus 339 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~ 418 (552) |+||+|++++++++||++|||||++||.. +++|++++.+.|+++||.||++.||++|+++|...+.. T Consensus 259 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-------------~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~ 325 (403) T protein:vir:10 259 DLDFKEDIEGFNKSICLAFGVPQVLLDGG-------------NNANIRPNIELFYYMTIIPMLNKLTSSLTFFFGYKITP 325 (403) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHcCCC-------------CCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeee Confidence 99999999999999999999999999742 35789999999999999999999999999998544322 Q ss_pred ce-eecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCC--CCCeeeccccccchhhhccccccccccCCCCCc Q lcl|NC_020081. 419 DY-VFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTE--GGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQ 495 (552) Q Consensus 419 ~~-~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~--ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~ 495 (552) +. .+.+++.|.+++++.++. .+.+|+||+||+|+++|+||++ +||.++.|.|+...... ..+++... T Consensus 326 d~~~~~~l~~D~~~~~~~~~~--~~~~G~lT~NE~R~~~gl~pi~~~~~d~~~~p~n~~~~~~~--------~~~~e~~~ 395 (403) T protein:vir:10 326 NTKEVAALTPDKEAEAKHLTS--LVNNGIITGNEARSELNLEPLDDEQMNKIRIPANVAGSATG--------VSGQEGGR 395 (403) T ss_pred ccchhhhcccCHHHHHHHHHH--HHhCCCcCHHHHHHHhCCCCCCccccccccccccccccccc--------CCCCcCCC Confidence 21 223467788888776664 3457999999999999999994 79999999887642211 11111111 Q ss_pred cCcccCCC Q lcl|NC_020081. 496 FLAQQTGY 503 (552) Q Consensus 496 ~~~~~~~~ 503 (552) +....+|+ T Consensus 396 ~~~~~~g~ 403 (403) T protein:vir:10 396 PKGSTEGD 403 (403) T ss_pred CCCCcCCC Confidence 11111111 No 64 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=1.5e-68 Score=392.36 Aligned_cols=387 Identities=14% Similarity=0.070 Sum_probs=273.9 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+++ ||++...++ .+... .....+... + +..+-..+. T Consensus 1 MGl~~-~~~~~~~~~----------------------------~~~~~-~~~~~~~~~----~------~~~~~~vt~-- 38 (394) T protein:vir:62 1 MGLRD-RFSNYLFKK----------------------------AEKRG-YLDNVLGKS----I------RYSGVYVTD-- 38 (394) T ss_pred Cchhh-hhhhhccCC----------------------------CCchh-hhhhhhhcc----c------ccCccccCh-- Confidence 99998 666443200 00000 000001111 1 011100000 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .-+...+++++||.++++.++ .+++.++.++.+ +.+.|++..++ .+||+.||++ T Consensus 39 --~~al~~~~v~~~i~~Ia~~iA-----------~lp~~v~~~~g~-------~~~~~~~~~Ll------~~PN~~~t~~ 92 (394) T protein:vir:62 39 --SNILQSSDVYELLQDISNQMV-----------LADIVVEDEFGN-------EIKDDIALQIL------RNPNNYLTQS 92 (394) T ss_pred --hhhhccHHHHHHHHHHHHhhc-----------ccceEEEcCCCc-------ccchhhHHHHh------ccCCCCCCHH Confidence 112345668888887666654 456666543321 12234433332 3678999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS 240 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~ 240 (552) +||+.++.+++++||+|++|.++..| ++ ..|++..++++.. + .. .+ ...|+++||||++. T Consensus 93 ~f~~~~~~~lll~Gn~~~~i~~~~~~----~~----~~~~~~~~~~~~~--------~-~~-~~--~~~~~~~eiih~r~ 152 (394) T protein:vir:62 93 EFIKLMTNTYLLEGETFPILNGAQIH----LA----SNVFTELDDNLVE--------H-FN-IG--GHEIPPCMIRHVKN 152 (394) T ss_pred HHHHHHHHHHHhcCCeEEEEecceee----cc----ccceEEECCceEE--------E-Ee-eC--CEEechhheEEecC Confidence 99999999999999999999765433 22 3455655554421 1 11 12 25689999999874 Q ss_pred cccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccc Q lcl|NC_020081. 241 NPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKI 320 (552) Q Consensus 241 ~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~ 320 (552) + +.++++|+||+..+..+|..+.++++++.++|+||++|+|+|++++....++++.+++++.|.+.++|..++|++ T Consensus 153 ~----~~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~ 228 (394) T protein:vir:62 153 I----GADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLESIDEARSV 228 (394) T ss_pred c----CCCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccccccCce Confidence 3 346789999999999999999999999999999999999999998877677888899999999999999999998 Q ss_pred eeecc-CCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhH Q lcl|NC_020081. 321 PVITA-EDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEP 399 (552) Q Consensus 321 ~il~~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P 399 (552) +|+.. .++++++++.++.|+||+|++++++++||++|||||++||.. .++|++++.+.|++.||.| T Consensus 229 ~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-------------~~sn~e~~~~~~~~~~l~P 295 (394) T protein:vir:62 229 KMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTEL-------------IKEDIEKAMMYIHNKAVRP 295 (394) T ss_pred eEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCC-------------CCcCHHHHHHHHHHHHHHH Confidence 77753 456778999999999999999999999999999999999843 3578999999999999999 Q ss_pred HHHHHHHHHHhhcCccc-ccceeecccccC---hHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCC--CCCCeeeccccc Q lcl|NC_020081. 400 LLKFIEDAVNKYIVSQF-GGDYVFNFVGGD---AKTEAEIISILESKAKIGLTINDIRKELGYPDT--EGGDVTLAGVHV 473 (552) Q Consensus 400 ~~~~ie~~ln~~L~~~~-~~~~~~~f~~~d---~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~--~ggD~~~~~~n~ 473 (552) |++.||++|+++|+++. +.+++|+|+..+ ..++++.+. +.+.+|+||+||+|+++||||+ ++||+++++.|+ T Consensus 296 ~~~~ie~~l~~kll~~~~~~~~~~~fd~~~~~~~~~~~~~~~--~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~~n~ 373 (394) T protein:vir:62 296 IMKNFEDHLSLLFYAQNSGKRIKFKINILDFVTYSNKTNIGY--NLVRTAITSPDNVADMLGFPKQNTKESQAIYISNDV 373 (394) T ss_pred HHHHHHHHHhhhhcCccccCceEEEechhhhcCHHHHHHHHH--HHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeccccc Confidence 99999999999998753 356788886443 344444333 3455789999999999999999 789999999998 Q ss_pred cchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccc Q lcl|NC_020081. 474 QRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQN 519 (552) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (552) ++++.....+. ..++++.+ ++ T Consensus 374 ~~~~~~~~~~~--~~kgge~~-----------------------en 394 (394) T protein:vir:62 374 TEIGKKEATDG--SLGGGEEN-----------------------EN 394 (394) T ss_pred ccccccccccc--cCCCCCCC-----------------------CC Confidence 88764322110 00111100 00 No 65 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=3.1e-65 Score=374.25 Aligned_cols=371 Identities=13% Similarity=0.069 Sum_probs=257.0 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhcccccccccccccccccccc--ccccCCcccccccCCCCchHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIG--SMSMNPDFKEAPSIHGKQNLL 78 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 78 (552) ||+++ +||+..- ... ...+...+.. ...+...+... .+..... T Consensus 3 m~~~~-~~~~~~~---------~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~---~g~~v~~ 47 (392) T protein:vir:74 3 LPILN-FINQTND---------PPE----------------------AGSVQSYFPDGNDAQIMESLLGD---NNEWVSA 47 (392) T ss_pred chhhh-hhhcccC---------ccc----------------------ccccccccccCchhhhhhhccCC---CCcccch Confidence 66665 3332110 000 0000000000 00000000000 0110000 Q ss_pred HHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCC Q lcl|NC_020081. 79 QMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDN 158 (552) Q Consensus 79 ~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t 158 (552) ..|...+++++||..++..++ ++++.+.-+.. ..+ ..+||+.|| T Consensus 48 ----~~al~~~~v~~~v~~ia~~ia-----------~lp~~~~~~~~---------------~~l------~~~PN~~~t 91 (392) T protein:vir:74 48 ----RAALRNSDLFSIILQLSSDLA-----------IVKINAEKKKN---------------QGI------IDNPSTNAN 91 (392) T ss_pred ----hhhhcchHHHHHHHHHHHhhc-----------cCceeeccchh---------------hhh------hhhcCCCCC Confidence 123345668888887666654 44554431110 112 236889999 Q ss_pred HHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCC---ceEEEEcccce Q lcl|NC_020081. 159 FRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDD---KVVAKFKAKEM 235 (552) Q Consensus 159 ~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~---~~~~~~~~~ev 235 (552) +++||+.++.+++++||+|++++|+..|++++||||+|++|++..+++|... +|.+...+ .....|+++|| T Consensus 92 ~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~------~y~~~~~~~~~~~~~~~~~~ev 165 (392) T protein:vir:74 92 KHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGM------YYNITFDDPKIEPILQAPQSDL 165 (392) T ss_pred HHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE------EEEEEecCCccceeEEEcCccE Confidence 9999999999999999999999999999999999999999999998877532 34433332 34678999999 Q ss_pred eeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccc Q lcl|NC_020081. 236 AWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGIN 315 (552) Q Consensus 236 i~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~ 315 (552) ||++.+. ....+||+||+.++..+|..+.++++++.++|+||++|+|+|+++++...++++ ++.|.+.+.|.. T Consensus 166 ih~~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~----~~~~~~~~~~~~ 238 (392) T protein:vir:74 166 IHMKLLS---IDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKD----KASRSRSFMKRS 238 (392) T ss_pred EEecCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHH----HHHHHHHHhccc Confidence 9997542 223479999999999999999999999999999999999999998765545433 456667778889 Q ss_pred ccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_020081. 316 GAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDK 395 (552) Q Consensus 316 nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~ 395 (552) |+|+++|+ ++|++|++++++++|+||+|++++++++||++|||||++||....+ ++.+++.++|+++ T Consensus 239 n~g~~~vl-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~------------~~~~e~~~~~~~~ 305 (392) T protein:vir:74 239 RSGGPVVL-DDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQ------------QSSIQQISGMYAS 305 (392) T ss_pred cCCCeeec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc------------ccHHHHHHHHHHH Confidence 99997655 5799999999999999999999999999999999999999965322 2445678899999 Q ss_pred HhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHh--------------CCCCC Q lcl|NC_020081. 396 GLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKEL--------------GYPDT 461 (552) Q Consensus 396 ~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~--------------gl~p~ 461 (552) ||.|+++.||++|+++|++.++.++.. +.+.|...+++.+. ..+.+|++|+||+|+++ |+||+ T Consensus 306 ~l~p~~~~ie~~l~~~l~~~~~~~~~~-~~~~d~~~~~~~~~--~l~~~g~~t~near~~~~~~g~~pne~r~~enl~~~ 382 (392) T protein:vir:74 306 ALNRYLRPAISELEYKLSDHISVNMRP-AIDPLGDNYLSTIS--TATRWGALAENQATFVLQEAGYIPKDLPAPENTNKK 382 (392) T ss_pred HHHHHHHHHHHHHHHhccchhcccchh-hhcCCHHHHHHHHH--HHHhCCCcCHHHHHHHHHhCCCCccccchhcCCCCC Confidence 999999999999999999876544332 34567766665444 33456899999999886 33333 Q ss_pred CCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCC Q lcl|NC_020081. 462 EGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNV 510 (552) Q Consensus 462 ~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (552) +|||. +++ .| T Consensus 383 ~~Gd~---------------------------~~p------------~p 392 (392) T protein:vir:74 383 TTGQS---------------------------NEP------------VP 392 (392) T ss_pred CCCCC---------------------------CCC------------CC Confidence 33321 111 11 No 66 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=8e-65 Score=371.99 Aligned_cols=379 Identities=13% Similarity=0.046 Sum_probs=270.0 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+++-....+.-+ +. ...+. ....+.+.+...+ +...+. T Consensus 1 Mg~~~~~~~~k~~~--------------------------------~~--~~~~~-~~~~~~~~~~~~~---~~~v~~-- 40 (383) T protein:vir:10 1 MGLLTPKNFSKRNA--------------------------------KN--MVYPS-NPAFFTTTVGGMQ---LSYVSA-- 40 (383) T ss_pred CCcccccccccccc--------------------------------cc--ccccc-chhhhhhhccCcc---ccccch-- Confidence 88887211111100 00 00000 0000011000000 000000 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .-+...+.+.+||..+++.++ .+++.+. + +....+| .+||+.||++ T Consensus 41 --~~~l~~~~v~~~i~~ia~~ia-----------~~~~~~~--~-------------~~~~~ll------~~PN~~~t~~ 86 (383) T protein:vir:10 41 --LSALQNTNVYSVINRIASDVS-----------SAHFKTE--N-------------TATLNRL------ESPSSLIGRF 86 (383) T ss_pred --hHhhcchHHHHHHHHHHHhhc-----------cCceeec--c-------------cchhhhh------hCCCCCCCHH Confidence 112234567888887766654 3344332 1 1122233 3688999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS 240 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~ 240 (552) +||+.++.+++++||+|++++|+ +.+++|+++.+|++..+.++. .+++....++....|+++||||++. T Consensus 87 ~f~~~~~~~l~l~Gn~~~~i~~~----~~~~~p~~~~~v~~~~~~~~~-------~~~~~~~~~~~~~~~~~~evih~r~ 155 (383) T protein:vir:10 87 SFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGI-------VYTVLESNDRPKMVLRQDQMLHFRL 155 (383) T ss_pred HHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCcceEEEEEcCCce-------EEEEEEcCCceEEEEcccceEEecc Confidence 99999999999999999999875 467899999988887765542 2344555667788999999999874 Q ss_pred cccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccc Q lcl|NC_020081. 241 NPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKI 320 (552) Q Consensus 241 ~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~ 320 (552) . ..+..+++||+||+.++..+|..+.++++++.++|+||++|+|+|++++.. .++++++++++.|++.++| .|+|++ T Consensus 156 ~-~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~-~~~e~~~~~~~~~~~~~~~-~n~~~~ 232 (383) T protein:vir:10 156 M-PDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYL-SDGKDLESAREEFEKANTG-DNSGRL 232 (383) T ss_pred C-CCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCC-CCHHHHHHHHHHHHHHhCc-cccCCc Confidence 3 345667789999999999999999999999999999999999999998653 4788999999999998877 689997 Q ss_pred eeeccCCceeeeccCchhHHHHH-HHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhH Q lcl|NC_020081. 321 PVITAEDVKFVNMTQSSKDMEFE-KWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEP 399 (552) Q Consensus 321 ~il~~~g~~~~~l~~~~~d~q~~-e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P 399 (552) +++ ++|++|++++.++.|+|++ +++++++++||++|||||++||..+.+ +.+++|++++...| .+||.| T Consensus 233 ~vl-~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~--------~~~~sn~eq~~~~~-~~~l~P 302 (383) T protein:vir:10 233 MVL-PDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTST--------ESQHSNIDQIKATY-LANLNS 302 (383) T ss_pred ccc-CCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCC--------CCccccHHHHHHHH-HHHHHH Confidence 655 5799999999999999985 899999999999999999999975432 34678888887655 469999 Q ss_pred HHHHHHHHHHhhcCcccccceeec---ccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccch Q lcl|NC_020081. 400 LLKFIEDAVNKYIVSQFGGDYVFN---FVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRL 476 (552) Q Consensus 400 ~~~~ie~~ln~~L~~~~~~~~~~~---f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~ 476 (552) +++.||++|+++|+.. .++|+ +++.|.+++++.+.. .+.+|+||+||+|+++|++|+++||.+....+..++ T Consensus 303 ~~~~ie~~l~~~l~~~---~~~f~~~~l~~~d~~~~~~~~~~--~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~ 377 (383) T protein:vir:10 303 YVNPIVDELRLKMNAP---DLELDIKDMLDVDDSILINQVSN--LAKSGVLGAEQAQFILTRSGFLPDNLPEFKPLTNET 377 (383) T ss_pred HHHHHHHHHHHhhCCc---eEEeechhhhccCHHHHHHHHHH--HHhCCCcCHHHHHHHhCCCcccCCcccccCCCcccC Confidence 9999999999999854 35554 357788888776654 345799999999999999999999975433221110 Q ss_pred hhhccccccccccCCCCCccCcccCCCCC Q lcl|NC_020081. 477 GQIMQQEQVEYQRQMDANQFLAQQTGYDG 505 (552) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (552) ++++ ++ T Consensus 378 ------------~gGd-----------~e 383 (383) T protein:vir:10 378 ------------KGGD-----------DK 383 (383) T ss_pred ------------CCCC-----------CC Confidence 1111 11 No 67 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=5.5e-65 Score=372.87 Aligned_cols=354 Identities=13% Similarity=0.127 Sum_probs=258.2 Q ss_pred hccc--cccccccccccccccccccccCCccccc-ccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_020081. 37 LKKG--KNTKSNKPKAYEEPIIGSMSMNPDFKEA-PSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSD 113 (552) Q Consensus 37 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~ 113 (552) |+-+ .++++. ..+. .|.+..... ....+...+. .-|.....+.+||..+++.++. T Consensus 1 M~~~~~f~~r~~--~~~~-------~~~~~~~~~~~~~~~~~v~~----~~al~~~av~~cv~~ia~~ia~--------- 58 (359) T protein:vir:10 1 MSILNPFERRSS--ITPN-------NYYPFMVQNGSIVPNSLVDA----TEALKNSDLYAVTSLISSDIAG--------- 58 (359) T ss_pred Ccccchhhcccc--CCCC-------cchhhhhccccccCCcccCH----HHhhcchHHHHHHHHHHHhhhc--------- Confidence 3222 111111 0110 011111000 0111111111 1122334568888887776653 Q ss_pred cccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEE Q lcl|NC_020081. 114 KGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKA 193 (552) Q Consensus 114 ~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~ 193 (552) +++. ..+....++ .+||+.||+++||+.++.+++++||+|++|+|+..|+|.+||| T Consensus 59 --~p~~----------------~~~~~~~L~------~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~ 114 (359) T protein:vir:10 59 --TRFI----------------GNQVFTSVL------NNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRL 114 (359) T ss_pred --Cccc----------------cchHHHHHh------hcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEE Confidence 2220 112222222 3688999999999999999999999999999999999999999 Q ss_pred ecCceeEEEECCCcccccccceeEEE-EEcCCceEEEEcccceeeecccc-cCCccCCcccccHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 194 VDASTVYVAVDEDGKERKAKDGVRYV-QVIDDKVVAKFKAKEMAWEVSNP-RTDLTVGKYGYPELEIALNHLQYHDNTEV 271 (552) Q Consensus 194 l~p~~v~v~~~~~g~~~~~~~~~~y~-~~~~~~~~~~~~~~evi~~~~~~-~~~~~~g~~G~spl~~~~~~i~~~~~~~~ 271 (552) |+|++|++..++++ ++|. ....++....|+++||||++... ..++.+|++|+||+.++..+|..+.++++ T Consensus 115 l~~~~v~i~~~~~~--------~~y~~~~~~~~~~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~ 186 (359) T protein:vir:10 115 IPSNAITIDLTDDT--------LTYEVNQFDDYPSAKYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANR 186 (359) T ss_pred eCCceEEEEEcCCe--------EEEEEEecCCceEEEEcccceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHH Confidence 99999999776542 2232 23455677889999999998643 34456889999999999999999999999 Q ss_pred HHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHH Q lcl|NC_020081. 272 FNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLIN 351 (552) Q Consensus 272 ~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~ 351 (552) +..++|+||++|+|+|+++.+ .+++++++++++.|++.++ ..|+|+++|+ ++|++|+++++++.|+||+|+++++++ T Consensus 187 ~~~~~f~ng~~~~gil~~~~~-~l~~e~~~~~~~~~~~~~~-~~n~g~~~vl-~~g~~~~~l~~~~~d~q~le~~~~~~~ 263 (359) T protein:vir:10 187 LSLSTLKGALNPTSVVKVPQG-TLSSEAKDSIRKEFEKANG-GNNSGRVMVL-DQSADFSTVSINADVANYLNSMNWGRT 263 (359) T ss_pred HHHHHHhccCCcceEEEeCCC-CCCHHHHHHHHHHHHHHhC-ccccCCceec-CCCcceeeecCCHHHHHHHHHHHHHHH Confidence 999999999999999999754 4699999999999988765 4899997666 579999999999999999999999999 Q ss_pred HHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHH Q lcl|NC_020081. 352 VICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKT 431 (552) Q Consensus 352 ~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~ 431 (552) +||++|||||++||..... ..++++++++...|+..+|.||+..|+.+|++++.... +..++|+ ... T Consensus 264 ~Ia~~fgVPp~~lg~~~~~--------~~~~~~~e~~~~~~l~~~l~p~~~~l~~~l~~~~~~~~--~~~~~~d---~~~ 330 (359) T protein:vir:10 264 QIAKAFGVSDSYLNGTGDQ--------QSSLDQIKDLYVNALNRFIEPLISELRIKCDSSIGVDM--SPITDYS---NSV 330 (359) T ss_pred HHHHHhCCCHHHhCCCCcc--------cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc--hhhhhcC---HHH Confidence 9999999999999865432 34688899999999999999999999988887764332 2333343 222 Q ss_pred HHHHHHHHHHHhcCCcCHHHHHHHhCCCCCC Q lcl|NC_020081. 432 EAEIISILESKAKIGLTINDIRKELGYPDTE 462 (552) Q Consensus 432 ~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ 462 (552) .. ..+.+.+.+|+||+||+|+++|++|+= T Consensus 331 ~~--~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 331 FK--ADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred HH--HHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 21 223345567999999999999999985 No 68 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=1.1e-64 Score=371.14 Aligned_cols=382 Identities=14% Similarity=0.077 Sum_probs=267.9 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+|+. .++..- + .+......+. + ..+.+.. ....+..... T Consensus 1 M~~f~~-~~~~~~--------------------~------------~~~~~~~~~~--~-~~~~~~~-~~~~~~~v~~-- 41 (386) T protein:vir:48 1 MPIFNI-TNLATE--------------------S------------PPISQGGFFD--I-TDPDFLS-TLNGSEWVSA-- 41 (386) T ss_pred Cccccc-cccccc--------------------c------------cccccccccc--c-ccchhcc-cccCCceech-- Confidence 776652 111100 0 0000000000 0 0000000 0011111111 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) ..+...+++.+||.++++.++ ++++.+. +. ....++ .+||+.||++ T Consensus 42 --~~~~~~~~v~~~i~~ia~~ia-----------~~p~~~~--~~-------------~~~~l~------~~pN~~~t~~ 87 (386) T protein:vir:48 42 --ESALRNSDLFSIINQLSNDLA-----------TVKLTAS--RK-------------QLQGII------DNPSNNANRF 87 (386) T ss_pred --hhhhcchHHHHHHHHHHHhhc-----------cCceeec--cc-------------hhHHHh------hcCCCCCCHH Confidence 112245667889887777654 3444442 11 112222 3678999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCC---ceEEEEcccceee Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDD---KVVAKFKAKEMAW 237 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~---~~~~~~~~~evi~ 237 (552) +||+.++.+++++||+|++|+|+..|++++||||+|++|++..+.+|.. .+|.+...+ .....|+++|||| T Consensus 88 ~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~------~~y~~~~~~~~~~~~~~~~~~evih 161 (386) T protein:vir:48 88 NFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDG------IYYNITFDDPRIPPKQHVPQGDVLH 161 (386) T ss_pred HHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCce------EEEEEEecCccccceeEecCccEEE Confidence 9999999999999999999999999999999999999999998877643 344444433 3456799999999 Q ss_pred ecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccccc Q lcl|NC_020081. 238 EVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGA 317 (552) Q Consensus 238 ~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~na 317 (552) ++.+. +.+++||+||+..+..+|..+.++++++.++|+||++|+++|++++. +++++.+++++.|... ..|+ T Consensus 162 ~~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~--~~~e~~~~~~~~~~~~---~~n~ 233 (386) T protein:vir:48 162 FKLLS---VDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGG--GLLDFKTKLSRSRQAM---KQMQ 233 (386) T ss_pred ecCCC---CCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC--CCHHHHHHHHHHHHHh---hcCC Confidence 98542 33458999999999999999999999999999999999999998764 5788889999988764 4578 Q ss_pred ccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHh Q lcl|NC_020081. 318 WKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGL 397 (552) Q Consensus 318 gk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l 397 (552) |++++ +++|++|++++++++|+||+|++++++++||++|||||++||.. .++++++++.+.|++.|| T Consensus 234 g~~~v-l~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~------------~~~~~~e~~~~~~~~~~l 300 (386) T protein:vir:48 234 GGPLV-LDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQ------------GDQQSSLEMSLDLYNKAV 300 (386) T ss_pred CCcee-cCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCC------------CCcccHHHHHHHHHHHHH Confidence 88655 46799999999999999999999999999999999999999852 236789999999999999 Q ss_pred hHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeec-cccccch Q lcl|NC_020081. 398 EPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLA-GVHVQRL 476 (552) Q Consensus 398 ~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~-~~n~~~~ 476 (552) .|+++.||++|+++|+++++.++...+ +.|...+...++ ..+.+|++|+||+|+.+|++|+++||+... ..+.. T Consensus 301 ~P~~~~ie~~l~~~l~~~~~~~~~~~~-~~d~~~~~~~~~--~l~~~g~~t~nE~r~~lg~~~~~~~~~~~~~~~~~~-- 375 (386) T protein:vir:48 301 SRYLRPFLSELSQKLSCDVDADILPAV-DPTGSNSVSRIN--SMVKSGTLAQNQGLYILQQAEILPKELPEGENPNKT-- 375 (386) T ss_pred HHHHHHHHHHHHHhhcchhhcchhhhh-ccChHHHHHHHH--HHHhCCCcCHHHHHHHhhcCCCCCccchhhcCCCCC-- Confidence 999999999999999987765544333 345444443333 345578999999999999999988775421 11100 Q ss_pred hhhccccccccccCCCCCccCcccCCCCCCCCCCCCCC Q lcl|NC_020081. 477 GQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKD 514 (552) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (552) +.+++++ ++++ T Consensus 376 ----------~~~gGd~-----------------~~~~ 386 (386) T protein:vir:48 376 ----------TLKGGEI-----------------NGED 386 (386) T ss_pred ----------ccCCCCC-----------------CCCC Confidence 0011111 0111 No 69 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=1.5e-64 Score=370.47 Aligned_cols=382 Identities=13% Similarity=0.066 Sum_probs=256.9 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhcccccccccccccccccccc--ccccCCcccccccCCCCchHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIG--SMSMNPDFKEAPSIHGKQNLL 78 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 78 (552) ||+++-.+|.++= .....+..++.. ...+.+.+... .+..... T Consensus 3 m~~f~~~~~~~~~--------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~v~~ 47 (392) T protein:vir:39 3 LPILNFINQTNDP--------------------------------PEVGSVQSYFPDGNDAQIMESLLGD---NNEWVSA 47 (392) T ss_pred chhhhhhhccccc--------------------------------ccccccccccccCchhhhhhhhcCC---CCceech Confidence 6665522111100 000000011100 00001111110 0111000 Q ss_pred HHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCC Q lcl|NC_020081. 79 QMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDN 158 (552) Q Consensus 79 ~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t 158 (552) ..+...+++.+||.+++..++ ++++.+. +... .. +..+||+.|| T Consensus 48 ----~~al~~~~v~~~i~~ia~~ia-----------~lp~~~~--~~~~-------------~~------l~~~PN~~~t 91 (392) T protein:vir:39 48 ----RAALRNSDLFSIILQLSSDLA-----------IVKINAE--KKKN-------------QG------IIDNPSTNAN 91 (392) T ss_pred ----HHhhccHHHHHHHHHHHHhhc-----------cCceeec--cchh-------------hh------HhhcCCCCCC Confidence 122234667888887776654 3444432 1110 11 2247889999 Q ss_pred HHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCC---ceEEEEcccce Q lcl|NC_020081. 159 FRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDD---KVVAKFKAKEM 235 (552) Q Consensus 159 ~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~---~~~~~~~~~ev 235 (552) +++||+.++.+++++||+|++++|+..|++++||||+|++|++..+.+|.. .+|.+...+ .....|+++|| T Consensus 92 ~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~------~~y~~~~~~~~~~~~~~~~~~ei 165 (392) T protein:vir:39 92 KHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENG------MYYNITFDDPKIEPILQAPQSDL 165 (392) T ss_pred HHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce------EEEEEEecCcccceeEEEccccE Confidence 999999999999999999999999999999999999999999999887653 234444332 24578999999 Q ss_pred eeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccc Q lcl|NC_020081. 236 AWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGIN 315 (552) Q Consensus 236 i~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~ 315 (552) ||++++. ....+||+||+.++..+|..+.++++++.++|+||++|+|+|+++++...++++ ++.|.+.+.|.. T Consensus 166 ih~~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~----~~~~~~~~~~~~ 238 (392) T protein:vir:39 166 IHMKLLS---IDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKD----KASRSRSFMKRS 238 (392) T ss_pred EEecCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHH----HHHHHHHHhccc Confidence 9997643 233479999999999999999999999999999999999999998765555443 455666777888 Q ss_pred ccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_020081. 316 GAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDK 395 (552) Q Consensus 316 nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~ 395 (552) ++|+++|+ ++|++|++++++++|+||++++++++++||++|||||++||....+ ++.+++.++|++. T Consensus 239 ~~g~~~vl-~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~------------~~~~~~~~~f~~~ 305 (392) T protein:vir:39 239 RSGGPVVL-DDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQ------------QSSIQQISGMYAS 305 (392) T ss_pred cCCCeeec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc------------ccHHHHHHHHHHH Confidence 99997655 5799999999999999999999999999999999999999864322 2446678899999 Q ss_pred HhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHh---CCCCCCCCCeeecccc Q lcl|NC_020081. 396 GLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKEL---GYPDTEGGDVTLAGVH 472 (552) Q Consensus 396 ~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~---gl~p~~ggD~~~~~~n 472 (552) ||.|+++.||++|+++|++.++.++.. +.+.|...+.+.+. ..+..|++|+||+|+++ |+.|.+ ++ ...+ T Consensus 306 ~l~P~~~~ie~~l~~~L~~~~~~d~~~-~~~~d~~~~~~~~~--~l~~~g~~t~nE~r~~l~~~g~~p~e---~r-~~e~ 378 (392) T protein:vir:39 306 ALNRYLRPAISELEYKLSDHISVNMRP-AIDPLGDNYLSTIS--TATRWGALAENQATFVLQEAGYIPKD---LP-APEN 378 (392) T ss_pred HHHHHHHHHHHHHHHhccccccccchh-hhccCHHHHHHHHH--HHHhCCCcCHHHHHHHHHhcCCCccc---cc-hhcC Confidence 999999999999999999876544332 33566666655443 33456899999999877 444321 00 0000 Q ss_pred ccchhhhccccccccccCCCCCccCcccCCCCCCCCCC Q lcl|NC_020081. 473 VQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNV 510 (552) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (552) .++..+++.+++. | T Consensus 379 ------------l~~~~~Gd~~~p~------------p 392 (392) T protein:vir:39 379 ------------TNKKTTGQSNEPV------------P 392 (392) T ss_pred ------------CCCCCCCCCCCCC------------C Confidence 0111111111111 1 No 70 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=1.5e-64 Score=370.47 Aligned_cols=382 Identities=13% Similarity=0.066 Sum_probs=256.9 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhcccccccccccccccccccc--ccccCCcccccccCCCCchHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIG--SMSMNPDFKEAPSIHGKQNLL 78 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ 78 (552) ||+++-.+|.++= .....+..++.. ...+.+.+... .+..... T Consensus 3 m~~f~~~~~~~~~--------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~v~~ 47 (392) T protein:vir:10 3 LPILNFINQTNDP--------------------------------PEVGSVQSYFPDGNDAQIMESLLGD---NNEWVSA 47 (392) T ss_pred chhhhhhhccccc--------------------------------ccccccccccccCchhhhhhhhcCC---CCceech Confidence 6665522111100 000000011100 00001111110 0111000 Q ss_pred HHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCC Q lcl|NC_020081. 79 QMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDN 158 (552) Q Consensus 79 ~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t 158 (552) ..+...+++.+||.+++..++ ++++.+. +... .. +..+||+.|| T Consensus 48 ----~~al~~~~v~~~i~~ia~~ia-----------~lp~~~~--~~~~-------------~~------l~~~PN~~~t 91 (392) T protein:vir:10 48 ----RAALRNSDLFSIILQLSSDLA-----------IVKINAE--KKKN-------------QG------IIDNPSTNAN 91 (392) T ss_pred ----HHhhccHHHHHHHHHHHHhhc-----------cCceeec--cchh-------------hh------HhhcCCCCCC Confidence 122234667888887776654 3444432 1110 11 2247889999 Q ss_pred HHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCC---ceEEEEcccce Q lcl|NC_020081. 159 FRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDD---KVVAKFKAKEM 235 (552) Q Consensus 159 ~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~---~~~~~~~~~ev 235 (552) +++||+.++.+++++||+|++++|+..|++++||||+|++|++..+.+|.. .+|.+...+ .....|+++|| T Consensus 92 ~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~------~~y~~~~~~~~~~~~~~~~~~ei 165 (392) T protein:vir:10 92 KHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENG------MYYNITFDDPKIEPILQAPQSDL 165 (392) T ss_pred HHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce------EEEEEEecCcccceeEEEccccE Confidence 999999999999999999999999999999999999999999999887653 234444332 24578999999 Q ss_pred eeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccc Q lcl|NC_020081. 236 AWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGIN 315 (552) Q Consensus 236 i~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~ 315 (552) ||++++. ....+||+||+.++..+|..+.++++++.++|+||++|+|+|+++++...++++ ++.|.+.+.|.. T Consensus 166 ih~~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~----~~~~~~~~~~~~ 238 (392) T protein:vir:10 166 IHMKLLS---IDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKD----KASRSRSFMKRS 238 (392) T ss_pred EEecCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHH----HHHHHHHHhccc Confidence 9997643 233479999999999999999999999999999999999999998765555443 455666777888 Q ss_pred ccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_020081. 316 GAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDK 395 (552) Q Consensus 316 nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~ 395 (552) ++|+++|+ ++|++|++++++++|+||++++++++++||++|||||++||....+ ++.+++.++|++. T Consensus 239 ~~g~~~vl-~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~------------~~~~~~~~~f~~~ 305 (392) T protein:vir:10 239 RSGGPVVL-DDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQ------------QSSIQQISGMYAS 305 (392) T ss_pred cCCCeeec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc------------ccHHHHHHHHHHH Confidence 99997655 5799999999999999999999999999999999999999864322 2446678899999 Q ss_pred HhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHh---CCCCCCCCCeeecccc Q lcl|NC_020081. 396 GLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKEL---GYPDTEGGDVTLAGVH 472 (552) Q Consensus 396 ~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~---gl~p~~ggD~~~~~~n 472 (552) ||.|+++.||++|+++|++.++.++.. +.+.|...+.+.+. ..+..|++|+||+|+++ |+.|.+ ++ ...+ T Consensus 306 ~l~P~~~~ie~~l~~~L~~~~~~d~~~-~~~~d~~~~~~~~~--~l~~~g~~t~nE~r~~l~~~g~~p~e---~r-~~e~ 378 (392) T protein:vir:10 306 ALNRYLRPAISELEYKLSDHISVNMRP-AIDPLGDNYLSTIS--TATRWGALAENQATFVLQEAGYIPKD---LP-APEN 378 (392) T ss_pred HHHHHHHHHHHHHHHhccccccccchh-hhccCHHHHHHHHH--HHHhCCCcCHHHHHHHHHhcCCCccc---cc-hhcC Confidence 999999999999999999876544332 33566666655443 33456899999999877 444321 00 0000 Q ss_pred ccchhhhccccccccccCCCCCccCcccCCCCCCCCCC Q lcl|NC_020081. 473 VQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNV 510 (552) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (552) .++..+++.+++. | T Consensus 379 ------------l~~~~~Gd~~~p~------------p 392 (392) T protein:vir:10 379 ------------TNKKTTGQSNEPV------------P 392 (392) T ss_pred ------------CCCCCCCCCCCCC------------C Confidence 0111111111111 1 No 71 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=2.4e-64 Score=369.34 Aligned_cols=379 Identities=12% Similarity=0.041 Sum_probs=266.7 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVS 103 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~ 103 (552) |..+.+.. ..+.+.+.. .... ....+...... ..+..... .-+...+.+++||..++..++ T Consensus 1 Mg~~~~~~--------~~~~~~~~~-~~~~---~~~~~~~~~~~---~~~~~v~~----~~al~~~~v~~~i~~ia~~ia 61 (385) T protein:vir:10 1 MGLLTPRN--------FNKRKAKNM-VYPS---NPAFFTTTVGG---MQLSYVSA----LSALQNTNVYSVINRIASDVA 61 (385) T ss_pred Cccccchh--------ccccccccc-cccc---chhhhhhhccc---cCccccCH----HHhhccHHHHHHHHHHHHHHh Confidence 44332100 001111111 1110 00000100000 01111111 112234567888877666654 Q ss_pred HHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC Q lcl|NC_020081. 104 MFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD 183 (552) Q Consensus 104 ~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~ 183 (552) ++++.+. + |....+| .+||+.||+++||+.++.+++++||+|++++|+ T Consensus 62 -----------~~p~~v~--~-------------~~~~~ll------~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~ 109 (385) T protein:vir:10 62 -----------SAHFKTE--N-------------TATLNRL------ESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ 109 (385) T ss_pred -----------hCceeee--c-------------cchhhhh------hcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC Confidence 3444432 1 1122233 368899999999999999999999999999976 Q ss_pred CCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHH Q lcl|NC_020081. 184 KLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHL 263 (552) Q Consensus 184 ~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i 263 (552) +.+++|+++.+|++..+.++. .+++....++....|+++||||++.+ ..+..++++|+||+.++..+| T Consensus 110 ----~~~~~p~~~~~v~~~~~~~~~-------~~~~~~~~~~~~~~~~~~eiihik~~-~~~~~~~~~G~s~i~~~~~~i 177 (385) T protein:vir:10 110 ----NLEHIPNSDVQINYLPGNMGI-------VYTVLESNDRPQMVLRQDQMLHFRLM-PDPQYRYLIGRSPLESLQNAL 177 (385) T ss_pred ----ceeEeecCCceEEEEEcCCce-------EEEEEEcCCceEEEEccccEEEeccC-CCCcccccccccHHHHHHHHH Confidence 467999999999987765542 23344455667788999999999854 344567789999999999999 Q ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHH Q lcl|NC_020081. 264 QYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFE 343 (552) Q Consensus 264 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~ 343 (552) ..+.++++++.++|+||++|+|+|++++. ..++++++++++.|++.++| .|+|+++++ ++|++|+++++++.|+|++ T Consensus 178 ~~~~~~~~~~~~~~~ng~~~~gil~~~~~-~~~~e~~~~~~~~~~~~~~~-~n~~~~~vl-~~g~~~~~l~~~~~d~~~l 254 (385) T protein:vir:10 178 NLDDKASKSNMSAMENQINPAGKLTISNY-LSDGKDLESAREEFEKANTG-DNSGRLMVL-PDGFDYTQLEMKTDVFKAL 254 (385) T ss_pred HHHHHHHHHHHHHHhccCCcceEEEeCCC-CCCHHHHHHHHHHHHHHhCc-cccCCcccc-CCCceEEecCCChhHHHHH Confidence 99999999999999999999999999864 34788999999999999877 689987655 5799999999999999985 Q ss_pred -HHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceee Q lcl|NC_020081. 344 -KWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVF 422 (552) Q Consensus 344 -e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~ 422 (552) |++++++++||++|||||++||..+.+ ..+++|+|++.. ++.+||.||++.||++|+++|++. .++| T Consensus 255 ~e~~~~~~~~Ia~~fgVp~~~lg~~~~~--------~~~~sn~eq~~~-~~~~~l~P~~~~ie~~l~~~l~~~---~~~f 322 (385) T protein:vir:10 255 ADNSAYSADQISKAFGVPSDILGGGTST--------ESQHSNIDQIKA-TYLANLNSYVNPIVDELRLKMNAP---DLEL 322 (385) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCCccCC--------CcccccHHHHHH-HHHHHHHHHHHHHHHHHHHhhCCc---eEEe Confidence 999999999999999999999975432 346788887655 455799999999999999999864 2444 Q ss_pred c---ccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCC--CeeeccccccchhhhccccccccccCCCCCccC Q lcl|NC_020081. 423 N---FVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGG--DVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFL 497 (552) Q Consensus 423 ~---f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~gg--D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (552) . ++++|.+++++.++.+ +.+|+||+||+|+++|++|+|+| |.+.++.+... +++.++ T Consensus 323 ~~~~ll~~d~~~~~~~~~~~--~~~G~~T~NE~R~~~g~~p~p~~~~~~~~~~~~~~~--------------~g~~~d-- 384 (385) T protein:vir:10 323 DIKDMLDVDDSALINQVSNL--AKSGVLGAEQAQFILTRSGFLPDNLPEFKPLTTQVK--------------GGDEGD-- 384 (385) T ss_pred echhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCccCCCCCccccCcccccC--------------CCCCCC-- Confidence 3 4678888888777653 45799999999999999999754 44444443211 111000 Q ss_pred cccCCCC Q lcl|NC_020081. 498 AQQTGYD 504 (552) Q Consensus 498 ~~~~~~~ 504 (552) | T Consensus 385 ------n 385 (385) T protein:vir:10 385 ------N 385 (385) T ss_pred ------C Confidence 0 No 72 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=100.00 E-value=3.1e-63 Score=363.25 Aligned_cols=371 Identities=12% Similarity=0.066 Sum_probs=260.9 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGV 116 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~ 116 (552) |+.+.+-- ++.++.+... ...+.... ....+...+.+.+||..+++.++ .+ T Consensus 1 Mg~f~~~f-~~~~~~~~~~------~~~~~~~~-----------~~~~a~~~~~v~~~i~~ia~~ia-----------~~ 51 (385) T protein:vir:95 1 MGLFDSVF-KRHSELSWMY------DLEFLQDK-----------SKKAYLKQIALNTVVEMVARTIS-----------QS 51 (385) T ss_pred Cchhhhhh-ccCccccccc------chhhhhcc-----------chhhhhhhHHHHHHHHHHHHHHc-----------cc Confidence 33222211 1111111110 11110000 01123345667888887776654 44 Q ss_pred ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecC Q lcl|NC_020081. 117 GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDA 196 (552) Q Consensus 117 ~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p 196 (552) ++.+..++.. ..|++..+|. .+||+.||+++||+.++.+++++||+|+++.++. +.+..++++.+ T Consensus 52 p~~~~~~~~~---------~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~-~~~~~~~~~~~ 116 (385) T protein:vir:95 52 EFRVMKNNTK---------EKGTLYYLLN-----VRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEG-HFFVADDFEKE 116 (385) T ss_pred ceeeeecCcc---------ccchHHHHHh-----cccCcCCCHHHHHHHHHHHHhhcCceEEEEecCC-Ceeeccccccc Confidence 5555433211 2245554442 3789999999999999999999999999887654 44555666655 Q ss_pred ceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 197 STVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARF 276 (552) Q Consensus 197 ~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~ 276 (552) ..+.+... ..+.+..........|+++||||++++. .....+|.||+..+..++..+.++.. T Consensus 117 ~~~~~~~~----------~~~~~~~~~~~~~~~~~~~eiih~~~~~---~~~~~~G~s~~~~~~~~i~~~~~~~~----- 178 (385) T protein:vir:95 117 DELGLYSH----------RFTNVLVNDFEFKRVFTMDDVIYLKYNN---QKLDAFSLGLFEDYGEIFGRMIDLQM----- 178 (385) T ss_pred cccccccc----------cceeeeecccceeeeeccccEEEecCCC---CCcccccchHHHHHHHHHHHHHHHHH----- Confidence 55443221 1112233334455789999999998643 23347899999999999877665432 Q ss_pred HhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccC------chhHHHHHHHHHHHH Q lcl|NC_020081. 277 FAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQ------SSKDMEFEKWLNYLI 350 (552) Q Consensus 277 f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~------~~~d~q~~e~~~~~~ 350 (552) +++.|+|+|++++...+++++.+++++.|++.++|..++++.++++++|++|+++++ ++.|+||+|++++++ T Consensus 179 --~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~ 256 (385) T protein:vir:95 179 --LNNQIRGILKVDATKFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVL 256 (385) T ss_pred --hcCCCceEEEeCCccCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHH Confidence 234588999998777789999999999999999998777766677889999999874 678999999999999 Q ss_pred HHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccccc-ceeec-----c Q lcl|NC_020081. 351 NVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGG-DYVFN-----F 424 (552) Q Consensus 351 ~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~-~~~~~-----f 424 (552) ++||++|||||++|+ .+++|++++.+.|++.||+|++..||++||++|+++.+. .++|+ + T Consensus 257 ~~Ia~~fgVpp~~l~--------------~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l 322 (385) T protein:vir:95 257 TDVARMIGVPPSLVL--------------GEMADLEKTIESYLQFCINPLLRKIEAELNSKFFYQDEYLNDDMHIKVVGI 322 (385) T ss_pred HHHHHHhCCCHHHhc--------------CCCcCHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhcccceEEEechhh Confidence 999999999999995 247899999999999999999999999999999986542 23344 3 Q ss_pred cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCC--CCCCeeeccccccchhhhccccccccccCCCCCcc Q lcl|NC_020081. 425 VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDT--EGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQF 496 (552) Q Consensus 425 ~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~--~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 496 (552) +++|.+++++.++.+ +.+|+||+||+|+++|+||+ ||||++++++|+++++... +++.+.. T Consensus 323 ~~~D~~~~~~~~~~~--~~~g~lt~NE~R~~~g~~p~~~~~gd~~~~~~n~~~~~~~k---------gge~~~e 385 (385) T protein:vir:95 323 DKRDPLKLSEAIDKL--VASGTFTRNQVRIMTGEEPADDPELDKFIITKNLQSADAFK---------GGESNEE 385 (385) T ss_pred hccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceeccccc---------CCCCCCC Confidence 677888887766643 45799999999999999999 7899999999998875321 1111100 No 73 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=100.00 E-value=1.5e-62 Score=359.48 Aligned_cols=383 Identities=14% Similarity=0.103 Sum_probs=258.5 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGV 116 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~ 116 (552) |+.+.+--++++ .... .+++..+ ...-...+...+++++||..+++.++ .+ T Consensus 1 Mg~f~~lf~~~~-~~~~----~~~~~~~-------------~~v~~~~~~~~~~v~~~i~~Ia~~iA-----------~~ 51 (395) T protein:vir:95 1 MSILEKIFKTRK-DITY----MLDLDMI-------------EDLSQQAYVKRLAIDSCIEFVARAVA-----------QS 51 (395) T ss_pred CchhhhhhccCc-cccc----cccchhc-------------cccchhhhhhhHHHHHHHHHHHHhhc-----------cc Confidence 433322111111 1110 0011000 00011123345678888887776654 34 Q ss_pred ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecC Q lcl|NC_020081. 117 GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDA 196 (552) Q Consensus 117 ~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p 196 (552) ++.+..+. ....+++..+|. .+||+.||+++||+.++.++++.|++|+++.++. .++++++ T Consensus 52 p~~~~~~~---------~~~~~~~~~ll~-----~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~ 112 (395) T protein:vir:95 52 HFKVLEGN---------RIQKNDVYYKLN-----IKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADS 112 (395) T ss_pred eeEeccCC---------ccccchHHHHHH-----hccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCC Confidence 45443221 112244444332 3689999999999999999999999888765542 3667766 Q ss_pred ceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 197 STVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARF 276 (552) Q Consensus 197 ~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~ 276 (552) ..+++....+. ...++..........++++||||++++.. ....||+||+..+..++..+. +. T Consensus 113 ~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~evih~~~~~~---~~~~~G~spi~~~~~~~~~~~-------~~ 175 (395) T protein:vir:95 113 FYREEYALYDD-------IFKDVTVKDYTYQRTFTMQEVIYLKYNNN---KVTHFVESLFEDYGKIFGRMI-------GA 175 (395) T ss_pred ccceeEeecCc-------ceeEEEEcCceeeeeeccccEEEEccCCC---CcccccchHHHHHHHHHHHHH-------HH Confidence 66655433221 12233444444567899999999987643 334789999999988876554 35 Q ss_pred HhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHH-----HHHHHHHHHHH Q lcl|NC_020081. 277 FAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDM-----EFEKWLNYLIN 351 (552) Q Consensus 277 f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~-----q~~e~~~~~~~ 351 (552) |.+|+.|+|+|.+++. .+++++++++++.|++.++|.++.+..++++++|++|+++++++.++ ||+|+++++++ T Consensus 176 ~~~~~~~~gii~~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~ 254 (395) T protein:vir:95 176 QLKNYQIRGILKSASS-AYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIK 254 (395) T ss_pred HHhcCCCceEEEeCCC-CCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHH Confidence 6788899999999764 46899999999999998887643333344477899999999988765 99999999999 Q ss_pred HHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc--cceeecc---cc Q lcl|NC_020081. 352 VICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG--GDYVFNF---VG 426 (552) Q Consensus 352 ~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~f---~~ 426 (552) +||++|||||++|| .+++|++++.+.|+++||.|++..||++||++|+++.+ .+++|.+ ++ T Consensus 255 ~Ia~~f~VPp~~l~--------------~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~ 320 (395) T protein:vir:95 255 NVALMIGIPPGLIY--------------GETADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNK 320 (395) T ss_pred HHHHHhCCCHHHhc--------------CcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhc Confidence 99999999999996 14688999999999999999999999999999998644 3455554 57 Q ss_pred cChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCC--CeeeccccccchhhhccccccccccCCCCCccCcccCCCC Q lcl|NC_020081. 427 GDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGG--DVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYD 504 (552) Q Consensus 427 ~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~gg--D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (552) .|.+++++.+..+ +.+|+||+||+|+++||||++|| |++++++|+++++.....+..... +... T Consensus 321 ~D~~~~~~~~~~~--~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~-----~~~k------- 386 (395) T protein:vir:95 321 KDPLQYAEAIDKL--VSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDE-----NTLK------- 386 (395) T ss_pred cCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccc-----cccC------- Confidence 7888887766643 45799999999999999999876 999999999887644322111100 0000 Q ss_pred CCCCCCCCCCCcccc Q lcl|NC_020081. 505 GNMDNVNGKDSFNQN 519 (552) Q Consensus 505 ~~~~~~~~~~~~~~~ 519 (552) +.+++.+| + T Consensus 387 gg~~~~~g------~ 395 (395) T protein:vir:95 387 GGDEDESG------D 395 (395) T ss_pred CCCCCCCC------C Confidence 00000000 0 No 74 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=100.00 E-value=1.5e-62 Score=359.48 Aligned_cols=383 Identities=14% Similarity=0.103 Sum_probs=258.5 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGV 116 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~ 116 (552) |+.+.+--++++ .... .+++..+ ...-...+...+++++||..+++.++ .+ T Consensus 1 Mg~f~~lf~~~~-~~~~----~~~~~~~-------------~~v~~~~~~~~~~v~~~i~~Ia~~iA-----------~~ 51 (395) T protein:vir:10 1 MSILEKIFKTRK-DITY----MLDLDMI-------------EDLSQQAYVKRLAIDSCIEFVARAVA-----------QS 51 (395) T ss_pred CchhhhhhccCc-cccc----cccchhc-------------cccchhhhhhhHHHHHHHHHHHHhhc-----------cc Confidence 433322111111 1110 0011000 00011123345678888887776654 34 Q ss_pred ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecC Q lcl|NC_020081. 117 GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDA 196 (552) Q Consensus 117 ~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p 196 (552) ++.+..+. ....+++..+|. .+||+.||+++||+.++.++++.|++|+++.++. .++++++ T Consensus 52 p~~~~~~~---------~~~~~~~~~ll~-----~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~ 112 (395) T protein:vir:10 52 HFKVLEGN---------RIQKNDVYYKLN-----IKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADS 112 (395) T ss_pred eeEeccCC---------ccccchHHHHHH-----hccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCC Confidence 45443221 112244444332 3689999999999999999999999888765542 3667766 Q ss_pred ceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 197 STVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARF 276 (552) Q Consensus 197 ~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~ 276 (552) ..+++....+. ...++..........++++||||++++.. ....||+||+..+..++..+. +. T Consensus 113 ~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~evih~~~~~~---~~~~~G~spi~~~~~~~~~~~-------~~ 175 (395) T protein:vir:10 113 FYREEYALYDD-------IFKDVTVKDYTYQRTFTMQEVIYLKYNNN---KVTHFVESLFEDYGKIFGRMI-------GA 175 (395) T ss_pred ccceeEeecCc-------ceeEEEEcCceeeeeeccccEEEEccCCC---CcccccchHHHHHHHHHHHHH-------HH Confidence 66655433221 12233444444567899999999987643 334789999999988876554 35 Q ss_pred HhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHH-----HHHHHHHHHHH Q lcl|NC_020081. 277 FAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDM-----EFEKWLNYLIN 351 (552) Q Consensus 277 f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~-----q~~e~~~~~~~ 351 (552) |.+|+.|+|+|.+++. .+++++++++++.|++.++|.++.+..++++++|++|+++++++.++ ||+|+++++++ T Consensus 176 ~~~~~~~~gii~~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~ 254 (395) T protein:vir:10 176 QLKNYQIRGILKSASS-AYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIK 254 (395) T ss_pred HHhcCCCceEEEeCCC-CCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHH Confidence 6788899999999764 46899999999999998887643333344477899999999988765 99999999999 Q ss_pred HHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc--cceeecc---cc Q lcl|NC_020081. 352 VICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG--GDYVFNF---VG 426 (552) Q Consensus 352 ~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~f---~~ 426 (552) +||++|||||++|| .+++|++++.+.|+++||.|++..||++||++|+++.+ .+++|.+ ++ T Consensus 255 ~Ia~~f~VPp~~l~--------------~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~ 320 (395) T protein:vir:10 255 NVALMIGIPPGLIY--------------GETADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNK 320 (395) T ss_pred HHHHHhCCCHHHhc--------------CcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhc Confidence 99999999999996 14688999999999999999999999999999998644 3455554 57 Q ss_pred cChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCC--CeeeccccccchhhhccccccccccCCCCCccCcccCCCC Q lcl|NC_020081. 427 GDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGG--DVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYD 504 (552) Q Consensus 427 ~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~gg--D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (552) .|.+++++.+..+ +.+|+||+||+|+++||||++|| |++++++|+++++.....+..... +... T Consensus 321 ~D~~~~~~~~~~~--~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~-----~~~k------- 386 (395) T protein:vir:10 321 KDPLQYAEAIDKL--VSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDE-----NTLK------- 386 (395) T ss_pred cCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccc-----cccC------- Confidence 7888887766643 45799999999999999999876 999999999887644322111100 0000 Q ss_pred CCCCCCCCCCCcccc Q lcl|NC_020081. 505 GNMDNVNGKDSFNQN 519 (552) Q Consensus 505 ~~~~~~~~~~~~~~~ 519 (552) +.+++.+| + T Consensus 387 gg~~~~~g------~ 395 (395) T protein:vir:10 387 GGDEDESG------D 395 (395) T ss_pred CCCCCCCC------C Confidence 00000000 0 No 75 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=100.00 E-value=1.5e-62 Score=359.48 Aligned_cols=383 Identities=14% Similarity=0.103 Sum_probs=258.5 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGV 116 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~ 116 (552) |+.+.+--++++ .... .+++..+ ...-...+...+++++||..+++.++ .+ T Consensus 1 Mg~f~~lf~~~~-~~~~----~~~~~~~-------------~~v~~~~~~~~~~v~~~i~~Ia~~iA-----------~~ 51 (395) T protein:vir:10 1 MSILEKIFKTRK-DITY----MLDLDMI-------------EDLSQQAYVKRLAIDSCIEFVARAVA-----------QS 51 (395) T ss_pred CchhhhhhccCc-cccc----cccchhc-------------cccchhhhhhhHHHHHHHHHHHHhhc-----------cc Confidence 433322111111 1110 0011000 00011123345678888887776654 34 Q ss_pred ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecC Q lcl|NC_020081. 117 GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDA 196 (552) Q Consensus 117 ~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p 196 (552) ++.+..+. ....+++..+|. .+||+.||+++||+.++.++++.|++|+++.++. .++++++ T Consensus 52 p~~~~~~~---------~~~~~~~~~ll~-----~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~ 112 (395) T protein:vir:10 52 HFKVLEGN---------RIQKNDVYYKLN-----IKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADS 112 (395) T ss_pred eeEeccCC---------ccccchHHHHHH-----hccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCC Confidence 45443221 112244444332 3689999999999999999999999888765542 3667766 Q ss_pred ceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 197 STVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARF 276 (552) Q Consensus 197 ~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~ 276 (552) ..+++....+. ...++..........++++||||++++.. ....||+||+..+..++..+. +. T Consensus 113 ~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~evih~~~~~~---~~~~~G~spi~~~~~~~~~~~-------~~ 175 (395) T protein:vir:10 113 FYREEYALYDD-------IFKDVTVKDYTYQRTFTMQEVIYLKYNNN---KVTHFVESLFEDYGKIFGRMI-------GA 175 (395) T ss_pred ccceeEeecCc-------ceeEEEEcCceeeeeeccccEEEEccCCC---CcccccchHHHHHHHHHHHHH-------HH Confidence 66655433221 12233444444567899999999987643 334789999999988876554 35 Q ss_pred HhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHH-----HHHHHHHHHHH Q lcl|NC_020081. 277 FAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDM-----EFEKWLNYLIN 351 (552) Q Consensus 277 f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~-----q~~e~~~~~~~ 351 (552) |.+|+.|+|+|.+++. .+++++++++++.|++.++|.++.+..++++++|++|+++++++.++ ||+|+++++++ T Consensus 176 ~~~~~~~~gii~~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~ 254 (395) T protein:vir:10 176 QLKNYQIRGILKSASS-AYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIK 254 (395) T ss_pred HHhcCCCceEEEeCCC-CCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHH Confidence 6788899999999764 46899999999999998887643333344477899999999988765 99999999999 Q ss_pred HHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc--cceeecc---cc Q lcl|NC_020081. 352 VICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG--GDYVFNF---VG 426 (552) Q Consensus 352 ~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~f---~~ 426 (552) +||++|||||++|| .+++|++++.+.|+++||.|++..||++||++|+++.+ .+++|.+ ++ T Consensus 255 ~Ia~~f~VPp~~l~--------------~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~ 320 (395) T protein:vir:10 255 NVALMIGIPPGLIY--------------GETADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNK 320 (395) T ss_pred HHHHHhCCCHHHhc--------------CcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhc Confidence 99999999999996 14688999999999999999999999999999998644 3455554 57 Q ss_pred cChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCC--CeeeccccccchhhhccccccccccCCCCCccCcccCCCC Q lcl|NC_020081. 427 GDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGG--DVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYD 504 (552) Q Consensus 427 ~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~gg--D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (552) .|.+++++.+..+ +.+|+||+||+|+++||||++|| |++++++|+++++.....+..... +... T Consensus 321 ~D~~~~~~~~~~~--~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~-----~~~k------- 386 (395) T protein:vir:10 321 KDPLQYAEAIDKL--VSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDE-----NTLK------- 386 (395) T ss_pred cCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccc-----cccC------- Confidence 7888887766643 45799999999999999999876 999999999887644322111100 0000 Q ss_pred CCCCCCCCCCCcccc Q lcl|NC_020081. 505 GNMDNVNGKDSFNQN 519 (552) Q Consensus 505 ~~~~~~~~~~~~~~~ 519 (552) +.+++.+| + T Consensus 387 gg~~~~~g------~ 395 (395) T protein:vir:10 387 GGDEDESG------D 395 (395) T ss_pred CCCCCCCC------C Confidence 00000000 0 No 76 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=8e-62 Score=355.55 Aligned_cols=369 Identities=12% Similarity=0.075 Sum_probs=263.2 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+|+.++.+.. ... .....+.+ .. .+.+... ...+..... T Consensus 1 Mglf~~~~~~~~--------------------~~~--------------~~~~~~~~-~~-~~~~~~~-~~~~~~v~~-- 41 (384) T protein:vir:49 1 MPIFNITNLATE--------------------SPP--------------SNQDSFFD-IT-DPEFLDA-LNGSEWVSA-- 41 (384) T ss_pred CccccccccCcc--------------------ccc--------------ccchhhcc-cc-chhhccc-ccCCceech-- Confidence 777763222110 000 00000000 00 0111100 001111001 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) +. +...+++.+||..++..++ ++++.+. +... .. +..+||++||++ T Consensus 42 -~~-al~~~~V~~~i~~Ia~~ia-----------~l~~~~~--~~~~-------------~~------l~~~PN~~~t~~ 87 (384) T protein:vir:49 42 -ET-ALKNSDLFSIISQLSNDLA-----------TAKITTS--RKQL-------------QG------IVDNPSNNANRF 87 (384) T ss_pred -hh-hhccHHHHHHHHHHHHHHh-----------hCceeee--cchh-------------hh------hhhccCCCCCHH Confidence 11 2234567888887776654 3444442 1110 11 234688999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCC---ceEEEEcccceee Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDD---KVVAKFKAKEMAW 237 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~---~~~~~~~~~evi~ 237 (552) +|++.++.+++++||+|++++|+..|+|++||||+|++|++..++++.. .+|.....+ +....|+++|||| T Consensus 88 ~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~------~~y~~~~~~~~~~~~~~~~~~eVih 161 (384) T protein:vir:49 88 NFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNG------LYYNITFDDPRIPPKQHVPQGDILH 161 (384) T ss_pred HHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce------EEEEEEecCccccceeEecCccEEE Confidence 9999999999999999999999999999999999999999988776543 234444332 3457899999999 Q ss_pred ecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccccc Q lcl|NC_020081. 238 EVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGA 317 (552) Q Consensus 238 ~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~na 317 (552) ++.+. ..+.++|+||+.++..+|..+.++++++.++|+||++|++||++++.. ++++. ++++.+.+.|..|+ T Consensus 162 ~~~~~---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~--~~~~~---~~~~~~~~~~~~n~ 233 (384) T protein:vir:49 162 FRLLS---VDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGG--LLDFK---TKQSRSRQAMKQMQ 233 (384) T ss_pred ecCCC---CCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC--ChHHH---HHHHHHHHhcccCC Confidence 98642 334589999999999999999999999999999999999999998754 33333 23455667788899 Q ss_pred ccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHh Q lcl|NC_020081. 318 WKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGL 397 (552) Q Consensus 318 gk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l 397 (552) |++++ +++|++|+++++++.|+||+|++++++++||++|||||++||..... ..+++++++....|++.+| T Consensus 234 ~~~~v-l~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~--------~~~~~~~~~~~~~~i~~~l 304 (384) T protein:vir:49 234 GGPLV-LDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDK--------QSSLEMIYNIYFKAVSRFL 304 (384) T ss_pred cccee-cCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc--------cccHHHHHHHHHHHHHHHH Confidence 99754 46799999999999999999999999999999999999999975332 3467889999999999999 Q ss_pred hHHHHHHHHHHHhhcCccc-----ccceeec-----ccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCe- Q lcl|NC_020081. 398 EPLLKFIEDAVNKYIVSQF-----GGDYVFN-----FVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDV- 466 (552) Q Consensus 398 ~P~~~~ie~~ln~~L~~~~-----~~~~~~~-----f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~- 466 (552) .|++..|+++|+++|.... ...++++ +++.+..++.+.++++.. .|+++ ||+|+.+|++|++|||. T Consensus 305 ~pi~~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~--~g~~~-ne~r~~~~~~p~~gGd~~ 381 (384) T protein:vir:49 305 RPFVSELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQ--AEILP-KDLPEGETDSTLKGGETN 381 (384) T ss_pred HHHHHHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhh--CCCCC-hhHHHHcCCCCCCCCCCC Confidence 9999999999999875321 1223344 357788888887775433 46665 99999999999999873 Q ss_pred -ee Q lcl|NC_020081. 467 -TL 468 (552) Q Consensus 467 -~~ 468 (552) .+ T Consensus 382 ~~~ 384 (384) T protein:vir:49 382 EQY 384 (384) T ss_pred CCC Confidence 22 No 77 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=2.9e-61 Score=352.47 Aligned_cols=374 Identities=13% Similarity=0.046 Sum_probs=255.8 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+|+..+.. +....... ....+...+. ....+...+. T Consensus 1 Mg~f~~~~~~---------------------------------~~~~~~~~-~~~~~~~~~~------~~~~~~~v~~-- 38 (382) T protein:vir:48 1 MPIFNLATES---------------------------------PPDNQGGF-FDVVDSDFLA------SLKGNEWVSA-- 38 (382) T ss_pred CccccccccC---------------------------------Cccccccc-ccchhhhccc------cccCCcccch-- Confidence 6666521110 00000000 0000110000 0111111111 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) +. +...+++.+||..+++.++ ++++.+..+.. .. +..+||+.||++ T Consensus 39 -~~-~l~~~~v~~~i~~ia~~ia-----------~~~~~~~~~~~---------------~~------L~~~PN~~~t~~ 84 (382) T protein:vir:48 39 -ET-ALRNSDLFSIINQLSNDLA-----------TVKLITSRKKL---------------QG------IVDNPSNNANRF 84 (382) T ss_pred -Hh-hhccHHHHHHHHHHHHhhc-----------cCceeeecchh---------------hh------hhhhcCCCCCHH Confidence 11 1234567888877666654 44554432110 01 224688999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCC---ceEEEEcccceee Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDD---KVVAKFKAKEMAW 237 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~---~~~~~~~~~evi~ 237 (552) +|++.++.+++++||+|++|+|+..|++++||||+|++|+|..+.+|.. ++|.+..++ +....|+++|||| T Consensus 85 ~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~------~~y~~~~~~~~~~~~~~~~~~evih 158 (382) T protein:vir:48 85 NFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDG------IYYNITFDDPRIPPKQHVPQNDVLH 158 (382) T ss_pred HHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCe------EEEEEEecCccccceeEEcCccEEE Confidence 9999999999999999999999999999999999999999998877643 344444433 3457899999999 Q ss_pred ecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccccc Q lcl|NC_020081. 238 EVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGA 317 (552) Q Consensus 238 ~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~na 317 (552) ++.+. ..+.++|+||+.++..+|..+.++++++.++|+||++|+|||++++. +++++.+++++.|.. +..|+ T Consensus 159 ~~~~~---~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~--~~~e~~~~~~~~~~~---~~~n~ 230 (382) T protein:vir:48 159 FRLLS---VDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGG--GLLDFKTKLSRSRQA---MKQMQ 230 (382) T ss_pred ecCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC--CChHHHHHHHHHHHh---hccCC Confidence 98642 33458999999999999999999999999999999999999999764 578888888888865 44678 Q ss_pred ccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHh Q lcl|NC_020081. 318 WKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGL 397 (552) Q Consensus 318 gk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l 397 (552) |++++ +++|++|++++.++.|+||+|++++++++||++|||||.+||.... +++.+++.+.|++.|| T Consensus 231 g~~~v-l~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~------------~~~~~~~~~~~~~~~l 297 (382) T protein:vir:48 231 GGPLV-LDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGD------------QQSSLEMSSDLYSKAV 297 (382) T ss_pred CCeeE-cCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC------------cccHHHHHHHHHHHHH Confidence 88654 4679999999999999999999999999999999999999986432 2467888899999999 Q ss_pred hHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCC-----CCCCeeecccc Q lcl|NC_020081. 398 EPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDT-----EGGDVTLAGVH 472 (552) Q Consensus 398 ~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~-----~ggD~~~~~~n 472 (552) .|+++.|+++|+++|+++++.+....+. .+..... ..+.....+|++|+||+|+.++..++ ++|+.+. T Consensus 298 ~p~~~~i~~~l~~~l~~~~~~~~~~~~~-~~~~~~~--~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~~~---- 370 (382) T protein:vir:48 298 SRYLRPFLSELSQKLSCDVDADIFPAVD-PTGSNYI--SRINSLVKTGTLAQNQGLYILQQAEILPKELPNGENPN---- 370 (382) T ss_pred HHHHHHHHHHHHHHhcChhhhhhhhhhc-cchhHHH--HHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhcCC---- Confidence 9999999999999999887655433332 2222221 12223345689999999998854332 2221110 Q ss_pred ccchhhhccccccccccCCCCCccCcccCCCCCCCCC Q lcl|NC_020081. 473 VQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDN 509 (552) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (552) ++.++++.+ ..+ T Consensus 371 -------------~~~~GGd~~------------~~~ 382 (382) T protein:vir:48 371 -------------STLKGGEED------------GQD 382 (382) T ss_pred -------------CCCCCCCCC------------CCC Confidence 000111100 000 No 78 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=2.7e-60 Score=347.13 Aligned_cols=473 Identities=11% Similarity=0.074 Sum_probs=260.7 Q ss_pred CCCCC-C---------CcccccchhhcccccCcc------cccccccchhhhhccccccccccccccccccccccccCCc Q lcl|NC_020081. 1 MGLLD-G---------FFKGRKQQDNIIDINDDM------AVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPD 64 (552) Q Consensus 1 ~~~~~-~---------~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 64 (552) |.-+- | .+|..+--+--+-++-+| .+++..-......+. +.++.-.... -+.+... . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~-d~~~~~~~r~-g~~~~~~-----~ 73 (648) T protein:vir:79 1 MARKVWGRGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKR-DPKMSLVKRI-GLAIMDG-----G 73 (648) T ss_pred CccchhcchhhhhhhhhccCccccccccccccccccCCCccccCCCCcccccccc-cchhHHHHHh-HHHHHhh-----c Confidence 22110 0 122211111001111111 111111111100000 0000000000 0000000 0 Q ss_pred ccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHH Q lcl|NC_020081. 65 FKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFI 144 (552) Q Consensus 65 ~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l 144 (552) +... ....++..+..|.+....+++++.||.++++.+ ++++|.++.++... ... .....+ T Consensus 74 ~g~~-~~~epp~d~~~l~~l~~~np~V~~aI~iia~~i-----------a~l~~~i~~~~~~~--~~~-----~~~~~l- 133 (648) T protein:vir:79 74 GGGR-DFEEPEFDFNEITSAYNTEGYVRQAVDKYIEMM-----------FKADWDFVSKNPNA--VEY-----IRMRFT- 133 (648) T ss_pred CCcc-ccccCCcCHHHHHHHHhcChHHHHHHHHHHHHH-----------hhCcceEEecCCcc--chh-----hHHHHH- Confidence 0011 112233345667555556777888887766554 45567766554321 111 111111 Q ss_pred HhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCC---------------EEEEEEecCceeEEEECCCccc Q lcl|NC_020081. 145 EKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGD---------------LHNFKAVDASTVYVAVDEDGKE 209 (552) Q Consensus 145 ~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~---------------~~~L~~l~p~~v~v~~~~~g~~ 209 (552) ...||+.+|.++||+.++.+++++||||++|+|+.+|. +.+||||+|.+|++..+++|.. T Consensus 134 -----l~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~ 208 (648) T protein:vir:79 134 -----LMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMI 208 (648) T ss_pred -----hhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCce Confidence 12678899999999999999999999999999998883 5889999999999999888754 Q ss_pred ccccceeEEEE-EcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Q lcl|NC_020081. 210 RKAKDGVRYVQ-VIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLH 288 (552) Q Consensus 210 ~~~~~~~~y~~-~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 288 (552) . .|++ ...+.....|.++||||++... +.+++||+|||.+++.+|..+.++++|..+||.||++|+|||+ T Consensus 209 ~------~Y~y~~~g~~~~~~~~~~dIIHik~~~---~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~ 279 (648) T protein:vir:79 209 K------GWQQEQEGQDKPQKFKPEDIVHIYYKR---EKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVK 279 (648) T ss_pred e------eeEEEecCCceeEEecCccEEEEccCC---CCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEE Confidence 2 2333 3344566789999999998643 4567899999999999999999999999999999999999999 Q ss_pred eCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeecc----CchhHHHHHHHHHHHHHHHHHHhcCCHHHh Q lcl|NC_020081. 289 IKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMT----QSSKDMEFEKWLNYLINVICSIYSIDPSEI 364 (552) Q Consensus 289 ~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~----~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 364 (552) ++.+.. ..++.+++++.|...+.+.. +.+.++++..++ .+++|+||++++++++++||++|||||++| T Consensus 280 ~~~~~~-~~e~~k~~~e~~~~~~~~~~-------i~gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lL 351 (648) T protein:vir:79 280 VGLEQE-GFGAEEGEVDLVRGEVENMD-------VEGGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMM 351 (648) T ss_pred eCCCcc-chHHHHHHHHHHHHhccccc-------ccccccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHc Confidence 875433 34556677777777665532 222333333332 356899999999999999999999999999 Q ss_pred cccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHh----hcCccc------c--cceeeccc---ccCh Q lcl|NC_020081. 365 NFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNK----YIVSQF------G--GDYVFNFV---GGDA 429 (552) Q Consensus 365 g~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~----~L~~~~------~--~~~~~~f~---~~d~ 429 (552) |+...+ ++++.+++.. ++..++.|++..|+..++. .++.+. . ..++|.|. +.|. T Consensus 352 G~~~~s----------s~stae~~~~-~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~ 420 (648) T protein:vir:79 352 GRGGTA----------SRSTGDNLSS-DFKDRIKALQKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSK 420 (648) T ss_pred ccCCCc----------cchHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccchhhH Confidence 986533 4566666555 4566777777666555443 332221 1 12455553 4555 Q ss_pred HHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCee-eccccccchhhhccccccccccCCCCCccCcccCCCCCCCC Q lcl|NC_020081. 430 KTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVT-LAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMD 508 (552) Q Consensus 430 ~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~-~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (552) +++++.+. ..+.+|+||+||+|+++||||+|+|+.. ....+..+...... +...+ +.++ +...... T Consensus 421 ~~~a~~~~--~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~----~~~~~--~~~~-----~~~~~~a 487 (648) T protein:vir:79 421 IKLENQAV--FLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATA----LAALA--PTPA-----GGSSASA 487 (648) T ss_pred HHHHHHHH--HHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccccchhccc----cccCC--CCCC-----CCCCCCc Confidence 55555443 3455799999999999999999987643 22233222211110 00000 0000 0000000 Q ss_pred CCCCCCCcccccCCCCcccccccccccc-ccCccccccccc------cccC Q lcl|NC_020081. 509 NVNGKDSFNQNVGKDGQSKQQANTNSTP-QGGKDDNGNVVN------DWEA 552 (552) Q Consensus 509 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~------~~~~ 552 (552) ...++.... +.+++. ++.+.++ +..+..++-.+. +|.+ T Consensus 488 ~~eg~~~e~-----~~~~~~-~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (648) T protein:vir:79 488 SGDKKKKAT-----DNKTKP-TNQHGTKTSPKKQTNGRHVRYMQEMLLEYT 532 (648) T ss_pred ccccccccc-----CCCCCC-CCCCCcCCCCccccchhhhhhhhhhhhcch Confidence 000110000 000000 0111111 122233333232 2222 No 79 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=100.00 E-value=6.6e-61 Score=350.52 Aligned_cols=366 Identities=14% Similarity=0.065 Sum_probs=245.1 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGV 116 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~ 116 (552) |+.+.+--.++. .. ... .++. .... .. . ..+...+.+.+||..++..++ .+ T Consensus 1 Mg~f~~l~~~~~-~~-~~~---~~~~--~~~~--~~--~-------~~~l~~~~v~~~i~~Ia~~ia-----------~~ 51 (376) T protein:vir:78 1 MGFFSELFKRNK-EI-EWM---WDLD--FLED--KT--T-------KVYLKKMALNTCVKHIARTIA-----------KS 51 (376) T ss_pred CchhhhhhccCC-cc-ccc---cchh--hccc--cc--h-------hhhhhhHHHHHHHHHHHHhhc-----------cc Confidence 332211101111 00 000 0000 0000 00 0 112234557788776665543 44 Q ss_pred ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecC Q lcl|NC_020081. 117 GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDA 196 (552) Q Consensus 117 ~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p 196 (552) ++.+..+ . ....|++..+|. .+||+.||+++||+.++.+++++||+|+++.|+..|.+.+++|+.+ T Consensus 52 p~~~~~~--~-------~~~~~~l~~ll~-----~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~ 117 (376) T protein:vir:78 52 DFRLKNG--E-------TSVRDKLYYKLN-----IRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKE 117 (376) T ss_pred ceeeccc--c-------ccccchHHHHHh-----hccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecc Confidence 5544321 1 112244444432 4789999999999999999999999999999999999999999998 Q ss_pred ceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 197 STVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARF 276 (552) Q Consensus 197 ~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~ 276 (552) ..+..... ..+...+......|+++||||++++.... ..++.+++..+...+. .....+ T Consensus 118 ~~~~~~~~------------~~~~~~~~~~~~~~~~~evih~~~~~~~~---~~~~~~~~~~~~~~~~------~~~~~~ 176 (376) T protein:vir:78 118 FAFFPDVF------------EGVTVKDYRYNRNFSMDDVIFLEYGNERL---SAFTDGMFEDYGELFG------KMIRAQ 176 (376) T ss_pred cceeeeee------------eeeeeecceeeeeeccccEEEeccCCCCc---hhhhhHHHHHHHHHHH------HHHHHH Confidence 77653221 11111222234578999999998654211 1222223233322222 222233 Q ss_pred HhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHH-----HHHHHHHHHHH Q lcl|NC_020081. 277 FAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDM-----EFEKWLNYLIN 351 (552) Q Consensus 277 f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~-----q~~e~~~~~~~ 351 (552) +.+++ +++++.+.....+++++.+++++.|++.++|..+.++.++++++|++|+++++++.|+ ||+|+++++++ T Consensus 177 ~~~~~-~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~ 255 (376) T protein:vir:78 177 MRNFQ-IRGAVNFKMAGVADKDKQTKLQEYIDKVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMID 255 (376) T ss_pred HhcCC-CceeEEEccCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHH Confidence 33333 3344444444567999999999999999999766655556677899999999888664 99999999999 Q ss_pred HHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeec---ccccC Q lcl|NC_020081. 352 VICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFN---FVGGD 428 (552) Q Consensus 352 ~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~---f~~~d 428 (552) +||++|||||++||. +++|++++.+.|++.||.|+++.||++||++|+++.+....++ +++.| T Consensus 256 ~Ia~~fgVPp~~l~~--------------~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~~~~~~~~ll~~d 321 (376) T protein:vir:78 256 YVASILGIPSSLLHG--------------DMADLSNNMKAYMEYCIDPLTKKLEDELNAKLFTFSEFLAGEHIKIIHKKD 321 (376) T ss_pred HHHHHhCCCHHHhCC--------------CCCCHHHHHHHHHHHHHHHHHHHHHHHHHhhhCCcccceecccchhhcccC Confidence 999999999999962 4688999999999999999999999999999998765433333 36788 Q ss_pred hHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCC--Ceeeccccccchhhhcccc Q lcl|NC_020081. 429 AKTEAEIISILESKAKIGLTINDIRKELGYPDTEGG--DVTLAGVHVQRLGQIMQQE 483 (552) Q Consensus 429 ~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~gg--D~~~~~~n~~~~~~~~~~~ 483 (552) .+++++.+..+ +.+|++|+||+|+++|+||+||| |+++++.|+++++....-. T Consensus 322 ~~~~~~~~~~~--~~~G~~t~NE~R~~lg~~p~~~g~~d~~~~~~n~~~~~~~~e~g 376 (376) T protein:vir:78 322 IIENAEAVDKL--VASGSFNRNEVRELLGAERVDNPELDKYLITKNYQSADEGGEDG 376 (376) T ss_pred HHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCCceeeeccCceehhccccCC Confidence 88888776644 45799999999999999999887 9999999999987432111 No 80 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=5.4e-60 Score=345.52 Aligned_cols=383 Identities=13% Similarity=0.047 Sum_probs=257.4 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+++..++.+ . + ..... ..+.+ +. .+.+.. +...+.....+ T Consensus 1 M~~f~~~~~~~-~--------~------------------------~~~~~-~~~~~-~~-~~~~~~-~~~~~~~v~~~- 42 (386) T protein:vir:49 1 MPIFNITNLAT-E--------S------------------------PPINQ-ESFFD-IA-DSDFLA-SLNSSEWVSAE- 42 (386) T ss_pred CchhhhhccCC-C--------C------------------------cccch-hhhhh-hh-hccccc-cccCCceechh- Confidence 66665333221 0 0 00000 00000 00 000000 00011111111 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .+...+.+.+||..+++.++ ++++.+..+. ...+ ...||+.||++ T Consensus 43 ---~al~~~~v~~~i~~ia~~ia-----------~~p~~~~~~~---------------~~~l------~~~PN~~~t~~ 87 (386) T protein:vir:49 43 ---NALKNSDLFSIISQLSNDLA-----------TAKITTSRKQ---------------LQGI------VDNPSNNANRF 87 (386) T ss_pred ---hhhccHHHHHHHHHHHHHhh-----------hCceeeccch---------------hhhh------hhccCCCCCHH Confidence 12234567888877766654 3444443211 0112 23688999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEc---CCceEEEEcccceee Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVI---DDKVVAKFKAKEMAW 237 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~---~~~~~~~~~~~evi~ 237 (552) +||+.++.+++++||||++|+|+..|++++||||+|++|+|..+.++... +|.+.. .++..+.|+++|||| T Consensus 88 ~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~~~~------~y~~~~~~~~~~~~~~~~~~evih 161 (386) T protein:vir:49 88 NFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGL------YYNITFDDPHIAPKQHVPQNDILH 161 (386) T ss_pred HHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceE------EEEEEEcCccccceeEEccccEEE Confidence 99999999999999999999999999999999999999999998876532 333332 234567899999999 Q ss_pred ecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccccc Q lcl|NC_020081. 238 EVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGA 317 (552) Q Consensus 238 ~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~na 317 (552) ++.+. ..++++|+||+.++..+|..+.++++++.++|+||++|+++|++++. .++++.+++++.|... ..|+ T Consensus 162 ~~~~~---~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~--~~~~~~~~~~~~~~~~---~~n~ 233 (386) T protein:vir:49 162 FRLLS---VDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGG--GLLDFKTKVSRSRQAM---KQMQ 233 (386) T ss_pred ecCCC---CCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCC--CChHHHHHHHHHHHHh---ccCC Confidence 97642 33458999999999999999999999999999999999999999864 4777788888888753 4688 Q ss_pred ccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHh Q lcl|NC_020081. 318 WKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGL 397 (552) Q Consensus 318 gk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l 397 (552) |+++++ ++|++|++++.++.|+||+|++++++++||++|||||++||.... ++++.+ +.+.|+..+| T Consensus 234 g~~~vl-~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-----------~~~~~~-~~~~~~~~~i 300 (386) T protein:vir:49 234 GGPLVL-DDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGD-----------QQSSLE-MIYNIYFKSV 300 (386) T ss_pred CCceec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-----------ccchHH-HHHHHHHHHH Confidence 887554 679999999999999999999999999999999999999985332 233443 4567889999 Q ss_pred hHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchh Q lcl|NC_020081. 398 EPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLG 477 (552) Q Consensus 398 ~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~ 477 (552) .|+++.|+++|+++|++..+.+. ..+++.|...+....+. .+.+|++|+||+|++++..++..++++.. ... T Consensus 301 ~~~l~~i~~~~~~~l~~~~~~~~-~~~~~~d~~~~~~~~~~--l~~~g~~t~nE~r~~l~~~~~~~~~~~~~-~~~---- 372 (386) T protein:vir:49 301 SRYLRPFVSEMSKKLSCEVDVDI-SPAVDPTGSNYISLINS--MVKSGTLAQNQGLYILQQAEILPKELPDG-KNP---- 372 (386) T ss_pred HHHHHHHHHHHHHHhcchhcccc-hhhhccCHHHHHHHHHH--HHhCCCcCHHHHHHHHhhCCCCCCcCcch-hcc---- Confidence 99999999999999986543221 12456677666655443 35578999999999998766543332110 000 Q ss_pred hhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccc Q lcl|NC_020081. 478 QIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQN 519 (552) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (552) .....++++. ..++ T Consensus 373 ------~~~~~~gGd~----------------------~~~~ 386 (386) T protein:vir:49 373 ------NRTSLKGGEI----------------------NEQD 386 (386) T ss_pred ------CCCCCCCCCC----------------------CCCC Confidence 0000000000 0000 No 81 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=100.00 E-value=4.6e-61 Score=351.35 Aligned_cols=360 Identities=14% Similarity=0.122 Sum_probs=239.3 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGV 116 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~ 116 (552) |..+.+-+++..... +.. ...+. .+. -...++.++++++||..+++.++ ++ T Consensus 1 Mg~f~~~~~~~~~~~--~~~-----~~~~~------~~~-----~~~~~~~~~~v~~~v~~IA~~iA-----------~l 51 (378) T protein:vir:94 1 MNLFGKVVSFSRGKL--NND-----TQRVT------AWQ-----NEAVEYTSAFVTNIHNKIANEIT-----------KV 51 (378) T ss_pred CCccccchhcccccc--cCC-----cceee------eec-----cchhHHHHHHHHHHHHHHHhhhh-----------hC Confidence 333222222110000 000 00110 010 01234567788899877666654 55 Q ss_pred ceeeeeccccccC-ChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC-CCCCEEEEEEe Q lcl|NC_020081. 117 GYEIRLKDPLQEP-NDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD-KLGDLHNFKAV 194 (552) Q Consensus 117 ~~~i~~k~~~~~~-~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~-~~G~~~~L~~l 194 (552) ++.+......+.. ........|++..+|. .+||+.||+++||+.++.+++++||+|++++++ ..|+++.++|. T Consensus 52 p~~~~~~~~~~~~~~~~~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l~p~ 126 (378) T protein:vir:94 52 EFNHVKYKKSDVGSDTLISMAGSDLDEVLN-----WSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDLLFA 126 (378) T ss_pred ceeeEEEcccCcccccccccccchHHHHHh-----hcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEEEec Confidence 6654333332221 1112223355655554 368899999999999999999999999997765 45777777653 Q ss_pred cCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 195 DASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNA 274 (552) Q Consensus 195 ~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~ 274 (552) . ....|+++||||++. +.++..|+||++.+..++.. T Consensus 127 ~------------------------------~~~~~~~~diiH~~~-----~~~~~~g~s~l~~~~~~i~~--------- 162 (378) T protein:vir:94 127 D------------------------------DKKEYKPEELVRLTS-----PFYINEDTSILDNALASIQT--------- 162 (378) T ss_pred C------------------------------CeeEeeeeeeEEecC-----cCCccchhHHHHHHHHHHHH--------- Confidence 2 112467889999873 23456799999988887643 Q ss_pred HHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhc---cccccccceeeccCCceeeeccCchhHHHHHHHHHHHHH Q lcl|NC_020081. 275 RFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFS---GINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLIN 351 (552) Q Consensus 275 ~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~---G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~ 351 (552) ++.+ +.|+|+|++++. +++++.+++++.|.+.+. +..++|+++++ ++|++|+++++++.++++ +.++++++ T Consensus 163 -~~~~-~~~~gil~~~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl-~~g~~~~~l~~~~~~~~~-~~~~~~~~ 236 (378) T protein:vir:94 163 -KLEQ-GKLRGLLKINAF--LDIDNTQEYREKALTTIKNMQEGSSYNGLTPV-DNKTEIVELKKDYSVLNK-DEIDLIKS 236 (378) T ss_pred -HHhc-ccccceeeeCCc--CCHHHHHHHHHHHHHHHHHhhcccccccceec-CCCceEEEccCChhhhhH-HHHHHHHH Confidence 2334 468999998764 466655555555555443 23578886544 679999999999999997 66799999 Q ss_pred HHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccccc----------cee Q lcl|NC_020081. 352 VICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGG----------DYV 421 (552) Q Consensus 352 ~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~----------~~~ 421 (552) +||++|||||++|+ .++.+++.+.|++.||.||+++||++|+++|+++.+. ++. T Consensus 237 ~Ia~~fgVP~~~l~----------------~~~se~~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~ 300 (378) T protein:vir:94 237 ELLTGYFMNENILL----------------GTASQEQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERII 300 (378) T ss_pred HHHHHhCCCHHHhc----------------CChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhccccccee Confidence 99999999999994 1345788999999999999999999999999985321 123 Q ss_pred ec---ccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCc Q lcl|NC_020081. 422 FN---FVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLA 498 (552) Q Consensus 422 ~~---f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (552) |+ ++++|.+++++.+..+ +.+|+||+||+|+++||||+||||++++++|+++++.+...+..... T Consensus 301 f~~~~l~~~d~~~~~~~~~~~--~~~G~~T~NE~R~~~gl~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~---------- 368 (378) T protein:vir:94 301 VDNQLFKFATLKELIDLYHEN--INGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSRKD---------- 368 (378) T ss_pred ecchhhhhcCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCeeeecccccccccchhhcCCcCC---------- Confidence 43 3577888888876654 45799999999999999999999999999999998765443221100 Q ss_pred ccCCCCCCCCCCCCC Q lcl|NC_020081. 499 QQTGYDGNMDNVNGK 513 (552) Q Consensus 499 ~~~~~~~~~~~~~~~ 513 (552) ..+++.++++ T Consensus 369 -----~~~~~e~~n~ 378 (378) T protein:vir:94 369 -----VTSTDETNNQ 378 (378) T ss_pred -----CCCCCCCCCC Confidence 0011111111 No 82 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=100.00 E-value=9.8e-61 Score=349.58 Aligned_cols=360 Identities=14% Similarity=0.138 Sum_probs=239.2 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGV 116 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~ 116 (552) |..+.+-+++....... ++. ....+. -...+++.+++.+||.++++.+ +.+ T Consensus 1 Mg~f~~~~~f~~~~~~~--------~~~-----~~~~~~-----~~~~~~~~~~v~~~i~~Ia~~i-----------A~l 51 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLNN--------DTQ-----RVTAWQ-----NEAVEYTSAFVTNIHNKIANEI-----------TKV 51 (378) T ss_pred CccchhhhhhhccccCC--------Ccc-----eeeecc-----cchhHHHHHHHHHHHHHHHhhh-----------hhC Confidence 33332222221100000 000 000000 0123446677888887766665 456 Q ss_pred ceeeeeccccccCChh-HHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC-CCCEEEEEEe Q lcl|NC_020081. 117 GYEIRLKDPLQEPNDH-NKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK-LGDLHNFKAV 194 (552) Q Consensus 117 ~~~i~~k~~~~~~~~~-~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~-~G~~~~L~~l 194 (552) ++.+......+...++ .....|++..+|. .+||++||+++||+.++.+++++||+|++++|+. .|++..++|. T Consensus 52 p~~~~~~~~~~~~~~~~~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l~~~ 126 (378) T protein:vir:93 52 EFNHVKYKKSDVGSDTLISMAGSDLDEVLN-----WSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFA 126 (378) T ss_pred ceeeEEEcccccccccccccccchHHHHHh-----hcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEec Confidence 6655433332222111 1122355555543 2688999999999999999999999999988764 3666666542 Q ss_pred cCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 195 DASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNA 274 (552) Q Consensus 195 ~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~ 274 (552) . ....|+++||||++. +.++..|.|++..+...+. T Consensus 127 ~------------------------------~~~~~~~~diih~r~-----~~~~~~~~s~l~~~~~~i~---------- 161 (378) T protein:vir:93 127 D------------------------------DKKEYKTEELVRLTS-----PFYINEDTSILDNALASIQ---------- 161 (378) T ss_pred C------------------------------CeeEeccceeEEecC-----ccccchhhHHHHHHHHHHH---------- Confidence 1 123467899999862 3455678999888776553 Q ss_pred HHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhc---cccccccceeeccCCceeeeccCchhHHHHHHHHHHHHH Q lcl|NC_020081. 275 RFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFS---GINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLIN 351 (552) Q Consensus 275 ~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~---G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~ 351 (552) .+|.+| .|+|+|++++. +++++.+++++.|.+.+. +..++|+++++ ++|++|++++.++.++|+ +.++++++ T Consensus 162 ~~~~~~-~~~g~l~~~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~g~~~~~l~~~~~~~~~-~~~~~~~~ 236 (378) T protein:vir:93 162 TKLEQG-KLRGLLKINAF--LDIDNTQEYREKALTTIKNMQEGSSYNGLTPV-DNKTEIVELKKDYSVLNK-DEIDLIKS 236 (378) T ss_pred HHHhcC-cccceeeeCCc--CCHHHHHHHHHHHHHHHHHhhcccccccceEc-CCCceEEEccCChhhhhH-HHHHHHHH Confidence 345555 68999998764 466665655655555442 33577886544 679999999999999997 66789999 Q ss_pred HHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccccc----------cee Q lcl|NC_020081. 352 VICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGG----------DYV 421 (552) Q Consensus 352 ~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~----------~~~ 421 (552) +||++|||||++|+ .++.+++...|++.||.|+++.||++||++|+++.+. ++. T Consensus 237 ~Ia~~fgVPp~~l~----------------g~~~e~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~ 300 (378) T protein:vir:93 237 ELLTGYFMNENILL----------------GTATQEQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERII 300 (378) T ss_pred HHHHHhCCCHHHhc----------------CCcHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhccccccee Confidence 99999999999994 1345789999999999999999999999999976431 133 Q ss_pred ec---ccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCc Q lcl|NC_020081. 422 FN---FVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLA 498 (552) Q Consensus 422 ~~---f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (552) |+ ++++|.+++++.+..+ +.+|+||+||+|+++||||+||||++++++|+++++.+...+..+.. T Consensus 301 fd~~~l~~~d~~~~~~~~~~~--~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~---------- 368 (378) T protein:vir:93 301 VDNQLFKFATLKELIDLYHEN--INGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSRKD---------- 368 (378) T ss_pred eccchhhhcCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCeeeeccccccccchhhhcCccCC---------- Confidence 43 3577888888876654 44799999999999999999999999999999998766543322100 Q ss_pred ccCCCCCCCCCCCCC Q lcl|NC_020081. 499 QQTGYDGNMDNVNGK 513 (552) Q Consensus 499 ~~~~~~~~~~~~~~~ 513 (552) ..+++++++| T Consensus 369 -----~~~~~e~~n~ 378 (378) T protein:vir:93 369 -----VTSTDETNNQ 378 (378) T ss_pred -----CCCCCCCCCC Confidence 0011111111 No 83 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=100.00 E-value=4.3e-60 Score=346.06 Aligned_cols=360 Identities=15% Similarity=0.138 Sum_probs=237.4 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||++.+ .+++.............. +. T Consensus 1 Mg~f~~------------------------------------~~~~~~~~~~~~~~~~~~-------------~~----- 26 (378) T protein:vir:16 1 MNLFGK------------------------------------VVSFSRGKLNNDTQRVTA-------------WQ----- 26 (378) T ss_pred Cccchh------------------------------------hhhhhcccccCCcceeee-------------cc----- Confidence 444441 111110000000000000 00 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChh-HHHHHHHHHHHHHhcCCCCCCCccCCH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDH-NKKKIKEIENFIEKTGRIDNDFTRDNF 159 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~-~~~~~~~l~~~l~~~n~~~~pn~~~t~ 159 (552) -...+++++++++||.+++..++ .++|.+......+...+. .....|++..+|. .+||++||+ T Consensus 27 ~~~~~~~~~~v~~~i~~Ia~~iA-----------~l~~~~~~~~~~~~~~~~~~~~~~~~l~~lL~-----~~PN~~~t~ 90 (378) T protein:vir:16 27 NEAVEYTSAFVTNIHNKIANEIT-----------KVEFNHVKYKKSDVGSDTLISMAGSDLDEVLN-----WSPKGERNS 90 (378) T ss_pred cchhhHHHHHHHHHHHHHHhhhh-----------hCceeEEEEcccccccccccccccchHHHHHh-----hcCCCCCCH Confidence 01134467788899887766654 456654333322221111 1123355555543 268899999 Q ss_pred HHHHHHHHHHHHhcCCeeEEEEECCC-CCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeee Q lcl|NC_020081. 160 RSFVKKLVRDRLTYDKINFELVYDKL-GDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWE 238 (552) Q Consensus 160 ~~f~~~~v~d~ll~Gna~~~i~r~~~-G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~ 238 (552) ++||+.++.+++++||+|++++|+.. |++..++|.. ....|+++||||+ T Consensus 91 ~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~~------------------------------~~~~~~~~diih~ 140 (378) T protein:vir:16 91 MDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFAD------------------------------DKKEYKPEELVRL 140 (378) T ss_pred HHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecC------------------------------CeeEecccceEEe Confidence 99999999999999999999998754 6666555432 1134678999998 Q ss_pred cccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhc---ccc Q lcl|NC_020081. 239 VSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFS---GIN 315 (552) Q Consensus 239 ~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~---G~~ 315 (552) +. +.++..|.||+..+...+. .++. ++.|+|+|++++. +++++.+++++.|++.+. +.. T Consensus 141 r~-----~~~~~~~~s~l~~~~~~i~----------~~~~-~~~~~g~l~~~~~--l~~~~~~~~~~~~~~~~~~~~~~~ 202 (378) T protein:vir:16 141 TS-----PFYINEDTSILDNALASIQ----------TKLE-QGKLRGLLKINAF--LDIDNTQEYREKALTTIKNMQEGS 202 (378) T ss_pred cC-----ccCccchhHHHHHHHHHHH----------HHHh-cCccceeeEeCCc--CCHHHHHHHHHHHHHHHHHhhccc Confidence 73 2345678899888877653 2344 4578999998764 456555555555555442 345 Q ss_pred ccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_020081. 316 GAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDK 395 (552) Q Consensus 316 nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~ 395 (552) ++|+++++ ++|++|+++++++.++++ +.+++++++||++|||||.+|+ .++.+++.+.|++. T Consensus 203 ~~g~~~vl-~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~----------------g~~~e~~~~~f~~~ 264 (378) T protein:vir:16 203 SYNGLTPV-DNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILL----------------GTASQEQQIYFYNS 264 (378) T ss_pred ccccceEc-CCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhc----------------CCchHHHHHHHHHH Confidence 78886555 579999999999999997 4568999999999999999994 13457899999999 Q ss_pred HhhHHHHHHHHHHHhhcCccccc----------ceeec---ccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCC Q lcl|NC_020081. 396 GLEPLLKFIEDAVNKYIVSQFGG----------DYVFN---FVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTE 462 (552) Q Consensus 396 ~l~P~~~~ie~~ln~~L~~~~~~----------~~~~~---f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ 462 (552) ||.||++.||++|+++|+++.+. +++|+ +++.|.+++++.+..+ +.+|+||+||+|+++||||+| T Consensus 265 tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~--~~~G~~T~NE~R~~~g~~p~~ 342 (378) T protein:vir:16 265 TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHEN--INGPIFTQNQLLVKMGEQPIE 342 (378) T ss_pred HHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999976431 23343 3577888887776543 447999999999999999999 Q ss_pred CCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCC Q lcl|NC_020081. 463 GGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGK 513 (552) Q Consensus 463 ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (552) |||++++++|+++++.+...+...... .+.+.++++ T Consensus 343 ggD~~~~~~n~~~~~~~~~~~~~~~~~---------------~~~~e~~ne 378 (378) T protein:vir:16 343 GGDVYIANLNAVAVKNLSDLQGSRKDV---------------TSTDETNNQ 378 (378) T ss_pred CCCeEeeccccccccchhhhcCccCCC---------------CCCCCCCCC Confidence 999999999999987654432211000 001111111 No 84 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=100.00 E-value=1.3e-59 Score=343.48 Aligned_cols=379 Identities=12% Similarity=0.079 Sum_probs=239.6 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||++| +|+.+.- . .+ + ...+...+. ... T Consensus 1 Mgl~d-~~~~~~~--------~------------~~-----------~---------~~~~~~~~~---~~~-------- 28 (395) T protein:vir:96 1 MGILD-FFSFKKS--------G------------TL-----------S---------DDDSGSTTS---EKL-------- 28 (395) T ss_pred Ccchh-hhcCCCC--------c------------cc-----------c---------ccccccchh---hhc-------- Confidence 99998 5433210 0 00 0 000000000 000 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) ...+...+.+.+||.+++..+ +.+++.+..++.. ....|++..+|. .+||++||++ T Consensus 29 -~~~~l~~~~v~~~i~~Ia~~i-----------a~lp~~v~~~~~~-------~~~~~~~~~lL~-----~~PN~~~t~~ 84 (395) T protein:vir:96 29 -TNVVLKEDALYKCVNYLARII-----------SKSTFRIKAPEKL-------TENQKDWLYWIN-----TKANPNQSAS 84 (395) T ss_pred -chhhhhhHHHHHHHHHHHHhh-----------ccceeEEEeCCcc-------ccccchHHHHHh-----hcCCCCCCHH Confidence 112223456788887766654 4456666543221 112344444432 3789999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS 240 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~ 240 (552) +||+.++.+++++||+|++++|+..+.+...++.... . . +..+..+..........++++||+|+++ T Consensus 85 ~f~~~l~~~lll~Gna~~~~~~~~~~~~~~~~~~~~~-----~--~------~~~~~~v~~~~~~~~~~~~~~dvih~k~ 151 (395) T protein:vir:96 85 QFWVEVVQKLLVDGETLIFVIPGKGIYVADAFTQDKK-----L--S------GNKFKVSRVQGQTYEKIFTFDQVIYLKN 151 (395) T ss_pred HHHHHHHHHHhhcCceEEEEEcCCceecCCccccccc-----c--c------cceeeeeeeccceeeeEeccCceEEecc Confidence 9999999999999999999999864332222221110 0 0 0011111222222356789999999987 Q ss_pred cccCCcc--CCc-ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccccc Q lcl|NC_020081. 241 NPRTDLT--VGK-YGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGA 317 (552) Q Consensus 241 ~~~~~~~--~g~-~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~na 317 (552) +...... ++. .+.+++..+...+.....+.++..++|.+|+.|.+++.+.+.. .+ +..++.|++.+.+..+. T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~---~~~~~~~~~~~~~~~~~ 226 (395) T protein:vir:96 152 DNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGGR--QP---KSDKDFFKRTIEKIRTE 226 (395) T ss_pred cCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCchh--hH---HHHHHHHHHHHHHhhcC Confidence 5432111 111 1122333333333333445578899999999999999876532 33 34455555544443322 Q ss_pred ccceeeccCCceeeeccCchhHHHHHHHHHHH------HHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHH Q lcl|NC_020081. 318 WKIPVITAEDVKFVNMTQSSKDMEFEKWLNYL------INVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRN 391 (552) Q Consensus 318 gk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~------~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~ 391 (552) +..++++++|++|+++++++.|+|+++.+++. +++||++|||||++|| .+++|++++.+. T Consensus 227 ~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~--------------~~~sn~e~~~~~ 292 (395) T protein:vir:96 227 SVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH--------------GDIADNQKNYEL 292 (395) T ss_pred CcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc--------------CCCccHHHHHHH Confidence 22235567899999999999999999988775 5899999999999996 246899999999 Q ss_pred HHHHHhhHHHHHHHHHHHhhcCccc--ccceeecc---cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCC--C Q lcl|NC_020081. 392 SKDKGLEPLLKFIEDAVNKYIVSQF--GGDYVFNF---VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEG--G 464 (552) Q Consensus 392 ~~~~~l~P~~~~ie~~ln~~L~~~~--~~~~~~~f---~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~g--g 464 (552) |++.||.||++.||++|+++|+++. ..+++|.| +++|.+++++.+..+ +.+|+||+||+|+++|+||+|| | T Consensus 293 f~~~~L~P~~~~ie~~l~~~Ll~~~e~~~~~~f~~~~l~~~d~~~~~~~~~~~--~~~G~~T~NE~R~~~gl~pi~~~~g 370 (395) T protein:vir:96 293 LLEGPIESLITNIVDGLEYAIFDKSETLEGSFIKVTGLKNYDLFSISSQADKL--ISSGFVFIDEVREEIGLPELPDGLG 370 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhcCChhhhcCceeEeecchhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCC Confidence 9999999999999999999999854 34566665 567888887776654 4479999999999999999976 9 Q ss_pred CeeeccccccchhhhccccccccccCCCCCccCcc Q lcl|NC_020081. 465 DVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQ 499 (552) Q Consensus 465 D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (552) |+++++.|+++++.. +++.+...++ T Consensus 371 D~~~~~~N~~~~~~~----------gge~~~~~~~ 395 (395) T protein:vir:96 371 KVLYMTKNYESVLER----------GGEVDEEVET 395 (395) T ss_pred ceeeecccceechhc----------cCCCCCCCCC Confidence 999999999887531 1111100000 No 85 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=100.00 E-value=4.8e-59 Score=340.32 Aligned_cols=359 Identities=14% Similarity=0.126 Sum_probs=231.4 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHH-HHhhcchHHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQML-KLWSRKNIILNAIIITRVNQV 102 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L-r~~a~~~~i~~a~i~~~~~~~ 102 (552) |+. +.+-.++. ..+...++...+..+ +..+..++++++||..++..+ T Consensus 1 M~~-------------f~k~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~i 48 (378) T protein:vir:85 1 MNL-------------FGKVVSFS-------------------RGKLNNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEI 48 (378) T ss_pred Cch-------------hhhhhhhh-------------------hcccccCCcceeeeeccchhhhhHHHHHHHHHHHHhH Confidence 222 21111110 000000000011000 123456677888987766665 Q ss_pred HHHHHHHHhhccccceeeeeccccccCCh-hHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEE Q lcl|NC_020081. 103 SMFCTPARNSDKGVGYEIRLKDPLQEPND-HNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELV 181 (552) Q Consensus 103 ~~~~~~~~~~~~~~~~~i~~k~~~~~~~~-~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~ 181 (552) +.+++.+..+..++...+ ......|++..+|. .+||+.||+++||+.++.+++++||||++++ T Consensus 49 -----------A~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i 112 (378) T protein:vir:85 49 -----------TKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLN-----WSYKGEHNSMEFWQKVIKKLLCTRYVDLYPI 112 (378) T ss_pred -----------hhCceeEEEEeccccccccccccccchHHHHHh-----ccCCCCCCHHHHHHHHHHHHhhcCCeEEEEe Confidence 456666544433322211 12234455555553 3688999999999999999999999999864 Q ss_pred -ECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHH Q lcl|NC_020081. 182 -YDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIAL 260 (552) Q Consensus 182 -r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~ 260 (552) ++..|++..+++. .+ ...|.++|+||++... ..++ +.+.+..+. T Consensus 113 ~~~~~g~~~~~~~~----------------------------~~--~~~~~~~dvih~~~~~---~~~~--~~~~~~~a~ 157 (378) T protein:vir:85 113 FDSETGELLDLLFA----------------------------ND--KKEYKPEELVRLVSPF---YINE--DTSILDNAL 157 (378) T ss_pred ecCCCceEEEEEec----------------------------CC--CEEEcccceEEEecCc---Cccc--hhhHHHHHH Confidence 4556655544332 11 1245678999987532 2222 234444433 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHh---ccccccccceeeccCCceeeeccCch Q lcl|NC_020081. 261 NHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMF---SGINGAWKIPVITAEDVKFVNMTQSS 337 (552) Q Consensus 261 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~---~G~~nagk~~il~~~g~~~~~l~~~~ 337 (552) ..+ ..++++ +.|+|+|++++. +++++.+++++.|++.+ .+..++|+++++ ++|++|+++++++ T Consensus 158 ~~~----------~~~~~~-~~~~g~l~~~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl-~~g~~~~~l~~~~ 223 (378) T protein:vir:85 158 ASI----------QTKLEQ-GKLRGLLKINAF--LDIDNTQEYREKALATIKNMQEGSSYNGLTPV-DNKTEIVELKKDY 223 (378) T ss_pred HHH----------HHHHhc-CCcceEEEeCCc--CCHHHHHHHHHHHHHHHHHhhcccccccceec-CCCceEEeccCCh Confidence 332 234444 578999998764 46766666666665543 244678886554 5799999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc Q lcl|NC_020081. 338 KDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG 417 (552) Q Consensus 338 ~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~ 417 (552) .++++ +.+++++++||++|||||++|+ .++.+++...|++.||.||+++||++||++|+++.+ T Consensus 224 ~~~~~-~~~~~~~~~Ia~~fgVPp~~l~----------------~s~~e~~~~~f~~~tL~P~~~~ie~~l~~kLl~~~e 286 (378) T protein:vir:85 224 SVLNK-DEIELIKSELLTGYFMNENILL----------------GTATQEQQIYFYNSTIIPLLIQLEKELTYKLISTNR 286 (378) T ss_pred hhhhH-HHHHHHHHHHHHHhCCCHHHhc----------------CCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhh Confidence 99996 6789999999999999999994 234578899999999999999999999999997643 Q ss_pred c--c--------eeec---ccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccc Q lcl|NC_020081. 418 G--D--------YVFN---FVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQ 484 (552) Q Consensus 418 ~--~--------~~~~---f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~ 484 (552) . . +.|+ ++++|.+++++.++.+ +..|+||+||+|+++||||+||||++++++|+++++.+...+. T Consensus 287 r~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~--~~~G~~T~NE~R~~lgl~p~~gGD~~~~~~N~~~~~~~~~~~~ 364 (378) T protein:vir:85 287 RRVVKGNLYYERIIVDNQLFKFATLKELIDLYHEN--INGPIFTQNQLLVKMGEQPIEGGDIYIANLNAVAVKNLSDLQG 364 (378) T ss_pred hhhhhhccccceeeecchhhhhcCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCeEeecccccccccchhhcC Confidence 2 1 2233 3577888888877654 4479999999999999999999999999999999886554322 Q ss_pred cccccCCCCCccCcccCCCCCCCCCCCCCCCccc Q lcl|NC_020081. 485 VEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQ 518 (552) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (552) .... ..+++.+ .|+ T Consensus 365 ~~~~---------------~~~~~e~-----~n~ 378 (378) T protein:vir:85 365 SRKD---------------VASTDET-----NNQ 378 (378) T ss_pred ccCC---------------CCCCCCC-----CCC Confidence 1100 0001111 111 No 86 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=100.00 E-value=1.8e-58 Score=337.10 Aligned_cols=380 Identities=14% Similarity=0.087 Sum_probs=246.7 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+|+ ++..+.- . ..+.... ...+ + .. T Consensus 1 MGlf~-~~~~~~~--------~-----------------------------~~~~~~~---~~~~----~--------~~ 27 (395) T protein:vir:98 1 MGILD-FFSFKKS--------G-----------------------------TLSDDDS---GSTT----S--------EK 27 (395) T ss_pred Ccchh-hhcCCCc--------c-----------------------------ccccccc---chhh----h--------hh Confidence 99988 4422210 0 0000000 0000 0 00 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) ....+...+.+++||..++..++ ++++.+..++... ...|++..+|. .+||++||++ T Consensus 28 ~~~~~~~~~~v~~~I~~ia~~iA-----------~lp~~~~~~~~~~-------~~~~~~~~lL~-----~~PN~~~t~~ 84 (395) T protein:vir:98 28 LTNVVLKEDALYKCVNYLARIIS-----------KSTFRLKTPEKLT-------ENQKDWLYWIN-----TKANPNQSAS 84 (395) T ss_pred cchhhhhhHHHHHHHHHHHHHHh-----------hCceeEEecCCcc-------cccchHHHHHh-----hcCCCCCCHH Confidence 01122244567888877666554 4556554332111 12244444443 3789999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS 240 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~ 240 (552) +||+.++.+++++||||++++++..+. +++..+.... ..+ ..+.++..........|.++||||+++ T Consensus 85 ~f~~~~~~~lll~Gnayi~~~~~~~~~------~~~~~~~~~~-~~~------~~~~~~~~~~~~~~~~~~~~evih~k~ 151 (395) T protein:vir:98 85 QFWVEVIQKLLVDGETLIFVIPGKGIY------VADSFTQDKK-ISG------SQFKVSRVQGQTYEKTFTFDQVIYLKN 151 (395) T ss_pred HHHHHHHHHHhhcCceEEEEEeCCcee------cCCccccccc-ccC------cccceeeecCceeeeEecCccEEEecC Confidence 999999999999999999999975432 2222222211 001 111122222223356789999999986 Q ss_pred cccCCccCCcccccHHHHHHHHHHHHHHH--HHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccc Q lcl|NC_020081. 241 NPRTDLTVGKYGYPELEIALNHLQYHDNT--EVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAW 318 (552) Q Consensus 241 ~~~~~~~~g~~G~spl~~~~~~i~~~~~~--~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nag 318 (552) +. .+. ..++.+++......+...... ..+..+++.++..+.+++..+.. ..++++.++.++.|++.+.+..+.+ T Consensus 152 ~~-~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (395) T protein:vir:98 152 DN-SDL--MSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQEN-SDGGRQSKSDKDFFKRTVEKIRTES 227 (395) T ss_pred CC-CCc--cccccchhhhHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc-CCcHHHHHHHHHHHHHHHhhhhcCC Confidence 53 222 234445555555555444333 34456788888888888776543 3356667788888888877644333 Q ss_pred cceeeccCCceeeeccC------chhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHH Q lcl|NC_020081. 319 KIPVITAEDVKFVNMTQ------SSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNS 392 (552) Q Consensus 319 k~~il~~~g~~~~~l~~------~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~ 392 (552) ..++++++|++|+++++ ++.++||++.+++++++||++|||||++|| .+++|++++.+.| T Consensus 228 ~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~--------------~~~sn~e~~~~~f 293 (395) T protein:vir:98 228 VVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH--------------GDIADNQKNYELL 293 (395) T ss_pred cceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhc--------------CCcccHHHHHHHH Confidence 33455678999999984 467889999999999999999999999996 2478999999999 Q ss_pred HHHHhhHHHHHHHHHHHhhcCcccc--cceeecc---cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCC--CC Q lcl|NC_020081. 393 KDKGLEPLLKFIEDAVNKYIVSQFG--GDYVFNF---VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEG--GD 465 (552) Q Consensus 393 ~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~f---~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~g--gD 465 (552) +++||.||+++||++||++|+++.+ .+++|+| +++|.+++++.+..+ +.+|++|+||+|+++|+||++| || T Consensus 294 ~~~tl~P~~~~ie~~l~~kll~~~~~~~g~~f~~~~l~~~d~~~~~~~~~~~--~~~G~~T~NE~R~~~g~~Pi~~~~gD 371 (395) T protein:vir:98 294 LEGPIESLITNIVDGLEYAIFDKSETLQGSFIKVTGLKNYDLFSISNQADKL--ISSGFVFIDEVREEIGLPELPDGLGK 371 (395) T ss_pred HHHHHHHHHHHHHHHHHHhcCChhhhcCcceeeehhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCCc Confidence 9999999999999999999998643 4456664 577888877766643 4579999999999999999976 99 Q ss_pred eeeccccccchhhhccccccccccCCCCCccCcc Q lcl|NC_020081. 466 VTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQ 499 (552) Q Consensus 466 ~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (552) ++++++|+++++.. +++.+...++ T Consensus 372 ~~~~~~n~~~~~~~----------gge~~~~~~~ 395 (395) T protein:vir:98 372 VLYMTKNYESVLER----------GGEVDEEVET 395 (395) T ss_pred eeeecccceecccc----------cCCCCCCCCC Confidence 99999999987521 1111100000 No 87 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=100.00 E-value=1.3e-58 Score=338.00 Aligned_cols=380 Identities=13% Similarity=0.037 Sum_probs=235.0 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVS 103 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~ 103 (552) |.. .+.+.+.. ++..+.... . ....|.. .. ......+...+.+.+||.+++..++ T Consensus 1 Mg~------~~~~~~~~----~~~~~~~~~-~-~~~~~~~----~~---------~~~~~~~l~~~~v~~~v~~Ia~~ia 55 (395) T protein:vir:40 1 MGF------KSWVSGFF----NEEQRTLNL-T-DTVWCSI----PS---------EKLKELSIKKWAIDSCANKIANTLS 55 (395) T ss_pred Cch------HHHHHhhh----ccccccccc-c-cchhhcc----cc---------ccchhhhhhhHHHHHHHHHHHHHHh Confidence 111 11111111 111111100 0 0001100 00 0011122344567888877766654 Q ss_pred HHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC Q lcl|NC_020081. 104 MFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD 183 (552) Q Consensus 104 ~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~ 183 (552) .+++.+..++. . ..+++..+|. .+||+.||+++||+.++.+++++||+|+++.++ T Consensus 56 -----------~~p~~~~~~~~--~-------~~~~~~~lL~-----~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~ 110 (395) T protein:vir:40 56 -----------CAEVLTYEKGE--E-------VRKKNWYMFN-----VEANQNQNATEFWKKAIYKLVYDNEALIFMQDE 110 (395) T ss_pred -----------hCceeeccCCc--c-------ccchHHHHHH-----hcCCCCCCHHHHHHHHHHHHhhcCceEEEEecC Confidence 44555433221 1 1133333332 378999999999999999999999999999876 Q ss_pred CCCCEEEEEEecCceeEEEECCCcccccccceeEEE-EEcCC-ceEEEEcccceeeecccccCCccCCcccccHHHHHHH Q lcl|NC_020081. 184 KLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYV-QVIDD-KVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALN 261 (552) Q Consensus 184 ~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~-~~~~~-~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~ 261 (552) . +++.++..+ ...... ...|. ....+ .....|+++||||++++... ...++.+.+..+.. T Consensus 111 ~------~~~~~~~~~-~~~~~~--------~~~~~~v~~~~~~~~~~~~~~evih~r~~~~~---~~~~~~~l~~~~~~ 172 (395) T protein:vir:40 111 Y------IYVADSFTK-NDKSLY--------ENTYTEVTLKDLTLKKEFKESEVLHLTLNNES---IKSIIDGFYLLYGD 172 (395) T ss_pred c------eeecCCccc-cccccc--------cceeeeeeecCceeeeeeccccEEEeecCCCC---ccccchhHHHHHHH Confidence 4 233332211 111000 01111 12222 22457899999999865321 11233344444443 Q ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc-cccccceeeccCCceeeeccCchhHH Q lcl|NC_020081. 262 HLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGI-NGAWKIPVITAEDVKFVNMTQSSKDM 340 (552) Q Consensus 262 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~-~nagk~~il~~~g~~~~~l~~~~~d~ 340 (552) .+.... ...++.+ .+++++.++....+++++.+++++.|++.+.|. .+++++ +++++|++|+++++++.|+ T Consensus 173 ~~~~~~-----~~~~~~~--~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~vl~~g~~~~~l~~~~~d~ 244 (395) T protein:vir:40 173 LLTAAV-----NKYKKLN--SRKIIVKLKAMFGQTPEAEEKLRLMLSERMKKFLAEGDSA-LPVEDGMEIDELAGDSKIA 244 (395) T ss_pred HHHHHH-----HHHHhcC--CCCceEEEecccCCCHHHHHHHHHHHHHHHHHhhccCCce-eecCCCceEEeccCChhhh Confidence 332222 2223333 455556655556679999999999999999885 456654 5567899999999999999 Q ss_pred HHHHHHHHHH---HHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc Q lcl|NC_020081. 341 EFEKWLNYLI---NVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG 417 (552) Q Consensus 341 q~~e~~~~~~---~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~ 417 (552) ||+|.+++.. ++||++|||||++|| .+++|++++.+.|++.||.||+++||++||++|+++.+ T Consensus 245 q~~e~~~~~~~~~~~Ia~~fgVPp~~l~--------------~~~sn~e~~~~~f~~~~L~P~~~~ie~~l~~kLl~~~~ 310 (395) T protein:vir:40 245 ESRDIKKMIDDVFEMVANSFNIPLGLAK--------------GDTVGLSEQVNSFLMFSINPIAEMFTDEGNRKFYGRDS 310 (395) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhc--------------CCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhh Confidence 9999998874 799999999999996 13688999999999999999999999999999998644 Q ss_pred --cceeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCC--CCeeeccccccchhhhccccccccc Q lcl|NC_020081. 418 --GDYVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEG--GDVTLAGVHVQRLGQIMQQEQVEYQ 488 (552) Q Consensus 418 --~~~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~g--gD~~~~~~n~~~~~~~~~~~~~~~~ 488 (552) .+++|+| +++|.+++++.+..+ +.+|+||+||+|+++|+||++| ||++++++|+++++..... . T Consensus 311 ~~~g~~i~fd~~~ll~~d~~~~~~~~~~~--~~~G~~t~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~-----~ 383 (395) T protein:vir:40 311 VLERTYMKLDTTRIKVQDIQEIASSMDVL--FHIGVNTIDDNLRMIGREPVMSPETQERFVTKNYAPLGENEED-----L 383 (395) T ss_pred hcCCceEEEechhhhccCHHHHHHHHHHH--HhCCCCCHHHHHHHhCCCCCCCCCCceeeeccccccccccccc-----c Confidence 3455554 577888888766643 4579999999999999999955 9999999999887643211 0 Q ss_pred cCCCCCccCcccCCCCCCCCCCCCCCCccc Q lcl|NC_020081. 489 RQMDANQFLAQQTGYDGNMDNVNGKDSFNQ 518 (552) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (552) ++++.+. .+++ + T Consensus 384 kgge~~~---------~~~~---------~ 395 (395) T protein:vir:40 384 KGGDINE---------NKGD---------S 395 (395) T ss_pred CCCCCCC---------CcCC---------C Confidence 1111000 0000 0 No 88 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=3.9e-57 Score=329.81 Aligned_cols=354 Identities=13% Similarity=0.161 Sum_probs=239.0 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) |-.-+ +||--.-|.+-+......--|++.....+..-....++..+.++........+ + .+.|...++ .+++. T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~f--g---~p~~v~~~~-~~~~~ 73 (376) T protein:vir:10 1 MPARD-RPRAARRRRHSFIFIHGVLRMSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTF--D---DPTPVMNRA-EILDY 73 (376) T ss_pred CCCCc-cchhhhhhcccchhhcccccchhccCCCcccchhhhhHhhhccCcceeEEEEc--C---CceeccCcc-hhhhh Confidence 54444 44444444333333333333322221111110000111111111111111110 0 011111111 12222 Q ss_pred HHHhh------------------cchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHH Q lcl|NC_020081. 81 LKLWS------------------RKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIEN 142 (552) Q Consensus 81 Lr~~a------------------~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~ 142 (552) +.-+. +.+..+.+|+.. -.+ T Consensus 74 ~~~~~~~~~~~pp~~~~~La~~~~~~~~h~s~l~~------------------------------------------k~n 111 (376) T protein:vir:10 74 VECWSNGEWFEPPVSFAGLAKSFRASTHHSSALFF------------------------------------------KAN 111 (376) T ss_pred hhhhhcCceecCCCCHHHHHHHHhhhHHhhhhHHH------------------------------------------HhH Confidence 22211 111111111111 011 Q ss_pred HHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEc Q lcl|NC_020081. 143 FIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVI 222 (552) Q Consensus 143 ~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~ 222 (552) .|... -.||+.+|.++|++ ++.|++++||+|++++|+..|+|++|+||+|.+|++..+.++ |++.. T Consensus 112 ~l~~~---~~Pnp~lT~~~f~~-~v~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~~----------~~~~~ 177 (376) T protein:vir:10 112 VLAST---FRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFNG----------FVYVN 177 (376) T ss_pred HHHhc---cCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceEEEeeCCe----------EEEEE Confidence 22221 25889999999975 567999999999999999999999999999999998876543 34455 Q ss_pred CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHH Q lcl|NC_020081. 223 DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTS 302 (552) Q Consensus 223 ~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~ 302 (552) .++....|.++||||++. +++.+++||+||+.+++.++.+..++..|..+||+||++|+|||.+++ ..++++++++ T Consensus 178 ~~~~~~~~~~~eViHir~---~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~d-~~l~~e~~~~ 253 (376) T protein:vir:10 178 GWQERHEFEPDSVFQLVR---PDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD-AAQKQDDVDN 253 (376) T ss_pred cCCeEEEEccccEEEecC---CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHH Confidence 566778899999999974 356678999999999999999999999999999999999999999864 4679999999 Q ss_pred HHHHHHHHhccccccccceeec----cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccc Q lcl|NC_020081. 303 FRREWTSMFSGINGAWKIPVIT----AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGN 378 (552) Q Consensus 303 ~~~~~~~~~~G~~nagk~~il~----~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~ 378 (552) ++++|++ +.|..|+++++|+. .+|++|++++.++.|+||+|.+++++++||++|||||.+||+.+.+| T Consensus 254 lr~~~~~-~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t------- 325 (376) T protein:vir:10 254 MRDALKN-AKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNS------- 325 (376) T ss_pred HHHHHHH-hcCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCC------- Confidence 9999987 68999999976664 35899999999999999999999999999999999999999987654 Q ss_pred cccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHH Q lcl|NC_020081. 379 TLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKT 431 (552) Q Consensus 379 ~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~ 431 (552) .+++|++++.+.|+++||.|+++.|| ++|.+|..+.-......++++|.++ T Consensus 326 -~~~sn~eq~~~~f~~~~L~Pl~~~ie-eln~~L~~~~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 326 -GGFGTPDTAARVFGRNEIRPLQARFA-ELNDWLGEEVVRFDDYEIPPAPVAA 376 (376) T ss_pred -CCcccHHHHHHHHHHHHHHHHHHHHH-HHHhhccccccccChhHhhcccccC Confidence 35789999999999999999999998 4888775432111111235555554 No 89 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=5.4e-57 Score=329.08 Aligned_cols=329 Identities=16% Similarity=0.152 Sum_probs=230.5 Q ss_pred cccccchhhhhcccccccccccccc-----cccccccc---c-----cCC--cccccccCCCCchHHHHHHHhhcchHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAY-----EEPIIGSM---S-----MNP--DFKEAPSIHGKQNLLQMLKLWSRKNIIL 91 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~---~-----~~~--~~~~~~~~~~~~~~~~~Lr~~a~~~~i~ 91 (552) |.+|......... ..+..+. ++|+.... + .|. .|++- +..+..|.+..+.+..+ T Consensus 1 ~~~~~~~~~~~~~-----~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ep------p~~~~~La~l~~~n~~h 69 (348) T protein:vir:26 1 MTEQLIHSHTTDG-----TESKSVYSFDPNPEPVDTNSWMTRYCELFYNDFDDYWEP------PISLKGLAEIANANGYH 69 (348) T ss_pred CCccccchhhccc-----cCCceEEEecCCCeeecCcchHHHHHHHHhcCCCccccC------CCCHHHHHHHHhhhhhh Confidence 4433332222111 0000111 11111000 0 000 01110 01122222222223333 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHH Q lcl|NC_020081. 92 NAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRL 171 (552) Q Consensus 92 ~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~l 171 (552) +.|+..+. +.|... -.||+++|.++|++. +.+++ T Consensus 70 ~~~i~~k~------------------------------------------N~l~~~---~~Pn~~~t~~~f~~~-~~d~l 103 (348) T protein:vir:26 70 GSLLKARA------------------------------------------NYVAGR---FMNGGGLPMYKMNSA-CWDYF 103 (348) T ss_pred hhhHhhhh------------------------------------------hHHhhc---ccCCCCCCHHHHHHH-HHHHH Confidence 33322111 122211 258899999999765 57999 Q ss_pred hcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcc Q lcl|NC_020081. 172 TYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKY 251 (552) Q Consensus 172 l~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~ 251 (552) ++||+|++++|+..|+|++|+||++.+|++..+ |. |+++..++....|.++||||++. +++.+++| T Consensus 104 l~Gnay~~~~rn~~G~~~~L~~l~~~~v~~~~d--~~---------~~~~~~~g~~~~f~~~dIiHir~---~~~~~~~~ 169 (348) T protein:vir:26 104 GLGMSAFVKIRSYLKNVIALEPLPMVHMRKRKN--GD---------FVQLLRNNEQKVFKAKDVIFIPQ---YDPQQQIY 169 (348) T ss_pred hcCCeEEEEEEcCCCcEEEEEEecCceeEeeec--Cc---------EEEEEecCeEEEEcCccEEEEcC---CCCCCCcc Confidence 999999999999999999999999999988643 21 22333455677899999999874 35567899 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec----cCC Q lcl|NC_020081. 252 GYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT----AED 327 (552) Q Consensus 252 G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~----~~g 327 (552) |+||+.+++.++.+..++..|.++||+||++|++||.++. ..+++++++++|++|++. .|..|++++.|+. .+| T Consensus 170 Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~-~~ls~e~~~~lk~~~~~~-~G~~n~~~~~vl~~~g~~~G 247 (348) T protein:vir:26 170 GLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATD-PNLSEADEKALKEKIASS-KGIGNFRSMFVNIPNGKEKG 247 (348) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHHh-cCcccccceeEEcCCCCccc Confidence 9999999999999999999999999999999999999864 467999999999999986 6788888876553 468 Q ss_pred ceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHH Q lcl|NC_020081. 328 VKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDA 407 (552) Q Consensus 328 ~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ 407 (552) +++++++.++.|+||++.+++++++||++|||||++||+.+.++ .++++++++.+.|+++||.|+++.||++ T Consensus 248 i~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~--------~~~sn~e~~~~~f~~~~l~P~~~~ie~~ 319 (348) T protein:vir:26 248 IQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQG--------ANVPDPLKVSQVYDFYEVIPVCKRFMDA 319 (348) T ss_pred eeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCC--------CccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999876543 3578999999999999999999999999 Q ss_pred HHhhcCcccccceeecccccChHHHHHHHHH Q lcl|NC_020081. 408 VNKYIVSQFGGDYVFNFVGGDAKTEAEIISI 438 (552) Q Consensus 408 ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~ 438 (552) ||++|....+..++|+|+.. .++.+..++ T Consensus 320 ln~~l~~~~~~~~~fdl~~~--~e~~~~~a~ 348 (348) T protein:vir:26 320 VNNDPEIPDNLKLKFNLNPG--VESANGSAV 348 (348) T ss_pred HhhhhCCCCccEEEEecCcc--cccchhhcC Confidence 99998754444445554322 222222222 No 90 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=100.00 E-value=1.1e-56 Score=327.40 Aligned_cols=359 Identities=14% Similarity=0.118 Sum_probs=228.4 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVS 103 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~ 103 (552) |+.+. +-+++.. +....+..+.+ ...+. ..++.++++.+||..+++.++ T Consensus 1 M~if~-------------~~~~~~~------~~~~~~~~~~~----~~~~~--------~~~~~~~~v~~~v~~Ia~~iA 49 (378) T protein:vir:94 1 MNLFG-------------KVVSFSR------GKLNNDTQRVT----AWQNE--------AVEYTSAFVTNIHNKIANEIT 49 (378) T ss_pred CchhH-------------HhHhhhh------cccccCcceee----eeecc--------hhhhhhHHHHHHHHHHHHhHh Confidence 33332 1122110 00000001100 00011 234455678888887777665 Q ss_pred HHHHHHHhhccccceeeeeccccc-cCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEE- Q lcl|NC_020081. 104 MFCTPARNSDKGVGYEIRLKDPLQ-EPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELV- 181 (552) Q Consensus 104 ~~~~~~~~~~~~~~~~i~~k~~~~-~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~- 181 (552) .+++.+......+ ..........|++..+|. .+||+.||+++||+.++.+++++||+|++++ T Consensus 50 -----------~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lLn-----~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~ 113 (378) T protein:vir:94 50 -----------KVEFNHVKYKKSDVGSDTLISMAGSDLDEVLN-----WSSKGERNSMEFWQKVIKKLLTTRYIDLYPIF 113 (378) T ss_pred -----------hCceeeeeecccccccccccccccchHHHHHh-----hcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 3445433222221 111111223355554443 3688999999999999999999999999854 Q ss_pred ECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHH Q lcl|NC_020081. 182 YDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALN 261 (552) Q Consensus 182 r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~ 261 (552) ++..|++..+++.. + ...|+++||+|++... ..+ -+.+++..+.. T Consensus 114 ~~~~g~~~~~~~~~----------------------------~--~~~~~~~dvih~~~~~---~~~--~~~~~~~~~~~ 158 (378) T protein:vir:94 114 DSETGELLDLLFAN----------------------------D--KKEYKPEELVRLTSPF---YIN--EDTSILDNALA 158 (378) T ss_pred eCCCCcEEEEEEec----------------------------C--cEEechhceeeecCcC---Ccc--cchhHHHHHHH Confidence 55667665554421 1 1346789999987422 111 24456665554 Q ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHH----HHHHHHHHHHhccccccccceeeccCCceeeeccCch Q lcl|NC_020081. 262 HLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQAL----TSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSS 337 (552) Q Consensus 262 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~----~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~ 337 (552) .+. ..+++ +.++|+|+++.. +++++. +++++.|++.+.| .++|++++ +++|++|+++++++ T Consensus 159 ~~~----------~~~~~-~~~~g~l~~~~~--l~~~~~~~~~e~~~~~~~~~~~~-~n~~~~~v-l~~g~~~~~l~~~~ 223 (378) T protein:vir:94 159 SIQ----------TKLEQ-GKLRGLLKINAF--LDIDNTQEYREKALATIKNMQEG-SSYNGLTP-VDNKTEIVELKKDY 223 (378) T ss_pred HHH----------HHHhh-CCcccceeeCCc--CCHHHHHHHHHHHHHHHHHhhcc-ccccccee-ccCCceEEEccCCh Confidence 433 23333 478899998764 455544 4455555554444 56777654 46799999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc Q lcl|NC_020081. 338 KDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG 417 (552) Q Consensus 338 ~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~ 417 (552) .++++ +.+++++++||++|||||++|+ .+..+++.+.|+++||.||++.||++||++|+++.+ T Consensus 224 ~~~~~-~~~~~~~~~Ia~~fgvPp~~l~----------------g~~~e~~~~~f~~~tl~P~~~~ie~~l~~~Ll~~~e 286 (378) T protein:vir:94 224 SVLNK-DEIDLIKSELLTGYFMNENILL----------------GTATQEQQIYFYNSTIIPLLIQLEKELTYKLISTNR 286 (378) T ss_pred HHhhH-HHHHHHHHHHHHHhCCCHHHhc----------------CCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhH Confidence 99996 7789999999999999999994 123478889999999999999999999999997533 Q ss_pred c----------ceeec---ccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccc Q lcl|NC_020081. 418 G----------DYVFN---FVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQ 484 (552) Q Consensus 418 ~----------~~~~~---f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~ 484 (552) . ++.|+ +++.|.+++++.+..+ ..+|+||+||+|+++|+||+||||++++++|+++++.....+. T Consensus 287 ~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~--~~~G~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~ 364 (378) T protein:vir:94 287 RRVVKGNLYYERIIVDNQLFKFATLKELIDLYHEN--INGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQG 364 (378) T ss_pred hhhhhhhcccceeEeecchhhhcCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCeeeecccccchhcchhccc Confidence 1 12333 3577888888877754 4479999999999999999999999999999999886655432 Q ss_pred cccccCCCCCccCcccCCCCCCCCCCCCCCCccc Q lcl|NC_020081. 485 VEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQ 518 (552) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (552) ..... .+++.+++ + T Consensus 365 ~~~~~---------------~~~~e~~n-----~ 378 (378) T protein:vir:94 365 NRKDV---------------TSTDETNN-----Q 378 (378) T ss_pred ccCCC---------------CCCCCCCC-----C Confidence 21100 01111111 1 No 91 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=1.9e-56 Score=326.05 Aligned_cols=327 Identities=13% Similarity=0.160 Sum_probs=227.4 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCcccccccCCCC-----------------chHHHHHHHhhcchH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGK-----------------QNLLQMLKLWSRKNI 89 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~Lr~~a~~~~ 89 (552) |++.....+-......+...+.++.......++ + -+.|...++ +..+..|.+..+.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~---~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~ 75 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTF--D---DPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRAST 75 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEc--C---CceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhH Confidence 211111111000000000000000000000000 0 000111110 001222222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHH Q lcl|NC_020081. 90 ILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRD 169 (552) Q Consensus 90 i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d 169 (552) -+..|+..+ .+.|... -.||+.||.++|+ .++.| T Consensus 76 ~h~~~l~~k------------------------------------------~n~l~~~---~~Pnp~~t~~~f~-~~v~d 109 (351) T protein:vir:79 76 HHSSALFFK------------------------------------------ANVLAST---FRPHRWLSRHAFE-RWALD 109 (351) T ss_pred hhhhhhhhh------------------------------------------hhHHhhc---ccCCCCCCHHHHH-HHHHH Confidence 222222110 1122221 2588999999996 56789 Q ss_pred HHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCC Q lcl|NC_020081. 170 RLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVG 249 (552) Q Consensus 170 ~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g 249 (552) ++++||+|++++|+..|++++|+||+|.+|++..+.++ |+++..++....|+++||||++. +++.++ T Consensus 110 ~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~~----------~~~~~~~g~~~~~~~~eIihir~---~~~~~~ 176 (351) T protein:vir:79 110 FLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFSG----------FVYVNGWQERHEFEPDSVFQLVR---PDINQE 176 (351) T ss_pred HHhcCCeEEEEEECCCCCEEEEEEeCCcceeeeecCCe----------EEEEecCceEEEEcCccEEEeCC---CCCCCC Confidence 99999999999999999999999999999998766543 44555667778899999999974 356678 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec----c Q lcl|NC_020081. 250 KYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT----A 325 (552) Q Consensus 250 ~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~----~ 325 (552) +||+||+.+++.++.+..++..|..+||+||++|++||.+++ ..++++++++++++|++ ..|..|++++.|+. . T Consensus 177 ~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~il~~~~-~~ls~e~~~~lk~~~~~-~~G~~N~~~~~v~~~~g~~ 254 (351) T protein:vir:79 177 VYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD-AAQKQDDVDNMRDALKN-AKGPGNFRNVFMYAPGGKK 254 (351) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHH-hcCccccCceeEecCCCCc Confidence 999999999999999999999999999999999999999864 46799999999999987 67889999976653 3 Q ss_pred CCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_020081. 326 EDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIE 405 (552) Q Consensus 326 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie 405 (552) +|++|++++.++.|+||++++++++++||++|||||.+||+.+.++ .++++++++.+.|+++||.|+++.|| T Consensus 255 ~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~llGi~~~~t--------~~~~n~e~~~~~f~~~~l~Pl~~~ie 326 (351) T protein:vir:79 255 DGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNS--------GGFGTPDTAARVFGRNEIRPLQARFA 326 (351) T ss_pred cceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCC--------CCcccHHHHHHHHHHHHHHHHHHHHH Confidence 6899999999999999999999999999999999999999987654 35789999999999999999999998 Q ss_pred HHHHhhcCcccccceee---cccccChHH Q lcl|NC_020081. 406 DAVNKYIVSQFGGDYVF---NFVGGDAKT 431 (552) Q Consensus 406 ~~ln~~L~~~~~~~~~~---~f~~~d~~~ 431 (552) + +|.+|..+. ++| .++++|.++ T Consensus 327 ~-ln~~lg~~~---~~F~~~~llr~d~~a 351 (351) T protein:vir:79 327 E-LNDWLGDEV---VTFDDYEIPPAPVAA 351 (351) T ss_pred H-HHhhcCcce---eeeChhhhccccccC Confidence 5 787764331 222 245566555 No 92 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=8.3e-57 Score=328.04 Aligned_cols=364 Identities=14% Similarity=0.122 Sum_probs=235.0 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) |.....|.......... +...-.....++ ....+.....+ .+ -+.| +.....+++- T Consensus 1 m~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~---~~~~~~~~~~~--fg---~p~~-~~~~~~~~~~ 56 (368) T protein:vir:79 1 MSRNKTRRAARAASAHV---------------RTANTDAPTEHH---TDRAAQAEVFS--FG---DPVE-VLDRRELLDY 56 (368) T ss_pred CCccccccchhccCccc---------------ccccccCcchhh---ccccCceEEEE--cC---Ccee-ecchhhHHHH Confidence 33332222111110000 000000000000 00000000011 01 0111 1111112221 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) +.-+..+.+ +..-+... .+ +++++.+. . .. ......| +++. +...||+.||.+ T Consensus 57 ~~~~~~~~~-~~~pi~~~--~l---a~~~~~~~----~--------h~---~~~~~~~---n~l~---l~~~Pn~~~t~~ 109 (368) T protein:vir:79 57 VECMRMGQW-YEPPMPWD--GL---ARSFRAAA----H--------HS---SAVYVKR---NILV---STFIPHPLLSRA 109 (368) T ss_pred HHHHhccch-hccCcCHH--HH---HHHHhhcc----c--------cc---hhhhhhc---chhh---hhcCCCcCCCHH Confidence 211111100 00000000 00 11111110 0 00 0000112 2222 224689999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS 240 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~ 240 (552) +|++ ++.|++++||+|++++|+..|+|++|+||+|.+|++..+.+ .|++...++..+.|+++||||++. T Consensus 110 ~f~~-l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~----------~~~~~~~~~~~~~~~~~dIihir~ 178 (368) T protein:vir:79 110 TFER-LVLDWQVFGNAYLERRENVLGGTIRLDTPLAKYVRRGLDLN----------TYFFVQNWQQPYTFAAGSVFHLQE 178 (368) T ss_pred HHHH-HHHHHhhcCCeEEEEEEcCCCCEEEEEEeCcccceeeccCC----------EEEEEecCCeEEEEccccEEEecC Confidence 9975 78899999999999999999999999999999998765432 233444566778899999999873 Q ss_pred cccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccc Q lcl|NC_020081. 241 NPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKI 320 (552) Q Consensus 241 ~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~ 320 (552) +++.+++||+||+.+++.++.+..+++.|..++|+||++|+|||.+++ ..++++++++++++|++ +.|..|+|++ T Consensus 179 ---~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~-~~l~~e~~~~lk~~~~~-~~G~~N~g~~ 253 (368) T protein:vir:79 179 ---PDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTD-AAQKQEDVDTLREAMKS-AKGPGNFRNL 253 (368) T ss_pred ---CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-CCCCHHHHHHHHHHHHH-hcCCcccCce Confidence 356778999999999999999999999999999999999999999864 46799999999999987 6789999998 Q ss_pred eeec----cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHH Q lcl|NC_020081. 321 PVIT----AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKG 396 (552) Q Consensus 321 ~il~----~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~ 396 (552) +|+. .+|++|++++.++.|+||++++++++++||++|||||.+||+.+.+| .+++|++++.+.|+++| T Consensus 254 ~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t--------~~~sn~e~~~~~f~~~~ 325 (368) T protein:vir:79 254 FMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNT--------GGFGDVEKAAMVFARNE 325 (368) T ss_pred eEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCC--------CccccHHHHHHHHHHHH Confidence 7764 36899999999999999999999999999999999999999977554 35789999999999999 Q ss_pred hhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhc Q lcl|NC_020081. 397 LEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAK 444 (552) Q Consensus 397 l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~ 444 (552) |.|+++.|| ++|.+|..+.-......+++.|.+.+++..+. .+ T Consensus 326 l~Pl~~~ie-~ln~~l~~e~~rF~~~~l~~~D~~a~a~~~~r----sa 368 (368) T protein:vir:79 326 VKPLQDRLL-AINDWIGDEVVRFAPYALGGHDQPAAAPGGQR----SA 368 (368) T ss_pred HHHHHHHHH-HHHhccCcceeeechhHhhcccccccCCcccc----cC Confidence 999999998 68887765422112233677787776653221 11 No 93 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=7.5e-56 Score=322.81 Aligned_cols=329 Identities=14% Similarity=0.156 Sum_probs=226.8 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccc----------cccccc--------cccC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYE----------EPIIGS--------MSMN 62 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~--------~~~~ 62 (552) |... ++ ..+-......+...+.++.. +|+... ...+ T Consensus 1 ~~~~-------~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~ 54 (351) T protein:vir:78 1 MSKR-------RS-------------------RAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSN 54 (351) T ss_pred CCCC-------CC-------------------CCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhcc Confidence 2111 11 11100000000000000000 111000 0001 Q ss_pred CcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHH Q lcl|NC_020081. 63 PDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIEN 142 (552) Q Consensus 63 ~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~ 142 (552) ..|+ .|. ..+..|.+..+.+.-+..|+..+ .+ T Consensus 55 ~~~~-~pp-----~~~~~la~~~~~~~~h~~~l~~k------------------------------------------~n 86 (351) T protein:vir:78 55 GEWF-EPP-----VSFAGLAKSFRASTHHSSALFFK------------------------------------------AN 86 (351) T ss_pred Ccee-cCC-----CCHHHHHHHHhhhHhhhhhhhhh------------------------------------------hh Confidence 1110 111 11222222222222222222111 11 Q ss_pred HHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEc Q lcl|NC_020081. 143 FIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVI 222 (552) Q Consensus 143 ~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~ 222 (552) .|... -.||+.||.++|+ .++.++|++||+|++++|+..|+|++|+||++.+|++..+.++ |++.. T Consensus 87 ~l~~~---~~Pn~~~t~~~f~-~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~~----------~~~~~ 152 (351) T protein:vir:78 87 VLAST---FRPHRWLSRHAFE-RWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFSG----------FVYVN 152 (351) T ss_pred HHhhc---ccCCCCCCHHHHH-HHHHHHHhcCCeEEEEEECCCCCEEEEEEecCcceEEeeeCCe----------EEEEe Confidence 22111 2588999999996 4567999999999999999999999999999999998776543 33444 Q ss_pred CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHH Q lcl|NC_020081. 223 DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTS 302 (552) Q Consensus 223 ~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~ 302 (552) .++....|+++||||++. +++.+++||+||+..++.++.+..++..|..+||+||++|+|||.+++ ..++++++++ T Consensus 153 ~~~~~~~~~~~eVihir~---~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~~-~~ls~e~~~~ 228 (351) T protein:vir:78 153 GWQERHEFAPDSVFQLVR---PDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD-AAQKQDDVDN 228 (351) T ss_pred cCCeEEEEccccEEEEcC---CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHH Confidence 566778899999999874 356788999999999999999999999999999999999999999864 4679999999 Q ss_pred HHHHHHHHhccccccccceeec----cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccc Q lcl|NC_020081. 303 FRREWTSMFSGINGAWKIPVIT----AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGN 378 (552) Q Consensus 303 ~~~~~~~~~~G~~nagk~~il~----~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~ 378 (552) ++++|++ ..|..|+|+++|+. .+|++|++++.++.|+||+|++++++++||++|||||.+||+.+.+| T Consensus 229 lr~~~~~-~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t------- 300 (351) T protein:vir:78 229 MRDALKN-AKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNS------- 300 (351) T ss_pred HHHHHHH-hcCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCC------- Confidence 9999987 68999999987664 35899999999999999999999999999999999999999987654 Q ss_pred cccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHH Q lcl|NC_020081. 379 TLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKT 431 (552) Q Consensus 379 ~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~ 431 (552) .+++|++++.+.|+++||.|+++.||+ ++.+|..+.-......++++|.++ T Consensus 301 -~~~sn~e~~~~~f~~~~l~P~~~~iee-~n~~l~~~~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 301 -GGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGDEVVRFDDYEIPPAPVAA 351 (351) T ss_pred -CCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCccceecChhhhccccccC Confidence 357899999999999999999999995 776665432111111235555555 No 94 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=100.00 E-value=1.1e-55 Score=321.91 Aligned_cols=276 Identities=9% Similarity=0.072 Sum_probs=229.1 Q ss_pred ccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEE Q lcl|NC_020081. 113 DKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFK 192 (552) Q Consensus 113 ~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~ 192 (552) ++++++.++.++.. ..+++..+|. .+||+.||+++||+.++.+++++||||++++|+..|++++|| T Consensus 1 ia~l~~~~~~~~~~---------~~~~l~~lL~-----~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~ 66 (278) T protein:vir:78 1 MASLPLKMYEDYKV---------VNTEVSDLLT-----VSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLF 66 (278) T ss_pred CccceeEEEecCcc---------cccHHHHHHH-----hcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEE Confidence 66777777644322 1244444432 368899999999999999999999999999999999999999 Q ss_pred EecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 193 AVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVF 272 (552) Q Consensus 193 ~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~ 272 (552) ||+|++|++..+.+|.. .+|.+...++....|+++||||++.+ ++.++++|+||+.++..++..+.+++++ T Consensus 67 ~l~~~~v~v~~~~~~~~------~~y~~~~~~g~~~~~~~~evih~~~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 137 (278) T protein:vir:78 67 LLNPDVVEMLIENQSRE------LYYSIHAATGNKLIVHNMDMLHFKHI---VASNMVQGISPIDVLKNTTDFDNAVRTF 137 (278) T ss_pred EECCceeEEEEcCCCce------EEEEEEcCCceEEEEccccEEEECCC---CCCCCeeeccHHHHHHHHHHHHHHHHHH Confidence 99999999999887743 45666666677889999999999753 3456789999999999999999999999 Q ss_pred HHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHH Q lcl|NC_020081. 273 NARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINV 352 (552) Q Consensus 273 ~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 352 (552) +...+.+ .|++++..++ .++++++++++++|++.+. ++|+++++ ++|++|+++++++.|++|+|++++++++ T Consensus 138 ~~~~~~~--~~~~i~~~~~--~l~~e~~~~~~~~~~~~~~---~~g~~~vl-~~g~~~~~l~~~~~d~~~~e~~~~~~~~ 209 (278) T protein:vir:78 138 NLTEMQK--PDSFMLKYGS--NVGKEKRQQVLEDFKQYYE---ENGGILFQ-EPGVEIEPLPKKYVSEDIVASENLTRER 209 (278) T ss_pred HHHHhcC--CCcEEEEeCC--CCCHHHHHHHHHHHHHHhc---cCCCceec-CCCceEEEccCChhHHHHHHHHHHHHHH Confidence 7665555 4788888654 5689999999999998764 57887554 6799999999999999999999999999 Q ss_pred HHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc--cceeecccccCh Q lcl|NC_020081. 353 ICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG--GDYVFNFVGGDA 429 (552) Q Consensus 353 Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~f~~~d~ 429 (552) ||++|||||++||..+. .+++|++++.+.|++.||+|+++.||++||++|+++.+ .+++|+|+-.-. T Consensus 210 Ia~~fgVpp~~lg~~~~----------~~~sn~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 210 VANVFQLPSVFLNARSN----------TNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred HHHHhCCCHHHhCCCCC----------CCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 99999999999997654 35899999999999999999999999999999998654 456777653322 No 95 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=5.1e-56 Score=323.69 Aligned_cols=334 Identities=15% Similarity=0.191 Sum_probs=223.3 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcc----hHH-HHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRK----NII-LNAIIITRVNQ 101 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~----~~i-~~a~i~~~~~~ 101 (552) |++.....+. +..+++....+-...++.+ -+.|...+. .+++.+.-+..+ +++ ..+.. T Consensus 1 ~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~---~p~~v~~~~-~~~~~~~~~~~~~~~~pp~~~~~la------ 63 (344) T protein:vir:56 1 MSKKKGKTPQ-------PAAKTMTASAPKMEAFTFG---EPVPVLDRR-DILDYVECISNGRWYEPPVSFTGLA------ 63 (344) T ss_pred CCCCCCCCCc-------hhhHHhhcCCCceEEEEcC---CceeecCcc-hhhhHHHhhhcCccccCCCCHHHHH------ Confidence 2222211110 0000000000000000000 011111111 112222211111 111 01100 Q ss_pred HHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEE Q lcl|NC_020081. 102 VSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELV 181 (552) Q Consensus 102 ~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~ 181 (552) ++.+.+.. -+-.+..| .+.|... -.||++||..+| ++++.|++++||||++++ T Consensus 64 -----~~~~a~~~-h~s~i~~k-----------------~n~l~~~---~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~~ 116 (344) T protein:vir:56 64 -----KSLRAAVH-HSSPIYVK-----------------RNILAST---FIPHPWLSQQDF-SRFVLDFLVFGNAFLEKR 116 (344) T ss_pred -----HHHhhhhh-hCccceeh-----------------hhhHHhh---cCCCCCCCHHHH-HHHHHHHHhcCCeEEEEE Confidence 00000000 00001110 1122221 268899999999 678899999999999999 Q ss_pred ECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHH Q lcl|NC_020081. 182 YDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALN 261 (552) Q Consensus 182 r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~ 261 (552) |+..|+|++|+||++.+|++..+.+ .|++...++....|.++||||++. .++.+++||+||+..++. T Consensus 117 rn~~G~~~~L~pl~~~~v~~~~~~~----------~~~~~~~~g~~~~~~~~dIiHir~---~~~~~~~~Gls~~~~a~~ 183 (344) T protein:vir:56 117 YSTTGKVIRLETSPAKYTRRGVEED----------VYWWVPSFNEPTAFAPGSVFHLLE---PDINQELYGLPEYLSALN 183 (344) T ss_pred ECCCCcEEEEEEeCCceeEEeecCC----------EEEEEecCCeEEEEcCccEEEECC---CCCCCCcccccHHHHHHH Confidence 9999999999999999999866543 244556667778899999999874 345678999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec-----cCCceeeeccCc Q lcl|NC_020081. 262 HLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT-----AEDVKFVNMTQS 336 (552) Q Consensus 262 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~-----~~g~~~~~l~~~ 336 (552) ++.++.+++.|..+||+||++|++||.+++ ..+++++++++|++|++.. | .|+|+.++|. .+|++|++++.+ T Consensus 184 si~l~~~a~~~~~~~f~NGa~pg~Il~~~d-~~ls~e~~~~lk~~~~~~~-g-~~~~r~l~l~~p~g~~~G~~~~pis~~ 260 (344) T protein:vir:56 184 SAWLNESATLFRRKYYENGAHAGYIMYVTD-AVQDRNDIEMLRENMVKSK-G-RNNFKNLFLYAPQGKADGIKIIPLSEV 260 (344) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHHhc-C-CCCccceEEecCCCCccceeEEEcCCC Confidence 999999999999999999999999999864 4679999999999999865 4 3788987775 368999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccc Q lcl|NC_020081. 337 SKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQF 416 (552) Q Consensus 337 ~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~ 416 (552) +.|+||+|++++++++||++|||||++||+.+.++ .+++|++++.+.|+++||.||++.||+ +|.+|..+. T Consensus 261 ~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t--------~~~~n~eq~~~~f~~~tL~Pl~~~ie~-~n~~l~~~~ 331 (344) T protein:vir:56 261 ATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENV--------GSLGDIEKVAKVFVRNELIPLQDRIRE-INGWIGQEV 331 (344) T ss_pred hHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCC--------CccccHHHHHHHHHHHHHHHHHHHHHH-HHhhhcccc Confidence 99999999999999999999999999999977644 357899999999999999999999985 888876442 Q ss_pred ccceeecccccCh Q lcl|NC_020081. 417 GGDYVFNFVGGDA 429 (552) Q Consensus 417 ~~~~~~~f~~~d~ 429 (552) -..-...+...|. T Consensus 332 ~~F~~y~l~~~~~ 344 (344) T protein:vir:56 332 IRFKNYSLDTDNG 344 (344) T ss_pred ccCCCccccccCC Confidence 1111112222222 No 96 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=1.6e-55 Score=320.96 Aligned_cols=330 Identities=15% Similarity=0.180 Sum_probs=224.8 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcc----hHH-HHHHH-HHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRK----NII-LNAII-ITRVN 100 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~----~~i-~~a~i-~~~~~ 100 (552) |++..... .......+...+.++....+.+ +.|....+. .++.+.-+..+ +++ ..+.. ..+++ T Consensus 1 m~~~~~~~-~~~~~~~~~~~~~~~~~~~f~~---------p~~v~~~~~-~~~~~~~~~~~~~~~pp~~~~~la~~~~a~ 69 (344) T protein:vir:60 1 MSKKKGKT-LQPAAKKMTASAPKMEAFTFGE---------PVPVLDRRD-ILDYVECISNGRWYEPPISFTGLAKSLRAA 69 (344) T ss_pred CCcccCCC-CCchHHhhcCCcCcEEEEEcCC---------ceeecCCcc-hhHHHHhhhcCccccCCCCHHHHHHHHHhh Confidence 22221111 0000000111111111111111 111111111 11111111111 111 11100 00011 Q ss_pred HHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEE Q lcl|NC_020081. 101 QVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFEL 180 (552) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i 180 (552) ..+ .. .|.. -.+.|..+ -.||+.||..+| +.++.|++++||||+++ T Consensus 70 ~~h-----------~~--~i~~-----------------k~n~l~~~---~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i 115 (344) T protein:vir:60 70 VHH-----------SS--PIYV-----------------KRNILAST---FIPHPWLSQQDF-SRFVLDFLVFGNAFLEK 115 (344) T ss_pred hhh-----------cc--chhh-----------------hhhHHHhh---ccCCCCCCHHHH-HHHHHHHHhcCCeEEEE Confidence 000 00 0000 01122222 258899999998 67889999999999999 Q ss_pred EECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHH Q lcl|NC_020081. 181 VYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIAL 260 (552) Q Consensus 181 ~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~ 260 (552) +|+..|+|++|+||++.+|++..+.+ .|+++..++....|+++||||++. .++.+++||+||+..++ T Consensus 116 ~rn~~G~~~~L~~l~~~~vr~~~~~~----------~~~~v~~~~~~~~~~~~eIiHir~---~~~~~~~yGlsp~~~a~ 182 (344) T protein:vir:60 116 RYSTTGKVIRLETSPAKYTRRGVEED----------VYWWVPSFNEPTAFAPGSVFHLLE---PDINQELYGLPEYLSAL 182 (344) T ss_pred EECCCCcEEEEEEcCcceEEEeecCC----------eEEEEccCCeEEEEcCccEEEEcC---CCCCCCcccccHHHHHH Confidence 99999999999999999999876543 245566677888999999999874 34567899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec-----cCCceeeeccC Q lcl|NC_020081. 261 NHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT-----AEDVKFVNMTQ 335 (552) Q Consensus 261 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~-----~~g~~~~~l~~ 335 (552) .++.+..+++.|..++|+||++|++||.+++ ..++++++++++++|++.+ |. ++++.++|. .+|++|++++. T Consensus 183 ~si~l~~~a~~~~~~~f~NG~~pg~il~~~~-~~ls~e~~~~ik~~~~~~~-g~-~~~r~~~l~~p~g~~~g~~~~pis~ 259 (344) T protein:vir:60 183 NSAWLNESATLFRRKYYENGAHAGYIMYVTD-AVQDRNDIEMLRENMVKSK-GR-NNFKNLFLYAPQGKADGIKIIPLSE 259 (344) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEecC-cCCCHHHHHHHHHHHHHhc-CC-CCCcceEEecCCCCccceeEEEcCC Confidence 9999999999999999999999999999864 4689999999999999875 43 678877775 36899999999 Q ss_pred chhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc Q lcl|NC_020081. 336 SSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ 415 (552) Q Consensus 336 ~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~ 415 (552) ++.|+||+|++++++++||++|||||++||+.+.+| .+++|++++.+.|+++||.||+++|| +||.+|..+ T Consensus 260 ~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t--------~~~~n~e~~~~~f~~~~L~Pl~~~~e-~ln~~lg~~ 330 (344) T protein:vir:60 260 VATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENV--------GSLGDIEKVAKVFVRNELIPLQDRIR-EINGWLGQE 330 (344) T ss_pred ChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCC--------CccccHHHHHHHHHHHHHHHHHHHHH-HHHHhcCCc Confidence 999999999999999999999999999999887654 35789999999999999999999998 588887532 Q ss_pred cccceeec---ccccCh Q lcl|NC_020081. 416 FGGDYVFN---FVGGDA 429 (552) Q Consensus 416 ~~~~~~~~---f~~~d~ 429 (552) .++|. ++..|. T Consensus 331 ---~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 331 ---VIRFKNYSLDTDNG 344 (344) T ss_pred ---ccccCccccCCCCC Confidence 23333 333333 No 97 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=2.5e-55 Score=319.93 Aligned_cols=328 Identities=14% Similarity=0.132 Sum_probs=225.2 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccc--------cccCCcccccccCC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGS--------MSMNPDFKEAPSIH 72 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~ 72 (552) |.+...|++. .+.............. -+.|+... +..+.+|. .| T Consensus 1 m~~~~~~~~~------------~~~~~~~~~~~~~~~~------------~p~~~~~~~~~~~~~~~~~~~~~~-~p--- 52 (340) T protein:vir:98 1 MSKRKPRKAV------------AMTASAPQKMEAFTFG------------EPVPVLDKRDILDYVECISNGKWY-EP--- 52 (340) T ss_pred CCCCCCCccc------------cccccCccceeEEEcC------------CceeecCcchhhhhhhhhhcCcee-cC--- Confidence 2211111100 0000000000000000 00011000 00011110 00 Q ss_pred CCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCC Q lcl|NC_020081. 73 GKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDN 152 (552) Q Consensus 73 ~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~ 152 (552) +..+..|.++.+.+..+..|+..+. +.|... .. T Consensus 53 --p~~~~~la~l~~a~~~h~s~i~~k~------------------------------------------n~l~~~---~~ 85 (340) T protein:vir:98 53 --PVSFSGLAKSLRSAVHHSSPIYVKR------------------------------------------NVLAST---YI 85 (340) T ss_pred --CCCHHHHHHHHHhccccchhhhhhh------------------------------------------hHHhhc---cC Confidence 1112223333323333333321111 111111 25 Q ss_pred CCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcc Q lcl|NC_020081. 153 DFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKA 232 (552) Q Consensus 153 pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~ 232 (552) ||++||..+|+ +++.|++++||+|++++|+..|+|++|+|+++.+|++..+.+ .|+++..++....|.+ T Consensus 86 Pn~~lt~~~f~-~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~~~~----------~~~~~~~~~~~~~~~~ 154 (340) T protein:vir:98 86 PHPLLSRQDFS-RFALDYLVFGNAFLEQRHSVTGQLIKLLTSPAKYTRRGVDDS----------VFWFVENFTQPHEFAP 154 (340) T ss_pred CCCCCCHHHHH-HHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceEEEcccCc----------EEEEEecCCeEEEEcc Confidence 78999999986 566799999999999999999999999999999998754322 2344445667788999 Q ss_pred cceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_020081. 233 KEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFS 312 (552) Q Consensus 233 ~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~ 312 (552) +||||++. .++.+++||+||+..++.++.+..+++.|..+||+||++|+|||.+++ ..+++++++++|++|++ .. T Consensus 155 ~eViHir~---~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~-~~ls~e~~~~lk~~~~~-~~ 229 (340) T protein:vir:98 155 DTVFHLLE---PDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTD-PAQSATDVESLRDAMRN-SK 229 (340) T ss_pred ccEEEEcC---CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHH-hc Confidence 99999974 356678999999999999999999999999999999999999999864 46799999999999987 58 Q ss_pred cccccccceeec----cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHH Q lcl|NC_020081. 313 GINGAWKIPVIT----AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEK 388 (552) Q Consensus 313 G~~nagk~~il~----~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~ 388 (552) |..|++++.|+. .+|++|++++.++.|+||++++++++++||++|||||++||+.+.+| .+++|++++ T Consensus 230 G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t--------~~~sn~e~~ 301 (340) T protein:vir:98 230 GLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENI--------GSLGDVEKV 301 (340) T ss_pred CccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCC--------CccccHHHH Confidence 889999876654 36899999999999999999999999999999999999999987543 357899999 Q ss_pred HHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccC Q lcl|NC_020081. 389 YRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGD 428 (552) Q Consensus 389 ~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d 428 (552) .+.|+++||.|+++.||+ +|.+|..+.-..-.+.+++.| T Consensus 302 ~~~f~~~~l~Pl~~~iee-~n~~L~~e~~rF~~~~l~~~d 340 (340) T protein:vir:98 302 AKVFVRNELSPLQDRFRE-VNDWLGMEVIRFKEYTLDNPE 340 (340) T ss_pred HHHHHHHHHHHHHHHHHH-HHhcccccccccCccccccCC Confidence 999999999999999995 888886542111122345555 No 98 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=1.9e-55 Score=320.63 Aligned_cols=334 Identities=16% Similarity=0.181 Sum_probs=225.7 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcc----hHH-HHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRK----NII-LNAIIITRVNQ 101 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~----~~i-~~a~i~~~~~~ 101 (552) |++.....+.-. ...+....++.-...+. -+.|...+.. +++.+.-+..+ +++ ..+.. T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~f~---------~p~~v~~~~~-~~~~~~~~~~~~~~~pp~~~~~la------ 63 (344) T protein:vir:20 1 MSKKKGKTPQPA-AKTMTASGPKMEAFTFG---------EPVPVLDRRD-ILDYVECISNGRWYEPPVSFTGLA------ 63 (344) T ss_pred CCcccCCCCcch-hhhhhccCCceEEEEcC---------CceEecCcch-hhhhhhhhhcCceecCCCCHHHHH------ Confidence 222222111100 00000000000000000 1112111111 12222211111 010 00100 Q ss_pred HHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEE Q lcl|NC_020081. 102 VSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELV 181 (552) Q Consensus 102 ~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~ 181 (552) ++.+.+.. .+-.+..+ .+.|... -.||+.||.++| +.++.|++++||||++++ T Consensus 64 -----~~~~a~~~-h~~~i~~k-----------------~n~l~~~---~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i~ 116 (344) T protein:vir:20 64 -----KSLRAAVH-HSSPIYVK-----------------RNILAST---FIPHPWLSQQDF-SRFVLDFLVFGNAFLEKR 116 (344) T ss_pred -----HHHhhhhh-hCccceeh-----------------hhhHHHh---ccCCCCCCHHHH-HHHHHHHHhcCCeEEEEE Confidence 00000000 00001110 1122221 258899999998 678899999999999999 Q ss_pred ECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHH Q lcl|NC_020081. 182 YDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALN 261 (552) Q Consensus 182 r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~ 261 (552) |+..|+|++|+||++.+|++..+.+ .|+++..++....|+++||||++. .++.+++||+||+..++. T Consensus 117 rn~~G~~~~L~pl~~~~vr~~~~~~----------~~~~~~~~~~~~~~~~~eIiHir~---~~~~~~~yGls~~~~a~~ 183 (344) T protein:vir:20 117 YSTTGKVIRLETSPAKYTRRGVEED----------VYWWVPSFNEPTAFAPGSVFHLLE---PDINQELYGLPEYLSALN 183 (344) T ss_pred ECCCCcEEEEEEcCCceeEeeecCC----------EEEEEccCCeEEEEcCccEEEeCC---CCCCCCcccccHHHHHHH Confidence 9999999999999999999865543 245566677788999999999874 245678999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec-----cCCceeeeccCc Q lcl|NC_020081. 262 HLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT-----AEDVKFVNMTQS 336 (552) Q Consensus 262 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~-----~~g~~~~~l~~~ 336 (552) ++.+..+++.|..++|+||++|++||.+++ ..++++++++++++|++.. |. ++|+.++|. .+|++|++++.+ T Consensus 184 si~l~~~a~~~~~~~f~NGa~p~~Il~~~d-~~l~~e~~~~ik~~~~~~~-g~-~n~r~l~l~~p~g~~~gi~~~pis~~ 260 (344) T protein:vir:20 184 SAWLNESATLFRRKYYENGAHAGYIMYVTD-AVQDRNDIEMLRENMVKSK-GR-NNFKNLFLYAPQGKADGIKIIPLSEV 260 (344) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEecC-cCCCHHHHHHHHHHHHHhc-CC-CCccceEEecCCCCccceeEEEcCCC Confidence 999999999999999999999999999864 4689999999999999865 43 678877775 368999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccc Q lcl|NC_020081. 337 SKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQF 416 (552) Q Consensus 337 ~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~ 416 (552) +.|+||+|++++++++||++|||||++||+.+.+| .+++|++++.+.|++++|.||++.|| +||.+|..+ T Consensus 261 ~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t--------~~~~n~e~~~~~f~~~~l~P~~~~~e-~in~~lg~~- 330 (344) T protein:vir:20 261 ATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENV--------GSLGDIEKVAKVFVRNELIPLQDRIR-EINGWLGQE- 330 (344) T ss_pred hhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCC--------CccccHHHHHHHHHHHHHHHHHHHHH-HHHHhcCCc- Confidence 99999999999999999999999999999877544 35789999999999999999999998 588777532 Q ss_pred ccceeecccccChHHH Q lcl|NC_020081. 417 GGDYVFNFVGGDAKTE 432 (552) Q Consensus 417 ~~~~~~~f~~~d~~~~ 432 (552) .++|++...+..++ T Consensus 331 --~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 331 --VIRFKNYSLDTDND 344 (344) T ss_pred --ccccCccccccCCC Confidence 34555444444333 No 99 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=7.8e-55 Score=317.22 Aligned_cols=342 Identities=16% Similarity=0.226 Sum_probs=227.9 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFC 106 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~ 106 (552) |.++..+..-.. .+.++.+..+-|. .+ .+.|... ...+.+.+.-++.+...+..-+. -..+ + T Consensus 1 m~~~~~~~~~~~-----~~~~~~~~~~~~~----~~---~p~~~~~-~~~~~~~~~~~~~~~~~~~pp~~--~~~l---a 62 (346) T protein:vir:10 1 MKKQLRKNLTQN-----DRLQPQAQTEIFS----FG---DPIPVLD-RADILNYLECSAMYEKWYNPPMS--FDGL---A 62 (346) T ss_pred CCcccCCCCCcc-----cccccccCeEEEe----cC---CcceecC-chhHHHHHHHhhcCCceEecCCC--HHHH---H Confidence 222211110000 0000001111000 00 0111111 11122222222211100000000 0000 0 Q ss_pred HHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCC Q lcl|NC_020081. 107 TPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLG 186 (552) Q Consensus 107 ~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G 186 (552) ++++.+ + . ....-..+.+.+..++ ..||++||.++|++ ++.|++++||+|++++|+..| T Consensus 63 ~l~~~~-~---~----------h~~~i~~k~n~l~~l~------~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~i~r~~~G 121 (346) T protein:vir:10 63 KSLRSS-T---H----------HESAIITKANILLSTC------EVDSRYLSRRDLSS-FVKDYLVFGNAYFEVVRNRLG 121 (346) T ss_pred HHHHhh-h---h----------cchhhhhhhhhHHHHH------hCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCC Confidence 100000 0 0 0000111223333333 35789999999986 567999999999999999999 Q ss_pred CEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHH Q lcl|NC_020081. 187 DLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYH 266 (552) Q Consensus 187 ~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~ 266 (552) +|++|+||+|.+|++..+.++ .+|+....++....|+++||||++.. ++.+++||+||+..++.++.+. T Consensus 122 ~~~~L~pl~~~~v~~~~~~~~--------~~~~~~~~~g~~~~~~~~dIih~r~~---~~~~~~~G~~~~~~a~~si~l~ 190 (346) T protein:vir:10 122 QVQRIESPLAKYVRKGLEAGQ--------FYYVPQRFDHQEHEFAKGSIYHLLEP---DINQDIYGLPQYLSALQSAWLN 190 (346) T ss_pred cEEEEEEecCCceEEEEcCCe--------EEEEEEccCCeEEEEecccEEEecCC---CCCCCeeeccHHHHHHHHHHHH Confidence 999999999999998776654 34566666777889999999998742 4567899999999999999999 Q ss_pred HHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecc----CCceeeeccCchhHHHH Q lcl|NC_020081. 267 DNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITA----EDVKFVNMTQSSKDMEF 342 (552) Q Consensus 267 ~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~----~g~~~~~l~~~~~d~q~ 342 (552) .+++.|..++|+||++|++||.+++ ..++++++++++++|++. .|..|+|++.|+.+ .|+++++++.++.|+|| T Consensus 191 ~~a~~~~~~~~~NG~~~~~il~~~d-~~l~~e~~~~i~~~~~~~-~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf 268 (346) T protein:vir:10 191 ESATLFRRKYFLNGAHAGFVFYMSD-ASQKQEDVENIRQQLKQS-KGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEF 268 (346) T ss_pred HHHHHHHHHHHhccCCCceEEEeCC-CCCCHHHHHHHHHHHHHh-cCccccCceeEecCCCCccceeEEecCCChhHHHH Confidence 9999999999999999999999864 467999999999999886 57789999876654 47899999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceee Q lcl|NC_020081. 343 EKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVF 422 (552) Q Consensus 343 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~ 422 (552) ++.+++++++||++|||||.+||+.+.++ .++++++++.+.|++++|.|++++||+ +|.+|..+ ++ T Consensus 269 ~e~k~~~~~~I~~af~VPp~llG~~~~~~--------~~~s~~e~~~~~f~~~~l~P~~~~iee-~n~~L~~e-----~i 334 (346) T protein:vir:10 269 FNIKNVSRDDVLAAHRVPPQLMGIIPNNT--------GGFGNVADAAEVFFITEIEPLQERLKE-FNQWLGQE-----VI 334 (346) T ss_pred HHHHHHhHHHHHHHhCCCHHHhcccCCCC--------CCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcccc-----ee Confidence 99999999999999999999999987653 357899999999999999999999985 77776543 23 Q ss_pred cccccChHHHHH Q lcl|NC_020081. 423 NFVGGDAKTEAE 434 (552) Q Consensus 423 ~f~~~d~~~~~~ 434 (552) +|+.-+...-.+ T Consensus 335 ~F~~~~ll~~~~ 346 (346) T protein:vir:10 335 KFKPSKLLQRTQ 346 (346) T ss_pred eechhhhcccCC Confidence 442211111111 No 100 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=8.8e-55 Score=316.94 Aligned_cols=325 Identities=12% Similarity=0.116 Sum_probs=229.5 Q ss_pred cccccchhhhhcccccccccccccccccccccc---------c-------cCCcccccccCCCCchHHHHHHHhhcchHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSM---------S-------MNPDFKEAPSIHGKQNLLQMLKLWSRKNII 90 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~-------~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i 90 (552) |.++.... +.+....+.......++ + .+..|++-|. .+..|.+..+.+.. T Consensus 1 ~~~~~~~~--------~~~~~~~~~~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~epp~------~~~~la~l~~~~~~ 66 (345) T protein:vir:37 1 MKTNVKTD--------NKKGIVIAPINDRTFSLNEISASPALDYVGIGFDENYNCYLPPV------NRHALAKLPHQNAQ 66 (345) T ss_pred CCCCcccc--------chhhcccCcceeEEeecCCcccccchhhhhhhhcCCccccCCCC------CHHHHHHHhhcccc Confidence 11111000 00000001111111110 0 0111111111 11222222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHH Q lcl|NC_020081. 91 LNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDR 170 (552) Q Consensus 91 ~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ 170 (552) +..|+ ..| .+.|... -.||+.||.++|++ ++.++ T Consensus 67 h~~~i-------------------------~~k-----------------~n~l~~~---~~Pn~~lt~~~f~~-~~~d~ 100 (345) T protein:vir:37 67 HGGIL-------------------------HSR-----------------ANMVSSL---YEGGKALSRMDMRA-LCLNL 100 (345) T ss_pred cccce-------------------------eee-----------------chHHHhh---ccCCCCCCHHHHHH-HHHHH Confidence 22221 110 1122221 15889999999975 56799 Q ss_pred HhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCc Q lcl|NC_020081. 171 LTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGK 250 (552) Q Consensus 171 ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~ 250 (552) +++||+|++++|+..|+|++|+||++.+|++..+.+... .+++.....++....|+++||||++. .++.+++ T Consensus 101 ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~~~~-----~~~~~~~~~~g~~~~~~~~dVihir~---~~~~~~~ 172 (345) T protein:vir:37 101 IQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVRKDGGYSY-----LMKKSLYDTAQEIYRYDAKDIIFIKL---YDPMQQV 172 (345) T ss_pred HhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEEeCCeeE-----EEEEeEecCCceEEEEccccEEEecC---CCCCCCc Confidence 999999999999999999999999999999877654321 12334445566778899999999874 2456789 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec----cC Q lcl|NC_020081. 251 YGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT----AE 326 (552) Q Consensus 251 ~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~----~~ 326 (552) ||+||+..++.++.+..++..|..++|+||++|++||.+++ ..+++++++++|++|++ ..|..|.+++.|+. .+ T Consensus 173 ~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d-~~l~~e~~~~lk~~~~~-~~g~~n~~~~~i~~p~g~~~ 250 (345) T protein:vir:37 173 YGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTD-PDLTEEMEEEIARKISE-SKGVGNFRSMFVNIANGHPD 250 (345) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecC-CCCCHHHHHHHHHHHHH-hcCcccccceEEEcCCCccc Confidence 99999999999999999999999999999999999999864 46799999999999988 57888888876554 36 Q ss_pred CceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_020081. 327 DVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIED 406 (552) Q Consensus 327 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~ 406 (552) |++|++++.+++|+||++++++++++||++|||||.+||+.+.++ .+++++|++.+.|+++||.|+++.||+ T Consensus 251 G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~--------~~~~~~e~~~~~f~~~~l~P~~~~ie~ 322 (345) T protein:vir:37 251 GLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNT--------GGLGDPLKYREVYHYDEVMPLQEIIAE 322 (345) T ss_pred ceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCC--------CCcccHHHHHHHHHHHHHHHHHHHHHH Confidence 899999999999999999999999999999999999999876543 357899999999999999999999999 Q ss_pred HHHhhcCcccccceeecccccChHH Q lcl|NC_020081. 407 AVNKYIVSQFGGDYVFNFVGGDAKT 431 (552) Q Consensus 407 ~ln~~L~~~~~~~~~~~f~~~d~~~ 431 (552) ++|+.+ +...++.++|+..+... T Consensus 323 ~ln~~~--~~~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 323 TINQDP--EIKNLLKIKFREQNFAK 345 (345) T ss_pred Hhhhhc--cCCCcceEEecchhhcC Confidence 999743 44555677776555433 No 101 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=4.9e-54 Score=312.85 Aligned_cols=333 Identities=13% Similarity=0.132 Sum_probs=228.6 Q ss_pred cccccchhhcccccCcccccccccchhhhhccccccccccccccccccc-ccccc-------CCcccccccCCCCchHHH Q lcl|NC_020081. 8 FKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPII-GSMSM-------NPDFKEAPSIHGKQNLLQ 79 (552) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-------~~~~~~~~~~~~~~~~~~ 79 (552) +++... .........+-.+..... ..+|+. ..+++ +..|++-| ..+. T Consensus 1 ~~~~~~-------~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~y~~~~~~~~~~~~epp------~~~~ 55 (345) T protein:vir:37 1 MKTNVK-------TDNKKGIVIAPINDRTFS------------LSEITASPALDYVGIGFDENYNCYLPP------VNRH 55 (345) T ss_pred CCcccc-------ccchhhhcCCCceEEEee------------cCCcccchhhcccceeeecCCccccCC------CCHH Confidence 000000 000000000000000000 001110 00000 11111111 1133 Q ss_pred HHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCH Q lcl|NC_020081. 80 MLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNF 159 (552) Q Consensus 80 ~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~ 159 (552) .|.++.+.+..+..|+..+. +.|... -.||+.||. T Consensus 56 ~la~~~~~~~~h~~~i~~k~------------------------------------------n~l~~~---~~Pn~~~t~ 90 (345) T protein:vir:37 56 ALAKLPHQNAQHGGILHSRA------------------------------------------NMVSAT---YEGGKALSK 90 (345) T ss_pred HHHHHhhcchhhcchhhhhh------------------------------------------hHHhhc---cCCCCCCCH Confidence 34333333433444432211 111111 158899999 Q ss_pred HHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeec Q lcl|NC_020081. 160 RSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEV 239 (552) Q Consensus 160 ~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~ 239 (552) .+|++ ++.|++++||+|++++|+..|++++|+|++|.+|++..+.+.... ..++.....+....|+++||||++ T Consensus 91 ~~f~~-~v~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~d~~~~~~-----~~~~~~~~~g~~~~~~~~eViHir 164 (345) T protein:vir:37 91 MEMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVHKDGGYSYL-----MKKSLYDTAQEIYRYDAKDIIFIK 164 (345) T ss_pred HHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEecCceeEEeecCCeeEE-----EeeeeeccCceEEEEccccEEEEc Confidence 99975 567999999999999999999999999999999998665432111 122333445677889999999997 Q ss_pred ccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccccccc Q lcl|NC_020081. 240 SNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWK 319 (552) Q Consensus 240 ~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk 319 (552) . .++.+++||+||+..++.++.+..+++.|..++|+||++|++||.++. ..+++++.++++++|++.+ |..|.+. T Consensus 165 ~---~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~-~~l~~e~~~~lk~~~~~~~-g~~n~~~ 239 (345) T protein:vir:37 165 L---YDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTD-PDLTEEMEEEIARKISESK-GVGNFRS 239 (345) T ss_pred C---CCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCCHHHHHHHHHHHHHhc-CccccCc Confidence 4 345678999999999999999999999999999999999999999864 4679999999999999986 4455555 Q ss_pred ceeec----cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHH Q lcl|NC_020081. 320 IPVIT----AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDK 395 (552) Q Consensus 320 ~~il~----~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~ 395 (552) +.|+. .+|++|++++.++.|+||++++++++++||++|||||.++|+.+.++ .++++++++.+.|++. T Consensus 240 ~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t--------~~~s~~e~~~~~f~~~ 311 (345) T protein:vir:37 240 MFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNT--------GGLGDPLKYREVYHYD 311 (345) T ss_pred eeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCC--------CCcccHHHHHHHHHHH Confidence 44443 35799999999999999999999999999999999999999987644 3478999999999999 Q ss_pred HhhHHHHHHHHHHHhhcCcccccceeecccccChHH Q lcl|NC_020081. 396 GLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKT 431 (552) Q Consensus 396 ~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~ 431 (552) ||.||+++||++||+. ++...+++++|+..+..- T Consensus 312 ~l~P~~~~ie~~ln~~--~e~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 312 EVMPLQEIIAETINQD--PEIKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHhhhh--hccCCcceEEECchhhcC Confidence 9999999999999974 344556777876554432 No 102 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=5e-54 Score=312.80 Aligned_cols=336 Identities=15% Similarity=0.177 Sum_probs=224.2 Q ss_pred cccccchhhh-----hccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcch----HHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAI-----LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKN----IILNAIIIT 97 (552) Q Consensus 27 ~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~----~i~~a~i~~ 97 (552) |++....... ......+...++.+....|. .+ .+.|...+. .+++-+.-|..+. ++-... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~---~p~~v~~~~-~~~~y~~~~~~~~~~~pp~~~~~--- 69 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFT----FG---DPMPVLDGR-GILDYLECWPNGRWYEPPLSMEG--- 69 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEE----eC---CceeecCcc-hhhHHHHHhhcCccccCCCCHHH--- Confidence 2221110000 00000000000000000010 00 111111111 1222222221111 110000 Q ss_pred HHHHHHHHHHHHHhh-ccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCe Q lcl|NC_020081. 98 RVNQVSMFCTPARNS-DKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKI 176 (552) Q Consensus 98 ~~~~~~~~~~~~~~~-~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna 176 (552) ++ +..+.+ ..+. .+. ...+.|... ..||++||.++|++ ++.|++++||| T Consensus 70 ----la---~~~~~~~~h~~--~l~-----------------~k~n~l~~~---~~Pn~~~t~~~f~~-~v~d~ll~Gna 119 (350) T protein:vir:11 70 ----LA---KSVGSSVYLQS--GLK-----------------FKRNMLAKT---FIPHRLLSRATFEQ-FSLDWLTFGSA 119 (350) T ss_pred ----HH---HHHhhhhhhcc--chh-----------------hhhhhhhhc---ccCCCCCCHHHHHH-HHHHHHhcCCe Confidence 00 000000 0000 000 001222221 26899999999975 67799999999 Q ss_pred eEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHH Q lcl|NC_020081. 177 NFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPEL 256 (552) Q Consensus 177 ~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl 256 (552) |++++|+..|+|++|+||+|.+|++..+.+ .|+++..++....|+++||||++. .++.+++||+||+ T Consensus 120 y~~~~rn~~G~~~~L~~l~~~~vr~~~~~~----------~~~~~~~~~~~~~~~~~eVihir~---~~~~~~~yGls~~ 186 (350) T protein:vir:11 120 YLEQPRSRLGTRMPLQAPLAKYMRRGTDLE----------TFYQVRSWKDEHEFEKGSVIQLRE---ADINQEIYGVPEW 186 (350) T ss_pred EEEEEEcCCCCEEEEEEeCCceeEeeecCC----------eEEEEeeCCeEEEECcccEEEeCC---CCCCCCcccccHH Confidence 999999999999999999999999866433 234445567778999999999874 3456789999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecc----CCceeee Q lcl|NC_020081. 257 EIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITA----EDVKFVN 332 (552) Q Consensus 257 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~----~g~~~~~ 332 (552) .+++.++.+..++..|..++|+||++|+|||.+++ ..++++++++++++|++. .|..|+|+++|+.+ +|++|++ T Consensus 187 ~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~-~~ls~e~~~~l~~~~~~~-~G~~N~~~~~v~~~~g~~~g~~~~p 264 (350) T protein:vir:11 187 FCALQSALLNESATLFRRKYYNNGSHAGFILYMTD-AAQNEEDIDALRTALKTA-KGPGNFRNLFVYAPNGKKEGIQLIP 264 (350) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHHHHHh-cCccccCceeeecCCCCccceEEEE Confidence 99999999999999999999999999999999974 457999999999999884 78889999866643 5799999 Q ss_pred ccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc Q lcl|NC_020081. 333 MTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI 412 (552) Q Consensus 333 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L 412 (552) ++.++.|+||+|.+++++++||++|||||.+||+.+.++ .++++++++.+.|+++||.|+++.||+ +|.+| T Consensus 265 l~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t--------~~~sn~e~~~~~f~~~~L~P~~~~ie~-ln~~l 335 (350) T protein:vir:11 265 VSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNA--------GGFGSISDAAAVWASLELAPMQTRLQQ-VNEMI 335 (350) T ss_pred cCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCC--------CCcCCHHHHHHHHHHHHHHHHHHHHHH-HHhhc Confidence 999999999999999999999999999999999987654 357899999999999999999999985 88887 Q ss_pred CcccccceeecccccCh Q lcl|NC_020081. 413 VSQFGGDYVFNFVGGDA 429 (552) Q Consensus 413 ~~~~~~~~~~~f~~~d~ 429 (552) ..+... +.+|...+. T Consensus 336 ~~~~~~--F~~~~~~~l 350 (350) T protein:vir:11 336 GEEVVR--FAQFDAPGL 350 (350) T ss_pred Cccccc--cCcccccCC Confidence 654221 112332222 No 103 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=6.9e-54 Score=312.03 Aligned_cols=325 Identities=12% Similarity=0.144 Sum_probs=220.5 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFC 106 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~ 106 (552) |.++..+. ....++...+ .++.+ -+.|...++ .+++ +..+..+....+ T Consensus 1 m~~~~~~~---------~~~~~~~~~~----~~~~~---~p~~~~~~~-~~~~--------------~~~~~~~~~~~~- 48 (337) T protein:vir:78 1 MTKRQQQP---------AQAAASSPRP----SVVFS---MPEAIDPTA-WMTD--------------YTGVFYNPYGEY- 48 (337) T ss_pred CCCcccCc---------ccccccCcee----EEEec---CcccccCcc-hhHh--------------hhhhhhccCcce- Confidence 21111110 0011111111 11111 111111111 0111 111100000000 Q ss_pred HHHHhhccccceeeeeccccccCChhHHHHH----HHHHHHHHhcCCCCCCCccCCH----HHHHHHHHHHHHhcCCeeE Q lcl|NC_020081. 107 TPARNSDKGVGYEIRLKDPLQEPNDHNKKKI----KEIENFIEKTGRIDNDFTRDNF----RSFVKKLVRDRLTYDKINF 178 (552) Q Consensus 107 ~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~----~~l~~~l~~~n~~~~pn~~~t~----~~f~~~~v~d~ll~Gna~~ 178 (552) + .. ++ +....++. ......| ...||..++. .+++++++.|++++||||+ T Consensus 49 --~---~p--P~-----------~~~~La~l~~~~~~h~~~L-----~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~ 105 (337) T protein:vir:78 49 --Y---QP--PI-----------DRKGLAKVARANAHHGAIL-----MARRNMVAGRFTNQRATITAFVHNYLQFGDGGL 105 (337) T ss_pred --e---cC--CC-----------CHHHHHHHhhcchhhhhHH-----HhhhccccccCcCcHHHHHHHHHHHHhhCCeEE Confidence 0 00 00 00000000 0001112 1245555543 4789999999999999999 Q ss_pred EEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHH Q lcl|NC_020081. 179 ELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEI 258 (552) Q Consensus 179 ~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~ 258 (552) +++||..|+|++|+||+|.+|++..+ |. .+|+ ..++....|+++||||++. .++.+++||+||+.+ T Consensus 106 ~~~rn~~G~~~~L~pl~~~~v~~~~d--~~-------~~~~--~~~~~~~~~~~~eIiHik~---~~~~~~~~Gls~~~~ 171 (337) T protein:vir:78 106 LKLRNSFGQVVGLHPLSSVYLRRRED--GC-------FVYL--QQGKPNLIYRPDDVIWLAQ---YDPEQQVYGMPDYLG 171 (337) T ss_pred EEEECCCCcEEEEEEeCCceeEeeeC--Ce-------EEEE--EcCCceEEECCccEEEECC---CCCCCCcccccHHHH Confidence 99999999999999999999987643 32 2232 2345667899999999874 245678999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec----cCCceeeecc Q lcl|NC_020081. 259 ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT----AEDVKFVNMT 334 (552) Q Consensus 259 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~----~~g~~~~~l~ 334 (552) ++.++.++.+++.|..++|+||++|++||.+++ ..+++++++++|+.|++ +.|..|.+++.|+. .+|++|++++ T Consensus 172 a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~-~~l~~e~~~~lk~~~~~-~~G~~n~~~~~v~~~~g~~~Gi~~~pis 249 (337) T protein:vir:78 172 GLQSALLNQDATLFRRRYFLNGAHMGFIFYATD-PNMDDDTEEEMKEMIAN-SKGVGNFRSMFVNIPDGKPDGIKLIPVG 249 (337) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC-CCCCHHHHHHHHHHHHH-hcCcccccceEEEcCCCCccceeEEEcC Confidence 999999999999999999999999999999864 45799999999999986 67888888876554 3579999999 Q ss_pred CchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCc Q lcl|NC_020081. 335 QSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVS 414 (552) Q Consensus 335 ~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~ 414 (552) .++.|+||++++++++++||++|||||++||+...++ ..+++|+|++.+.|+++||.|++++||+++|.+|++ T Consensus 250 ~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~-------~~~~~n~e~~~~~f~~~~L~P~~~~ie~~~n~~ll~ 322 (337) T protein:vir:78 250 DIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNG-------GGGLGDPEKYDATYARNEVLPLCELVQDAINSAGLP 322 (337) T ss_pred CChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCC-------cCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 9999999999999999999999999999999876542 235789999999999999999999999999998886 Q ss_pred ccccceeecccccChH Q lcl|NC_020081. 415 QFGGDYVFNFVGGDAK 430 (552) Q Consensus 415 ~~~~~~~~~f~~~d~~ 430 (552) .. ....|+|...... T Consensus 323 ~~-~~~~f~~~~~~~~ 337 (337) T protein:vir:78 323 RA-LWVTFRETIGAAV 337 (337) T ss_pred hh-hceeccccccccC Confidence 43 2344555333222 No 104 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=100.00 E-value=2.7e-45 Score=264.95 Aligned_cols=210 Identities=12% Similarity=0.208 Sum_probs=168.2 Q ss_pred eEEEECCCcccccccceeEEEEE----cCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 199 VYVAVDEDGKERKAKDGVRYVQV----IDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNA 274 (552) Q Consensus 199 v~v~~~~~g~~~~~~~~~~y~~~----~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~ 274 (552) |++ ..+|.. +|+.. ..++....|.++||+|++.. .+.+++||+|||.+++.+|..+.++++|+. T Consensus 1 ~r~--~~dg~~-------~y~~~~~~~~~~g~~~~~~~~eilH~r~~---~~~~~~~Glspi~~a~~~i~~~~aa~~~~~ 68 (219) T protein:vir:98 1 MRV--CKDGNY-------KYLMKKSLYDTKSEIYEYNKNDVIFIKLY---DPMQQVYGSPDYVGGITSALLNSDATIFRR 68 (219) T ss_pred Cce--eecCeE-------EEEEecceecCCceeEEeccccEEEecCC---CCCCCcceecHHHHHHHHHHHHHHHHHHHH Confidence 332 233422 23222 22356778999999998742 356789999999999999999999999999 Q ss_pred HHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec-----cCCceeeeccCchhHHHHHHHHHHH Q lcl|NC_020081. 275 RFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT-----AEDVKFVNMTQSSKDMEFEKWLNYL 349 (552) Q Consensus 275 ~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~-----~~g~~~~~l~~~~~d~q~~e~~~~~ 349 (552) +||+||++|+|||.+++ ..++++++++++++|++. .|..|++++ +|+ .+|++|++++++++|+||+|+++++ T Consensus 69 ~~f~Ng~~p~gil~~~~-~~l~~e~~~~~~~~~~~~-~g~~n~~~~-~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~ 145 (219) T protein:vir:98 69 RYYSNGAHMGFILYSTD-PDMTEEMEDEIAERIRDS-KGVGNFRSM-FVNIAGGHPDGLKVIPIGDTGQKDEFANIKNIS 145 (219) T ss_pred HHHhcCCCCceEEEeCC-CCCCHHHHHHHHHHHHHh-cCcccccce-eEecCCCCccceeEEEccCCHHHHHHHHHHHhh Confidence 99999999999999875 467999999999999885 677777664 453 4589999999999999999999999 Q ss_pred HHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccCh Q lcl|NC_020081. 350 INVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDA 429 (552) Q Consensus 350 ~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~ 429 (552) +++||++|||||++||+.+.++ .+++|++++.+.|+++||.||+++||++||++++...+ .+++|...+. T Consensus 146 ~~eIa~~fgVPp~~lG~~~~~~--------~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~~~--~~~~F~~~~~ 215 (219) T protein:vir:98 146 AQDVLTSHRFPPGLSGIIPVNT--------AGLGDPLKIREAYQADEVLPLQEIIAESINSDYEIKSA--LKVNFKQPEK 215 (219) T ss_pred HHHHHHHhCCCHHHcccccCCC--------CCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCc--cEEeecCccc Confidence 9999999999999999876543 45789999999999999999999999999987654333 4556654433 Q ss_pred HHHH Q lcl|NC_020081. 430 KTEA 433 (552) Q Consensus 430 ~~~~ 433 (552) .+.- T Consensus 216 ~d~~ 219 (219) T protein:vir:98 216 RDKN 219 (219) T ss_pred ccCC Confidence 3322 No 105 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=3.6e-40 Score=236.85 Aligned_cols=249 Identities=15% Similarity=0.136 Sum_probs=171.4 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||++..+. ++.- . .....+.......+.+ ........+ T Consensus 1 MglF~~~~-~r~~---------------------------------~--~~~~~~~~~~~~~~~~---~~~~~~~v~--- 38 (251) T protein:vir:46 1 MGIFYKNE-KRDL---------------------------------Q--YNEDDLQMMVQTLPSF---QGTKLRQYK--- 38 (251) T ss_pred CCcccccc-cccc---------------------------------C--CCccchhhhhhhhccc---cCcCcceec--- Confidence 66653111 0000 0 0000000000000000 000111110 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .+.|...+.+++||.+++..+ +.+++.+..+.. ..+.|++..+|. .+||+.||++ T Consensus 39 -~~~al~~~~v~~~i~~ia~~i-----------A~lp~~~~~~~~--------~~~~~~~~~ll~-----~~Pn~~~t~~ 93 (251) T protein:vir:46 39 -DIEAIRHSDIFTAVMMIASDL-----------ARMPIRVTVNGQ--------INYSDRIVNLLN-----TRPNPMYNGY 93 (251) T ss_pred -hhhhhccHHHHHHHHHHHHhH-----------hhCceEEeeCcc--------ccccchHHHHHh-----ccCCCCCCHH Confidence 112334556788887766665 455665543211 112355555443 4789999999 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEE--cCCceEEEEcccceeee Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQV--IDDKVVAKFKAKEMAWE 238 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~--~~~~~~~~~~~~evi~~ 238 (552) +||+.++.+++++||||++|+|+..|+|++|+||+|++|++..+++|...+ +|... ..++....|+++||||+ T Consensus 94 ~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~-----~~~~~~~~~~g~~~~~~~~diiH~ 168 (251) T protein:vir:46 94 IFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYY-----FHQRIDSNGNNIERNVKFEDMLDI 168 (251) T ss_pred HHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCCCcEEE-----EEEEeccCCcceeEEECCccEEEe Confidence 999999999999999999999999999999999999999999988876432 12111 23355678999999999 Q ss_pred cccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccc Q lcl|NC_020081. 239 VSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAW 318 (552) Q Consensus 239 ~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nag 318 (552) +.. +.+|++|+||+.+++.+|..+.++++|+.++|+||++|+|+|++++. ..++++++++|+.|++.++|.+|+| T Consensus 169 r~~----~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~~e~~~~~~~~~~~~~~g~~n~g 243 (251) T protein:vir:46 169 KFY----SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGV-LDNKKARDRAREEFPKVLVELNKLG 243 (251) T ss_pred cCc----CCCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCC-CCCHHHHHHHHHHHHHHhcCccccc Confidence 752 35689999999999999999999999999999999999999999864 3467788999999999999999999 Q ss_pred cceeeccC Q lcl|NC_020081. 319 KIPVITAE 326 (552) Q Consensus 319 k~~il~~~ 326 (552) ++++...+ T Consensus 244 ~~~~gm~~ 251 (251) T protein:vir:46 244 KLSYSMNQ 251 (251) T ss_pred ccccccCC Confidence 98765443 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.91 E-value=5.6e-24 Score=148.07 Aligned_cols=411 Identities=13% Similarity=0.159 Sum_probs=208.6 Q ss_pred ccccCcccccccccchhhhhccccccccc-cccccccccccccccCCcccccccCCCCchHHHHHH-HhhcchHHHHHHH Q lcl|NC_020081. 18 IDINDDMAVRIKQIEEDAILKKGKNTKSN-KPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLK-LWSRKNIILNAII 95 (552) Q Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr-~~a~~~~i~~a~i 95 (552) ...-+.... .++. .+. +.+. ...++| +. ...+..|. .++ .+++.+.+| T Consensus 1 ~~~~D~~~~--------~~~~-----~g~~~~~~---------~~~~~~---~~----~~~~~~l~a~Y~-~~~l~~~~v 50 (437) T protein:vir:52 1 MKFFDGIKS--------LALK-----LGSKQEQT---------YYSPSL---SL----TDDLVQLEALWR-DNWIANKVC 50 (437) T ss_pred CchhhhhHh--------HHhc-----CCCccccc---------eeecCc---cc----cccHHHHHHHHH-hCchhhHHh Confidence 000000000 0000 000 0000 001111 00 11223443 444 455666666 Q ss_pred HHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCC Q lcl|NC_020081. 96 ITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDK 175 (552) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gn 175 (552) .+.++.+.+ .++.+.-.+ .+.+.++.++..+.++++ .+-+...+.+.-++|. T Consensus 51 d~~a~d~~r-----------~~~~i~~~d-------~~~~~~~~~~~~~~~l~~----------~~~l~~a~~~~rl~G~ 102 (437) T protein:vir:52 51 IKRPEDMVR-----------NWREIYSND-------LNSKQLDLFTKFERSLKL----------RETLTKALQWSSLYGS 102 (437) T ss_pred hcchHHhhc-----------CCceEecCC-------CCHHHHHHHHHHHHhhcH----------HHHHHHHHHhcccccc Confidence 655443321 233442211 112344556666666542 2344455556668899 Q ss_pred eeEEEEECC---------CCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCCceEEEEcccceeeecccccC Q lcl|NC_020081. 176 INFELVYDK---------LGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRT 244 (552) Q Consensus 176 a~~~i~r~~---------~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~ 244 (552) ||++++++. .|.+..|.++++..|++....+ -.-.....+..| ++..+.....+.++.|||+...+.+ T Consensus 103 a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p~~y-~v~~~~~~~~iH~SRii~~~~~~~~ 181 (437) T protein:vir:52 103 VGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNFGRYSEY-SILGGSQSITVHHSRLIILNANDAP 181 (437) T ss_pred eEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccccccCcceEE-EEecCCcceeEccceeEEecCccCC Confidence 999999875 3789999999999987533221 111111122333 3444555667899999998755444 Q ss_pred CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC-CCCHHHHHHHHHHHHHHhccccccccceee Q lcl|NC_020081. 245 DLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQ-EQSNQALTSFRREWTSMFSGINGAWKIPVI 323 (552) Q Consensus 245 ~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~~s~~~~~~~~~~~~~~~~G~~nagk~~il 323 (552) .+.+.++|+|+++.+...|.....+......++.+...+ ++++++-. .++....+.+++.++....+ .+.+++.++ T Consensus 182 ~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 258 (437) T protein:vir:52 182 LSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKID--IFKIAGLSDKIAAGMENEVASVISAVQEI-KSATNSLLL 258 (437) T ss_pred CccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCC--ceecchHHHHhcCCcHHHHHHHHHHHHHh-cCCCceEEE Confidence 556678999999999999999999999988877765544 45554311 11211122333333332222 334555455 Q ss_pred ccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHH Q lcl|NC_020081. 324 TAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKF 403 (552) Q Consensus 324 ~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ 403 (552) +.+-+|+.++.+..+.. +...+..++||++++||..+|.-...+++++..+... +.-+......+..|+|++++ T Consensus 259 -d~~~~~e~~~~~~sgl~--~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~---~yyd~i~~~Qe~~l~p~le~ 332 (437) T protein:vir:52 259 -DAENEYDRKELTFTGLK--DLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQ---NYHEAIRRLQETRLRPIFEI 332 (437) T ss_pred -cCCcceEEEecCcCCHH--HHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHH---HHHHHHHHHHHHHHHHHHHH Confidence 45678888777665443 6777889999999999998884333444421111111 12222222233568888888 Q ss_pred HHHHHHhhcCcccccceeecccc---cChHHHHHHH----HHH-HHHhcCCcCHHHHHHHhC----CCCCCCCCeeeccc Q lcl|NC_020081. 404 IEDAVNKYIVSQFGGDYVFNFVG---GDAKTEAEII----SIL-ESKAKIGLTINDIRKELG----YPDTEGGDVTLAGV 471 (552) Q Consensus 404 ie~~ln~~L~~~~~~~~~~~f~~---~d~~~~~~~~----~~~-~~~~~g~lT~NE~R~~~g----l~p~~ggD~~~~~~ 471 (552) +-..|-+..+.....++.|+|.. .+.+++++.. +++ ..+.+|+++++|+|+++. ++.++..|.. T Consensus 333 l~~~i~~~~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r~~L~~~g~~~~i~~~~~~---- 408 (437) T protein:vir:52 333 IDPLICNELFGGLPADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQIANELRESGLFANISAEHIE---- 408 (437) T ss_pred HHHHHHHHhcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCCCCccccc---- Confidence 87777666555444567777753 3334444332 122 234478999999999873 2223222110 Q ss_pred cccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccc Q lcl|NC_020081. 472 HVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSK 527 (552) Q Consensus 472 n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (552) ..+..+ .... ..+++++..+. ...++.++ T Consensus 409 ---------------~~~~~~---~~~~---~~~~~~~~~~~------~~~~~~~~ 437 (437) T protein:vir:52 409 ---------------ELKNAD---EFAG---NFEEPEKMEGA------QVQNSEDQ 437 (437) T ss_pred ---------------cccCCC---CCCC---ccCCCCCCCCC------CCCCCCCC Confidence 000000 0000 00011110000 00010100 No 107 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.87 E-value=1.9e-20 Score=128.69 Aligned_cols=488 Identities=13% Similarity=0.040 Sum_probs=227.1 Q ss_pred CCCCCCCcccccchhhcccccCccccc--ccccchhhhhcccccc--ccccccccccccccccccCC-c------ccccc Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVR--IKQIEEDAILKKGKNT--KSNKPKAYEEPIIGSMSMNP-D------FKEAP 69 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-~------~~~~~ 69 (552) |.--+-.||.+..-..|-+-. .+.++ .++...........+. ..+....+.--+.+...... + ++..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~ 79 (532) T protein:vir:94 1 MADTDPTPRPEITYATLQQAQ-RVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEA 79 (532) T ss_pred CCCCCCCCCcceehhhhhhHh-hhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCcccccccccccccc Confidence 777777788777666661100 01111 0111111110011000 00000011000111111111 1 11111 Q ss_pred cCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCC Q lcl|NC_020081. 70 SIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGR 149 (552) Q Consensus 70 ~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~ 149 (552) ...+ -++.+..++. +++.+.+|.+.++.+.+ . ++.+...+.+ +... .....++..+.+++ T Consensus 80 ~~~~---~~~l~a~Y~~-~~l~r~~Vd~~aed~~r---------~--~~~i~~~~~~-~~~~---~~~~~i~~~~~~l~- 139 (532) T protein:vir:94 80 TSWP---GFPTLALLAQ-LPEYRTMHETPADECVR---------A--WGKITCSSKD-ELAA---DKATRITQKLEQYN- 139 (532) T ss_pred cccc---hHHHHHHHHc-CchhhhhhccchHHHhh---------C--CceEeeCCcc-ccch---HHHHHHHHHHHhhh- Confidence 1111 1245556665 45556666655553321 2 3334322221 1112 22333444444432 Q ss_pred CCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC-------------------CCCCEEEEEEecCceeEEEECC--Ccc Q lcl|NC_020081. 150 IDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD-------------------KLGDLHNFKAVDASTVYVAVDE--DGK 208 (552) Q Consensus 150 ~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~-------------------~~G~~~~L~~l~p~~v~v~~~~--~g~ 208 (552) ..+.+..+++...++|.+++++.-. ..|.+.+|.+++|..|++.... +-. T Consensus 140 ---------v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~ 210 (532) T protein:vir:94 140 ---------VRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPT 210 (532) T ss_pred ---------HHHHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccccceeeEEEeechheeccccccccccc Confidence 2234445556667889998887532 2344678999999998764321 100 Q ss_pred cccccceeEEEEEcCCceEEEEcccceeeecccccCC---ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_020081. 209 ERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTD---LTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRG 285 (552) Q Consensus 209 ~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~---~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~g 285 (552) -.....+..| +...+ ..+.++.|||+...+.++ +..+++|.|.++.+...|.....+......+....... T Consensus 211 sp~fg~P~~y-~v~~g---~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~-- 284 (532) T protein:vir:94 211 LPSFYKPDSW-IATSG---KKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMT-- 284 (532) T ss_pred ccccCCceeE-EEccC---eeeccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-- Confidence 0111122233 22222 357899999886554333 22346899999999999999999888887766654433 Q ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh- Q lcl|NC_020081. 286 LLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI- 364 (552) Q Consensus 286 il~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l- 364 (552) ++++.....++.+..+++.+++.....+-.|.+ +.++..+.-+|+.++.+..+ +.+......+.||++.+||...| T Consensus 285 v~k~~~a~~ls~~~~~~~~~r~~~~~~~~~n~g-~~~id~~~e~~e~~~~~lsg--l~~~l~~~~~~iAaa~~IP~t~Lf 361 (532) T protein:vir:94 285 NLATDMAQLLAPGGAQSLDARLQLFNLYRDNRN-IGALDKGTEEIQQTNTPLSG--LDSLQAQSQEQMAAVSHIPLVKLL 361 (532) T ss_pred eeeechHHhhcchhHHHHHHHHHHHHhhcCCcc-ceEEcCCCceeEEEecccCC--HHHHHHHHHHHHHhHhCCCeeeee Confidence 334332223344455677777765544433443 33454445677777666554 34677888999999999999965 Q ss_pred cccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeeccccc---ChHHHHHHH----H Q lcl|NC_020081. 365 NFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGG---DAKTEAEII----S 437 (552) Q Consensus 365 g~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~---d~~~~~~~~----~ 437 (552) |.. .+++..++. ....+.-...+...+..|.|+++.+-+.|-+..+.....++.|+|... +.+++++.. + T Consensus 362 G~s-p~GlnstGe--~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~d~~~~f~pL~~~s~kEkAei~~~~a~ 438 (532) T protein:vir:94 362 GIT-PNGLNASSD--GEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQIDPGLAWEWSPLMELDDKELAEVRQLNAS 438 (532) T ss_pred cCC-cccccccch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCceEEeCCCCCCCHHHHHHHHHHHHH Confidence 643 333322111 011222333333334567899999888887766654455678887644 334444322 2 Q ss_pred HH-HHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 438 IL-ESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 438 ~~-~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) ++ ..+.+|++++||+|++++..|..+.+........ +....... ........++....+. ..+..++.+++. T Consensus 439 a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~--~~~~~~~~---~~~~~~~~~~~~~~~~--~~~~~~~~~~d~ 511 (532) T protein:vir:94 439 TDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDE--LDDVEEIA---KQLMAAALNPPATAPQ--TPNPQPDSEDDQ 511 (532) T ss_pred HHHHHHhcCCCCHHHHHHHHhcCCccccccccccccc--cccccchh---hhhcccccCCCCCCCC--CCCCCCCCCCCC Confidence 22 2345789999999999999998765432211110 00000000 0000000000000000 000000000000 Q ss_pred ccccCCCCccccccccccccccCccccccc Q lcl|NC_020081. 517 NQNVGKDGQSKQQANTNSTPQGGKDDNGNV 546 (552) Q Consensus 517 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (552) .+.+....++..+..+-. |+. T Consensus 512 -----~~~~~~~~~~~~~~~~~~----~~~ 532 (532) T protein:vir:94 512 -----TDNQPDAQADPAQNDQPV----GNR 532 (532) T ss_pred -----CCCccCCCccccccCCCc----CCC Confidence 000111111111111110 111 No 108 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.85 E-value=6.1e-20 Score=125.93 Aligned_cols=466 Identities=12% Similarity=0.090 Sum_probs=207.2 Q ss_pred CCCCCcccccchhhc-----ccccCcccccccccchhhhhcc--cccc-cccccccc--------ccccccccccCC--- Q lcl|NC_020081. 3 LLDGFFKGRKQQDNI-----IDINDDMAVRIKQIEEDAILKK--GKNT-KSNKPKAY--------EEPIIGSMSMNP--- 63 (552) Q Consensus 3 ~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~--------~~~~~~~~~~~~--- 63 (552) .++ ++|++.-| -+ ....-.++.++.-.+.....-. .++. ..-++.+. ..+ .+-++..+ T Consensus 1 ~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~a~d~~~~~~ 77 (537) T protein:vir:10 1 MFK-FWRKKTVE-AVQSSIAERIEPRVGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHP-DMAMDGLDVEG 77 (537) T ss_pred CCC-cccccccc-ccccccccccccccCCCcccchhhHHHHHHhhhhccCCCCCccCccccccccccc-chhccccccch Confidence 333 55554422 11 1111111222110000000000 0000 00000000 000 00011111 Q ss_pred ----------------cccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccc Q lcl|NC_020081. 64 ----------------DFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQ 127 (552) Q Consensus 64 ----------------~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~ 127 (552) .++..+...+ .+.+..++. +++.+.+|.+.++.+.+ .++.+...+.+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~----~~l~a~Y~~-~~l~r~iVd~~A~d~~r-----------~~~~i~~~~~~- 140 (537) T protein:vir:10 78 GTFSAYANPNLSEGLVLWYAQQAFIG----HQMCALIAT-HWLVNKACSQMPRDAMR-----------KGYKIISDDGN- 140 (537) T ss_pred hhhhhhccccccchhhhhccccCCcc----HHHHHHHHh-CchhhhhhhhhhHHhhc-----------CCceeecCCcc- Confidence 0111111211 234444554 45556666655544321 23344322221 Q ss_pred cCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE---C-------------CCCCEEEE Q lcl|NC_020081. 128 EPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY---D-------------KLGDLHNF 191 (552) Q Consensus 128 ~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r---~-------------~~G~~~~L 191 (552) +..+ +..+.++..+.+++ .+..|.+. +....++|.++++|.- | ..|.+.+| T Consensus 141 ~~~~---~~~~~l~~~~~~l~---------~~~~l~~a-~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l 207 (537) T protein:vir:10 141 ELDP---KDAKFIDRYDRAFN---------IKKHAIQF-VRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGI 207 (537) T ss_pred cccH---HHHHHHHHHHHHhh---------HHHHHHHH-HHhcccccceEEEEeecCcCCcccccccccccccccceeEE Confidence 1112 22334455555443 12334444 4445567988888753 2 12346788 Q ss_pred EEecCceeEEEE------CCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCC---ccCCcccccHHHHHHHH Q lcl|NC_020081. 192 KAVDASTVYVAV------DEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTD---LTVGKYGYPELEIALNH 262 (552) Q Consensus 192 ~~l~p~~v~v~~------~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~---~~~g~~G~spl~~~~~~ 262 (552) ..|+|..|.+.. +.....+. .+..|.. . ...+.++.|||+.-.+..+ +..+++|+|.++.+... T Consensus 208 ~vidp~~~~~~~~~~~~~dp~sp~fg--~P~~y~v--~---g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~ 280 (537) T protein:vir:10 208 VQIDPYWCAPLLDAQASSNPVSMHFY--EPTYWLI--N---GKKYHRSHLAIYINDEVVDFLKPSYIYGGVPLPQQIMER 280 (537) T ss_pred EEechhhcccccchhhhccCCccccC--Cceeeee--c---CeEecceeEEEecCCCCchhhhcccCcccccHHHHHHHH Confidence 889988877532 11111111 1223322 1 2457889999876443222 22347899999999999 Q ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC-CHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHH Q lcl|NC_020081. 263 LQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQ-SNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDME 341 (552) Q Consensus 263 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~-s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q 341 (552) |.....+.......+...... +++++....+ ++++ +.+.+.....+.+|.+- .++..++-+|+.+..+..++ T Consensus 281 l~~~~~t~~~~~~l~~~~~~~--v~k~~~~~~l~~~~~---~~~r~~~~~~~r~n~g~-~~id~e~e~~e~~~~~lsgl- 353 (537) T protein:vir:10 281 VYAAERTANEGPMLAMTKRQT--VLKVDAAQVLANKQQ---FDETMSWWTATRDNYQV-RVVDKDNEDVVQIDTTLNDL- 353 (537) T ss_pred HHHHHHHHHHHHHHHHhcCCc--eeeechHHhhcCHHH---HHHHHHHHHhhcCCcce-eEecCCCceeEEEeccCCCH- Confidence 999999888888877766554 3444432222 3333 33444333334344443 45555567788776655543 Q ss_pred HHHHHHHHHHHHHHHhcCCHHHh-cccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccce Q lcl|NC_020081. 342 FEKWLNYLINVICSIYSIDPSEI-NFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDY 420 (552) Q Consensus 342 ~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~ 420 (552) .++.....+.||++.|||...| |....+..++..+...+|.. ..... +..|.|.+..+.+.|-+..+.. ..++ T Consensus 354 -~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd---~I~~~-Qe~l~p~l~~l~~ll~~~~~~~-~~~~ 427 (537) T protein:vir:10 354 -DKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHE---ECEST-QDDMRPLIDRHHQLVCRSHLRK-RIRV 427 (537) T ss_pred -HHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHH---HHHHH-HHHHHHHHHHHHHHHHHhcCCC-Ccce Confidence 4777888999999999999965 64332211111111122221 11111 2348999999988887766543 3345 Q ss_pred eeccccc---ChHHHHHHH----HHH-HHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCC Q lcl|NC_020081. 421 VFNFVGG---DAKTEAEII----SIL-ESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMD 492 (552) Q Consensus 421 ~~~f~~~---d~~~~~~~~----~~~-~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~ 492 (552) .|+|... +.+++++.. +++ ..+.+|++++||+|+.|+.+|..|-+-+............. . .+ T Consensus 428 ~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~----~-----~~ 498 (537) T protein:vir:10 428 KVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDID----V-----DD 498 (537) T ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhhccc----C-----Cc Confidence 6666533 444444432 122 23446899999999999998765433221110000000000 0 00 Q ss_pred CCccCcccCCCCCCCCCCCCCCC-cccccCCCCccccccccccccc Q lcl|NC_020081. 493 ANQFLAQQTGYDGNMDNVNGKDS-FNQNVGKDGQSKQQANTNSTPQ 537 (552) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 537 (552) ..++. ......++..+. .....+ +.-.+..++..+++. T Consensus 499 ~~~~~------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~~~ 537 (537) T protein:vir:10 499 EGKPV------RIIEDQPAPSEMFGATSSG-ESANDPRDSGAAFED 537 (537) T ss_pred cCCcC------CCCCCCCCccccCCCCccc-cccCCCccCccccCC Confidence 00000 000000000000 000001 111112222222222 No 109 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.78 E-value=6.2e-17 Score=109.44 Aligned_cols=494 Identities=11% Similarity=0.052 Sum_probs=202.2 Q ss_pred CCCCCCCcccccchhhcccc---------cCcccccccccchhhh-hcccc-cccccccc-------ccccccccccccC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDI---------NDDMAVRIKQIEEDAI-LKKGK-NTKSNKPK-------AYEEPIIGSMSMN 62 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~-------~~~~~~~~~~~~~ 62 (552) .-..-|.|-++.-..=.++. ++.+.+... .+..+ +.... ........ .+......+.+.. T Consensus 40 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s 117 (862) T protein:vir:99 40 ARTRQNWPVQKEKPNPIIRSVKDFPFVEISDSVNAKSV--SGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQS 117 (862) T ss_pred HhhcccCCcccccCCCCCCcccccccccccccccchhh--hhhhhcchhhcchhhhhhhhhhhhcchhhhhhcccccccc Confidence 11111332222211111111 111111100 00000 00000 00000000 0001111111111 Q ss_pred Cccc--------ccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHH Q lcl|NC_020081. 63 PDFK--------EAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNK 134 (552) Q Consensus 63 ~~~~--------~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~ 134 (552) +.+. .-++.. -++.+..|+.+ ++.+.+|.+.++.+.+ .++.+.......+... T Consensus 118 ~y~~~~~~~~~~~~~~f~----gyql~alY~~~-~larkiVd~pAeDatR-----------~g~~I~~~~d~~e~~~--- 178 (862) T protein:vir:99 118 SYAVPEALQDWYLSQGFI----GHQACALIAQH-WLVDKACSLAGEDAIR-----------NGWHLKSLGEGEEIDE--- 178 (862) T ss_pred ccccchhccccccccCcc----cHHHHHHHHhC-chhhhhhhhhhHHHhh-----------CCceEeecCcccccCH--- Confidence 1100 001111 12455666654 5556666655554322 2344432222222222 Q ss_pred HHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE---C-------------CCCCEEEEEEecCce Q lcl|NC_020081. 135 KKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY---D-------------KLGDLHNFKAVDAST 198 (552) Q Consensus 135 ~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r---~-------------~~G~~~~L~~l~p~~ 198 (552) ..+..++..+.++++ +..|.+ .+...-++|.+++++.. | ..|.+.+|..|+|.. T Consensus 179 e~~~~ie~~~~rL~v---------~~~l~e-air~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w 248 (862) T protein:vir:99 179 ESLEKFKAIDVEFKV---------KENLIE-FNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYW 248 (862) T ss_pred HHHHHHHHHHHHhhH---------HHHHHH-HHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhh Confidence 233445555555432 223444 44444567777766542 2 124567888899887 Q ss_pred eEEEE----CCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCC---ccCCcccccHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 199 VYVAV----DEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTD---LTVGKYGYPELEIALNHLQYHDNTEV 271 (552) Q Consensus 199 v~v~~----~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~---~~~g~~G~spl~~~~~~i~~~~~~~~ 271 (552) +.+.. ..+-.-..+..+..|.. .+ ..+.++.||++.-....+ +...++|+|.++.+...|.....+.. T Consensus 249 ~~p~~v~~~~~Dp~sp~yGkP~~y~I--~g---~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~ 323 (862) T protein:vir:99 249 MMPMLTAESTADPSSQFFYEPEFWII--SG---QKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTAN 323 (862) T ss_pred hcccccccccccccccccCCceeeee--cC---eeeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHH Confidence 76532 12211111112223322 12 245667776654322111 22336899999999999999999999 Q ss_pred HHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHH Q lcl|NC_020081. 272 FNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLIN 351 (552) Q Consensus 272 ~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~ 351 (552) ....++.+.... +++++....+..+ +.+.+.+.....+.+|.| +.++ +.+-+|+.++.+..+. .+......+ T Consensus 324 saa~Ll~ka~l~--v~ktd~l~~l~~e--d~l~~r~~~~~~~rdN~G-i~li-D~eEe~e~ls~slSGL--~dll~~~~q 395 (862) T protein:vir:99 324 EAPLLAMNKRTT--AIHTDTAKAIANE--DKFIQRLMFWVRYRDNHA-VKVL-GTDETMEQFDTSLADF--DAVIMGQYQ 395 (862) T ss_pred HHHHHHHHhccc--eeechhHhhhccH--HHHHHHHHHHHhccCcce-eEEe-cCCCceeEEecccCCh--HHHHHHHHH Confidence 998888875543 4555443333322 234444444344444444 4444 4567788777665544 366778888 Q ss_pred HHHHHhcCCHHH-hcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeeccccc--- Q lcl|NC_020081. 352 VICSIYSIDPSE-INFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGG--- 427 (552) Q Consensus 352 ~Ia~~fgVPp~~-lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~--- 427 (552) .||++.+||... +|....|...+..+... |.-+.....-+..|+|+++++...+...+..+ .++.|+|... T Consensus 396 ~IAaas~IP~tiLfGqspaGlnATGE~D~~---nYyD~I~s~QE~~L~P~LerL~~li~~~lg~~--~d~~ieFnpL~~~ 470 (862) T protein:vir:99 396 LVASIAKTPATKLLGTAPKGFNSTGEFETI---SYHEELESIQEHVYMPFLQRHYLISRLSLGIQ--HEIDVVMEPVASM 470 (862) T ss_pred HHHhhhCCCceeecccCcccccCchHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CcceEEeCCCCCC Confidence 999999999995 46442222111111111 22222222224668899999988876665433 3456666443 Q ss_pred ChHHHHHHH----HHH-HHHhcCCcCHHHHHHHh------CCCCCCCCCeeeccccccchhhhccccccccccCCC--CC Q lcl|NC_020081. 428 DAKTEAEII----SIL-ESKAKIGLTINDIRKEL------GYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMD--AN 494 (552) Q Consensus 428 d~~~~~~~~----~~~-~~~~~g~lT~NE~R~~~------gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~--~~ 494 (552) +.+++++.. +++ ..+.+|+++++|+|+++ |++.++..|..-.+.... +.. ....+.+. .. T Consensus 471 sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~--e~~-----~~~e~~g~a~~~ 543 (862) T protein:vir:99 471 TAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASP--ENL-----AAYQKAGAAQET 543 (862) T ss_pred CHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCc--ccc-----cccccCCccccc Confidence 334444432 122 23347899999999976 444444333221111000 000 00000000 00 Q ss_pred ccC-cccCCCCCCCCCCCCCCCcccccCCCCccccccccccccccCccccccccccccC Q lcl|NC_020081. 495 QFL-AQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGKDDNGNVVNDWEA 552 (552) Q Consensus 495 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (552) .+. ..+.+......+.++.....-..++-++... .+.+...-+..+.+.....|.+ T Consensus 544 ap~de~~aga~~~~~e~d~~~~p~~~~~~~g~~~~--~t~~~~a~~p~~~~~~~~~~~~ 600 (862) T protein:vir:99 544 ASAKETQAGAAVTTAEGDQPNVQMVPSMKPGQMVG--PEVGITAPMPEDDAPVAGVVAK 600 (862) T ss_pred ccccccccccCCccccCCcccccccCCCCCCCccc--cccccccCCCccccccCccccc Confidence 000 0000000000000000000000000111000 0000000011112222222222 No 110 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.77 E-value=3.5e-17 Score=110.83 Aligned_cols=430 Identities=11% Similarity=0.059 Sum_probs=219.9 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCccccc----ccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEA----PSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQV 102 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~ 102 (552) |..+ +.-..++-..+ ...+. ...+....... +..+. +...+..+.+.+.-.-+.+|+..|...| T Consensus 1 ~~~~-----~~~~~p~~~~g--~~~~~----~~~~~~~~~~~~e~~~~lr~-~~~~~ly~~m~e~D~~i~s~l~~rk~av 68 (469) T protein:vir:10 1 MTER-----VKTAAPVSEAG--YVFGS----GVVDGWTVWDPFEQTPELQW-PQSVAVYSRMDNEDSRVTSLLEAISLPI 68 (469) T ss_pred CCCc-----ccCCCCccchh--hhhhc----ccccchhhcccccccccccc-ccchHHHHHHHhhChHHHHHHHHHHHHH Confidence 2111 11000000000 00000 00111011110 11111 1112333444433344566666555443 Q ss_pred HHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCC-------CCCCccCCHHHHHHHHHHHHHhcCC Q lcl|NC_020081. 103 SMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRI-------DNDFTRDNFRSFVKKLVRDRLTYDK 175 (552) Q Consensus 103 ~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~-------~~pn~~~t~~~f~~~~v~d~ll~Gn 175 (552) .+..|.|..-+.+ ++ ..+.+...|..+... ....-+.+|++++..++.+.+.+|. T Consensus 69 -----------~~~~w~v~p~~~~----~e---~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~ 130 (469) T protein:vir:10 69 -----------RSTPWRIRANGAS----DE---VTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGH 130 (469) T ss_pred -----------hcCCceEecCCCC----HH---HHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCc Confidence 3556666432221 11 112222333322110 0112245788888888888889999 Q ss_pred eeEEEEECCC-----CC--EEEEEEecCcee-EEEECCCccccccc-c----eeEEEEEcCCceEEEEcccceeeecccc Q lcl|NC_020081. 176 INFELVYDKL-----GD--LHNFKAVDASTV-YVAVDEDGKERKAK-D----GVRYVQVIDDKVVAKFKAKEMAWEVSNP 242 (552) Q Consensus 176 a~~~i~r~~~-----G~--~~~L~~l~p~~v-~v~~~~~g~~~~~~-~----~~~y~~~~~~~~~~~~~~~evi~~~~~~ 242 (552) ++.++++... |. +..|.+.|+.++ +...++++.+.... . .........+.....+++...|+++++. T Consensus 131 s~~Eivw~~~~~~~dG~~~~~~l~~rp~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~ 210 (469) T protein:vir:10 131 AVFEQVYRPRNQSPDGRFWLRKLAPRPQWTISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNK 210 (469) T ss_pred eeeeeeeecccccCCCceeeeeeeecCcccceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecC Confidence 9999998643 43 667888888766 34444544332110 0 0011112223334567777777777654 Q ss_pred cCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccccccccee Q lcl|NC_020081. 243 RTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPV 322 (552) Q Consensus 243 ~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~i 322 (552) ++ ..+||.|.+..|......-....++...|...-|.|--+.+.+.+ .++++++.+.+.+.+...|. +++ + T Consensus 211 ~~---g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~--a~~~ek~~l~~a~~~~~~g~-~a~---~ 281 (469) T protein:vir:10 211 RP---GQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSA--TDEDEVRKMAALARSVRGGI-NAG---V 281 (469) T ss_pred CC---CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCC--CCHHHHHHHHHHHHHHhcCC-ceE---E Confidence 32 338999999999999999999999999999999999888887654 37778888888777665443 232 3 Q ss_pred eccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHH Q lcl|NC_020081. 323 ITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLK 402 (552) Q Consensus 323 l~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~ 402 (552) +.+.|++++-+..+-....|.+..++..++|+.+. ||.+-.+ .++..+++..+. ........+.-.++ T Consensus 282 iip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~i------LG~tlTs-----~~~gGS~a~~~v-h~ev~~d~~~sDa~ 349 (469) T protein:vir:10 282 GLAQGQILELLGVSGNLPDIRRAIEGHDRSIALSG------LAHFLNL-----DGKGGSYALASV-LEDPFTQAVHAYAT 349 (469) T ss_pred EccCCceEEEeecCCCchHHHHHHHHHHHHHHHHH------hcccccc-----cCccchhhHHHH-HHHHHHHHHHHHHH Confidence 34567776666655555678899999999998875 5533211 112233443333 44455668889999 Q ss_pred HHHHHHHhhcCcc-----ccc--c-eeecccccCh--HHHHHHHHHHHHHhcCC-----cCHHHHHHHhCCCCCCCCCee Q lcl|NC_020081. 403 FIEDAVNKYIVSQ-----FGG--D-YVFNFVGGDA--KTEAEIISILESKAKIG-----LTINDIRKELGYPDTEGGDVT 467 (552) Q Consensus 403 ~ie~~ln~~L~~~-----~~~--~-~~~~f~~~d~--~~~~~~~~~~~~~~~g~-----lT~NE~R~~~gl~p~~ggD~~ 467 (552) .|+..||+.|++. ++. . -+|.|...+. +..++..+ .....|. ++.+.+|+.+|+|+-..++.. T Consensus 350 ~i~~tln~~li~~l~~lN~g~~~~~P~~~~~~~e~~~~~~a~~i~--~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~ 427 (469) T protein:vir:10 350 SICRIANQHIIEDLVDINFGVDTPAPVLTFDPIGSRQDLTAAAVK--LLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPS 427 (469) T ss_pred HHHHHHHHHHHHHHHHhcCCCCCCccEEEecCCCCcHHHHHHHHH--HHHhcCCccCccccHHHHHHHhCCCCCCCCccc Confidence 9999999887653 221 1 2566654332 22232222 2333565 456789999999977665543 Q ss_pred eccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccc Q lcl|NC_020081. 468 LAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQAN 531 (552) Q Consensus 468 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 531 (552) +.+.. ..........+..... .+..+.......+....-.+. T Consensus 428 ~~~~~----------~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~l~da 469 (469) T protein:vir:10 428 AEPEE----------PAAVPNQSAAPARTRS------------SGNADARARAPKADQGVLFDA 469 (469) T ss_pred ccchh----------cccCCCCCccccccCC------------CCCcccccccCCChHHhhccC Confidence 32210 0000000000000000 011111111111111111011 No 111 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.72 E-value=3.7e-17 Score=110.69 Aligned_cols=408 Identities=11% Similarity=0.077 Sum_probs=191.2 Q ss_pred cccccchhhhhccccccccccc--ccccccccccc-ccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKP--KAYEEPIIGSM-SMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVS 103 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~ 103 (552) |.+ +.++...+...++-++.. ........... +|+ .|.. +..-.+..|.++-..+++.+.+|.+.++.+. T Consensus 1 ~~~-~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d~~~~~-~~~~-----~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~ 73 (461) T protein:vir:80 1 MYS-IDKAKQAKIDSKIVNRNDFMVGHGKANSRDKLTRQ-TPGN-----GQKLDLKACENLYASNSIAMNIVDIISEDMV 73 (461) T ss_pred Ccc-chhhhhhhhhhhhhhhhHHHhhcCCcchhhhhhcc-ccCc-----ccccCHHHHHHHHHhCCccchhhccchHHhh Confidence 111 111111111000000000 00000000000 111 1111 1111356665554456666777666555432 Q ss_pred HHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE- Q lcl|NC_020081. 104 MFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY- 182 (552) Q Consensus 104 ~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r- 182 (552) .. |+.+...+ ......++.++.+++ ..+-+...+++..++|.+|++|.- T Consensus 74 ---------r~--g~~i~~~~---------~~~~~~~~~~~~~l~----------~~~~l~~~~~~~rl~G~a~i~i~v~ 123 (461) T protein:vir:80 74 ---------RA--GWSLKTDN---------KEMKKNIESKWRKLK----------TKDRFQKLYADKRLYGDGFLSIGVV 123 (461) T ss_pred ---------cC--CeeeecCC---------HHHHHHHHHHHHHhh----------HHHHHHHHHHhhcccccEEEEEEee Confidence 11 34442211 122344556665543 123455566677899999988853 Q ss_pred CCC------------CCEEEEEEe---cCceeEE---EECCCcccccccceeEEEEEc------------CCceEEEEcc Q lcl|NC_020081. 183 DKL------------GDLHNFKAV---DASTVYV---AVDEDGKERKAKDGVRYVQVI------------DDKVVAKFKA 232 (552) Q Consensus 183 ~~~------------G~~~~L~~l---~p~~v~v---~~~~~g~~~~~~~~~~y~~~~------------~~~~~~~~~~ 232 (552) +.. +.+.+|.+| .+..+.+ ..+..+..+ ..+..|.+.. .+.....+.+ T Consensus 124 d~~~~~~~~~~pl~~~~~~~~~~l~~~~~~~i~~~~~~~dp~sp~f--g~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~ 201 (461) T protein:vir:80 124 SSNREQADLSTAIDPKTIKSIPYINTFNTQKVTQLYLNQDMFSEHF--GEVEFFEVNRVSQLGEEILSGTTASTSEQIHR 201 (461) T ss_pred cCCccccCccCCcccccccceeEEEeccccccchhhhcccCcCccc--ccceEEEEeccccccccccccccCccceEEcc Confidence 211 122233333 3333221 111111111 1222333321 2334467889 Q ss_pred cceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_020081. 233 KEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFS 312 (552) Q Consensus 233 ~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~ 312 (552) +.|||+..... .+..+|.|.++.+...|.....+......++.+...+ ++++++-.....+....+.+.++.... T Consensus 202 SRii~~~~~~~---~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~~~~~~ 276 (461) T protein:vir:80 202 SRIIHEQGLRF---EGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFK--VYKTDDIDALNKDDKANLTAMLDFMFR 276 (461) T ss_pred ccEEEecCCCC---CccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCC--ceecchHHhhhchHHHHHHHHHHHhcC Confidence 99998865432 2347899999999999999999998888877765543 456654322333344455666654332 Q ss_pred cccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh-cccccccccccccccccchhHHHHHHH Q lcl|NC_020081. 313 GINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI-NFPNRGGATGHSGNTLNEGSSAEKYRN 391 (552) Q Consensus 313 G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~~~~~~~~~~n~e~~~~~ 391 (552) +. .+ ++.+.+-+++.++.+..+ +.+........||++-+||...| |.. .++++...+... +.-+.... T Consensus 277 ---~~-g~-~~~d~~e~~e~~~~~lsg--l~~~l~~~~~~iaa~s~iP~t~L~G~s-~g~~asge~D~~---~yyd~i~~ 345 (461) T protein:vir:80 277 ---TE-AL-AIIKGDEQLTKESTNVSG--MKDLLDYGWDYLAGAVRMPKTVLKGQE-AGTLTGAQYDVM---NYYARVSS 345 (461) T ss_pred ---Cc-eE-EEEcCCcceEEEecCcCC--HHHHHHHHHHHHhhhhcCCeeeeeccc-CCccccchHHHH---HHHHHHHH Confidence 22 23 344556778877776665 44788899999999999999876 544 343322121222 22222333 Q ss_pred HHHHHhhHHHHHHHHHHHhhcCcc------cccceeeccccc---ChHHHHHHH----HHH-HHHhcCCcCHHHHHHHh- Q lcl|NC_020081. 392 SKDKGLEPLLKFIEDAVNKYIVSQ------FGGDYVFNFVGG---DAKTEAEII----SIL-ESKAKIGLTINDIRKEL- 456 (552) Q Consensus 392 ~~~~~l~P~~~~ie~~ln~~L~~~------~~~~~~~~f~~~---d~~~~~~~~----~~~-~~~~~g~lT~NE~R~~~- 456 (552) .-+..++|+++++-..|-+..+.. ...++.++|... +.+++++.. +++ ..+.+|++|++|+|+.+ T Consensus 346 ~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~ 425 (461) T protein:vir:80 346 IQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRF 425 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHH Confidence 333567788877777665443321 113456666544 334444432 222 23347899999999855 Q ss_pred C---CCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCC Q lcl|NC_020081. 457 G---YPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKD 514 (552) Q Consensus 457 g---l~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (552) + ++|.. . +...+ .+..+. .. .......+.++++ T Consensus 426 ~~~~~~~~~--~-------~~~~~-------------~~~~~~-~~--~~~~~~~~e~~~g 461 (461) T protein:vir:80 426 GRFGLENSS--K-------FSGDS-------------AEIDKL-AK--LVYDAYAKKNADG 461 (461) T ss_pred HhcCCCCCc--c-------CCCCC-------------chhhhh-hh--hccccccccCCCC Confidence 2 32211 0 00000 000000 00 0000000011111 No 112 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.71 E-value=2.4e-17 Score=111.71 Aligned_cols=393 Identities=13% Similarity=0.148 Sum_probs=184.4 Q ss_pred ccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcccccee Q lcl|NC_020081. 40 GKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYE 119 (552) Q Consensus 40 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~ 119 (552) ..+.-+ |..-+.+..+.. .++..+... .+..|-++-..+++.+.+|.+.++.+.+ .++. T Consensus 1 ~~~~D~-----~~n~~~gg~~~~-~~~~~~~~~----~~~~l~a~Y~~~~l~~~~Vd~~aed~~r-----------~g~~ 59 (422) T protein:vir:10 1 MVKTDS-----YANIFLGGSDGS-EIYGSLQNQ----APTILASLYADNALVRRIIDTIPETALA-----------AGFH 59 (422) T ss_pred Cccchh-----hHHHHcCCCCCc-cccCccccc----CHHHHHHHHHhChhhHHHHhhhhHHHhc-----------CCcc Confidence 111111 111111111111 111111111 1233333333456666666665554322 1233 Q ss_pred eeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE-C---------CCCCEE Q lcl|NC_020081. 120 IRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY-D---------KLGDLH 189 (552) Q Consensus 120 i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r-~---------~~G~~~ 189 (552) |.-. .+..+ ++.-+.+++ ..+-+...+....++|.+++++.. + ..|.+. T Consensus 60 i~~~--------~~~~~---~~~~~~~l~----------~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~ 118 (422) T protein:vir:10 60 IDGI--------DDEPA---FWSRWDDLE----------MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELE 118 (422) T ss_pred ccCC--------CHHHH---HHHHHHHhh----------HHHHHHHHHHhhccccceEEEEEecCCCCccccccccCcee Confidence 3111 11111 222233332 223455556667788999888874 2 356788 Q ss_pred EEEEecCceeEEEECC-CcccccccceeEEEEEcC-CceEEEEcccceeeecccccCC---ccCCcccccHHHH-HHHHH Q lcl|NC_020081. 190 NFKAVDASTVYVAVDE-DGKERKAKDGVRYVQVID-DKVVAKFKAKEMAWEVSNPRTD---LTVGKYGYPELEI-ALNHL 263 (552) Q Consensus 190 ~L~~l~p~~v~v~~~~-~g~~~~~~~~~~y~~~~~-~~~~~~~~~~evi~~~~~~~~~---~~~g~~G~spl~~-~~~~i 263 (552) .|.++++..|++..-. +-.-.....+..|.+... +.....+.++.+||+.-.+.++ ....++|.|+++. +.+.| T Consensus 119 ~l~v~d~~~i~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i 198 (422) T protein:vir:10 119 TVRVYDRTQVKVQTREENPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSI 198 (422) T ss_pred eEEeeccccccchhcccCccccccCcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHH Confidence 9999999988764321 111111112334444332 2334677888888774332111 3345789999986 67889 Q ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCCCCC--CCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHH Q lcl|NC_020081. 264 QYHDNTEVFNARFFAQGGTTRGLLHIKTGQE--QSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDME 341 (552) Q Consensus 264 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~--~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q 341 (552) .....+.......|...... ++++++-.. .+......+++++........+.+. .++.+++-+|+.++.+..+ T Consensus 199 ~~~~~~~~~~~~l~~~~~~~--v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~-~~l~~~~e~~e~~~~~lsg-- 273 (422) T protein:vir:10 199 KDYTNCERLATQLLKRKQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQA-IGIDAESEEYSVLNSDIGG-- 273 (422) T ss_pred HHHHHHHHHHHHHHHHhccc--cccchhHHHhcCCccchHHHHHHHHHHHHhcCCccc-eeEecCCcceEEEecccCC-- Confidence 98888888888776655433 345443111 1122223333333333222222333 3555666788887777665 Q ss_pred HHHHHHHHHHHHHHHhcCCHHHh-cccccccccccccc-c-ccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccccc Q lcl|NC_020081. 342 FEKWLNYLINVICSIYSIDPSEI-NFPNRGGATGHSGN-T-LNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGG 418 (552) Q Consensus 342 ~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~~~~~-~-~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~ 418 (552) +.+......++||++.+||...| |.. .+++.+++.. . ..|..++..+ +..|+|.+.++=..| ... . T Consensus 274 l~~~~~~~~~~iaaa~~IP~t~L~G~s-~~Glnatgd~d~~~yyd~i~~~Q----e~~l~p~l~~l~~~i----~~s--~ 342 (422) T protein:vir:10 274 IDAFLDKKFDRIVALSGIHEIILKNKN-VGGVSSSQNTALETFHKLVDRKR----NAELLPILEFLIPFI----VNA--E 342 (422) T ss_pred hHHHHHHHHHHHHhhhCCCeeeeccCC-cccccccchHHHHHHHHHHHHHH----HHHHHHHHHHHHHHh----ccc--C Confidence 45778899999999999999977 443 4444222211 1 1222222222 345566655553332 212 3 Q ss_pred ceeecccc---cChHHHHHHH----HHH-HHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccC Q lcl|NC_020081. 419 DYVFNFVG---GDAKTEAEII----SIL-ESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQ 490 (552) Q Consensus 419 ~~~~~f~~---~d~~~~~~~~----~~~-~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~ 490 (552) ++.++|.. .+.+++++.. +++ ..+.+|+++++|+|+.|--.....| +.+ ...+.+ T Consensus 343 ~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~---~~~-~~~~~~------------- 405 (422) T protein:vir:10 343 EWSVEFNPLAQESSKDKAEILEKNVNSIAALIAAGAMDIDEARDTLRTIAPEVK---IND-GSVETE------------- 405 (422) T ss_pred CcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHhhhhccccc---CCC-CCCccc------------- Confidence 45566643 3334444432 222 2344689999999998732211110 000 000000 Q ss_pred CCCCccCcccCCCCCCCCCCCCCCCcccc Q lcl|NC_020081. 491 MDANQFLAQQTGYDGNMDNVNGKDSFNQN 519 (552) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (552) ......+ +.+...|+ .+ T Consensus 406 -----~~~~~~~-~~~~~~~~------~d 422 (422) T protein:vir:10 406 -----VTISETS-NDPLEVPT------DD 422 (422) T ss_pred -----cchhhcC-CCCCCCCC------CC Confidence 0000000 00000000 00 No 113 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.71 E-value=3.4e-17 Score=110.90 Aligned_cols=407 Identities=12% Similarity=0.123 Sum_probs=185.6 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQM 80 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (552) ||+|= ++++ ..-.+... +...+..+. +..++.+. .+....+.. T Consensus 1 ~~~~m---~~~~-------------------~~~~~~D~-----------~~~~~~~~~---g~~~~~~~-~~~~~~~~~ 43 (435) T protein:vir:79 1 MGVFM---SDKV-------------------KAITKEDG-----------YNEIFGSKD---GTFRPNAF-YMQRAAFKA 43 (435) T ss_pred CCccc---cccc-------------------ccchhhcc-----------hhhhhcccc---cccccCcc-cCCcCCHHH Confidence 66653 2220 00001111 111111110 00011111 111123445 Q ss_pred HHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 81 LKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 81 Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) |-++-..+++.+.+|.+.++.+.+. ++.+. .. .+..+ ++..+.+++ .. T Consensus 44 l~~~Y~~~~l~~~~Vd~~aed~~r~-----------g~~i~--g~------~~~~~---~~~~~~~l~----------~~ 91 (435) T protein:vir:79 44 LSQFYEEDGMARRIVDVIPEEMVTP-----------GFKVD--GV------KNEKS---FKSRWDELR----------LN 91 (435) T ss_pred HHHHHhcCchhhhhhccchHHhhcC-----------Cceec--CC------ChHHH---HHHHHHHhh----------HH Confidence 5444234466666666655543321 23331 11 11112 333333332 12 Q ss_pred HHHHHHHHHHHhcCCeeEEEEE-C---------CCCCEEEEEEecCceeEEEECC-CcccccccceeEEEEEcC-CceEE Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVY-D---------KLGDLHNFKAVDASTVYVAVDE-DGKERKAKDGVRYVQVID-DKVVA 228 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r-~---------~~G~~~~L~~l~p~~v~v~~~~-~g~~~~~~~~~~y~~~~~-~~~~~ 228 (552) +-+...+....++|.+++++.. + ..|.+.+|.++++..|++..-. +-.-.....+..|.+... +.... T Consensus 92 ~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~dp~sp~fg~P~~y~v~~~~~~~~~ 171 (435) T protein:vir:79 92 AKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHERETNARSVRYGEPKLYKISPGGDIPEF 171 (435) T ss_pred HHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhhccCCcccccCcceEEEEecCCCCCce Confidence 3444555566788888887763 2 3456779999999888753321 100011112233443322 22456 Q ss_pred EEcccceeeecccccCC---ccCCcccccHH-HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC--CCHHHHHH Q lcl|NC_020081. 229 KFKAKEMAWEVSNPRTD---LTVGKYGYPEL-EIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQE--QSNQALTS 302 (552) Q Consensus 229 ~~~~~evi~~~~~~~~~---~~~g~~G~spl-~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~--~s~~~~~~ 302 (552) .+.++.+||+...+.++ ....++|.|++ +.+.+.|.....+......++...... ++++++-.. .+...... T Consensus 172 ~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~--v~~~~~l~~~~~~~~~~~~ 249 (435) T protein:vir:79 172 FVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQA--VWKARDLALMCDDEEGRYA 249 (435) T ss_pred EEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc--cccchhHHHhhcCccchHH Confidence 78888888875332221 22457899998 688899999998888888766554433 344432101 11222223 Q ss_pred HHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh-ccccccccccccccccc Q lcl|NC_020081. 303 FRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI-NFPNRGGATGHSGNTLN 381 (552) Q Consensus 303 ~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~~~~~~~~ 381 (552) +++.+........+.+. .++.+++-+|+.++.+..+ +.+......+.||++.+||...| |.. .+++.+++.. . T Consensus 250 ~~~r~~~~~~~~~~~~~-~~i~~~~e~~e~~~~~lsg--l~~~~~~~~~~iaaa~~IP~t~L~G~s-~~glnstgd~--d 323 (435) T protein:vir:79 250 ARLRLAQVDDESGVGKA-IGIDATDEEYEVLNSDVSG--VPEFLQEKIDRIVALTGIHEIIIKNKN-TGGVSASQNT--A 323 (435) T ss_pred HHHHHHHHHHhcCCCCc-eeEecCCcceEEEecccCC--HHHHHHHHHHHHHhhhCCCeeeeccCC-ccccccchhH--H Confidence 33333222221122333 3555555678877766654 45778889999999999999776 443 3433222111 0 Q ss_pred chhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccc---cChHHHHHHH----HHH-HHHhcCCcCHHHHH Q lcl|NC_020081. 382 EGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVG---GDAKTEAEII----SIL-ESKAKIGLTINDIR 453 (552) Q Consensus 382 ~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~---~d~~~~~~~~----~~~-~~~~~g~lT~NE~R 453 (552) ..|.-+.....-+..++|.+.++-..+ ... .++.++|.. .+.+++++.. +++ ..+.+|+++++|+| T Consensus 324 ~~~yyd~i~~~Qe~~l~p~l~~l~~li----~~s--~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r 397 (435) T protein:vir:79 324 LETFYKLIDRKRVEDYKPILEFLLPFM----ISE--TEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETR 397 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh----hcC--CCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHH Confidence 112222222222355677766654433 212 355666643 3344444432 122 23447899999999 Q ss_pred HHh-CCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCC Q lcl|NC_020081. 454 KEL-GYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDS 515 (552) Q Consensus 454 ~~~-gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (552) +.+ ...|.-| +.+.....+ + +.... +.+...+.|++. T Consensus 398 ~~L~~~~~~~~----~~~~~~~~~---------~---~~~d~---------~~~~~~e~g~~~ 435 (435) T protein:vir:79 398 DTLRSICPDLK----IMDNDNIEL---------P---EPEDL---------DPEPGQEGGLNK 435 (435) T ss_pred HHHHHhccccC----CCCcccccC---------C---ccccC---------CCCCCCCCCCCC Confidence 977 3222211 000000000 0 00000 000000111110 No 114 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.70 E-value=4.4e-17 Score=110.28 Aligned_cols=446 Identities=11% Similarity=0.016 Sum_probs=208.1 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccc-cccccccCCccccccc-C-CCCchH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEP-IIGSMSMNPDFKEAPS-I-HGKQNL 77 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~-~~~~~~ 77 (552) ||++| |.-+-.= -... -+...++- .. .+|.-. ......|++....... + .....+ T Consensus 1 mn~~d-r~i~~~s--------P~~~---~~R~~ar~------~~----~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~l 58 (502) T protein:vir:79 1 MAILD-DVIGVFS--------PGWK---AARLRSRA------VI----QAYEAVKTTRTHKARRENRTADQLSQYGAVSL 58 (502) T ss_pred CchHh-hHHhhcC--------hHHH---HHHHhhHH------HH----hhccccCcccccCCCCCCCChHHHHHHHHHHH Confidence 77776 2111100 0000 00000000 00 011110 0112233322111000 0 000111 Q ss_pred HHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcccc-ceeeeec--cccccCChhHHHHHHHHHH-HHHhcCCCCCC Q lcl|NC_020081. 78 LQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGV-GYEIRLK--DPLQEPNDHNKKKIKEIEN-FIEKTGRIDND 153 (552) Q Consensus 78 ~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~-~~~i~~k--~~~~~~~~~~~~~~~~l~~-~l~~~n~~~~p 153 (552) ..--|.+..++.+...++....+ +.-|. |+.+..+ ..+....++-.+++..+.+ |.+. ... T Consensus 59 r~RaRdl~rNn~~a~~av~~~~~-----------nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~----~D~ 123 (502) T protein:vir:79 59 REQARYLDNNHDLVIGVFDKLEE-----------RVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVS----PEV 123 (502) T ss_pred HHHHHHHHhcChHHHHHHHHHHH-----------hhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcC----cCc Confidence 12224555566666665544332 23333 3433322 2222222222333333322 2222 234 Q ss_pred CccCCHHHHHHHHHHHHHhcCCeeEEEEECCCC-------CEEEEEEecCceeEEE------------ECCCcccccccc Q lcl|NC_020081. 154 FTRDNFRSFVKKLVRDRLTYDKINFELVYDKLG-------DLHNFKAVDASTVYVA------------VDEDGKERKAKD 214 (552) Q Consensus 154 n~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G-------~~~~L~~l~p~~v~v~------------~~~~g~~~~~~~ 214 (552) ...++++.+...+++.++..|.+|+.+++...+ .+..|..|+|++|..- .|..|+ T Consensus 124 ~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr------ 197 (502) T protein:vir:79 124 TGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTPSAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGR------ 197 (502) T ss_pred cccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCCCcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCc------ Confidence 457899999999999999999999999875432 4678999999888532 222222 Q ss_pred eeEEEEEc------CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Q lcl|NC_020081. 215 GVRYVQVI------DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLH 288 (552) Q Consensus 215 ~~~y~~~~------~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 288 (552) .+.|.... .......+++++|||+...-+ .....|+|.+..++..+............--+=.+...++|+ T Consensus 198 ~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r---~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~ 274 (502) T protein:vir:79 198 PEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRR---LHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIR 274 (502) T ss_pred eEEEEEeecCCCCCcccceeEechhheEEeecccC---CccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeee Confidence 12233221 123346789999999876543 234679999999988887777666666555555666777777 Q ss_pred eCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh-ccc Q lcl|NC_020081. 289 IKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI-NFP 367 (552) Q Consensus 289 ~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~ 367 (552) .+.+.....+... ..-....... ..|.+...+..|.+++....+.....|.+..+...+.||+.+|||-+.| |+. T Consensus 275 ~~~~~~~~~~~~~---~~~~~~~~~l-~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~ 350 (502) T protein:vir:79 275 KGDGQSYEPDGNG---SKENERELTI-QPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNY 350 (502) T ss_pred cCCCcccccccCC---CCCccccccc-cCCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccc Confidence 5432211111000 0000000011 1343322234577877777665667888999999999999999998887 322 Q ss_pred ccccccccccccccchhHHHHH-----------HHHHHHHhhHHHH-HHHHHHHhhcCc--cc-c--cceeecc-----c Q lcl|NC_020081. 368 NRGGATGHSGNTLNEGSSAEKY-----------RNSKDKGLEPLLK-FIEDAVNKYIVS--QF-G--GDYVFNF-----V 425 (552) Q Consensus 368 ~~~t~~~~~~~~~~~~n~e~~~-----------~~~~~~~l~P~~~-~ie~~ln~~L~~--~~-~--~~~~~~f-----~ 425 (552) +.||+++.... ..++...++|+.+ +++.++-.-.++ .+ . ..+.+.| . T Consensus 351 -----------s~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~ 419 (502) T protein:vir:79 351 -----------NGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMP 419 (502) T ss_pred -----------cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCcc Confidence 12555554433 3445556666544 355555443332 11 1 1123333 2 Q ss_pred ccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCC-C Q lcl|NC_020081. 426 GGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGY-D 504 (552) Q Consensus 426 ~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 504 (552) ..|+...... ....+.+|..|.-++-++.|.+|-+--+.+. ........ .+.... .++.. . T Consensus 420 ~iDP~Ke~~a--~~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a-----~e~~~~~~-~Gl~~~----------~~~~~~~ 481 (502) T protein:vir:79 420 WIDPVKEAEA--WKIQIRGGAATESDWVRAGGRNPDDVKRRRK-----AEIDENRK-LDLVFD----------TDPASDK 481 (502) T ss_pred ccChHHHHHH--HHHHHHcCCCCHHHHHHHcCCCHHHHHHHHH-----HHHHHHHH-cCCCCC----------CCCCCCC Confidence 3465544332 2334567999999999999998843211100 00000000 000000 00000 0 Q ss_pred CCCCCCCCCCCcccccCCCCcc Q lcl|NC_020081. 505 GNMDNVNGKDSFNQNVGKDGQS 526 (552) Q Consensus 505 ~~~~~~~~~~~~~~~~~~~~~~ 526 (552) ..++.+..+++++++ +.++++ T Consensus 482 ~~~~~~~~~~e~~~~-~~~~e~ 502 (502) T protein:vir:79 482 GGSSAATKRQEPQHT-DDQSEE 502 (502) T ss_pred CCCCCCCCCCCCCCC-CCCCCC Confidence 000000000000000 000000 No 115 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.69 E-value=2.1e-16 Score=106.59 Aligned_cols=397 Identities=12% Similarity=0.131 Sum_probs=182.7 Q ss_pred cccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeee Q lcl|NC_020081. 43 TKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRL 122 (552) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~ 122 (552) =|....-.|..-+.+. ..+. ..+...... -.+.+..++.+ ++.+.+|.+.++.+.+. ++.|.- T Consensus 1 ~~~~~~d~~~~~~~~~---~~~~-~~~~~~~~~-~~~l~a~Y~~~-~l~~~~Vd~~aed~~r~-----------g~~i~g 63 (427) T protein:vir:10 1 MKIVKHDGYNDIFNGG---ADGS-PKPFFMSDA-SYHVGSFYNDN-ATAKRIVDVIPEEMVTA-----------GFKMSG 63 (427) T ss_pred CCccccchHHHHhhcC---CCCc-ccCccccCc-hHHHHHHHHcC-chhhhhhccchHHhhcC-----------CccccC Confidence 0111111111101110 0010 011111111 12445556654 55555555554443221 233311 Q ss_pred ccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE----------CCCCCEEEEE Q lcl|NC_020081. 123 KDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY----------DKLGDLHNFK 192 (552) Q Consensus 123 k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r----------~~~G~~~~L~ 192 (552) +.+.++ ++..+.+++ ..+-+...+...-++|.+++++.- +..|.+.+|. T Consensus 64 --------~~~~~~---~~~~~~~l~----------~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~ 122 (427) T protein:vir:10 64 --------VKDEKE---FKSLWDSYK----------LDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVR 122 (427) T ss_pred --------ccHHHH---HHHHHHHhh----------HHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEE Confidence 111222 333333332 223455556667788999988742 3467889999 Q ss_pred EecCceeEEEECCC-cccccccceeEEEEEcC-CceEEEEcccceeeecccccCC---ccCCcccccHHH-HHHHHHHHH Q lcl|NC_020081. 193 AVDASTVYVAVDED-GKERKAKDGVRYVQVID-DKVVAKFKAKEMAWEVSNPRTD---LTVGKYGYPELE-IALNHLQYH 266 (552) Q Consensus 193 ~l~p~~v~v~~~~~-g~~~~~~~~~~y~~~~~-~~~~~~~~~~evi~~~~~~~~~---~~~g~~G~spl~-~~~~~i~~~ 266 (552) ++++..|++..... -.-.....+..|.+... +.....+.++.+||+.-.+.++ +...++|.|++. .+...|... T Consensus 123 v~d~~~~~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~ 202 (427) T protein:vir:10 123 VYDRFAITVEKRVTNARSPRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDY 202 (427) T ss_pred EechhcccccccccCccccccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHH Confidence 99998887632211 00001112233433332 2234678888888774332111 234578999996 467888888 Q ss_pred HHHHHHHHHHHhccCCCceEEEeCCCCCC--CHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHH Q lcl|NC_020081. 267 DNTEVFNARFFAQGGTTRGLLHIKTGQEQ--SNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEK 344 (552) Q Consensus 267 ~~~~~~~~~~f~ng~~p~gil~~~~~~~~--s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e 344 (552) ..+.......|...... ++++++-..+ +.+....+++.+........+.+. .++.+++-+|+.++.+..+ +.+ T Consensus 203 ~~~~~~~~~l~~k~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~-~~l~~~~e~~e~~~~~lsg--l~~ 277 (427) T protein:vir:10 203 DYCESLATQILRRKQQA--VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRA-IGIDAETEEYDVLNSDISG--VPE 277 (427) T ss_pred HHHHHHHHHHHHHhccc--cccchhHHHHhcCccchHHHHHHHHHHHHhcCcccc-eeeecCCCceeEEecccCC--hHH Confidence 88888877766654432 3444321110 111112233333332222222333 3455556778777766654 346 Q ss_pred HHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecc Q lcl|NC_020081. 345 WLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNF 424 (552) Q Consensus 345 ~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f 424 (552) ......+.||++.+||...|--...+++.+++.. -..|.-+.....-+..|+|.+.++=..| ... .++.++| T Consensus 278 ~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~--D~~nyyd~i~~~Qe~~l~p~l~~l~~~i----~~s--~~~~~~f 349 (427) T protein:vir:10 278 FLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT--ALETFYKLVDRKREEDYRPLLEFLLPFI----VDE--EEWSIEF 349 (427) T ss_pred HHHHHHHHHHhhhCCCeeeeccCCccccccchhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcC--CCcEEEe Confidence 7788899999999999997732333333222111 1112222222222355777766654433 222 3456666 Q ss_pred ccc---ChHHHHHHH----HHH-HHHhcCCcCHHHHHHHh----CCCCCCCCCeeeccccccchhhhccccccccccCCC Q lcl|NC_020081. 425 VGG---DAKTEAEII----SIL-ESKAKIGLTINDIRKEL----GYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMD 492 (552) Q Consensus 425 ~~~---d~~~~~~~~----~~~-~~~~~g~lT~NE~R~~~----gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~ 492 (552) ... +.+++++.. +++ ....+|+++++|+|+.+ +...+.+++.. .. T Consensus 350 ~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~~~~-------~~---------------- 406 (427) T protein:vir:10 350 EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNI-------NI---------------- 406 (427) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCCccc-------cc---------------- Confidence 433 334443321 222 23346899999999877 23333222110 00 Q ss_pred CCccCcccCCCCCCCCCCCCCCCcccc Q lcl|NC_020081. 493 ANQFLAQQTGYDGNMDNVNGKDSFNQN 519 (552) Q Consensus 493 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (552) .+++.....+.+.+++..++| T Consensus 407 ------e~~~~~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 407 ------REPEETTEPEPGLGEKLEDEN 427 (427) T ss_pred ------cccchhcCCCCCCCCCCCCCC Confidence 000000001111111111111 No 116 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.69 E-value=3.8e-15 Score=99.63 Aligned_cols=502 Identities=13% Similarity=0.087 Sum_probs=203.7 Q ss_pred CCCCCCCc-ccccchhhc---------ccccCcccccccccchhhhhccccccccccc-----ccccccc---ccc---- Q lcl|NC_020081. 1 MGLLDGFF-KGRKQQDNI---------IDINDDMAVRIKQIEEDAILKKGKNTKSNKP-----KAYEEPI---IGS---- 58 (552) Q Consensus 1 ~~~~~~~~-~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~---~~~---- 58 (552) |=++.=-| |+++--.-- +.-..+...+++-. .+-.+.+++.+.+ +-+.+|. .+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~ 76 (765) T protein:vir:96 1 MFKLSWIFGRKKDNAACSESAPEKVARIPQHDPLDPMIKLG----KIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYG 76 (765) T ss_pred CceeeeecccccccccccccCchhhhhcCCCCCcccchhHH----HHhhcccccccCCCCCCCCcccCcccceecccccc Confidence 33222111 111110000 11111111111111 0101111111111 0111111 000 Q ss_pred ---------cc--cCCcccccc--cC-CC-CchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeec Q lcl|NC_020081. 59 ---------MS--MNPDFKEAP--SI-HG-KQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLK 123 (552) Q Consensus 59 ---------~~--~~~~~~~~~--~~-~~-~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k 123 (552) +. .++...... .. .+ .-.-.+.+..++. +++.+.+|.+.++...+ .++.|.. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~alY~~-~~l~rkiVd~pAeDa~R-----------~g~~I~~- 143 (765) T protein:vir:96 77 DGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAIISQ-HWLVDKACSMSGEDAAR-----------NGWELKS- 143 (765) T ss_pred ccccchHHHhhhccCccchhhHHHhhhcccCCccHHHHHHHHh-CchhhhhhhcchHHhhc-----------CCceeec- Confidence 00 000000000 00 00 0000234444444 34445555544433221 1233321 Q ss_pred cccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE---C-------------CCCC Q lcl|NC_020081. 124 DPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY---D-------------KLGD 187 (552) Q Consensus 124 ~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r---~-------------~~G~ 187 (552) ...+..+ ..+..++..+.+++ ..+-+...+...-++|.+|+++.- | ..|. T Consensus 144 -~~~e~~~---~~~~~l~~~~~rl~----------v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~ 209 (765) T protein:vir:96 144 -DGRKLSD---EQSALIARRDMEFR----------VKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGS 209 (765) T ss_pred -CccccCH---HHHHHHHHHHHHhh----------HHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccce Confidence 1122222 23344555555543 234455556667788888877653 2 1235 Q ss_pred EEEEEEecCceeEEEEC----CCcccccccceeEEEEEcCCceEEEEcccceeeecccccCC---ccCCcccccHHHHHH Q lcl|NC_020081. 188 LHNFKAVDASTVYVAVD----EDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTD---LTVGKYGYPELEIAL 260 (552) Q Consensus 188 ~~~L~~l~p~~v~v~~~----~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~---~~~g~~G~spl~~~~ 260 (552) +.+|..++|..+.+... .+-.-.....+..|.. .+ ..+.++.||++.-....+ +..+++|.|.++.+. T Consensus 210 ~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i--~g---~~IH~SRli~~~g~~lpd~lk~~~~~~G~Svlq~~y 284 (765) T protein:vir:96 210 YKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWII--SG---KKYHRSHLVVVRGPQPPDILKPTYIFGGIPLTQRIY 284 (765) T ss_pred eeEEEEechhhcccccchhccccccccccCcceeeee--cC---ceeccceEEEecCCCchhhhccccCccCccHHHHHH Confidence 66788888876665321 1110011111222222 11 356778888764332211 223468999999999 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHH Q lcl|NC_020081. 261 NHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDM 340 (552) Q Consensus 261 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~ 340 (552) ..|.....+......++...... +++++....+..+ +.+++.++......+|.| +.++ +.+-+|+.++.+..+ T Consensus 285 d~I~~~~~t~~~~a~Ll~k~~~~--v~k~~~~~~l~~~--~~l~~r~~~~~~~r~n~g-~~~i-d~ee~~e~~s~~lsg- 357 (765) T protein:vir:96 285 ERVYAAERTANEAPLLAMSKRTS--TIHVDVEKAIANE--DAFNARLAFWIANRDNHG-VKVI-GIDETMEQFDTNLSD- 357 (765) T ss_pred HHHHHHHHHHHHHHHHHHHhccc--eeeechHhhhccH--HHHHHHHHHHHHhcCCce-eEEe-cCCcceeEEecccCC- Confidence 99999999998888887766543 4555443222222 234444444444433433 3344 456788887776665 Q ss_pred HHHHHHHHHHHHHHHHhcCCHHHh-ccccccccccc-ccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccccc Q lcl|NC_020081. 341 EFEKWLNYLINVICSIYSIDPSEI-NFPNRGGATGH-SGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGG 418 (552) Q Consensus 341 q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~~-~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~ 418 (552) +.+......+.||++.+||...| |... +++..+ .+.. .|.-+.....-+..|+|.++++-..|-.. ..... T Consensus 358 -l~d~l~~~~~~iAaas~IP~t~LfGqsp-~GlnATGe~D~---~nYyD~I~s~Qe~~l~p~le~L~~li~~s--~~i~~ 430 (765) T protein:vir:96 358 -FDSVIMNQYQLVAAIAKTPATKLLGTSP-KGFNATGEHET---ISYHEELESIQEHIFDPLLERHYLLLAKS--ESIDV 430 (765) T ss_pred -HHHHHHHHHHHHHhhhCCCeeeeccCCc-ccccCcchHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCCC Confidence 44777888999999999999666 5432 222111 1111 11222222222355677776666555432 12234 Q ss_pred ceeeccccc---ChHHHHHHH----HHH-HHHhcCCcCHHHHHHHhCCCCCCCC------Ce----eeccccccchhhhc Q lcl|NC_020081. 419 DYVFNFVGG---DAKTEAEII----SIL-ESKAKIGLTINDIRKELGYPDTEGG------DV----TLAGVHVQRLGQIM 480 (552) Q Consensus 419 ~~~~~f~~~---d~~~~~~~~----~~~-~~~~~g~lT~NE~R~~~gl~p~~gg------D~----~~~~~n~~~~~~~~ 480 (552) ++.++|... +.+++++.. +++ ..+.+|+++++|+|+.+..+|.-|. +. .+.+.+........ T Consensus 431 d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~~~~~~~ 510 (765) T protein:vir:96 431 QLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPENLAELEKAG 510 (765) T ss_pred cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCccccccccCCC Confidence 566776543 344444432 222 2344789999999999876554221 11 11111111111111 Q ss_pred cccccccccCCCC-CccCcccCCCCCCCCCCCCCC----CcccccCCCCccccccc--cccccc-cCccccccccccccC Q lcl|NC_020081. 481 QQEQVEYQRQMDA-NQFLAQQTGYDGNMDNVNGKD----SFNQNVGKDGQSKQQAN--TNSTPQ-GGKDDNGNVVNDWEA 552 (552) Q Consensus 481 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~~ 552 (552) ............. .++.......+.....|.+.. ...+..+....++..++ ....++ .+....-++.++|.. T Consensus 511 ~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~~p~~~~p~~~~~~~~~~~~~~~~~~~~~a~ 590 (765) T protein:vir:96 511 AQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAATPPSRPNPRAELRNLLSDLLSKLEALDDAQA 590 (765) T ss_pred cccccccCccccccCCCCccCCCCcccccCCcccCCccccccccCccccCccccccccccchhcccchhhhhhccccccc Confidence 0000000000000 000000000000101111100 00011111111111111 111111 111222346666655 No 117 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=99.66 E-value=2.5e-14 Score=95.13 Aligned_cols=430 Identities=12% Similarity=0.079 Sum_probs=219.8 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCC-CchHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHG-KQNLLQ 79 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 79 (552) |..+=+. .|+ -+..+.+. +++... +. +....+..-|+.-- +..+.. T Consensus 1 m~~~~d~-~g~------------------p~~~~~~~---------~~~~~~--~~---~~~~~~~~~~~~gltp~~l~~ 47 (512) T protein:vir:19 1 MGRILDI-SGQ------------------PFDFDDEM---------QSRSDE--LA---MVMKRTQEHPSSGVTPNRAAQ 47 (512) T ss_pred CcceeCC-CCC------------------cccccccc---------ccccch--hc---ccchhhccccccCCCHHHHHH Confidence 2221110 111 11111111 111110 00 01111112222222 223345 Q ss_pred HHHHhhcchHHHHHHHHH-----HHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_020081. 80 MLKLWSRKNIILNAIIIT-----RVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDF 154 (552) Q Consensus 80 ~Lr~~a~~~~i~~a~i~~-----~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn 154 (552) .||... ..++.+.|-.. +-..+.-.+...+..+.+..|.|..-.. .+..+++....+..+|... T Consensus 48 iL~~a~-~gd~~~~~~L~~dm~~~D~hi~s~l~~Rk~av~~~~w~I~p~~~---~~~~~~~~a~~v~~~l~~~------- 116 (512) T protein:vir:19 48 MLRDAE-RGDLTAQADLAFDMEEKDTHLFSELSKRRLAIQALEWRIAPARD---ASAQEKKDADMLNEYLHDA------- 116 (512) T ss_pred HHHHhh-CCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCCCceEecCCC---CCHHHHHHHHHHHHHHhcC------- Confidence 565443 33444433221 1222333333334445566677653221 1233444445555555421 Q ss_pred ccCCHHHHHHHHHHHHHhcCCeeEEEEECC---CCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEc Q lcl|NC_020081. 155 TRDNFRSFVKKLVRDRLTYDKINFELVYDK---LGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFK 231 (552) Q Consensus 155 ~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~---~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~ 231 (552) ..+.+++..++ +.+.+|-++++|++.. ...|..|.++|+..+....+..+.+ ++.. .......++ T Consensus 117 --~~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~~f~~~~~~~~~l-------r~~~--~~~~G~~l~ 184 (512) T protein:vir:19 117 --AWFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALHHRDPALFCANPDNLNEL-------RLRD--ASYHGLELQ 184 (512) T ss_pred --CCHHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeeeeeccccceeccCCCcEE-------EecC--CCCCceeec Confidence 13666777666 5788999999999843 3457889999998877643322222 1211 222234566 Q ss_pred ccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_020081. 232 AKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMF 311 (552) Q Consensus 232 ~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~ 311 (552) +...|+++++.++ ..+||.+.+..|.-....-....++...|...-|.|--+.+++.+ .++++++++.+.+.+.. T Consensus 185 ~~k~i~~~~~~~~---g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~--a~~~ek~~L~~al~~~~ 259 (512) T protein:vir:19 185 PFGWFMHRAKSRT---GYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPTG--STNREKATLMQAVMDIG 259 (512) T ss_pred CCceEEEeccCCC---CCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCCC--CCHHHHHHHHHHHHHHh Confidence 6666655554332 347999999999999999999999999999999999888777654 36778888888887753 Q ss_pred ccccccccceeeccCCceeeeccC-chhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHH Q lcl|NC_020081. 312 SGINGAWKIPVITAEDVKFVNMTQ-SSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYR 390 (552) Q Consensus 312 ~G~~nagk~~il~~~g~~~~~l~~-~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~ 390 (552) + +++ ++.+.|++++-+.. +.....|.+..++..++|+.+. ||.+-.+ ..+++.+++..+. .. T Consensus 260 ~---~a~---~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~i------LGqtlTs----~~g~~Gs~a~~~v-h~ 322 (512) T protein:vir:19 260 R---RAG---GIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAI------LGGTLTT----EAGDKGARSLGEV-HD 322 (512) T ss_pred h---CcE---EEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHH------hhhhhcc----cccccchhhHHHH-HH Confidence 2 222 22234554443332 2333558888999999999873 5543222 1122333454333 34 Q ss_pred HHHHHHhhHHHHHHHHHHHhhcCccc-----cc------ceeecccccChHHHHHHHHHHHHHhcCC-cCHHHHHHHhCC Q lcl|NC_020081. 391 NSKDKGLEPLLKFIEDAVNKYIVSQF-----GG------DYVFNFVGGDAKTEAEIISILESKAKIG-LTINDIRKELGY 458 (552) Q Consensus 391 ~~~~~~l~P~~~~ie~~ln~~L~~~~-----~~------~~~~~f~~~d~~~~~~~~~~~~~~~~g~-lT~NE~R~~~gl 458 (552) ......+...++.|+..||+.|+... +. .-+|.|...+.++.....+.+.....|+ ++..++|+.+|+ T Consensus 323 ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~~a~~~~~l~~G~~i~~~~i~e~~Gi 402 (512) T protein:vir:19 323 EVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTSEAGDITALSDAIPKLAAGMRIPVSWIQEKLHI 402 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCChhhHHHHHHHHHHHhcCCCCCHHHHHHHhCC Confidence 45677788999999999999887532 21 1256676666665555555444444564 799999999999 Q ss_pred CCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcccccccccccccc Q lcl|NC_020081. 459 PDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQG 538 (552) Q Consensus 459 ~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 538 (552) |....++..+.+.-. .+ .......... . .+.... .+..+. T Consensus 403 p~~~~~e~~~~~~~~-----------~~-~~~~~~~~~~------~-------~~~~~~----~~~~d~----------- 442 (512) T protein:vir:19 403 PQPVGDEAVFTIQPV-----------VP-DNGSQKEAAL------S-------AEDIPQ----EDDIDR----------- 442 (512) T ss_pred CCCCCccccccCCCc-----------cc-cccccccccc------c-------ccCCCc----hhhHhH----------- Confidence 755444433221000 00 0000000000 0 000000 000000 Q ss_pred CccccccccccccC Q lcl|NC_020081. 539 GKDDNGNVVNDWEA 552 (552) Q Consensus 539 ~~~~~~~~~~~~~~ 552 (552) -+....||+. T Consensus 443 ----~~~~~~~~~~ 452 (512) T protein:vir:19 443 ----MGVSPEDWQR 452 (512) T ss_pred ----HhhhHHHHHH Confidence 0011112222 No 118 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=99.64 E-value=1.4e-14 Score=96.59 Aligned_cols=424 Identities=11% Similarity=0.024 Sum_probs=203.3 Q ss_pred cccccchhhhhccccccccc-cccccc------cccccccccCCcc--cccccCCCCchHHHHHHHhhcchHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSN-KPKAYE------EPIIGSMSMNPDF--KEAPSIHGKQNLLQMLKLWSRKNIILNAIIIT 97 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~-~~~~~~------~~~~~~~~~~~~~--~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~ 97 (552) |.+...+..=... .... .+.+.+ .++... .+.... ..++.... ....+..+.+.+ -.-+.+|+.. T Consensus 1 m~k~~~k~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~~~iLr~-~~~~~ly~~m~~-D~hi~s~l~~ 74 (448) T protein:vir:79 1 MAKRGRKPKELVP---GPGSIDPSDVPKLEGASVPVMST-SYDVVVDREFDELLQG-KDGLLVYHKMLS-DGTVKNALNY 74 (448) T ss_pred CCCCCCCCccccC---cccccccccchhhhhhhhhhccc-ccccccccchhHhhcc-ccchHHHHHHhh-ChHHHHHHHH Confidence 2111111100000 0000 000000 000100 000000 00111111 112233344433 2334555555 Q ss_pred HHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCee Q lcl|NC_020081. 98 RVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKIN 177 (552) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~ 177 (552) |... +.+..|.|... ..+..+.+....+..+|..+... ....+|.+++..++ +.+++|.++ T Consensus 75 Rk~a-----------v~~~~w~v~p~----~~~~~~~~~ae~v~~~l~~~~~~---~~~~~f~~~~~~~l-da~~~G~s~ 135 (448) T protein:vir:79 75 IFGR-----------IRSAKWYVEPA----STDPEDIAIAAFIHAQLGIDDAS---VGKYPFGRLFAIYE-NAYIYGMAA 135 (448) T ss_pred HHHH-----------HhcCCceEecC----CCCHHHHHHHHHHHHHhhhhhhh---hccCCHHHHHHHHH-Hhhhhccee Confidence 4443 34566777432 22334444445566666543211 12346667776665 578999999 Q ss_pred EEEEEC--CCCC--EEEEEEecCceeE-EEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCccc Q lcl|NC_020081. 178 FELVYD--KLGD--LHNFKAVDASTVY-VAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYG 252 (552) Q Consensus 178 ~~i~r~--~~G~--~~~L~~l~p~~v~-v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G 252 (552) +++++. .+|. +..|.+.++.++. +..+.++++........+.....+.....++..-++|+.+ .+ ...+|| T Consensus 136 ~Eivw~~~~~g~~~~~~l~~r~~~~~~~f~~~~d~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~-~~---~g~p~g 211 (448) T protein:vir:79 136 GEIVLTLGADGKLILDKIVPIHPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVSGLEIPIWKTVVFLH-ND---DGSFTG 211 (448) T ss_pred EEEEeeecCCCceecccccccCCccccceeeecCCceEEeecCCcccccccCCCccccccceEEEEec-Cc---cCCccc Confidence 999985 3564 4467777776442 2333444332211100000001111223446666777643 22 224799 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeee Q lcl|NC_020081. 253 YPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVN 332 (552) Q Consensus 253 ~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~ 332 (552) .+.+..|.-....-....++...|...-|.|--|.+++.+...++++++.+.+...+...|. +++ ++.+.|++++- T Consensus 212 ~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~-~a~---~iiP~~~~ie~ 287 (448) T protein:vir:79 212 QSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKP-RHG---IILPDDWKFDT 287 (448) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCC-ceE---EEecCCceEEE Confidence 99999999999999999999999999999998888887666556667777777666544443 333 23345666555 Q ss_pred ccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc Q lcl|NC_020081. 333 MTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI 412 (552) Q Consensus 333 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L 412 (552) +.....-..+.+..++..++|+.+. ||.+-.+ ..++.+++...........+.+.-.++.|+..||+.| T Consensus 288 ~ea~~~~~~~~~~i~~~d~~Isk~i------LGqtlTs-----~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~l 356 (448) T protein:vir:79 288 VDLKSAMPDAIPYLTYHDAGIARAL------GIDFNTV-----QLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYL 356 (448) T ss_pred EecCCCcccHHHHHHHHHHHHHHHH------hhhhhcc-----ccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5444444456678888899988765 4543221 1111223333333334456677888999999999887 Q ss_pred Cccc-----c--cce-eecccccChHHHHHHHHHHHHHhc-CCcCHHHHHHHhCCCC-CCCCCeeeccccccchhhhccc Q lcl|NC_020081. 413 VSQF-----G--GDY-VFNFVGGDAKTEAEIISILESKAK-IGLTINDIRKELGYPD-TEGGDVTLAGVHVQRLGQIMQQ 482 (552) Q Consensus 413 ~~~~-----~--~~~-~~~f~~~d~~~~~~~~~~~~~~~~-g~lT~NE~R~~~gl~p-~~ggD~~~~~~n~~~~~~~~~~ 482 (552) ++.. + ..+ +|.|...+.++....++.+..... +-..-+-+|+.+|+|. .++. ....+. T Consensus 357 i~~l~~lNfg~~~~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~-~~~a~~----------- 424 (448) T protein:vir:79 357 IPKLVLPNWPSATRFPRLTFEMEERNDFSAAANLMGMLINAVKDSEDIPTELKALIDALPSK-MRRALG----------- 424 (448) T ss_pred HHHHHHhcCCCcCCCcEEEecCCChHHHHHHHHHhhhhhccchhhHHHHHHhhcCCCCCCCc-cccccC----------- Confidence 7532 2 122 566765555554444444333322 2222334677778763 3321 111000 Q ss_pred cccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCC Q lcl|NC_020081. 483 EQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGK 522 (552) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (552) ..........+++. +.. .-.+ .-. T Consensus 425 -------~~~~~~~~~~~~~~-~~~--~~~~------~~~ 448 (448) T protein:vir:79 425 -------VVDEVREAVRQPAD-SRY--LYTR------RRR 448 (448) T ss_pred -------CCCcccccccCCcc-ccc--hhhc------ccC Confidence 00000000000000 000 0000 000 No 119 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=99.60 E-value=5.2e-14 Score=93.43 Aligned_cols=427 Identities=11% Similarity=0.043 Sum_probs=205.5 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccc------cccccccccCCcc--cccccCC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYE------EPIIGSMSMNPDF--KEAPSIH 72 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~--~~~~~~~ 72 (552) |-.+++.|+ ..+-+. .+. .+.+.+ .++.... +.... ..++... T Consensus 1 m~kk~~k~~------------------------~~~~~~--~~~--~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~iLr 51 (448) T protein:vir:77 1 MAKRGRKPK------------------------ELVPGP--GSI--DPSDVPKLEGASVPVMSTS-YDVVVDREFDELLQ 51 (448) T ss_pred CCCCCCCCc------------------------ccCCcc--ccc--chhhhhhhccchhhhcccc-cccccccchhHhhc Confidence 222221110 000000 000 000000 0001000 00000 0011111 Q ss_pred CCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCC Q lcl|NC_020081. 73 GKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDN 152 (552) Q Consensus 73 ~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~ 152 (552) . ....+..+.+.+. .-+.+|+..|... +.+..|.|.... .+..+.+....+..+|..+.. T Consensus 52 ~-~~~~~ly~~m~~D-~hi~s~l~~Rk~a-----------v~~~~w~v~p~~----~~~~d~~~ae~v~~~l~~~~~--- 111 (448) T protein:vir:77 52 G-KDGLLVYHKMLSD-GTVKNALNYIFGR-----------IRSAKWYVEPAS----TDPEDIAIAAFIHAQLGIDDA--- 111 (448) T ss_pred c-ccchHHHHHHhhC-hHHHHHHHHHHHH-----------HhcCCceEecCC----CCHHHHHHHHHHHHHhhchhh--- Confidence 1 1112333444332 3345555554443 345566664321 223333344445555543211 Q ss_pred CCccCCHHHHHHHHHHHHHhcCCeeEEEEEC--CCCC--EEEEEEecCceeE-EEECCCcccccccceeEEEEEcCCceE Q lcl|NC_020081. 153 DFTRDNFRSFVKKLVRDRLTYDKINFELVYD--KLGD--LHNFKAVDASTVY-VAVDEDGKERKAKDGVRYVQVIDDKVV 227 (552) Q Consensus 153 pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~--~~G~--~~~L~~l~p~~v~-v~~~~~g~~~~~~~~~~y~~~~~~~~~ 227 (552) .....+|.+++..++ +.+.+|-+++|+++. .+|. +..|.+.++.+++ ...+.++++........+.....+... T Consensus 112 ~~~~~~f~~~i~~~l-da~~~G~s~~Eivw~~~~dg~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~~~~~~~~~~ 190 (448) T protein:vir:77 112 SVGKYPFGRLFAIYE-NAYIYGMAAGEIVLTLGADGKLILDKIVPIHPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVNG 190 (448) T ss_pred hhccCCHHHHHHHHH-HhhhhcceeEEEEEeecCCCceeeccccccCCCccceeeeecCCceEEEecCCcccccccCCCc Confidence 122346778888775 789999999999985 3565 4467777776542 233444433221111001111111122 Q ss_pred EEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_020081. 228 AKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREW 307 (552) Q Consensus 228 ~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~ 307 (552) ..++..-++|+++. + ...++|.+.+..|.-....-....++...|...-|.|--+.+++.+...++++++.+.+.. T Consensus 191 ~~lP~~~~i~~~~~-~---~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av 266 (448) T protein:vir:77 191 LEIPIWKTVVFLHN-D---DGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIV 266 (448) T ss_pred cccccceEEEEecC-C---cCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHH Confidence 34566667776432 2 2347999999999998888899999999999999999888888766555666777777766 Q ss_pred HHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHH Q lcl|NC_020081. 308 TSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAE 387 (552) Q Consensus 308 ~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~ 387 (552) .+...|. +++ ++.+.|++++-+........+.+..++..++|+.+. ||.+-.+ ..++..++...+ T Consensus 267 ~~i~~g~-~a~---~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~i------LGqtlTs-----~~~~g~~~~~~~ 331 (448) T protein:vir:77 267 KNFVQKP-RHG---IILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARAL------GIDFNTV-----QLNMGVQAVNIG 331 (448) T ss_pred HHHhcCC-ceE---EEecCCceEEEEecCCCccCHHHHHHHHHHHHHHHH------hcccccc-----ccccchhhhhhh Confidence 6543343 333 223456665544433333446678888889998875 4432211 111222333433 Q ss_pred HHHHHHHHHhhHHHHHHHHHHHhhcCccc-----c--cce-eecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCC Q lcl|NC_020081. 388 KYRNSKDKGLEPLLKFIEDAVNKYIVSQF-----G--GDY-VFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYP 459 (552) Q Consensus 388 ~~~~~~~~~l~P~~~~ie~~ln~~L~~~~-----~--~~~-~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~ 459 (552) .........+.-.++.|+..||+.|++.. + ..+ +|.|...+.++....++.+... .+-+|+.+|+| T Consensus 332 ~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~e~eDl~~~a~~~~~l------~~~~~~~~~ip 405 (448) T protein:vir:77 332 EFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEMEERNDFSAAANLMGML------INAVKDSEDIP 405 (448) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCCChhhHHHHHHHhHHH------HHHHHHHhcCC Confidence 33345567778899999999999887532 2 122 5677766666655544444332 25689999997 Q ss_pred CCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcccccccccccccc Q lcl|NC_020081. 460 DTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQG 538 (552) Q Consensus 460 p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 538 (552) .-.++.... .. ..+....+..+...+. ....-++ ....+-+-. T Consensus 406 ~~~~~~~~~-----------~~---------~~~~~~~~~~~~~~~~--~~~~~~~--------------~~~~~r~~~ 448 (448) T protein:vir:77 406 TELKALIDA-----------LP---------SKMRRALGVVDEVREA--VRQPADS--------------RYLYTRRRR 448 (448) T ss_pred ccCCcCCCC-----------Cc---------hhcccccCCCCCCCch--hhcchhh--------------HHHHhhhcC Confidence 422111000 00 0000000000000000 0000000 000000001 No 120 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.59 E-value=4.2e-13 Score=88.41 Aligned_cols=456 Identities=10% Similarity=0.046 Sum_probs=214.8 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCc-hHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQ-NLLQ 79 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 79 (552) |..+= +...+.+..+.++ ++++.. +.+ ....+..-|+..-++ .... T Consensus 1 ~~~~~-------------------d~~g~p~~~~~~~---------~~~~~~--~~~---~~~~~~~~~~~gltp~~l~~ 47 (528) T protein:vir:10 1 MAAIV-------------------DIYGNPLRTQQLR---------KQQTAH--LAG---LAKEFANHPAKGLTPAKLAH 47 (528) T ss_pred CCeeE-------------------CCCCCcccccccc---------chhhhh--hhh---hhhhhcccCCCCCCHHHHHH Confidence 21111 1111111111111 111110 000 011112223322222 2334 Q ss_pred HHHHhhcchHHHHHHHHH-----HHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_020081. 80 MLKLWSRKNIILNAIIIT-----RVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDF 154 (552) Q Consensus 80 ~Lr~~a~~~~i~~a~i~~-----~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn 154 (552) .|+... ..++.+.+-.. +-..++-.+...+..+.+..|.|.....+ +..+++....+..+|... T Consensus 48 il~~a~-~gd~~~~~~L~~~m~e~D~~i~s~l~~Rk~av~~~~w~I~p~~~~---~~~~~~~a~~v~~~l~~~------- 116 (528) T protein:vir:10 48 ILIEAE-QGHLQAQAELFMDMEERDAHLFAEMSKRKRAVLGLDWTIEPPRNA---SAAEKADAEYLHELLLDL------- 116 (528) T ss_pred HHHhhh-CCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCCC---CHHHHHHHHHHHHHHhCC------- Confidence 554433 33333332211 12223333334444455667777543222 122333444455555321 Q ss_pred ccCCHHHHHHHHHHHHHhcCCeeEEEEECCCC---CEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEc Q lcl|NC_020081. 155 TRDNFRSFVKKLVRDRLTYDKINFELVYDKLG---DLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFK 231 (552) Q Consensus 155 ~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G---~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~ 231 (552) ..+.+++..++ +.+++|.++++++|...| .|..|.++|+.++.+.. +++.. +...........++ T Consensus 117 --~~f~~~i~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~~f~~~~--~~~~~-------l~~~~~~~~g~~l~ 184 (528) T protein:vir:10 117 --EGIEDLMLDCM-DGVGHGYSAIELDWSLQGREWLPQAFDHRPQSWFQLNP--DDQDE-------LRLRDNSIAGEVLQ 184 (528) T ss_pred --ccHHHHHHHHH-hhhhhcceeEEEEEeecCCceeEEEeeeecccceeecc--CCCcE-------EeccCCCCCceeec Confidence 12555555554 567799999999986543 46789999998776533 23221 11111222234566 Q ss_pred ccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_020081. 232 AKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMF 311 (552) Q Consensus 232 ~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~ 311 (552) +...+++++..+ ...+||.+.+..|.-....-....++...|...-|.|--+.+++.+ .++++++++.+.+.+.. T Consensus 185 ~~k~iv~~~~~~---~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~--a~~~ek~~L~~al~~i~ 259 (528) T protein:vir:10 185 PFGWIMHKPRSR---SGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPPG--TPDEEKVTLLRAVTGLG 259 (528) T ss_pred CCCeEEEeecCC---CCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCC--CCHHHHHHHHHHHHHHh Confidence 666555554332 2337899999999999999999999999999999999888887654 37888888888887653 Q ss_pred ccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHH Q lcl|NC_020081. 312 SGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRN 391 (552) Q Consensus 312 ~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~ 391 (552) ++ ++ ..|..+..+++...+.. .-..|.+..++..++|+.+. ||.+-.+. +..+...+++-.+.. .. T Consensus 260 ~~---~~-~iiP~~~~ie~~ea~~~-~~~~f~~li~~~d~~Isk~i------LGqtlTs~--~~~g~~gS~Alg~vh-~~ 325 (528) T protein:vir:10 260 HA---AA-GIIPESMSIDFQEASKG-SAEPFMAMMRWCDDSMSKAI------LGGTLTSQ--TSESGGGAYALGQVH-NE 325 (528) T ss_pred hC---cE-EEecCCceeEEeecCCC-ChhHHHHHHHHHHHHHHHHH------hhhhhhcc--ccccccchhhhHHHH-HH Confidence 32 22 12333334455544332 23457888899999998875 44322110 011112233333333 33 Q ss_pred HHHHHhhHHHHHHHHHHHhhcCcc-----ccc------ceeecccccChHHHHHHHHHHHHH-hcCC-cCHHHHHHHhCC Q lcl|NC_020081. 392 SKDKGLEPLLKFIEDAVNKYIVSQ-----FGG------DYVFNFVGGDAKTEAEIISILESK-AKIG-LTINDIRKELGY 458 (552) Q Consensus 392 ~~~~~l~P~~~~ie~~ln~~L~~~-----~~~------~~~~~f~~~d~~~~~~~~~~~~~~-~~g~-lT~NE~R~~~gl 458 (552) .....+.-.++.|+..||+.|+.. ++. .-+|.|.....++....++..+.. ..|+ ++..++|+.+|+ T Consensus 326 v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gi 405 (528) T protein:vir:10 326 VRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVPVNWVQEQLGI 405 (528) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCCCCCHHHHHHHhCC Confidence 456677888999999999877542 111 125566555544433333333322 3465 799999999999 Q ss_pred CCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccccc----- Q lcl|NC_020081. 459 PDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTN----- 533 (552) Q Consensus 459 ~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 533 (552) |.-..|+.++.+....+ +.+.... ++..... . ...........+.-++...... T Consensus 406 p~p~~~e~~~~~~~~~~-----------------~~~~~~~-~~~~~~~--~-~~~~~~~~~~~~~~d~~~~~~~~~~~~ 464 (528) T protein:vir:10 406 PLPANGEAVLGDQAGAG-----------------IAQLSRR-PGPRIAA--L-AQVIGPRYRDQEALDQVLASLPAQDMQ 464 (528) T ss_pred CCCCCCcccccCCCccc-----------------ccccCcc-ccccccc--c-cccccccccccchHHHHHHHHHHHHHH Confidence 87666655543211000 0000000 0000000 0 0000000000000000000000 Q ss_pred -ccc----c-cCccccccccccccC Q lcl|NC_020081. 534 -STP----Q-GGKDDNGNVVNDWEA 552 (552) Q Consensus 534 -~~~----~-~~~~~~~~~~~~~~~ 552 (552) ... + -.--++++...|++. T Consensus 465 ~~~~~~l~~i~~~l~~~~s~ee~~~ 489 (528) T protein:vir:10 465 NQADSLVAPLLDVISRGGSEAELLG 489 (528) T ss_pred HHHHHHHHHHHHHHHhcCCHHHHHH Confidence 000 0 000111222222222 No 121 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.53 E-value=2e-12 Score=84.73 Aligned_cols=427 Identities=11% Similarity=0.075 Sum_probs=202.1 Q ss_pred cccccchhhhhccccccccccccccccccccccccCC-cccccccCCCCchHHHHHHHhhcc---------hHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNP-DFKEAPSIHGKQNLLQMLKLWSRK---------NIILNAIII 96 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~Lr~~a~~---------~~i~~a~i~ 96 (552) |..++ ....-.-.... ...++......+.. .+...++..-.+.....|+..... ..-+.+|+. T Consensus 1 ~~~~i-----~~~~g~~~~~~--~~~~~~~~~ia~~~~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m~~D~~i~s~l~ 73 (491) T protein:vir:79 1 MSKGL-----WVSPTEFVKFG--EPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCVR 73 (491) T ss_pred CCCee-----eCCCCCccccc--ccchhHHHHHhhhccccccccccccCcchhHHHhhccCCHHHHHHHhhChHHHHHHH Confidence 22221 10000000000 00000000000000 000000101011111233322221 122334443 Q ss_pred HHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCe Q lcl|NC_020081. 97 TRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKI 176 (552) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna 176 (552) .|. ..+.+..|.|...+.+ + +....+.+++.. ..+.+++..++ +.+.+|.+ T Consensus 74 ~Rk-----------~av~~~~w~i~~~~~~----~---~~a~~i~e~l~~----------~~~~~~i~~~l-da~~~G~s 124 (491) T protein:vir:79 74 RRK-----------AAVKALEWGLDRGKAK----S---RVAKSIADVFAD----------LDLSRIATEML-DAVLYGYQ 124 (491) T ss_pred HHH-----------HHHhCCCcEEecCCCC----H---HHHHHHHHHHhc----------CCHHHHHHHHH-Hhhhhcce Confidence 333 3344566776543221 1 122344455543 24667777665 67889999 Q ss_pred eEEEEECCCC---CEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccc Q lcl|NC_020081. 177 NFELVYDKLG---DLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGY 253 (552) Q Consensus 177 ~~~i~r~~~G---~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~ 253 (552) ++++++...| .|..|.++|+.++.+. .++++ .+....+......+++...|++++..++ ..+||. T Consensus 125 ~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d--~~~~l-------~l~~~~~~~~g~~lp~~k~i~~~~~~~~---g~p~g~ 192 (491) T protein:vir:79 125 PMEITWGKVGNYIVPIDVVGKPADWFVYD--PENQL-------RFRSKEHWVQGEELPARKFLVPRQEATY---LNPYGF 192 (491) T ss_pred eEEEEEeecCCeeeEEeeeeecccceeec--cCCce-------EEeecCCCCCceeecCCCeEEEEecCCC---CCcccc Confidence 9999986543 3678999999888753 34432 2332222233456677777766654332 238999 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeec Q lcl|NC_020081. 254 PELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNM 333 (552) Q Consensus 254 spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l 333 (552) |.+..|......-....++...|...-|.|--+.+++.+ .++++++++.+.+.+... +++ ..|..+..+++... T Consensus 193 gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~--a~~~ek~~l~~al~~~~~---~a~-~viP~~~~ie~~ea 266 (491) T protein:vir:79 193 PDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRS--ASDAETNLLLDRLEDMVQ---DAV-AVIPDDSSIEIKEA 266 (491) T ss_pred hhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCC--CCHHHHHHHHHHHHHHhc---CeE-EEecCCceeEEEec Confidence 999999999999999999999999999999888887654 377778888877776532 222 22333334455544 Q ss_pred cCc-hhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc Q lcl|NC_020081. 334 TQS-SKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI 412 (552) Q Consensus 334 ~~~-~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L 412 (552) +.. ..-..|.+..++..++|+.+. ||.+-.+. ++.+++..+.... .....+.-.++.|+..||+ | T Consensus 267 ~~~~g~~~~y~~li~~~d~~Isk~i------LGqtlTt~------~~gs~a~~~vh~~-v~~~i~~~D~~~i~~tln~-l 332 (491) T protein:vir:79 267 AGKSGSADVYERLLHFCRGEVSIAL------LGQNQTTE------ATSTRASAQAGLE-VTDDIRDGDKAIVVEAMNM-L 332 (491) T ss_pred cCCCCChhHHHHHHHHHHHHHHHHH------hhhhhccC------cccchhhHHHHHH-HHHHHHHHHHHHHHHHHHH-H Confidence 332 222347788888889988865 55432221 1234454444433 4566778889999999986 5 Q ss_pred Ccc-----cc--cceeecccccChHHHHHHHHHHHHH-hcCC-cCHHHHHHHhCCCCCCCCCeeeccccccchhhhcccc Q lcl|NC_020081. 413 VSQ-----FG--GDYVFNFVGGDAKTEAEIISILESK-AKIG-LTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQE 483 (552) Q Consensus 413 ~~~-----~~--~~~~~~f~~~d~~~~~~~~~~~~~~-~~g~-lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~ 483 (552) +.. +. ..++|.|....... ...++..+.. ..|+ ++..++|+.+|+|+-+.+...+.... T Consensus 333 i~~l~~~N~~~~~~p~f~~~e~ee~~-~~~a~~~~~L~~~G~~i~~~~~~e~~Gip~~~~~e~~~~~~~----------- 400 (491) T protein:vir:79 333 IRWICDLNFDGAARPVFDMWEQEQVD-EIQAGRDEKLTRAGARFTPAYFKRAYNLQDGDLDERPLPVSA----------- 400 (491) T ss_pred HHHHHHhcCCCCCcceEeecCcCchh-HHHHHHHHHHHhCCCccCHHHHHHHhCCCCCCCCccccCcCc----------- Confidence 432 11 23455554332211 1122333222 3365 79999999999987655543321100 Q ss_pred ccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccc---cccc----ccc-CccccccccccccC Q lcl|NC_020081. 484 QVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQAN---TNST----PQG-GKDDNGNVVNDWEA 552 (552) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~----~~~-~~~~~~~~~~~~~~ 552 (552) +... ........ ..++ . ...|...+.... ..+. ++= .--.+++...|+.. T Consensus 401 --~~~~-~~~~~~~~------~~~~--~--------~~~d~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~s~~e~~~ 458 (491) T protein:vir:79 401 --VDAV-GAASFAEF------EAPD--Q--------DALDAALNALSARDLNADAQALVAPLLKRIANGASADELLG 458 (491) T ss_pred --cccc-cccccccc------CCCC--C--------cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHH Confidence 0000 00000000 0000 0 000000000000 0000 000 00000111111111 No 122 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=99.50 E-value=3.8e-12 Score=83.19 Aligned_cols=420 Identities=10% Similarity=0.037 Sum_probs=204.0 Q ss_pred cchhhhhccccccccccccccccccccccccCCcccccccCCCCchH--------HHHHHHhhcchHHHHHHHHHHHHHH Q lcl|NC_020081. 31 IEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNL--------LQMLKLWSRKNIILNAIIITRVNQV 102 (552) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~Lr~~a~~~~i~~a~i~~~~~~~ 102 (552) +.+..+...+ +.. .+..+....|.. ....+.+.+ .+..+.+.+. .-+.+|+..|... T Consensus 1 v~~~~l~~e~---------at~---~~~~d~~~~~~~-~l~~~~~~il~~a~~g~~~~y~~l~~D-~~i~s~l~~rk~a- 65 (488) T protein:vir:99 1 MEKPALGREI---------ATS---GDGRDITRPFIS-GLQVPNDSILQRRGGNDLRVYEEILSD-AQVKTVWGQRQLA- 65 (488) T ss_pred CCccchhHHH---------HHH---HhhhhhhccccC-CCCCCChHHHHhhccCCHHHHHHHhhC-hHHHHHHHHHHHH- Confidence 2222222111 100 000000000100 001111111 1222223222 2234444444333 Q ss_pred HHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE Q lcl|NC_020081. 103 SMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY 182 (552) Q Consensus 103 ~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r 182 (552) +.+..|.|...+. +..+++....+..+|.. ..+.+++..++ +.+++|.+++++++ T Consensus 66 ----------v~~~~w~i~p~~~----~~~~~~~ae~v~~~l~~----------~~~~~~l~~~l-da~~~G~s~~Ei~w 120 (488) T protein:vir:99 66 ----------VVSREWKVEAGGD----RPIDQAAAEHLEQQLQR----------VGWDRVTSKML-FGVFYGYAVSELIY 120 (488) T ss_pred ----------HhcCCceEEcCCC----ChHHHHHHHHHHHHHhC----------CCHHHHHHHHH-hhhhhcceeEEEEE Confidence 3456677753321 23334444445555532 35677888777 57889999999998 Q ss_pred CCCC---CEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEccc-ceeeecccccCCccCCcccccHHHH Q lcl|NC_020081. 183 DKLG---DLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAK-EMAWEVSNPRTDLTVGKYGYPELEI 258 (552) Q Consensus 183 ~~~G---~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~-evi~~~~~~~~~~~~g~~G~spl~~ 258 (552) ...| .|..|.++|+.++.+ +.+++.. +....+......++.. ..+++++..+ ...+||.|.+.. T Consensus 121 ~~~~g~~~~~~l~~r~~~~f~~--d~~~~l~-------~~~~~~~~~g~~lp~~~~~i~~~~~~~---~g~p~g~gLl~~ 188 (488) T protein:vir:99 121 GRDDRYITLEAIKVRNRRRFRY--DQDGGLR-------LLTPNNMFEGEPCPAPYFWHFSTGADN---DDEPYGLGLAHW 188 (488) T ss_pred eecCCeeeEeeeeeecccceee--cCCCceE-------EeccCCCCCccccccCceEEEEeecCC---CCCcccchHHHH Confidence 6543 367899999987764 3444332 1111111122344322 3333333322 234899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchh Q lcl|NC_020081. 259 ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSK 338 (552) Q Consensus 259 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~ 338 (552) |......-....++...|...-|.|--+.+++. ...++++++++.+.+.+..+ +++ ..|..+..+++...+.. . T Consensus 189 ~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~-~~a~~~ek~~l~~av~~~~~---~~~-~viP~~~~ie~~ea~~~-~ 262 (488) T protein:vir:99 189 LYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDD-KTATPEDKAKLLAALHAIQT---DSA-IIMPAGMQAELLEAGRS-G 262 (488) T ss_pred HHHHHHHHHhhHHHHHHHHHHcCCceeeeecCC-CCCCHHHHHHHHHHHHHHhc---CcE-EEecCCceeEEeecCCC-C Confidence 999998999999999999999999987777653 23467777888777766532 221 12333333445444332 2 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccc-- Q lcl|NC_020081. 339 DMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQF-- 416 (552) Q Consensus 339 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~-- 416 (552) ...|.+..++..++|+.+ +||.+-.+ ..++.+++..+... ......+...++.|+..||+.|+..+ T Consensus 263 ~~~~~~li~~~d~~Isk~------iLGqtlts-----~~~~Gs~a~~~vh~-~v~~d~~~aDa~~i~~tln~~li~~l~~ 330 (488) T protein:vir:99 263 TADYKTLHDTMDATIAKV------GLGQVAST-----QGTPGRLGNDDLQA-DVRLDLVKADADLICESFNLGPARWLTE 330 (488) T ss_pred hHHHHHHHHHHHHHHHHH------Hhhhhhcc-----cccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 245888889999999987 35543221 11222345444444 34577888999999999998776431 Q ss_pred ---cc-c-eeecccccChHHHHHHHHHHHHHhc--CC-cCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccc Q lcl|NC_020081. 417 ---GG-D-YVFNFVGGDAKTEAEIISILESKAK--IG-LTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQ 488 (552) Q Consensus 417 ---~~-~-~~~~f~~~d~~~~~~~~~~~~~~~~--g~-lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~ 488 (552) +. . -.|.|...+.++..+.++....... |. ++..++|+.+|+|+-..++....+.- .. T Consensus 331 ~N~~~~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~~~~~~~~~~~~~--------------~~ 396 (488) T protein:vir:99 331 WNFPGAQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVEVESTQAEATAPTP--------------ST 396 (488) T ss_pred hCcCCcCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCCcccccccccCCC--------------cc Confidence 11 1 2455555555544444444443332 43 78899999999997654433221100 00 Q ss_pred cCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcccccccc--cccccc-CccccccccccccC Q lcl|NC_020081. 489 RQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANT--NSTPQG-GKDDNGNVVNDWEA 552 (552) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~ 552 (552) ..+ ....+....+...+ .-.++-... ...++= .--.+++...|+.+ T Consensus 397 ~~~--------------~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~i~~~l~~a~s~ee~~~ 445 (488) T protein:vir:99 397 EFA--------------EGDQPSDPAAAMAP----QLAEAMQPVVGNWTTQLRTLIEQASSLEDLRE 445 (488) T ss_pred cCC--------------CCCCCCCchHHHHH----HHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHH Confidence 000 00000000000000 000000000 000000 00001111112222 No 123 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=99.50 E-value=5e-12 Score=82.55 Aligned_cols=450 Identities=9% Similarity=0.041 Sum_probs=214.7 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCC-CCchHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIH-GKQNLLQ 79 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 79 (552) |..+=+. .|+-. ....++ ++ ..+... +....+..-|+.- .+..+.. T Consensus 1 ~~~~~d~-~g~p~------------------~~~~~~---------~~---~~~~~~--~~~~~~~~~~~~gltp~~l~~ 47 (526) T protein:vir:79 1 MAQIVDV-YGNPI------------------RPQQLR---------EP---QTSRLA--GLAKEFAQHPAKGLTPAKLAR 47 (526) T ss_pred CCeeeCC-CCCcc------------------Cccccc---------hh---hhhhhh--hhhhhcccCCCCCcCHHHHHH Confidence 2111100 01100 000000 00 001010 0011122222222 1223445 Q ss_pred HHHHhhcchHHHHHHHHH-----HHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_020081. 80 MLKLWSRKNIILNAIIIT-----RVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDF 154 (552) Q Consensus 80 ~Lr~~a~~~~i~~a~i~~-----~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn 154 (552) .||... ..++.+.|-.. +-..+.-.+...+..+.+..|.|..-..+ +..+++....+..+|... T Consensus 48 il~~a~-~gd~~~~~~L~edm~e~D~~i~s~l~~Rk~av~~~~w~I~p~~~~---~~~~~~~a~~v~~~l~~~------- 116 (526) T protein:vir:79 48 ILVEAE-QGNLQAQAELFMDMEERDAHLFAEMSKRKRAILGLDWAVEPPRNA---SAAEKADADYLHELLLDL------- 116 (526) T ss_pred HHHHhh-CCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCCCceEecCCCC---ChHHHHHHHHHHHHHhcc------- Confidence 666543 33433333221 12223333333444455666776543222 122334444555555321 Q ss_pred ccCCHHHHHHHHHHHHHhcCCeeEEEEECCCC---CEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEc Q lcl|NC_020081. 155 TRDNFRSFVKKLVRDRLTYDKINFELVYDKLG---DLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFK 231 (552) Q Consensus 155 ~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G---~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~ 231 (552) ..+.+++..++ +.+.+|-++.+|++...| .+..|.+.++.++.+..+....+ ++. .+......++ T Consensus 117 --~~~~~~i~~~l-dA~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~F~~~~~~~~~l-------~~~--~~~~~g~~l~ 184 (526) T protein:vir:79 117 --EGLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQSWFQLNPEDQNEL-------RLR--DNSPAGEALQ 184 (526) T ss_pred --cCHHHHHHHHH-hhhhhcceeEEEEEeecCCceeEEEeeeecccceEeccCCCcEE-------Eec--CCCCCceeec Confidence 13566776666 477899999999987643 47789999998777543332221 111 1222234566 Q ss_pred ccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_020081. 232 AKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMF 311 (552) Q Consensus 232 ~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~ 311 (552) +...|.+++..+ ...+||.+.+..|.-....-....++...|...-|.|--+.+++.+ .++++++++.+.+.+.. T Consensus 185 ~~k~iv~~~~~~---~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~--a~~~ek~~L~~av~~i~ 259 (526) T protein:vir:79 185 PFGWIIHRPRAR---SGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPG--TADEEKATLLRAVTGLG 259 (526) T ss_pred CCceEEEeecCC---cCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCC--CCHHHHHHHHHHHHHHh Confidence 666555554332 2347999999999998888888999999999999999888887654 37778888888877653 Q ss_pred ccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHH Q lcl|NC_020081. 312 SGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRN 391 (552) Q Consensus 312 ~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~ 391 (552) + +++ ..|..+..+++...+. ..-..|.+..++..++|+.+. ||.+-.+.. ..+...+++..+... . T Consensus 260 ~---da~-~iiP~~~~ie~~ea~~-~~~~~f~~li~~~d~~Isk~i------LGqtlTs~~--~~g~~gS~a~g~vh~-~ 325 (526) T protein:vir:79 260 H---AAA-GIIPETMAIDFQQAAQ-GSSEPFLAMMRQSEDAISKAV------LGGTLTSTT--SQSGGGAFALGQVHN-E 325 (526) T ss_pred c---CcE-EEecCCceeEEeecCC-CCHHHHHHHHHHHHHHHHHHH------hhhhhcccc--ccCcchhhhhHHHHH-H Confidence 3 222 1233333344544432 233458888899999998874 453321100 011123344433333 3 Q ss_pred HHHHHhhHHHHHHHHHHHhhcCcc-----cc---c---ceeecccccChHHHHHHHHHHHHH-hcCC-cCHHHHHHHhCC Q lcl|NC_020081. 392 SKDKGLEPLLKFIEDAVNKYIVSQ-----FG---G---DYVFNFVGGDAKTEAEIISILESK-AKIG-LTINDIRKELGY 458 (552) Q Consensus 392 ~~~~~l~P~~~~ie~~ln~~L~~~-----~~---~---~~~~~f~~~d~~~~~~~~~~~~~~-~~g~-lT~NE~R~~~gl 458 (552) .....+.-.++.|+..||+.|+.. ++ . .-+|.|...+.++....++.+... ..|+ ++..++|+.+|+ T Consensus 326 v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gi 405 (526) T protein:vir:79 326 VRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGI 405 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCcCCHHHHHHHhCC Confidence 456677889999999999877642 11 1 124566544444433333333322 3465 799999999999 Q ss_pred CCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcccccccccccc-- Q lcl|NC_020081. 459 PDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTP-- 536 (552) Q Consensus 459 ~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 536 (552) |....++.++.+..- +.......... .......... ..+..+.-+..-... T Consensus 406 p~~~~~e~~l~~~~~----------------~~~~~~~~~~~---~~~~~~~~~~--------~~~~~~~~d~~l~~~~~ 458 (526) T protein:vir:79 406 PQPAKNEPVLRPAAQ----------------PAILSRQHGQR---VAALATIVGP--------RYGDQQALDKALADLPA 458 (526) T ss_pred CCCCCchhhccccCC----------------ccccccccccc---cccccccccc--------cCchhhHHHHHHHHHHH Confidence 765555444322110 00000000000 0000000000 000000000000000 Q ss_pred ------------c-cCccccccccccccC Q lcl|NC_020081. 537 ------------Q-GGKDDNGNVVNDWEA 552 (552) Q Consensus 537 ------------~-~~~~~~~~~~~~~~~ 552 (552) + ..-...++...|+.. T Consensus 459 ~~~~~~~~~~~~~i~~~~~~~~s~ee~~~ 487 (526) T protein:vir:79 459 KDMQNQANDLLAPLLDAVNRGDSETELLG 487 (526) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCHHHHHH Confidence 0 000011111222222 No 124 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=99.49 E-value=6.1e-12 Score=82.08 Aligned_cols=450 Identities=9% Similarity=0.032 Sum_probs=215.8 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCc-hHHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQ-NLLQ 79 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 79 (552) |..+= +..++-+..+.+ .++++.. .. +....+...|+..-++ .+.. T Consensus 1 ~~~~~-------------------d~~g~p~~~~~~---------~~~~~~~---~~--~~~~~~~~~~~~gltp~~l~~ 47 (526) T protein:vir:99 1 MAQIV-------------------DVYGNPIRTQQL---------REPQTSR---LA--GLAKEFAQHPAKGLTPAKLAR 47 (526) T ss_pred CCeeE-------------------CCCCCccccccc---------cchhhhh---hh--hhhhhhcccCcCCCCHHHHHH Confidence 11111 111111111111 1111111 00 0111111222222222 3345 Q ss_pred HHHHhhcchHHHHHHHHH-----HHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_020081. 80 MLKLWSRKNIILNAIIIT-----RVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDF 154 (552) Q Consensus 80 ~Lr~~a~~~~i~~a~i~~-----~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn 154 (552) .||.... .++.+.|-.. +-..++-.+...+..+.+..|.|..-..+ +..+++....+..+|... T Consensus 48 iLr~a~~-gd~~~~~~L~e~m~e~D~~i~s~l~~Rk~av~~~~w~I~p~~~~---~~~~~~~a~~v~~~l~~~------- 116 (526) T protein:vir:99 48 ILVEAEQ-GNLQAQAELFMDMEERDAHLFAEMSKRKRAILGLDWAVEPPRNA---SAAEKADADYLHELLLDL------- 116 (526) T ss_pred HHHhhhC-CCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCCCceEecCCCC---CHHHHHHHHHHHHHHhcc------- Confidence 5554433 2333332221 12223333333344455666766543222 122334444555555321 Q ss_pred ccCCHHHHHHHHHHHHHhcCCeeEEEEECCCC---CEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEc Q lcl|NC_020081. 155 TRDNFRSFVKKLVRDRLTYDKINFELVYDKLG---DLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFK 231 (552) Q Consensus 155 ~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G---~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~ 231 (552) ..+.+++..++ +.+.+|-++.++++...| .|..|.+.++.++.+..+....+ ++. .+......++ T Consensus 117 --~~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~f~~~~~~~~~l-------~~~--~~~~~g~~l~ 184 (526) T protein:vir:99 117 --EGLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQSWFQLNPEDQNEL-------RLR--DNSPAGEALQ 184 (526) T ss_pred --cCHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEeeeecccceeeccCCCcEE-------Eec--CCCCCceeec Confidence 13666777666 578899999999986643 47789999998887643332221 111 1222234556 Q ss_pred ccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_020081. 232 AKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMF 311 (552) Q Consensus 232 ~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~ 311 (552) +...|.+++..+ ...+||.+.+..|.-....-....++...|...-|.|--+.+++.+ .++++++++.+.+.+.. T Consensus 185 ~~k~i~~~~~~~---~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~--a~~~ek~~L~~av~~i~ 259 (526) T protein:vir:99 185 PFGWIIHRPRAR---SGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPG--TADEEKATLLRAVTGLG 259 (526) T ss_pred CCCeEEEeecCC---cCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCC--CCHHHHHHHHHHHHHHh Confidence 665555544332 2347999999999999888888999999999999999888887654 37788888888887653 Q ss_pred ccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHH Q lcl|NC_020081. 312 SGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRN 391 (552) Q Consensus 312 ~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~ 391 (552) + +++ ..|..+..+++...+. ..-..|.+..++..++|+.+. ||.+-.+.. ..+...+++..+... . T Consensus 260 ~---d~~-~iiP~~~~ie~~ea~~-~~~~~f~~li~~~d~~Isk~i------LGqtlTs~~--~~g~~gS~a~g~vh~-~ 325 (526) T protein:vir:99 260 H---AAA-GIIPETMAIDFQQAAQ-GSSEPFLAMMRQSEDAISKAV------LGGTLTSTT--SQSGGGAFALGQVHN-E 325 (526) T ss_pred h---CcE-EEecCCceeEEeecCC-CCHHHHHHHHHHHHHHHHHHH------hhhhhcccc--ccCcchhhhHHHHHH-H Confidence 3 222 1233333344544432 233457888899999998875 443321100 011122344333333 3 Q ss_pred HHHHHhhHHHHHHHHHHHhhcCcc-----cc------cceeecccccChHHHHHHHHHHHHH-hcCC-cCHHHHHHHhCC Q lcl|NC_020081. 392 SKDKGLEPLLKFIEDAVNKYIVSQ-----FG------GDYVFNFVGGDAKTEAEIISILESK-AKIG-LTINDIRKELGY 458 (552) Q Consensus 392 ~~~~~l~P~~~~ie~~ln~~L~~~-----~~------~~~~~~f~~~d~~~~~~~~~~~~~~-~~g~-lT~NE~R~~~gl 458 (552) .....+.-.++.|+..||+.|+.. ++ ..-+|.|...+.++....++..+.. ..|+ ++..++|+.+|+ T Consensus 326 v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~Gi 405 (526) T protein:vir:99 326 VRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGI 405 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCccCHHHHHHHhCC Confidence 456677788999999999877642 11 1124666554444433333333323 3465 799999999999 Q ss_pred CCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccccccc--- Q lcl|NC_020081. 459 PDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNST--- 535 (552) Q Consensus 459 ~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 535 (552) |.-..++..+.+..-. ....... + ........ .... ..+.....+..-.. T Consensus 406 p~~~~~e~~l~~~~~~----------------~~~~~~~----~----~~~~~~~~-~~~~--~~~~~~~~d~~l~~~~~ 458 (526) T protein:vir:99 406 PQPAKNEPVLRSAAQP----------------AILSRQH----G----QRVAALAT-IVGP--RYGDQQALDKALADLPA 458 (526) T ss_pred CCCCCcccccCCCCCC----------------ccccccc----c----cccccccc-cccc--cCcchhhHHHHHHHHHH Confidence 7765555544221000 0000000 0 00000000 0000 00000000000000 Q ss_pred ----cc--------cCccccccccccccC Q lcl|NC_020081. 536 ----PQ--------GGKDDNGNVVNDWEA 552 (552) Q Consensus 536 ----~~--------~~~~~~~~~~~~~~~ 552 (552) .. -.-..+++...|+.. T Consensus 459 ~~~~~~~~~~l~~i~~~l~~~~s~ee~~~ 487 (526) T protein:vir:99 459 KDMQNQANDLLAPLLEAVNRGDSETELLG 487 (526) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCHHHHHH Confidence 00 000011111222222 No 125 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.48 E-value=7.5e-13 Score=87.04 Aligned_cols=452 Identities=10% Similarity=0.032 Sum_probs=202.4 Q ss_pred cccccchhhhhcccccc--cccc-ccccc--ccccccc-ccCCcccccccCCCCch-------HHHHHHHhhcchHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNT--KSNK-PKAYE--EPIIGSM-SMNPDFKEAPSIHGKQN-------LLQMLKLWSRKNIILNA 93 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~--~~~~-~~~~~--~~~~~~~-~~~~~~~~~~~~~~~~~-------~~~~Lr~~a~~~~i~~a 93 (552) |++......-..+.... +... ...|. .....++ .|.|. ..+.... +...-|.+..|+.+... T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~-----~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~ 75 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPS-----LRSPDALINPLKRIADARGRDMADNDGFTNG 75 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccCCCcccccccC-----CCChHHHHHHHHHHHHHHHHHHHhcChHHHH Confidence 33222222111110000 0000 00110 0011111 22221 1111111 11222345556665555 Q ss_pred HHHHHHHHHHHHHHHHHhhccccceeeeeccccc---cCChhHHHH-HHHHHHHHH----hcCCCCCCCccCCHHHHHHH Q lcl|NC_020081. 94 IIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQ---EPNDHNKKK-IKEIENFIE----KTGRIDNDFTRDNFRSFVKK 165 (552) Q Consensus 94 ~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~---~~~~~~~~~-~~~l~~~l~----~~n~~~~pn~~~t~~~f~~~ 165 (552) ++... ..+.-|.|+.++.+-... ..+.+...+ .+.++..+. .++..-...-.++++.+... T Consensus 76 av~~~-----------~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l 144 (553) T protein:vir:63 76 AVGYQ-----------RDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRL 144 (553) T ss_pred HHHHH-----------HHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHH Confidence 55432 233345577665542211 112222221 122222222 22211123356789999999 Q ss_pred HHHHHHhcCCeeEEEEECCC-C--CEEEEEEecCceeEEEECC-Ccc------cccc-cceeEEEEEcC--C-------- Q lcl|NC_020081. 166 LVRDRLTYDKINFELVYDKL-G--DLHNFKAVDASTVYVAVDE-DGK------ERKA-KDGVRYVQVID--D-------- 224 (552) Q Consensus 166 ~v~d~ll~Gna~~~i~r~~~-G--~~~~L~~l~p~~v~v~~~~-~g~------~~~~-~~~~~y~~~~~--~-------- 224 (552) +++.++..|.+|+.+++... | .+..|..|+|+++..-.+. +|. .+.. ...+.|..... + T Consensus 145 ~~r~~~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~ 224 (553) T protein:vir:63 145 GVVGYVKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPD 224 (553) T ss_pred HHHHHHhCCceEEEeeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCcccccccc Confidence 99999999999999886543 3 2467889999888543222 111 1100 01122322111 0 Q ss_pred c-------eEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCH Q lcl|NC_020081. 225 K-------VVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSN 297 (552) Q Consensus 225 ~-------~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~ 297 (552) . ....+++.+|||+...-+ ....-|+|.+..++..+......+.....--+=.+...++|+-+.+ ++ T Consensus 225 ~~~~~r~~~~~~v~a~~vlH~f~~~r---~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~---~~ 298 (553) T protein:vir:63 225 MYKWKFVQQSKPWGRRQVIHILEPRE---PDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELP---PE 298 (553) T ss_pred ccceeeeccccccChhHheecccccC---CCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCC---hh Confidence 0 122467889999876433 3346899999988888777766666555444445566667764321 12 Q ss_pred HHHHHHHH----------------HHHHHhccc----cccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHh Q lcl|NC_020081. 298 QALTSFRR----------------EWTSMFSGI----NGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIY 357 (552) Q Consensus 298 ~~~~~~~~----------------~~~~~~~G~----~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~f 357 (552) ...+.+.. .....+.|. -+.|.++.| ..|.+++.+..+-....|.+..+...+.||+.+ T Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L-~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaagl 377 (553) T protein:vir:63 299 FIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHL-FPGTKLNLKPMGTPGGVGSEFEASLNRHLASAF 377 (553) T ss_pred hhhhhcccccccccccccccccccccccccccccceeecCceeeec-CCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhc Confidence 11111110 000111110 124454433 457777777766556788899999999999999 Q ss_pred cCCHHHh-cccccccccccccccccchhHH-----------HHHHHHHHHHhhHHHHH-HHHHHHhhcC--cccccc--- Q lcl|NC_020081. 358 SIDPSEI-NFPNRGGATGHSGNTLNEGSSA-----------EKYRNSKDKGLEPLLKF-IEDAVNKYIV--SQFGGD--- 419 (552) Q Consensus 358 gVPp~~l-g~~~~~t~~~~~~~~~~~~n~e-----------~~~~~~~~~~l~P~~~~-ie~~ln~~L~--~~~~~~--- 419 (552) |||-+.| |+.. ..||+++. ..+..|+...++|+.+. +++++-...+ +..... T Consensus 378 Gi~Ye~lt~D~s----------~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~ 447 (553) T protein:vir:63 378 GMSYEEFTRDFS----------KANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLF 447 (553) T ss_pred CCCHHHHhhhcc----------cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhh Confidence 9999877 5433 24566553 33444556666775444 4555543222 211111 Q ss_pred ---------eeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhcccccc Q lcl|NC_020081. 420 ---------YVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQV 485 (552) Q Consensus 420 ---------~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~ 485 (552) ..++| ...|+...... ....+.+|..|.-|+-++.|.+|-+--+.+.. ....... .+. T Consensus 448 ~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A--~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~-----e~~~~~~-~Gl 519 (553) T protein:vir:63 448 YQPLMKEALSKCEWIGASQGQIDQLKETQA--AVMRIDAGLSTYEREIARLGGDFRKSFAQRAR-----EDALLKK-YGL 519 (553) T ss_pred cchhhhhhhhceeeecCCccccChHHHHHH--HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHH-----HHHHHHH-cCC Confidence 11223 23465544332 23345579999999988889987531110000 0000000 000 Q ss_pred ccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccccccccccCcccccc Q lcl|NC_020081. 486 EYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGKDDNGN 545 (552) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (552) ... ..+..+ ...+.+ ...++.+...+.++.+. || T Consensus 520 ~~~--~~~~~~--~~~~~~------~~~~~~~~~~~~~~~~~----------------~e 553 (553) T protein:vir:63 520 TFN--LSAKRS--LGDGRD------AATGIAEDPAAAQTSQQ----------------GE 553 (553) T ss_pred CCC--CCCccc--cCCCcc------cCCCCCCCCCCCCcccc----------------cC Confidence 000 000000 000000 00000000000111111 11 No 126 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.48 E-value=6.1e-12 Score=82.06 Aligned_cols=437 Identities=11% Similarity=0.094 Sum_probs=204.8 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCcccccccCC-CCchHHHHHHHhhcchHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIH-GKQNLLQMLKLWSRKNIILNAIIITRVNQVSMF 105 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~ 105 (552) |..++--..- ..+.....+++ -.+.+.-+ ..+...|+.. .++..-..||.......++..+ . +-..++-. T Consensus 1 m~~~i~~~~g-~p~~~~~~~~~--~~~~ia~~----~~~~~~~~~~~~~~~~~~iLr~~~~~~~~y~~m-~-~D~~i~s~ 71 (491) T protein:vir:10 1 MSKGLWVSPT-EFVTFGEPDKS--LSSQIATR----ARSIDFFALGMYLPNPDPVLKALGKDIRVYREL-R-ADAHVGGC 71 (491) T ss_pred CCCceeCCCC-CccCcccCChH--HHHHHHhh----hcccccccccCCccchHHHHHhcCCCHHHHHHH-h-hChHHHHH Confidence 2222210000 00000000000 01111100 0011111111 1222333444332211111111 1 11112222 Q ss_pred HHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCC Q lcl|NC_020081. 106 CTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKL 185 (552) Q Consensus 106 ~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~ 185 (552) +...+..+.+..|.|...+.+ .+....+.++|.+ ..+.+++..++ +.+++|.++.+++|... T Consensus 72 l~~Rk~av~~~~w~i~~~~~~-------~~~~e~v~e~l~~----------~~~~~~l~~~l-da~~~G~s~~Ei~w~~~ 133 (491) T protein:vir:10 72 VRRRKAAVKALEWGLDRGKAK-------SRVAKSIADVFAD----------LDLSRIVTEML-DAVLYGYQPMEITWGKV 133 (491) T ss_pred HHHHHHHHhCCCcEEecCCCC-------HHHHHHHHHHHhc----------CCHHHHHHHHH-HhhhhcceeEEEEEeec Confidence 222333344566776432221 1223445555543 24667887776 67889999999998664 Q ss_pred C---CEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHH Q lcl|NC_020081. 186 G---DLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNH 262 (552) Q Consensus 186 G---~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~ 262 (552) | .|..|.++|+.++.+ +.++++ .|...........+++...|++++..++ ..+||.+.+..|... T Consensus 134 ~g~~~~~~l~~r~~~~f~~--d~~~~l-------~~~~~~~~~~g~~l~~~k~i~~~~~~~~---~~p~g~gLl~~~~w~ 201 (491) T protein:vir:10 134 GNYIVPIDVVGKPADWFVY--DPENQL-------RFRSKDHWMQGEELPARKFLVPRQEATY---LNPYGFPDLSMCFWP 201 (491) T ss_pred CCeeEEEEeeeecccceee--ccCCce-------EEecCCCCCCcceecCCCEEEEEecCCC---CCcccchhHHHHHHH Confidence 3 367899999988765 334433 2222222233455677766766654332 238999999999999 Q ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhH-HH Q lcl|NC_020081. 263 LQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKD-ME 341 (552) Q Consensus 263 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d-~q 341 (552) ...-....++...|...-|.|--+.+++.+ .++++++++.+.+.+... +++ ..|..+..++++..+.+... .. T Consensus 202 ~~fK~~~~~~w~~f~E~yG~P~~igky~~~--a~~~ek~~l~~al~~~~~---~a~-~viP~~~~ie~~ea~~~~g~~~~ 275 (491) T protein:vir:10 202 TTFKKGGLKFWVQFTEKYGSPMLVGKHPRS--ASDGEKNLLLDCLEDMVQ---DAV-AVVPDDSSIEIKEAAGKTGSADV 275 (491) T ss_pred HHHHHHHHHHHHHHHHHcCCCeEEEecCCC--CCHHHHHHHHHHHHHHhc---CcE-EEecCCceeEEEecCCCCCChhH Confidence 999999999999999999999888887654 377888888888777532 222 12333333444444433322 34 Q ss_pred HHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc-----c Q lcl|NC_020081. 342 FEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ-----F 416 (552) Q Consensus 342 ~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~-----~ 416 (552) |.+..++..++|+.+. ||.+-.+. ++.+++..+.... .....+.-.++.|+..||+ |+.. + T Consensus 276 y~~li~~~d~~Isk~i------LGqtlTt~------~~gs~a~~~vh~~-v~~di~~~D~~~i~~tln~-li~~l~~~N~ 341 (491) T protein:vir:10 276 YERLLHFCRGEVSIAL------LGQNQTTE------ATSTRASAQAGLE-VTDDIRDGDKAVVSEAMNM-LIRWICDLNF 341 (491) T ss_pred HHHHHHHHHHHHHHHH------hhhhcccC------cccchhHHHHHHH-HHHHHHHHHHHHHHHHHHH-HHHHHHHhcC Confidence 7788889999988873 55432221 1234444444333 4566777888899999986 5432 1 Q ss_pred c--cceeecccccCh--HHHHHHHHHHHHHhcCC-cCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCC Q lcl|NC_020081. 417 G--GDYVFNFVGGDA--KTEAEIISILESKAKIG-LTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQM 491 (552) Q Consensus 417 ~--~~~~~~f~~~d~--~~~~~~~~~~~~~~~g~-lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~ 491 (552) + ...+|.|..... +.+++.... ....|+ ++..++|+.+|+|+-+.+...+... T Consensus 342 ~~~~~p~f~~~~~~e~~~~~a~~~~~--L~~~G~~i~~~~i~e~~Gip~~~~~~~~~~~~-------------------- 399 (491) T protein:vir:10 342 DGADRPVFDMWEQEQVDEIQAGRDQK--LTQAGARFTPAYFKRAYNLQDGDLDERPLPVS-------------------- 399 (491) T ss_pred CCCCcceEEecCcCchhHHHHHHHHH--HHhCCCcCCHHHHHHHhCCCCCCcCccccccC-------------------- Confidence 1 224566654332 223333332 233465 7999999999998754443221000 Q ss_pred CCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccc-cccc----ccc-CccccccccccccC Q lcl|NC_020081. 492 DANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQAN-TNST----PQG-GKDDNGNVVNDWEA 552 (552) Q Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~~-~~~~~~~~~~~~~~ 552 (552) .+................ +..... .+.. ..++ ..+. ++= .--.+++...|+.. T Consensus 400 ~~~~~~~~~~~~~~~~~~----~~~d~~--~~~~--~~~~~~~~~~~~~~~i~~~l~~~~s~~e~~~ 458 (491) T protein:vir:10 400 AVDTVGAASFAEFEAPDQ----DALDAA--LNTL--SARDLNADAQALVAPLLKRIANGASADELLG 458 (491) T ss_pred CCCCcccccccccCCCCC----CchHHH--HHHH--HHHHHHHHHHHHHHHHHHHHHhcCCHHHHHH Confidence 000000000000000000 000000 0000 0000 0000 000 00000111111111 No 127 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.46 E-value=3e-12 Score=83.72 Aligned_cols=506 Identities=11% Similarity=0.058 Sum_probs=211.6 Q ss_pred CCCCCCCcccccchhhc-ccccCcccccccccchhhhhccccccccccccccccccccccccCCccccc------ccCCC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNI-IDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEA------PSIHG 73 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~ 73 (552) |++-.- -||-.-+.-. ...+-..+..-.+..+-...-....+++ ++ .-.++++...... ..+.+ T Consensus 41 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~----~~~~~~~~~~~~~l~~~~~~~F~G 111 (694) T protein:vir:10 41 VPADFA-RRGALNALDAAPVAEPSPSLRLARQFEVDVSNYTPRERR----AA----SYALDFNGTSMDALSFVTSSGFPG 111 (694) T ss_pred ccCCcc-ccccchhhcccccCCCCcchhhhhhccccccCCCccccc----hh----hhhhccCcccccchhhhhccCcch Confidence 444331 1222222222 1111111111111111111100011111 11 1122223222222 23333 Q ss_pred CchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcc----ccceeeeeccccccCChhHHHHHHHHHHHHHhcCC Q lcl|NC_020081. 74 KQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDK----GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGR 149 (552) Q Consensus 74 ~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~----~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~ 149 (552) ++ .|..+|-.+. ++.|+.+.+....+-+....+.+. ..|+.+ .....+. .+..+++.|+.-++++++ T Consensus 112 y~----~la~laQ~~e-yr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~--~~~~~~~--~d~dqi~~L~~e~erl~V 182 (694) T protein:vir:10 112 FP----TLVLLAQLPE-YRAMHEVLADECIRTWGEAIGGTKEKADTSGLAA--GGNAAST--SDGDQLKQINDEIERLRI 182 (694) T ss_pred HH----HHHHHhhccc-hhhHHHHHHHHhhcccceeccccchhhhhhcccc--ccccccc--ccHHHHHHHHHHHHHHHH Confidence 33 3444554444 355556555554332211110000 001111 0111111 122455666776666543 Q ss_pred CCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC-----------------CCCCEEEEEEecCceeEEEEC----CCcc Q lcl|NC_020081. 150 IDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD-----------------KLGDLHNFKAVDASTVYVAVD----EDGK 208 (552) Q Consensus 150 ~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~-----------------~~G~~~~L~~l~p~~v~v~~~----~~g~ 208 (552) +..|.+.+.++. +||-+.+++.-+ ..|.+.+|..|+|..|++... +-+- T Consensus 183 ---------~~~l~eaik~aR-lfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~sp 252 (694) T protein:vir:10 183 ---------RDAVRTTVIHDQ-AFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVAD 252 (694) T ss_pred ---------HHHHHHHHHhhc-cccceEEEEEeecCccccccccccccccccCcceeeeEeecccccccchhhhccchhh Confidence 223555555544 555555444321 235677899999999987432 1111 Q ss_pred cccccceeEEEEEcCCceEEEEcccceeeecccccCCc---cCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_020081. 209 ERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDL---TVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRG 285 (552) Q Consensus 209 ~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~---~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~g 285 (552) .+ ..+.+|... + ..+.++.++.++-.+.++. ...+.|+|..+.+...|..+..+......+...- ...+ T Consensus 253 df--gkP~~y~V~--G---~~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~-~v~~ 324 (694) T protein:vir:10 253 DF--YKPSTWWMI--G---TEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF-SVSG 324 (694) T ss_pred cc--CCCceEEEe--c---eEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhh-hhHH Confidence 11 122233332 1 1244444444433322221 1236799999999999999888877777666542 2232 Q ss_pred EEEeCCCCCC-CHHHH-HHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHH Q lcl|NC_020081. 286 LLHIKTGQEQ-SNQAL-TSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSE 363 (552) Q Consensus 286 il~~~~~~~~-s~~~~-~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 363 (552) + ++.-...+ ...+. -..|-++-+++.+ |.| +.++....-+|.+.+.+..... +......+.||.+-+||... T Consensus 325 l-k~dla~~L~~g~~~~l~~R~eli~~~Rs--n~G-~~llDk~~Eefeq~stslSGLd--dVi~qf~q~VAgaa~IPltk 398 (694) T protein:vir:10 325 I-LMDLAQALMPGANVDLSMRAELINRYRD--NRN-ILFLDKATEEFFQFNTPLSGLD--ALQAQAQEQMSAVSHIPLIK 398 (694) T ss_pred H-HHHHHHhhcChhHHHHHHHHHHHHHhcC--ccc-eEEEecCCcceEEEecccCCHH--HHHHHHHHHHHhhhcCchhh Confidence 2 11100001 11111 1223344455654 333 2345434567777765554433 56667789999999999987 Q ss_pred hcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHH----- Q lcl|NC_020081. 364 INFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISI----- 438 (552) Q Consensus 364 lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~----- 438 (552) |--...+++..++. ..-.|.-+.....-+.-|+|.++++-+.|-+..+.....++.|+|......+..+++++ T Consensus 399 LfGqSPkGlNATGE--~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~i~~~fnPL~qmtd~EkAeI~~k~A 476 (694) T protein:vir:10 399 LLGITPTGLNASSE--GEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAESRYKQA 476 (694) T ss_pred hhccCcccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhh Confidence 74344444422221 11223334444444678899999888888777776666677788764433333333222 Q ss_pred ---HHHHhcCCcCHHHHHHHhCCCCCC-------CCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCC Q lcl|NC_020081. 439 ---LESKAKIGLTINDIRKELGYPDTE-------GGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMD 508 (552) Q Consensus 439 ---~~~~~~g~lT~NE~R~~~gl~p~~-------ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (552) ...+..|+++++|+|.++.-+|-- -.|.+-.+.....-+.....+... .+++-..+-+..+|...++. T Consensus 477 ~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~g~~~~~~ 554 (694) T protein:vir:10 477 QSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLA--EGGDTGAPGGARAGATAPPT 554 (694) T ss_pred HHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcc--cccccCCCCcccccccCCCc Confidence 122346899999999998776531 123332222111101111111110 01111111111111111111 Q ss_pred CCCCCCCcc--cccCCCCcccc--------ccccc-------cccccCccccccccccccC Q lcl|NC_020081. 509 NVNGKDSFN--QNVGKDGQSKQ--------QANTN-------STPQGGKDDNGNVVNDWEA 552 (552) Q Consensus 509 ~~~~~~~~~--~~~~~~~~~~~--------~~~~~-------~~~~~~~~~~~~~~~~~~~ 552 (552) ..+-.-..+ ...+.+...+. .+--- =.=-|++.+.||++.+--. T Consensus 555 v~~~~~~~~~~~ag~~~~~~~~ag~v~~~~g~vLl~kr~~g~W~lPgG~vE~gEt~~~a~~ 615 (694) T protein:vir:10 555 VANVNANVNPREAGAQDAAMRAAGAVYVVDGKVLLMKRPAGDWGLPAGKVEGNETPEEAAR 615 (694) T ss_pred ccccccccCccccCCCCccceeeEEEEEeCCEEEEEEecCCCccCCccccCCCCCHHHHHH Confidence 110000000 00000100000 00000 0001455555555543211 No 128 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.46 E-value=4.5e-13 Score=88.26 Aligned_cols=445 Identities=10% Similarity=0.040 Sum_probs=200.3 Q ss_pred hhcccccCcccccccccchhhhhcccccccccccccccc---cccccc-ccCCcccccccCCCCc-------hHHHHHHH Q lcl|NC_020081. 15 DNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEE---PIIGSM-SMNPDFKEAPSIHGKQ-------NLLQMLKL 83 (552) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~-~~~~~~~~~~~~~~~~-------~~~~~Lr~ 83 (552) +++ ..++..--+........|.+ ....++ .|.|. ..+... .+...-|. T Consensus 1 ~~~----------------~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~-----~~s~~~~i~~~~~~lr~RaRd 59 (530) T protein:vir:38 1 MKI----------------PSLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPP-----SESADAALLPNYSRGNARADD 59 (530) T ss_pred Ccc----------------ceeecCccccchHHHhhhhcccCCCCCcccccccC-----CCCHHHHHHHHHHHHHHHHHH Confidence 111 11111100000011111111 001111 12221 111111 11122345 Q ss_pred hhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeecccc--ccCChhHHHH-HHHHHHHHH----hcCCCCCCCcc Q lcl|NC_020081. 84 WSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPL--QEPNDHNKKK-IKEIENFIE----KTGRIDNDFTR 156 (552) Q Consensus 84 ~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~--~~~~~~~~~~-~~~l~~~l~----~~n~~~~pn~~ 156 (552) +..|+.+...++...++ +.-|.|+.++.+-.. ...+.+..++ .+.++..+. .++..-..... T Consensus 60 l~rNn~~a~~av~~~~~-----------nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~ 128 (530) T protein:vir:38 60 LVRNNGYAANAVQLHQD-----------HIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERK 128 (530) T ss_pred HHhcChHHHHHHHHHHH-----------HhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeecc Confidence 55566666665544332 333556665543110 0111121111 122333222 11111112345 Q ss_pred CCHHHHHHHHHHHHHhcCCeeEEEEECCC-C--CEEEEEEecCceeEEEEC-CCcc------cccc-cceeEEEEEcC-- Q lcl|NC_020081. 157 DNFRSFVKKLVRDRLTYDKINFELVYDKL-G--DLHNFKAVDASTVYVAVD-EDGK------ERKA-KDGVRYVQVID-- 223 (552) Q Consensus 157 ~t~~~f~~~~v~d~ll~Gna~~~i~r~~~-G--~~~~L~~l~p~~v~v~~~-~~g~------~~~~-~~~~~y~~~~~-- 223 (552) +|++++.+.+++.++..|.+|+.+++... | .+..|..|+|++|....+ .+|. .+.. ...+.|..... T Consensus 129 ~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~ 208 (530) T protein:vir:38 129 RTFTMMIREGVAMHAFNGELCVQATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGY 208 (530) T ss_pred CCHHHHHHHHHHHHhhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccC Confidence 78999999999999999999999987654 3 257889999988753221 1121 1000 01122333211 Q ss_pred -Cc---------eEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC Q lcl|NC_020081. 224 -DK---------VVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQ 293 (552) Q Consensus 224 -~~---------~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~ 293 (552) +. ....+++.+|||+...-+ .....|+|.+..++..+......+.....--+=.+...++|+-+.+. T Consensus 209 ~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r---~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~ 285 (530) T protein:vir:38 209 PGWMAQNWTYIPRELPGGRPSFIHVFEPME---DGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDT 285 (530) T ss_pred CCccccccceeeeeeccChhHeEeeccccC---CCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCc Confidence 11 123356679999876433 33478999999988887777666665554445555566666643211 Q ss_pred C---------CCHHHHHHHHHHHHHH---hcc---ccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020081. 294 E---------QSNQALTSFRREWTSM---FSG---INGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYS 358 (552) Q Consensus 294 ~---------~s~~~~~~~~~~~~~~---~~G---~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fg 358 (552) . ..++....+.....+. +.+ .-..|.++. +..|.+++.+..+-....|.+..+...+.||+.+| T Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~-L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglG 364 (530) T protein:vir:38 286 QSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPH-LLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLG 364 (530) T ss_pred cccccccccCCcccccccccccchhhhhcccccceeccCceeee-cCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcC Confidence 0 0000011111111110 000 012444443 44578888777765667888999999999999999 Q ss_pred CCHHHh-cccccccccccccccccchhHH-----------HHHHHHHHHHhhHHHH-HHHHHHHhhcCcc-------ccc Q lcl|NC_020081. 359 IDPSEI-NFPNRGGATGHSGNTLNEGSSA-----------EKYRNSKDKGLEPLLK-FIEDAVNKYIVSQ-------FGG 418 (552) Q Consensus 359 VPp~~l-g~~~~~t~~~~~~~~~~~~n~e-----------~~~~~~~~~~l~P~~~-~ie~~ln~~L~~~-------~~~ 418 (552) ||-+.| |+... .||+++. ..+..+....++|+.. ++++++..-.++- +.. T Consensus 365 i~ye~lt~D~s~----------~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~ 434 (530) T protein:vir:38 365 VSYEQLSRNYSQ----------MSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQE 434 (530) T ss_pred CCHHHHhccccc----------ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchh Confidence 999877 54332 3566443 3344444545556444 4566555543321 100 Q ss_pred ---c-eeecc-----cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhcccccccccc Q lcl|NC_020081. 419 ---D-YVFNF-----VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQR 489 (552) Q Consensus 419 ---~-~~~~f-----~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~ 489 (552) . ...+| ...|+....... ...+.+|..|.-++-++.|.+|-+--+.+.. ....... ...... T Consensus 435 ~~~a~~~~~w~~p~~~~iDP~Ke~~a~--~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~-----e~~~~~~-~Gl~~~- 505 (530) T protein:vir:38 435 ARTAWGNANWIGSGRMAIDGLKEVQEA--VMLIEAGLSTYEKECAKRGDDYQEIFAQQVR-----ESMERRA-AGLNPP- 505 (530) T ss_pred hHHhhhceeeecCCccccChHHHHHHH--HHHHHcCCCCHHHHHHHcCCCHHHHHHHHHH-----HHHHHHH-cCCCCC- Confidence 0 12233 334665444332 2345579999999988899987431110000 0000000 000000 Q ss_pred CCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcccccccccc Q lcl|NC_020081. 490 QMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNS 534 (552) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 534 (552) .... .....+...+++++. ++ .++| T Consensus 506 -~~~~--~~~~~~~~~~~~~~~-d~----------------~~~a 530 (530) T protein:vir:38 506 -AWAA--AAFEAGVKKSNEEEQ-DG----------------ARAA 530 (530) T ss_pred -CCcc--cccCCCCCCCCCCCC-CC----------------CCCC Confidence 0000 000000001110000 00 0000 No 129 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.45 E-value=3.8e-12 Score=83.22 Aligned_cols=452 Identities=10% Similarity=0.044 Sum_probs=206.0 Q ss_pred cccchhhhhcccccccccc-cccccc---cccccc-ccCCccccccc--CCCCchHHHHHHHhhcchHHHHHHHHHHHHH Q lcl|NC_020081. 29 KQIEEDAILKKGKNTKSNK-PKAYEE---PIIGSM-SMNPDFKEAPS--IHGKQNLLQMLKLWSRKNIILNAIIITRVNQ 101 (552) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~-~~~~~~~~~~~--~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~ 101 (552) =+....+....+...+..+ ..+|.. ....++ +|.|.....-. ......+...-|.+..|+.+...++....+ T Consensus 1 ~~~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~- 79 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQD- 79 (533) T ss_pred CCCchhhhhhcccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH- Confidence 0000000000111111111 011111 011111 22221110000 000011112224455566666665544332 Q ss_pred HHHHHHHHHhhccccceeeeeccc------cccCChhHHHHHHHH-HHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcC Q lcl|NC_020081. 102 VSMFCTPARNSDKGVGYEIRLKDP------LQEPNDHNKKKIKEI-ENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYD 174 (552) Q Consensus 102 ~~~~~~~~~~~~~~~~~~i~~k~~------~~~~~~~~~~~~~~l-~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~G 174 (552) +.-|.|+.++.+-. +.+...+-.++++.+ ..|.+.++........++++++...+++.++..| T Consensus 80 ----------nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dG 149 (533) T protein:vir:34 80 ----------HIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNG 149 (533) T ss_pred ----------HhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCC Confidence 33345666654311 111111112223222 2222222211123456789999999999999999 Q ss_pred CeeEEEEECCC-C--CEEEEEEecCceeEEEEC-CCcc-cc----c-c-cceeEEEEEcC--Cc----------eEEEEc Q lcl|NC_020081. 175 KINFELVYDKL-G--DLHNFKAVDASTVYVAVD-EDGK-ER----K-A-KDGVRYVQVID--DK----------VVAKFK 231 (552) Q Consensus 175 na~~~i~r~~~-G--~~~~L~~l~p~~v~v~~~-~~g~-~~----~-~-~~~~~y~~~~~--~~----------~~~~~~ 231 (552) .+|+.+.+... | .+..|..|+|+++..-.+ .+|. +. . . ...+.|..... .+ .....+ T Consensus 150 E~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~ 229 (533) T protein:vir:34 150 ELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGG 229 (533) T ss_pred ceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccC Confidence 99999987654 2 256888999988854221 1111 10 0 0 01223333211 11 122356 Q ss_pred ccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC---------CCHHHHHH Q lcl|NC_020081. 232 AKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQE---------QSNQALTS 302 (552) Q Consensus 232 ~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~---------~s~~~~~~ 302 (552) +.+|||+....+ .....|+|.+..++..+............--+=.+...++|+-+.+.. ...+..+. T Consensus 230 a~~VlH~f~~~r---~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~ 306 (533) T protein:vir:34 230 RASFIHVFEPVE---DGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRER 306 (533) T ss_pred hhHeeeeccccC---CCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCccccccc Confidence 788999876433 334689999999888887777666655555555566667776432110 01111111 Q ss_pred HHH---HHHHHhccc---cccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh-ccccccccccc Q lcl|NC_020081. 303 FRR---EWTSMFSGI---NGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI-NFPNRGGATGH 375 (552) Q Consensus 303 ~~~---~~~~~~~G~---~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~~ 375 (552) +.. .-...+.+. -+.|.++. +..|.+++.+..+-....|.+..+...+.||+.+|||-+.| |+... T Consensus 307 ~~~~~~~~~~~~~~~~~~l~pG~i~~-L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~------ 379 (533) T protein:vir:34 307 LTGWIGEIAAYYAAAPVRLGGAKVPH-LMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQ------ 379 (533) T ss_pred ccccchhhhhccCcceeeccCceeee-cCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhccc------ Confidence 111 111111111 12455543 44578888777766667888999999999999999999877 54332 Q ss_pred ccccccchhH-----------HHHHHHHHHHHhhHHHHH-HHHHHHhhcCc--c-----ccc----ceeecc-----ccc Q lcl|NC_020081. 376 SGNTLNEGSS-----------AEKYRNSKDKGLEPLLKF-IEDAVNKYIVS--Q-----FGG----DYVFNF-----VGG 427 (552) Q Consensus 376 ~~~~~~~~n~-----------e~~~~~~~~~~l~P~~~~-ie~~ln~~L~~--~-----~~~----~~~~~f-----~~~ 427 (552) .||+++ +..+..++...++|+.+. +++++-...++ . +.. -..+.| ... T Consensus 380 ----~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~i 455 (533) T protein:vir:34 380 ----MSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAI 455 (533) T ss_pred ----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCcccc Confidence 355654 334444556666776554 55555443332 1 100 012333 334 Q ss_pred ChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCC Q lcl|NC_020081. 428 DAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNM 507 (552) Q Consensus 428 d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (552) |+....... ...+.+|..|.-|+-++.|.+|-+--+.+.. ........ +... +..+ T Consensus 456 DP~Ke~~a~--~~~i~~G~~s~~~~~a~~G~D~~ev~~q~a~-----e~~~~~~~-gl~~-----~~~~----------- 511 (533) T protein:vir:34 456 DGLKEVQEA--VMLIEAGLSTYEKECAKRGDDYQEIFAQQVR-----ETMERRAA-GLKP-----PAWA----------- 511 (533) T ss_pred ChHHHHHHH--HHHHHcCCCCHHHHHHHcCCCHHHHHHHHHH-----HHHHHHhc-CCCC-----CCCC----------- Confidence 665444332 3345579999999988899987432111000 00000000 0000 0000 Q ss_pred CCCCCCCCcccccCCCCcccccccccc Q lcl|NC_020081. 508 DNVNGKDSFNQNVGKDGQSKQQANTNS 534 (552) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 534 (552) ...+.+....+.++..+++++| T Consensus 512 -----~~~~~s~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 512 -----AAAFESGLRQSTEEEKSDSRAA 533 (533) T ss_pred -----CcCccCCCCCCCCCCcccCCCC Confidence 0000011111111111122222 No 130 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.45 E-value=4.1e-12 Score=82.99 Aligned_cols=497 Identities=11% Similarity=0.065 Sum_probs=213.0 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccc------cccCCCC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKE------APSIHGK 74 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~ 74 (552) +|.++.-|-.+- +.+..-.+.-+.........+++-.+ -.++++..... ...+.++ T Consensus 52 ~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~l~~~~~~~F~Gy 113 (695) T protein:vir:78 52 LNALDAAPVAEP----------SPSLRLARQFEVDVSNYTPRERRAAS--------YALDFNGTSMDALSFVTSSGFPGF 113 (695) T ss_pred ccccccccccCC----------CcccccceeceeccccCCccccchhh--------hhhcccccccccchhhhccCcchH Confidence 333332222211 11111111111111111111111111 12233332222 2233344 Q ss_pred chHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcc----ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCC Q lcl|NC_020081. 75 QNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDK----GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRI 150 (552) Q Consensus 75 ~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~----~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~ 150 (552) + .|..+|-.+. ++.|+.+.+....+-+....+.+. ..|+.+ .....+. .+..+++.|+.-++++++ T Consensus 114 ~----~la~laQ~~e-yr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~--~~~~~~~--~d~dqi~~L~~e~erL~V- 183 (695) T protein:vir:78 114 P----TLVLLAQLPE-YRAMHEVLADECIRTWGEAIGGTKEKADTSGLAA--GGNAAST--SDGDQLKQINDEIERLRI- 183 (695) T ss_pred H----HHHHHhhccc-hhhHHHHHHHHhhcccceeccccchhhhhhcccc--ccccccc--ccHHHHHHHHHHHHHHHH- Confidence 3 4444554444 355556655554332211110000 001111 0111111 122455666776666543 Q ss_pred CCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC-----------------CCCCEEEEEEecCceeEEEEC----CCccc Q lcl|NC_020081. 151 DNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD-----------------KLGDLHNFKAVDASTVYVAVD----EDGKE 209 (552) Q Consensus 151 ~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~-----------------~~G~~~~L~~l~p~~v~v~~~----~~g~~ 209 (552) +..|.+.+.++. +||-+.+++.-. ..|.+.+|..|+|..|++... +-+-. T Consensus 184 --------~~~l~eaik~aR-lfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spd 254 (695) T protein:vir:78 184 --------RDAVRTTVIHDQ-AFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADD 254 (695) T ss_pred --------HHHHHHHHHhhc-cccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhc Confidence 223555555544 555555444221 235677899999999987432 11111 Q ss_pred ccccceeEEEEEcCCceEEEEcccceeeecccccCCc---cCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Q lcl|NC_020081. 210 RKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDL---TVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGL 286 (552) Q Consensus 210 ~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~---~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 286 (552) + ..+.+|... + ..+.++.++.++-.+.++. ...+.|+|..+.+...|..+..+......+...- ...++ T Consensus 255 f--gkP~~y~V~--G---~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~-~v~~l 326 (695) T protein:vir:78 255 F--YKPSTWWMI--G---TEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF-SVSGI 326 (695) T ss_pred c--CCCceEEEe--c---eEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhh-hhHHH Confidence 1 122233332 1 1244444444433332221 1236799999999999999888877777666542 22322 Q ss_pred EEeCCCCCC-CHHHH-HHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh Q lcl|NC_020081. 287 LHIKTGQEQ-SNQAL-TSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI 364 (552) Q Consensus 287 l~~~~~~~~-s~~~~-~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 364 (552) ++.-...+ ...+. -..|-++-+++.+ |.| +.++....-+|.+.+.+..... +......+.||.+-+||...| T Consensus 327 -k~dla~~L~~g~~~~l~~R~eli~~~Rs--n~G-~~llDk~~Eefeq~stslSGLd--dVi~qf~q~VAgaa~IPltkL 400 (695) T protein:vir:78 327 -LMDLAQALMPGANVDLSMRAELINRYRD--NRN-ILFLDKATEEFFQFNTPLSGLD--ALQAQAQEQMSAVSHIPLIKL 400 (695) T ss_pred -HHHHHHhhcChhHHHHHHHHHHHHHhcC--ccc-eEEEecCCcceEEEecccCCHH--HHHHHHHHHHHhhhcCchhhh Confidence 11100001 11111 1223344455654 333 3345434567777765554433 566677899999999999877 Q ss_pred cccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHH------ Q lcl|NC_020081. 365 NFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISI------ 438 (552) Q Consensus 365 g~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~------ 438 (552) --...+++..++. ..-.|.-+.....-+.-|+|.++++-+.|-+..+.....++.|+|......+..+++++ T Consensus 401 fGqSPkGlNATGE--~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpdi~~~fnPL~qmtd~EkAeI~~k~A~ 478 (695) T protein:vir:78 401 LGITPTGLNASSE--GEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAESRYKQAQ 478 (695) T ss_pred hccCCccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhhH Confidence 4344444422221 11223334444444678899999888888777776666677788764433333333222 Q ss_pred --HHHHhcCCcCHHHHHHHhCCCCCC-------CCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCC Q lcl|NC_020081. 439 --LESKAKIGLTINDIRKELGYPDTE-------GGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDN 509 (552) Q Consensus 439 --~~~~~~g~lT~NE~R~~~gl~p~~-------ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (552) ...+..|+++++|+|.++.-+|-- -.|.+-.+.....-+.....+.. ..+++-..+-+..+|...++.. T Consensus 479 ~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~g~~~~~~~ 556 (695) T protein:vir:78 479 SDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRL--AEGGDTGAPGGARAGATAPPTV 556 (695) T ss_pred HHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCc--ccccccCCCCCCCCCCCCCCce Confidence 122346899999999998776531 12333222211110111111111 1111111122222222222221 Q ss_pred CCCC--CCcccccCCCCcccc--------cccccc-------ccccCccccccccccccC Q lcl|NC_020081. 510 VNGK--DSFNQNVGKDGQSKQ--------QANTNS-------TPQGGKDDNGNVVNDWEA 552 (552) Q Consensus 510 ~~~~--~~~~~~~~~~~~~~~--------~~~~~~-------~~~~~~~~~~~~~~~~~~ 552 (552) .+-. -......+.+...+. .+---. .=-|++.+.||++.+--. T Consensus 557 ~~~~~~~~~~~ag~~~~~~~aag~v~~~~g~vLl~kr~~g~W~lPgG~vE~gEt~~~aa~ 616 (695) T protein:vir:78 557 ANVNANVKPREAGAQDAAMRAAGAVYVVDGKVLLMKRPAGDWGLPAGKVEGNETPEEAAR 616 (695) T ss_pred eeeeccccccccCCCCcccceeEEEEEeCCEEEEEEecCCCccCCccccCCCCCHHHHHH Confidence 1100 000000001111000 000000 001455555555543211 No 131 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.43 E-value=7.4e-12 Score=81.61 Aligned_cols=496 Identities=11% Similarity=0.073 Sum_probs=211.8 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccc------cccCCCC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKE------APSIHGK 74 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~ 74 (552) +|.++.-|-- +-+.+..-.+.-+.........+++-.+ -.++++..... ...+.++ T Consensus 52 ~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~l~~~~~~~F~Gy 113 (695) T protein:vir:36 52 LNALDAAPVV----------EPSPSLRLARQFEVDVSNYTPRERRAAS--------YALDFNGTSMDALSFVTSSGFPGF 113 (695) T ss_pred cccccccccc----------CCCcccccceeceecccccCccccchhh--------hhhcccccccccchhhhccCcchH Confidence 4444432211 1112222112111112211111221111 12233332222 2233344 Q ss_pred chHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcc----ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCC Q lcl|NC_020081. 75 QNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDK----GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRI 150 (552) Q Consensus 75 ~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~----~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~ 150 (552) + .|..+|-.+. ++.|+.+.+....+-+....+.+. ..|+.+ .....+. .+..+++.|+.-++++++ T Consensus 114 ~----~la~laQ~~e-yr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~--~~~~~~~--~d~dqik~L~~e~erL~V- 183 (695) T protein:vir:36 114 P----TLVLLAQLPE-YRAMHEVLADECIRTWGEAIGGTKEKADTSGLAA--GGNAAST--SDGDQLKQINDEIERLRI- 183 (695) T ss_pred H----HHHHHhhccc-hhhHHHHHHHHhhcccceecccchhhhhhccccc--ccccccc--CchHHHHHHHHHHHHHHH- Confidence 3 3444554444 355556655554332211110000 011111 0011111 122356667777666543 Q ss_pred CCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC-----------------CCCCEEEEEEecCceeEEEEC----CCccc Q lcl|NC_020081. 151 DNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD-----------------KLGDLHNFKAVDASTVYVAVD----EDGKE 209 (552) Q Consensus 151 ~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~-----------------~~G~~~~L~~l~p~~v~v~~~----~~g~~ 209 (552) +..|.+.+.++. +||-+.+++.-. ..|.+.+|..|+|..|++... +-+-. T Consensus 184 --------~~~l~eaik~aR-lfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spd 254 (695) T protein:vir:36 184 --------RDAVRTTVIHDQ-AFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADD 254 (695) T ss_pred --------HHHHHHHHHhhc-cccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhc Confidence 223555555544 555555444221 235677899999999987432 11111 Q ss_pred ccccceeEEEEEcCCceEEEEcccceeeecccccCCc---cCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Q lcl|NC_020081. 210 RKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDL---TVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGL 286 (552) Q Consensus 210 ~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~---~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 286 (552) + ..+.+|... + ..+.++.++.++-.+.++. ...+.|+|..+.+...|..+..+......+...- ...++ T Consensus 255 f--gkP~~y~V~--G---~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~-~v~~l 326 (695) T protein:vir:36 255 F--YKPSTWWMI--G---TEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF-SVSGI 326 (695) T ss_pred c--CCCceEEEe--c---eEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhh-hHHHH Confidence 1 122233332 1 1244444444433332221 1236799999999999998888777777666432 22222 Q ss_pred EEeCCCCCC---CHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHH Q lcl|NC_020081. 287 LHIKTGQEQ---SNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSE 363 (552) Q Consensus 287 l~~~~~~~~---s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 363 (552) ++.-...+ ...++ ..|-++-+++.+ |.| +.++....-+|.+.+.+..... +......+.||.+-+||... T Consensus 327 -k~dla~aL~~g~~~~l-~~R~eli~~~Rs--n~G-~~llDk~~Eefeq~stslSGLd--dVi~qf~q~VAgaa~IPltk 399 (695) T protein:vir:36 327 -LMDLAQALMPGANVDL-SMRAELINRYRD--NRN-ILFLDKATEEFFQFNTPLSGLD--ALQAQAQEQMSAVSHIPLIK 399 (695) T ss_pred -HHHHHHhhcChhHHHH-HHHHHHHHHhcC--ccc-eEEEecCCcceEEEecccCCHH--HHHHHHHHHHHhhhcCchhh Confidence 11100000 11111 223344455654 333 2345434567777765554433 56667789999999999987 Q ss_pred hcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHH----- Q lcl|NC_020081. 364 INFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISI----- 438 (552) Q Consensus 364 lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~----- 438 (552) |--...+++..++. ..-.|.-+.....-+.-|+|.++++-+.|-+..+.....++.|+|......+..+++++ T Consensus 400 LfGqSPkGlNATGE--~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpdi~~~fnPL~qmtd~EkAeI~~k~A 477 (695) T protein:vir:36 400 LLGITPTGLNASSE--GEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAESRYKQA 477 (695) T ss_pred hhccCcccccccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhh Confidence 74344444422221 11223334444444678899999888888777776666677788764433333333322 Q ss_pred ---HHHHhcCCcCHHHHHHHhCCCCCC-------CCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCC Q lcl|NC_020081. 439 ---LESKAKIGLTINDIRKELGYPDTE-------GGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMD 508 (552) Q Consensus 439 ---~~~~~~g~lT~NE~R~~~gl~p~~-------ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (552) ...+..|+++++|+|.++.-+|-- -.|.+-.+.....-+.....+... .+++-..+-+...|...++. T Consensus 478 ~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~g~~~~~~ 555 (695) T protein:vir:36 478 QSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLA--EGGDTGAPGGARAGATAPPT 555 (695) T ss_pred HHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcc--cccccCCCCcccccccCCCc Confidence 122346899999999998776531 123332222111101111111100 00111111111111111111 Q ss_pred CCCCCCCcc--cccCCCCcccc--------ccccc-------cccccCccccccccccccC Q lcl|NC_020081. 509 NVNGKDSFN--QNVGKDGQSKQ--------QANTN-------STPQGGKDDNGNVVNDWEA 552 (552) Q Consensus 509 ~~~~~~~~~--~~~~~~~~~~~--------~~~~~-------~~~~~~~~~~~~~~~~~~~ 552 (552) ..+-.-..+ ...+.+...+. .+--- =.=-|++.+.||++.+--. T Consensus 556 v~~~~~~~~~~~ag~~~~~~~aag~v~~~~g~vLl~kr~~g~W~lPgG~vE~gEt~~~aa~ 616 (695) T protein:vir:36 556 VANVNANVNPREAGAQDAAMRAAGAVYVVDGKVLLMKRPAGDWGLPAGKVEGNETPEEAAR 616 (695) T ss_pred ccccccccCccccCCCCccceeeEEEEEeCCEEEEEEecCCCccCCccccCCCCCHHHHHH Confidence 110000000 00000100000 00000 0001455555555543211 No 132 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.43 E-value=1.3e-12 Score=85.83 Aligned_cols=445 Identities=11% Similarity=0.040 Sum_probs=208.8 Q ss_pred cccccchhhcccccCcccccccccchhhhhcccccc---ccccccccccc-cccccccCCcccccccC-CCCch------ Q lcl|NC_020081. 8 FKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNT---KSNKPKAYEEP-IIGSMSMNPDFKEAPSI-HGKQN------ 76 (552) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~~------ 76 (552) +.---+..|++. +-++ ...... ......+|.-. ...+++ ++...|+. +.... T Consensus 1 ~~r~~~~~~~~d--r~i~------------~~~~~~~~~~~~~~~~y~aa~~~r~~~---~w~~~~~~~s~~~~i~~~~~ 63 (505) T protein:vir:96 1 MKRAEKKPSLAQ--RMVN------------WAWYRYVEPQKNAARAFEAARRDRLGK---AWLRRASRLSADEEIYADLA 63 (505) T ss_pred CCCCccccchhh--cccc------------hhhhhhHHHHHHhhhhcccccCCCccc---cccCCCCCCChHHHHHHHHH Confidence 111122222220 0000 000000 00001122110 011111 11111211 11111 Q ss_pred -HHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeee--ccccccCChhHHHHHHHHHHHH-HhcCCCCC Q lcl|NC_020081. 77 -LLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRL--KDPLQEPNDHNKKKIKEIENFI-EKTGRIDN 152 (552) Q Consensus 77 -~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~--k~~~~~~~~~~~~~~~~l~~~l-~~~n~~~~ 152 (552) +..--|.+..|+.+...++...++.+ ++..|+.++. .......+++-.+++..+.+.. +.++ .. T Consensus 64 ~lr~RaRdL~rNn~~a~~av~~~~~nv----------VG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~--~D 131 (505) T protein:vir:96 64 SLVQRAREQSINNPYAKRFYQLLKNNV----------IGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGN--CD 131 (505) T ss_pred HHHHHHHHHHhcChHHHHHHHHHHHHh----------cCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcC--cc Confidence 11222455666666666655433322 2213444433 3333344444444444333322 1111 12 Q ss_pred CCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCC-CEEEEEEecCceeEEEEC---CCcc------cccc-cceeEEEEE Q lcl|NC_020081. 153 DFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLG-DLHNFKAVDASTVYVAVD---EDGK------ERKA-KDGVRYVQV 221 (552) Q Consensus 153 pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G-~~~~L~~l~p~~v~v~~~---~~g~------~~~~-~~~~~y~~~ 221 (552) ....++++++...+++.++..|.+|+.+++...+ .+..|..|+|++|..-.+ .+|. .+.. ...+.|... T Consensus 132 ~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~ 211 (505) T protein:vir:96 132 VTGRYHFVTLLHLWMETLARDGEVLVREHRGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLL 211 (505) T ss_pred eeccCCHHHHHHHHHHHHhhCCceEEEEeecCCCCcceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEe Confidence 3356789999999999999999999988765433 467889999998853221 1111 1111 011223322 Q ss_pred c------------CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEe Q lcl|NC_020081. 222 I------------DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHI 289 (552) Q Consensus 222 ~------------~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~ 289 (552) . .......+++.+|||+...-+ ....-|+|.+..++..+............-.+=.+...++|+. T Consensus 212 ~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r---~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~ 288 (505) T protein:vir:96 212 VNHPGDNSYCYHYAGQTYERVPADEIIHTFVPWR---PHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQ 288 (505) T ss_pred ecCCCccccccccccccccccCHhHhhhhhcccC---CccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeec Confidence 1 112234578899999876543 3346899999988888777766666555555555666677775 Q ss_pred CCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh-cccc Q lcl|NC_020081. 290 KTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI-NFPN 368 (552) Q Consensus 290 ~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~ 368 (552) +.+.. .+...+. ....... -..|.++.+ ..|.+++.+..+-....|.+..+...+.||+.+|||-+.| |+.. T Consensus 289 ~~~~~-~~~~~~~----~~~~~~~-l~pG~i~~L-~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s 361 (505) T protein:vir:96 289 DPEAY-DQPPEDD----QGEIVEE-VEAGTYQLL-PYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLE 361 (505) T ss_pred CCccC-CCccccc----cCccccc-cCCceeeec-CCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc Confidence 43221 1110000 0011111 124555444 4578888877776668889999999999999999998877 5433 Q ss_pred cccccccccccccchhHHH-----------HHHHHHHHHhhHHHH-HHHHHHHhhcCc--ccccc--eeeccc-----cc Q lcl|NC_020081. 369 RGGATGHSGNTLNEGSSAE-----------KYRNSKDKGLEPLLK-FIEDAVNKYIVS--QFGGD--YVFNFV-----GG 427 (552) Q Consensus 369 ~~t~~~~~~~~~~~~n~e~-----------~~~~~~~~~l~P~~~-~ie~~ln~~L~~--~~~~~--~~~~f~-----~~ 427 (552) . .||+++.+ .+..|+...++|+.+ +++.++-...++ ....+ ..+.|. .. T Consensus 362 ~----------~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~i 431 (505) T protein:vir:96 362 G----------VNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWV 431 (505) T ss_pred c----------ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCcccc Confidence 2 34554433 344455667777555 466665544432 21111 233442 34 Q ss_pred ChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCC Q lcl|NC_020081. 428 DAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNM 507 (552) Q Consensus 428 d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (552) |+...... ....+.+|..|+-|+-++.|.+|-+--+.+.. ....... .+. ....+... .... T Consensus 432 DP~Ke~~a--~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~-----e~~~~~~-~Gl------~~~~~~~~----~~~~ 493 (505) T protein:vir:96 432 DPAKDSKA--HSESIKNRTRSRSSIIRAAGDDPEDVFDEIAW-----EEQLMRD-KGV------NPTPPEQE----SKDA 493 (505) T ss_pred ChHHHHHH--HHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHH-----HHHHHHH-cCC------CCCCCCCC----CCCC Confidence 65544332 23345579999999888899987431111000 0000000 000 00000000 0000 Q ss_pred CCCCCCCCcccccCCCCcc Q lcl|NC_020081. 508 DNVNGKDSFNQNVGKDGQS 526 (552) Q Consensus 508 ~~~~~~~~~~~~~~~~~~~ 526 (552) +..+++. + ..| + T Consensus 494 --~~~~~~~--~-~~d--~ 505 (505) T protein:vir:96 494 --TTDEEDD--S-ASD--D 505 (505) T ss_pred --CCCCCCC--C-CCC--C Confidence 0000000 0 000 0 No 133 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.40 E-value=3.4e-12 Score=83.43 Aligned_cols=492 Identities=11% Similarity=0.059 Sum_probs=204.7 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccc------cccCCCC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKE------APSIHGK 74 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~ 74 (552) +|.++.-|-.+- +.+..-.+.-+.........+++-.+ -.++++..... ...+.++ T Consensus 52 ~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~l~~~~~~~F~Gy 113 (698) T protein:vir:10 52 LNALDAAPVAEP----------SPSLRLARQFEVDVSNYTPRERRAAS--------YALDFNGTSMDALSFVTSSGFPGF 113 (698) T ss_pred ccccccccccCC----------CccccccccceeccccCCccccchhh--------hhhcccccccccchhhhccCcchH Confidence 333332222211 11111111111111111111111111 12232332222 2233344 Q ss_pred chHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcc----ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCC Q lcl|NC_020081. 75 QNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDK----GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRI 150 (552) Q Consensus 75 ~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~----~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~ 150 (552) + .|..+|-.+. ++.|+.+.++...+-+....+.+. ..|+.+ .....+. .+..+++.|+.-++++++ T Consensus 114 ~----~la~laQ~~e-yr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~--~~~~~~~--~d~dqi~~L~~e~erl~V- 183 (698) T protein:vir:10 114 P----TLVLLAQLPE-YRAMHEVLADECIRTWGEAIGGTKEKADTSGLAA--GGNAAST--SDGDQLKQINDEIERLRI- 183 (698) T ss_pred H----HHHHHhhccc-hhhHHHHHHHHhhcccceeccccchhhhhhcccc--ccccccc--ccHHHHHHHHHHHHHHHH- Confidence 3 3444554444 355556655554332211110000 001111 0111111 122456667777766653 Q ss_pred CCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC-----------------CCCCEEEEEEecCceeEEEEC----CCccc Q lcl|NC_020081. 151 DNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD-----------------KLGDLHNFKAVDASTVYVAVD----EDGKE 209 (552) Q Consensus 151 ~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~-----------------~~G~~~~L~~l~p~~v~v~~~----~~g~~ 209 (552) +..+.+.+.++. +||-+.+++.-+ ..|.+.+|..|+|..|++... +-+-. T Consensus 184 --------~~~l~eai~~aR-lfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~dP~spd 254 (698) T protein:vir:10 184 --------RDAVRTTVIHDQ-AFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADD 254 (698) T ss_pred --------HHHHHHHHHhcc-cccceEEEEEeecCccccccccccccccccCccceeeeeecccccccchhhhccchhhc Confidence 223555555555 455554444311 235677899999999987431 11111 Q ss_pred ccccceeEEEEEcCCceEEEEcccceeeecccccCCcc---CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Q lcl|NC_020081. 210 RKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLT---VGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGL 286 (552) Q Consensus 210 ~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~---~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 286 (552) ++ .+.+|... +. .+.++.++.++-.+-++.. ..+.|+|.++.+...|..+..+......+...- ...++ T Consensus 255 fg--kP~~y~V~--G~---~IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~-~~~~l 326 (698) T protein:vir:10 255 FY--KPSTWWMI--GS---EVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQF-SVSGI 326 (698) T ss_pred cC--CCceEEEe--cc---eecceeEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHh-hHHHH Confidence 11 22233332 21 2445554444333322222 225699999999999998888777776666432 22222 Q ss_pred EEeCCCCCCCHHHHHHH--HHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh Q lcl|NC_020081. 287 LHIKTGQEQSNQALTSF--RREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI 364 (552) Q Consensus 287 l~~~~~~~~s~~~~~~~--~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 364 (552) +..-...++......+ |-++-+++.+ |.|- .++....-+|.+.+.+.... .+......+.||.+-+||...| T Consensus 327 -~~dla~aL~~g~~~~l~~R~eli~~~Rs--n~G~-~llDk~~Eefeq~st~lSGL--ddVi~qf~q~VAgaa~IPltkL 400 (698) T protein:vir:10 327 -LMDLAQALTPGANVDLSMRAELINRYRD--NRNI-LFLDKATEEFFQFNTPLSGL--DALQAQAQEQMSAVSHIPLIKL 400 (698) T ss_pred -HHHHHHhcCChhhHHHHHHHHHHHHhcC--ccce-EEEecCCcceEEEecCcCCH--HHHHHHHHHHHHhhhcCchhhh Confidence 1100000111111112 3344455554 3332 34543457777776555443 3666778899999999999877 Q ss_pred cccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHH------ Q lcl|NC_020081. 365 NFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISI------ 438 (552) Q Consensus 365 g~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~------ 438 (552) --...++|..++. ..-.|.-+.....-+.-|+|.++++-+.|-+..+......+.|+|......+..+++++ T Consensus 401 fGqSPkGlNATGE--~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~i~~~fnPL~qmtd~EkAeI~~k~A~ 478 (698) T protein:vir:10 401 LGITPTGLNASSE--GEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAEARYKQAQ 478 (698) T ss_pred hccCCcccCccch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhhH Confidence 4344444422211 11223333444444677889999988888777776666677778764433333333322 Q ss_pred --HHHHhcCCcCHHHHHHHhCCCCCCC-------CCeeeccc-cccchhhhccccccccccCCCCCccCcccCCCCCCCC Q lcl|NC_020081. 439 --LESKAKIGLTINDIRKELGYPDTEG-------GDVTLAGV-HVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMD 508 (552) Q Consensus 439 --~~~~~~g~lT~NE~R~~~gl~p~~g-------gD~~~~~~-n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (552) ...+..|+++++|+|.++.-+|--+ -|.+..|. +.+.......+...+....+.+..+.+.-+|..-++. T Consensus 479 ~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 558 (698) T protein:vir:10 479 SDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGARAGATAPPA 558 (698) T ss_pred HHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccccCCCCCCcc Confidence 1123468999999999987654311 12221111 1111110000000000000111111111111111111 Q ss_pred CCCCCCCcccccCCCCcccccccccccc-------c-----------------cCccccccccccccC Q lcl|NC_020081. 509 NVNGKDSFNQNVGKDGQSKQQANTNSTP-------Q-----------------GGKDDNGNVVNDWEA 552 (552) Q Consensus 509 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~-----------------~~~~~~~~~~~~~~~ 552 (552) ..+-+ +...+.+-+. +..+-. . |++.+.||.+-+--. T Consensus 559 ~~~~~------~~~~~~~~~~-~~~~~~a~giv~~~g~~vLL~~r~~g~W~lPgG~ie~GEt~~~aa~ 619 (698) T protein:vir:10 559 AANVN------ANANPREAGA-QDAAMRAAGIVFRAGDKVLLMKRPAGDWGLPAGKVEDGETPEEAAR 619 (698) T ss_pred ccccc------CCCCccccCc-ccceeeEEEEEEEcCCeEEEEEecCCCcccCccccCCCCCHHHHHH Confidence 10000 0000000000 000000 0 122222222211111 No 134 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.40 E-value=1.4e-12 Score=85.59 Aligned_cols=479 Identities=9% Similarity=-0.003 Sum_probs=207.1 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhcccccccccccccccccc-ccccccCCcccccccCCCCch--- Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPI-IGSMSMNPDFKEAPSIHGKQN--- 76 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~--- 76 (552) ||++| |.-+-. .-.. .-+...++- ...+|.--- .....|++. ..+.... T Consensus 1 Mn~iD-r~i~~~--------sP~~---a~~R~~ar~----------~~~~y~aa~~~r~~~~~~~-----~~s~~~~i~~ 53 (548) T protein:vir:95 1 MNLID-RLLEPL--------APEL---VARRLAARE----------AIQAYEAARPGRTHKAKRQ-----PLGADTSLQK 53 (548) T ss_pred CchHH-hHhhhc--------chHH---HHHHHHhHH----------HhccccccCccccccccCC-----CCChHHHHHH Confidence 77777 221110 0000 000000000 001111100 111122221 1111111 Q ss_pred ----HHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhcc--ccceeeeeccccccCChhHHHHHHHHHH-HHHhcCC Q lcl|NC_020081. 77 ----LLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDK--GVGYEIRLKDPLQEPNDHNKKKIKEIEN-FIEKTGR 149 (552) Q Consensus 77 ----~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~--~~~~~i~~k~~~~~~~~~~~~~~~~l~~-~l~~~n~ 149 (552) +...-|.+..|+.+...++....+. .+. +.++.-+....+.+...+-.+.++.+.+ |-..+ T Consensus 54 ~~~~lr~RaRdL~rNn~~a~~av~~~~~n----------vVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~-- 121 (548) T protein:vir:95 54 SAVSMREQCRKLDEDHDLVTGLLDRLEER----------VVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSP-- 121 (548) T ss_pred HHHHHHHHHHHHHhcChHHHHHHHHHHHh----------ccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCc-- Confidence 1122245556666666655443322 222 2233333222222211121223333222 22222 Q ss_pred CCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCC-------CCEEEEEEecCceeEEEECCCcccc-c-----c-cce Q lcl|NC_020081. 150 IDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKL-------GDLHNFKAVDASTVYVAVDEDGKER-K-----A-KDG 215 (552) Q Consensus 150 ~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~-------G~~~~L~~l~p~~v~v~~~~~g~~~-~-----~-~~~ 215 (552) .....+|++.+...+++.++..|.+++.+++... ..+..|..|+|++|..-.+..+... . . ... T Consensus 122 --D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp 199 (548) T protein:vir:95 122 --ETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTFATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRK 199 (548) T ss_pred --cccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccCCcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCce Confidence 2345678999999999999999999999887542 2367899999998853222222110 0 0 011 Q ss_pred eEEEEEc----------CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_020081. 216 VRYVQVI----------DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRG 285 (552) Q Consensus 216 ~~y~~~~----------~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~g 285 (552) +.|.... .......+++++|||+....+ .....|+|.+..++..+......+.....--+=.+...+ T Consensus 200 ~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r---~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~ 276 (548) T protein:vir:95 200 RAYHLLKDHPGNLQTLGGSLAVKRVEAERIIHIAYRKR---IGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAM 276 (548) T ss_pred EEEEEeecCCCcccccccccceeeechhHheecccccC---CccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhhee Confidence 2222221 112345689999999875433 334679999999888877777666655555555566677 Q ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh- Q lcl|NC_020081. 286 LLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI- 364 (552) Q Consensus 286 il~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l- 364 (552) +|+.+.+.....+. ....-.. .... ..|.+.-.+..|.+++.+..+.....|.+..+...+.||+.+|||-+.| T Consensus 277 fi~~~~~~~~~~~~---~~~~~~~-~~~~-~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~lt 351 (548) T protein:vir:95 277 YIKKGNPDSYTVEP---GKDRKNR-TIPI-APGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVS 351 (548) T ss_pred eeecCCCccccCCC---Ccccccc-cccc-cCCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHh Confidence 77754321111100 0000000 0001 1233211234567777777665567888999999999999999998887 Q ss_pred cccccccccccccccccchhHHHHH-----------HHHHHHHhhHHHH-HHHHHHHhhcC--ccc-c--cceeeccc-- Q lcl|NC_020081. 365 NFPNRGGATGHSGNTLNEGSSAEKY-----------RNSKDKGLEPLLK-FIEDAVNKYIV--SQF-G--GDYVFNFV-- 425 (552) Q Consensus 365 g~~~~~t~~~~~~~~~~~~n~e~~~-----------~~~~~~~l~P~~~-~ie~~ln~~L~--~~~-~--~~~~~~f~-- 425 (552) |+. +.||+++.+.. ..|+...++|+.. +++.++-.-.+ +.+ . ..+.++|. T Consensus 352 gD~-----------s~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P 420 (548) T protein:vir:95 352 RAY-----------DGTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGP 420 (548) T ss_pred ccc-----------chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecC Confidence 322 13566554443 3445566667444 45555544333 211 1 12334443 Q ss_pred ---ccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCe------eecccccc----chhhhccccccccccCCC Q lcl|NC_020081. 426 ---GGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDV------TLAGVHVQ----RLGQIMQQEQVEYQRQMD 492 (552) Q Consensus 426 ---~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~------~~~~~n~~----~~~~~~~~~~~~~~~~~~ 492 (552) ..|+...+... ...+.+|..|.-|+-++.|.+|-+--+. .+.-+++. +...... .+.+ T Consensus 421 ~~~~iDP~Kea~A~--~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~-------~~~~ 491 (548) T protein:vir:95 421 VMPWINPMHEANAW--ELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVK-------SGMD 491 (548) T ss_pred CccccChHHHHHHH--HHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccc-------cccC Confidence 34665444332 2345578999988888899887531100 00001100 0000000 0000 Q ss_pred CCcc-CcccCCCCCCCCCCCCCCCcccccCCCCccccccccccccccCccccccccccccC Q lcl|NC_020081. 493 ANQF-LAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGKDDNGNVVNDWEA 552 (552) Q Consensus 493 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (552) +..+ .....+.+-+..-++.++.-|..-+ .-..-+-+=++.++.|+- +|.-.| -.. T Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~--~~~~~~-~~~ 548 (548) T protein:vir:95 492 PVEAVQKVYLGVGKMLTADEARELVNRYGA-GLPVPGPDFPNESNNGGA--DGQPSN-PDP 548 (548) T ss_pred CCCchhhhccccccccccchhHHhhccCCC-CCcCCCCCCCcccccCCC--CCCCCC-CCC Confidence 0000 0000000000000011111110000 000001111111111111 011000 000 No 135 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=99.38 E-value=4.2e-11 Score=77.45 Aligned_cols=430 Identities=15% Similarity=0.153 Sum_probs=190.7 Q ss_pred hcccccccccccccccccccccccc---CCcccccc--cCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSM---NPDFKEAP--SIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARN 111 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~--~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~ 111 (552) |-.....-++-+-.....| ++... ...+...+ ..+ .+...+..+.+-+ -.-+.+|+..|...| T Consensus 1 ~~~~~~~~~gl~p~rl~~i-~~~~~~~~~~~~~~~~~~~Lr-~~~~~~ly~~m~~-D~hi~s~l~~Rk~av--------- 68 (488) T protein:vir:95 1 MADITETQESLPPFRMGEV-GSLGLKVKNGRIYEEPRQALR-FPESIKTFQLMMR-DPAVAASVNIIKMFV--------- 68 (488) T ss_pred CCCccccCCCCCHHHHHHH-HHHhhccccchhhccchhhhc-ccchHHHHHHHhh-ChHHHHHHHHHHHHH--------- Confidence 1111111111110000001 00000 11111110 011 1112233344433 233556665555443 Q ss_pred hccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCC------ Q lcl|NC_020081. 112 SDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKL------ 185 (552) Q Consensus 112 ~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~------ 185 (552) .+..|.|...+... ...++.+....+..++... ..++.+++..++ +.+.+|-+++++++... T Consensus 69 --~~~~w~v~p~~~~~-~d~~~~~~a~~v~~~l~~~--------~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~ 136 (488) T protein:vir:95 69 --RKVNWRFVPPKGKE-QDPKMLERADFFNSLMDDM--------EHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGK 136 (488) T ss_pred --hcCCceEecCCCCc-hhHHHHHHHHHHHHHHhcc--------CccHHHHHHHHH-Hhhcccceeeeeeeecccccccc Confidence 34566664322111 1111222233344444321 135667777776 67899999999998542 Q ss_pred -------CC--EEEEEEecCcee-EEEECCCccccccc-ceeEEEE-------EcCCceEEEEcccceeeecccccCCcc Q lcl|NC_020081. 186 -------GD--LHNFKAVDASTV-YVAVDEDGKERKAK-DGVRYVQ-------VIDDKVVAKFKAKEMAWEVSNPRTDLT 247 (552) Q Consensus 186 -------G~--~~~L~~l~p~~v-~v~~~~~g~~~~~~-~~~~y~~-------~~~~~~~~~~~~~evi~~~~~~~~~~~ 247 (552) |. +..|.+.|+.+. .+..+.+++..... ....... .........+++...|+|++..+ . T Consensus 137 ~~~~~~dg~~~~~~i~~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~---~ 213 (488) T protein:vir:95 137 YQSKFDDGLIGWAKLPIRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDE---Y 213 (488) T ss_pred ccccccCCeeeeeeeeecCcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecCC---C Confidence 32 455666666433 23334444332110 0000000 00111223466666666665443 2 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC--CCCHHHHHHHHHHHHHHhc---ccccccccee Q lcl|NC_020081. 248 VGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQ--EQSNQALTSFRREWTSMFS---GINGAWKIPV 322 (552) Q Consensus 248 ~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~--~~s~~~~~~~~~~~~~~~~---G~~nagk~~i 322 (552) ..+||.+.+..|......-....++...|...-+.|--++..+.+. ..++++.+.+.+...+... +...+| + T Consensus 214 g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag---~ 290 (488) T protein:vir:95 214 GNPEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAG---L 290 (488) T ss_pred CccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhh---e Confidence 3389999999999888888888888888888755554455543221 1234444444444443322 111122 2 Q ss_pred eccCCce--e-------eeccCc-hhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHH Q lcl|NC_020081. 323 ITAEDVK--F-------VNMTQS-SKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNS 392 (552) Q Consensus 323 l~~~g~~--~-------~~l~~~-~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~ 392 (552) +.+.|+. + +.++.. ..-..|.+..++..++|+.+. ||.+-.++ .+++.+++.. +..... T Consensus 291 iiP~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~i------LGqtLT~~----~~~~Gs~Al~-~vh~ev 359 (488) T protein:vir:95 291 IWPRYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAF------MSDVLAMG----QSKYGSFSLA-DSKTSL 359 (488) T ss_pred eeccccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHH------hccccccc----cCcchhhhHH-HHHHHH Confidence 2233443 2 222222 223346677788888888865 55432221 1122334433 334445 Q ss_pred HHHHhhHHHHHHHHHHHhhcCccc-----cc--c-eeecccccChHHHHHHHHHHHHH-hcCCcC-----HHHHHHHhCC Q lcl|NC_020081. 393 KDKGLEPLLKFIEDAVNKYIVSQF-----GG--D-YVFNFVGGDAKTEAEIISILESK-AKIGLT-----INDIRKELGY 458 (552) Q Consensus 393 ~~~~l~P~~~~ie~~ln~~L~~~~-----~~--~-~~~~f~~~d~~~~~~~~~~~~~~-~~g~lT-----~NE~R~~~gl 458 (552) ....+.-.++.|++.||+.|++.. +. . -+|.|.....++....++.++.. ..|..- .+.+|+.+|+ T Consensus 360 ~~~i~~aDa~~i~~tln~~li~~l~~~Nfg~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gi 439 (488) T protein:vir:95 360 LAMSVDILLKQIKNVINRDLVAQTYALNMWDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGL 439 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCC Confidence 567788899999999999877532 21 1 25677655555544444444433 346543 3679999999 Q ss_pred CCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcccccc Q lcl|NC_020081. 459 PDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQA 530 (552) Q Consensus 459 ~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (552) |+-+.+.....+.--.. ....+..... ++ ......+..++....+- .++ T Consensus 440 p~~~~~e~~~~~~~~~~-----------~~~~~~~~~~----~~-~~~~~~~~~~~~~~a~~-------~~~ 488 (488) T protein:vir:95 440 PPADESQPVSEKLSPNS-----------QSRSGDGYKT----AG-EGTAKTPSAKDPSTANK-------ANK 488 (488) T ss_pred CCCCCCccccccCCCCC-----------CCCCCcccCC----Cc-ccCCcccccccchhhhh-------ccC Confidence 97655443322210000 0000000000 00 00000011111110000 000 No 136 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.35 E-value=5.3e-12 Score=82.40 Aligned_cols=402 Identities=11% Similarity=0.031 Sum_probs=170.4 Q ss_pred cccccchhhhhccccccc-cccccccccccccc-----cccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTK-SNKPKAYEEPIIGS-----MSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVN 100 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~ 100 (552) |++|..- .+.-.+.... .+.--.+.-+..+- -.|+... -+..-.+++|-++-+.+++.+.||...++ T Consensus 1 ~~~~~~~-~~~~~~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g------~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d 73 (449) T protein:vir:10 1 MTDKLTL-AVNHALNDARMARARMGLMVPTMGLDNKRHSAWCEYG------FPELVTYENLYSLYRRGGIAHGAVEKLVG 73 (449) T ss_pred CchhhHH-HHhhhcchhHHHHHHHHHHHHHhcCCcccchhhhhcC------CcccCCHHHHHHHHhcCchhHHHHHhhhh Confidence 3333110 0000000000 00000000011100 0111110 11223456676666677888888876655 Q ss_pred HHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEE Q lcl|NC_020081. 101 QVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFEL 180 (552) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i 180 (552) .. |. +. +..+ ...+.+..+.. ..++..+..++.. .-|..+.+.+-+ ..++|-+++++ T Consensus 74 ~~-~~--------~~-~~i~--~g~~~~~~~~~----~~~e~~~~~l~~~------~~~~~l~ea~~~-~rl~Gga~i~i 130 (449) T protein:vir:10 74 KC-WQ--------TN-PEII--EGDDADDSEDE----TSWEKKSKQVFTN------RLWRSFAEADRR-RLVGRYAGILL 130 (449) T ss_pred hh-hh--------cC-cccc--cCccccchhhh----HHHHHHHHHHHHH------HHHHHHHHHHHh-hhccCcEEEEE Confidence 32 11 00 0011 00011111111 1112222211100 012223333333 44677776665 Q ss_pred E-ECC---------CCCEEEEEEecCceeEEEE---CCCcccccccceeEEEEEc--CC--ceEEEEcccceeeeccccc Q lcl|NC_020081. 181 V-YDK---------LGDLHNFKAVDASTVYVAV---DEDGKERKAKDGVRYVQVI--DD--KVVAKFKAKEMAWEVSNPR 243 (552) Q Consensus 181 ~-r~~---------~G~~~~L~~l~p~~v~v~~---~~~g~~~~~~~~~~y~~~~--~~--~~~~~~~~~evi~~~~~~~ 243 (552) . ++. .|.+..|.|+....+++.. ++....+ ..+.+|.... .+ .....+.++.|+++.-. T Consensus 131 ~v~d~~~l~~Pl~~~~~i~~i~v~~~~~i~~~~~~~dp~sp~y--g~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~-- 206 (449) T protein:vir:10 131 HIRDEKDWNLPATKGRGLQKVSVSWAGSLKVAEWDTGINSKTY--GQPKLWKYTERLPNGSSRRVDIHPDRVFILGDY-- 206 (449) T ss_pred EecCCCCCCcccccCcceeeEEeeccccCChhhhhcCCCCCCC--CCceEEEEeeeccCCCccceeeccceeEeecCC-- Confidence 4 332 2357778887766555431 1111111 2233333221 11 22345667777655321 Q ss_pred CCccCCcccccHHHHHHHHHHHHHHHH-HHHHHHHhccCC-----------CceEEEeCCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_020081. 244 TDLTVGKYGYPELEIALNHLQYHDNTE-VFNARFFAQGGT-----------TRGLLHIKTGQEQSNQALTSFRREWTSMF 311 (552) Q Consensus 244 ~~~~~g~~G~spl~~~~~~i~~~~~~~-~~~~~~f~ng~~-----------p~gil~~~~~~~~s~~~~~~~~~~~~~~~ 311 (552) ..-|.|.++.+.+.+-....+. .+...+++|-.+ ..++.... + ...++..+++.+...... T Consensus 207 -----~~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~-~-~~~e~~~~~~~~~~~~~~ 279 (449) T protein:vir:10 207 -----SEDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLY-G-VSIDELQDKFNEVAGEIN 279 (449) T ss_pred -----CCCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHh-h-CCchHHHHHHHHHHHHHh Confidence 1337788888876553333322 222233332111 11111111 1 112233345555554444 Q ss_pred ccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh-cccccccccccccccccchhHHHHHH Q lcl|NC_020081. 312 SGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI-NFPNRGGATGHSGNTLNEGSSAEKYR 390 (552) Q Consensus 312 ~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~~~~~~~~~~n~e~~~~ 390 (552) +|.+ . .++. .+-+|+.+..++.+.. +......++||++-+||...| |.. .+++..+ +...+|......+ T Consensus 280 ~~~~---~-~~i~-~~~d~~~~~~~~sgl~--d~l~~~~q~iaaa~~IP~t~L~Gqs-p~glnst-~D~~nyyd~i~~~- 349 (449) T protein:vir:10 280 RGND---V-LMTT-QGATVTPLVTSVADPT--ATYNVNLQTAAAGVDIPTRILIGNQ-QAERSST-EDQKYFNARCQSR- 349 (449) T ss_pred ccch---h-eeec-CCcceEEEecccCChh--HHHHHHHHHHHHHhCCCeeeeeccC-ccccccc-hhHHHHHHHHHHH- Confidence 4432 1 2343 4556777776666543 566678888999999999776 544 4555433 2333333322222 Q ss_pred HHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccC---hHHHHHHHH----HHHH-HhcC---CcCHHHHHHHhCCC Q lcl|NC_020081. 391 NSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGD---AKTEAEIIS----ILES-KAKI---GLTINDIRKELGYP 459 (552) Q Consensus 391 ~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d---~~~~~~~~~----~~~~-~~~g---~lT~NE~R~~~gl~ 459 (552) +.-|+|.++.+-+.|-+.-+.....++.|+|.... .+++++... +++. +.+| +++++|+|+.+|++ T Consensus 350 ---Q~~l~p~le~l~~~l~~s~~g~~~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~~~~EiR~~~~~~ 426 (449) T protein:vir:10 350 ---RVDLSFEIEDFCDKLIELKIIDAVAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAFSREEIRTAAGYD 426 (449) T ss_pred ---HHhhhHHHHHHHHHHHHhhcCCCCCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCcCHHHHHHHhccc Confidence 23478988888777755444332345777775443 344444322 2221 2234 78999999999998 Q ss_pred CCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCccc Q lcl|NC_020081. 460 DTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQ 518 (552) Q Consensus 460 p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (552) |..+.+. +...++..++..+.. . T Consensus 427 ~~~~~~~-----------------------------~~e~~de~~~~~d~~-------a 449 (449) T protein:vir:10 427 NDDEEPL-----------------------------GEEDGDEEDKATDSA-------A 449 (449) T ss_pred CCCCCCC-----------------------------CCCCCccccccCCcC-------C Confidence 8542110 000000000000000 0 No 137 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.33 E-value=2.7e-11 Score=78.53 Aligned_cols=436 Identities=13% Similarity=0.076 Sum_probs=192.2 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCch---- Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQN---- 76 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 76 (552) ||+++. .| ..-. .... ......+|.- .+. ..+....+..++... T Consensus 1 m~~~~~-----~~-----------~a~~-----~~~~------~~~~~~~y~a--a~~---~~~~~~~~~~s~d~~~~~~ 48 (495) T protein:vir:10 1 MNMTPS-----GY-----------QSLA-----SGLL------VPVGASAYEG--ASG---GHRWQDIGDYGPDTAVASG 48 (495) T ss_pred CCcccc-----cc-----------cccc-----hhhh------hHHHhhhhhc--ccc---CcccCCCCCCChhHHHHHH Confidence 555551 11 1100 0000 0000011210 000 000111111111111 Q ss_pred ---HHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHH-HHHHhcCCCCC Q lcl|NC_020081. 77 ---LLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIE-NFIEKTGRIDN 152 (552) Q Consensus 77 ---~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~-~~l~~~n~~~~ 152 (552) +...-|.+..|+.+...++...+ .+.-|.|+..+.+..+. +-.+++..+. .|-+.+ . T Consensus 49 ~~~lr~RaRdl~rNn~~a~~av~~~~-----------~~vVG~Gi~p~~~~~~~----~~~~~ie~~w~~wa~~~----D 109 (495) T protein:vir:10 49 IQTLRARSHHNVRNNPWATNAVATWV-----------AAAVGNGLTPRWRMKEQ----ELRQELQELWGDWVNEA----D 109 (495) T ss_pred HHHHHHHHHHHHhcChHHHHHHHHHH-----------HhhcCCCcccccCCchH----HHHHHHHHHHHHhhcCc----c Confidence 11222445556666555554333 33345566655543322 2223333322 222222 2 Q ss_pred CCccCCHHHHHHHHHHHHHhcCCeeEEEEECC--CC--CEEEEEEecCceeEEEEC----CCcc------cccc-cceeE Q lcl|NC_020081. 153 DFTRDNFRSFVKKLVRDRLTYDKINFELVYDK--LG--DLHNFKAVDASTVYVAVD----EDGK------ERKA-KDGVR 217 (552) Q Consensus 153 pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~--~G--~~~~L~~l~p~~v~v~~~----~~g~------~~~~-~~~~~ 217 (552) ....++++.+...+++.++..|.+|+.+++.. .| .+..|..|+|++|..-.+ .+|. .+.. ..... T Consensus 110 ~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~va 189 (495) T protein:vir:10 110 FDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRFSNGGKRKA 189 (495) T ss_pred cccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEechhhcCCCCCCCCCCCCCEEEeceEECCCCceEE Confidence 34567999999999999999999999887643 33 467899999999853111 1111 1100 01122 Q ss_pred EEEE--cCC--------ceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE Q lcl|NC_020081. 218 YVQV--IDD--------KVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLL 287 (552) Q Consensus 218 y~~~--~~~--------~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil 287 (552) |... -.+ .....+++++|||+. ..+ .....|+|.+..+. .+......+.....--+=.+...++| T Consensus 190 Y~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f-~~r---~gQ~RGis~la~i~-~l~~l~~y~dael~~a~i~A~~~~fi 264 (495) T protein:vir:10 190 YCFYRNHPAESSLIGDPVDTVWIKAEHVLHVT-VLT---VRSDAGAPWFQLLL-RLNELDQYEDAELVRKKTAALFAAFI 264 (495) T ss_pred EEEeecCCCcccccccccceeeechhheEecc-ccC---CCcccCcchhHHHH-HHHHhhHHHHHHHHHHHHhhhheeee Confidence 2221 111 134568899999984 332 23467998665433 34444444443333333345556666 Q ss_pred EeCCCCCCCHHHHHH-HHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh-c Q lcl|NC_020081. 288 HIKTGQEQSNQALTS-FRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI-N 365 (552) Q Consensus 288 ~~~~~~~~s~~~~~~-~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g 365 (552) +-+.+.....+.... -...-.....+ -+.|.++. +..|.+++.+..+..-..|.+..+...+.||+.+|||-+.| | T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-l~pG~i~~-L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltg 342 (495) T protein:vir:10 265 QEATADSTGGPTIGQPKRSKGGKRITG-LNPGTLQY-LQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTG 342 (495) T ss_pred ecCCCccccccccCccccccCccccee-cCCceeee-cCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhc Confidence 643221110000000 00000000111 12445543 45578888777765567888999999999999999999877 5 Q ss_pred ccccccccccccccccchhHHHHHHH------------HHHHHhhHHHH-HHHHHHHhhcC--cccccc----eeecc-- Q lcl|NC_020081. 366 FPNRGGATGHSGNTLNEGSSAEKYRN------------SKDKGLEPLLK-FIEDAVNKYIV--SQFGGD----YVFNF-- 424 (552) Q Consensus 366 ~~~~~t~~~~~~~~~~~~n~e~~~~~------------~~~~~l~P~~~-~ie~~ln~~L~--~~~~~~----~~~~f-- 424 (552) +... .||+++.+.... ++.+.++|+.+ +++.++-.-.+ +.+-.. ..++| T Consensus 343 D~s~----------~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~ 412 (495) T protein:vir:10 343 DLRG----------VNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRT 412 (495) T ss_pred cccc----------ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhcccccc Confidence 5433 345544333322 33444555444 35545443222 222111 12333 Q ss_pred ---cccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccC Q lcl|NC_020081. 425 ---VGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQT 501 (552) Q Consensus 425 ---~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (552) ...|+...... ....+.+|.+|+-|+-++.|++|-+--+.+-. ........ +.. -..++.....+ T Consensus 413 p~~~~vDP~Ke~~A--~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~-----e~~~~~~~-Gl~--~~~~p~~~~~~-- 480 (495) T protein:vir:10 413 PRWEEVDPLKKHLA--DLGDVRAGFAPISDKQAERGYDMEELFDMISD-----ANQLIDEY-DLR--LDSDPRYVNGS-- 480 (495) T ss_pred CCccccChHHHHHH--HHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHH-----HHHHHHHc-CCC--CCCCCCcCCCc-- Confidence 23455544332 23345579999999988899987431110000 00000000 000 00000000000 Q ss_pred CCCCCCCCCCCCCCcccccCCCCccccc Q lcl|NC_020081. 502 GYDGNMDNVNGKDSFNQNVGKDGQSKQQ 529 (552) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (552) +... ....+..+.++ T Consensus 481 ~~~~-------------~~~~~~~~~~e 495 (495) T protein:vir:10 481 GAEQ-------------KSVMEAALNNE 495 (495) T ss_pred cCCC-------------CCCCCCCCCCC Confidence 0000 00000000000 No 138 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=99.24 E-value=8.9e-11 Score=75.69 Aligned_cols=321 Identities=14% Similarity=0.053 Sum_probs=161.5 Q ss_pred eEEEEECCCC---CEEEEEEecCceeE-EEECCCcccccccceeEEEEEc-CCceEEEEcccceeeecccccCCccCCcc Q lcl|NC_020081. 177 NFELVYDKLG---DLHNFKAVDASTVY-VAVDEDGKERKAKDGVRYVQVI-DDKVVAKFKAKEMAWEVSNPRTDLTVGKY 251 (552) Q Consensus 177 ~~~i~r~~~G---~~~~L~~l~p~~v~-v~~~~~g~~~~~~~~~~y~~~~-~~~~~~~~~~~evi~~~~~~~~~~~~g~~ 251 (552) +.||+|...+ .|..|.+.|+.++. ...++++++. ...+.. .+.....+++...|+|++..++ ..+| T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~------~~~~~~~~g~~~~~lp~~kfi~~~~~~~~---g~p~ 71 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLV------AIEQWGVFGKATVRIPVDRLVVFVNEREG---ANWL 71 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCcee------EEEecCCCCCCcceeccCCEEEEEeCCCC---CCcc Confidence 8899986544 36778889988664 4455555432 222332 3334456777777777765432 3389 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCC-----------HHHHHHHHHHHHHHhccccccccc Q lcl|NC_020081. 252 GYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQS-----------NQALTSFRREWTSMFSGINGAWKI 320 (552) Q Consensus 252 G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s-----------~~~~~~~~~~~~~~~~G~~nagk~ 320 (552) |.+.+..|.-....-....++...|...-+.|--+++.+.+...+ .+.++.+.........|. .+ T Consensus 72 G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~-~a--- 147 (355) T protein:vir:78 72 GQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGE-AA--- 147 (355) T ss_pred chhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCc-ce--- Confidence 999999999999999999999999999875554455555432221 122233333333322232 22 Q ss_pred eeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHH Q lcl|NC_020081. 321 PVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPL 400 (552) Q Consensus 321 ~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~ 400 (552) .++.+.|++++-+........|.+..++..++|+.++ ||.+-.+. .+++..+++-.+. ........+.-. T Consensus 148 ~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~i------LGqtlTs~---~~~~gGS~Alg~v-h~~v~~~~~~aD 217 (355) T protein:vir:78 148 GGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAV------LAHFLTLG---GDKSTGSYALGDT-FASFFTGSLNAV 217 (355) T ss_pred eEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHH------hhhhhccc---cCCccchhhHHHH-HHHHHHHHHHHH Confidence 2334567776666555555667788889999998876 34321110 0111223443333 345566778889 Q ss_pred HHHHHHHHHhhcCcc-----ccc--c-eeecccccChHHHHHHHHHHH-HHhcCCcCH-----HHHHHHhCCCCCCCCCe Q lcl|NC_020081. 401 LKFIEDAVNKYIVSQ-----FGG--D-YVFNFVGGDAKTEAEIISILE-SKAKIGLTI-----NDIRKELGYPDTEGGDV 466 (552) Q Consensus 401 ~~~ie~~ln~~L~~~-----~~~--~-~~~~f~~~d~~~~~~~~~~~~-~~~~g~lT~-----NE~R~~~gl~p~~ggD~ 466 (552) ++.|++.||+.|+.. ++. . -+|.|...+..+. ..++..+ ....|++.+ +.+|+.+|+|.-+.++. T Consensus 218 ~~~i~~~ln~~li~~l~~lN~~~~~~~P~~~~~~~~~~~~-~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~~ 296 (355) T protein:vir:78 218 MKHIADVTQQHVVEDLVDQNWGPEEPAPRLVPAQLGKEQP-VTAEAIRALVECGAFTADPELEKDLRARYGLPAPAERDD 296 (355) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCcChhHH-HHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCCc Confidence 999999999877653 221 1 2455643332222 2223322 333466543 45899999986555544 Q ss_pred eeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccccccccc----cCccc Q lcl|NC_020081. 467 TLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQ----GGKDD 542 (552) Q Consensus 467 ~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~ 542 (552) ...+..- .....+.....++ ....++.+.......+++.+. ..-.+.-. .-..+ T Consensus 297 ~~~~~~~----------------~~~~~~~~~~~~~-----~~~~~~~~a~~~~a~~~~~~~-~~~~~~~~~~~~~~~~~ 354 (355) T protein:vir:78 297 GADAAAA----------------KAAGRRRAKRLPG-----QRQGAALPSRSPRADPPRRRG-PLRRRPRHPAHRRCAPD 354 (355) T ss_pred ccCCccc----------------cccccccccccCC-----ccccccccccCCCCCChhhhH-HHHHHhhccccCCCCCC Confidence 3322110 0000000000000 000011111111111111111 10111100 01111 Q ss_pred cc Q lcl|NC_020081. 543 NG 544 (552) Q Consensus 543 ~~ 544 (552) | T Consensus 355 -~ 355 (355) T protein:vir:78 355 -G 355 (355) T ss_pred -C Confidence 1 No 139 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=99.23 E-value=4.4e-10 Score=71.88 Aligned_cols=386 Identities=12% Similarity=0.103 Sum_probs=190.0 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCccc-cccc--CCCCchHHH---HHHHhhcchHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFK-EAPS--IHGKQNLLQ---MLKLWSRKNIILNAIIIT 97 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~--~~~~~~~~~---~Lr~~a~~~~i~~a~i~~ 97 (552) |+--.++-..+.+...+ +.+. ..+....+|. +.+. ..+- .+++ ..+.+-+.-.-+.+|... T Consensus 1 ~~~~~~~~p~~~~~~~~---------~~~~---~~~~~~~g~~~~D~~lr~~gg-~~~~~~~l~~~m~e~D~~v~s~l~~ 67 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRT---------IYAM---EHLGLATSYLSEDGGYKRAGK-PTYQQLSAWDEAAQTEPIIAQGLDS 67 (446) T ss_pred CcccccCCCchhhhhhh---------hhcc---ccchhhcccCCcchHhhhcCC-ChHHHHHHHHHHHhcchHHHHHHHH Confidence 11111111111111111 0000 0000011110 0110 0010 1122 223333333445555555 Q ss_pred HHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCee Q lcl|NC_020081. 98 RVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKIN 177 (552) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~ 177 (552) |...| .++.|.|... +++....+.++|..+. +++....+.+.+.+|.++ T Consensus 68 Rk~av-----------~~~~w~V~p~---------~~~~a~~v~~~l~~~~-----------~~~~~~~~ldai~~G~s~ 116 (446) T protein:vir:98 68 IALSV-----------LNKVGPYQHG---------DKRIKKFIDDQLRNRA-----------KTWISHCVKSIMTYGFSL 116 (446) T ss_pred HHHHh-----------hcCCceecCc---------cHHHHHHHHHHHhhcC-----------chhHHHHHHHHHhhCcee Confidence 54443 3455666421 1223344666665321 245555678999999999 Q ss_pred EEEEECCC-C--CEE----EEEEecCceeEEEECCCcccccccc-ee------------------EEEEEcCCceEEEEc Q lcl|NC_020081. 178 FELVYDKL-G--DLH----NFKAVDASTVYVAVDEDGKERKAKD-GV------------------RYVQVIDDKVVAKFK 231 (552) Q Consensus 178 ~~i~r~~~-G--~~~----~L~~l~p~~v~v~~~~~g~~~~~~~-~~------------------~y~~~~~~~~~~~~~ 231 (552) .|+++... | .+. .+....|..++...+.++....... +. ........+....++ T Consensus 117 ~Eivw~~~~g~~~p~~~~d~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP 196 (446) T protein:vir:98 117 SEQIYAHGARDNMPATVLDDIVNYHPLQVMLIANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLP 196 (446) T ss_pred eeEEEeecccccccchhhccccccccccceeeeccCCccccccccchhhcccccccCcccchhhhhhhhcccCccccccc Confidence 99998542 2 111 1222223223333333332211100 00 000001112234567 Q ss_pred ccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC----CHH---HHHHHH Q lcl|NC_020081. 232 AKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQ----SNQ---ALTSFR 304 (552) Q Consensus 232 ~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~----s~~---~~~~~~ 304 (552) ....+++++..++ ..+||.|.+..|.-....-....++...|...-|.|--+.+++.+... +++ +-+... T Consensus 197 ~~kfi~~~~~~~~---~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~ 273 (446) T protein:vir:98 197 SHKRLFINYNTKG---NNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIA 273 (446) T ss_pred ccceEEEEecCCC---CCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHH Confidence 7787887765543 338999999999999999999999999999999999999998755421 111 111222 Q ss_pred HHHHHHhccc-cccccc-ee-eccCCceeeeccCchh-HHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccc Q lcl|NC_020081. 305 REWTSMFSGI-NGAWKI-PV-ITAEDVKFVNMTQSSK-DMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTL 380 (552) Q Consensus 305 ~~~~~~~~G~-~nagk~-~i-l~~~g~~~~~l~~~~~-d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~ 380 (552) +.+.+++... .+++.+ +. ..++|++++-++.... -..|.+..++..++|+.+....---+|... +... T Consensus 274 ~~L~~av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~--------~~~G 345 (446) T protein:vir:98 274 EQAEDALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRE--------TTFG 345 (446) T ss_pred HHHHHHHHhccccceeeeecccCCCCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccc--------cccc Confidence 3333333322 222221 11 2366777765543322 234888889999999998755433333211 1112 Q ss_pred cchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccc-----ccc--------eeecccccChHHHHHHHHHHHHH-hcCC Q lcl|NC_020081. 381 NEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQF-----GGD--------YVFNFVGGDAKTEAEIISILESK-AKIG 446 (552) Q Consensus 381 ~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~-----~~~--------~~~~f~~~d~~~~~~~~~~~~~~-~~g~ 446 (552) +++-.+... ......+.-.++.|++.||+.|+... +.. -+++|...+.++....++..+.. ..|. T Consensus 346 S~ala~vh~-~V~~d~~~aDa~~i~~tln~~Li~~l~~lNf~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~ 424 (446) T protein:vir:98 346 TGRASEIQL-ELFDGKINSIFDTVIHAFTEQVIGNLIRLNFDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGF 424 (446) T ss_pred hhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccceeccCChhhHHHHHHHHHHHHhCCc Confidence 233333333 34556678899999999999876431 110 12344444555544444444433 3466 Q ss_pred cCH---HHHHHHhCCCCCCCCCe Q lcl|NC_020081. 447 LTI---NDIRKELGYPDTEGGDV 466 (552) Q Consensus 447 lT~---NE~R~~~gl~p~~ggD~ 466 (552) +++ +.+|+.+|+|+-.. |. T Consensus 425 ~~p~~~~~ire~~giP~~~~-~~ 446 (446) T protein:vir:98 425 LVDGDKDHIRSITGLPDAIS-ST 446 (446) T ss_pred cccccHHHHHHHhCcCCCCC-CC Confidence 554 55999999976421 22 No 140 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.11 E-value=1.9e-09 Score=68.44 Aligned_cols=457 Identities=11% Similarity=0.096 Sum_probs=179.4 Q ss_pred CCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHH Q lcl|NC_020081. 3 LLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLK 82 (552) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr 82 (552) .-+=.|+++.|-..+..+.........+.....|.+.+...+..+-+.......+.-.............+......... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 80 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKT 80 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccc Confidence 22335677777666643333333333333333343332221100000000000010000000000000000000000000 Q ss_pred --HhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 83 --LWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 83 --~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) +++ ....+.|+...++- .-+-++.+.. .+ .+....+..|+. | .+. T Consensus 81 ~~ri~--~n~~~~ivd~~~~y-----------l~g~~~~~~~--~d-------~~~~~~l~~~~~--------n---~~~ 127 (503) T protein:vir:59 81 NNRTS--HAWHKLFVDQKTQY-----------LVGEPVTFTS--DN-------KTLLEYVNELAD--------D---DFD 127 (503) T ss_pred cceee--cchHHHHHHHHHhh-----------hhcCCeeecc--Cc-------HHHHHHHHHHHh--------c---CHH Confidence 000 11122233222211 1122222211 11 111122333332 1 345 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCc-----eEEEEcccce Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDK-----VVAKFKAKEM 235 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~-----~~~~~~~~ev 235 (552) .....+..+.+.+|.+|..+-++.+|++. +..++|..+.++.++.... .....++|+...... ....+++..+ T Consensus 128 ~~~~~~~~~~~~~G~~~~~v~~d~dg~~~-i~~~~p~~~~~i~d~~~~~-~~~~~ir~~~~~~~~~~~~~~~evy~~~~i 205 (503) T protein:vir:59 128 DILNETVKNMSNKGIEYWHPFVDEEGEFD-YVIFPAEEMIVVYKDNTRR-DILFALRYYSYKGIMGEETQKAELYTDTHV 205 (503) T ss_pred HHHHHHHHHHhhCCeEEEEEeecCCCceE-EEEEccceeEEEEeCCCCC-ceEEEEEEEEEecCCCceEEEEEEEeCCcE Confidence 56777889999999999999999998864 8889999998887653210 001122222221110 1112222222 Q ss_pred eeeccc------------------------------c-cCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Q lcl|NC_020081. 236 AWEVSN------------------------------P-RTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTR 284 (552) Q Consensus 236 i~~~~~------------------------------~-~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~ 284 (552) .++... + ...-.....|.|-++.+...++....+.....+.+...+.|- T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~ 285 (503) T protein:vir:59 206 YYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIV 285 (503) T ss_pred EEEEEcCCcccccccccccccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCe Confidence 221100 0 000011245888787777777776666666666666677776 Q ss_pred eEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh Q lcl|NC_020081. 285 GLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI 364 (552) Q Consensus 285 gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 364 (552) .++. + ....+ ...+...+.. +++ +..+++.+.+.+........+....+.+.+.|...-++|..-. T Consensus 286 ~v~~--g-~~~~~--~~~~~~~~~~--------~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~ 351 (503) T protein:vir:59 286 YVLK--N-YDGEN--PKEFTANLRY--------HSV-IKVSGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSP 351 (503) T ss_pred eEee--c-CCccc--cchhhhhhhc--------ccc-eeccCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCc Confidence 5554 3 21121 1122222211 122 2333333444444444445556777777788877777764322 Q ss_pred cccccccccccccccccc--h---hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc--cceeecccccChHHHHHHHH Q lcl|NC_020081. 365 NFPNRGGATGHSGNTLNE--G---SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG--GDYVFNFVGGDAKTEAEIIS 437 (552) Q Consensus 365 g~~~~~t~~~~~~~~~~~--~---n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~--~~~~~~f~~~d~~~~~~~~~ 437 (552) +... ++ .++....+ . ......+..+...|+-+++.|...++..-...+. ..+.+.|.+.-+.+..+.++ T Consensus 352 ~~~~-~~---~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~ 427 (503) T protein:vir:59 352 ETIG-GG---ATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEIVQ 427 (503) T ss_pred cccc-cc---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHHHHHH Confidence 1100 11 11111000 0 0112233344555555555555444432222211 34677787777777777776 Q ss_pred HHHH-HhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 438 ILES-KAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 438 ~~~~-~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) ++.+ +.+|+|+...+.++++.-+-+ + ..+..+...+.... +.. .......++.+ +..+ +..... T Consensus 428 ~~~kl~~~GiiS~et~l~~l~~v~d~--~--------~E~~ri~~E~~~~~-~~~--~~~~~~~~~~~-~~~~-~~~~~~ 492 (503) T protein:vir:59 428 SLVQGVTGGIMSKETAVARNPFVQDP--E--------EELARIEEEMNQYA-EMQ--GNLLDDEGGDD-DLEE-DDPNAG 492 (503) T ss_pred HHHHHHhCCCCchHHHHHhCCCCCCH--H--------HHHHHHHHHHHHHH-hhh--ccccCccCCCC-CCCc-CCCCCC Confidence 5544 446889998888877542210 0 11111111111000 000 00000000000 0000 000000 Q ss_pred ccccCCCCccc Q lcl|NC_020081. 517 NQNVGKDGQSK 527 (552) Q Consensus 517 ~~~~~~~~~~~ 527 (552) ....++.++.- T Consensus 493 ~~~~~~~g~~~ 503 (503) T protein:vir:59 493 AAESGGAGQVS 503 (503) T ss_pred cccCCCCCCcC Confidence 00011111110 No 141 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.05 E-value=3.7e-09 Score=66.80 Aligned_cols=377 Identities=12% Similarity=-0.010 Sum_probs=163.4 Q ss_pred ccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHh Q lcl|NC_020081. 67 EAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEK 146 (552) Q Consensus 67 ~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~ 146 (552) ..|.... .-+..+++.+..++. +-|+.+.++. ....||. ..+.+ ....+.+++.. T Consensus 1 ~l~~~~~--~~~~~~~~~~v~n~~-~~ivd~~~~~-----------l~~~gf~--~~d~~---------~~~~~~~i~~~ 55 (434) T protein:vir:98 1 MLPKNAE--QAFLDFQRKARTNFC-GLIANASVHR-----------LLALGVT--GPDGE---------PDTRASRWWQA 55 (434) T ss_pred CCCCCcc--HHHHHhhhhhhccch-HHHHHHHHhh-----------hccCcee--cCCCc---------hHHHHHHHHHh Confidence 1111111 122233333323332 3333322221 1122332 22211 11223444433 Q ss_pred cCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCE------EEEEEecCceeEEEECCCcccccccceeEEEE Q lcl|NC_020081. 147 TGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDL------HNFKAVDASTVYVAVDEDGKERKAKDGVRYVQ 220 (552) Q Consensus 147 ~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~------~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~ 220 (552) | .+......+..+.+++|.+|+.+.++..|.. ..+.+++|..+.++.++..+... ..++|+. T Consensus 56 -------N---~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~--~ai~~~~ 123 (434) T protein:vir:98 56 -------N---RLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPL--VGLKVWH 123 (434) T ss_pred -------c---ChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceE--EEEEEEE Confidence 1 2345666788999999999999988765432 23778899999888875432211 1122211 Q ss_pred Ec-CCceE-EEEccc-------------------------------------c--eeeecccccCCccCCcccccHHHHH Q lcl|NC_020081. 221 VI-DDKVV-AKFKAK-------------------------------------E--MAWEVSNPRTDLTVGKYGYPELEIA 259 (552) Q Consensus 221 ~~-~~~~~-~~~~~~-------------------------------------e--vi~~~~~~~~~~~~g~~G~spl~~~ 259 (552) .. ++... ..+..+ . +++++.++. .+ ..|.|-++.+ T Consensus 124 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~---~~-~~g~sd~e~v 199 (434) T protein:vir:98 124 NDIDGFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPD---LG-EDPEPEFAGV 199 (434) T ss_pred eccCCceEEEEEEeCcEEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCC---cC-cCCcchhhhH Confidence 11 11100 000000 0 222222221 11 3588888888 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHH--HHHHHHHHHHHHhccccccccceeeccCCceeeeccCch Q lcl|NC_020081. 260 LNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQ--ALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSS 337 (552) Q Consensus 260 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~--~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~ 337 (552) ...++....+.........-.+.|.-+|. + ....+. ........++. +.. ..+++.++.+++.++.++.... T Consensus 200 i~liDa~~~~~s~~~~~~~~~a~p~~~i~--G-~~~~~~~~~~~~~~~~~~~-~~~--~~~~i~~~~~~~~~~~q~~~~~ 273 (434) T protein:vir:98 200 LDIQDRVNLGILNRMAASRFSGFRQKWIK--G-HKFAKRTDPATGMTVVDQP-FVP--SPSAVWASEGENTQFGQLDATD 273 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHhcchhhhhc--C-CCcccccccccccchhhhh-hhc--cccccccCCCCCceEEEecCcc Confidence 88888877776665555555556654443 1 111111 00111122221 211 1233444455678877765433 Q ss_pred hHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----- Q lcl|NC_020081. 338 KDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----- 412 (552) Q Consensus 338 ~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----- 412 (552) . -.|++.++..+..|+..=++|++.+|.... +.++ ..+..+...+...+ .-..+.+...|.+.+ T Consensus 274 ~-~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~----n~Sg-----~Al~~~~~~l~~k~-~~k~~~f~~~l~~~~rl~~~ 342 (434) T protein:vir:98 274 L-SGFLKEHASDVRDMLTISQTPTYLYATDLV----NISA-----DTIGALDILHVAKV-REHIASFSEGLESVLALAAA 342 (434) T ss_pred h-HHHHHHHHHHHHHHhcccCCCHHHhccccC----ChHH-----HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 3 337788888999999999999999973211 1111 11111111111111 111122222222111 Q ss_pred -C--cccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhcccccccccc Q lcl|NC_020081. 413 -V--SQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQR 489 (552) Q Consensus 413 -~--~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~ 489 (552) . +.....+.+.|.+..+.+.++.++++.+.....++..-+++++|+++-+ +..+.+....+..... T Consensus 343 ~~g~~~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g~~~e~~~~~lg~~~~e----------~~r~~~e~~~~~~~~~- 411 (434) T protein:vir:98 343 QAGVPEDYTEAEVRWANPAHVTMAVKADAATKLKSIGYPLDVIAEELDESPAR----------VRRIVAGAASQALLAA- 411 (434) T ss_pred hcCCChhheeeeEEecCCCCCCHHHHHHHHHHHHhcCCcHHHHHHhCCCCHHH----------HHHHHHHHHHHHHHHH- Confidence 1 1111245667777777777777776655544346777788888876521 0011100000000000 Q ss_pred CCCCCccCcccCCCCCCCCCCCCCCCcccccCCCC Q lcl|NC_020081. 490 QMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDG 524 (552) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (552) ....+.+.+..... ..+..++|| T Consensus 412 ------~~~~~~~~~~~g~~------~~~~~~~dg 434 (434) T protein:vir:98 412 ------SLLPAPGAPSAGNV------PDSGGAVDG 434 (434) T ss_pred ------hhhccCCCCCCCCC------CcccCCCCC Confidence 00000000000000 111112222 No 142 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.01 E-value=4.9e-09 Score=66.16 Aligned_cols=409 Identities=12% Similarity=0.060 Sum_probs=165.9 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCCchHH--------------HHHHHhhcc-hHHHHHH--HHHHH Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLL--------------QMLKLWSRK-NIILNAI--IITRV 99 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~Lr~~a~~-~~i~~a~--i~~~~ 99 (552) || +-.|-.|.-........... ..++++-++ ..++..- ...+. T Consensus 1 ~~--------------------~~~~~~~~~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~ 60 (453) T protein:vir:39 1 MK--------------------YKPPKLMTFPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTKDLWKP 60 (453) T ss_pred Ce--------------------ecCCcceEcCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCccccCc Confidence 11 00011111111111111100 111111111 0010000 00000 Q ss_pred -HH-HHHHHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCC Q lcl|NC_020081. 100 -NQ-VSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDK 175 (552) Q Consensus 100 -~~-~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gn 175 (552) .. +.-+++.+....++ +|-.+.+.. . +......+.+++.. | .+......+..+.+.+|. T Consensus 61 ~~ki~~n~~~~ivd~~~~~l~g~~~~~~~-----~--d~~~~~~l~~i~~~-------N---~~~~~~~~~~~~~~~~G~ 123 (453) T protein:vir:39 61 DNRLTVNFTKYIVDTFTGYFNGIPVKKSH-----S--DKETLSKLQEFDNL-------N---DMEDEESELAKMACIYGR 123 (453) T ss_pred cceeecchHHHHHHHHhhhhcccCceecc-----C--ChHHHHHHHHHHHh-------c---ChhHHHHHHHHHHhhcCe Confidence 00 00111111111111 111111110 0 11122334444432 1 344567778899999999 Q ss_pred eeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCce-EEEEcccceeeecc------------cc Q lcl|NC_020081. 176 INFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKV-VAKFKAKEMAWEVS------------NP 242 (552) Q Consensus 176 a~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~-~~~~~~~evi~~~~------------~~ 242 (552) +|..+.++.+|.+ .+..++|..+.++.++..... ....++|+...+... ...++++.+.++.. |+ T Consensus 124 ~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~-~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~ 201 (453) T protein:vir:39 124 AFELLYQNEETQT-NVIYNTPENMFMVYDDTIKQE-PLFAVRYGYDDDYKLYGEVYTKETTYALNGTMGFYNMTEQAPNP 201 (453) T ss_pred EEEEEEecCCCce-EEEEEcccceEEEecCCCCCe-EEEEEEEEEeCCeEEEEEEEeCCeEEEEEecCCceeeecccccC Confidence 9999999999875 467789999988876543210 001111211111100 11112222111110 10 Q ss_pred cC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccccc Q lcl|NC_020081. 243 RT-----DLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGA 317 (552) Q Consensus 243 ~~-----~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~na 317 (552) -. .-.+...|.|-++.+...++....+..-..+.+...+.|-.++. + ...+++....++.. ..-...+. T Consensus 202 ~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~--g-~~~~~~~~~~~~~~---~~~~~~~~ 275 (453) T protein:vir:39 202 FDDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFL--G-AAVEEEDLKNIRSN---RVINYYGE 275 (453) T ss_pred CCceeEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeee--c-CCCCchhhhhhhhc---ceeeecCC Confidence 00 00112468888887777776666666655556666667765554 3 23344444433321 11011100 Q ss_pred ccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccch-----hHHHHHHHH Q lcl|NC_020081. 318 WKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG-----SSAEKYRNS 392 (552) Q Consensus 318 gk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~-----n~e~~~~~~ 392 (552) .-.+.+.++..++.......+....+.+.+.|+..-++|..-.+ .+++.++....+. ......+.. T Consensus 276 ----~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-----~~gn~Sg~Al~~~~~~l~~ka~~~~~~ 346 (453) T protein:vir:39 276 ----SSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDE-----SFGSSSGVSLAYKLQAMSNLALSFQRK 346 (453) T ss_pred ----CCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc-----cccCChHHHHHHHHHHHHHHHHHHHHH Confidence 01122333444444445566777888899999999888853221 1222222221111 111223344 Q ss_pred HHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeecccc Q lcl|NC_020081. 393 KDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVH 472 (552) Q Consensus 393 ~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n 472 (552) +..+|+.+++.+...++..-....-.++.+.|.+..+.+.++.++++... .|+++.--+.++++.-+-+. T Consensus 347 ~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl-~g~is~et~l~~l~~v~D~~--------- 416 (453) T protein:vir:39 347 FQSSLNSRYKLYCELSTNVSNKEAWKDIEYTFTRNEPKDIKEQAETANIL-MGITSQETALSVISVIPDVQ--------- 416 (453) T ss_pred HHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHH-hccCChHHHHHhCCCCCCHH--------- Confidence 45566666665554443321111123567778777777777777666554 58899888888776422110 Q ss_pred ccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCC Q lcl|NC_020081. 473 VQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKD 514 (552) Q Consensus 473 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (552) ..+..+.+.+.... . ..........+.++..+.. +++ T Consensus 417 -~E~~ri~~E~~~~~-~--~~~~~~~~~~~~~~~~~~~-~~e 453 (453) T protein:vir:39 417 -AEMEKIKKEEASTA-I--FDKDKQPSEKGTDTVVPET-NEE 453 (453) T ss_pred -HHHHHHHHHHHHHH-H--HHHhccCCCCCCCCCCCCc-CCC Confidence 11111111110000 0 0000011111111111111 111 No 143 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=98.94 E-value=1.2e-08 Score=63.97 Aligned_cols=397 Identities=12% Similarity=0.065 Sum_probs=163.4 Q ss_pred cccccccccccccccccccCCcccccccCCCCchHHHHHHHhh-----------------cch-HHHH--------HHHH Q lcl|NC_020081. 43 TKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWS-----------------RKN-IILN--------AIII 96 (552) Q Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a-----------------~~~-~i~~--------a~i~ 96 (552) -|-+. |+++.-+.-..- +.+.|.++. ++. .|+. +... T Consensus 1 ~~~~~---------------~~~~~~~~~~~~--~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~k 63 (452) T protein:vir:36 1 MKYKP---------------PKLMTFSKDEPI--TVEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAKDSWKPDNR 63 (452) T ss_pred CcccC---------------ceeEEcCCccCC--CHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCccccccCccce Confidence 01111 111111110000 112222211 110 0000 0000 Q ss_pred HHHHHHHHHHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcC Q lcl|NC_020081. 97 TRVNQVSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYD 174 (552) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~G 174 (552) +. .-+++.+....++ +|-.+.+.. . +......+.+++.. | .+......+..+.+.+| T Consensus 64 i~----~n~~~~ivd~~~~~l~g~~~~~~~-----~--d~~~~~~l~~~~~~-------n---~~~~~~~~~~~~~~~~G 122 (452) T protein:vir:36 64 LA----VNFTKYIVDTFTGYFNGIPVKKSH-----S--DKEILTKLQEFDNL-------N---DMEDEESELAKMACIYG 122 (452) T ss_pred ee----cchHHHHHHHHhhhhcccCceeec-----C--ChhHHHHHHHHHhh-------c---ChhHHHHHHHHHHHhcC Confidence 00 0011111111111 111111111 1 11112334444432 1 24456677889999999 Q ss_pred CeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCce-EEEEcccceeeecc------------c Q lcl|NC_020081. 175 KINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKV-VAKFKAKEMAWEVS------------N 241 (552) Q Consensus 175 na~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~-~~~~~~~evi~~~~------------~ 241 (552) .+|..+.++.+|.+. +..++|..+.++.++...... .-.++|+....... ...++.+.++++.. | T Consensus 123 ~~~~~v~~d~~g~~~-i~~~~p~~~~~v~d~~~~~~~-~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~ 200 (452) T protein:vir:36 123 RAFEFLYQDEDTQTN-VVYNSPENMFMVYDDTVKQEP-LFAVRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYN 200 (452) T ss_pred eEEEEEEecCCCeeE-EEEEcccceEEEEcCCCCCce-EEEEEEEEecCceEEEEEEecCeEEEEEEcCCceEEecceec Confidence 999999999888864 778899999888776432110 01112222111111 11122222211100 0 Q ss_pred cc-----CCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccc Q lcl|NC_020081. 242 PR-----TDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGING 316 (552) Q Consensus 242 ~~-----~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~n 316 (552) +. ..-.+...|.|-++.....++....+..-..+.+...+.|-.++. + ...+++....++. T Consensus 201 ~~g~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~--g-~~~~~~~~~~~~~----------- 266 (452) T protein:vir:36 201 PYPDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFL--G-AAVEEEDLKNIRS----------- 266 (452) T ss_pred cCCcccEEEecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee--c-CCcCchhhhhhhh----------- Confidence 00 000112367787877777777666666666666666667755553 3 2334433332221 Q ss_pred cccceeecc----CCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccc--h---hHHH Q lcl|NC_020081. 317 AWKIPVITA----EDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNE--G---SSAE 387 (552) Q Consensus 317 agk~~il~~----~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~--~---n~e~ 387 (552) +++..+.. .+.++.-+........+....+.+.+.|+..-++|..-.+ .+++.++....+ . .-.. T Consensus 267 -~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-----~~gn~Sg~Al~~~~~~l~~k~~ 340 (452) T protein:vir:36 267 -NRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDE-----SFGSSSGVSLAYKLQAMSNLAL 340 (452) T ss_pred -cceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcc-----cccCCcHHHHHHHHHHHHHHHH Confidence 01112211 1223333444445666778888999999999999853221 122222221111 1 1112 Q ss_pred HHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCee Q lcl|NC_020081. 388 KYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVT 467 (552) Q Consensus 388 ~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~ 467 (552) ..+..+..+|+.+++.|...++.+-......++.+.|.+.-+.+.++.++++.+. .|+++.--+.++++.-+-+ + T Consensus 341 ~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~-~g~iS~et~~~~~~~~~d~--~-- 415 (452) T protein:vir:36 341 SFQRKFQSSLNSRYKLFCELSTNVSNKDSWKDIEYTFTRNEPKDIKEQAETANIL-MGITSQETALSVISVIPDV--Q-- 415 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHH-hccCChHHHHHhCCCCCCH--H-- Confidence 2334445556555555554443321111113466778777676777766666554 5889987777777642211 0 Q ss_pred eccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcc Q lcl|NC_020081. 468 LAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQS 526 (552) Q Consensus 468 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (552) ..+..+.+.+...... ..+.....++.++..+.++ ++ T Consensus 416 ------~E~~ri~~E~~~~~~~---~~~~~~~~~~~~~~~~~~~-------------~e 452 (452) T protein:vir:36 416 ------AEMEKIKKEEASTAIF---DKDKQPSEKGTDTVVSETN-------------EE 452 (452) T ss_pred ------HHHHHHHHHHHHHHHH---HhhccCCCCcccccCcccc-------------CC Confidence 1111111111000000 0000000001011000000 00 No 144 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=98.92 E-value=9.9e-09 Score=64.47 Aligned_cols=471 Identities=11% Similarity=0.085 Sum_probs=192.3 Q ss_pred CCCCC--CCcccccchhhcccccCcccccccccchhhhhcccccccccc---ccccccccccccccCCcccccccCCCCc Q lcl|NC_020081. 1 MGLLD--GFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNK---PKAYEEPIIGSMSMNPDFKEAPSIHGKQ 75 (552) Q Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (552) |--+- .+...+.|+.+|. +.+-.+ +.++.-.+. ++....|+.-. ....+|+. ....+.. T Consensus 1 ~~~~~~w~~~de~~~~~~~~---~~~~~~-----------~~p~~~dG~s~i~~~~~~~~~~~-~~~~~~~g-g~~~n~~ 64 (533) T protein:vir:58 1 MPSLEKYKKLNEAVNFTNFL---SPMYGM-----------GAPHGAGGSSMIPINMYHPFATA-GYASRFYG-GIEFNRF 64 (533) T ss_pred CCCcchhhhhhHHHHHHHhh---chhhcc-----------cCccCCCCCccccCCCCcchhhh-hhhhhhhc-cccccHH Confidence 32222 0122222222221 111111 011111111 11111111111 11222221 2233334 Q ss_pred hHHHHHHHhhc-chHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCC Q lcl|NC_020081. 76 NLLQMLKLWSR-KNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDF 154 (552) Q Consensus 76 ~~~~~Lr~~a~-~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn 154 (552) .+.+..|.++. .+.+-.|+-.|..+++ .+....-+.++-+.+.+ .+..-+.++ ++.++. T Consensus 65 eLI~~YR~ma~~~pEVd~AideIvneai-------v~d~~~~pV~v~l~~~e--~s~~iK~kI------~~lldf----- 124 (533) T protein:vir:58 65 FLYDMYDRMDYTDPLISTVLDIIADECT-------IPNENGNIVDVVTKDIE--LAKAILSYL------DYVINI----- 124 (533) T ss_pred HHHHHHHHhhccCcchhhHHHhhhceee-------EecCCCceeEeeccccc--ccHHHHHHH------HHHhcc----- Confidence 45566677775 4566666554433332 12223334455444433 333222222 222221 Q ss_pred ccCCHHHHHHHHHHHHHhcCCeeEEEEE-CCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEccc Q lcl|NC_020081. 155 TRDNFRSFVKKLVRDRLTYDKINFELVY-DKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAK 233 (552) Q Consensus 155 ~~~t~~~f~~~~v~d~ll~Gna~~~i~r-~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~ 233 (552) ..+.+ .+++.+++.|+.|..++- +..+.+.+|.+|||..|+.+....+...+......|.....+......+.+ T Consensus 125 -~~~~~----~~fR~WYVDGriy~Hkiik~~k~GI~elr~lDPr~i~~vr~~~t~~eyyvy~~~~~~~~s~~~~~kI~~d 199 (533) T protein:vir:58 125 -EKNAY----PIIRNMIKYGDMFLHILEKGSDGTIEKFQVVSPYIFSKRYNPETDTWYYVITDVYRNVVSGYFNEDIPEE 199 (533) T ss_pred -hhhhh----HHHHhhhhcceeEEEeccCCcccchhhheecCCeeeEEEEeeccceEEEeecccccccccCccccccchh Confidence 12333 456778888999999874 356678999999999999887654332111111111111233334567788 Q ss_pred ceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHH---HHHHH Q lcl|NC_020081. 234 EMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRR---EWTSM 310 (552) Q Consensus 234 evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~---~~~~~ 310 (552) .|+|+.... ...++.+++|-|..+...+.....++....-|=-.-+.=+-|..+.-+......+-+=++. .++.. T Consensus 200 aI~y~~SGl--~d~~~~~iisyLhkAiKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNk 277 (533) T protein:vir:58 200 DVIHFSHKI--DTNFFPYGRSYLESARAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRD 277 (533) T ss_pred heeeeeecc--ccCCCCceehhhhHHHHHHHHHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccc Confidence 899886553 3445688999999998888877777776654433334334455554332222211111121 11111 Q ss_pred hccccccccc----------eeec---------cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhccccccc Q lcl|NC_020081. 311 FSGINGAWKI----------PVIT---------AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGG 371 (552) Q Consensus 311 ~~G~~nagk~----------~il~---------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t 371 (552) +-=....|.+ -++. +.|.++..|... . +.-++-..+..+.+.++++||.+.|+... + T Consensus 278 lvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg-~-lgemeDV~YF~kkLy~ALnVP~sRl~~e~--~ 353 (533) T protein:vir:58 278 YWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGS-K-VDLAEDVEYMLNRLISALKVPKAFIGYEG--D 353 (533) T ss_pred eEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCCC-C-CCcHHHHHHHHHHHHHHhCCCeeecCCCC--C Confidence 1111112221 0110 123455555432 2 44456667899999999999999997543 2 Q ss_pred ccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccc---ccceeecccccChHHHHHHHHHHHHH------ Q lcl|NC_020081. 372 ATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQF---GGDYVFNFVGGDAKTEAEIISILESK------ 442 (552) Q Consensus 372 ~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~---~~~~~~~f~~~d~~~~~~~~~~~~~~------ 442 (552) | | .++--.....=+...|.-+..++.+.|...|.... .+++.+.|.+...-+...-.+++..+ T Consensus 354 f-g-------r~~eItRDEiKF~KFI~rLR~rF~~ll~~qLilk~iit~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~ 425 (533) T protein:vir:58 354 V-N-------AKNTLATQDIKFNNTIKRIQGFFVEELERMVRMNKEFADQDFRLVMNRSNSIVEGERFAVIEQRIGIAER 425 (533) T ss_pred C-c-------cchhhhHHHHHHHHHHHHHHHHHHHHHhcccccccCcchhheeeeeeccchHHHHHHHHHHHHHHHHHHH Confidence 2 1 12222112222456677788888888887775321 13455666555432211111111111 Q ss_pred hcCCcCHHHHHH-HhCC----------------CCC-----CCCCeeeccccccchhhhccccccccccCCCCCccCccc Q lcl|NC_020081. 443 AKIGLTINDIRK-ELGY----------------PDT-----EGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQ 500 (552) Q Consensus 443 ~~g~lT~NE~R~-~~gl----------------~p~-----~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (552) ..+++.-.=||+ .|.+ .++ +++++ .|.-+.+ .-+++.+..... T Consensus 426 ~dpyvgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~~~~~~~~e~--~~~~~~~-------------~~~~p~~~~~~~ 490 (533) T protein:vir:58 426 LKGWVREDWIYSNILQIPYDLKPQEEVAEAAGGGGLFDTGGFGEET--TPADFLG-------------ERGSPIESPRGR 490 (533) T ss_pred hcchhhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCCCCCCccccc--CCcccCc-------------cccCcccCCCCh Confidence 012221111221 1222 111 01110 0000000 000000000000 Q ss_pred CCCCCCCCCCCCCCCcccccCCCCccccccccccccccCccccccccccccC Q lcl|NC_020081. 501 TGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGKDDNGNVVNDWEA 552 (552) Q Consensus 501 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (552) ...+..+ .+...-....+.+...++-.+.+.+.+++ -.|-. T Consensus 491 ~~~~~~~-~~~~~~~~~~~~~~a~~~~~~~~g~~~~~----------~~~p~ 531 (533) T protein:vir:58 491 TEFDFGT-EGGEELGGELNLGGAFEEFEEETGGGEEE----------LPFPE 531 (533) T ss_pred hhHhccc-CCcccccccccccccchhhhhhcCCcccC----------CCCCC Confidence 0000000 00000001112222222211111111111 11111 No 145 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.91 E-value=9.3e-09 Score=64.61 Aligned_cols=423 Identities=9% Similarity=0.030 Sum_probs=154.8 Q ss_pred cccccccccccCCccc--cc--ccCCCCchHHHHHHHhhcchH-HHHHHH----HHHHH-HHHHHHHHHHhhcc----cc Q lcl|NC_020081. 51 YEEPIIGSMSMNPDFK--EA--PSIHGKQNLLQMLKLWSRKNI-ILNAII----ITRVN-QVSMFCTPARNSDK----GV 116 (552) Q Consensus 51 ~~~~~~~~~~~~~~~~--~~--~~~~~~~~~~~~Lr~~a~~~~-i~~a~i----~~~~~-~~~~~~~~~~~~~~----~~ 116 (552) -+.||.+-..-...-. .. ........-...|..+-++.- +...-+ ..+.. .+.-+++.+....+ +. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~ 80 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQAVE 80 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhhhccC Confidence 2223222110000000 00 000000000112222222111 100000 00000 01112222222211 22 Q ss_pred ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCE-------E Q lcl|NC_020081. 117 GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDL-------H 189 (552) Q Consensus 117 ~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~-------~ 189 (552) ||.+ .+. ......+.+++.. | .+..+...+..+.+++|.+|+.+.++..+.. . T Consensus 81 g~~~--~~~--------~~~~~~l~~i~~~-------N---~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~ 140 (485) T protein:vir:24 81 GFRL--GDA--------DEADEELWQWWQA-------N---NLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVP 140 (485) T ss_pred ceec--CCC--------chhHHHHHHHHHh-------c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcc Confidence 3321 110 1111234444432 1 2445677889999999999999988766532 2 Q ss_pred EEEEecCceeEEEECCCcccccccceeEEEEEcCCc---eEEEEcccc-------------------------eeeeccc Q lcl|NC_020081. 190 NFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDK---VVAKFKAKE-------------------------MAWEVSN 241 (552) Q Consensus 190 ~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~---~~~~~~~~e-------------------------vi~~~~~ 241 (552) .+.+++|..+.++.++..+.... .+.++....+. ....|..+. |++++.+ T Consensus 141 ~i~~~~p~~~~~i~D~~~~~~~~--~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~ 218 (485) T protein:vir:24 141 LIRVEPPTRMYAEIDPRIGRPAK--AIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNR 218 (485) T ss_pred eEEEeccceeEEEeeCCcCceeE--EEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEeccC Confidence 57889999998887654322111 01110000000 001111111 1333222 Q ss_pred ccCCccCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHH--HHHHHHHHHHhccccccc Q lcl|NC_020081. 242 PRTDLTVGKYGYPELEI-ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQAL--TSFRREWTSMFSGINGAW 318 (552) Q Consensus 242 ~~~~~~~g~~G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~--~~~~~~~~~~~~G~~nag 318 (552) + ...+++|.|.+.- +...++....+..-........+.|.-+|. + ....+... ..-...|+. ..+ T Consensus 219 ~---~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~--G-~~~~~~~~~~~~~~~~~~~------~~~ 286 (485) T protein:vir:24 219 T---RLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIF--G-IKPEEIGVDPETGQTLFDA------YLA 286 (485) T ss_pred c---ccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhc--c-CCccccccccccccchhhh------ccc Confidence 1 2345688887653 344444444333333333333345544443 2 11111000 001112221 122 Q ss_pred cceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccch-----hHHHHHHHHH Q lcl|NC_020081. 319 KIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG-----SSAEKYRNSK 393 (552) Q Consensus 319 k~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~-----n~e~~~~~~~ 393 (552) ++.++.+++.++.++.....+ .+++..+..+..++..=++|++.+|.......+| ....+. .-.+..+..+ T Consensus 287 ~i~~~~~~~~~~~q~~~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg---~Al~~~~~~l~~ka~~~~~~f 362 (485) T protein:vir:24 287 RILAFEDAEGKIQQFSAAELA-NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASA---EAIRAAESRLIKKVERKNAIF 362 (485) T ss_pred ceeccCCCCceEEeecccchH-HHHHHHHHHHHHHhcccCCCHHHhccccCcchHH---HHHHHHHHHHHHHHHHHHHHH Confidence 334444567777766543332 3677777888888888899999997432111111 000000 0011222223 Q ss_pred HHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHH-Hhc--CCcCHHHHHHHhCCCCCCCCCeeecc Q lcl|NC_020081. 394 DKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILES-KAK--IGLTINDIRKELGYPDTEGGDVTLAG 470 (552) Q Consensus 394 ~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~-~~~--g~lT~NE~R~~~gl~p~~ggD~~~~~ 470 (552) ...|+-+++.+...++..-.+.....+.+.|.+..+.+..+.+..+.+ +.+ |+++..-+++++|+.+-+- T Consensus 363 ~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~------- 435 (485) T protein:vir:24 363 GGAWEEAMRLAYRLMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAER------- 435 (485) T ss_pred HHHHHHHHHHHHHHhcCCCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHH------- Confidence 334443333332221111011111345677765555555554443332 233 4678777788888754210 Q ss_pred ccccchhhhccccccccc--cCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCc Q lcl|NC_020081. 471 VHVQRLGQIMQQEQVEYQ--RQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQ 525 (552) Q Consensus 471 ~n~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (552) ..+......+..... ...-.+ .....++.+..+..++++... .+.|+- T Consensus 436 ---~e~~~~~ee~~~~~~~~~~~~~~-~~~~~~~~~~~~e~~~~~~~~---~~~~~a 485 (485) T protein:vir:24 436 ---EEMRRWDEEEAAMGLGLLGTMVD-ADPTVPGSPNPTPAPKPQPAI---EGGDSA 485 (485) T ss_pred ---HHHHHHHHHHhhhhhhHHHhhcc-cCCCCCCCCCCCCCCCCccCC---CCCCCC Confidence 001111111000000 000000 000000000000000000000 011110 No 146 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.90 E-value=1e-08 Score=64.38 Aligned_cols=412 Identities=8% Similarity=0.004 Sum_probs=149.8 Q ss_pred cccccccccccCCcccccccCCCCchH---H--------------HHHHHhhcchH-HHHHHHHH----HH-HHHHHHHH Q lcl|NC_020081. 51 YEEPIIGSMSMNPDFKEAPSIHGKQNL---L--------------QMLKLWSRKNI-ILNAIIIT----RV-NQVSMFCT 107 (552) Q Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~--------------~~Lr~~a~~~~-i~~a~i~~----~~-~~~~~~~~ 107 (552) -+.|+ +...-.... . ..|..+-++.- +...-+.+ +. ..+.-+++ T Consensus 1 ~~~~~-------------~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~ 67 (486) T protein:vir:42 1 MTAPL-------------PGMEEIEDPAVVREEMISAFEDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPR 67 (486) T ss_pred CCCCC-------------CCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHH Confidence 11111 111111111 1 11111111110 00000000 00 00111122 Q ss_pred HHHhhcc----ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC Q lcl|NC_020081. 108 PARNSDK----GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD 183 (552) Q Consensus 108 ~~~~~~~----~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~ 183 (552) .+....+ ..||.+ . +. ......+.+++.. | .+......+..+++++|.+|+.+.++ T Consensus 68 ~iVd~~~~~l~~~g~~~--~--~~------~~~~~~~~~i~~~-------N---~~d~~~~~~~~~a~~~G~ay~~v~~~ 127 (486) T protein:vir:42 68 LYVDSVAERQAVEGFRL--G--DA------DEADEELWQWWQA-------N---NLDIEAPLGYTDAYVHGRSFITISKP 127 (486) T ss_pred HHHHHHHhhhcccceec--C--CC------chhHHHHHHHHHh-------c---ChhHHHHHHHHHHhhcCceEEEEecC Confidence 2211111 122211 0 00 0111223344332 1 23345667889999999999999887 Q ss_pred CCCC-------EEEEEEecCceeEEEECCCcccccccceeEEEEEcCCce---EEEEcccce------------------ Q lcl|NC_020081. 184 KLGD-------LHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKV---VAKFKAKEM------------------ 235 (552) Q Consensus 184 ~~G~-------~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~---~~~~~~~ev------------------ 235 (552) ..|. ...+.+++|..+.++.++....... .++|+....+.. ...|.++.+ T Consensus 128 e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~~~~~~~--~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h 205 (486) T protein:vir:42 128 DPQLDLGWDQNVPIIRVEPPTRMHAEIDPRINRVSK--AIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPH 205 (486) T ss_pred CcccccccCCCeeEEEEecccceEEEEeCCCCCeEE--EEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceec Confidence 6443 2357788999998887743221111 111111111111 011222211 Q ss_pred -------eeecccccCCccCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHH--HHHHH Q lcl|NC_020081. 236 -------AWEVSNPRTDLTVGKYGYPELEI-ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQAL--TSFRR 305 (552) Q Consensus 236 -------i~~~~~~~~~~~~g~~G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~--~~~~~ 305 (552) ++++.+ ....+.+|.|-++- +...++....+..-......-.+.|..+|. + ....+... .+-.. T Consensus 206 ~~g~vPvv~~~n~---~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~--G-~~~~~~~~~~~~~~~ 279 (486) T protein:vir:42 206 GLGVVPVVPLPNR---TRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIF--G-IKPEEIGVDSETGQT 279 (486) T ss_pred CCCCceEEEeccc---cccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhh--c-CCccccccccccccc Confidence 222211 12244688886652 333333333332222222333334443332 1 11111000 01111 Q ss_pred HHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchh- Q lcl|NC_020081. 306 EWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGS- 384 (552) Q Consensus 306 ~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n- 384 (552) .|.. ..+++.++...++++.++..... -.+++..+..+..+|..=++|++.+|.......+ +....+.. T Consensus 280 ~~~~------~~~~~~~~~~~~~~~~q~~~~~~-e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~S---g~Al~~~~~ 349 (486) T protein:vir:42 280 LFDA------YLARILAFEDAEGKIQQFSAAEL-ANFTNALDQIAKQVAAYTGLPPQYLSTAADNPAS---AEAIRAAES 349 (486) T ss_pred hhhh------hhchhcccCCCCceEEeecccCH-HHHHHHHHHHHHHHhcccCCCHHHhccccCchhH---HHHHHHHHH Confidence 1221 12333344445677766654332 2377888888888999999999998743221111 11111110 Q ss_pred ----HHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHH-hc--CCcCHHHHHHHhC Q lcl|NC_020081. 385 ----SAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESK-AK--IGLTINDIRKELG 457 (552) Q Consensus 385 ----~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~-~~--g~lT~NE~R~~~g 457 (552) .-+..+..+...|.-+++.+....+..-.+.....+.+.|.+..+.+.++.++++.+. .+ |+++..-+++++| T Consensus 350 ~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg 429 (486) T protein:vir:42 350 RLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMG 429 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCC Confidence 0112222333444444443322222111111113456677666555555555544333 22 6788777788888 Q ss_pred CCCCCCCCeeeccccccchhhhccccccccccC-CCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcccccccccccc Q lcl|NC_020081. 458 YPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQ-MDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTP 536 (552) Q Consensus 458 l~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 536 (552) +-+-+- ..+..+...+....... ..-.......++....+..+.++ ..+.. T Consensus 430 ~~~d~~----------~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~~~~~ 481 (486) T protein:vir:42 430 YSVKER----------EEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQ------------------PAIES 481 (486) T ss_pred CChhHH----------HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCC------------------cccCC Confidence 743210 00111000000000000 00000000000000000000000 11111 Q ss_pred ccCcc Q lcl|NC_020081. 537 QGGKD 541 (552) Q Consensus 537 ~~~~~ 541 (552) +|+.+ T Consensus 482 ~~~~~ 486 (486) T protein:vir:42 482 SGGDA 486 (486) T ss_pred CCCCC Confidence 11111 No 147 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.88 E-value=1.8e-08 Score=63.01 Aligned_cols=436 Identities=9% Similarity=0.039 Sum_probs=154.6 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCccccc-ccC-CCCchHHHHHHHhhc-chHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEA-PSI-HGKQNLLQMLKLWSR-KNIILNAIIITRVN 100 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~Lr~~a~-~~~i~~a~i~~~~~ 100 (552) |..-+......-.-..+....-..- ... ..++...-.|+.. ... .-.......++++.. .++ ..-|+.+.+. T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~-~~~---~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~-~~~ivd~~~~ 75 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAF-EDS---TQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGY-PRLYVDSIAE 75 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHH-HHH---HHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCc-HHHHHHHHHh Confidence 2222111111111000000000000 000 0000000001100 000 000000111121100 011 1122221111 Q ss_pred HHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEE Q lcl|NC_020081. 101 QVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFEL 180 (552) Q Consensus 101 ~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i 180 (552) .....||.+ .+ . + .....+.+++.. | .+..+...+..+.+++|.||..+ T Consensus 76 -----------~l~~~g~~~--~~--~---~---~~~~~~~~i~~~-------N---~~d~~~~~~~~~a~i~G~ay~~v 124 (485) T protein:vir:10 76 -----------RQAVEGFRF--GD--A---D---EADEELWQWWQA-------N---NLDIEAPLGYTDAYVHGRSYITI 124 (485) T ss_pred -----------hhcccceec--CC--C---c---hhHHHHHHHHHh-------c---CHhHHHHHHHHHHhhcCceEEEE Confidence 111223321 11 1 0 112334455432 1 34567778999999999999999 Q ss_pred EECCCCC-------EEEEEEecCceeEEEECCCcccccccceeEEEEEcCCce---EEEEcccce--------------- Q lcl|NC_020081. 181 VYDKLGD-------LHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKV---VAKFKAKEM--------------- 235 (552) Q Consensus 181 ~r~~~G~-------~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~---~~~~~~~ev--------------- 235 (552) .++..+. ...+.+++|..+.++.++..+.... .+.++....++. ...|..+.+ T Consensus 125 ~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~--~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~ 202 (485) T protein:vir:10 125 SRPDPQIDLGWDPNTPIIRVEPPTRMYAEIDPRIGRVSK--AIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFN 202 (485) T ss_pred eeCCcccccccCCCeeEEEEEccceeEEEEcCCCCceeE--EEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEecc Confidence 8876542 2357888999998887754332111 111111111111 111222211 Q ss_pred ----------eeecccccCCccCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHH--HH Q lcl|NC_020081. 236 ----------AWEVSNPRTDLTVGKYGYPELEI-ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQAL--TS 302 (552) Q Consensus 236 ----------i~~~~~~~~~~~~g~~G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~--~~ 302 (552) +++..+. ...+.+|.|.++- +...++....+..-......-.+.|.-+|. + ....+... .. T Consensus 203 ~~~~~g~vPvv~~~n~~---~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~--G-~~~~~~~~~~~~ 276 (485) T protein:vir:10 203 NPHGLGVVPVVPIPNRT---RLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIF--G-IKPEEIGVDPET 276 (485) T ss_pred ccCCCCcccEEEecccc---ccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHh--c-CCcccccccccc Confidence 2222221 2234688886642 333333333332222222222334443332 1 11111000 00 Q ss_pred HHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccc Q lcl|NC_020081. 303 FRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNE 382 (552) Q Consensus 303 ~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~ 382 (552) -...|+.. .+++..+.+++.++.++....- --|++..+..+.+|+.+=++|++.+|....+..+ +....+ T Consensus 277 ~~~~~~~~------~~~i~~~~~~d~k~~q~~~~~~-~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~S---g~Al~~ 346 (485) T protein:vir:10 277 GQTLFDAY------LARILAFEDAEGKIQQFSAAEL-ANFTNALDQIAKQVAAYTGLPPQYLSTAADNPAS---AEAIRA 346 (485) T ss_pred cchhhhhc------ccceeccCCCCceEEeecccch-HHHHHHHHHHHHHHhcccCCCHHHhccccCchhH---HHHHHH Confidence 01112211 2233344445677776654332 2377888888999999999999998743211110 000000 Q ss_pred h-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHH-hc--CCcCHHHHHH Q lcl|NC_020081. 383 G-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESK-AK--IGLTINDIRK 454 (552) Q Consensus 383 ~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~-~~--g~lT~NE~R~ 454 (552) . .-.+..+..+...|+.+++.+....+..-.......+.+.|.+..+.+.++.++++.+. .+ |+++..-+++ T Consensus 347 ~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~ 426 (485) T protein:vir:10 347 AESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARK 426 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHH Confidence 0 00011222223333333332221111100001112456677666666666665554433 33 4788888899 Q ss_pred HhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCc Q lcl|NC_020081. 455 ELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQ 525 (552) Q Consensus 455 ~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (552) ++|+.+-+- ..+......+....... -+......++.++.++.....+....++++++. T Consensus 427 ~lg~~~~~~----------~~~~~~~ee~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 427 DMGYSIAER----------EEMRRWDEEEAAMGLGL--IGTMVDPNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred hCCCCHhHH----------HHHHHHHHHHHHHHHHH--HHHhhccCCCCCCCCCccccccCcCCCCCCCCC Confidence 988854210 00110001000000000 000000000000000000000001111111111 No 148 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=98.87 E-value=5.5e-09 Score=65.87 Aligned_cols=476 Identities=11% Similarity=0.073 Sum_probs=208.0 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcch--HHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKN--IILNAIIITRVNQ 101 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~--~i~~a~i~~~~~~ 101 (552) |-..+--+...+.. ....+..+..-+.++++.. |+-..+.+.-..+..-=+-+.|...- .=++..+.=+.++ T Consensus 1 ma~~~lr~~rrpk~-~p~~~r~~al~aas~~i~~-----p~~~~~ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s 74 (629) T protein:vir:99 1 MAPTSLRIVRRPKS-EPVSTRQRALVAASQPVEN-----PGKAFRKAMGSSTRTDWQDDAWKAYDAVGELRYYVGWRSSS 74 (629) T ss_pred CCccceeeeecCCC-CChhhhhhhhhhhhhcccc-----cchhhhhhcCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhh Confidence 33222222222222 1111222233344444321 11111111111111001112222111 0112222223333 Q ss_pred HHHHHHHHHhhccccceeeeecccc-----ccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCe Q lcl|NC_020081. 102 VSMFCTPARNSDKGVGYEIRLKDPL-----QEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKI 176 (552) Q Consensus 102 ~~~~~~~~~~~~~~~~~~i~~k~~~-----~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna 176 (552) ++++ ++| ....|++ ..+. ++......+.++.+.+. -..+-..++++.+..++-+-|.+ T Consensus 75 ~Sr~-rL~----------as~idpDtg~ptg~i~-e~~~~~~~v~~~v~~i~-----gG~lgqa~lLkr~~~~ltV~GE~ 137 (629) T protein:vir:99 75 ASRV-RLI----------ASAIDPDTGLPTGSID-EDDRVGARVQQIVNQIA-----GGALGQAQLIKRVVEQLTVAGET 137 (629) T ss_pred hcee-eeE----------eeeecCCCCCCccccC-CCchhHHHHHHHHHhhc-----CChhhHHHHHHHHHhheecccce Confidence 3322 111 1111211 1111 11111123344444432 12345678999999999999999 Q ss_pred eEEEEECCC------CCEEEE-EEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCC Q lcl|NC_020081. 177 NFELVYDKL------GDLHNF-KAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVG 249 (552) Q Consensus 177 ~~~i~r~~~------G~~~~L-~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g 249 (552) |+.+.--.. |.++.- +.+-++-|.- ..+| .-.....+.+.......|++.-. .++++... T Consensus 138 wiv~~~~~~~~~d~~~~~~~eW~~vt~~ei~~--~~~~---------~~i~lP~g~~~e~~~~~d~l~Ri--W~P~Prr~ 204 (629) T protein:vir:99 138 WVAILFTDKSRLDSNGNPVPEWLALTPEEVRA--SEKK---------TIIELPTGDKHEFRDGLDGMFRV--WNPRARRA 204 (629) T ss_pred EEEEeecCCCccCCCCcchhhheeechHHhhh--ccCc---------eeEEcCCCCccceeCCCceEEEe--eCCCcccc Confidence 999884322 333333 3333333331 1111 11233334444555556666333 33455566 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCC--------------------HHHHHHHHHHHH- Q lcl|NC_020081. 250 KYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQS--------------------NQALTSFRREWT- 308 (552) Q Consensus 250 ~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s--------------------~~~~~~~~~~~~- 308 (552) .+--||+.+++..+.-...+.+...+..+.-.+-.|||.+|....+. .-+.+.|.+.|- T Consensus 205 ~e~DSpvra~l~~l~Ei~~lt~~i~aaakSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q 284 (629) T protein:vir:99 205 REPDSPVRANLDSLKEIVRTTKTIANASKSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQ 284 (629) T ss_pred cCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeEeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHH Confidence 78889999999999888888888777777777777887775432220 012334444443 Q ss_pred ---HHhcccc-ccccceeecc------CCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHH-hccccccccccccc Q lcl|NC_020081. 309 ---SMFSGIN-GAWKIPVITA------EDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSE-INFPNRGGATGHSG 377 (552) Q Consensus 309 ---~~~~G~~-nagk~~il~~------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~t~~~~~~ 377 (552) ..+..-+ .+--+||+.. ++++...+.. .-+.--+.+++..+..||....|||.. ||+..+++-|+... T Consensus 285 ~a~tAi~De~S~aA~vPiia~~P~E~i~~i~hlkf~~-ei~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWq 363 (629) T protein:vir:99 285 VAQTAYDDEDSMAALIPMFAAAPGELIKNVTHLKFDN-QVTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQ 363 (629) T ss_pred HHhhhhcCCCCccceeeeeEeechHHhcCeeEEeecC-chhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEE Confidence 3333222 2445676643 2344444433 334445789999999999999999965 47755554443321 Q ss_pred ccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc----cc---cceeecccccC----hHHHHHHHHHHHHHhcCC Q lcl|NC_020081. 378 NTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ----FG---GDYVFNFVGGD----AKTEAEIISILESKAKIG 446 (552) Q Consensus 378 ~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~----~~---~~~~~~f~~~d----~~~~~~~~~~~~~~~~g~ 446 (552) -...-++-.|.|.+..|+++|++.+|.. .| .+|.+.|+-.. +.-..+.. +....|. T Consensus 364 ----------I~dedvrlHI~P~l~~ic~AlT~~~Lrp~Le~eGiDp~kYvvW~DaS~Lt~dPd~~deA~---~a~drGA 430 (629) T protein:vir:99 364 ----------IGDEDVRLHILPPVEMLCEAITNQVLRTVLMREGIDPNAYVVWHDASQLTVDPDKTDEAR---DAFDRGA 430 (629) T ss_pred ----------ecccceeeecchhHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHH---HHHHcCC Confidence 1111245679999999999999887742 22 35777786443 22222322 3345699 Q ss_pred cCHHHHHHHhCCCCCCCCCe-------------eecccccc-chhhhccccccc-cccCCCCCccCcccCCCCCCCCCCC Q lcl|NC_020081. 447 LTINDIRKELGYPDTEGGDV-------------TLAGVHVQ-RLGQIMQQEQVE-YQRQMDANQFLAQQTGYDGNMDNVN 511 (552) Q Consensus 447 lT~NE~R~~~gl~p~~ggD~-------------~~~~~n~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (552) ||-...|+.+|+.--.|-|. +-..-.+. .+.......... ...+....++...+.+.++.++.++ T Consensus 431 It~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga~~ 510 (629) T protein:vir:99 431 ITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTLAVLIPELADVEFPTPTVALPPAEEQDGDEEASGASR 510 (629) T ss_pred ccHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhhhhhhhhhcccccCccCCCCCccccCCCcccccCCCc Confidence 99999999999965333221 00000100 000000000000 0011111111111112222222222 Q ss_pred CCCCcccccCCCCccccccccccc----------------cccCccccccccccccC Q lcl|NC_020081. 512 GKDSFNQNVGKDGQSKQQANTNST----------------PQGGKDDNGNVVNDWEA 552 (552) Q Consensus 512 ~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~ 552 (552) +.+...++ +..+++...+..- ..|+..-.-+....|.. T Consensus 511 ~~ep~te~---d~~~~~a~~aa~~~~~~a~V~llv~RALelAGkR~r~r~~~ar~r~ 564 (629) T protein:vir:99 511 REEPDTED---DAGTDDSDQASLDSRETAMVEALVFRALELAGKRSRTRSLPYELRQ 564 (629) T ss_pred CCCCCCCC---CCcccccCCCCCCCcHHHHHHHHHHHHHHhcCCcCCChhhHHHHhc Confidence 22221111 1111111111110 01111100000111111 No 149 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=98.87 E-value=4.8e-09 Score=66.17 Aligned_cols=476 Identities=11% Similarity=0.073 Sum_probs=208.3 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcch--HHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKN--IILNAIIITRVNQ 101 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~--~i~~a~i~~~~~~ 101 (552) |-..+--+...+.. ....+..+..-+.++++.. |+-..+.+.-..+..-=+-+.|...- .=++..+.=+.++ T Consensus 1 ma~~~lr~~rrpk~-~p~~~r~~al~aas~~i~~-----p~~~~~ks~~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s 74 (629) T protein:vir:86 1 MAPTSLRIVRRPKS-EPVSTRQRALVAASQPVEN-----PGKAFRKAMGSSTRTDWQEDAWKAYDAVGELRYYVGWRSSS 74 (629) T ss_pred CCccceeeeecCCC-CChhhhhhhhhhhhhcccc-----ccchhhhhcCCCchhhhhHHHHHHHHhhhhHHHHhhhhhhh Confidence 33222222222222 1111222233344444321 11111111111111101112222111 0112222223333 Q ss_pred HHHHHHHHHhhccccceeeeecccc-----ccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCe Q lcl|NC_020081. 102 VSMFCTPARNSDKGVGYEIRLKDPL-----QEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKI 176 (552) Q Consensus 102 ~~~~~~~~~~~~~~~~~~i~~k~~~-----~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna 176 (552) ++++ ++ .....|++ ..+. ++......+.++.+.+. -..+-..++++.+..++-+-|.+ T Consensus 75 ~Sr~-rL----------~as~idpDtg~ptg~i~-e~~~~~~~v~~~v~~i~-----gG~lgqa~lLkr~~~~ltV~GE~ 137 (629) T protein:vir:86 75 ASRV-RL----------IASAIDPDTGLPTGSID-EDDRVGARVQQIVNQIA-----GGALGQAQLIKRVVEQLTVAGET 137 (629) T ss_pred hcee-ee----------EeeeecCCCCCCccccC-CCchhHHHHHHHHHhhc-----CChhhHHHHHHHHHhheecccce Confidence 3322 11 11111211 1111 11111123344444432 12345678999999999999999 Q ss_pred eEEEEECCC------CCEEEE-EEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCccCC Q lcl|NC_020081. 177 NFELVYDKL------GDLHNF-KAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVG 249 (552) Q Consensus 177 ~~~i~r~~~------G~~~~L-~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g 249 (552) |+.+.--.. |.++.- +.+-++-|.- ..+| .-.....+.+.......+++.-.. ++++... T Consensus 138 wiv~~~~~~~~~d~~~~~~~eW~~vt~~ei~~--~~~~---------~~i~lP~g~~~e~~~~~d~l~RiW--~P~Prr~ 204 (629) T protein:vir:86 138 WVAILFTDKSRLDSNGNPVPEWLALTPEEVRA--SEKK---------TIIELPTGDKHEFRDGLDGMFRVW--NPRARRA 204 (629) T ss_pred EEEEeecCCCccCCCCcchhhheeechHHhhh--ccCc---------eeeEcCCCCcceeeCCCceEEEee--CCCcccc Confidence 999884322 333333 3333333321 1111 112333444445555666663333 3455566 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCC--------------------HHHHHHHHHHHH- Q lcl|NC_020081. 250 KYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQS--------------------NQALTSFRREWT- 308 (552) Q Consensus 250 ~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s--------------------~~~~~~~~~~~~- 308 (552) .+--||+.+++..+.-...+.+...+..+.-.+-.|||.+|....+. .-+.+.|.+.|- T Consensus 205 ~e~DSpvra~l~~l~Ei~~lt~~i~aaakSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q 284 (629) T protein:vir:86 205 REPDSPVRANLDSLKEIVRTTKTIANASKSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQ 284 (629) T ss_pred cCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeeeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHH Confidence 78889999999999888888887777777777777787775432220 012334444443 Q ss_pred ---HHhcccc-ccccceeecc------CCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHH-hccccccccccccc Q lcl|NC_020081. 309 ---SMFSGIN-GAWKIPVITA------EDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSE-INFPNRGGATGHSG 377 (552) Q Consensus 309 ---~~~~G~~-nagk~~il~~------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~t~~~~~~ 377 (552) ..+..-+ .+--+||+.. ++++...+.. .-+.--+.+++..+..||....|||.. ||+..+++-|+... T Consensus 285 ~a~tAi~De~S~aA~vPiia~~P~E~i~~i~hlkf~~-ei~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWq 363 (629) T protein:vir:86 285 VAQTAYDDEDSMAALIPMFAAAPGELIKNVTHLKFDN-QVTEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQ 363 (629) T ss_pred HHhhhhcCCCCccceeeeeEeechHHhcCeeEEeecC-chhHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEE Confidence 3333222 2445676643 2344444433 334445789999999999999999965 47755554443321 Q ss_pred ccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc----cc---cceeecccccC----hHHHHHHHHHHHHHhcCC Q lcl|NC_020081. 378 NTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ----FG---GDYVFNFVGGD----AKTEAEIISILESKAKIG 446 (552) Q Consensus 378 ~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~----~~---~~~~~~f~~~d----~~~~~~~~~~~~~~~~g~ 446 (552) -...-++-.|.|.+..|+++|++.+|.. .| .+|.+.|+-.. +.-..+.. +....|. T Consensus 364 ----------I~dedvrlHI~P~l~~ic~AlT~~~Lrp~Le~eGiDp~kYvvW~DaS~Lt~dPd~~deA~---~a~drGA 430 (629) T protein:vir:86 364 ----------IGDEDVRLHILPPVEMLCEAITNQVLRTVLMREGIDPNAYVVWHDASQLTVDPDKTDEAR---DAFDRGA 430 (629) T ss_pred ----------ecccceeeecchHHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHH---HHHHcCC Confidence 1111245679999999999999887742 22 35777786443 22223323 3345699 Q ss_pred cCHHHHHHHhCCCCCCCCCe-------------eecccccc-chhhhccccccc-cccCCCCCccCcccCCCCCCCCCCC Q lcl|NC_020081. 447 LTINDIRKELGYPDTEGGDV-------------TLAGVHVQ-RLGQIMQQEQVE-YQRQMDANQFLAQQTGYDGNMDNVN 511 (552) Q Consensus 447 lT~NE~R~~~gl~p~~ggD~-------------~~~~~n~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (552) ||-...|+.+|+.--.|-|. +-..-.+. .+.......... ...+....++...+.+.++.++.++ T Consensus 431 It~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga~~ 510 (629) T protein:vir:86 431 ITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTLAVLIPELADVEFPTPTVALPPAEEQDGDEEASGASR 510 (629) T ss_pred cCHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhhhhhhhhhcccccCccCCCCCccccCCCcccccCCCc Confidence 99999999999965333221 00000100 000000000000 0011101111111112222222222 Q ss_pred CCCCcccccCCCCccccccccccc----------------cccCccccccccccccC Q lcl|NC_020081. 512 GKDSFNQNVGKDGQSKQQANTNST----------------PQGGKDDNGNVVNDWEA 552 (552) Q Consensus 512 ~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~ 552 (552) +.+...++ +..+++...+..- ..|+..-.-+....|.. T Consensus 511 ~~ep~te~---d~~~~~a~~aa~~~~~~a~V~llv~RALelAGkR~r~r~~~a~~r~ 564 (629) T protein:vir:86 511 REEPDTED---DAGTDDSDQASLDSRETAMVEALVFRALELAGKRSRTRSLPYELRQ 564 (629) T ss_pred CCCCCCCC---CCcccccCCCCCCCcHHHHHHHHHHHHHHhcCCcCCChhhHHHHhc Confidence 22221111 1111111111110 01111100000111111 No 150 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=98.87 E-value=1.3e-08 Score=63.76 Aligned_cols=433 Identities=12% Similarity=0.068 Sum_probs=186.3 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCc-ccccccCC-CCchHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPD-FKEAPSIH-GKQNLL 78 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~~ 78 (552) |--++ +|-++.+-.++++.-+..........+..|.+.-.. ..+ -+.......+. ..- .+...... ...... T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~-~~~-~~~~~~Yy~g~---~~i~~~~~~~~~~~~~~~~ 74 (474) T protein:vir:96 1 MIVIF-WPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPK-IDD-ITVGERYYNHD---PDVLRLAPKLDNKGEIDPL 74 (474) T ss_pred Ceeec-cCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHH-HHH-HHHHHHHhccC---Ccchhccchhccccccccc Confidence 77776 777777777776444444443333333333221000 000 00000000000 000 00000000 000000 Q ss_pred HHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCC Q lcl|NC_020081. 79 QMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDN 158 (552) Q Consensus 79 ~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t 158 (552) ..-.+++ ....+.|+...+.- . +|-.+.+.-.+ .+....+..|+.. . T Consensus 75 ~~~~ki~--~n~~~~Ivd~~~~~-----------l--~g~p~~~~~~d-------~~~~~~l~~~~~n-----------~ 121 (474) T protein:vir:96 75 KPDWRMF--TNYHQNLVDQKVAY-----------A--VANPVTFSSDD-------DKSLKTIQEVLNH-----------K 121 (474) T ss_pred ccchhcc--cchHHHHHHhhhhh-----------h--cccCceeecCc-------hHHHHHHHHHHhc-----------C Confidence 0000011 01112232222111 1 12222222111 1223445555421 2 Q ss_pred HHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCCceEEEEccccee Q lcl|NC_020081. 159 FRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDDKVVAKFKAKEMA 236 (552) Q Consensus 159 ~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~~~~~~~~~~evi 236 (552) .......+..+++.+|.+|..+.++..|++. +..++|..+.++.++. +... ..++|+..........+..+.+. T Consensus 122 ~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~-i~~~~p~~~~~v~d~~~~~~~~---~~vr~~~~~~~~~~~~yt~~~v~ 197 (474) T protein:vir:96 122 WDDKLVDILTAASNKGIEWLQPYIDENGEFK-TFRVPAEQAIPIWTNKERDTLK---AFIRYYRLDGAERVEYWTDSDVT 197 (474) T ss_pred HHHHHHHHHHHHHhcCeeEEEEEecCCCceE-EEEEcccceEEEEcCCCCCceE---EEEEEEeecCceEEEEEeCCeEE Confidence 2334556778899999999999999888865 8889999999887653 2211 12233322222111222222221 Q ss_pred eecc--------------------------cccC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_020081. 237 WEVS--------------------------NPRT-----DLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRG 285 (552) Q Consensus 237 ~~~~--------------------------~~~~-----~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~g 285 (552) +... |+-. .-.+...|.|-++.....++....+.....+.+...+.|-. T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l 277 (474) T protein:vir:96 198 YYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIY 277 (474) T ss_pred EEEecCCceeeccccccccccccccccccccCCCceeEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcccee Confidence 1110 0000 00012457888888888887777777767777777777755 Q ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhc Q lcl|NC_020081. 286 LLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEIN 365 (552) Q Consensus 286 il~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg 365 (552) ++. +. ...+ ...+...+ ...++..+.++|.+++.++.......+....+.+.+.|+..-++|..-.+ T Consensus 278 v~~--g~-~~~~--~~~~~~~~--------~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~ 344 (474) T protein:vir:96 278 ILK--GY-EGQD--LDEFMRNL--------KYYKAINVDGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQD 344 (474) T ss_pred eee--cC-Cccc--ccchhhhh--------hcCceEEecCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCcccccc Confidence 543 32 2111 11222211 12334445455556666565555566778889999999999999864321 Q ss_pred ccccccccccccccccch--h---HHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecccccChHHHHHHHHHH Q lcl|NC_020081. 366 FPNRGGATGHSGNTLNEG--S---SAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNFVGGDAKTEAEIISIL 439 (552) Q Consensus 366 ~~~~~t~~~~~~~~~~~~--n---~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~d~~~~~~~~~~~ 439 (552) -. .+ +.++....+. . .....+..+...|+.+++.|...+.. ..+ ..+.+.|.+..+.+..+.++++ T Consensus 345 ~~-~~---n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~----~~~~~~i~i~f~~~~p~~~~e~~~~~ 416 (474) T protein:vir:96 345 KF-GN---SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL----NIKVQDVEITFNFNVMVNELEQSQIG 416 (474) T ss_pred cc-cc---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CcccceeeEEeccCCCcCHHHHHHHH Confidence 10 01 1111111110 0 11122223444444444444332211 111 3456778777777777777765 Q ss_pred HHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcc Q lcl|NC_020081. 440 ESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFN 517 (552) Q Consensus 440 ~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 517 (552) .. +|++|...++++++. +++-+ ..+..+...+ ........ +... ....+.++ +.++ ++ T Consensus 417 ~~--ag~iS~et~~~~~~~--v~d~~--------~E~~ri~~E~-~e~~~~~~---~~~~-~~~~~~~d--~~~e-~~ 474 (474) T protein:vir:96 417 VQ--SQYLSKETVVTNHPW--VDDPV--------AELERIEQDN-IDFNKQLP---PLEG-DANGRAQD--NESE-TN 474 (474) T ss_pred Hh--cCCCchHHHHHhCCC--CCCHH--------HHHHHHHHHH-HHHHhccc---cccc-ccccccCC--Cccc-CC Confidence 43 689999888888754 22111 1111111110 00000000 0000 00001110 1111 11 No 151 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.86 E-value=2.6e-08 Score=62.18 Aligned_cols=406 Identities=11% Similarity=0.039 Sum_probs=173.2 Q ss_pred ccccccCCCCchHHHHHHHhhcc--hHHHHH---HHHHHHH--HHHHHHHHHHhhccc--cceeeeeccccccCChhHHH Q lcl|NC_020081. 65 FKEAPSIHGKQNLLQMLKLWSRK--NIILNA---IIITRVN--QVSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKK 135 (552) Q Consensus 65 ~~~~~~~~~~~~~~~~Lr~~a~~--~~i~~a---~i~~~~~--~~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~ 135 (552) +-......-.+ -...|.++-++ ..+... ....+.+ -+.-+++.+....++ +|-.+.+.-.+. .... T Consensus 1 ~~~~~~~~~~~-r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~----~~~~ 75 (440) T protein:vir:95 1 MLAAFLGSQKQ-RLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEG----GSAD 75 (440) T ss_pred ChhhHHHHHHH-HHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEeeCCC----ccHH Confidence 10000000011 12233333221 111100 0000000 011122222222222 121222211111 1122 Q ss_pred HHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccce Q lcl|NC_020081. 136 KIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDG 215 (552) Q Consensus 136 ~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~ 215 (552) ....+.+++... ........+..+.+++|.+|..+.++.+|+|. +..++|..+.++.++.+.... .-. T Consensus 76 ~~~~l~~~~~~n----------~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~-i~~~~p~~~~~~~d~~~~~~~-~~~ 143 (440) T protein:vir:95 76 QLSTIKDIEWQN----------DINALNSDLAFDASVYGRAYEYHFRDKDKVDR-VVLISPLEMFVIRDLTVEQNI-IAA 143 (440) T ss_pred HHHHHHHHHHhc----------CHhHHHHHHHHHHhhcCeEEEEEEecCCCceE-EEEEcccceEEEEcCCCCCce-EEE Confidence 333444544331 34456677889999999999999999998864 777899999998876542111 112 Q ss_pred eEEEEEcCCceEEEEcccceeeecc---------------cc-----cCCccCCcccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 216 VRYVQVIDDKVVAKFKAKEMAWEVS---------------NP-----RTDLTVGKYGYPELEIALNHLQYHDNTEVFNAR 275 (552) Q Consensus 216 ~~y~~~~~~~~~~~~~~~evi~~~~---------------~~-----~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~ 275 (552) ++|+..........++.+.++++.. |+ -..-.+...|.|-++.+...++....+.....+ T Consensus 144 i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s~~~~ 223 (440) T protein:vir:95 144 VHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQSDTAN 223 (440) T ss_pred EEEEEecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHHHHHH Confidence 2232222222222233333322210 00 000011235778787777777777766666666 Q ss_pred HHhccCCCceEEEeC-CCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHH Q lcl|NC_020081. 276 FFAQGGTTRGLLHIK-TGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVIC 354 (552) Q Consensus 276 ~f~ng~~p~gil~~~-~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 354 (552) .....+.|-.+++-. .....+++....++..-.-... .... ....+++.+...++.......+....+.+.+.|+ T Consensus 224 ~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~--~~~~--~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~ 299 (440) T protein:vir:95 224 YMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLK--TGIS--TTGQQTTADASYIYKQYDVNGTEAYKNRLANDIH 299 (440) T ss_pred HHHHhhcceeeeecccccCCCCccchhhhhhccceecc--cccc--cccCCCCcceeEEeecCCHHHHHHHHHHHHHHHH Confidence 666666776555421 1122344444444432111110 0000 0111233344444444455667788889999999 Q ss_pred HHhcCCHHHhcccccccccccccccccchh---HHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecccccChH Q lcl|NC_020081. 355 SIYSIDPSEINFPNRGGATGHSGNTLNEGS---SAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNFVGGDAK 430 (552) Q Consensus 355 ~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n---~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~d~~ 430 (552) ..-++|..-.+... ++.+|.... ..++. .-...+..+...|..+++.|...++..-....+ .++.+.|.+..+. T Consensus 300 ~~s~~p~~~~~~~~-~n~Sg~Al~-~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~ 377 (440) T protein:vir:95 300 RFSRIPNLDDDRFN-STSSGIALL-YKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKLTFTFHPNIPQ 377 (440) T ss_pred HHhCCccccccccc-ccchHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEeCCCCCC Confidence 99999975443211 111111000 00011 112223344555555555555444432222222 3467778777777 Q ss_pred HHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCC Q lcl|NC_020081. 431 TEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNV 510 (552) Q Consensus 431 ~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (552) +.++.++++.+. .|+++.--+.++++.-..+ .....+...+.... .+.....+..++...++ T Consensus 378 ~~~~~ad~~~kl-~g~iS~et~~~~l~~~d~~-----------~E~~ri~~E~~~~~------~~~~~~~~~~~~~~~~~ 439 (440) T protein:vir:95 378 DVWTEIKAYIEA-GGEISQETLMENASFTDYK-----------TEHSRILKQGGSSD------LEIGQIVGDADVGQADT 439 (440) T ss_pred CHHHHHHHHHHH-hccCcHHHHHHhCCCCCcH-----------HHHHHHHHHHHHhh------hhHHhhccCCCCCCcCC Confidence 777777766554 5888876666666431100 11111111111000 00000000000000000 Q ss_pred CCC Q lcl|NC_020081. 511 NGK 513 (552) Q Consensus 511 ~~~ 513 (552) + T Consensus 440 --e 440 (440) T protein:vir:95 440 --E 440 (440) T ss_pred --C Confidence 0 No 152 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=98.83 E-value=3.3e-08 Score=61.63 Aligned_cols=396 Identities=11% Similarity=0.028 Sum_probs=161.0 Q ss_pred cccccccCCcccccccCCCCchHHHHHHHhhcc-hHHHHH--HHHHHHH-H-HHHHHHHHHhhccc--cceeeeeccccc Q lcl|NC_020081. 55 IIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRK-NIILNA--IIITRVN-Q-VSMFCTPARNSDKG--VGYEIRLKDPLQ 127 (552) Q Consensus 55 ~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~-~~i~~a--~i~~~~~-~-~~~~~~~~~~~~~~--~~~~i~~k~~~~ 127 (552) +...+ =..+... ......-...|+++-++ ..|+.. -...+.+ . +.-+++.+....++ +|-.+.+.-. T Consensus 1 l~~~~--l~~~i~~--~~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~-- 74 (429) T protein:vir:98 1 MTKDL--LSELIQK--HRSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQTSHE-- 74 (429) T ss_pred CCHHH--HHHHHHH--HHHHHHHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhhcccCceeecC-- Confidence 00000 0000000 00000001112222111 001000 0000000 0 00011111111111 1111111111 Q ss_pred cCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCc Q lcl|NC_020081. 128 EPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDG 207 (552) Q Consensus 128 ~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g 207 (552) +......+..++.. | .+......+..+.+.+|.+|..+.++.+|+| .+..++|..+.++.++.. T Consensus 75 -----~~~~~~~l~~~~~~-------n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~~~~~~p~~~~~v~dd~~ 138 (429) T protein:vir:98 75 -----NKQVSNYLELLDGY-------N---DQDDNNAELSKICSIYGHGYELVFNDENAEA-GITYLTPLEAFIVYDDSI 138 (429) T ss_pred -----ChHHHHHHHHHHhh-------c---CHhHHHHHHHHHHhhcCeEEEEEEecCCCcE-EEEEEcccceEEEEeCCC Confidence 11111234444332 1 2345677788999999999999999999986 477899999988876543 Q ss_pred ccccccceeEEEEEcCCceEEEEcccceeeec-------------ccc-----cCCccCCcccccHHHHHHHHHHHHHHH Q lcl|NC_020081. 208 KERKAKDGVRYVQVIDDKVVAKFKAKEMAWEV-------------SNP-----RTDLTVGKYGYPELEIALNHLQYHDNT 269 (552) Q Consensus 208 ~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~-------------~~~-----~~~~~~g~~G~spl~~~~~~i~~~~~~ 269 (552) ... ....++|+...+......+...+.++.. -|+ -..-.+...|.|-++.+...++....+ T Consensus 139 ~~~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liD~~d~~ 217 (429) T protein:vir:98 139 RQK-PLFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIEYVENEERQSLLASVVTLINAFNKA 217 (429) T ss_pred CCc-eEEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccceEEecCCCCCCCcHHHHHHHHHHHHHH Confidence 211 1111222222221111111111111110 000 000012347888888888888877777 Q ss_pred HHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecc---CCceeeeccCchhHHHHHHHH Q lcl|NC_020081. 270 EVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITA---EDVKFVNMTQSSKDMEFEKWL 346 (552) Q Consensus 270 ~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~---~g~~~~~l~~~~~d~q~~e~~ 346 (552) ..-..+.+...+.|-.+++ + ...+++....++. +++..+.. ++.+...+........+.... T Consensus 218 ~s~~~~~~~~~~~p~~~i~--g-~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 282 (429) T protein:vir:98 218 ISEKANDVEYFADAYLKIL--G-AELDDETLKSLRD------------TRIINLKDTDAQQLTVEFLQKPDADATQEHLL 282 (429) T ss_pred HHHHHHHHHHhcCceeeee--c-CCCCcchhhhHhh------------CceeeccCCCCCCcceeEEeecCCHHHHHHHH Confidence 7766666777777766654 3 2334433222211 11112211 223344444444445566778 Q ss_pred HHHHHHHHHHhcCCHHHhcccccccccccccccccch--h---HHHHHHHHHHHHhhHHHHHHHHHHHhhcCccccccee Q lcl|NC_020081. 347 NYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG--S---SAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYV 421 (552) Q Consensus 347 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~--n---~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~ 421 (552) +.+.+.|+..-++|..-.+ .+++.++....+. . -....+..+...|.-+++.+...++..-....-.++. T Consensus 283 ~~l~~~i~~~s~~p~~~~~-----~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~ 357 (429) T protein:vir:98 283 DRLENLIFRTAMVANISDE-----SFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIK 357 (429) T ss_pred HHHHHHHHHHhCccccCcc-----ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccce Confidence 8999999999999853221 1222221111111 0 1111223333444444444443333221111112466 Q ss_pred ecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccC Q lcl|NC_020081. 422 FNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQT 501 (552) Q Consensus 422 ~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (552) +.|.+..+.+..+.++++.+. .|+++..-+.++++.-+-+ + ..+..+...+.... +..+..-. T Consensus 358 v~f~~~~p~~~~~~a~~~~kl-~g~is~et~~~~l~~v~d~--~--------~E~~ri~~E~~~~~----~~~~~~~~-- 420 (429) T protein:vir:98 358 YKFTRNLPANLLEESQIAGNL-AGIVSEETQVGVLSIVENP--Q--------KEIERKNSDKSTLI----SRQAGGLN-- 420 (429) T ss_pred EEeCCCCCcCHHHHHHHHHHH-hccCchHHHHHhCCCCCCH--H--------HHHHHHHHHHHHHH----HHHHhhhc-- Confidence 778777777777666665554 5888887777777652211 0 11111111110000 00000000 Q ss_pred CCCCCCCCC Q lcl|NC_020081. 502 GYDGNMDNV 510 (552) Q Consensus 502 ~~~~~~~~~ 510 (552) +++++++.+ T Consensus 421 ~~~~~~~~~ 429 (429) T protein:vir:98 421 GQNTTTILE 429 (429) T ss_pred CCCCCCCCC Confidence 000111110 No 153 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=98.82 E-value=3.6e-08 Score=61.39 Aligned_cols=423 Identities=11% Similarity=0.032 Sum_probs=172.6 Q ss_pred hccccccccccccccccccccccccCCcccc------cccCCCCchHHHHHHHhhcc-hHHHHH-HHHHHHH-H-HHHHH Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKE------APSIHGKQNLLQMLKLWSRK-NIILNA-IIITRVN-Q-VSMFC 106 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~Lr~~a~~-~~i~~a-~i~~~~~-~-~~~~~ 106 (552) |+-+ ..++........+.+.-..+..-.. ....... ...+.|+++-++ ..|+.. -...+.+ . +.-++ T Consensus 1 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~-~~~~~l~~Yy~g~~~i~~~~~~~~~~~~ki~~n~~ 77 (470) T protein:vir:99 1 MKDI--NYGRDKVTGNSSFIFPKGEKLTSNELLGFIAYNETVLK-PRYRENMKLYLGKHKILTAPEKETGADNRIVVNSA 77 (470) T ss_pred Cccc--cCCcccccCCceEEeCCCCCcCHHHHHHHHHHHHHhhH-HHHHHHHHHhccccccccCcccccCCcceeecchH Confidence 3222 1111111111111111000000000 0000000 000111111110 000000 0000000 0 00011 Q ss_pred HHHHhhcc----ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE Q lcl|NC_020081. 107 TPARNSDK----GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY 182 (552) Q Consensus 107 ~~~~~~~~----~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r 182 (552) +.+....+ |-++.+...+ +.+....+.+++.. ..+......+..+.+.+|.+|..+.+ T Consensus 78 ~~Ivd~~~~~l~g~p~~~~~~~--------d~~~~~~l~~~~~~----------n~~~~~~~~~~~~~~~~G~~~~~v~~ 139 (470) T protein:vir:99 78 KYVVDVYNGYFCGIEPKLALLN--------DSSKIDEIARWNRQ----------ENFFDTINEISKQCDIFGRSIASIYQ 139 (470) T ss_pred HHHHHHHhhhhccCCeeEeeCC--------chhHHHHHHHHHHh----------cCHhHHHHHHHHHHHhcCeeEEEEEe Confidence 11111111 1111121111 11122334444432 14556778899999999999999999 Q ss_pred CCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceE----EEEcccceeeec--------------cccc- Q lcl|NC_020081. 183 DKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVV----AKFKAKEMAWEV--------------SNPR- 243 (552) Q Consensus 183 ~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~----~~~~~~evi~~~--------------~~~~- 243 (552) +.+|++ .+..++|..+.++.++.+.... .-.++|+....+... ..+..+.+.++. .|+. T Consensus 140 d~dg~~-~i~~~~p~~~~~i~d~~~~~~~-~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 217 (470) T protein:vir:99 140 GEDARP-HLMYSSPNHAFIIYDDTVQRQP-LAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYG 217 (470) T ss_pred CCCCeE-EEEEEccceeEEEEcCCCCcce-EEEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCC Confidence 989886 4788999999988876543211 111222221111110 111111111110 0100 Q ss_pred --C--CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccccccc Q lcl|NC_020081. 244 --T--DLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWK 319 (552) Q Consensus 244 --~--~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk 319 (552) + .-.+...|.|-++.+...++....+.......+...+.|-.++. +.....++.-+ ....+.. . + T Consensus 218 ~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~~g~-~~~~~~~-------~-~ 286 (470) T protein:vir:99 218 LVPAVEFFENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMI--GFKLPEDDEGN-PKFDFKN-------N-R 286 (470) T ss_pred ccceEeecCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cCCcccccccc-hhhhhhh-------c-c Confidence 0 01122468888888888888777777777777777777766654 32111211111 1112211 1 1 Q ss_pred ceee----ccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccch-----hHHHHHH Q lcl|NC_020081. 320 IPVI----TAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG-----SSAEKYR 390 (552) Q Consensus 320 ~~il----~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~-----n~e~~~~ 390 (552) ...+ .+++.++..++.......+....+.+.+.|+..-++|+...+-.. ++ .++....+. ...+..+ T Consensus 287 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n---~Sg~Ai~~~~~~l~~k~~~~~ 362 (470) T protein:vir:99 287 VLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFA-GN---SSGVALQYKLFAMKNKADSKE 362 (470) T ss_pred eeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccc-cC---chHHHHHHHHHHHHHHHHHHH Confidence 1111 123445555555555566677888999999999999975432211 11 111111000 0112222 Q ss_pred HHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeec Q lcl|NC_020081. 391 NSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLA 469 (552) Q Consensus 391 ~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~ 469 (552) ..+..+|+-+++.+...+...-....+ ..+.+.|.+..+.+..+.++++... .|+++...++++++.-. + + T Consensus 363 ~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl-~giis~et~l~~l~~vd-~--~---- 434 (470) T protein:vir:99 363 RKFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNA-EGIVSKKTQLGMIPDIE-P--D---- 434 (470) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHH-hccCCHHHHHHhCCCCC-H--H---- Confidence 344555555555555444433222222 3467778777777777777766554 48899877777764311 0 0 Q ss_pred cccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCC Q lcl|NC_020081. 470 GVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDS 515 (552) Q Consensus 470 ~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (552) ..+..+.+.+...... ........+.....++++++ T Consensus 435 ----~E~eri~~E~~~~~~~------~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 435 ----AEMKQIAKEKADAIKQ------TQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred ----HHHHHHHHHHHHHHHH------HHhhcCCCCcCCCCCCccCC Confidence 0111111111000000 00000001111111111111 No 154 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=98.79 E-value=4.8e-08 Score=60.71 Aligned_cols=395 Identities=10% Similarity=0.045 Sum_probs=166.3 Q ss_pred cccCCc-ccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhc------------------------ Q lcl|NC_020081. 59 MSMNPD-FKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSD------------------------ 113 (552) Q Consensus 59 ~~~~~~-~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~------------------------ 113 (552) |..-|. +...|.- ...+.+.|..+ +......+.++.+..++.. T Consensus 1 ~~~~~~~~~~~~~~--~~~~~~~i~~~----------i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~ 68 (453) T protein:vir:73 1 MNLKPIKLMTYSRD--EEITDKVVNDF----------MKKHQEEVERYEYLGNMYKGIMEISSQKAKDSWKPDNRLTNNF 68 (453) T ss_pred Cccccceeeecccc--ccCCHHHHHHH----------HHHHHHHHHHHHHHHHHhccccchhcCCCCCccCccceeecch Confidence 111110 0111100 00011112111 1111111111111111111 Q ss_pred --------ccc--ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC Q lcl|NC_020081. 114 --------KGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD 183 (552) Q Consensus 114 --------~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~ 183 (552) +++ |-.+.+.. .+....+.+.+++.. -.+......+..+.+.+|.+|..+.++ T Consensus 69 ~~~ivd~~~~~l~g~~~~~~~-------~d~~~~~~l~~~~~~----------n~~~~~~~~~~~~~~~~G~~~~~v~~d 131 (453) T protein:vir:73 69 AKYIVDTFVGYFNGIPIKKTH-------DDKSVLEAMQLFDNL----------NDMEDEESELAKIACVYGRAYELMYQN 131 (453) T ss_pred HHHHHHHhhhhhcccCceeec-------CChHHHHHHHHHHHh----------cChhHHHHHHHHHHHhcCeEEEEEEeC Confidence 110 11111110 011122334444432 123446667889999999999999999 Q ss_pred CCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCce-EEEEcccceeeeccc--------ccCC--------- Q lcl|NC_020081. 184 KLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKV-VAKFKAKEMAWEVSN--------PRTD--------- 245 (552) Q Consensus 184 ~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~-~~~~~~~evi~~~~~--------~~~~--------- 245 (552) .+|.+. +..++|..+.++.++..... ..-.++|+...++.. ...++.+.++++... ...+ T Consensus 132 ~~~~~~-i~~~~p~~~~~v~dd~~~~~-~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~ 209 (453) T protein:vir:73 132 ESTESE-VIYCSPLNVFMVYDDSIKQK-PLFAVYYGFDEEGNLSGTVYTLLETISITGKAGEVKFGESTYNVYSDLPIVE 209 (453) T ss_pred CCCceE-EEEEcccceEEEEeCCCCce-eEEEEEEEEecCceEEEEEEeCCeEEEEEecCCceEEccceeccCCceeEEE Confidence 998864 77789999988876543221 111222322222221 112233332222110 0000 Q ss_pred ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecc Q lcl|NC_020081. 246 LTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITA 325 (552) Q Consensus 246 ~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~ 325 (552) -.+...|.|-++.+...++....+..-..+.....+.|..++. + ...+++....++..-. ........+.. ...+ T Consensus 210 ~~n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~--g-~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~ 284 (453) T protein:vir:73 210 YNFNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFL--G-AEVDEEDAKNIKDNRL-INFFDKNSNGQ-GTNA 284 (453) T ss_pred ecCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeee--c-CCCCchhhhccccccc-ccccccccccc-cccc Confidence 0112367788887777777766666656665666666765554 2 3334444444433210 00001111111 1122 Q ss_pred CCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccch-----hHHHHHHHHHHHHhhHH Q lcl|NC_020081. 326 EDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG-----SSAEKYRNSKDKGLEPL 400 (552) Q Consensus 326 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~ 400 (552) .+.++.-++....+..+....+.+.+.|+..-++|.. +.. .+++.++....+. .-.+..+..+...|..+ T Consensus 285 ~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~--~~~---~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~ 359 (453) T protein:vir:73 285 AKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANI--SDE---NFGNSSGVALAYKLQAMSNLALSFQRKFQSALNRR 359 (453) T ss_pred cCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCccc--Ccc---cccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3445555555555666777888999999999888852 221 1222222111111 11122233445555555 Q ss_pred HHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhc Q lcl|NC_020081. 401 LKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIM 480 (552) Q Consensus 401 ~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~ 480 (552) ++.+...++..-....-.++.+.|.+..+.+..+.++++.+. .|+++..-+.+++++-+-+. ..+..+. T Consensus 360 ~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~-~giis~et~~~~~~~~~d~~----------~E~~ri~ 428 (453) T protein:vir:73 360 YSLWSSLSTNASNKDAWKDIEYTFTRNEPKDIKEQAETANIL-KGITSEETALSVISVIPDVQ----------AEMEKIK 428 (453) T ss_pred HHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHH-hccCcHHHHHHhCCCCCCHH----------HHHHHHH Confidence 555544333221111123567778777676677766665554 38888877777775522110 0111111 Q ss_pred cccccccccCCCCCccCcccCCCCCCC Q lcl|NC_020081. 481 QQEQVEYQRQMDANQFLAQQTGYDGNM 507 (552) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (552) +.+......+...+ . ....+..++- T Consensus 429 ~E~~~~~~~~~~~~-~-~~~~~~~~~~ 453 (453) T protein:vir:73 429 KKKLLQLSLTRTSN-L-VRMKQMRGNL 453 (453) T ss_pred HHHHHHHHHHHhcc-C-CcchhhhcCC Confidence 11100000000000 0 0000000111 No 155 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=98.78 E-value=5.2e-08 Score=60.53 Aligned_cols=419 Identities=13% Similarity=0.060 Sum_probs=173.9 Q ss_pred hhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHH----------------- Q lcl|NC_020081. 34 DAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIII----------------- 96 (552) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~----------------- 96 (552) -.++|.+..-+.. ..+......+ -... ........+..+.+-.....+..... T Consensus 1 ~~~~~~~~~~~~~---~~~~e~i~~~------i~~~-~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 70 (474) T protein:vir:10 1 MTLYKLIDDIEAQ---GILPKHIEAL------IESH-KDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRR 70 (474) T ss_pred CchHHHHhhcccc---CCCHHHHHHH------HHHh-hhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccc Confidence 1111111000000 0000000000 0000 00000000000111000000000000 Q ss_pred --HHH--HHHHHHHHHHHhhcccc--ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHH Q lcl|NC_020081. 97 --TRV--NQVSMFCTPARNSDKGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDR 170 (552) Q Consensus 97 --~~~--~~~~~~~~~~~~~~~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ 170 (552) .++ ..+.-+++.+....+++ |-.+.+...+... .+.+-...+.+|+.. ..+......+..+. T Consensus 71 ~~~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~--~~e~~~~~l~~~~~~----------n~~~~~~~~~~~~~ 138 (474) T protein:vir:10 71 LDVSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAE--KNEKLKKFITNFAIR----------NSVDDEDSEIGKMA 138 (474) T ss_pred cccCcccccccchHHHHHHhHhhheeccceeEeeCCCCc--chHHHHHHHHHHHhh----------cCHhHHHHHHHHHH Confidence 000 00011222222222221 2122221111111 111122334444432 13455777888999 Q ss_pred HhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcC--Cce----EEEEcccceeeecc---- Q lcl|NC_020081. 171 LTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVID--DKV----VAKFKAKEMAWEVS---- 240 (552) Q Consensus 171 ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~--~~~----~~~~~~~evi~~~~---- 240 (552) +.+|.||..+.++.+|++ .+..++|..+.++.++.+.... .++|+.... +.. ...++...+.++.. T Consensus 139 ~~~G~a~~~~~~d~~~~~-~~~~i~p~~~~~v~d~~~~~~~---~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~ 214 (474) T protein:vir:10 139 AICGYGARLAYIDTNGDI-RIKNIDPYNVIFVGDNILEPTY---SLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGID 214 (474) T ss_pred hhcCeEEEEEEeCCCCee-EEEEEcccceEEEEcCCCceEE---EEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCC Confidence 999999999989988875 5788999999888876654321 222222111 110 01111111111110 Q ss_pred ---------cccC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_020081. 241 ---------NPRT-----DLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRRE 306 (552) Q Consensus 241 ---------~~~~-----~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~ 306 (552) |+.. .-.+...|.|-++.+...++....+..-..+.+...+.|-.+++ + ...+++....++ T Consensus 215 ~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~--g-~~~~~~~~~~~~-- 289 (474) T protein:vir:10 215 ALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR--G-MGMSEEMIQETQ-- 289 (474) T ss_pred cccccccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--c-CCCCchhhhhhh-- Confidence 1000 00122467777777777777666666655555555556654443 3 233443322221 Q ss_pred HHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccch--- Q lcl|NC_020081. 307 WTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG--- 383 (552) Q Consensus 307 ~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~--- 383 (552) ..+ ...+.+++.+++-+.....+..+....+.+.+.|...-++|..-.+-.. + +.++....+. T Consensus 290 ---------~~~-~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~---n~Sg~Al~~~~~~ 355 (474) T protein:vir:10 290 ---------KSG-AFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFN-G---NVPIIGMKLKLMA 355 (474) T ss_pred ---------hcc-eeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c---cchHHHHHHHHHH Confidence 122 2344455666666665555667788889999999999999875433111 1 1111111111 Q ss_pred --hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCc--ccc-cceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCC Q lcl|NC_020081. 384 --SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVS--QFG-GDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGY 458 (552) Q Consensus 384 --n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~--~~~-~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl 458 (552) ..-...+..+..+|+-+++.|...++.+-.. +.. .++.+.|.+.-+.+..+.++++... .|++|..-+.+++++ T Consensus 356 l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl-~g~iS~et~~~~l~~ 434 (474) T protein:vir:10 356 LENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL-KGQVSERTRLGQSQL 434 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH-hccCchHHHHHhCCC Confidence 1122333455666666666666555543211 112 3467778777777777777766554 588998888888765 Q ss_pred CCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccc Q lcl|NC_020081. 459 PDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQN 519 (552) Q Consensus 459 ~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (552) -+ +-+ ..+..+...+. +... +......+ +...++++..++ T Consensus 435 v~--d~~--------~E~eri~~E~~-e~~~-----~~~~~~~~-----~~~~~~~~~~s~ 474 (474) T protein:vir:10 435 VD--DVD--------YELDEMEKESL-EFND-----KLPDIDEG-----DANDKSQNNQSE 474 (474) T ss_pred CC--CHH--------HHHHHHHHHHH-HHHh-----hcccccCC-----CcCCCCccccCC Confidence 22 100 11111111110 0000 00000000 000000000000 No 156 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=98.78 E-value=5.2e-08 Score=60.53 Aligned_cols=419 Identities=13% Similarity=0.060 Sum_probs=173.9 Q ss_pred hhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHH----------------- Q lcl|NC_020081. 34 DAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIII----------------- 96 (552) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~----------------- 96 (552) -.++|.+..-+.. ..+......+ -... ........+..+.+-.....+..... T Consensus 1 ~~~~~~~~~~~~~---~~~~e~i~~~------i~~~-~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 70 (474) T protein:vir:94 1 MTLYKLIDDIEAQ---GILPKHIEAL------IESH-KDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRR 70 (474) T ss_pred CchHHHHhhcccc---CCCHHHHHHH------HHHh-hhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccc Confidence 1111111000000 0000000000 0000 00000000000111000000000000 Q ss_pred --HHH--HHHHHHHHHHHhhcccc--ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHH Q lcl|NC_020081. 97 --TRV--NQVSMFCTPARNSDKGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDR 170 (552) Q Consensus 97 --~~~--~~~~~~~~~~~~~~~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ 170 (552) .++ ..+.-+++.+....+++ |-.+.+...+... .+.+-...+.+|+.. ..+......+..+. T Consensus 71 ~~~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~--~~e~~~~~l~~~~~~----------n~~~~~~~~~~~~~ 138 (474) T protein:vir:94 71 LDVSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAE--KNEKLKKFITNFAIR----------NSVDDEDSEIGKMA 138 (474) T ss_pred cccCcccccccchHHHHHHhHhhheeccceeEeeCCCCc--chHHHHHHHHHHHhh----------cCHhHHHHHHHHHH Confidence 000 00011222222222221 2122221111111 111122334444432 13455777888999 Q ss_pred HhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcC--Cce----EEEEcccceeeecc---- Q lcl|NC_020081. 171 LTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVID--DKV----VAKFKAKEMAWEVS---- 240 (552) Q Consensus 171 ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~--~~~----~~~~~~~evi~~~~---- 240 (552) +.+|.||..+.++.+|++ .+..++|..+.++.++.+.... .++|+.... +.. ...++...+.++.. T Consensus 139 ~~~G~a~~~~~~d~~~~~-~~~~i~p~~~~~v~d~~~~~~~---~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~ 214 (474) T protein:vir:94 139 AICGYGARLAYIDTNGDI-RIKNIDPYNVIFVGDNILEPTY---SLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGID 214 (474) T ss_pred hhcCeEEEEEEeCCCCee-EEEEEcccceEEEEcCCCceEE---EEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCC Confidence 999999999989988875 5788999999888876654321 222222111 110 01111111111110 Q ss_pred ---------cccC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_020081. 241 ---------NPRT-----DLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRRE 306 (552) Q Consensus 241 ---------~~~~-----~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~ 306 (552) |+.. .-.+...|.|-++.+...++....+..-..+.+...+.|-.+++ + ...+++....++ T Consensus 215 ~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~--g-~~~~~~~~~~~~-- 289 (474) T protein:vir:94 215 ALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLR--G-MGMSEEMIQETQ-- 289 (474) T ss_pred cccccccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--c-CCCCchhhhhhh-- Confidence 1000 00122467777777777777666666655555555556654443 3 233443322221 Q ss_pred HHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccch--- Q lcl|NC_020081. 307 WTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG--- 383 (552) Q Consensus 307 ~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~--- 383 (552) ..+ ...+.+++.+++-+.....+..+....+.+.+.|...-++|..-.+-.. + +.++....+. T Consensus 290 ---------~~~-~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~---n~Sg~Al~~~~~~ 355 (474) T protein:vir:94 290 ---------KSG-AFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFN-G---NVPIIGMKLKLMA 355 (474) T ss_pred ---------hcc-eeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c---cchHHHHHHHHHH Confidence 122 2344455666666665555667788889999999999999875433111 1 1111111111 Q ss_pred --hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCc--ccc-cceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCC Q lcl|NC_020081. 384 --SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVS--QFG-GDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGY 458 (552) Q Consensus 384 --n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~--~~~-~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl 458 (552) ..-...+..+..+|+-+++.|...++.+-.. +.. .++.+.|.+.-+.+..+.++++... .|++|..-+.+++++ T Consensus 356 l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl-~g~iS~et~~~~l~~ 434 (474) T protein:vir:94 356 LENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL-KGQVSERTRLGQSQL 434 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH-hccCchHHHHHhCCC Confidence 1122333455666666666666555543211 112 3467778777777777777766554 588998888888765 Q ss_pred CCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccc Q lcl|NC_020081. 459 PDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQN 519 (552) Q Consensus 459 ~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (552) -+ +-+ ..+..+...+. +... +......+ +...++++..++ T Consensus 435 v~--d~~--------~E~eri~~E~~-e~~~-----~~~~~~~~-----~~~~~~~~~~s~ 474 (474) T protein:vir:94 435 VD--DVD--------YELDEMEKESL-EFND-----KLPDIDEG-----DANDKSQNNQSE 474 (474) T ss_pred CC--CHH--------HHHHHHHHHHH-HHHh-----hcccccCC-----CcCCCCccccCC Confidence 22 100 11111111110 0000 00000000 000000000000 No 157 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.76 E-value=5.8e-08 Score=60.26 Aligned_cols=356 Identities=12% Similarity=0.079 Sum_probs=147.5 Q ss_pred cccCC-cccccccCCCCchHHHHHHHhhcch-HHHHHHHHH------HHHHHHHHHHHHHhhccc----cceeeeecccc Q lcl|NC_020081. 59 MSMNP-DFKEAPSIHGKQNLLQMLKLWSRKN-IILNAIIIT------RVNQVSMFCTPARNSDKG----VGYEIRLKDPL 126 (552) Q Consensus 59 ~~~~~-~~~~~~~~~~~~~~~~~Lr~~a~~~-~i~~a~i~~------~~~~~~~~~~~~~~~~~~----~~~~i~~k~~~ 126 (552) |.... .+-.+. +.....-...+..+-++. .+...-+.+ ....+.-.++.+..+.+. -||.. . T Consensus 1 ~~~~~i~~L~~~-~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~----~- 74 (409) T protein:vir:94 1 MTEKGIGYLRFK-LSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFEN----D- 74 (409) T ss_pred CCHHHHHHHHHH-HHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCcccC----C- Confidence 10000 000000 000000011112222111 110000000 001111122222222211 22210 0 Q ss_pred ccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC Q lcl|NC_020081. 127 QEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED 206 (552) Q Consensus 127 ~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~ 206 (552) + ..+.+++... .+......+..+.+++|.+|+.+..+.+|.| .+.+++|..+.++.|+. T Consensus 75 ----d------~~l~~i~~~N----------~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~-~i~~~sp~~~~~i~D~~ 133 (409) T protein:vir:94 75 ----D------FTVNEIFEEN----------NPDIFFDSAVLSSLIASCSFTYISKGENDAV-RLQVIEAVNATGIIDPI 133 (409) T ss_pred ----c------hHHHHHHHhc----------ChhHHHHHHHHHHHHhcceeEEEecCCCCce-EEEEeccceEEEEEecC Confidence 0 1244444321 2334566788899999999999999989986 68899999999888764 Q ss_pred cccccccceeEEEEEcCCc-e--EEEEcccce----------------------eeecccccCCccCCcccccHH----H Q lcl|NC_020081. 207 GKERKAKDGVRYVQVIDDK-V--VAKFKAKEM----------------------AWEVSNPRTDLTVGKYGYPEL----E 257 (552) Q Consensus 207 g~~~~~~~~~~y~~~~~~~-~--~~~~~~~ev----------------------i~~~~~~~~~~~~g~~G~spl----~ 257 (552) .+.+.. .+++......+ . ...+.++++ +++..++ ...+.+|.|.| . T Consensus 134 ~~~~~~--a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~---~~~~~~G~s~I~e~v~ 208 (409) T protein:vir:94 134 TGLLTE--GYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRP---DAVRPFGRSRITRSGM 208 (409) T ss_pred CCceee--eEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEEecccc---ccccccCccccchhHH Confidence 332211 11111110011 0 011222222 2332222 12457888855 3 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCch Q lcl|NC_020081. 258 IALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSS 337 (552) Q Consensus 258 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~ 337 (552) .+.+.+.....-......||. .|.-++. | ...+.+..+.++......+.-.++. .++++++.++.... T Consensus 209 ~l~da~~r~~~~~~~~~e~~a---~pqr~i~--G-~d~d~~~~~~~~~~~~~i~~~~~d~------dg~~~~v~q~~~~~ 276 (409) T protein:vir:94 209 YWQSNAKRTLERADVTAEFYS---FPQKYVT--G-LSDDAEPMETWKATVSSMLQFTKDE------DGDKPTLGQFTQPS 276 (409) T ss_pred HHHHHHHHHHHHHHHHHHHhc---ChhheeE--e-cCCCCcccchhhhhHHHhhcCCCCC------CCCCceEEecCCCC Confidence 444444444433344445554 4544443 1 1112222333443333222111110 12346666554433 Q ss_pred hHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchh-----HHHHHHHHHHHHhhHHHHHHHHHHHhhc Q lcl|NC_020081. 338 KDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGS-----SAEKYRNSKDKGLEPLLKFIEDAVNKYI 412 (552) Q Consensus 338 ~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n-----~e~~~~~~~~~~l~P~~~~ie~~ln~~L 412 (552) - ..|++..+..+..+|+.-++|++.+|.......+ +....++- ..+..+..+...++-+++..-...+..- T Consensus 277 l-~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsS---a~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~ 352 (409) T protein:vir:94 277 M-SPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSS---VEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAP 352 (409) T ss_pred h-hHHHHHHHHHHHHHhhhcCCCHHHhccccCchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Confidence 2 2488999999999999999999999864321110 00000000 0001111111222222221111111000 Q ss_pred -Ccccccceeeccc---ccChHHHHHHHHHHH-HHhcC--CcCHHHHHHHhCCCCCC Q lcl|NC_020081. 413 -VSQFGGDYVFNFV---GGDAKTEAEIISILE-SKAKI--GLTINDIRKELGYPDTE 462 (552) Q Consensus 413 -~~~~~~~~~~~f~---~~d~~~~~~~~~~~~-~~~~g--~lT~NE~R~~~gl~p~~ 462 (552) .+.....+.+.|. ..+..+.++.+..+. ...+| .+.-+-+++++|+..-+ T Consensus 353 ~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 353 YLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred ccccccccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 0011123455564 334444555544433 33344 45568899999997654 No 158 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.76 E-value=6.1e-08 Score=60.12 Aligned_cols=427 Identities=11% Similarity=0.044 Sum_probs=152.0 Q ss_pred chhhhh--cccccc-----ccccccccccccccccccCCccccccc-CC-CCchHHHHHHHhh-cchHHHHHHHHHHHHH Q lcl|NC_020081. 32 EEDAIL--KKGKNT-----KSNKPKAYEEPIIGSMSMNPDFKEAPS-IH-GKQNLLQMLKLWS-RKNIILNAIIITRVNQ 101 (552) Q Consensus 32 ~~~~~~--~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~Lr~~a-~~~~i~~a~i~~~~~~ 101 (552) ++..+. ..+... ....-... ..+....-.|+.... .. -.......++++. ..++ .+.|+.+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~----~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~-~~~ivd~~~-- 73 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTER----TQDLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGY-PRLYIDAIA-- 73 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHH----HHHHHHHHHHHhccccchhcccccchhHHhhhhhcCc-HHHHHHHHH-- Confidence 111111 011000 00000000 000000001110000 00 0000001111111 0011 111111111 Q ss_pred HHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEE Q lcl|NC_020081. 102 VSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELV 181 (552) Q Consensus 102 ~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~ 181 (552) ......||.+ .+ .....+.+.+++... .+......+..+.+++|.+|..+. T Consensus 74 ---------~~l~~~g~~~--~~--------~~~~~~~l~~i~~~N----------~~d~~~~~~~~~a~~~G~a~~~v~ 124 (484) T protein:vir:77 74 ---------ARQELEGFRL--GG--------ADKADEQLWDWWQAN----------DLDIESTLGHTDSLVHGRSYITIS 124 (484) T ss_pred ---------hhhccCceec--CC--------cchhHHHHHHHHHhc----------CHhHHHHHHHHHHhhcCceEEEEe Confidence 1111223332 11 011223355555431 234567788999999999999999 Q ss_pred ECCCCCE-------EEEEEecCceeEEEECCCcccccccceeEEEEEcCCceE---EEEcccc----------------- Q lcl|NC_020081. 182 YDKLGDL-------HNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVV---AKFKAKE----------------- 234 (552) Q Consensus 182 r~~~G~~-------~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~---~~~~~~e----------------- 234 (552) ++..|.+ ..|.+++|..+.++.++..+.... .++|+....++.. ..|.++. T Consensus 125 ~~~~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~--a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~ 202 (484) T protein:vir:77 125 KPDPNIDPGVDPEVPIIRVEPPTNLYAQIDPRTRQVMR--AIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANV 202 (484) T ss_pred cCCCCcccccccccceEEEeccceeEEEecCCCCceEE--EEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccc Confidence 9888754 247888999998887653221111 1111111111100 0111111 Q ss_pred --------eeeecccccCCccCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHH--HH Q lcl|NC_020081. 235 --------MAWEVSNPRTDLTVGKYGYPELEI-ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALT--SF 303 (552) Q Consensus 235 --------vi~~~~~~~~~~~~g~~G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~--~~ 303 (552) |++++.+. ...+++|.|.+.- +...++....+..-........+.|.-+|. + ...++...+ .- T Consensus 203 ~~~~g~vPvv~f~N~~---~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~--G-~~~~~~~~~~~~~ 276 (484) T protein:vir:77 203 AHNLEMVPVIPIPNRT---RLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLF--G-VKGEELGVDPETG 276 (484) T ss_pred cCCCCCcceEEecccc---ccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHh--C-CCcchhccccccc Confidence 13332111 2334678886642 333333333332222222222234443332 1 111111111 00 Q ss_pred HHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccch Q lcl|NC_020081. 304 RREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG 383 (552) Q Consensus 304 ~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~ 383 (552) ...|+. ..+++..+.+++.++.++....-+ -|++..+..+..|+.+-++|++.+|....... ++....+. T Consensus 277 ~~~~~~------~~~~~~~~~~~~~~~~q~~~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~---Sg~Al~~~ 346 (484) T protein:vir:77 277 QTLFDA------YLARILAFEDHESKAQQFSAAELR-NFVDALDALDRKAAAYTGLPPYYLSFSSENPA---SAEAIRSS 346 (484) T ss_pred chhhhh------hhhhhcccCCCCceeEeecCCChH-HHHHHHHHHHHHHhcccCCCHHHhccccCcch---HHHHHHHH Confidence 111221 123444455567888776654432 37788888899999999999999975321111 11111111 Q ss_pred hHH-----HHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHH-Hhc--CCcCHHHHHHH Q lcl|NC_020081. 384 SSA-----EKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILES-KAK--IGLTINDIRKE 455 (552) Q Consensus 384 n~e-----~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~-~~~--g~lT~NE~R~~ 455 (552) ... +..+..+...|+-+++.+....+..-.+.....+.+.|.+..+.+.++.+..+.+ +.+ |+++..-++++ T Consensus 347 ~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~ 426 (484) T protein:vir:77 347 ESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARID 426 (484) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhc Confidence 100 1111222222322222222211111011111235566655544455544443333 333 47888888888 Q ss_pred hCCCCCCCCCeeeccccccchhhhccccccccccCCCCC-ccCcccCCCCCCCCCCCCCCCcccccCCCCcccccccccc Q lcl|NC_020081. 456 LGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDAN-QFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNS 534 (552) Q Consensus 456 ~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 534 (552) +|+-+-+- ..+......+........... ....++.+....++.+.+..+ .+ + T Consensus 427 l~~~~~~~----------~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~--------------~ 480 (484) T protein:vir:77 427 MGYSITER----------EEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPETPEPQPN--PA--------------E 480 (484) T ss_pred CCCChhHH----------HHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCcccccCC--Cc--------------c Confidence 88844321 001011110000000000000 000000000000010110000 00 0 Q ss_pred cccc Q lcl|NC_020081. 535 TPQG 538 (552) Q Consensus 535 ~~~~ 538 (552) ...| T Consensus 481 ~~~~ 484 (484) T protein:vir:77 481 EAAA 484 (484) T ss_pred ccCC Confidence 0001 No 159 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=98.75 E-value=2.5e-08 Score=62.23 Aligned_cols=474 Identities=14% Similarity=0.105 Sum_probs=205.6 Q ss_pred ccccccccchhhhhccccccccccccccccccc---cccccCCcccccccCCCCchHHHHHHHhhcchH--HHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPII---GSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNI--ILNAIIITR 98 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~--i~~a~i~~~ 98 (552) |.+..+-..--+ -|-......+.-.+.++++- ..+.+.-+-.-+ ..++ -+.|...-. =++..+.=+ T Consensus 1 ~~a~~~lr~~rr-pkg~~~a~~r~L~aAs~~~~dpg~~~~~~~g~~~~---~~WQ-----~eAW~~~d~v~Elry~vgW~ 71 (631) T protein:vir:10 1 MAATQSLRLVRR-PKGGRPAPSRALTAASQPLPDPSQVFSKSTGISRN---SDWQ-----TDAWEAVDLVGELRYYVGWR 71 (631) T ss_pred CCcccceeeeec-CCCCCccchhhhhhhhccccchhhhhhhhcCCccc---chhh-----HHHHHHHHhhhhHHHHhhhh Confidence 111110000000 00001111122223333331 111111000000 0111 112211100 012222223 Q ss_pred HHHHHHHHHHHHhhccccceeeeecccc-----ccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhc Q lcl|NC_020081. 99 VNQVSMFCTPARNSDKGVGYEIRLKDPL-----QEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTY 173 (552) Q Consensus 99 ~~~~~~~~~~~~~~~~~~~~~i~~k~~~-----~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~ 173 (552) .++++++ ++ .....|++ ..+.+ +......+.++.+... ...+.-.++++.+..++-+- T Consensus 72 ~~s~sr~-rL----------~as~idpDtg~ptg~iee-~~~~~~~v~~~~~~i~-----gG~lgQ~~llkrl~~~ltV~ 134 (631) T protein:vir:10 72 ASSCSRC-RL----------VASELDENTGLPTGGISE-DNTEGERVREIVSKIA-----DGTLGQAALTKRVVECLTVP 134 (631) T ss_pred hhhhcee-ee----------EeeeeccCCCCCcccccc-CCchhHHHHHHHHhcC-----CCcchHHHHHHHHHhheecc Confidence 3333322 11 11111211 11111 1111122334444322 23466788999999999999 Q ss_pred CCeeEEEE-ECCCC---------C-EEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccc Q lcl|NC_020081. 174 DKINFELV-YDKLG---------D-LHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNP 242 (552) Q Consensus 174 Gna~~~i~-r~~~G---------~-~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~ 242 (552) |.+|+.+. |...| + ..+++++....|......+|.. +. ...+.+-......|+++-.. T Consensus 135 GE~wiv~l~~p~~~~~~~pd~~~r~~~~W~~vt~~ei~~~~~g~g~~--------v~-lp~g~~h~~~~~~D~l~RiW-- 203 (631) T protein:vir:10 135 GELWIVILTRPVKGAPAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTN--------IV-LPTGEEHEFVKGTDIIFRVW-- 203 (631) T ss_pred cceEEEEEeccCcCCCCCcccccccccceeeccHHHHhcccCcccce--------ee-cCCCCccceecCCceEEEee-- Confidence 99999875 22221 1 2345556555554332222221 11 12222223333445554333 Q ss_pred cCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC-------------------CHHHHHHH Q lcl|NC_020081. 243 RTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQ-------------------SNQALTSF 303 (552) Q Consensus 243 ~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~-------------------s~~~~~~~ 303 (552) ++++....+--||+.+++..+.-...+.+...+..+.-.+-.|||.+|....+ .+-+...| T Consensus 204 ~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaakSRl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l 283 (631) T protein:vir:10 204 IPKPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQL 283 (631) T ss_pred CCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHH Confidence 34555567888999999999998888888888887777777888888754332 11244444 Q ss_pred HHHHH----HHhccc-cccccceeecc------CCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHH-hccccccc Q lcl|NC_020081. 304 RREWT----SMFSGI-NGAWKIPVITA------EDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSE-INFPNRGG 371 (552) Q Consensus 304 ~~~~~----~~~~G~-~nagk~~il~~------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~t 371 (552) .+.+- ..+..- ..+--+||+.. ++++...+.. .-+.--+.+++..+..||....|||.. ||+..+++ T Consensus 284 ~~~l~q~a~tai~De~S~aA~vPii~~~p~E~i~~i~hlkf~~-ei~e~aiktR~daI~RlA~glDi~pE~LLGlGsd~N 362 (631) T protein:vir:10 284 TDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDN-EITEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTN 362 (631) T ss_pred HHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCeeEEeecC-chhHHHHhhHHHHHHHHHhccCCchhhheeccCCcc Confidence 44433 222221 12344666643 2344444433 334445789999999999999999965 47754554 Q ss_pred ccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc----cc---cceeecccccC----hHHHHHHHHHHH Q lcl|NC_020081. 372 ATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ----FG---GDYVFNFVGGD----AKTEAEIISILE 440 (552) Q Consensus 372 ~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~----~~---~~~~~~f~~~d----~~~~~~~~~~~~ 440 (552) -|+... -...-++-.|.|.+..|+++|++.+|.. .| .+|.+.|+-.. +.-..+.. + T Consensus 363 HWsAWq----------I~dedVrlHI~P~l~lic~AlT~q~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dPdr~deA~---q 429 (631) T protein:vir:10 363 HWSAWQ----------ISDEDVQLHIAPVMEIFCQALTDQILRVTLAREGIDPSKYVVWYDPSQLTIDPDKSDEAK---F 429 (631) T ss_pred ceEEEE----------ecccceeeecchHHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHH---H Confidence 443321 1111245679999999999999887742 22 35778886443 22223323 3 Q ss_pred HHhcCCcCHHHHHHHhCCCCCCCCC------------------eeeccccccchhhhccccccccccCCCCCccCcccCC Q lcl|NC_020081. 441 SKAKIGLTINDIRKELGYPDTEGGD------------------VTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTG 502 (552) Q Consensus 441 ~~~~g~lT~NE~R~~~gl~p~~ggD------------------~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (552) ....|.||-...|+.+|+.--.+-| .-+.| ++.++....-+... ...++... +.++++ T Consensus 430 a~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~~~a~~av~~dpaLip-~lApl~~~~~~~v~-~P~~~a~~-~~g~ed- 505 (631) T protein:vir:10 430 AYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIP-MLAPLIAGVLKQIE-FPQQQAID-SGGNED- 505 (631) T ss_pred HHHcCCcCHHHHHHHhcCchhcccCcCchHHHHHHHHHHhhcccCcch-hhHHHHHHHhhhcc-CCCCCCCC-CCCCCc- Confidence 3456999999999999996543322 11111 11221110000000 00000000 000000 Q ss_pred CCCCCCCCCCCCCcccccCCCCcccccccccccc---------ccCc--cccccccccccC Q lcl|NC_020081. 503 YDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTP---------QGGK--DDNGNVVNDWEA 552 (552) Q Consensus 503 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~--~~~~~~~~~~~~ 552 (552) .+.+...+++.+.+.++...+......-.....+ .|+. .-.++....|-+ T Consensus 506 ~~~~~~~~~g~~epdt~d~~p~~~~a~~~~~iv~llv~RALelAGkRl~~r~r~~~ar~~~ 566 (631) T protein:vir:10 506 TSDADDLDDGEQEPDTEDDDDGTQKAGLETGIVDLMVDRALELVGKRRRGRDRETLARLSG 566 (631) T ss_pred cccccccccCCCCCCCCCCCCccccccchHHHHHHHHHHHHHhhcchhcCCcccchhHHhc Confidence 0000011111111111100000000000111111 1222 111222233333 No 160 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=98.74 E-value=7.3e-08 Score=59.71 Aligned_cols=396 Identities=9% Similarity=0.022 Sum_probs=151.8 Q ss_pred cccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHH--------------------------------- Q lcl|NC_020081. 59 MSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMF--------------------------------- 105 (552) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~--------------------------------- 105 (552) |-..| .-... .+.++.+ +...++........++ T Consensus 1 ~~~~p----~~~l~-----~~~~~~~-----~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ 66 (479) T protein:vir:99 1 MIDLP----DEDLS-----SEGLAKY-----LETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSR 66 (479) T ss_pred CccCC----cccCC-----hhHHHHH-----HHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhh Confidence 11111 00010 0111110 0111111111121111 Q ss_pred ---HHHHHhhccc----cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeE Q lcl|NC_020081. 106 ---CTPARNSDKG----VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINF 178 (552) Q Consensus 106 ---~~~~~~~~~~----~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~ 178 (552) ++.+....+. .|| ...+ . .....+.+++.. | .+......+..+.+++|.+|. T Consensus 67 ~n~~~~iVd~~~~~l~~~gf--~~~d--~-------~~~~~~~~i~~~-------N---~~d~~~~~~~~~a~~~G~af~ 125 (479) T protein:vir:99 67 KPWMGLMVNSFAQQLIVDGY--RKTG--T-------NENAKGWDTWRL-------N---QMDKQQFWLNRAVLTFGYAFI 125 (479) T ss_pred cCcHHHHHHHHHhhcccccc--cCCC--c-------hhhHHHHHHHHh-------c---ChhHHHHHHHHHHhhcCceEE Confidence 2222111111 111 1110 0 011122333322 1 123455668889999999999 Q ss_pred EEEE-----CCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcc--------------------- Q lcl|NC_020081. 179 ELVY-----DKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKA--------------------- 232 (552) Q Consensus 179 ~i~r-----~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~--------------------- 232 (552) .+.+ +..|.+ .+..++|..+.++.++...... ..+++..........+.. T Consensus 126 ~v~~~~~~~d~~g~~-~i~~~~p~~~~~iydd~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h 201 (479) T protein:vir:99 126 KVTSGISPLDGTTVA-RIKCIDPRDAFAIWEDPYWDEW---PKYLLERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSH 201 (479) T ss_pred EEecCCCCcCCCCce-EEEEechhheEEEecCCcccce---eeEEEeecCceeEEEEecceEEEEEecCCceeecccccc Confidence 8864 344544 4778899998887654322110 111111111111111110 Q ss_pred --cc--eeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_020081. 233 --KE--MAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWT 308 (552) Q Consensus 233 --~e--vi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~ 308 (552) .. |++++.++.. +.+|.|-++.+...++....+..-....+.-.+.|..+|. +. ...+..... ...|. T Consensus 202 ~~g~vPvv~f~n~~~~----~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~--G~-~~~~~~~~~-~~~~~ 273 (479) T protein:vir:99 202 DYGHIPFVRYVNVMDL----RGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWAT--GL-MLPEGANAD-QEKMR 273 (479) T ss_pred CCCCcceEEeecCCCc----CcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhc--CC-Ccccccccc-hhccc Confidence 11 2333332221 2468998888887777777666655555555566654443 21 111111000 11121 Q ss_pred HHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccch----- Q lcl|NC_020081. 309 SMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG----- 383 (552) Q Consensus 309 ~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~----- 383 (552) . ..+++..+.++++++..+.... -..+++..+..+..|+.+=++|++.+|...+++ +....+. T Consensus 274 ~------~~~~i~~~~~~~~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~S-----g~Al~~~~~~l~ 341 (479) T protein:vir:99 274 F------AQESMLISQNEKASFGAIPAAP-LDGLLNAYKESLLEFLALAQLPPHIAGQIVNVA-----ADALAAGTRQTM 341 (479) T ss_pred c------ccccceeecCCCceEEEecccc-hHHHHHHHHHHHHHHhccCCCCHHHcccccchH-----HHHHHHHHHHHH Confidence 1 1223444556778877665322 234667778888889988899999998643321 1111111 Q ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHH-HHhcCCcCHHHHHHHh-CCCCC Q lcl|NC_020081. 384 SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILE-SKAKIGLTINDIRKEL-GYPDT 461 (552) Q Consensus 384 n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~-~~~~g~lT~NE~R~~~-gl~p~ 461 (552) ...+..+..+..+|.-+++.+....+.. .+.....+.+.|-...+.+..+.++.+. .+.+|+++...+.+++ |+.+- T Consensus 342 ~ka~~~~~~f~~al~~~~~l~~~~~~~~-~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~ 420 (479) T protein:vir:99 342 QKLFEKQATWKASHNQTMRLVNKIEGRT-EEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQS 420 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCC-ccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH Confidence 1111112222333333333332211110 0111123455554444444455554443 3446788887777766 66532 Q ss_pred CCCCeeeccccccchhhhcccc-----ccc-cccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccccccc Q lcl|NC_020081. 462 EGGDVTLAGVHVQRLGQIMQQE-----QVE-YQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNST 535 (552) Q Consensus 462 ~ggD~~~~~~n~~~~~~~~~~~-----~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 535 (552) + + ..+......+ ... ...+..+.... ++....++..+. .+...+ .+.=- T Consensus 421 ~---~-------e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~--~~~~~~-------~~~~~ 475 (479) T protein:vir:99 421 T---V-------NGWKEIYDREGDFGKYMRKLQNGPDPAEQR------GGPNGATNMQQA--NNKTGE-------PASLN 475 (479) T ss_pred H---H-------HHHHHHHHHHHHHHHHHHHHhcccCccccc------CCCCCCCCCCCC--CCCCcc-------hhccC Confidence 1 0 0000000000 000 00000000000 000000000000 000000 00001 Q ss_pred cccC Q lcl|NC_020081. 536 PQGG 539 (552) Q Consensus 536 ~~~~ 539 (552) +.|+ T Consensus 476 ~~~~ 479 (479) T protein:vir:99 476 KSGA 479 (479) T ss_pred CCCC Confidence 1111 No 161 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.73 E-value=8e-08 Score=59.49 Aligned_cols=392 Identities=10% Similarity=0.042 Sum_probs=148.0 Q ss_pred ccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHH-HH----HHHHHH-HHHHHHHHHhhcc----ccceeee Q lcl|NC_020081. 52 EEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNA-II----ITRVNQ-VSMFCTPARNSDK----GVGYEIR 121 (552) Q Consensus 52 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a-~i----~~~~~~-~~~~~~~~~~~~~----~~~~~i~ 121 (552) -.+. ...|-...... ......-...|..+-++.-.+.. .. ..+... +.-+++.+....+ ..||.. T Consensus 1 ~~~~--~~~~i~~l~~~--~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~g~~~- 75 (441) T protein:vir:80 1 MNSD--ELALIEGMYDR--IQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLDWLGWTN- 75 (441) T ss_pred CCcc--HHHHHHHHHHH--HHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhccccccC- Confidence 0000 00000000000 00000001222222221110000 00 000000 1111222221111 111110 Q ss_pred eccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEE Q lcl|NC_020081. 122 LKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYV 201 (552) Q Consensus 122 ~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v 201 (552) . +. ..+..++.. -.+......+..+.+++|.+|..+.++.+|.+ .+.+++|..|.+ T Consensus 76 -------~---d~---~~l~~i~~~----------n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~ 131 (441) T protein:vir:80 76 -------G---DG---YGLDGVYAA----------NRLATASCDVHLDALIFGLSFVAIIPHGDGTV-SVRPQSPKNCTG 131 (441) T ss_pred -------C---Ch---HHHHHHHHh----------cCHHHHHHHHHHHHhhcCeeEEEEEeCCCCce-EEEEEccceEEE Confidence 0 01 123333321 13556777888999999999999999999987 588999999998 Q ss_pred EECCCcccccccceeEEEEEcCCc-eEEEEcccc--------------------------eeeecccccCCccCCccccc Q lcl|NC_020081. 202 AVDEDGKERKAKDGVRYVQVIDDK-VVAKFKAKE--------------------------MAWEVSNPRTDLTVGKYGYP 254 (552) Q Consensus 202 ~~~~~g~~~~~~~~~~y~~~~~~~-~~~~~~~~e--------------------------vi~~~~~~~~~~~~g~~G~s 254 (552) +.++........ ...|+...+.. ....+..+. |+++..+. ...+++|.| T Consensus 132 i~d~~~~~~~~~-~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~---~~~~~~G~s 207 (441) T protein:vir:80 132 KFSADGSRLDAG-LVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRR---RTSRIDGRS 207 (441) T ss_pred EEeCCCCceeEE-EEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccc---cCCccCCcc Confidence 877543321110 01111111110 011111111 12222111 223467888 Q ss_pred HHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec----cCCce Q lcl|NC_020081. 255 ELEI-ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT----AEDVK 329 (552) Q Consensus 255 pl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~----~~g~~ 329 (552) .+.- +...++....+..-......-.+.|-.+|. + ..+++...+. |+... +++..+. +.+++ T Consensus 208 ~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--G-~~~~~~~~~~----~~~~~------~~i~~~~~~~~~~~~~ 274 (441) T protein:vir:80 208 EITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT--G-VSADEFSQPG----WVLSM------ASVWAVDKDDDGDTPN 274 (441) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeee--c-CCccccccch----hhhcc------cccccCCCCCCCCcce Confidence 6532 333444333333333333333445544443 2 2223322111 21111 1111111 12345 Q ss_pred eeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccch---hHHHHHHHHHHHHhhHHHHHHHH Q lcl|NC_020081. 330 FVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG---SSAEKYRNSKDKGLEPLLKFIED 406 (552) Q Consensus 330 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~---n~e~~~~~~~~~~l~P~~~~ie~ 406 (552) +..+..... -.|++..+..+..|+..-++|++.+|.......+|.... ..+. ..-+..+..+...|+-.++.+.. T Consensus 275 ~~~~~~~~~-~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~-~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~ 352 (441) T protein:vir:80 275 VGSFPVNSP-TPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALA-AEESRLVKRAERRQTSFGQGWLSVGFLAAK 352 (441) T ss_pred eEecCccch-HHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 544443222 236788888899999999999999985432211110000 0000 00111112222233333332322 Q ss_pred HHHhhcCcc-cccceeecccccChHHHHHHHHHHHH-HhcCCc--CHHHHHHHhCCCCCCCCCeeeccccccchhhhccc Q lcl|NC_020081. 407 AVNKYIVSQ-FGGDYVFNFVGGDAKTEAEIISILES-KAKIGL--TINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQ 482 (552) Q Consensus 407 ~ln~~L~~~-~~~~~~~~f~~~d~~~~~~~~~~~~~-~~~g~l--T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~ 482 (552) .++...-.. ....+.+.|.+..+.+.++.++++.+ ..+|.+ +..-+++.+|+.+-+- ..+.... T Consensus 353 ~~~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e~----------~~~~~e~-- 420 (441) T protein:vir:80 353 ALDSRVDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDVQV----------EAVMRHR-- 420 (441) T ss_pred HhcCCCcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHHHH----------HHHHHHH-- Confidence 222211111 11345677877777777776665443 345554 4445677777654220 0000000 Q ss_pred cccccccCCCCCccCcccCCCCCCCCCC Q lcl|NC_020081. 483 EQVEYQRQMDANQFLAQQTGYDGNMDNV 510 (552) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (552) .++. ++...........++.. T Consensus 421 --~e~~-----~~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 421 --AESS-----DPLAVLAGAISRQTNEV 441 (441) T ss_pred --HHHH-----HHHHHHhhhhhcccccC Confidence 0000 00000000001111111 No 162 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.71 E-value=9e-08 Score=59.20 Aligned_cols=457 Identities=12% Similarity=0.088 Sum_probs=174.9 Q ss_pred ccc---ccCcccccccccchhhhhccccccccccccccccccccccccCCc----ccccccCCCCchHHHHHHHhhcc-h Q lcl|NC_020081. 17 IID---INDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPD----FKEAPSIHGKQNLLQMLKLWSRK-N 88 (552) Q Consensus 17 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~Lr~~a~~-~ 88 (552) ++. ++...+.. +..+..+ ++..+.......... ....++. +-..... ....-...|.++-.+ . T Consensus 1 ~~~~~~~~~~~~~~--~~~~~~~----~~~~n~~~~~~~~e~--~~~~~~~~i~~~i~~~~~-~~~~r~~~l~~Yy~g~~ 71 (511) T protein:vir:93 1 MLKVNEFETDTDLR--GNINYLF----NDEANVVYTYDGTES--DLLQNVNEVSKYIEHHMD-YQRPRLKVLSDYYEGKT 71 (511) T ss_pred Cccccchhhhhhhh--hhhhhhh----hhhhCCcccccchhh--hhhccHHHHHHHHHHHHH-hhHHHHHHHHHHhcccC Confidence 110 01111110 0001111 000000000000000 0000000 0000000 000001111111110 0 Q ss_pred HHH-HH---HHHHHHH--HHHHHHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 89 IIL-NA---IIITRVN--QVSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 89 ~i~-~a---~i~~~~~--~~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .++ +. -...+.+ .+.-+++.+.....+ +|-.+++... +......+.+|+.. -.+. T Consensus 72 ~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~-------d~~~~~~l~~~~~~----------n~~~ 134 (511) T protein:vir:93 72 KNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDD-------DKDVLEVIEAFNDL----------NDVE 134 (511) T ss_pred ccccccCcCcccccCcceeecchHHHHHHHHhhhhcccCeeeccC-------ChHHHHHHHHHHhh----------cCHh Confidence 010 00 0000000 000111222221111 1112222111 11122334444432 1345 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEc--CCc------eEEEEcc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVI--DDK------VVAKFKA 232 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~--~~~------~~~~~~~ 232 (552) .....+..+++++|.+|..+.++.+|++. +..++|..+.++.++..... ..-.++|+... .+. ....+++ T Consensus 135 ~~~~~~~~~~~~~G~ay~~vy~de~~~~~-i~~~~p~~~~~vydd~~~~~-~~~~vr~~~~~~~~~~~~~~~~~~~iyt~ 212 (511) T protein:vir:93 135 SHNRSLGLDLSIYGKAYELMIRNQDDETR-LYKSDAMSTFVIYDNTIERN-SIAGVRYLRTKPIDKTDEDEVFTVDLFTS 212 (511) T ss_pred HHHHHHHHHHHhcCeeEEEEEeCCCCceE-EEEEccceeEEEEcCCCCCc-eEEEEEEEEeeeccccccceEEEEEEEeC Confidence 57778889999999999999999988754 78899999998887643211 11122222211 100 0112333 Q ss_pred cceeeecccc-------------cCC---------ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC Q lcl|NC_020081. 233 KEMAWEVSNP-------------RTD---------LTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIK 290 (552) Q Consensus 233 ~evi~~~~~~-------------~~~---------~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 290 (552) +.+.+++..- ..+ -.+...|.|-++.+...++....+..-..+.+...+.|-.++. T Consensus 213 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~-- 290 (511) T protein:vir:93 213 HGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK-- 290 (511) T ss_pred CcEEEEEecCCCccccccccccccccCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeee-- Confidence 3332221100 000 0012367888888888888777766666666666666655543 Q ss_pred CCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccc Q lcl|NC_020081. 291 TGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRG 370 (552) Q Consensus 291 ~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 370 (552) +....+.+.....+....-............+-..++.++..++.......+....+.+.+.|+..-++|..-.+-.. + T Consensus 291 G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~-~ 369 (511) T protein:vir:93 291 GNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G 369 (511) T ss_pred cCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c Confidence 433333333332222110000000000000111234555555555555666778888999999999999875443211 1 Q ss_pred cccccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc--cc-cceeecccccChHHHHHHHHHHHHH Q lcl|NC_020081. 371 GATGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ--FG-GDYVFNFVGGDAKTEAEIISILESK 442 (552) Q Consensus 371 t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~--~~-~~~~~~f~~~d~~~~~~~~~~~~~~ 442 (552) + .++....+. .-....+..+..+|+-+++.|...++.+--.. .+ ..+.+.|.+..+.+.++.++++... T Consensus 370 n---~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl 446 (511) T protein:vir:93 370 T---QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS 446 (511) T ss_pred c---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHH Confidence 1 111111111 11122333455566666655555444322111 11 2467778777777777766666554 Q ss_pred hcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCC Q lcl|NC_020081. 443 AKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGK 522 (552) Q Consensus 443 ~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (552) .|+++.--+++++++-+-+ + ..+..+...+... ...... ....++ ...+. ...+.++ T Consensus 447 -~g~iS~et~~~~l~~v~d~--~--------~E~~ri~~E~~~~-~~~~~~--~~~~~~----~~~~~-----~~~~~~~ 503 (511) T protein:vir:93 447 -GGKISQTTLMSLFSFFQDP--E--------LEVKKIEEDEKES-IKKAQK--GIYKDP----RDIND-----DEQDDDT 503 (511) T ss_pred -hccCchHHHHHhCCCCCCH--H--------HHHHHHHHHHHHH-HHHHhh--hcccCC----CCCCC-----CCCCCcc Confidence 5889987788777542211 0 1111111111000 000000 000000 00000 0000000 Q ss_pred CCcccccc Q lcl|NC_020081. 523 DGQSKQQA 530 (552) Q Consensus 523 ~~~~~~~~ 530 (552) +...+.++ T Consensus 504 ~~~~~~~~ 511 (511) T protein:vir:93 504 KDTVDKKE 511 (511) T ss_pred cccccccC Confidence 00111111 No 163 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.70 E-value=9.8e-08 Score=59.01 Aligned_cols=442 Identities=9% Similarity=-0.027 Sum_probs=159.1 Q ss_pred cccccCcccccccccch-----hhhhccccccccccccccccccccccccCCccccc-ccC-CCCchHHHHHHHhh-cch Q lcl|NC_020081. 17 IIDINDDMAVRIKQIEE-----DAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEA-PSI-HGKQNLLQMLKLWS-RKN 88 (552) Q Consensus 17 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~Lr~~a-~~~ 88 (552) +-+|++..+..+-.+.. ..++..+-+.-..+..-+ ...-.|+.. ... .-.......++++. -.+ T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~~~r~--------~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n 72 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLVDRTPRN--------LLRASFYDGKYAIRQIGNLIPPEYLRTATVLG 72 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHHHHhHHH--------HHHHHHHhccccchhccccccHHHHHHhhccC Confidence 44455555444433311 111111111111111000 000011110 000 00000111122110 011 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHH Q lcl|NC_020081. 89 IILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVR 168 (552) Q Consensus 89 ~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~ 168 (552) +. +-|+ ..+......-||.+ .+ .+ .....+.+++... .+......+.. T Consensus 73 ~~-~~iV-----------d~~a~rl~~~Gf~~--~d--~~------~~~~~l~~i~~~N----------~ld~~~~~~~~ 120 (504) T protein:vir:99 73 WS-AKAV-----------DTLARRCNLESFVW--PD--GD------YGSIGGPDVWDEN----------FFATKANNAMV 120 (504) T ss_pred cH-HHHH-----------HHHHhhhccceeeC--CC--CC------hhhHHHHHHHHhc----------ChhhHHHHHHH Confidence 10 1111 11111122234432 11 11 1123355555432 23345667889 Q ss_pred HHHhcCCeeEEEEECCCCCEE-EEEEecCceeEEEECCCcccccccceeEEEEEcCCc-eE--EEEcccce--------- Q lcl|NC_020081. 169 DRLTYDKINFELVYDKLGDLH-NFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDK-VV--AKFKAKEM--------- 235 (552) Q Consensus 169 d~ll~Gna~~~i~r~~~G~~~-~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~-~~--~~~~~~ev--------- 235 (552) +.++||.+|+.|..+.+|++. .+.+++|..+.++.|+..+.... .+.|+....++ .. ..|.++.+ T Consensus 121 ~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~iyD~~~~~~~~--a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~ 198 (504) T protein:vir:99 121 SSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATGEWNSRRNAMDS--LLSITSRDAEGHPTGIALYEDGVTVTADMDDDG 198 (504) T ss_pred HHHhhCceeEEEecCCCCCceeEEEEeccceeEEEEeCCCCceeE--EEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCc Confidence 999999999999998888764 56789999998887754322111 11111111111 11 12222222 Q ss_pred ---------------eeecccccCCccCCcccccHH----HHHHHHHHHHHHHHHHHHHHHhccCCCceEEE-eCCCCCC Q lcl|NC_020081. 236 ---------------AWEVSNPRTDLTVGKYGYPEL----EIALNHLQYHDNTEVFNARFFAQGGTTRGLLH-IKTGQEQ 295 (552) Q Consensus 236 ---------------i~~~~~~~~~~~~g~~G~spl----~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~-~~~~~~~ 295 (552) +++..+. ...+++|.|.+ ..+.+.+.....-......||. .|.-+|. +...... T Consensus 199 ~~~~~~~~~~~gvPvV~~~n~~---~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a---~p~r~i~G~~~~~~~ 272 (504) T protein:vir:99 199 DWHADVRTHKLGVPVEVLPYKP---REDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYS---FPQLILLGADAKNFR 272 (504) T ss_pred eeeeccccCCCCcceEEecccc---cCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhc---chhhhhccCCccccc Confidence 2222111 12346787754 3344444333333333444443 3332221 1100000 Q ss_pred --CHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhccccccccc Q lcl|NC_020081. 296 --SNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGAT 373 (552) Q Consensus 296 --s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~ 373 (552) +......++......+ .........+....+.++.++....-+ .|++..+..+..|+..-++|++.+|+...+.. T Consensus 273 ~~d~~~~~~~~~~~~~i~-~~~~~~~~~~~~~~~~~~~q~~~~~l~-~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~- 349 (504) T protein:vir:99 273 NKDGSMKPAWQIALARVF-ALPDDEDEPDAARARADVKQFPASSPQ-PHIEMLEQIAMMFSGETSIPVESLGFSNRANP- 349 (504) T ss_pred cccccccchhhhhhhhhh-cCCCccccccccCccceeeecCCCChH-HHHHHHHHHHHHHHhhhCCCHHHhcccccccc- Confidence 0011122333222221 111111111222234666655443322 47899999999999999999999997643210 Q ss_pred ccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhh------cC------cccccceeecccccChHHHHHHHHHHHH Q lcl|NC_020081. 374 GHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKY------IV------SQFGGDYVFNFVGGDAKTEAEIISILES 441 (552) Q Consensus 374 ~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~------L~------~~~~~~~~~~f~~~d~~~~~~~~~~~~~ 441 (552) +++ ..+..+...+.. ...-..+.+...|.+. +. +.....+.+.|....+.+.++.+.++.+ T Consensus 350 -sSa-----~Ai~~~~~~L~~-ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~K 422 (504) T protein:vir:99 350 -TSA-----DAYIASREDLIA-EAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAK 422 (504) T ss_pred -cHH-----HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEecCCCccCHHHHHHHHHH Confidence 010 111111111111 1111222222222221 11 1111234556766666666666655443 Q ss_pred H-hcCC--cCH-HHHHHHhCCCCCCCC---CeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCC Q lcl|NC_020081. 442 K-AKIG--LTI-NDIRKELGYPDTEGG---DVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKD 514 (552) Q Consensus 442 ~-~~g~--lT~-NE~R~~~gl~p~~gg---D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (552) . .+|. +.. .-+.+++|+.|-+-- +..-..-....++.+.. ..+.+... +...+.+.++. T Consensus 423 l~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~---------~~~~~~~~-----~~~~~~~~~e~ 488 (504) T protein:vir:99 423 MLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNR---------RQQEAATA-----GEDQDQGAGEP 488 (504) T ss_pred HHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhc---------ccCCCCCC-----CCCCCcCCCCC Confidence 3 3343 222 334566777543100 00000000000000000 00000000 00001111110 Q ss_pred CcccccCCCCcccccccccccccc Q lcl|NC_020081. 515 SFNQNVGKDGQSKQQANTNSTPQG 538 (552) Q Consensus 515 ~~~~~~~~~~~~~~~~~~~~~~~~ 538 (552) ...+..+.++.. +..| T Consensus 489 a~~~~~~~~~~p--------~~~~ 504 (504) T protein:vir:99 489 PANEPPAALGRP--------TLVG 504 (504) T ss_pred CCCCCCccCCCc--------ccCC Confidence 000110111110 1111 No 164 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.68 E-value=1.2e-07 Score=58.55 Aligned_cols=405 Identities=13% Similarity=0.083 Sum_probs=146.5 Q ss_pred CCCCchHHH--------------HHHHhhcch-HHHHHHHH----HHH-HHHHHHHHHHHhhcc----ccceeeeecccc Q lcl|NC_020081. 71 IHGKQNLLQ--------------MLKLWSRKN-IILNAIII----TRV-NQVSMFCTPARNSDK----GVGYEIRLKDPL 126 (552) Q Consensus 71 ~~~~~~~~~--------------~Lr~~a~~~-~i~~a~i~----~~~-~~~~~~~~~~~~~~~----~~~~~i~~k~~~ 126 (552) +.......+ .|.++-++- .+...-+. .+. ..+.-+++.+....+ ..||.+ .+ T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~--~~-- 76 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--SE-- 76 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCceec--CC-- Confidence 111111111 111221111 00000000 000 001111222221111 122211 11 Q ss_pred ccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE------CCCCCEEEEEEecCceeE Q lcl|NC_020081. 127 QEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY------DKLGDLHNFKAVDASTVY 200 (552) Q Consensus 127 ~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r------~~~G~~~~L~~l~p~~v~ 200 (552) +......+.+++.. | .+......+..+.+++|.+|..+.+ +.+|.+ .+.+++|..|. T Consensus 77 ------d~~~~~~l~~i~~~-------N---~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~-~i~~~~p~~~~ 139 (480) T protein:vir:78 77 ------DSEGLEELWNWWQA-------N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMY 139 (480) T ss_pred ------CchhHHHHHHHHHh-------c---CHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCee-EEEEEcccceE Confidence 11122334444432 1 2345667788999999999998875 345554 47889999999 Q ss_pred EEECCCcccccccceeEEEEEcCCce----EEEEcccce-----------------------------eeecccccCCcc Q lcl|NC_020081. 201 VAVDEDGKERKAKDGVRYVQVIDDKV----VAKFKAKEM-----------------------------AWEVSNPRTDLT 247 (552) Q Consensus 201 v~~~~~g~~~~~~~~~~y~~~~~~~~----~~~~~~~ev-----------------------------i~~~~~~~~~~~ 247 (552) ++.++....... ..++|+...++.. ...+.++.+ +++..+. .. T Consensus 140 ~i~D~~~~~~~~-~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~---~~ 215 (480) T protein:vir:78 140 AELDPRNTRRVT-RAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP---RL 215 (480) T ss_pred EEEcCCCccceE-EEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeeccc---cc Confidence 888754211100 1122211111110 111122222 2222111 22 Q ss_pred CCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccC Q lcl|NC_020081. 248 VGKYGYPELEI-ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAE 326 (552) Q Consensus 248 ~g~~G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~ 326 (552) .+.+|.|-++- +...++....+..-......-.+.|.-+|. + ...++...+.-...|... .+++..+.++ T Consensus 216 ~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G-~~~~~~~~~~~~~~~~~~------~~~~~~~~~~ 286 (480) T protein:vir:78 216 GNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--G-VTTDELTNDGENTTLDIY------YGRILTLASE 286 (480) T ss_pred CCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhh--C-CCccccccccccchhhhh------hhhhccCCCC Confidence 34678886653 344444444333333333333344543332 2 211111111111112211 1233344455 Q ss_pred CceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccch--h---HHHHHHHHHHHHhhHHH Q lcl|NC_020081. 327 DVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG--S---SAEKYRNSKDKGLEPLL 401 (552) Q Consensus 327 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~--n---~e~~~~~~~~~~l~P~~ 401 (552) ++++..+.....+ -|++..+..+..|+.+=++|++.+|....... ++....+. . .-+..+..+...|+-++ T Consensus 287 ~~~~~~~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~---Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~ 362 (480) T protein:vir:78 287 AAKISEFKAAELR-NFAEEMEVFRKEAASITGLPPQYLSSSSENPA---SAEAIIATDSRIVKMAERKGRIFGGAWERAM 362 (480) T ss_pred CceEEecCccCHH-HHHHHHHHHHHHHhcccCCCHHHhccccCchh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6777766554332 37788889999999999999999974221100 11011100 0 00111111122222222 Q ss_pred HHHHHHHHhhcCcccccceeecccccChHHHHHHHHHH-HHHhc--CCcCHHHHHHHhCCCCCCCCCeeeccccccchhh Q lcl|NC_020081. 402 KFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISIL-ESKAK--IGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQ 478 (552) Q Consensus 402 ~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~-~~~~~--g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~ 478 (552) +.+....... .......+.+.|.+..+.+..+.+..+ +.+.+ +.++..-+++++|+.+-+- ..+.. T Consensus 363 rl~~~~~~~~-~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~----------~e~~~ 431 (480) T protein:vir:78 363 RIAMQIMGRE-VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR----------EQMRD 431 (480) T ss_pred HHHHHHcCCC-ccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHH----------HHHHH Confidence 2222111100 011112345566544444444444332 22333 3577767788888854310 00000 Q ss_pred hcccccc---ccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccccccccccCc Q lcl|NC_020081. 479 IMQQEQV---EYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGK 540 (552) Q Consensus 479 ~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 540 (552) ....+.. ..-......++... .++...++.++..+ ........++. T Consensus 432 ~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~-----------~~~~~~~~~~~ 480 (480) T protein:vir:78 432 WDKQETEDMIDTLYSTTKAQADAT-----PKPTVTETKTETQT-----------SPSGFNRTKTR 480 (480) T ss_pred HHHHHHHHHHHHhhccccCCCccc-----cCCCCCCCCCccCC-----------CcccCCCcCCC Confidence 0000000 00000000000000 00000000000000 00111111111 No 165 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=98.67 E-value=1.3e-07 Score=58.40 Aligned_cols=457 Identities=13% Similarity=0.092 Sum_probs=173.8 Q ss_pred ccc---ccCcccccccccchhhhhccccccccccccccccccccccccCCc----ccccccCCCCchHHHHHHHhhcc-h Q lcl|NC_020081. 17 IID---INDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPD----FKEAPSIHGKQNLLQMLKLWSRK-N 88 (552) Q Consensus 17 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~Lr~~a~~-~ 88 (552) ++. ++...+.. +..+..+ ++..+...........-. .++. +-...... ...-.+.|+++-.+ . T Consensus 1 ~~~~~~~~~~~~~~--~~~~~~~----~~~~n~~~~~~~~e~~~~--~~~~~i~~~i~~~~~~-~~~r~~~l~~Yy~g~~ 71 (511) T protein:vir:96 1 MLKVNEFETDTDLR--GNINYLF----NDEANVVYTYDGTESDLL--QNVNEVSKYIEHHMDY-QRPRLKVLSDYYEGKT 71 (511) T ss_pred Cccccchhhhhhhh--hhhhhhh----hhhhCCccccchhhhhhh--ccHHHHHHHHHHHHHh-hHHHHHHHHHHhcccC Confidence 110 01111110 0011111 000100000000000000 0000 00000000 00001111111110 0 Q ss_pred HHHHH----HHHHHHH-H-HHHHHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 89 IILNA----IIITRVN-Q-VSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 89 ~i~~a----~i~~~~~-~-~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .++.. -...+.+ . +.-+++.+....++ +|-.+.+... +......+.+++.. -.+. T Consensus 72 ~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~-------~~~~~~~l~~~~~~----------n~~~ 134 (511) T protein:vir:96 72 KNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDD-------DKDVLEAIEAFNDL----------NDVE 134 (511) T ss_pred ccccccCcCcccccCcceeecchHHHHHHHHHhhhccCCceeecC-------chHHHHHHHHHHhh----------cCHH Confidence 11000 0000000 0 00111222221111 1111111111 11122334444432 1344 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEc--CC---c---eEEEEcc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVI--DD---K---VVAKFKA 232 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~--~~---~---~~~~~~~ 232 (552) .....+..+++++|.+|..+-++.+|.+ .+.+++|..+.++.++..... ..-.++|+... .+ . ....+++ T Consensus 135 ~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~p~~~~~vydd~~~~~-~~~~vr~~~~~~~d~~~~~~~~~~~iyt~ 212 (511) T protein:vir:96 135 SHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERN-SIAGVRYLRTKPIDKTDEDEVFTVDLFTS 212 (511) T ss_pred HHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCc-eEEEEEEEEeeeccccccceEEEEEEEeC Confidence 5677788999999999999999988875 578899999998877543211 11122332211 00 0 0112333 Q ss_pred cceeeecccc----------------------cCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC Q lcl|NC_020081. 233 KEMAWEVSNP----------------------RTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIK 290 (552) Q Consensus 233 ~evi~~~~~~----------------------~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 290 (552) +.+.++...- -..-.+...|.|-++.+...++....+..-..+.+...+.|-.++. T Consensus 213 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~-- 290 (511) T protein:vir:96 213 HGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK-- 290 (511) T ss_pred CcEEEEEecCCCcccccccccccccccCCceeeEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeee-- Confidence 3332221100 0000011368888888888888777766666666666666655544 Q ss_pred CCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccc Q lcl|NC_020081. 291 TGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRG 370 (552) Q Consensus 291 ~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 370 (552) +....+..+....++.-.-.......+.....-...+.++.-++.......+....+.+.+.|...-++|..-.+-.. + T Consensus 291 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-~ 369 (511) T protein:vir:96 291 GNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G 369 (511) T ss_pred cCccCCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c Confidence 322233333222221100000000000000111233555555555555667788889999999999999985443211 1 Q ss_pred cccccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc--cc-cceeecccccChHHHHHHHHHHHHH Q lcl|NC_020081. 371 GATGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ--FG-GDYVFNFVGGDAKTEAEIISILESK 442 (552) Q Consensus 371 t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~--~~-~~~~~~f~~~d~~~~~~~~~~~~~~ 442 (552) + .++....+. ......+..+..+|+-+++.|...+..+--.. .+ ..+.+.|.+.-+.+.++.++++... T Consensus 370 n---~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl 446 (511) T protein:vir:96 370 T---QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS 446 (511) T ss_pred c---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHH Confidence 1 111111111 11122234445555555555555444332111 11 2467778777677666666665544 Q ss_pred hcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCC Q lcl|NC_020081. 443 AKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGK 522 (552) Q Consensus 443 ~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (552) .|++|.-.+.+++++-+-+ + ..+..+...+......... ....++. +.+.+.+ +..+ T Consensus 447 -~G~iS~et~l~~l~~v~D~--~--------~E~~ri~~E~~~~~~~~~~---~~~~~~~-~~~~~~~--------~~~~ 503 (511) T protein:vir:96 447 -GGKISQTTLMSLFSFFQDP--E--------LEVKKIEEDEKESIKKAQK---GIYKDPR-DINDDEQ--------DDDT 503 (511) T ss_pred -hccCChHHHHHhCCCCCCH--H--------HHHHHHHHHHHHHHHHHhh---ccccCCC-CCCCCCC--------CCcc Confidence 5889998888877642211 0 1111111111000000000 0000000 0000000 0000 Q ss_pred CCcccccc Q lcl|NC_020081. 523 DGQSKQQA 530 (552) Q Consensus 523 ~~~~~~~~ 530 (552) ++..+.++ T Consensus 504 ~~~~~~~~ 511 (511) T protein:vir:96 504 KDTVDKKE 511 (511) T ss_pred cccccccC Confidence 00000000 No 166 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=98.65 E-value=1.4e-07 Score=58.16 Aligned_cols=459 Identities=13% Similarity=0.082 Sum_probs=174.6 Q ss_pred ccc---ccCcccccccccchhhhhccccccccccccccccccccccccCC--cccccccCCCCchHHHHHHHhhcc-hHH Q lcl|NC_020081. 17 IID---INDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNP--DFKEAPSIHGKQNLLQMLKLWSRK-NII 90 (552) Q Consensus 17 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~Lr~~a~~-~~i 90 (552) ++. ++...+.. +..+..+. +..+..-............+.- .+-........ .-.+.|.++-.+ ..+ T Consensus 1 ~~~~~~~~~~~~~~--~~~~~~~~----~~~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~-~r~~~l~~Yy~g~~~i 73 (511) T protein:vir:10 1 MLKVNEFETDTDLR--GNINYLFN----DEANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQR-PRLKVLSDYYEGKTKN 73 (511) T ss_pred Cccccchhhhhhhh--hhhhhhhh----hhhcCCccCchhhhhcccCHHHHHHHHHHHHHhhH-HHHHHHHHHhcccCcc Confidence 110 01111110 11111110 0110000000000000000000 00000000000 001111111111 000 Q ss_pred HH----HHHHHHHHH--HHHHHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHH Q lcl|NC_020081. 91 LN----AIIITRVNQ--VSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSF 162 (552) Q Consensus 91 ~~----a~i~~~~~~--~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f 162 (552) +. .....+.+. +.-+++.+....++ +|-.+.+... +......+.+|+... .+... T Consensus 74 ~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~-------d~~~~~~l~~~~~~n----------~~~~~ 136 (511) T protein:vir:10 74 LVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDD-------DKDVLEAIEAFNDLN----------DVESH 136 (511) T ss_pred ccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecC-------chHHHHHHHHHHhhc----------CHHHH Confidence 00 000000000 00111222221111 1111111111 111223344444321 24456 Q ss_pred HHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEc--CC---c---eEEEEcccc Q lcl|NC_020081. 163 VKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVI--DD---K---VVAKFKAKE 234 (552) Q Consensus 163 ~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~--~~---~---~~~~~~~~e 234 (552) ...+..+++++|.+|..+.++.+|++ .+..++|..+.++.++..... ..-.++|+... .+ . ....++++. T Consensus 137 ~~~~~~~~~i~G~ay~~vy~dedg~~-~i~~~~p~~~~~vydd~~~~~-~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~ 214 (511) T protein:vir:10 137 NRSLGLDLSIYGKAYEIMIRNQDDET-RLYKSDAMSTFVIYDNTIERN-SIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred HHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCc-eEEEEEEEEeeecccCccceEEEEEEEeCCc Confidence 66788899999999999999998875 578899999998887653211 11122232211 00 0 011233333 Q ss_pred eeeeccc-----------------c-----cCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC Q lcl|NC_020081. 235 MAWEVSN-----------------P-----RTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTG 292 (552) Q Consensus 235 vi~~~~~-----------------~-----~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 292 (552) +.++... + -..-.+...|.|-++.+...++....+..-..+.+...+.|-.++. +. T Consensus 215 i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~--g~ 292 (511) T protein:vir:10 215 VYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK--GN 292 (511) T ss_pred EEEEEecCCCcccccccccccccccCcceeEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeee--cc Confidence 3222110 0 0000011357888888888887777666666666666666655543 33 Q ss_pred CCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc Q lcl|NC_020081. 293 QEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGA 372 (552) Q Consensus 293 ~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~ 372 (552) ...+.++....++.-.-............+-...+.+++.++.......+....+.+.+.|+..-++|..-.+-.. ++ T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n- 370 (511) T protein:vir:10 293 LNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GT- 370 (511) T ss_pred ccCCchhhccchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-cc- Confidence 3333333333222111001110111111112234566666666666677778889999999999999875432111 11 Q ss_pred cccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc--cc-cceeecccccChHHHHHHHHHHHHHhc Q lcl|NC_020081. 373 TGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ--FG-GDYVFNFVGGDAKTEAEIISILESKAK 444 (552) Q Consensus 373 ~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~--~~-~~~~~~f~~~d~~~~~~~~~~~~~~~~ 444 (552) .++....+. ......+..+..+|+-+++.|...+...--.. .+ ..+.+.|.+.-+.+.++.++++... . T Consensus 371 --~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl-~ 447 (511) T protein:vir:10 371 --QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS-G 447 (511) T ss_pred --chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHH-h Confidence 111111111 11223334445555555555555444332111 11 2467778777777777777666554 4 Q ss_pred CCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCC Q lcl|NC_020081. 445 IGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDG 524 (552) Q Consensus 445 g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (552) |+++.--+.+++++-+ .-+ ..+..+...+... .+.... ....++. +.+.+.+ +...++ T Consensus 448 G~iS~et~~~~l~~v~--d~~--------~E~~ri~~E~~~~-~~~~~~--~~~~~~~-~~~~~~~--------~~~~~~ 505 (511) T protein:vir:10 448 GKISQTTLMSLFSFFQ--DPE--------LEVKKIEEDEKES-IKKAQK--GIYKDPR-DINDDEQ--------DDDTKD 505 (511) T ss_pred ccCcHHHHHHhCCCCC--CHH--------HHHHHHHHHHHHH-HHHHhh--hcccCCC-CCCCCCC--------CCcccC Confidence 8899877777775421 100 1111111111000 000000 0000000 0000000 000000 Q ss_pred cccccc Q lcl|NC_020081. 525 QSKQQA 530 (552) Q Consensus 525 ~~~~~~ 530 (552) ..+.++ T Consensus 506 ~~~~~~ 511 (511) T protein:vir:10 506 TVDKKE 511 (511) T ss_pred cccccC Confidence 110111 No 167 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=98.65 E-value=1.5e-07 Score=58.03 Aligned_cols=433 Identities=13% Similarity=0.092 Sum_probs=172.4 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCC---chHHHHHHHhhcchHHHHHHHHHH-HHHHHHHHHHHHh- Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGK---QNLLQMLKLWSRKNIILNAIIITR-VNQVSMFCTPARN- 111 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~Lr~~a~~~~i~~a~i~~~-~~~~~~~~~~~~~- 111 (552) |.+ -..+. ..+....|..|.-.+-.... ..+-+.+ ......+..++... .....++.++.++ T Consensus 1 ~~~---~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~---~~~~~~i~~~i~~~~~~~~~r~~~l~~YY 67 (512) T protein:vir:97 1 MLK---ANEFE-------TDTDLRENRNYLFNDEANVVYTYDGTESDL---LQNINEVSKYIEHHMDYQRPRLKVLSDYY 67 (512) T ss_pred Ccc---ceecc-------CceeeeeCceeeeccccccccccCchhhhh---hhhHHHHHHHHHHHHHhhHHHHHHHHHHh Confidence 111 11110 00001111111100000000 0000000 00001111111110 0001111111111 Q ss_pred ---------------------------------hccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCcc Q lcl|NC_020081. 112 ---------------------------------SDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTR 156 (552) Q Consensus 112 ---------------------------------~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~ 156 (552) ..++ +|-.+.+... +......+.+|+.. T Consensus 68 ~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~-------d~~~~~~l~~~~~~---------- 130 (512) T protein:vir:97 68 EGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDD-------DKDVLEAIEAFNDL---------- 130 (512) T ss_pred cccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeccC-------ChHHHHHHHHHHhh---------- Confidence 1111 1111111110 11112233444322 Q ss_pred CCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEc--CCc------eEE Q lcl|NC_020081. 157 DNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVI--DDK------VVA 228 (552) Q Consensus 157 ~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~--~~~------~~~ 228 (552) -.+......+..+.+++|.+|..+.++.+|++ .+..++|..+.++.++..... ....++|+... .+. ... T Consensus 131 n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~-~i~~~~p~~~~~iyd~~~~~~-~~~~vr~~~~~~~~~~~~~~~~~~~ 208 (512) T protein:vir:97 131 NDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERN-SIAGVRYLRTKPIDKTDEDEVFTVD 208 (512) T ss_pred cCHHHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCc-eEEEEEEEEeeeccccccceEEEEE Confidence 13445667788999999999999999998875 478899999999887653211 11122232211 100 011 Q ss_pred EEcccceeeecc-----------------cc-----cCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Q lcl|NC_020081. 229 KFKAKEMAWEVS-----------------NP-----RTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGL 286 (552) Q Consensus 229 ~~~~~evi~~~~-----------------~~-----~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 286 (552) .++.+.+.++.. |+ -..-.+...|.|-++.+...++....+..-..+.+...+.|-.+ T Consensus 209 vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv 288 (512) T protein:vir:97 209 LFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLL 288 (512) T ss_pred EEeCCcEEEEEecCCCcccccccccccccccCcccceEeecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 233333332211 00 00000123688888888888887777766666666666666655 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHHHhccccccccce-eeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhc Q lcl|NC_020081. 287 LHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIP-VITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEIN 365 (552) Q Consensus 287 l~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~-il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg 365 (552) +. +....+.+.....+....-........+..+ +-.++|.+++.+........+....+.+.+.|+..-++|..-.+ T Consensus 289 ~~--G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~ 366 (512) T protein:vir:97 289 IK--GNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDD 366 (512) T ss_pred ee--cCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcc Confidence 54 3222333333333222111111111111111 11234566666665556666778888999999999999886543 Q ss_pred ccccccccccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCc--ccc-cceeecccccChHHHHHHHH Q lcl|NC_020081. 366 FPNRGGATGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVS--QFG-GDYVFNFVGGDAKTEAEIIS 437 (552) Q Consensus 366 ~~~~~t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~--~~~-~~~~~~f~~~d~~~~~~~~~ 437 (552) ... ++ .++....+. ......+..+..+|+-+++.|...++..--. ..+ ..+.+.|.+.-+.+.++.++ T Consensus 367 ~~~-gn---~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~ 442 (512) T protein:vir:97 367 NFS-GT---QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELK 442 (512) T ss_pred ccc-cc---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHH Confidence 221 11 111111111 1112223334444544444444444322111 111 24677787777777777666 Q ss_pred HHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcc Q lcl|NC_020081. 438 ILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFN 517 (552) Q Consensus 438 ~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 517 (552) ++... .|+++.--+.+++++-+-+ + ..+..+...+.... +.... ....+ ........ T Consensus 443 ~~~kl-~giiS~et~~~~l~~v~d~--~--------~E~eri~~E~~~~~-~~~~~--~~~~~----~~~~~~~~----- 499 (512) T protein:vir:97 443 AYIDS-GGKISQTTLMSLFSFFQDP--E--------LEVKKIEEDEKESI-KKAQK--GIYKD----PRDINDDE----- 499 (512) T ss_pred HHHHH-hccCchHHHHHhCCCCCCH--H--------HHHHHHHHHHHHHH-HHHhh--cccCC----CCCCCCCC----- Confidence 66554 4889987777777542210 0 11111111110000 00000 00000 00000000 Q ss_pred cccCCCCcccccc Q lcl|NC_020081. 518 QNVGKDGQSKQQA 530 (552) Q Consensus 518 ~~~~~~~~~~~~~ 530 (552) .+..++...+.++ T Consensus 500 ~~~~~~~~~~~~~ 512 (512) T protein:vir:97 500 QDDDTKDTVDKKE 512 (512) T ss_pred CCCCccccccccC Confidence 0000000000000 No 168 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.64 E-value=1.6e-07 Score=57.88 Aligned_cols=334 Identities=12% Similarity=0.085 Sum_probs=144.3 Q ss_pred CCchHHHHH--------------HHhhcch-HHHH---HHH-HH--HHHHHHHHHHHHHhhccc----cceeeeeccccc Q lcl|NC_020081. 73 GKQNLLQML--------------KLWSRKN-IILN---AII-IT--RVNQVSMFCTPARNSDKG----VGYEIRLKDPLQ 127 (552) Q Consensus 73 ~~~~~~~~L--------------r~~a~~~-~i~~---a~i-~~--~~~~~~~~~~~~~~~~~~----~~~~i~~k~~~~ 127 (552) ........| ..+-++. .+-. ++- .. ....+.-.++.+..+.+. -||.. T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~------- 73 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFEN------- 73 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhcccccccC------- Confidence 111121111 1121110 0000 000 00 011111122222222211 22210 Q ss_pred cCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCc Q lcl|NC_020081. 128 EPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDG 207 (552) Q Consensus 128 ~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g 207 (552) .+ ..+.+++.. | .+......+..+.+++|.+|+.|..+..|.| .+.+++|..+.++.|+.. T Consensus 74 --~d------~~l~~i~~~-------N---~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~-~i~~~sP~~~~~i~D~~~ 134 (409) T protein:vir:16 74 --DD------FTVNEIFEE-------N---NPDIFFDSTVLSALIASCSFTYISKGENDAV-RLQVIEATNATGIIDPIT 134 (409) T ss_pred --cc------hHHHHHHHh-------c---ChhHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEeeccc Confidence 01 124444432 1 2344666788899999999999999888875 688899999988876643 Q ss_pred ccccccceeEEEEE-cCCceE--EEEcccc----------------------eeeecccccCCccCCcccccHH----HH Q lcl|NC_020081. 208 KERKAKDGVRYVQV-IDDKVV--AKFKAKE----------------------MAWEVSNPRTDLTVGKYGYPEL----EI 258 (552) Q Consensus 208 ~~~~~~~~~~y~~~-~~~~~~--~~~~~~e----------------------vi~~~~~~~~~~~~g~~G~spl----~~ 258 (552) +.... .+.+... ..+... ..+.+++ ++++..++ ...+.+|.|.| .. T Consensus 135 ~~~~~--a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~---~~~~~~G~seI~~~v~~ 209 (409) T protein:vir:16 135 GLLTE--GYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRP---DAVRPFGRSRITRSGMY 209 (409) T ss_pred cccee--eeEEEEecCCCceEEEEEEecCcEEEEEecCccccceecCCCCcceEEecccc---cccccCCccccchhHHH Confidence 32211 1111111 111110 1112222 23332222 12356888754 44 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceee----ccCCceeeecc Q lcl|NC_020081. 259 ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVI----TAEDVKFVNMT 334 (552) Q Consensus 259 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il----~~~g~~~~~l~ 334 (552) +.+.+.....-......||. .|.-++. | ...+....+.++..... +..+ .+++.++.++. T Consensus 210 l~da~~r~~~~~~~~~e~~a---~pqr~i~--G-~d~d~~~~~~~~~~~~~----------i~~~~~d~~g~~~~v~q~~ 273 (409) T protein:vir:16 210 WQSNAKRTLERADVTAEFYS---FPQKYVT--G-LSDDAEPMETWKATVSS----------MLQFTKDEDGDKPTLGQFT 273 (409) T ss_pred HHHHHHHHHHHHHHHHHHhc---ChhheeE--e-cCCCCCccchhhhhhhH----------hhccCCCCCCCCceEEecC Confidence 45555544444445555654 4544443 1 11111222233322221 2222 12346666665 Q ss_pred CchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHH----------HHHHhhHHHHHH Q lcl|NC_020081. 335 QSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNS----------KDKGLEPLLKFI 404 (552) Q Consensus 335 ~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~----------~~~~l~P~~~~i 404 (552) ...-+ .|++..+.....+|+.-++|++.+|.......+ ...+..+...+ +...++-+++.+ T Consensus 274 ~~~l~-~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsS--------a~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla 344 (409) T protein:vir:16 274 QPSMS-PFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSS--------VEAIKASHENLRLAGRKAQRSLGAGLLNVAYLA 344 (409) T ss_pred CCChh-HHHHHHHHHHHHHhhhcCCCHHHcccccCchhH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44332 589999999999999999999999864321100 01111111111 111111111111 Q ss_pred HHHHHhh-cCcccccceeeccc---ccChHHHHHHHHHHHHH-hcC--CcCHHHHHHHhCCCCCC Q lcl|NC_020081. 405 EDAVNKY-IVSQFGGDYVFNFV---GGDAKTEAEIISILESK-AKI--GLTINDIRKELGYPDTE 462 (552) Q Consensus 405 e~~ln~~-L~~~~~~~~~~~f~---~~d~~~~~~~~~~~~~~-~~g--~lT~NE~R~~~gl~p~~ 462 (552) ....+.. -.+.....+.+.|. ..+..+.++.+..+.+. .+| .+.-+-+++++|+..-+ T Consensus 345 ~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 345 ACLRDDVPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred HHHhcCCCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 1111100 00011123445554 22333444444443333 333 33457779999997654 No 169 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=98.61 E-value=2e-07 Score=57.33 Aligned_cols=417 Identities=12% Similarity=0.083 Sum_probs=163.8 Q ss_pred cccccchhhhhccccccccccccccccccccccccCC----c----ccccccCCCCchHHHHHHHhhcch-HHHHHHHH- Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNP----D----FKEAPSIHGKQNLLQMLKLWSRKN-IILNAIII- 96 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~----~~~~~~~~~~~~~~~~Lr~~a~~~-~i~~a~i~- 96 (552) |+.. |.-.+ .+....+ +..++.-.- . +..... . .-.-...|+++-++. .|...-.. T Consensus 1 ~~~~-----~~~~~--~~~~~~~-----~~~~~~~~~~~~~~~i~~~i~~~~-~-~~~~~~~l~~Yy~g~~~i~~~~~~~ 66 (474) T protein:vir:96 1 MINI-----IRMPW--DKPYGEE-----VVEQMKPKVETQEEMIIRLINNHK-Q-KLKDINVGQKYYDKDNDINYQAYKQ 66 (474) T ss_pred Cccc-----ccCCC--CCCCCcc-----hhhhccccccchHHHHHHHHHHHH-H-HHHHHHHHHHHhcccCccccccchh Confidence 1111 11111 1111110 111100000 0 000000 0 000011111111110 11100000 Q ss_pred --------HHHH-H-HHHHHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHH Q lcl|NC_020081. 97 --------TRVN-Q-VSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVK 164 (552) Q Consensus 97 --------~~~~-~-~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~ 164 (552) .+.+ . +.-+++.+....++ +|-.+.+.. .+.+..+.+..|+. | .+..... T Consensus 67 ~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~-------~~~~~~~~l~~~~~--------n---~~~~~~~ 128 (474) T protein:vir:96 67 DLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAH-------DDDKVLDVIHQVLD--------T---RWDNKLI 128 (474) T ss_pred hhcccccccccccccccchHHHHHHhhhhhhcccCceecc-------CChHHHHHHHHHHh--------c---cHHHHHH Confidence 0000 0 01111222221111 111112111 11112233444432 1 2445666 Q ss_pred HHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCCceEEEEcccceeeecc-- Q lcl|NC_020081. 165 KLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS-- 240 (552) Q Consensus 165 ~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~-- 240 (552) .+..+++.+|.+|..+-++.+|.+ .+..++|..+.++.++. +... ..++|+..........+....+.++.. T Consensus 129 ~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~---a~ir~~~~~~~~~~~vy~~~~i~~~~~~~ 204 (474) T protein:vir:96 129 DILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPIWTDKEREQLN---AFIRIFTFNGETKVEYWTAETVTYYVYEN 204 (474) T ss_pred HHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEEcCCCCCceE---EEEEEEeecCeeEEEEEeCCeEEEEEEcC Confidence 788999999999999999988875 57779999998887643 2211 122222222111222233333332211 Q ss_pred --------------------cccC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC Q lcl|NC_020081. 241 --------------------NPRT-----DLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQ 295 (552) Q Consensus 241 --------------------~~~~-----~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~ 295 (552) |+.. .-.+...|.|-++.....++....+..-..+.+...+.|-.++. + ... T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--g-~~~ 281 (474) T protein:vir:96 205 GGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR--G-YEG 281 (474) T ss_pred CceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--C-CCc Confidence 0000 00012457777777777777766655555555566666654443 3 222 Q ss_pred CHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhccccccccc-c Q lcl|NC_020081. 296 SNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGAT-G 374 (552) Q Consensus 296 s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~-~ 374 (552) + ....+...+. ..++ +..+++.+...+.....+..+....+.+.+.|...-++|..-.. +++ + T Consensus 282 ~--~~~~~~~~~~--------~~~~-i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-----~~~~n 345 (474) T protein:vir:96 282 E--DLSEFMEGLK--------YYKA-INVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTD-----KFGSA 345 (474) T ss_pred c--cccchhhhhh--------ccce-eeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccc-----ccccc Confidence 2 1222222221 1222 22333444444455555667778888999999999999864321 111 1 Q ss_pred cccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCH Q lcl|NC_020081. 375 HSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTI 449 (552) Q Consensus 375 ~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~ 449 (552) .++....+. ......+..+...|+.+++.|...++.. .....+.+.|.+..+.+..+.++++.. +|++|. T Consensus 346 ~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~---~d~~~i~i~f~~~~p~~~~e~a~~~~~--~giiS~ 420 (474) T protein:vir:96 346 TSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIK---LDAKEIEITFNFNVMVNDLEQSQIGAQ--SQYLSK 420 (474) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---cccceeeEEecCCCccCHHHHHHHHHH--cCCCCh Confidence 111111111 1112233344445555554444322211 111346677887777777777776544 589999 Q ss_pred HHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccc Q lcl|NC_020081. 450 NDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQ 529 (552) Q Consensus 450 NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (552) -.+++++++-.-+ + ..+..+...+... .+... ......++...+.+.+++++ . T Consensus 421 et~~~~lp~v~D~--~--------~E~eri~~E~~~~-~~~~~--~~~~~~~~~~~~~~~~~~~e-------------~- 473 (474) T protein:vir:96 421 ETLVRHHPWVDDP--K--------AELERLDEEQLEL-NKQLP--NLDDGGADGAQQQQQSENNQ-------------S- 473 (474) T ss_pred HHHHHhCCCCCCH--H--------HHHHHHHHHHHHH-Hhhcc--ccccccCCCCCCcCCCCccc-------------c- Confidence 8888887542211 0 1111111111000 00000 00000000000000000000 0 Q ss_pred cccccc Q lcl|NC_020081. 530 ANTNST 535 (552) Q Consensus 530 ~~~~~~ 535 (552) + T Consensus 474 -----~ 474 (474) T protein:vir:96 474 -----K 474 (474) T ss_pred -----C Confidence 0 No 170 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=98.61 E-value=2e-07 Score=57.33 Aligned_cols=417 Identities=12% Similarity=0.083 Sum_probs=163.8 Q ss_pred cccccchhhhhccccccccccccccccccccccccCC----c----ccccccCCCCchHHHHHHHhhcch-HHHHHHHH- Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNP----D----FKEAPSIHGKQNLLQMLKLWSRKN-IILNAIII- 96 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~----~~~~~~~~~~~~~~~~Lr~~a~~~-~i~~a~i~- 96 (552) |+.. |.-.+ .+....+ +..++.-.- . +..... . .-.-...|+++-++. .|...-.. T Consensus 1 ~~~~-----~~~~~--~~~~~~~-----~~~~~~~~~~~~~~~i~~~i~~~~-~-~~~~~~~l~~Yy~g~~~i~~~~~~~ 66 (474) T protein:vir:95 1 MINI-----IRMPW--DKPYGEE-----VVEQMKPKVETQEEMIIRLINNHK-Q-KLKDINVGQKYYDKDNDINYQAYKQ 66 (474) T ss_pred Cccc-----ccCCC--CCCCCcc-----hhhhccccccchHHHHHHHHHHHH-H-HHHHHHHHHHHhcccCccccccchh Confidence 1111 11111 1111110 111100000 0 000000 0 000011111111110 11100000 Q ss_pred --------HHHH-H-HHHHHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHH Q lcl|NC_020081. 97 --------TRVN-Q-VSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVK 164 (552) Q Consensus 97 --------~~~~-~-~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~ 164 (552) .+.+ . +.-+++.+....++ +|-.+.+.. .+.+..+.+..|+. | .+..... T Consensus 67 ~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~-------~~~~~~~~l~~~~~--------n---~~~~~~~ 128 (474) T protein:vir:95 67 DLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAH-------DDDKVLDVIHQVLD--------T---RWDNKLI 128 (474) T ss_pred hhcccccccccccccccchHHHHHHhhhhhhcccCceecc-------CChHHHHHHHHHHh--------c---cHHHHHH Confidence 0000 0 01111222221111 111112111 11112233444432 1 2445666 Q ss_pred HHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCCceEEEEcccceeeecc-- Q lcl|NC_020081. 165 KLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS-- 240 (552) Q Consensus 165 ~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~-- 240 (552) .+..+++.+|.+|..+-++.+|.+ .+..++|..+.++.++. +... ..++|+..........+....+.++.. T Consensus 129 ~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~---a~ir~~~~~~~~~~~vy~~~~i~~~~~~~ 204 (474) T protein:vir:95 129 DILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPIWTDKEREQLN---AFIRIFTFNGETKVEYWTAETVTYYVYEN 204 (474) T ss_pred HHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEEcCCCCCceE---EEEEEEeecCeeEEEEEeCCeEEEEEEcC Confidence 788999999999999999988875 57779999998887643 2211 122222222111222233333332211 Q ss_pred --------------------cccC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC Q lcl|NC_020081. 241 --------------------NPRT-----DLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQ 295 (552) Q Consensus 241 --------------------~~~~-----~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~ 295 (552) |+.. .-.+...|.|-++.....++....+..-..+.+...+.|-.++. + ... T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--g-~~~ 281 (474) T protein:vir:95 205 GGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR--G-YEG 281 (474) T ss_pred CceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--C-CCc Confidence 0000 00012457777777777777766655555555566666654443 3 222 Q ss_pred CHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhccccccccc-c Q lcl|NC_020081. 296 SNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGAT-G 374 (552) Q Consensus 296 s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~-~ 374 (552) + ....+...+. ..++ +..+++.+...+.....+..+....+.+.+.|...-++|..-.. +++ + T Consensus 282 ~--~~~~~~~~~~--------~~~~-i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-----~~~~n 345 (474) T protein:vir:95 282 E--DLSEFMEGLK--------YYKA-INVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTD-----KFGSA 345 (474) T ss_pred c--cccchhhhhh--------ccce-eeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccc-----ccccc Confidence 2 1222222221 1222 22333444444455555667778888999999999999864321 111 1 Q ss_pred cccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCH Q lcl|NC_020081. 375 HSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTI 449 (552) Q Consensus 375 ~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~ 449 (552) .++....+. ......+..+...|+.+++.|...++.. .....+.+.|.+..+.+..+.++++.. +|++|. T Consensus 346 ~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~---~d~~~i~i~f~~~~p~~~~e~a~~~~~--~giiS~ 420 (474) T protein:vir:95 346 TSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIK---LDAKEIEITFNFNVMVNDLEQSQIGAQ--SQYLSK 420 (474) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---cccceeeEEecCCCccCHHHHHHHHHH--cCCCCh Confidence 111111111 1112233344445555554444322211 111346677887777777777776544 589999 Q ss_pred HHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccc Q lcl|NC_020081. 450 NDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQ 529 (552) Q Consensus 450 NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (552) -.+++++++-.-+ + ..+..+...+... .+... ......++...+.+.+++++ . T Consensus 421 et~~~~lp~v~D~--~--------~E~eri~~E~~~~-~~~~~--~~~~~~~~~~~~~~~~~~~e-------------~- 473 (474) T protein:vir:95 421 ETLVRHHPWVDDP--K--------AELERLDEEQLEL-NKQLP--NLDDGGADGAQQQQQSENNQ-------------S- 473 (474) T ss_pred HHHHHhCCCCCCH--H--------HHHHHHHHHHHHH-Hhhcc--ccccccCCCCCCcCCCCccc-------------c- Confidence 8888887542211 0 1111111111000 00000 00000000000000000000 0 Q ss_pred cccccc Q lcl|NC_020081. 530 ANTNST 535 (552) Q Consensus 530 ~~~~~~ 535 (552) + T Consensus 474 -----~ 474 (474) T protein:vir:95 474 -----K 474 (474) T ss_pred -----C Confidence 0 No 171 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=98.60 E-value=2.1e-07 Score=57.17 Aligned_cols=398 Identities=13% Similarity=0.097 Sum_probs=164.7 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhc-chHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSR-KNIILNAIIITRVNQVSMF 105 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~-~~~i~~a~i~~~~~~~~~~ 105 (552) |+ ..+ +-...++... ..++.|..-.+ ....+..++......+.++ T Consensus 1 ~~-----~~~-----~~~~~~~~~~------------------------~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~ 46 (474) T protein:vir:95 1 MF-----NII-----RMPWDKPYGE------------------------EVVEQLKPQFETQEEMIIRLIDDHRKQLDKI 46 (474) T ss_pred Cc-----cee-----ecCCCCchhh------------------------HHHHhhhhccCChHHHHHHHHHHHHHHHHHH Confidence 00 000 0000000000 01111111110 0011111111111111111 Q ss_pred HH---------------------------------------HHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHH Q lcl|NC_020081. 106 CT---------------------------------------PARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFI 144 (552) Q Consensus 106 ~~---------------------------------------~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l 144 (552) .. .+....++ +|-.+.+.- .+......+..|+ T Consensus 47 ~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~-------~d~~~~~~l~~~~ 119 (474) T protein:vir:95 47 TVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSC-------EDESVLKIIHDVL 119 (474) T ss_pred HHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCceecc-------CchHHHHHHHHHH Confidence 11 11111111 111111110 1111122233333 Q ss_pred HhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEc Q lcl|NC_020081. 145 EKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVI 222 (552) Q Consensus 145 ~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~ 222 (552) . | .+......+..+.+.+|.+|..+.++.+|++ .+..++|..+.++.++. +... ..++|+... T Consensus 120 ~--------n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~---~~i~~~~~~ 184 (474) T protein:vir:95 120 D--------T---RWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIPIWVDKEREELK---SFIRYYKFN 184 (474) T ss_pred h--------c---cHHHHHHHHHHHHhhcCcEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceE---EEEEEEEEc Confidence 2 1 2344566678899999999999989988885 47788999998877643 2211 123333333 Q ss_pred CCceEEEEcccceeeecc----------------------cccC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 223 DDKVVAKFKAKEMAWEVS----------------------NPRT-----DLTVGKYGYPELEIALNHLQYHDNTEVFNAR 275 (552) Q Consensus 223 ~~~~~~~~~~~evi~~~~----------------------~~~~-----~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~ 275 (552) .......++.+.+.+.+. |+-. .-.+...|.|-++-+...++....+..-..+ T Consensus 185 ~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~ 264 (474) T protein:vir:95 185 NEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSLIDAIDKRLSDAQN 264 (474) T ss_pred CeeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCCCCCCCcHHHHHHHHHHHHHHHHHHHH Confidence 333333333333332211 1000 0011246788888777777776666666666 Q ss_pred HHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 276 FFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICS 355 (552) Q Consensus 276 ~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 355 (552) .+...+.|-.++. + ...++ .+.+.... ..++ ++..+++.+.+.++.......+....+.+.+.|+. T Consensus 265 ~~~~~~~p~lv~~--g-~~~~~--~~~~~~~~--------~~~~-~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~ 330 (474) T protein:vir:95 265 MFDESVELIYILK--G-YEGQD--LEEFMRGL--------KYYK-AINVDGDGGVETIQVEVPVSSTKEYIDLMRAYIME 330 (474) T ss_pred HHHHhcCceeeee--c-CCccc--chhhhhhh--------hccc-eeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHH Confidence 6666667755543 3 22222 11222211 1222 23333444444455555666777888999999999 Q ss_pred HhcCCHHHhcccccccccccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChH Q lcl|NC_020081. 356 IYSIDPSEINFPNRGGATGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAK 430 (552) Q Consensus 356 ~fgVPp~~lg~~~~~t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~ 430 (552) .-++|..-.+-. .++.++....+. ..-...+..+...|+.+++.|...+... .....+.+.|.+..+. T Consensus 331 ~s~~p~~~~~~~----~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~---~d~~~i~v~f~~~~p~ 403 (474) T protein:vir:95 331 FGQGVDFQTDKF----GSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLK---MDVKDIEISFNFNRMM 403 (474) T ss_pred HhCCcccccccc----cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---cccceeeEEeccCCCc Confidence 999986322111 011111111000 0112222344445555555554433221 1123466778877777 Q ss_pred HHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCC Q lcl|NC_020081. 431 TEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNV 510 (552) Q Consensus 431 ~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (552) +.++.++++.. +|+||...+.+++++-+-+ + ..+..+...+.. .............+ +...+.. T Consensus 404 d~~e~a~~~~~--~g~iS~et~i~~l~~v~d~--~--------~E~~ri~~E~~~-~~~~~~~~~~~~~d---~~~~~~~ 467 (474) T protein:vir:95 404 NDAEQSQIIAQ--SQYLSRETLVKSSPLVDDY--K--------AELERIEQEQME-YNKQLPNLDDGGAD---GAQQQER 467 (474) T ss_pred CHHHHHHHHHh--cCCCchHHHHHhCCCCCCH--H--------HHHHHHHHHHHH-HHhcccccccccCC---CCcCCCC Confidence 77777776554 5899988888777542211 0 011111111100 00000000000000 0000000 Q ss_pred CCCCCcccccCCCCcccccc Q lcl|NC_020081. 511 NGKDSFNQNVGKDGQSKQQA 530 (552) Q Consensus 511 ~~~~~~~~~~~~~~~~~~~~ 530 (552) .+.+++ + T Consensus 468 ~~~~~~-------------~ 474 (474) T protein:vir:95 468 SNDKES-------------E 474 (474) T ss_pred CccCCC-------------C Confidence 000000 0 No 172 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=98.57 E-value=2.7e-07 Score=56.62 Aligned_cols=462 Identities=13% Similarity=0.084 Sum_probs=167.9 Q ss_pred cchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCC--cccccccCCCCchHHHHHHHhhcc-h Q lcl|NC_020081. 12 KQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNP--DFKEAPSIHGKQNLLQMLKLWSRK-N 88 (552) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~Lr~~a~~-~ 88 (552) --.-|=+|.-.+.. +..+..+ ++..+..-............+.- .+-.+-... ...-.+.|.++-.+ . T Consensus 1 ~~~~~~~~~~~~~~----~~~~~~~----~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~-~~~r~~~l~~Yy~g~~ 71 (511) T protein:vir:99 1 MLKVNEFETDTDLR----GNINYLF----NDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDY-QRPRLKVLSDYYEGKT 71 (511) T ss_pred Cccccchhhhhhhh----hhhhhhh----hhhhCCccccchhhhhhhccHHHHHHHHHHHHHh-hHHHHHHHHHHhcccC Confidence 00000011111111 0001111 00000000000000000000000 000000000 00001112221110 0 Q ss_pred HHH-HH---HHHHHHH--HHHHHHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHH Q lcl|NC_020081. 89 IIL-NA---IIITRVN--QVSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFR 160 (552) Q Consensus 89 ~i~-~a---~i~~~~~--~~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~ 160 (552) .++ .. -...+.+ .+.-+++.+....++ +|-.+.+... +......+.+|+... .+. T Consensus 72 ~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~-------d~~~~~~l~~~~~~n----------~~~ 134 (511) T protein:vir:99 72 KNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDD-------DKDVLEAIEAFNDLN----------DVE 134 (511) T ss_pred ccccccCcccccccCcceeecchHHHHHHHHHhhhcccCceeecC-------chHHHHHHHHHHhhc----------CHh Confidence 110 00 0000000 000111122221111 1111221111 111223444554331 244 Q ss_pred HHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEc--CC---c---eEEEEcc Q lcl|NC_020081. 161 SFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVI--DD---K---VVAKFKA 232 (552) Q Consensus 161 ~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~--~~---~---~~~~~~~ 232 (552) .....+..+++++|.+|..+.++.+|++ .+..++|..+.++.++..... ....++|+... .+ . ....+++ T Consensus 135 ~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~p~~~~~vyd~~~~~~-~~~~vr~~~~~~~~~~~~~~~~~~~vyt~ 212 (511) T protein:vir:99 135 SHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERN-SIAGVRYLRTKPIDKTDEDEVFTVDLFTS 212 (511) T ss_pred HHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCc-eEEEEEEEEeeecccCccceEEEEEEEeC Confidence 5667788999999999999999988875 578899999998887543111 11122222211 10 0 1112333 Q ss_pred cceeeecccc----------------------cCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC Q lcl|NC_020081. 233 KEMAWEVSNP----------------------RTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIK 290 (552) Q Consensus 233 ~evi~~~~~~----------------------~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~ 290 (552) +.+.+++..- -..-.+...|.|.++.+...++....+..-..+.+...+.|-.++. T Consensus 213 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~-- 290 (511) T protein:vir:99 213 HGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK-- 290 (511) T ss_pred CcEEEEEecCCccccccccccccccCCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhc-- Confidence 3333221100 0000011367787877777777666665555555555555544433 Q ss_pred CCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccc Q lcl|NC_020081. 291 TGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRG 370 (552) Q Consensus 291 ~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 370 (552) +....+.......++.-.-......-.....+-..++.+++.++.......+....+.+.+.|+..-++|..-.+-.. + T Consensus 291 G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-g 369 (511) T protein:vir:99 291 GNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G 369 (511) T ss_pred cCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c Confidence 322233333322222100000000000000112234556665665555667778889999999999999985443211 1 Q ss_pred cccccccccccchh-----HHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc--cc-cceeecccccChHHHHHHHHHHHHH Q lcl|NC_020081. 371 GATGHSGNTLNEGS-----SAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ--FG-GDYVFNFVGGDAKTEAEIISILESK 442 (552) Q Consensus 371 t~~~~~~~~~~~~n-----~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~--~~-~~~~~~f~~~d~~~~~~~~~~~~~~ 442 (552) + .++....+.. .....+..+..+|.-+++.|...+...--.. .. ..+.+.|.+.-+.+.++.++++... T Consensus 370 n---~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl 446 (511) T protein:vir:99 370 T---QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDS 446 (511) T ss_pred c---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHH Confidence 1 1111111110 1111223333444444444443333221111 11 2456778777677777777666554 Q ss_pred hcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCC Q lcl|NC_020081. 443 AKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGK 522 (552) Q Consensus 443 ~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (552) .|++|.--+.++++.- ++-+ ..+..+...+... .+..... ...++...++.. .+..+ T Consensus 447 -~GiiS~et~l~~l~~v--~D~~--------~E~~ri~~E~~~~-~~~~~~~--~~~~~~~~~~~~---------~~~~~ 503 (511) T protein:vir:99 447 -GGKISQTTLMSLFSFF--QDPE--------LEVKKIEEDEKES-IKKAQKN--MYQDPRNINDDE---------QDDST 503 (511) T ss_pred -hccCCHHHHHHhCCCC--CCHH--------HHHHHHHHHHHHH-HHHHhhc--ccccCCCCCCCC---------CCCCC Confidence 4889987788876432 1100 1111111111000 0000000 000000000000 00000 Q ss_pred CCcccccc Q lcl|NC_020081. 523 DGQSKQQA 530 (552) Q Consensus 523 ~~~~~~~~ 530 (552) +.+.++++ T Consensus 504 ~~~~d~~e 511 (511) T protein:vir:99 504 KDSIDKKE 511 (511) T ss_pred cCcccccC Confidence 11111111 No 173 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.56 E-value=2.9e-07 Score=56.45 Aligned_cols=359 Identities=11% Similarity=0.054 Sum_probs=144.6 Q ss_pred cccCCcccccccCCCCchHHHHHHHhhcch-HHH-------HHHHHHHHHHHHHHHHHHHhhc----cccceeeeecccc Q lcl|NC_020081. 59 MSMNPDFKEAPSIHGKQNLLQMLKLWSRKN-IIL-------NAIIITRVNQVSMFCTPARNSD----KGVGYEIRLKDPL 126 (552) Q Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~-~i~-------~a~i~~~~~~~~~~~~~~~~~~----~~~~~~i~~k~~~ 126 (552) |...-.-.-.........-...+..|-++. .+. ...-. ....+.-.++.+..+. ...||.. ++ T Consensus 1 m~~~~i~~L~~~~~~~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~-~~~~v~nw~~~~Vd~~a~rl~~~Gf~~----~d 75 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGVDKRYRYYAMDDRDDTRSIVMPNNVRE-MYRSVLEWTAKGVDSLADRIIFREFTN----DD 75 (422) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhhcCccccHHHHH-HHHhhcchhHHHHHHHHhccccceeeC----Cc Confidence 111000000000000000011122222211 000 00000 0011111222222222 1123211 11 Q ss_pred ccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC-CCCEEEEEEecCceeEEEECC Q lcl|NC_020081. 127 QEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK-LGDLHNFKAVDASTVYVAVDE 205 (552) Q Consensus 127 ~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~-~G~~~~L~~l~p~~v~v~~~~ 205 (552) ..+.+++.. | .+......+..+.|++|.+|+.|.++. .|.| .+.+++|..+.++.|+ T Consensus 76 -----------~~l~~~w~~-------N---~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p-~i~~~sp~~~~~i~D~ 133 (422) T protein:vir:97 76 -----------FNAWEIFKA-------N---NPDIFFDTAIQSALIASCCFVYIMPGAEDGLP-KMQVIEASKATGILDP 133 (422) T ss_pred -----------hhHHHHHHh-------c---ChHHHHHHHHHHHHHhcceeEEEeeCCCCCee-EEEEechhhEEEEEeC Confidence 113344432 1 233455577889999999999999875 5665 5888999999988876 Q ss_pred CcccccccceeE-EEEEcCCceE--EEEcccc---------------------eeeecccccCCccCCcccccHH-HHHH Q lcl|NC_020081. 206 DGKERKAKDGVR-YVQVIDDKVV--AKFKAKE---------------------MAWEVSNPRTDLTVGKYGYPEL-EIAL 260 (552) Q Consensus 206 ~g~~~~~~~~~~-y~~~~~~~~~--~~~~~~e---------------------vi~~~~~~~~~~~~g~~G~spl-~~~~ 260 (552) ..+.... .+. |.....+... ..++... |+++..++ ...+.+|.|.| +.++ T Consensus 134 ~~~~~~~--a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~---~~~~~~G~s~I~e~v~ 208 (422) T protein:vir:97 134 TTFLLTE--GYAILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHRP---DAVRPFGRSRITKAGM 208 (422) T ss_pred CCCccee--eEEEEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcceEEecccC---CCccccCccccchhHH Confidence 4332211 111 1111111111 1111111 23333222 12346888865 3343 Q ss_pred HHHHHHH---HHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec----cCCceeeec Q lcl|NC_020081. 261 NHLQYHD---NTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT----AEDVKFVNM 333 (552) Q Consensus 261 ~~i~~~~---~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~----~~g~~~~~l 333 (552) ..++... .-......|| +.|.-.+. +...+....+.++.... ++..+. ++++++.++ T Consensus 209 ~l~da~~r~~~~~~~~~e~~---a~pqr~i~---G~d~d~~~~~~~~~~~~----------~i~~~~~de~~~~~~v~q~ 272 (422) T protein:vir:97 209 YHQKAAKRTLERAEVTAEFY---SFPQKYVL---GMDPDAKPMEKWRATVS----------TLLEISKDEDGDKPTVGQF 272 (422) T ss_pred HHHHHHHHHHHHHHHHHHHh---cchhhhhc---ccCcccccCchhhhhhh----------hhhccCCCCCCCcceeeec Confidence 3333333 3333344444 34443332 12212222223333222 222221 234666655 Q ss_pred cCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHH----------HHHHHhhHHHHH Q lcl|NC_020081. 334 TQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRN----------SKDKGLEPLLKF 403 (552) Q Consensus 334 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~----------~~~~~l~P~~~~ 403 (552) ....-+ .|++..+.....|+..-++|++.+|.......+ + ..+..+... .+...++-+++. T Consensus 273 ~~~~l~-~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsS---a-----~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rl 343 (422) T protein:vir:97 273 TTASMA-PFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSS---V-----ESIKAAHENLRAAGRKAQRSFSSGFLNVAYI 343 (422) T ss_pred CCCChh-HHHHHHHHHHHHHhcccCCCHHHhccccCchhH---H-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 544332 488999999999999999999999864321110 0 111111111 111122222221 Q ss_pred HHHHHHhhcC--cccccceeeccc---ccChHHHHHHHHHHHH-Hhc--CCcCHHHHHHHhCCCCCCCCCeeeccccccc Q lcl|NC_020081. 404 IEDAVNKYIV--SQFGGDYVFNFV---GGDAKTEAEIISILES-KAK--IGLTINDIRKELGYPDTEGGDVTLAGVHVQR 475 (552) Q Consensus 404 ie~~ln~~L~--~~~~~~~~~~f~---~~d~~~~~~~~~~~~~-~~~--g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~ 475 (552) +....+. .- +....++.+.|. ..+..+.++.+..+.+ ..+ |.++..-+++++|+...+. -. .. T Consensus 344 a~~~~~~-~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~~---~~-----~~ 414 (422) T protein:vir:97 344 AVCLRDE-FPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGADK---PI-----PA 414 (422) T ss_pred HHHHhcC-CcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCchhH---HH-----HH Confidence 1111110 00 001123455564 3455555554443332 223 5678888999999954311 00 00 Q ss_pred hhhhccccccccccCC Q lcl|NC_020081. 476 LGQIMQQEQVEYQRQM 491 (552) Q Consensus 476 ~~~~~~~~~~~~~~~~ 491 (552) +.... + ++ T Consensus 415 ~~~~~----~----d~ 422 (422) T protein:vir:97 415 ITEVT----T----DG 422 (422) T ss_pred HHhhh----c----cC Confidence 00000 0 00 No 174 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.56 E-value=2.9e-07 Score=56.43 Aligned_cols=431 Identities=10% Similarity=0.045 Sum_probs=148.1 Q ss_pred cchhhhhcccccccccccccccccc--ccccccCCcccccc-cC--CCCchHHHHHHHh-hcchHHHHHHHHHHHHHHHH Q lcl|NC_020081. 31 IEEDAILKKGKNTKSNKPKAYEEPI--IGSMSMNPDFKEAP-SI--HGKQNLLQMLKLW-SRKNIILNAIIITRVNQVSM 104 (552) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~-~~--~~~~~~~~~Lr~~-a~~~~i~~a~i~~~~~~~~~ 104 (552) +..+.-+.. .. -.++-..+-. ..+...--.|+... .. .+.. ....+++. ...++ .+.|+.+.++ T Consensus 1 ~~~~~~~d~---~~-~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~-~~~~~~~~~~~~n~-~~~ivd~~a~---- 70 (488) T protein:vir:23 1 MAETESIDP---EK-LRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLA-VPLDMRKYLAHVGY-PRTYVDAIAE---- 70 (488) T ss_pred CCcccCCCH---HH-HHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCcc-cchhhhhhhhhcch-HHHHHHHHHH---- Confidence 111111110 00 0000000000 00000000111000 00 0000 00111111 00111 1111111111 Q ss_pred HHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC Q lcl|NC_020081. 105 FCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK 184 (552) Q Consensus 105 ~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~ 184 (552) .....||.+-...........+......+.+++... .+......+..+.+++|.+|+.+.++. T Consensus 71 -------~l~~~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~N----------~~~~~~~~~~~~a~i~G~a~~~v~~~~ 133 (488) T protein:vir:23 71 -------RQELEGFRIPSANGEEPESGGENDPASELWDWWQAN----------NLDIEATLGHTDALIYGTAYITISMPD 133 (488) T ss_pred -------hhhccceeccCCcccccccccchhHHHHHHHHHHhc----------ChhHHHHHHHHHHhhcCceEEEEecCC Confidence 011123332111111111111222234455555431 345567778899999999999887643 Q ss_pred --------CCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceE---EEEcccce------------------ Q lcl|NC_020081. 185 --------LGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVV---AKFKAKEM------------------ 235 (552) Q Consensus 185 --------~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~---~~~~~~ev------------------ 235 (552) .|.+ .+.+++|..+.++.++..+... ..++|+...+++.. ..|.++.+ T Consensus 134 ~~~~~~~~~~~~-~i~~~~p~~~~~~~d~~~~~~~--~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h 210 (488) T protein:vir:23 134 PEVDFDVDPEVP-LIRVEPPTALYAEVDPRTRKVL--YAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPH 210 (488) T ss_pred cccccCCCCCcc-eEEEeccceeEEEEecCCCceE--EEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEecccccc Confidence 2222 4678889888887764322111 11222222222111 12222222 Q ss_pred -------eeecccccCCccCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHH--HHHH Q lcl|NC_020081. 236 -------AWEVSNPRTDLTVGKYGYPELEI-ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALT--SFRR 305 (552) Q Consensus 236 -------i~~~~~~~~~~~~g~~G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~--~~~~ 305 (552) ++++.++ ...+++|.|-|+- +...++....+.........-.+.|..+|. + ...++...+ .-.. T Consensus 211 ~~g~vPvv~f~n~~---~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G-~~~~~~~~~~~~~~~ 284 (488) T protein:vir:23 211 GLEMVPVIPISNRT---RLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIF--G-AKPEELGINAETGQR 284 (488) T ss_pred CCCCcceEEecccc---ccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHh--C-CCcccccccccccch Confidence 2222211 1234688886642 233333333222222222222223433332 1 111111100 0111 Q ss_pred HHHHHhccccccccceeec-cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccch- Q lcl|NC_020081. 306 EWTSMFSGINGAWKIPVIT-AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG- 383 (552) Q Consensus 306 ~~~~~~~G~~nagk~~il~-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~- 383 (552) .|+.. .+++.++. +++.++.++..... -.+++..+..+..|+..=++|++.+|....+..++ ....+. T Consensus 285 ~~~~~------~~~v~~~~~g~~~~~~q~~~~~~-~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg---~Al~~~~ 354 (488) T protein:vir:23 285 MFDAY------MARILAFEGGEGAHAEQFSAAEL-RNFVDALDALDRKAASYSGLPPQYLSSSSDNPASA---EAIKAAE 354 (488) T ss_pred hhhhh------hhhhccCCCCCCceeEecCCCCh-HHHHHHHHHHHHHHhcccCCCHHHhccccCcchHH---HHHHHHH Confidence 22221 12232332 23466766554332 34778888889999999999999987532211100 000000 Q ss_pred ----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHH-Hhc--CCcCHHHHHHHh Q lcl|NC_020081. 384 ----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILES-KAK--IGLTINDIRKEL 456 (552) Q Consensus 384 ----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~-~~~--g~lT~NE~R~~~ 456 (552) .-.+..+..+...|.-+++.+...++..-.+.....+.+.|....+.+..+.+..+.+ +.+ |+++..-+++++ T Consensus 355 ~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l 434 (488) T protein:vir:23 355 SRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDM 434 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhC Confidence 0011111222223333333332211111001111245566765555555554443333 333 468888888998 Q ss_pred CCCCCCCCCeeeccccccchhhhccccccccccC---CCCCccCcccCCCCCCCCCCCCCCCcc Q lcl|NC_020081. 457 GYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQ---MDANQFLAQQTGYDGNMDNVNGKDSFN 517 (552) Q Consensus 457 gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~ 517 (552) |+-+.+- ..+....+.+....... .........+++..+..+.++.++.-+ T Consensus 435 ~~~~d~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 435 GYTIVER----------EQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred CCCchHH----------HHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCCCCCCCCCCCCCCC Confidence 8743210 00000000000000000 000000000111111111111111111 No 175 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=98.53 E-value=3.4e-07 Score=56.01 Aligned_cols=395 Identities=12% Similarity=0.120 Sum_probs=161.0 Q ss_pred cccccccccccccCCcccccccCCCC--chHHHHHHHhhcchHHHHHHHHHHHHHHHHH--------------------- Q lcl|NC_020081. 49 KAYEEPIIGSMSMNPDFKEAPSIHGK--QNLLQMLKLWSRKNIILNAIIITRVNQVSMF--------------------- 105 (552) Q Consensus 49 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~--------------------- 105 (552) ..+..++...+ +..-..... +.+.+.|..+ +........++ T Consensus 1 ~~~~~~~~~~~------~~~~~~~~~~~~~~~~~i~~~----------i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~ 64 (472) T protein:vir:93 1 MYPSQPTQTEI------FDAIVRTNNKPETLEEMIVRY----------IKQHLEKLPEISIGQEYYEQRPDIVKEPKPVD 64 (472) T ss_pred CCCCCCcchhh------hhceeeecCchhhHHHHHHHH----------HHHHHHHHHHHHHHHHHhccccccccccchhh Confidence 11111111100 000000000 0111111111 11111111111 Q ss_pred ------------------HHHHHhhcccc--ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHH Q lcl|NC_020081. 106 ------------------CTPARNSDKGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKK 165 (552) Q Consensus 106 ------------------~~~~~~~~~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~ 165 (552) ++.+....++. |-.+.+.. . +......+..|+. | ........ T Consensus 65 ~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~~~-----~--d~~~~~~l~~~~~--------n---~~~~~~~~ 126 (472) T protein:vir:93 65 ATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH-----T--DDEVVKRIDEVLG--------N---RFDDKLHS 126 (472) T ss_pred ccccccccccccccccchHHHHHHHHhhhhcccCeeecc-----C--ChHHHHHHHHHHh--------c---cHHHHHHH Confidence 11111111111 11111111 1 1112223333332 1 23345566 Q ss_pred HHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCCceEEEEcccceeeec---- Q lcl|NC_020081. 166 LVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEV---- 239 (552) Q Consensus 166 ~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~---- 239 (552) +..+.+.+|.+|..+..+.+|++. +..++|..+.++.++. +... -.++|+..........+....+.++. T Consensus 127 ~~~~~~~~G~~~~~v~~d~d~~~~-i~~~~p~~~~~i~d~~~~~~~~---~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (472) T protein:vir:93 127 VLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHEELE---AFIRMYKLENETKVEYWDKVTVNYYVYENG 202 (472) T ss_pred HHHHHhhcCeEEEEEEECCCCceE-EEEEcccceEEEEcCCCCCceE---EEEEEEEeecceeEEEEecCeEEEEEEecC Confidence 788999999999999999888864 7789999999887642 2211 12222222222222222222211111 Q ss_pred ------------------ccccC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCC Q lcl|NC_020081. 240 ------------------SNPRT-----DLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQS 296 (552) Q Consensus 240 ------------------~~~~~-----~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s 296 (552) .|+.. .-.+..+|.|-++.+...++....+.....+.+...+.|-.++. +. . T Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~--g~-~-- 277 (472) T protein:vir:93 203 SLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT--NY-D-- 277 (472) T ss_pred eeeecccccccccccccccCCCCCcceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEee--cC-C-- Confidence 01000 00012468888888888887777666666666777777765554 32 1 Q ss_pred HHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccc Q lcl|NC_020081. 297 NQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHS 376 (552) Q Consensus 297 ~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~ 376 (552) .+....+...+. .++++.+ +++.+...+........+....+.+.+.|+..-++|..-.+... + +.+ T Consensus 278 ~~~~~~~~~~~~--------~~~~~~~-~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~---n~S 344 (472) T protein:vir:93 278 DQELPEFKRLLR--------YYGAIKV-SDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-S---APS 344 (472) T ss_pred cccchhhHHHHh--------hcccccc-CCCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc-c---Cch Confidence 111222222221 1112222 23334444444455677888899999999999999865443211 0 111 Q ss_pred cccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHH Q lcl|NC_020081. 377 GNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTIND 451 (552) Q Consensus 377 ~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE 451 (552) +....+. .--...+..+...|+-+++.|...++.. ....++.+.|.+..+.+.++.++++.+. .|+++.-- T Consensus 345 g~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~---~~~~~i~v~f~~~~p~~~~~~~~~~~k~-~giis~et 420 (472) T protein:vir:93 345 GVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK---GEHKDVDISFNYNKVANTELQVQTAQQS-MGIVSHET 420 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---cccceeeEEeCCCCCCCHHHHHHHHHHH-hccCchHH Confidence 1000000 0011222333344444444444333221 1123566777777776666666665554 58899877 Q ss_pred HHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcccccc Q lcl|NC_020081. 452 IRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQA 530 (552) Q Consensus 452 ~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (552) +.+++++-+-+ + ..+..+...+. ......... . ...++.....+ ++.++.++ T Consensus 421 ~l~~l~~~~d~--~--------~E~~ri~~E~~-~~~~~~~~~----~----~~~~d~~~~~~--------~~~~~~~e 472 (472) T protein:vir:93 421 VLENHPFVEDL--Q--------AELERIEQEQM-EYNKQLPNL----D----DGGADGAQQQE--------RSNNKESE 472 (472) T ss_pred HHHhCCCCCCH--H--------HHHHHHHHHHH-HHHHhccCc----C----cccCCCCCCCC--------CCCcccCC Confidence 77777542210 0 01111111100 000000000 0 00000000000 00000000 No 176 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=98.53 E-value=3.6e-07 Score=55.92 Aligned_cols=435 Identities=14% Similarity=0.128 Sum_probs=163.0 Q ss_pred chhhhhccccccccc--cccccccc-cccccccCCcccccc-c--CCCCchHHHHHHHhhcc--hHHHHHHHHHHH---- Q lcl|NC_020081. 32 EEDAILKKGKNTKSN--KPKAYEEP-IIGSMSMNPDFKEAP-S--IHGKQNLLQMLKLWSRK--NIILNAIIITRV---- 99 (552) Q Consensus 32 ~~~~~~~~~~~~~~~--~~~~~~~~-~~~~~~~~~~~~~~~-~--~~~~~~~~~~Lr~~a~~--~~i~~a~i~~~~---- 99 (552) +.-.+|..+. ++.- ..+.+-.+ +...+ .+..-... . ......-.+.|.++-++ ..+...-..... T Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~ 77 (481) T protein:vir:10 1 MTVYTINNIN-TKFSPLANDDFVVSDLAELL--KEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDK 77 (481) T ss_pred CeeEeeehhc-hhcccccCceeeeecchhhc--CHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCcccccccccc Confidence 1111221110 0000 00000000 00000 00000000 0 00000001111111111 011000000000 Q ss_pred -H-H-HHHHHHHHHhhcccc--ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcC Q lcl|NC_020081. 100 -N-Q-VSMFCTPARNSDKGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYD 174 (552) Q Consensus 100 -~-~-~~~~~~~~~~~~~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~G 174 (552) + . +.-+++.+....+++ |-.+.+.. .+......+.+++.. ..+..+...+..+.+++| T Consensus 78 ~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~-------~d~~~~~~l~~~~~~----------n~~~~~~~~~~~~~~~~G 140 (481) T protein:vir:10 78 ADHRAVHNYAKYVSRFIVGYLTGNPITITH-------QDNQTNDKIIELNDL----------NDADEVNSDLALNLSIYG 140 (481) T ss_pred ccceeecchHHHHHHHHHhhhccCCceEec-------CChhHHHHHHHHHHh----------cChhHHHHHHHHHHHhcC Confidence 0 0 000111111111110 10111110 111122334444432 134567788999999999 Q ss_pred CeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCC-c----eEEEEcccceeeecc--------- Q lcl|NC_020081. 175 KINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDD-K----VVAKFKAKEMAWEVS--------- 240 (552) Q Consensus 175 na~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~-~----~~~~~~~~evi~~~~--------- 240 (552) .+|+.+.++.+|++ .+..++|..+.++.++.+.... ...++|+..... . ....+..+.+.++.. T Consensus 141 ~~~~~~~~d~dg~~-~i~~~~p~~~~~v~d~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~ 218 (481) T protein:vir:10 141 RAYEIVYRDFEDRD-TFKVLDPKSTFVVYDQTLDKKV-VAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVE 218 (481) T ss_pred eEEEEEEeCCCCeE-EEEEEcccceEEEEcCCCCCce-EEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecc Confidence 99999999999986 4788999999888776532110 011222211111 0 011222232222211 Q ss_pred ---c-----ccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_020081. 241 ---N-----PRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFS 312 (552) Q Consensus 241 ---~-----~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~ 312 (552) | |-..-.+..+|.|-++.+...++.......-....+...+.|-.++. +....+++....++..- ... T Consensus 219 ~~~~~~g~vPvv~~~n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~--g~~~~~~~~~~~~~~~~--~~~ 294 (481) T protein:vir:10 219 EVEHYYNDVPIIEYLNDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAII--GNVDLDSEDAKAFRDAN--MIH 294 (481) T ss_pred cccccCCceeEEEeecCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEee--cCcCCCccchhhhhhcc--cee Confidence 0 00000112467787777666666555554444444555556655543 33333443333333210 000 Q ss_pred cccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccc--hh---HHH Q lcl|NC_020081. 313 GINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNE--GS---SAE 387 (552) Q Consensus 313 G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~--~n---~e~ 387 (552) ...+.. ....+++.++.-+........+.+..+.+.+.|+..-++|....+... ++.+| ....+ .. .-. T Consensus 295 ~~~~~~--~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg---~Al~~~~~~l~~k~~ 368 (481) T protein:vir:10 295 LEPGTN--ANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFS-GVQSG---ESMKYKLFGLEQVRA 368 (481) T ss_pred cccccc--ccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-cccHH---HHHHHHHHHHHHHHH Confidence 000100 111223344444444445566778889999999999999986655321 11111 00000 00 011 Q ss_pred HHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCe Q lcl|NC_020081. 388 KYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDV 466 (552) Q Consensus 388 ~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~ 466 (552) ..+..+...|.-+++.+...++..-....+ ..+.+.|.+..+.+.++.++++... .|+++.-.+.+++++-. +-+ T Consensus 369 ~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl-~g~is~et~~~~l~~i~--d~~- 444 (481) T protein:vir:10 369 IKERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESINAFNAL-SGGVSESTRLSLLDFID--NPK- 444 (481) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHHHHHHH-hccCChHHHHHhCCCCC--CHH- Confidence 111222333333333333332222111222 2456778777777777777766554 48888877777775421 100 Q ss_pred eeccccccchhhhccccccccccCCCCCccCcccCC-CCCCCCCCCC Q lcl|NC_020081. 467 TLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTG-YDGNMDNVNG 512 (552) Q Consensus 467 ~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 512 (552) ..+..+...+.. ..+ ...+....+.. ..++.|..+| T Consensus 445 -------~E~~ri~~E~~~-~~~--~~~~~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 445 -------EELEKMQEEEAQ-REK--QADKRGYGEAFENHLNVDDSNG 481 (481) T ss_pred -------HHHHHHHHHHHH-HHh--hhhhccCCccCCCCCCCCCCCC Confidence 111111111100 000 00000000000 0111111111 No 177 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=98.52 E-value=3.7e-07 Score=55.85 Aligned_cols=423 Identities=12% Similarity=0.097 Sum_probs=162.1 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCccccccc--CCCCchHHHHHHHhhcc-hHHHHHHHHH------ Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPS--IHGKQNLLQMLKLWSRK-NIILNAIIIT------ 97 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~Lr~~a~~-~~i~~a~i~~------ 97 (552) |++....+ + .|...++.- +.+..........-.+.. ...+..-...+.++-++ ..|+...... T Consensus 1 ~~~~~~~~-----~--~~~~~~~~~-~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~ 72 (474) T protein:vir:97 1 MFNIIRMP-----W--DKPYGEEVV-EQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNI 72 (474) T ss_pred Cccccccc-----C--CCchhhHHH-HhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhcccccc Confidence 11111100 0 000000000 000000000000000000 00000000111111111 1111100000 Q ss_pred ---HHH-H-HHHHHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHH Q lcl|NC_020081. 98 ---RVN-Q-VSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDR 170 (552) Q Consensus 98 ---~~~-~-~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ 170 (552) ..+ . +.-+++.+....++ +|-.+.+.- . +....+.+..|+. | .+......+..+. T Consensus 73 ~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~-----~--d~~~~~~l~~~~~--------n---~~~~~~~e~~~~~ 134 (474) T protein:vir:97 73 DYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSC-----E--DENVLKVIHDVLD--------T---RWDNKLIDILTAT 134 (474) T ss_pred ccccCcceeecchHHHHHHHHHhhhhcCCceecc-----C--cHHHHHHHHHHHh--------c---cHHHHHHHHHHHH Confidence 000 0 00011111111111 111111110 1 1111222333321 1 2345566778999 Q ss_pred HhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCCceEEEEcccceeeecc-------- Q lcl|NC_020081. 171 LTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS-------- 240 (552) Q Consensus 171 ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~-------- 240 (552) +.+|.+|..+.++.+|++ .+..++|..+.++.++. +.... .++|+..........++.+.+.+.+. T Consensus 135 ~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~---~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~ 210 (474) T protein:vir:97 135 SNKGIDWLQVYINENGEM-KLFRVPAEQAIPIWVDKEREELKS---FIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPD 210 (474) T ss_pred hhcCceEEEEEecCCCee-EEEEEcccceEEEEcCCCCCceEE---EEEEEEecCeEEEEEEeCCeEEEEEEcCCccccc Confidence 999999999999998875 47789999999887653 22211 22322222221222222222222110 Q ss_pred --------------ccc-----CCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHH Q lcl|NC_020081. 241 --------------NPR-----TDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALT 301 (552) Q Consensus 241 --------------~~~-----~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~ 301 (552) |+. ..-.+..+|.|-++.+...++....+.....+.+...+.|-.++. + ...++ .+ T Consensus 211 ~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~--g-~~~~~--~~ 285 (474) T protein:vir:97 211 YYYGANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILK--G-YEGED--LE 285 (474) T ss_pred cccCcCcccccccccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--c-CCccc--ch Confidence 000 000112478888888878777777666666666666666665553 3 22222 12 Q ss_pred HHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccc-cccccc Q lcl|NC_020081. 302 SFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATG-HSGNTL 380 (552) Q Consensus 302 ~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~-~~~~~~ 380 (552) .+.... ...++ +...++.+...+........+....+.+.+.|...-++|..-.+ ++++ .++... T Consensus 286 ~~~~~~--------~~~~~-i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-----~~~~n~Sg~Al 351 (474) T protein:vir:97 286 EFMRGL--------KYYKA-INVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTD-----KFGSAPSGIAL 351 (474) T ss_pred hhhhhh--------hccce-eeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcc-----ccccccHHHHH Confidence 222211 11222 33333444444444445566677788888999999888853221 1111 111110 Q ss_pred cch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHH Q lcl|NC_020081. 381 NEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKE 455 (552) Q Consensus 381 ~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~ 455 (552) .+. .........+...|+.+++.|...++.. .....+.+.|.+..+.+.++.++++.. +|+++.--++++ T Consensus 352 ~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~---~d~~~i~v~f~~~~p~~~~e~a~~~~~--~g~iS~et~l~~ 426 (474) T protein:vir:97 352 KFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLK---TDVKDIEISFNFNRMMNDAEQSQIIAQ--SQYLSRETLVKS 426 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---cccceeeEEeccCcccCHHHHHHHHHH--cCCCCHHHHHHh Confidence 000 0112223344555555555554433221 112346677877777777777666544 589999888887 Q ss_pred hCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccc Q lcl|NC_020081. 456 LGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQ 529 (552) Q Consensus 456 ~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (552) ++.- ++-+ ..+..+...+.. ..+ ..+.... ...+ .++ ....+++.+++ T Consensus 427 l~~v--~D~~--------~E~eri~~E~~~-~~~---~~~~~~~-----~~~~--~~~-----~~~~~~~~~~e 474 (474) T protein:vir:97 427 SPLV--DDYK--------AELERIEQEQME-YNK---QLPNLDD-----GGAD--GAQ-----QQEGSNNKESE 474 (474) T ss_pred CCCC--CCHH--------HHHHHHHHHHHH-HHh---hccccCC-----CCCC--Ccc-----cCCCCcccccC Confidence 7542 1100 111111111100 000 0000000 0000 000 00000011111 No 178 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=98.52 E-value=3.7e-07 Score=55.85 Aligned_cols=423 Identities=12% Similarity=0.097 Sum_probs=162.1 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCccccccc--CCCCchHHHHHHHhhcc-hHHHHHHHHH------ Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPS--IHGKQNLLQMLKLWSRK-NIILNAIIIT------ 97 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~Lr~~a~~-~~i~~a~i~~------ 97 (552) |++....+ + .|...++.- +.+..........-.+.. ...+..-...+.++-++ ..|+...... T Consensus 1 ~~~~~~~~-----~--~~~~~~~~~-~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~ 72 (474) T protein:vir:94 1 MFNIIRMP-----W--DKPYGEEVV-EQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNI 72 (474) T ss_pred Cccccccc-----C--CCchhhHHH-HhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhcccccc Confidence 11111100 0 000000000 000000000000000000 00000000111111111 1111100000 Q ss_pred ---HHH-H-HHHHHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHH Q lcl|NC_020081. 98 ---RVN-Q-VSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDR 170 (552) Q Consensus 98 ---~~~-~-~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ 170 (552) ..+ . +.-+++.+....++ +|-.+.+.- . +....+.+..|+. | .+......+..+. T Consensus 73 ~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~-----~--d~~~~~~l~~~~~--------n---~~~~~~~e~~~~~ 134 (474) T protein:vir:94 73 DYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSC-----E--DENVLKVIHDVLD--------T---RWDNKLIDILTAT 134 (474) T ss_pred ccccCcceeecchHHHHHHHHHhhhhcCCceecc-----C--cHHHHHHHHHHHh--------c---cHHHHHHHHHHHH Confidence 000 0 00011111111111 111111110 1 1111222333321 1 2345566778999 Q ss_pred HhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCCceEEEEcccceeeecc-------- Q lcl|NC_020081. 171 LTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS-------- 240 (552) Q Consensus 171 ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~-------- 240 (552) +.+|.+|..+.++.+|++ .+..++|..+.++.++. +.... .++|+..........++.+.+.+.+. T Consensus 135 ~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~---~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~ 210 (474) T protein:vir:94 135 SNKGIDWLQVYINENGEM-KLFRVPAEQAIPIWVDKEREELKS---FIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPD 210 (474) T ss_pred hhcCceEEEEEecCCCee-EEEEEcccceEEEEcCCCCCceEE---EEEEEEecCeEEEEEEeCCeEEEEEEcCCccccc Confidence 999999999999998875 47789999999887653 22211 22322222221222222222222110 Q ss_pred --------------ccc-----CCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHH Q lcl|NC_020081. 241 --------------NPR-----TDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALT 301 (552) Q Consensus 241 --------------~~~-----~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~ 301 (552) |+. ..-.+..+|.|-++.+...++....+.....+.+...+.|-.++. + ...++ .+ T Consensus 211 ~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~--g-~~~~~--~~ 285 (474) T protein:vir:94 211 YYYGANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILK--G-YEGED--LE 285 (474) T ss_pred cccCcCcccccccccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--c-CCccc--ch Confidence 000 000112478888888878777777666666666666666665553 3 22222 12 Q ss_pred HHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccc-cccccc Q lcl|NC_020081. 302 SFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATG-HSGNTL 380 (552) Q Consensus 302 ~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~-~~~~~~ 380 (552) .+.... ...++ +...++.+...+........+....+.+.+.|...-++|..-.+ ++++ .++... T Consensus 286 ~~~~~~--------~~~~~-i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-----~~~~n~Sg~Al 351 (474) T protein:vir:94 286 EFMRGL--------KYYKA-INVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTD-----KFGSAPSGIAL 351 (474) T ss_pred hhhhhh--------hccce-eeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcc-----ccccccHHHHH Confidence 222211 11222 33333444444444445566677788888999999888853221 1111 111110 Q ss_pred cch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHH Q lcl|NC_020081. 381 NEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKE 455 (552) Q Consensus 381 ~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~ 455 (552) .+. .........+...|+.+++.|...++.. .....+.+.|.+..+.+.++.++++.. +|+++.--++++ T Consensus 352 ~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~---~d~~~i~v~f~~~~p~~~~e~a~~~~~--~g~iS~et~l~~ 426 (474) T protein:vir:94 352 KFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLK---TDVKDIEISFNFNRMMNDAEQSQIIAQ--SQYLSRETLVKS 426 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---cccceeeEEeccCcccCHHHHHHHHHH--cCCCCHHHHHHh Confidence 000 0112223344555555555554433221 112346677877777777777666544 589999888887 Q ss_pred hCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccc Q lcl|NC_020081. 456 LGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQ 529 (552) Q Consensus 456 ~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 529 (552) ++.- ++-+ ..+..+...+.. ..+ ..+.... ...+ .++ ....+++.+++ T Consensus 427 l~~v--~D~~--------~E~eri~~E~~~-~~~---~~~~~~~-----~~~~--~~~-----~~~~~~~~~~e 474 (474) T protein:vir:94 427 SPLV--DDYK--------AELERIEQEQME-YNK---QLPNLDD-----GGAD--GAQ-----QQEGSNNKESE 474 (474) T ss_pred CCCC--CCHH--------HHHHHHHHHHHH-HHh---hccccCC-----CCCC--Ccc-----cCCCCcccccC Confidence 7542 1100 111111111100 000 0000000 0000 000 00000011111 No 179 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=98.51 E-value=3.9e-07 Score=55.71 Aligned_cols=459 Identities=12% Similarity=0.082 Sum_probs=169.5 Q ss_pred ccc---ccCcccccccccchhhhhccccccccccccccccccccccccCC--cccccccCCCCchHHHHHHHhhcc-hHH Q lcl|NC_020081. 17 IID---INDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNP--DFKEAPSIHGKQNLLQMLKLWSRK-NII 90 (552) Q Consensus 17 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~Lr~~a~~-~~i 90 (552) ++. ++...+.. |..+..+.+ ..+..-..............- .+-...... ...-++.|+++-.+ ..+ T Consensus 1 ~~~~~~~~~~~~~~--~~~~~~~~~----~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~-~~~r~~~l~~Yy~g~~~i 73 (511) T protein:vir:78 1 MLKVNEFETDTDLR--GNINYLFND----EANVVYTYDGTESDLLQNVNEVSKYIEHHMDY-QRPRLKVLSDYYEGKTKN 73 (511) T ss_pred Cccccchhhhhhhh--hhhhhhhhh----hhCCcccccchhhhhhcCHHHHHHHHHHHHHh-hhHHHHHHHHHhhccCcc Confidence 210 11111110 111111100 000000000000000000000 000000000 00001111111110 011 Q ss_pred H-HHH---HHHHHH-H-HHHHHHHHHhhcccc--ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHH Q lcl|NC_020081. 91 L-NAI---IITRVN-Q-VSMFCTPARNSDKGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSF 162 (552) Q Consensus 91 ~-~a~---i~~~~~-~-~~~~~~~~~~~~~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f 162 (552) + +.. ...+.+ . +.-+++.+....+++ |-.+.+... +......+.+++... ....+ T Consensus 74 l~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~-------d~~~~~~l~~~~~~n----------~~~~~ 136 (511) T protein:vir:78 74 LVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDD-------DKDVLEAIEAFNDLN----------DVESH 136 (511) T ss_pred ccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecC-------chHHHHHHHHHHhhc----------ChhHH Confidence 0 000 000000 0 001112222222111 111111111 111223344554331 23456 Q ss_pred HHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEc--CCc------eEEEEcccc Q lcl|NC_020081. 163 VKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVI--DDK------VVAKFKAKE 234 (552) Q Consensus 163 ~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~--~~~------~~~~~~~~e 234 (552) ...+..+++++|.+|..+-++.+|++ .+..++|..+.++.++..... ..-.++|+... .+. ....++++. T Consensus 137 ~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~v~dd~~~~~-~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~ 214 (511) T protein:vir:78 137 NRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFIIYDNTVERN-SIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred HHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCc-eEEEEEEEEeeeccccccceEEEEEEEeCCc Confidence 67788899999999999999998875 578899999998887643211 11122232211 100 111233333 Q ss_pred eeeeccccc-------------CCc---------cCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC Q lcl|NC_020081. 235 MAWEVSNPR-------------TDL---------TVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTG 292 (552) Q Consensus 235 vi~~~~~~~-------------~~~---------~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 292 (552) +.++...-. .++ .+...|.|-++.+...++....+..-..+.+...+.|-.++. +. T Consensus 215 i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~--G~ 292 (511) T protein:vir:78 215 VYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK--GN 292 (511) T ss_pred EEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhhee--cC Confidence 332211100 000 012357888887777777766665555555555556654443 32 Q ss_pred CCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc Q lcl|NC_020081. 293 QEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGA 372 (552) Q Consensus 293 ~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~ 372 (552) ...+.++....+....-......-......-...+.+.+.++.......+....+.+.+.|+..-++|..-.+... + T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~-- 369 (511) T protein:vir:78 293 LNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G-- 369 (511) T ss_pred ccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c-- Confidence 2233333322221110000000000000001122344444555555666778888999999999999976443221 1 Q ss_pred cccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCc--ccc-cceeecccccChHHHHHHHHHHHHHhc Q lcl|NC_020081. 373 TGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVS--QFG-GDYVFNFVGGDAKTEAEIISILESKAK 444 (552) Q Consensus 373 ~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~--~~~-~~~~~~f~~~d~~~~~~~~~~~~~~~~ 444 (552) +.++....+. ......+..+...|.-+++.|...+...--. ..+ ..+.+.|.+.-+.+..+.++++... . T Consensus 370 -n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl-~ 447 (511) T protein:vir:78 370 -TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS-G 447 (511) T ss_pred -ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHH-h Confidence 1111111111 1122233344555555555555444332111 111 2467778877777777777666554 4 Q ss_pred CCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCC Q lcl|NC_020081. 445 IGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDG 524 (552) Q Consensus 445 g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (552) |+++..-+.+++++ +++-+ ..+..+...+... .+.... .... +.........++..++ +. T Consensus 448 G~iS~et~l~~l~~--v~d~~--------~El~ri~~E~~~~-~~~~~~--~~~~----~~~~~~~~~~~~~~~~---~~ 507 (511) T protein:vir:78 448 GKISQTTLMSLFSF--FQDPE--------LEVKKIEEDEKES-IKKAQK--GIYK----DPRDINDDEQDDDTKD---TV 507 (511) T ss_pred ccCChHHHHHhCCC--CCCHH--------HHHHHHHHHHHHH-HHHHhh--cccc----CCCCCCCCCCCCCccC---cc Confidence 88998777777643 21100 1111111111000 000000 0000 0000000000000000 00 Q ss_pred cccccc Q lcl|NC_020081. 525 QSKQQA 530 (552) Q Consensus 525 ~~~~~~ 530 (552) .+ ++ T Consensus 508 ~e--~~ 511 (511) T protein:vir:78 508 DK--KE 511 (511) T ss_pred cc--cC Confidence 00 00 No 180 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=98.51 E-value=3.9e-07 Score=55.71 Aligned_cols=459 Identities=12% Similarity=0.082 Sum_probs=169.5 Q ss_pred ccc---ccCcccccccccchhhhhccccccccccccccccccccccccCC--cccccccCCCCchHHHHHHHhhcc-hHH Q lcl|NC_020081. 17 IID---INDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNP--DFKEAPSIHGKQNLLQMLKLWSRK-NII 90 (552) Q Consensus 17 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~Lr~~a~~-~~i 90 (552) ++. ++...+.. |..+..+.+ ..+..-..............- .+-...... ...-++.|+++-.+ ..+ T Consensus 1 ~~~~~~~~~~~~~~--~~~~~~~~~----~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~-~~~r~~~l~~Yy~g~~~i 73 (511) T protein:vir:96 1 MLKVNEFETDTDLR--GNINYLFND----EANVVYTYDGTESDLLQNVNEVSKYIEHHMDY-QRPRLKVLSDYYEGKTKN 73 (511) T ss_pred Cccccchhhhhhhh--hhhhhhhhh----hhCCcccccchhhhhhcCHHHHHHHHHHHHHh-hhHHHHHHHHHhhccCcc Confidence 210 11111110 111111100 000000000000000000000 000000000 00001111111110 011 Q ss_pred H-HHH---HHHHHH-H-HHHHHHHHHhhcccc--ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHH Q lcl|NC_020081. 91 L-NAI---IITRVN-Q-VSMFCTPARNSDKGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSF 162 (552) Q Consensus 91 ~-~a~---i~~~~~-~-~~~~~~~~~~~~~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f 162 (552) + +.. ...+.+ . +.-+++.+....+++ |-.+.+... +......+.+++... ....+ T Consensus 74 l~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~-------d~~~~~~l~~~~~~n----------~~~~~ 136 (511) T protein:vir:96 74 LVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDD-------DKDVLEAIEAFNDLN----------DVESH 136 (511) T ss_pred ccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecC-------chHHHHHHHHHHhhc----------ChhHH Confidence 0 000 000000 0 001112222222111 111111111 111223344554331 23456 Q ss_pred HHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEc--CCc------eEEEEcccc Q lcl|NC_020081. 163 VKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVI--DDK------VVAKFKAKE 234 (552) Q Consensus 163 ~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~--~~~------~~~~~~~~e 234 (552) ...+..+++++|.+|..+-++.+|++ .+..++|..+.++.++..... ..-.++|+... .+. ....++++. T Consensus 137 ~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~p~~~~~v~dd~~~~~-~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~ 214 (511) T protein:vir:96 137 NRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFIIYDNTVERN-SIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred HHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCc-eEEEEEEEEeeeccccccceEEEEEEEeCCc Confidence 67788899999999999999998875 578899999998887643211 11122232211 100 111233333 Q ss_pred eeeeccccc-------------CCc---------cCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC Q lcl|NC_020081. 235 MAWEVSNPR-------------TDL---------TVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTG 292 (552) Q Consensus 235 vi~~~~~~~-------------~~~---------~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 292 (552) +.++...-. .++ .+...|.|-++.+...++....+..-..+.+...+.|-.++. +. T Consensus 215 i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~--G~ 292 (511) T protein:vir:96 215 VYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIK--GN 292 (511) T ss_pred EEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhhee--cC Confidence 332211100 000 012357888887777777766665555555555556654443 32 Q ss_pred CCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc Q lcl|NC_020081. 293 QEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGA 372 (552) Q Consensus 293 ~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~ 372 (552) ...+.++....+....-......-......-...+.+.+.++.......+....+.+.+.|+..-++|..-.+... + T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~-- 369 (511) T protein:vir:96 293 LNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G-- 369 (511) T ss_pred ccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c-- Confidence 2233333322221110000000000000001122344444555555666778888999999999999976443221 1 Q ss_pred cccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCc--ccc-cceeecccccChHHHHHHHHHHHHHhc Q lcl|NC_020081. 373 TGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVS--QFG-GDYVFNFVGGDAKTEAEIISILESKAK 444 (552) Q Consensus 373 ~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~--~~~-~~~~~~f~~~d~~~~~~~~~~~~~~~~ 444 (552) +.++....+. ......+..+...|.-+++.|...+...--. ..+ ..+.+.|.+.-+.+..+.++++... . T Consensus 370 -n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl-~ 447 (511) T protein:vir:96 370 -TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS-G 447 (511) T ss_pred -ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHH-h Confidence 1111111111 1122233344555555555555444332111 111 2467778877777777777666554 4 Q ss_pred CCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCC Q lcl|NC_020081. 445 IGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDG 524 (552) Q Consensus 445 g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (552) |+++..-+.+++++ +++-+ ..+..+...+... .+.... .... +.........++..++ +. T Consensus 448 G~iS~et~l~~l~~--v~d~~--------~El~ri~~E~~~~-~~~~~~--~~~~----~~~~~~~~~~~~~~~~---~~ 507 (511) T protein:vir:96 448 GKISQTTLMSLFSF--FQDPE--------LEVKKIEEDEKES-IKKAQK--GIYK----DPRDINDDEQDDDTKD---TV 507 (511) T ss_pred ccCChHHHHHhCCC--CCCHH--------HHHHHHHHHHHHH-HHHHhh--cccc----CCCCCCCCCCCCCccC---cc Confidence 88998777777643 21100 1111111111000 000000 0000 0000000000000000 00 Q ss_pred cccccc Q lcl|NC_020081. 525 QSKQQA 530 (552) Q Consensus 525 ~~~~~~ 530 (552) .+ ++ T Consensus 508 ~e--~~ 511 (511) T protein:vir:96 508 DK--KE 511 (511) T ss_pred cc--cC Confidence 00 00 No 181 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.49 E-value=2e-07 Score=57.32 Aligned_cols=394 Identities=13% Similarity=0.062 Sum_probs=149.9 Q ss_pred cccccc--cccccccCCcccccccCCCCchHHHHHHHhhcch-HHHHHHH----HHHHH---HHHHHHHHHHhhc----c Q lcl|NC_020081. 49 KAYEEP--IIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKN-IILNAII----ITRVN---QVSMFCTPARNSD----K 114 (552) Q Consensus 49 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~-~i~~a~i----~~~~~---~~~~~~~~~~~~~----~ 114 (552) --+.+| +...+ . . .......-.+.|+.|-++. .+...-+ ..+.. .+.-+++.+.... . T Consensus 1 ~~~~t~~~~~~~l------~-~-~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~ 72 (456) T protein:vir:79 1 MTASTPAEWLPVL------T-K-RIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) T ss_pred CCCCCHHHHHHHH------H-H-HHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhc Confidence 000000 00000 0 0 0000000012222332221 1111000 00000 0111122222221 2 Q ss_pred ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEe Q lcl|NC_020081. 115 GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAV 194 (552) Q Consensus 115 ~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l 194 (552) +-|+.+. ..+.. .....+.+++.. | .+..+...+..+.+++|.+|..+-++.+|.+ .+..+ T Consensus 73 ~~g~~~~--~~~d~------~~~~~~~~~~~~-------n---~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~-~i~~~ 133 (456) T protein:vir:79 73 PNGITVG--GSADS------DLALRARRIWRD-------N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITAD 133 (456) T ss_pred cCCeecC--CCCCc------cHHHHHHHHHHh-------c---ChhHHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEe Confidence 2233321 11110 011223344432 1 2335666788999999999999888989987 48889 Q ss_pred cCceeEEEECCCcccccccceeEEEEEcCCce-----------EE----------------E------Ecccceeeeccc Q lcl|NC_020081. 195 DASTVYVAVDEDGKERKAKDGVRYVQVIDDKV-----------VA----------------K------FKAKEMAWEVSN 241 (552) Q Consensus 195 ~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~-----------~~----------------~------~~~~evi~~~~~ 241 (552) +|..+.++.++...... ...++|+...++.. .. . ....++-|.... T Consensus 134 ~p~~~~~i~d~~~~~~~-~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (456) T protein:vir:79 134 SPETMVVSVDPLQPWRI-RSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSP 212 (456) T ss_pred ccceeEEEEcCCCCCce-EEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCc Confidence 99999888765322110 01111111111110 00 0 000011110000 Q ss_pred ccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-CCCCCHHHHHH--HHHHHHHHhccccccc Q lcl|NC_020081. 242 PRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKT-GQEQSNQALTS--FRREWTSMFSGINGAW 318 (552) Q Consensus 242 ~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~~~~s~~~~~~--~~~~~~~~~~G~~nag 318 (552) +..-......|+|-++.....++....+..-........+.|..++.-.. +....++.-+. ....|... .+ T Consensus 213 ~pvv~~~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~------~~ 286 (456) T protein:vir:79 213 PPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAA------PG 286 (456) T ss_pred eeEEEecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhh------cc Confidence 00001112457777777766665554443322222222233332221000 00000111011 11222221 12 Q ss_pred cceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHH-------- Q lcl|NC_020081. 319 KIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYR-------- 390 (552) Q Consensus 319 k~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~-------- 390 (552) .+ +..+++.++.++...+. -.+.+..+..+..|+..-++|++.+|.... +.+ ...++.... T Consensus 287 ~~-~~~~~~~~~~q~~~~~~-~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~----N~S-----g~Al~~~~~~l~~k~~~ 355 (456) T protein:vir:79 287 AL-WELPPGVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSA----NQS-----AEGAHNIEKGFLFKCED 355 (456) T ss_pred cc-ccCCCCcceeeecccCh-HHHHHHHHHHHHHHHhhcCCChhHhccccc----CcH-----HHHHHHHHHHHHHHHHH Confidence 22 33456788776654332 337888999999999999999999974211 111 111111111 Q ss_pred --HHHHHHhhHHHHHHHHHHHhhcCcc-cccceeecccccChHHHHHHHHHHHH-HhcCCcCHHHHHHHhCCCCCCCCCe Q lcl|NC_020081. 391 --NSKDKGLEPLLKFIEDAVNKYIVSQ-FGGDYVFNFVGGDAKTEAEIISILES-KAKIGLTINDIRKELGYPDTEGGDV 466 (552) Q Consensus 391 --~~~~~~l~P~~~~ie~~ln~~L~~~-~~~~~~~~f~~~d~~~~~~~~~~~~~-~~~g~lT~NE~R~~~gl~p~~ggD~ 466 (552) ..+...|+-+++.+.. +... ....+.+.|.+..+.+.++.++++.+ ...|+++..-+++.+|+.+-+ + T Consensus 356 ~~~~f~~~l~~~~~l~~~-----~~g~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~---i 427 (456) T protein:vir:79 356 RLSIAKIGLEAILVKALQ-----IEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQ---I 427 (456) T ss_pred HHHHHHHHHHHHHHHHHH-----hcCCCccccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHH---H Confidence 1222222222222211 1111 11245666766666666666665443 345788877777888886521 0 Q ss_pred eeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCC Q lcl|NC_020081. 467 TLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVN 511 (552) Q Consensus 467 ~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (552) . -........+..+. ..+....++++... T Consensus 428 ~-----~~e~~r~~~e~~~~-----------~~~~~~~~~~~~~~ 456 (456) T protein:vir:79 428 K-----QDDLDRAREQITLF-----------AGNPVQRPQEDGSR 456 (456) T ss_pred H-----HHHHHHHHHHHHHH-----------hhhHhhcCCCCCCC Confidence 0 00000000000000 00000000000000 No 182 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.49 E-value=4.6e-07 Score=55.35 Aligned_cols=406 Identities=13% Similarity=0.090 Sum_probs=144.5 Q ss_pred CCCCchHHH--------------HHHHhhcchH-HHHHHHHH----HH-HHHHHHHHHHHhhcc----ccceeeeecccc Q lcl|NC_020081. 71 IHGKQNLLQ--------------MLKLWSRKNI-ILNAIIIT----RV-NQVSMFCTPARNSDK----GVGYEIRLKDPL 126 (552) Q Consensus 71 ~~~~~~~~~--------------~Lr~~a~~~~-i~~a~i~~----~~-~~~~~~~~~~~~~~~----~~~~~i~~k~~~ 126 (552) +.+.....+ .|..+-++-- +...-+.+ +. ..+.-+++.+..... ..||.+ .+ T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~--~~-- 76 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--SE-- 76 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceec--CC-- Confidence 111111111 1122211100 00000000 00 001111222221111 122211 11 Q ss_pred ccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE------CCCCCEEEEEEecCceeE Q lcl|NC_020081. 127 QEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY------DKLGDLHNFKAVDASTVY 200 (552) Q Consensus 127 ~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r------~~~G~~~~L~~l~p~~v~ 200 (552) +....+.+.+++.. | .+......+..+.+++|.+|..+-+ +..|.+ .+.+++|..|. T Consensus 77 ------d~~~~~~l~~i~~~-------N---~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~-~i~~~~p~~~~ 139 (480) T protein:vir:78 77 ------DSEGLEELWNWWQA-------N---DLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIP-LIRVESPLYMY 139 (480) T ss_pred ------CchhHHHHHHHHHh-------c---CHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCee-EEEEEcccceE Confidence 11122334455432 1 2345677788999999999988876 335554 47789999999 Q ss_pred EEECCC--cccccccceeEEEEEcCCc-e---EEEEcccce-----------------------------eeecccccCC Q lcl|NC_020081. 201 VAVDED--GKERKAKDGVRYVQVIDDK-V---VAKFKAKEM-----------------------------AWEVSNPRTD 245 (552) Q Consensus 201 v~~~~~--g~~~~~~~~~~y~~~~~~~-~---~~~~~~~ev-----------------------------i~~~~~~~~~ 245 (552) ++.++. +.... .++|+...++. . ...+.++.+ ++++.++ T Consensus 140 ~~~D~~~~~~~~~---~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~--- 213 (480) T protein:vir:78 140 AELDPRNTRRVTR---AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP--- 213 (480) T ss_pred EEEcCCCccceEE---EEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeeccc--- Confidence 888753 22211 11111111110 0 011111111 2222111 Q ss_pred ccCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec Q lcl|NC_020081. 246 LTVGKYGYPELEI-ALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT 324 (552) Q Consensus 246 ~~~g~~G~spl~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~ 324 (552) ...+++|.|-++- +...++....+..-......-.+.|.-+|. + ....+...+.-...|.. . .+++..+. T Consensus 214 ~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~--G-~~~~~~~~~~~~~~~~~-~-----~~~~~~~~ 284 (480) T protein:vir:78 214 RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--G-VTTDELTNDGENTTLDI-Y-----YGRILTLA 284 (480) T ss_pred ccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh--c-CCccccccccccchhhh-h-----hhhhccCC Confidence 2234688887653 444444444333333333333345544442 2 21121111111111221 1 12223344 Q ss_pred cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccch--hH---HHHHHHHHHHHhhH Q lcl|NC_020081. 325 AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG--SS---AEKYRNSKDKGLEP 399 (552) Q Consensus 325 ~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~--n~---e~~~~~~~~~~l~P 399 (552) ++++++..+..... -.+++..+..+..|+..=++|+..+|.......+ +....+. .. -+..+..+...|.- T Consensus 285 ~~~~~~~~~~~~~~-~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~S---g~Alk~~~~~l~~ka~~~~~~f~~~l~~ 360 (480) T protein:vir:78 285 SEAAKISEFKAAEL-RNFAEEMEVFRKEAASITGLPPQYLSSSSENPAS---AEAIIATDSRIVKMAERKGRIFGGAWER 360 (480) T ss_pred CCCceEEecCccCH-HHHHHHHHHHHHHHhcccCCChHHhccccCcchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55677776665332 2367778888899999999999999753211110 0000000 00 00111111122222 Q ss_pred HHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHH-HHHHhc--CCcCHHHHHHHhCCCCCCCCCeeeccccccch Q lcl|NC_020081. 400 LLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISI-LESKAK--IGLTINDIRKELGYPDTEGGDVTLAGVHVQRL 476 (552) Q Consensus 400 ~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~-~~~~~~--g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~ 476 (552) +++.+....... .......+.+.|....+.+..+.+.. .+.+.+ |.++..-+++.+|+.+-+- ..+ T Consensus 361 ~~~l~~~~~g~~-~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~----------~~~ 429 (480) T protein:vir:78 361 AMRIAMQIMGRE-VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR----------EQM 429 (480) T ss_pred HHHHHHHHcCCC-ccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHH----------HHH Confidence 222221111100 00111234556644433333333322 222333 3577777788888754210 001 Q ss_pred hhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCccccccccccccccCc Q lcl|NC_020081. 477 GQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGK 540 (552) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 540 (552) ......+... ..........+. + +..++...++ +..+.++..++....++. T Consensus 430 ~~~~~e~~~~-~~~~~~~~~~~~--~-~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 430 RDWDKQETED-MIDTLYSTTKAQ--A-DATPKPTVTE---------TKTETQTSPSGFNRTKTR 480 (480) T ss_pred HHHHHHHHHH-HHHHhhcccccc--C-CCCCCCCCCC---------CCCccccccCCCCcccCC Confidence 1100000000 000000000000 0 0000000000 000001111111111111 No 183 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.47 E-value=5.4e-07 Score=54.96 Aligned_cols=399 Identities=14% Similarity=0.071 Sum_probs=153.7 Q ss_pred cccccc--cccccccCCcccccccCCCCchHHHHHHHhhcch-HHHHHHHH----HHH--H-HHHHHHHHHHhhcc---- Q lcl|NC_020081. 49 KAYEEP--IIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKN-IILNAIII----TRV--N-QVSMFCTPARNSDK---- 114 (552) Q Consensus 49 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~-~i~~a~i~----~~~--~-~~~~~~~~~~~~~~---- 114 (552) -.+.+| +...+ .. -......-.+.|..+-++. .+...-.. .+. . .+.-+++.+..... T Consensus 1 ~~~~t~~~~~~~l-------~~-~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~ 72 (456) T protein:vir:10 1 MTASTPAEWLPVL-------TK-RIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) T ss_pred CCCCCHHHHHHHH-------HH-HHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhc Confidence 000000 00000 00 0000000112333333322 11100000 000 0 11112333332222 Q ss_pred ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEe Q lcl|NC_020081. 115 GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAV 194 (552) Q Consensus 115 ~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l 194 (552) +-||.+ ...+.. .....+.+++.. | ....+...+..+.+++|.+|..+.++..|.+. +..+ T Consensus 73 ~~~~~~--~~~~d~------~~~~~~~~i~~~-------N---~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~-i~~~ 133 (456) T protein:vir:10 73 PNGITV--GGSADS------DLALRARRIWRD-------N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT-ITAD 133 (456) T ss_pred cCCeec--CCCCCc------chHHHHHHHHHh-------c---ChhhHHHHHHHHHhhcCeeEEEEeeCCCCceE-EEEE Confidence 223332 111110 011223444432 1 23345667889999999999999898888764 6788 Q ss_pred cCceeEEEECCCcccccccceeEEEEEcCCceE---------------------------EEEcccce-----eeeccc- Q lcl|NC_020081. 195 DASTVYVAVDEDGKERKAKDGVRYVQVIDDKVV---------------------------AKFKAKEM-----AWEVSN- 241 (552) Q Consensus 195 ~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~---------------------------~~~~~~ev-----i~~~~~- 241 (552) +|..+.++.++....... ..++|+...++... ........ .|+... T Consensus 134 ~p~~~~~i~d~~~~~~~~-~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (456) T protein:vir:10 134 SPETMVVSVDPLQPWRIR-AAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSP 212 (456) T ss_pred ccceeEEEEcCCCCcceE-EEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCc Confidence 999998887754322111 11112211111110 00000000 011000 Q ss_pred ccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-CCCCCCHHHHHH--HHHHHHHHhccccccc Q lcl|NC_020081. 242 PRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIK-TGQEQSNQALTS--FRREWTSMFSGINGAW 318 (552) Q Consensus 242 ~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~s~~~~~~--~~~~~~~~~~G~~nag 318 (552) +.........|+|.++.....++....+..-........+.|.-++.-. .+....++.-.. ....|... .+ T Consensus 213 ~pvv~~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~------~~ 286 (456) T protein:vir:10 213 PPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAA------PG 286 (456) T ss_pred eeEEEecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhh------cc Confidence 0000111246888888877777766655443333333333333333210 000000111111 11223221 12 Q ss_pred cceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchh-----HHHHHHHHH Q lcl|NC_020081. 319 KIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGS-----SAEKYRNSK 393 (552) Q Consensus 319 k~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n-----~e~~~~~~~ 393 (552) ++ ...+.+.++..+..... -.|++..+.++.+|++.-++|++.+|.... +.++....+.. --+..+..+ T Consensus 287 ~~-~~~~~~~~~~q~~~~~~-~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~----N~Sg~Ai~~~~~~l~~k~~~~~~~f 360 (456) T protein:vir:10 287 AL-WELPPGVDIWESQANDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSA----NQSAEGAHNIEKGFLFKCEDRLSIA 360 (456) T ss_pred cc-ccCCCCcceEEecccCh-hHHHHHHHHHHHHHHhccCCChHHhccccc----ChHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 23356788776654322 347888999999999999999999974211 11111111100 001111111 Q ss_pred HHHhhHHHHHHHHHHHhhcCcc-cccceeecccccChHHHHHHHHHHHH-HhcCCcCHHHHHHHhCCCCCCCCCeeeccc Q lcl|NC_020081. 394 DKGLEPLLKFIEDAVNKYIVSQ-FGGDYVFNFVGGDAKTEAEIISILES-KAKIGLTINDIRKELGYPDTEGGDVTLAGV 471 (552) Q Consensus 394 ~~~l~P~~~~ie~~ln~~L~~~-~~~~~~~~f~~~d~~~~~~~~~~~~~-~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~ 471 (552) ...|+-+++.+.. +... ....+.+.|....+.+.++.++++.+ ...|+++..-+++++|+.+-+ +. T Consensus 361 ~~~l~~~~rl~~~-----~~g~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~---i~---- 428 (456) T protein:vir:10 361 KIGLEAILVKALQ-----IEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQ---IK---- 428 (456) T ss_pred HHHHHHHHHHHHH-----hcCCCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHH---HH---- Confidence 2222222221110 1111 11245666766666666666665443 345788887788888875421 00 Q ss_pred cccchhhhccccccccccCCCCCccCcccCCCCCCC Q lcl|NC_020081. 472 HVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNM 507 (552) Q Consensus 472 n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (552) -..+.....+... ...++... +..+++- T Consensus 429 -~~e~er~~~e~~~------~~~~~~~~-~~~~~~~ 456 (456) T protein:vir:10 429 -QDDLDRAREQITL------FAGNPVQR-PQEDGSR 456 (456) T ss_pred -HHHHHHHHHHHHH------Hhhhhhhc-CCCCCCC Confidence 0001111111000 00000000 0000000 No 184 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.47 E-value=5.4e-07 Score=54.96 Aligned_cols=399 Identities=14% Similarity=0.071 Sum_probs=153.7 Q ss_pred cccccc--cccccccCCcccccccCCCCchHHHHHHHhhcch-HHHHHHHH----HHH--H-HHHHHHHHHHhhcc---- Q lcl|NC_020081. 49 KAYEEP--IIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKN-IILNAIII----TRV--N-QVSMFCTPARNSDK---- 114 (552) Q Consensus 49 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~-~i~~a~i~----~~~--~-~~~~~~~~~~~~~~---- 114 (552) -.+.+| +...+ .. -......-.+.|..+-++. .+...-.. .+. . .+.-+++.+..... T Consensus 1 ~~~~t~~~~~~~l-------~~-~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~ 72 (456) T protein:vir:10 1 MTASTPAEWLPVL-------TK-RIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) T ss_pred CCCCCHHHHHHHH-------HH-HHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhc Confidence 000000 00000 00 0000000112333333322 11100000 000 0 11112333332222 Q ss_pred ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEe Q lcl|NC_020081. 115 GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAV 194 (552) Q Consensus 115 ~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l 194 (552) +-||.+ ...+.. .....+.+++.. | ....+...+..+.+++|.+|..+.++..|.+. +..+ T Consensus 73 ~~~~~~--~~~~d~------~~~~~~~~i~~~-------N---~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~-i~~~ 133 (456) T protein:vir:10 73 PNGITV--GGSADS------DLALRARRIWRD-------N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT-ITAD 133 (456) T ss_pred cCCeec--CCCCCc------chHHHHHHHHHh-------c---ChhhHHHHHHHHHhhcCeeEEEEeeCCCCceE-EEEE Confidence 223332 111110 011223444432 1 23345667889999999999999898888764 6788 Q ss_pred cCceeEEEECCCcccccccceeEEEEEcCCceE---------------------------EEEcccce-----eeeccc- Q lcl|NC_020081. 195 DASTVYVAVDEDGKERKAKDGVRYVQVIDDKVV---------------------------AKFKAKEM-----AWEVSN- 241 (552) Q Consensus 195 ~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~---------------------------~~~~~~ev-----i~~~~~- 241 (552) +|..+.++.++....... ..++|+...++... ........ .|+... T Consensus 134 ~p~~~~~i~d~~~~~~~~-~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (456) T protein:vir:10 134 SPETMVVSVDPLQPWRIR-AAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSP 212 (456) T ss_pred ccceeEEEEcCCCCcceE-EEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCc Confidence 999998887754322111 11112211111110 00000000 011000 Q ss_pred ccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-CCCCCCHHHHHH--HHHHHHHHhccccccc Q lcl|NC_020081. 242 PRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIK-TGQEQSNQALTS--FRREWTSMFSGINGAW 318 (552) Q Consensus 242 ~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~s~~~~~~--~~~~~~~~~~G~~nag 318 (552) +.........|+|.++.....++....+..-........+.|.-++.-. .+....++.-.. ....|... .+ T Consensus 213 ~pvv~~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~------~~ 286 (456) T protein:vir:10 213 PPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAA------PG 286 (456) T ss_pred eeEEEecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhh------cc Confidence 0000111246888888877777766655443333333333333333210 000000111111 11223221 12 Q ss_pred cceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchh-----HHHHHHHHH Q lcl|NC_020081. 319 KIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGS-----SAEKYRNSK 393 (552) Q Consensus 319 k~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n-----~e~~~~~~~ 393 (552) ++ ...+.+.++..+..... -.|++..+.++.+|++.-++|++.+|.... +.++....+.. --+..+..+ T Consensus 287 ~~-~~~~~~~~~~q~~~~~~-~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~----N~Sg~Ai~~~~~~l~~k~~~~~~~f 360 (456) T protein:vir:10 287 AL-WELPPGVDIWESQANDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSA----NQSAEGAHNIEKGFLFKCEDRLSIA 360 (456) T ss_pred cc-ccCCCCcceEEecccCh-hHHHHHHHHHHHHHHhccCCChHHhccccc----ChHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 23356788776654322 347888999999999999999999974211 11111111100 001111111 Q ss_pred HHHhhHHHHHHHHHHHhhcCcc-cccceeecccccChHHHHHHHHHHHH-HhcCCcCHHHHHHHhCCCCCCCCCeeeccc Q lcl|NC_020081. 394 DKGLEPLLKFIEDAVNKYIVSQ-FGGDYVFNFVGGDAKTEAEIISILES-KAKIGLTINDIRKELGYPDTEGGDVTLAGV 471 (552) Q Consensus 394 ~~~l~P~~~~ie~~ln~~L~~~-~~~~~~~~f~~~d~~~~~~~~~~~~~-~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~ 471 (552) ...|+-+++.+.. +... ....+.+.|....+.+.++.++++.+ ...|+++..-+++++|+.+-+ +. T Consensus 361 ~~~l~~~~rl~~~-----~~g~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~---i~---- 428 (456) T protein:vir:10 361 KIGLEAILVKALQ-----IEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQ---IK---- 428 (456) T ss_pred HHHHHHHHHHHHH-----hcCCCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHH---HH---- Confidence 2222222221110 1111 11245666766666666666665443 345788887788888875421 00 Q ss_pred cccchhhhccccccccccCCCCCccCcccCCCCCCC Q lcl|NC_020081. 472 HVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNM 507 (552) Q Consensus 472 n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (552) -..+.....+... ...++... +..+++- T Consensus 429 -~~e~er~~~e~~~------~~~~~~~~-~~~~~~~ 456 (456) T protein:vir:10 429 -QDDLDRAREQITL------FAGNPVQR-PQEDGSR 456 (456) T ss_pred -HHHHHHHHHHHHH------Hhhhhhhc-CCCCCCC Confidence 0001111111000 00000000 0000000 No 185 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=98.46 E-value=5.5e-07 Score=54.88 Aligned_cols=408 Identities=11% Similarity=0.025 Sum_probs=164.7 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhc--- Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSD--- 113 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~--- 113 (552) |. +.......+.+... ...++..++.-....+.++-++.++.. T Consensus 1 ~~--------------------------------~~~~~~~~~~~~~~--~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~ 46 (499) T protein:vir:10 1 MA--------------------------------VVIDKDLLDDVNEP--NIEAINYAIRELQNRKKRLDKLSDYYNGKQ 46 (499) T ss_pred Cc--------------------------------cchhhhHHhhhhcC--CHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 00 00011111111100 011112222211112211111111111 Q ss_pred -----------------------------cc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHH Q lcl|NC_020081. 114 -----------------------------KG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSF 162 (552) Q Consensus 114 -----------------------------~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f 162 (552) ++ +|-.+.+.. .+......+.+++.. -.+..+ T Consensus 47 ~i~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~-------~~~~~~~~l~~~~~~----------n~~~~~ 109 (499) T protein:vir:10 47 EIEKHEFDNATVEAANVMVNHAKYITDMNVGFMTGNPVKYVA-------EKGKNIDDILEVFNQ----------IDIHKH 109 (499) T ss_pred chhcCCcCcCCCCcceeecchHHHHHHHHhhhhcccCceeec-------CChhHHHHHHHHHhh----------cCHhHH Confidence 11 111111111 111122334444432 134456 Q ss_pred HHHHHHHHHhcCCeeEEEEECCCCCE----------------EEEEEecCceeEEEECCCcccccccceeEEEEEcC--C Q lcl|NC_020081. 163 VKKLVRDRLTYDKINFELVYDKLGDL----------------HNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVID--D 224 (552) Q Consensus 163 ~~~~v~d~ll~Gna~~~i~r~~~G~~----------------~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~--~ 224 (552) ...+..+.+.+|.+|.++..+.+|.+ ..+..++|..+.++.++.+.... ...++|+...+ + T Consensus 110 ~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~-~~~i~~~~~~~~~~ 188 (499) T protein:vir:10 110 DIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDP-LFAVFTQEKKDLEG 188 (499) T ss_pred HHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEEcccceEEEecCCCCcce-EEEEEEEEEeecCC Confidence 77888999999999999988887753 34677888888777665432111 11222222211 1 Q ss_pred c----eEEEEcccceeeecc-----------------cccCC-----ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020081. 225 K----VVAKFKAKEMAWEVS-----------------NPRTD-----LTVGKYGYPELEIALNHLQYHDNTEVFNARFFA 278 (552) Q Consensus 225 ~----~~~~~~~~evi~~~~-----------------~~~~~-----~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ 278 (552) . ....++++.+.++.. |+... -.+...|.|-++.+...++....+.....+.+. T Consensus 189 ~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~ 268 (499) T protein:vir:10 189 NTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKE 268 (499) T ss_pred CceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCccceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHH Confidence 1 111223333222210 00000 001235777777777777776666666666666 Q ss_pred ccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceee-ccCCceeeeccCchhHHHHHHHHHHHHHHHHHHh Q lcl|NC_020081. 279 QGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVI-TAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIY 357 (552) Q Consensus 279 ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il-~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~f 357 (552) ..+.|-.++. +. ..++.. .....+ ..+++..+ .+++.+++.+........+....+.+.+.|...- T Consensus 269 ~~~~~~lv~~--G~-~~~~~~--~~~~~~--------~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s 335 (499) T protein:vir:10 269 AFVDALLVTF--GF-GLGDDK--DDIQRL--------KRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKIS 335 (499) T ss_pred HhcCceeeee--cC-cccccc--chhhhh--------hhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHh Confidence 6667765554 32 222211 011111 11122222 2345555555555555666777888889998888 Q ss_pred cCCHHHhcccccccccccccccccc--h---hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecccccChHH Q lcl|NC_020081. 358 SIDPSEINFPNRGGATGHSGNTLNE--G---SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNFVGGDAKT 431 (552) Q Consensus 358 gVPp~~lg~~~~~t~~~~~~~~~~~--~---n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~d~~~ 431 (552) ++|..-.+.. .++.++....+ . .-....+..+..+|.-+++.+...++..- ...+ ..+.+.|.+.-+.+ T Consensus 336 ~~p~~~~~~~----~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~-~~~d~~~i~i~f~~~~p~n 410 (499) T protein:vir:10 336 YVPNMNDEKF----MGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKG-ANDDASGCKISLVANIPSN 410 (499) T ss_pred CcccCCchhh----cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CccccccceEEeCCCCCCC Confidence 8874221100 00111111111 0 11222334444555555555554443221 1111 24567777776777 Q ss_pred HHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhcccccc------ccccCCCCCccCcccCCCCC Q lcl|NC_020081. 432 EAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQV------EYQRQMDANQFLAQQTGYDG 505 (552) Q Consensus 432 ~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~ 505 (552) ..+.++.+.+. .|+++.--++++++.-+ +-+ ..+..+.+.+.. .......+...... +... T Consensus 411 ~~e~~~~~~kl-~g~iS~et~~~~l~~v~--d~~--------~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 477 (499) T protein:vir:10 411 LSDVVNNVKNA-DGIIPRKYTYSWLPDVD--NPQ--------DVIDEMNQQDAETIKKNQEALRGQDPDRLELE--DKQD 477 (499) T ss_pred HHHHHHHHHHH-hccCChHHHHHhCCCCC--CHH--------HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC--CCCc Confidence 77777766554 68899988888765421 101 011111111000 00000000000000 0000 Q ss_pred CCCCCCCCCCcccccCCCCccccc Q lcl|NC_020081. 506 NMDNVNGKDSFNQNVGKDGQSKQQ 529 (552) Q Consensus 506 ~~~~~~~~~~~~~~~~~~~~~~~~ 529 (552) .+..+++++ .+...++++-+.. T Consensus 478 ~~~~~~~~~--~~~~~~~~~~~~~ 499 (499) T protein:vir:10 478 DSSENDKEA--GSNHNQSHRTRAV 499 (499) T ss_pred ccCCCCCCC--ccccccCCCCCCC Confidence 000010000 0111112222211 No 186 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=98.46 E-value=5.7e-07 Score=54.83 Aligned_cols=408 Identities=12% Similarity=0.115 Sum_probs=167.8 Q ss_pred chhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHH------ Q lcl|NC_020081. 32 EEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMF------ 105 (552) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~------ 105 (552) +-..+++ ++--..+..|+.-. -++--|+..... ..+.+.|+.+. .-....+.++ T Consensus 1 ~~~~~~~------~~~~~~~~~~~~~~-~~~~~~~~~~~~---e~~~~~i~~~i----------~~~~~~~~r~~~l~~Y 60 (483) T protein:vir:12 1 MAQALIK------GGNILYPSQPTQTE-IFDAIVRTNNKP---ETLEEMIVRYI----------KQHLEKLPEISIGQEY 60 (483) T ss_pred Cccchhc------CCceeecCcchhhh-hhhcccccCCch---hhHHHHHHHHH----------HHHHHHHHHHHHHHHH Confidence 2222221 12111111121111 011111111111 11122222211 1111111111 Q ss_pred ---------------------------------HHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCC Q lcl|NC_020081. 106 ---------------------------------CTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRI 150 (552) Q Consensus 106 ---------------------------------~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~ 150 (552) ++.+....++ +|-.+.+.. .+......+..|+. T Consensus 61 Y~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~-------~d~~~~~~l~~~~~----- 128 (483) T protein:vir:12 61 YEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH-------TDDEVVKRIDEVLG----- 128 (483) T ss_pred hccccccccccccccccccccccccccccccchHHHHHHHHhhhhcccCceecc-------CChHHHHHHHHHHh----- Confidence 1111111111 111111111 11112233334332 Q ss_pred CCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCCceEE Q lcl|NC_020081. 151 DNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDDKVVA 228 (552) Q Consensus 151 ~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~~~~~ 228 (552) | ........+..+.+.+|.+|..+-.|.+|+|. +..++|..+.++.++. +... -.++|+......... T Consensus 129 ---n---~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~-i~~~~p~~~~~v~d~~~~~~~~---~~ir~~~~~~~~~~~ 198 (483) T protein:vir:12 129 ---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHEELE---AFIRMYKLENETKVE 198 (483) T ss_pred ---c---cHHHHHHHHHHHHhhCCeEEEEEEEcCCCceE-EEEEcccceEEEEcCCCCCceE---EEEEEEEeecceEEE Confidence 1 23345566788999999999999999998864 8889999998887643 2211 122333222222222 Q ss_pred EEcccceeeecc----------------------cccC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_020081. 229 KFKAKEMAWEVS----------------------NPRT-----DLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGG 281 (552) Q Consensus 229 ~~~~~evi~~~~----------------------~~~~-----~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~ 281 (552) .+.+..+.++.. |+-. .-.+...|.|-++.+...++....+..-..+.+...+ T Consensus 199 ~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 278 (483) T protein:vir:12 199 YWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 278 (483) T ss_pred EEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhc Confidence 222222222110 0000 0001235778888777777777766666666666667 Q ss_pred CCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCH Q lcl|NC_020081. 282 TTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDP 361 (552) Q Consensus 282 ~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp 361 (552) .|..++. +. . .+....++..+. .+++ +..+++.+...+.....+..+....+.+.+.|+..-++|. T Consensus 279 ~~~lv~~--g~-~--~~~~~~~~~~~~--------~~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~ 344 (483) T protein:vir:12 279 ELTYVLT--NY-D--DQELPEFKRLLR--------YYGA-IKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVD 344 (483) T ss_pred Cceeeee--cC-C--cccchhHHHhhh--------hccc-cccCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCC Confidence 7765553 32 1 112222222221 1111 2222333444444445556777888899999999999986 Q ss_pred HHhcccccccccccccccccc--h---hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHH Q lcl|NC_020081. 362 SEINFPNRGGATGHSGNTLNE--G---SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEII 436 (552) Q Consensus 362 ~~lg~~~~~t~~~~~~~~~~~--~---n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~ 436 (552) .-.+-.. + +.++....+ . ..-...+..+...|+-+++.|...++.+ ....++.+.|.+..+.+..+.+ T Consensus 345 ~~~~~~~-~---n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~---~~~~~i~v~f~~~~p~~~~~~a 417 (483) T protein:vir:12 345 FSSDKFG-S---APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK---GEHKDVDISFNYNKVANTELQV 417 (483) T ss_pred CCccccc-c---CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---CccceeeEEeCCCCCCCHHHHH Confidence 4332110 0 111111100 0 1112233344455555555554433321 1123566778777777777777 Q ss_pred HHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 437 SILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 437 ~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) +++... .|++|..-+.++++.-+-+ + ..+..+.+.+.. ....... ... .+.+... T Consensus 418 ~~~~kl-~GiiS~et~~~~~~~v~d~--~--------~E~~ri~~E~~~-~~~~~~~---~~~-~~~d~~~--------- 472 (483) T protein:vir:12 418 QTAQQS-MGIVSHETVLENHPFVEDL--Q--------AELERIEQEQME-YNKQLPN---LDD-GGADGAQ--------- 472 (483) T ss_pred HHHHHH-hccCchHHHHHhCCCCCCH--H--------HHHHHHHHHHHH-HHhhccc---ccc-cccCCcc--------- Confidence 666554 5899988888877652210 0 111111111100 0000000 000 0000000 Q ss_pred ccccCCCCcccccc Q lcl|NC_020081. 517 NQNVGKDGQSKQQA 530 (552) Q Consensus 517 ~~~~~~~~~~~~~~ 530 (552) ... ++.+++++ T Consensus 473 --~~~-~~~~~e~e 483 (483) T protein:vir:12 473 --QQE-RSNNKESE 483 (483) T ss_pred --cCC-CCCcccCC Confidence 000 00011111 No 187 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=98.45 E-value=5.9e-07 Score=54.72 Aligned_cols=462 Identities=11% Similarity=0.043 Sum_probs=170.3 Q ss_pred ccCcccccccccchhhhhccccc-----cccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHH Q lcl|NC_020081. 20 INDDMAVRIKQIEEDAILKKGKN-----TKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAI 94 (552) Q Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~ 94 (552) .+...-.|..-..++.+...+.+ .+.+-... .+...+.-.. ..++........ .....+.-+ ++=|+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~YY~g~h~I--l~r~~~~~~~~~-~~~~d~~~~-nnki~~nf 75 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIG-ENYYNQENDI--EKSRIFYMNDKG-QLREDNYAS-NVKISHGF 75 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcccchh--hhcccccccccc-ccccccccc-ccccccch Confidence 11111111111112222111100 00000000 0000000000 000000000000 000000000 00000000 Q ss_pred HHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcC Q lcl|NC_020081. 95 IITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYD 174 (552) Q Consensus 95 i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~G 174 (552) ....++...-| - +|-.+.+.-.+. ...+....+..++. ..+......+..++..+| T Consensus 76 ~k~Ivd~~~~y-------l--~G~Pv~~~~~d~----~~~e~~~~l~~~~~-----------~~~~~~~~el~~~~s~~G 131 (537) T protein:vir:78 76 FTELVDQLAQY-------L--LSNGVEVKVKDE----DNTQLDEILQEYFD-----------EDFQATIDTLVTNASKKG 131 (537) T ss_pred HHHHHHHHhhh-------h--cccCceeecCcc----hhHHHHHHHHHHhh-----------ccHHHHHHHHHHHHhhcC Confidence 01111111111 0 122222221111 11112222333321 123345667788999999 Q ss_pred CeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCC-----c---eEEEEcccceeeeccccc--- Q lcl|NC_020081. 175 KINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDD-----K---VVAKFKAKEMAWEVSNPR--- 243 (552) Q Consensus 175 na~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~-----~---~~~~~~~~evi~~~~~~~--- 243 (552) .+|.++-++.+|.+. +..++|..+.++.++.+.... ....++...... . ....++++.+.+.+.... T Consensus 132 ~ay~~~y~de~~~~~-~~~i~p~~~~pv~d~~~~~~~-~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~ 209 (537) T protein:vir:78 132 FEGIFARTTSEGKLK-FQTVDGLTLIPVFDDYGVLKM-IIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVS 209 (537) T ss_pred eeEEEeeecCCCceE-EEEEccceeEEEEcCCCCcee-EEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCccc Confidence 999999999998764 788999999988877654321 111111000000 0 112233333332211000 Q ss_pred -------------------------------------CCc---------cCCcccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 244 -------------------------------------TDL---------TVGKYGYPELEIALNHLQYHDNTEVFNARFF 277 (552) Q Consensus 244 -------------------------------------~~~---------~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f 277 (552) .++ ....+|.|-++.+...++....+....++.+ T Consensus 210 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~ 289 (537) T protein:vir:78 210 TTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNL 289 (537) T ss_pred ccccccccccccccceeeeccccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHH Confidence 000 0113577888888888887777777777777 Q ss_pred hccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHh Q lcl|NC_020081. 278 AQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIY 357 (552) Q Consensus 278 ~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~f 357 (552) ...+.|-.+ +.+. ...+ ...++..+.. . ++..+.+++.++.-+.....+.......+.+.+.|...- T Consensus 290 ~~~~~~ilv--i~g~-~~~~--~~~~~~~l~~-------~-~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s 356 (537) T protein:vir:78 290 QDFSEAIYV--VKGF-SGDS--TDKLRQNIKA-------K-KMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSG 356 (537) T ss_pred HHhcCceee--eecC-CCcc--chhHHHHHhh-------c-CceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhc Confidence 665555444 4342 2221 1123332221 1 122333333333333444444444566677777777654 Q ss_pred cCCHHHhcccccccccccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecccccChHH Q lcl|NC_020081. 358 SIDPSEINFPNRGGATGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNFVGGDAKT 431 (552) Q Consensus 358 gVPp~~lg~~~~~t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~d~~~ 431 (552) .+|.. ... .+++.++....+. .-....+..+...|+-+++.|...++.+-...++ ..+.+.|.+.-+.+ T Consensus 357 ~~~~~--~~~---~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n 431 (537) T protein:vir:78 357 MGFNS--TAV---GDGNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYDSNDICFEIEPHVLAN 431 (537) T ss_pred CCCCC--ccc---cccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccceeeEEeccCCCCC Confidence 44431 211 1122222111111 1122334445556666666666555443222222 34677787777777 Q ss_pred HHHHHHHHHH-HhcCCcCHHHHHHHhCCCCCCCCCeeec---cccccchhhhccccccccccCCCCCccC---cccCCCC Q lcl|NC_020081. 432 EAEIISILES-KAKIGLTINDIRKELGYPDTEGGDVTLA---GVHVQRLGQIMQQEQVEYQRQMDANQFL---AQQTGYD 504 (552) Q Consensus 432 ~~~~~~~~~~-~~~g~lT~NE~R~~~gl~p~~ggD~~~~---~~n~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~ 504 (552) ..+.++++.. ...|++|..-+.+.+++-.-+.-..... .............+..+........++. .+..+.+ T Consensus 432 ~~e~a~~~~~l~~~giiS~eT~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (537) T protein:vir:78 432 ELDIATTRKTEAETEALKIGNIMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQ 511 (537) T ss_pred HHHHHHHHHHHHhcCcchHHHHHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCC Confidence 7777666554 3468899888887765422110000000 0000000000000000000000000000 0001111 Q ss_pred CCCCCCCCCCCcccccCCCCcccccccc Q lcl|NC_020081. 505 GNMDNVNGKDSFNQNVGKDGQSKQQANT 532 (552) Q Consensus 505 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (552) ++.+..++.++++-...+|++. ...| T Consensus 512 ~~~d~~~~~~~~~~~~~~~~~~--~~~~ 537 (537) T protein:vir:78 512 PPVDPNQPVADPNVVPPTDPNA--VPQT 537 (537) T ss_pred CCCCccCCCCCCCCCCCCCCcc--CCCC Confidence 1112222222222222233221 1111 No 188 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=98.41 E-value=7.7e-07 Score=54.10 Aligned_cols=395 Identities=11% Similarity=0.054 Sum_probs=160.6 Q ss_pred hhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhc-chHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020081. 33 EDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSR-KNIILNAIIITRVNQVSMFCTPARN 111 (552) Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~-~~~i~~a~i~~~~~~~~~~~~~~~~ 111 (552) =..|+..+.+.... -.++.++.-++ ...++..++......+.++.+..++ T Consensus 1 ~~~~~~~~~~~~~~-----------------------------e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~y 51 (478) T protein:vir:10 1 MISINWPWDKPYHE-----------------------------QVVEQIKPKYETQEEMILRLVREHKENIDNITMGERY 51 (478) T ss_pred CccccCCCCchhHH-----------------------------HHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01111111111000 00011111000 0112222222222222222111111 Q ss_pred h---------------------------------------cccc--ceeeeeccccccCChhHHHHHHHHHHHHHhcCCC Q lcl|NC_020081. 112 S---------------------------------------DKGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRI 150 (552) Q Consensus 112 ~---------------------------------------~~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~ 150 (552) . .+++ |-.+.+.- .+.+....+.+++. T Consensus 52 Y~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~-------~~d~~~~~l~~~~~----- 119 (478) T protein:vir:10 52 YNHHPDILDAPPKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGV-------DNDKALKQIQHTLN----- 119 (478) T ss_pred hcCCCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCeeeec-------CChHHHHHHHHHHh----- Confidence 1 1111 11111110 01112223444432 Q ss_pred CCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCCceEE Q lcl|NC_020081. 151 DNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDDKVVA 228 (552) Q Consensus 151 ~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~~~~~ 228 (552) | .+.+....+..+.+.+|.+|+.+..+.+|++ .+..++|..+.++.++. +... -.++|+......... T Consensus 120 ---n---~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~-~~~~~~p~~~~~i~d~~~~~~~~---~~v~~~~~~~~~~~~ 189 (478) T protein:vir:10 120 ---H---KWDDKLVDILTAASNKGIEWVQPYVDEEGEF-KTFRVPAEQAVPIWTNKERDELQ---AFIRVYELDGAERVE 189 (478) T ss_pred ---c---CHHHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEEcCCCCCceE---EEEEEEEecCceEEE Confidence 1 3445666788999999999999999988886 47789999999887643 2211 122222222222222 Q ss_pred EEcccceeeeccc----------------------ccCCc---------cCCcccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 229 KFKAKEMAWEVSN----------------------PRTDL---------TVGKYGYPELEIALNHLQYHDNTEVFNARFF 277 (552) Q Consensus 229 ~~~~~evi~~~~~----------------------~~~~~---------~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f 277 (552) .+.++.+.+.+.. ...+. .+..+|.|-++.+...++....+..-..+.+ T Consensus 190 ~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~ 269 (478) T protein:vir:10 190 YWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTF 269 (478) T ss_pred EEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHH Confidence 2223222221110 00000 1234688888877777777776666666666 Q ss_pred hccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec-cCCceeeeccCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 278 AQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT-AEDVKFVNMTQSSKDMEFEKWLNYLINVICSI 356 (552) Q Consensus 278 ~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~ 356 (552) ...+.|-.++. + ...++ .......+. .+++..+. .+|.++.-+........+....+.+.+.|... T Consensus 270 ~~~~~p~~~~~--g-~~~~~--~~~~~~~~~--------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 336 (478) T protein:vir:10 270 DESVELIYILK--G-YEGED--MKDFMHNLK--------YYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEF 336 (478) T ss_pred HHhhCceeeee--c-CCccc--cchhhhhhh--------hcceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHH Confidence 66666754443 3 22221 111111111 11222221 22333333444445566778888999999999 Q ss_pred hcCCHHHhcccccccccc-cccccccch--h---HHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecccccCh Q lcl|NC_020081. 357 YSIDPSEINFPNRGGATG-HSGNTLNEG--S---SAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNFVGGDA 429 (552) Q Consensus 357 fgVPp~~lg~~~~~t~~~-~~~~~~~~~--n---~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~d~ 429 (552) -++|..-.+- +++ .++....+. . .....+..+..+|+-+++.|...+. ...+ ..+.+.|.+.-+ T Consensus 337 s~~p~~~~~~-----~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g----~~~~~~~i~i~f~~~~p 407 (478) T protein:vir:10 337 GQGVDFQQDK-----FGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYR----LDVKVQDIEITFNFNVM 407 (478) T ss_pred hCccccCccc-----cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCcccccceEEecCCCC Confidence 9988643211 111 111110000 0 0112223333444444444333221 1111 345677777766 Q ss_pred HHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCC Q lcl|NC_020081. 430 KTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDN 509 (552) Q Consensus 430 ~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (552) .+.++.++++... +|+++...+++++++-.-+ + ..+..+...+. +.... ......+ ...+. T Consensus 408 ~d~~e~a~~~~kl-~g~iS~et~~~~l~~v~D~--~--------~E~~ri~~E~~-~~~~~---~~~~~~~----~~~~~ 468 (478) T protein:vir:10 408 VNELENSQIAMNS-TGLLSKETILSNHAWVEDP--V--------AEMERIEQENI-ELNQQ---LPDIEEG----LNGEQ 468 (478) T ss_pred CCHHHHHHHHHHH-hCCCChHHHHHhCCCCCCH--H--------HHHHHHHHHHH-HHHhh---ccccccc----cCCCC Confidence 6667767666554 6889988888888652211 1 11111111110 00000 0000000 00000 Q ss_pred CCCCCCcccc Q lcl|NC_020081. 510 VNGKDSFNQN 519 (552) Q Consensus 510 ~~~~~~~~~~ 519 (552) ....++.+.+ T Consensus 469 ~~~~~~~~~~ 478 (478) T protein:vir:10 469 QRQSENNQPE 478 (478) T ss_pred CCCCCCCCCC Confidence 0000000000 No 189 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=98.39 E-value=9.1e-07 Score=53.70 Aligned_cols=425 Identities=12% Similarity=0.093 Sum_probs=166.5 Q ss_pred CcccccccccchhhhhccccccccccccccccccccccccCCccccccc-CCCCchHHHHHHHhhcch------------ Q lcl|NC_020081. 22 DDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPS-IHGKQNLLQMLKLWSRKN------------ 88 (552) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~Lr~~a~~~------------ 88 (552) -..-.-++|+-++.| |++-- . .|+..-. +..+..-.. ..-...+.+.|+.+.... T Consensus 1 ~~~~~~~~~~~~~~~-------~~~~~-~--~~~~~~~--~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~ 68 (492) T protein:vir:94 1 MQFIQLISQVAQALI-------KGGNI-L--YPSQPTQ--TEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQE 68 (492) T ss_pred ChHHHHHHHHHHHHh-------cCCce-e--ecCccch--hhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000011222222222 11111 0 1111000 000000000 000111222222221100 Q ss_pred ------HHHHHH---------HHHHHH-H-HHHHHHHHHhhcccc--ceeeeeccccccCChhHHHHHHHHHHHHHhcCC Q lcl|NC_020081. 89 ------IILNAI---------IITRVN-Q-VSMFCTPARNSDKGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGR 149 (552) Q Consensus 89 ------~i~~a~---------i~~~~~-~-~~~~~~~~~~~~~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~ 149 (552) .|...- ...+.+ . +.-+++.+....++. |-.+.+.. .+......+..|+. T Consensus 69 YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~~-------~d~~~~~~l~~~~~---- 137 (492) T protein:vir:94 69 YYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH-------TDDEVVKRIDEVLG---- 137 (492) T ss_pred HhccccccccccccccccccccccccccccccchHHHHHHHHHhhhcccCceecc-------CchHHHHHHHHHHh---- Confidence 000000 000000 0 011112222211111 11111111 11122233444432 Q ss_pred CCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCCceE Q lcl|NC_020081. 150 IDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDDKVV 227 (552) Q Consensus 150 ~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~~~~ 227 (552) | ........+..+.+.+|.+|..+-.+.+|+| .+..++|..+.++.++. +... -.++|+........ T Consensus 138 ----n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~v~d~~~~~~~~---a~ir~~~~~~~~~~ 206 (492) T protein:vir:94 138 ----N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELE---AFIRMYKLENETKV 206 (492) T ss_pred ----c---cHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceE---EEEEEEeeccceeE Confidence 1 2344566788899999999999999988885 47789999998887642 2211 12223222222222 Q ss_pred EEEcccceeeecc----------------------cccCC-c----cCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_020081. 228 AKFKAKEMAWEVS----------------------NPRTD-L----TVGKYGYPELEIALNHLQYHDNTEVFNARFFAQG 280 (552) Q Consensus 228 ~~~~~~evi~~~~----------------------~~~~~-~----~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng 280 (552) ..++...+.++.. |+-.. | .....|.|-++.+...++....+..-..+.+... T Consensus 207 ~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~ 286 (492) T protein:vir:94 207 EYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDS 286 (492) T ss_pred EEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 2222222222211 10000 0 0113578888888888887777777667677777 Q ss_pred CCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_020081. 281 GTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSID 360 (552) Q Consensus 281 ~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP 360 (552) +.|-.++. +. +.+....++..+.. +++ +..+++.+...+........+....+.+.+.|+..-++| T Consensus 287 ~~p~lv~~--g~---~~~~~~~~~~~~~~--------~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 352 (492) T protein:vir:94 287 NELTYVLK--NY---DDQELPEFKRLLRY--------YGA-IKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAV 352 (492) T ss_pred cCceeeee--cC---CcccchhhHHHHhh--------ccc-eecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCc Confidence 77765553 32 11222223332211 111 222333333334444444566777888899999999988 Q ss_pred HHHhcccccccccc-cccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHH Q lcl|NC_020081. 361 PSEINFPNRGGATG-HSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAE 434 (552) Q Consensus 361 p~~lg~~~~~t~~~-~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~ 434 (552) ..-.+. +++ .++....+. .........+...|+-+++.+...++.+ ....++.+.|.+.-+.+.++ T Consensus 353 ~~~~~~-----~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~---~~~~~i~v~f~~~~p~~~~e 424 (492) T protein:vir:94 353 DFSSDK-----FGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK---GEHKDVDISFNYNKVANTEL 424 (492) T ss_pred CCCccc-----cccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---cccceeeEEecCCCCCCHHH Confidence 633221 111 111111100 0111222233344444444444333221 11235677787777777777 Q ss_pred HHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCC Q lcl|NC_020081. 435 IISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKD 514 (552) Q Consensus 435 ~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (552) .++++... .|+++..-++++++.-+-+ + ..+..+...+.. ..+.... .....++ +++. .++. T Consensus 425 ~~~~~~kl-~giiS~et~~~~l~~v~d~--~--------~E~eri~~E~~~-~~~~~~~--~~~~~~~-~~~~--~~~~- 486 (492) T protein:vir:94 425 QVQTAQQS-MGIVSHETVLENHPFVEDL--Q--------AELERIEQEQME-YNKQLPN--LDDGGAD-SAQQ--QERS- 486 (492) T ss_pred HHHHHHHH-hccCchHHHHHhCCCCCCH--H--------HHHHHHHHHHHH-HHhhccc--cccccCC-CCcc--ccCC- Confidence 77666554 4889988888877652211 0 111111111100 0000000 0000000 0000 0000 Q ss_pred CcccccCCCCcccccc Q lcl|NC_020081. 515 SFNQNVGKDGQSKQQA 530 (552) Q Consensus 515 ~~~~~~~~~~~~~~~~ 530 (552) .+++++ T Consensus 487 ----------~~~e~e 492 (492) T protein:vir:94 487 ----------NNKESE 492 (492) T ss_pred ----------ccccCC Confidence 000000 No 190 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=98.34 E-value=1.2e-06 Score=53.04 Aligned_cols=430 Identities=12% Similarity=0.116 Sum_probs=192.8 Q ss_pred ccccccchhhhhc----------cccccccccccccccccccccccCCcccccccCCCC---chHHHHHHHhhcchHHHH Q lcl|NC_020081. 26 VRIKQIEEDAILK----------KGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGK---QNLLQMLKLWSRKNIILN 92 (552) Q Consensus 26 ~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~Lr~~a~~~~i~~ 92 (552) .++-+...+...+ ..++.-.++...-+...... ..+..|....-..+. ..+.+..|.++..+.+-. T Consensus 1 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~-~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~ 79 (511) T protein:vir:56 1 MKFWTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQ-LGHAIIPSDAQSEGTIPVKELIKSYRALAEYHEVDD 79 (511) T ss_pred CCCccchhhhhhhhhccCCcccccCCCCCCCceEEecccccce-ecceeccccccccCccchHHHHHHHHHHhhccchhh Confidence 1111111111111 01111111111100000000 011111111111222 356677788888888877 Q ss_pred HHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcCCCCCCCccCCHHHHHHHHHHHHH Q lcl|NC_020081. 93 AIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTGRIDNDFTRDNFRSFVKKLVRDRL 171 (552) Q Consensus 93 a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~l 171 (552) |+-.+..+++ .+....-+..+-+.+.+ .+..-+++|.. +..++..++. ....++ +++.+. T Consensus 80 Av~eIvne~i-------v~d~~~~pV~l~ld~~~--~s~~iK~kI~eeF~~Il~ll~F------~~~~~~----~fR~WY 140 (511) T protein:vir:56 80 AIQEIVDEAI-------VYENDKEVVWLNLDNTD--FSENIKAKINEEFDRVVSLLQM------RKHGYK----WFRKWY 140 (511) T ss_pred HHHHhhccee-------EecCCCceEEEEecccC--cchHHHHHHHHHHHHHHHHhcc------chhhhH----HHhhhh Confidence 7766554443 12234444555554333 44444555543 4444444332 123444 456777 Q ss_pred hcCCeeEEEEECCCCCEEEEEEecCceeEEEEC-----CCcccccccceeEEEEEcCC--------------ceEEEEcc Q lcl|NC_020081. 172 TYDKINFELVYDKLGDLHNFKAVDASTVYVAVD-----EDGKERKAKDGVRYVQVIDD--------------KVVAKFKA 232 (552) Q Consensus 172 l~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~-----~~g~~~~~~~~~~y~~~~~~--------------~~~~~~~~ 232 (552) +.|..|+.++-|..-.+.+|.+|||..|+.++. .+|..... ....|+.+... .....++. T Consensus 141 VDgRi~fHkiid~k~GI~eLr~lDPr~i~~vr~i~~~~~~~~~v~~-~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~ 219 (511) T protein:vir:56 141 VDSRIYFHKILDKDNNIIELRPLNPMKMELVREIQKETIDGVEVVK-GTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPK 219 (511) T ss_pred hcceEEEEEEeccccceeehhhcCcccchhhhhhhccccccccccc-ceeeeeEecCCCcccCcccccccccccceeech Confidence 889999999888776799999999999987543 22221111 11222222221 13356777 Q ss_pred cceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhc Q lcl|NC_020081. 233 KEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFS 312 (552) Q Consensus 233 ~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~ 312 (552) +.|+|...+......+..+.+|-|..|...+.....++....-|=-.-+.-+-|..+..+......+-+=++.-+. .+. T Consensus 220 daI~y~hSGL~d~~~~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~-k~k 298 (511) T protein:vir:56 220 DAIVFAHSGLMRGCADDPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQ-NVK 298 (511) T ss_pred hheeeecccceeccCCCCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHH-hcC Confidence 8887765554333456678899999999998888887776654444444445555555443333322222222221 111 Q ss_pred ----------cccccccce-eec---------cCCceeeecc--CchhHHHHHHHHHHHHHHHHHHhcCCHHHhccccc- Q lcl|NC_020081. 313 ----------GINGAWKIP-VIT---------AEDVKFVNMT--QSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNR- 369 (552) Q Consensus 313 ----------G~~nagk~~-il~---------~~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~- 369 (552) ...+..+.. ++. +.|.++..|. .+.-+|+- ..+....+.++++||.+.|+..+. T Consensus 299 NklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kKLy~aLnVP~SRl~~e~q~ 375 (511) T protein:vir:56 299 NRVVYDTQTGQVKNTTNAMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIED---VLYFNRKLYKAMRIPTSRAASEDQT 375 (511) T ss_pred ceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH---HHHHHHHHHHHhCCCcccccCCCCc Confidence 011111110 110 0133444432 33344443 458889999999999999975432 Q ss_pred ccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----C-----cccc-----cceeecccccChHH---- Q lcl|NC_020081. 370 GGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----V-----SQFG-----GDYVFNFVGGDAKT---- 431 (552) Q Consensus 370 ~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~-----~~~~~~f~~~d~~~---- 431 (552) ++|....+ ..-+.-|-. +...|.-+..++...|...| + ++.+ ..+.+.|.+...-+ T Consensus 376 ~~f~~Gr~--~EItRDEiK----F~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe 449 (511) T protein:vir:56 376 GGINFGQG--AEITRDELK----FTKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKE 449 (511) T ss_pred cccccccc--hhhhHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHH Confidence 33321111 111112222 23344445555555444332 2 2222 34667776665322 Q ss_pred ------HHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCC Q lcl|NC_020081. 432 ------EAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGY 503 (552) Q Consensus 432 ------~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (552) |...++.+..++.-.+|.+=+|+. |.+--.+ +...+.+...+...+.-+.. +++- T Consensus 450 ~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDee----------i~~~~k~I~~E~k~~~~~~~-------e~~f 511 (511) T protein:vir:56 450 LEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQ----------ITAMQSEIDEEETNPRFQQD-------DQGF 511 (511) T ss_pred HHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHH----------HHHHHHHHHHhhcCCCCCCc-------ccCC Confidence 222223333333334576666653 3332110 00111111111110000000 0000 No 191 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=98.33 E-value=1.3e-06 Score=52.92 Aligned_cols=445 Identities=14% Similarity=0.100 Sum_probs=171.8 Q ss_pred chhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHH---------- Q lcl|NC_020081. 13 QQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLK---------- 82 (552) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr---------- 82 (552) .|..|--.++- +..+++ ..+-.++.. .|.....-.......+.|+ T Consensus 1 ~~~~~~~~~~~---------~~~~~~-----~~~~~~~~~-----------~~~~~~~~~~~~~~~~~l~~~i~~~~~~~ 55 (501) T protein:vir:27 1 MEQTLFTDSTG---------QDLVLN-----LRFHRESRI-----------RYRADNLEELMVNNWELLKNFINHHKLRQ 55 (501) T ss_pred CCceeEEeccc---------hhhhhh-----cccChhHHH-----------hhccccccccccccHHHHHHHHHHHHHHH Confidence 11111100000 000000 000000000 0000000000000111111 Q ss_pred --------Hhhcc--hHHHHHH---HHHHHH--HHHHHHHHHHhhcccc--ceeeeeccccccCChhHHHHHHHHHHHHH Q lcl|NC_020081. 83 --------LWSRK--NIILNAI---IITRVN--QVSMFCTPARNSDKGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIE 145 (552) Q Consensus 83 --------~~a~~--~~i~~a~---i~~~~~--~~~~~~~~~~~~~~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~ 145 (552) ++-++ ..+...- ...+.+ .+.-+++.+....++. |-.+.+...+... .......+.+++. T Consensus 56 ~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~---~~~~~~~l~~~~~ 132 (501) T protein:vir:27 56 APRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDN---NSQNDDTIKRIGR 132 (501) T ss_pred HHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhhcccCeeEecCCccc---hHHHHHHHHHHHH Confidence 11111 0110000 000000 0011111112111111 1111111111100 0111122333332 Q ss_pred hcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCc-ccccccceeEEEEEcC- Q lcl|NC_020081. 146 KTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDG-KERKAKDGVRYVQVID- 223 (552) Q Consensus 146 ~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g-~~~~~~~~~~y~~~~~- 223 (552) . -.+..+...+..+++.+|.+|..+.++..|+|. +..++|..+.++.++.. +.. ...++|+.... T Consensus 133 ~----------n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~-i~~~~p~~~~~v~d~~~~~~~--~~~ir~~~~~~~ 199 (501) T protein:vir:27 133 I----------NDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETR-IKRLNPLETFVIYDNSLEDNS--IAAVRYYNRGTL 199 (501) T ss_pred h----------cChhHHHHHHHHHHhhCCeEEEEEEeCCCCceE-EEEEccceeEEEecCCCCCce--EEEEEEEEeeec Confidence 2 134567778899999999999999999888754 77899999988876542 111 11222222111 Q ss_pred -Cc--eEEEEcccceeeecc-----------cc-----cCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Q lcl|NC_020081. 224 -DK--VVAKFKAKEMAWEVS-----------NP-----RTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTR 284 (552) Q Consensus 224 -~~--~~~~~~~~evi~~~~-----------~~-----~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~ 284 (552) +. ....++.+.+.++.. |+ -..-.+...|.|.++.+...++....+..-..+.+...+.|- T Consensus 200 ~~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~ 279 (501) T protein:vir:27 200 QNAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAI 279 (501) T ss_pred CCcEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCce Confidence 11 011122222211100 10 000011247888888888888877777776666666666665 Q ss_pred eEEEeCCCCC-CCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHH Q lcl|NC_020081. 285 GLLHIKTGQE-QSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSE 363 (552) Q Consensus 285 gil~~~~~~~-~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 363 (552) .++. +... ...+....++... .-.....+ .+.....++++..++....+..+....+.+.+.|+..-++|..- T Consensus 280 ~v~~--g~~~~~~~~~~~~~~~~~---~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~ 353 (501) T protein:vir:27 280 LAIY--GDLALPKGMQASDMKRTR---LMQLKPPK-SADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMS 353 (501) T ss_pred eeee--cCccCCcccchhhhhhcC---ceeecccc-cccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccC Confidence 5544 3211 1222222222110 00001111 01112344566556655555667788889999999999999754 Q ss_pred hcccccccccccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcC-cccc-cceeecccccChHHHHHHH Q lcl|NC_020081. 364 INFPNRGGATGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIV-SQFG-GDYVFNFVGGDAKTEAEII 436 (552) Q Consensus 364 lg~~~~~t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~-~~~~-~~~~~~f~~~d~~~~~~~~ 436 (552) .+-.. + +.++....+. ......+..+...|+-+++.+...++..-- ..++ ..+.+.|.+.-+.+.++.+ T Consensus 354 ~~~~~-~---n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~a 429 (501) T protein:vir:27 354 DTNFS-G---NTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQV 429 (501) T ss_pred ccccc-c---CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHH Confidence 43211 1 1111111110 111222344455555555555544433211 1122 2467788777777777777 Q ss_pred HHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 437 SILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 437 ~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) +++.+. .|+++..-+.+++++-. +-+ ..+..+...+. +...... +.+.++.... +.+ . T Consensus 430 d~~~kl-~g~iS~et~l~~l~~v~--D~~--------~E~eri~~E~~-e~~~~~~-------~~~~~~~~~~--~~d-~ 487 (501) T protein:vir:27 430 SILTGL-GGQVSQETALSLSGLVE--SPN--------EELDKINKEVS-EIDFKGY-------SNDFNEHVGK--YTD-E 487 (501) T ss_pred HHHHHH-hccCcHHHHHHhCCCCC--CHH--------HHHHHHHHHHH-hhhHhhh-------cCcccccccc--ccC-C Confidence 766554 58899877777764421 100 01111111110 0000000 0000000000 000 0 Q ss_pred ccccCCCCcccccc Q lcl|NC_020081. 517 NQNVGKDGQSKQQA 530 (552) Q Consensus 517 ~~~~~~~~~~~~~~ 530 (552) ..+...|..++..+ T Consensus 488 ~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 488 VKETHTDDFERAYE 501 (501) T ss_pred CCCCccccccccCC Confidence 01111222222222 No 192 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=98.32 E-value=1.4e-06 Score=52.74 Aligned_cols=436 Identities=13% Similarity=0.070 Sum_probs=170.3 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhc-chHHHHHHHHHHHH-H Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSR-KNIILNAIIITRVN-Q 101 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~-~~~i~~a~i~~~~~-~ 101 (552) |--...-++....-..+. ..+-.++... |. .+.+.+... ....+..+|..... . T Consensus 1 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~-----------~~-----------~~~~~~~~~~~~~~i~~~i~~h~~~~ 56 (502) T protein:vir:48 1 MMEQTLFTDSTGQDLVLN--LRFHRESRIR-----------YR-----------ADNLEELMVNNWELLKNFINHHKLRQ 56 (502) T ss_pred CceeEEEEecchhHHHhh--cccChhHHhh-----------hc-----------ccchhhhccccHHHHHHHHHHHHHHH Confidence 000000000000000000 0000001100 00 000111000 01112222221110 0 Q ss_pred HHH----------------------------------HHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHH Q lcl|NC_020081. 102 VSM----------------------------------FCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIE 145 (552) Q Consensus 102 ~~~----------------------------------~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~ 145 (552) ..+ +++.+....++ +|-.+.....+.+. .......+.+++. T Consensus 57 ~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~---~~~~~~~l~~~~~ 133 (502) T protein:vir:48 57 APRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNED---NSQNDDAIKRIGR 133 (502) T ss_pred HHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhhcccCeeEecCCccc---hhHHHHHHHHHHh Confidence 111 11111111111 11111111111000 0001111222222 Q ss_pred hcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEE-cCC Q lcl|NC_020081. 146 KTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQV-IDD 224 (552) Q Consensus 146 ~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~-~~~ 224 (552) . -.+......+..+++.+|.+|+.+.++.+|.+ .+..++|..+.++.++..... ..-.++|+.. ... T Consensus 134 ~----------N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~vydd~~~~~-~~~~ir~~~~~~~~ 201 (502) T protein:vir:48 134 I----------NDIDTHNRNLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYDNSLEDN-SIAAVRYYNRGTLQ 201 (502) T ss_pred h----------cCHhHHHHHHHHHHhhcCeEEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCc-eEEEEEEEEEeecC Confidence 1 13445777889999999999999999998875 477899999988876532110 0111222211 111 Q ss_pred c---eEEEEcccceeeecc-----------cccC-----CccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_020081. 225 K---VVAKFKAKEMAWEVS-----------NPRT-----DLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRG 285 (552) Q Consensus 225 ~---~~~~~~~~evi~~~~-----------~~~~-----~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~g 285 (552) . ....++.+.+.++.. |+.. .-.+...|.|.++.+...++....+..-..+.+...+.|-. T Consensus 202 ~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l 281 (502) T protein:vir:48 202 NAKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGTVPITEFLNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAIL 281 (502) T ss_pred CcEEEEEEEeCCeEEEEEeCCceeeccceecCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 1 111222222221110 1000 00112478888888888888777777777777777777765 Q ss_pred EEEeCCCCCC-CHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh Q lcl|NC_020081. 286 LLHIKTGQEQ-SNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI 364 (552) Q Consensus 286 il~~~~~~~~-s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 364 (552) ++. +.... .++....++... ........+ .--.+.+.++..++.......+....+.+.+.|+..-++|+... T Consensus 282 v~~--g~~~~~~~~~~~~~~~~~-~~~~~~~~~---~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~ 355 (502) T protein:vir:48 282 AIY--GDLALPQGMQASDMKRTR-LMQLKPPKS---ADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSD 355 (502) T ss_pred eee--cCcccccccchhhhhhcc-eeecccccc---ccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCc Confidence 554 32211 222222222111 000000000 00112345555555554555566778899999999999997554 Q ss_pred cccccccccccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcC-cccc-cceeecccccChHHHHHHHH Q lcl|NC_020081. 365 NFPNRGGATGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIV-SQFG-GDYVFNFVGGDAKTEAEIIS 437 (552) Q Consensus 365 g~~~~~t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~-~~~~-~~~~~~f~~~d~~~~~~~~~ 437 (552) +-.. + +.++....+. .-....+..+...|+-+++.+...++..-- ..++ ..+.+.|.+..+.+..+.++ T Consensus 356 ~~~~-~---n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~ 431 (502) T protein:vir:48 356 NHFS-G---NASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVS 431 (502) T ss_pred cccc-c---CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHH Confidence 3211 1 1111111111 111223344555555555555555443211 1222 24677787777777777777 Q ss_pred HHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcc Q lcl|NC_020081. 438 ILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFN 517 (552) Q Consensus 438 ~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 517 (552) ++.+. .|+++..-+.+++++-.-+ + ..+..+...+.. ...........+ ...+ +.+... T Consensus 432 ~~~kl-~g~iS~et~l~~l~~v~D~--~--------~E~~ri~~E~~~---~~~~~~~~~~~~----~~~~---~~d~~~ 490 (502) T protein:vir:48 432 ILNDL-GGQVSQETALSLSGLVENP--T--------EELDKINEESSK---IDFKGYPSYFYD----NVGK---YTDEVK 490 (502) T ss_pred HHHHH-hccCcHHHHHHhCCCCCCH--H--------HHHHHHHHHHHh---hhhhcccccccc----cccc---cCCCcc Confidence 66554 5889988888877542110 0 111111111000 000000000000 0000 000000 Q ss_pred cccCCCCcccccccccc Q lcl|NC_020081. 518 QNVGKDGQSKQQANTNS 534 (552) Q Consensus 518 ~~~~~~~~~~~~~~~~~ 534 (552) +...++.++.-. T Consensus 491 -----e~~~~~~~~~~~ 502 (502) T protein:vir:48 491 -----ETHTDDFERVYE 502 (502) T ss_pred -----CCCCcCcCCCCC Confidence 000001011100 No 193 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=98.30 E-value=1.5e-06 Score=52.51 Aligned_cols=357 Identities=9% Similarity=0.014 Sum_probs=147.3 Q ss_pred hhccccccccccccccccccccccccCCccccc-ccC-CCCchHHHHHHHhh--cchHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020081. 36 ILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEA-PSI-HGKQNLLQMLKLWS--RKNIILNAIIITRVNQVSMFCTPARN 111 (552) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~Lr~~a--~~~~i~~a~i~~~~~~~~~~~~~~~~ 111 (552) +. .+..+.. ..-.|+.. ... .-.....+.++... -.++. .-++.+.+ . T Consensus 1 l~--~~~~r~~--------------~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~-~~~Vds~a-----------~ 52 (410) T protein:vir:95 1 MN--LYQSRVN--------------LRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWA-AKGVDSLA-----------D 52 (410) T ss_pred CC--cchhhHH--------------HHHHHhcCCCCccccchhccHHHHhHHHhhcchh-HHHHHHhH-----------h Confidence 10 0111100 00011110 000 00001112222110 01111 12222211 1 Q ss_pred hccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEE Q lcl|NC_020081. 112 SDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNF 191 (552) Q Consensus 112 ~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L 191 (552) ....-||. ..+ ..+.+++.. | .+......+..+.|++|.+|+.|..+.+|.+ .+ T Consensus 53 rl~~~Gf~----~~d-----------~~l~~i~~~-------N---~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~-~i 106 (410) T protein:vir:95 53 RLIFRAFA----NDD-----------FNVTEIFDR-------N---NPDIFFDSAILSALIGSCSFVYISKGEDDEV-RL 106 (410) T ss_pred hhcccccc----CCC-----------chHHHHHhh-------c---ChHHHHHHHHHHHHHhCceeEEEecCCCCce-EE Confidence 11222332 111 124455432 1 2345666788999999999999999888876 58 Q ss_pred EEecCceeEEEECCCcccccccceeEEEEEcCCce---EEEEcccce---------------------eeecccccCCcc Q lcl|NC_020081. 192 KAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKV---VAKFKAKEM---------------------AWEVSNPRTDLT 247 (552) Q Consensus 192 ~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~---~~~~~~~ev---------------------i~~~~~~~~~~~ 247 (552) .+++|..+.++.|+..+.+.. .+.+.....++. ...|.++.+ +++..++ .. T Consensus 107 ~~~sP~~~~~i~Dp~~~~~~~--al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~---~l 181 (410) T protein:vir:95 107 QVIESSNATGVIDPITGLLVE--GYAVLARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRP---DA 181 (410) T ss_pred EEEcccceEEEEeCCCCceEE--EEEEEEecCCCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEecccc---cC Confidence 899999999988764332211 111111111111 112222222 2222221 12 Q ss_pred CCcccccH----HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceee Q lcl|NC_020081. 248 VGKYGYPE----LEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVI 323 (552) Q Consensus 248 ~g~~G~sp----l~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il 323 (552) .+.+|.|. +..+.+.+.....-......||. .|.-.+. +...+....+.++... +++..+ T Consensus 182 ~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a---~pqr~i~---G~d~d~~~~~~~~~~~----------~~i~~~ 245 (410) T protein:vir:95 182 VRPFGRSRITRAGMYYQKYAKRTLERADITAEFYS---WPQKYIL---GLDPDAEPMEKWKATV----------SSLLTI 245 (410) T ss_pred CccCCccccchhHHHHHHHHHHHHHHHHHHHHHhc---chhheee---ccCCCCCcCchhhhhh----------hhheec Confidence 34678774 44444544444444444555653 4543432 1221222222232222 122222 Q ss_pred c----cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHH---H Q lcl|NC_020081. 324 T----AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDK---G 396 (552) Q Consensus 324 ~----~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~---~ 396 (552) . +.+.++.++....-+ .|++..+.....||..-++|++.+|.......++ ..+..+...+... . T Consensus 246 ~~~~~~~~~~v~q~~~~~l~-~~~~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa--------~Al~a~~~~L~~ka~~k 316 (410) T protein:vir:95 246 SSSDKGVKPSVGQFTTASMS-PFTEQLRTAAAGFAGEMGLTLDDLGFVSDNPSSV--------EAIKASHENLRLAGRKA 316 (410) T ss_pred cCCCCCCcceEEecCCCChH-HHHHHHHHHHHHHhhhcCCCHHHhccccCchhHH--------HHHHHHHHHHHHHHHHH Confidence 1 123566655443332 4889999999999999999999998533211000 0111111111111 1 Q ss_pred hhHHHHHHHHHHHhhc--Cccc------ccceeeccc---ccChHHHHHHHHHHHH-Hhc--CCcCHHHHHHHhCCCCCC Q lcl|NC_020081. 397 LEPLLKFIEDAVNKYI--VSQF------GGDYVFNFV---GGDAKTEAEIISILES-KAK--IGLTINDIRKELGYPDTE 462 (552) Q Consensus 397 l~P~~~~ie~~ln~~L--~~~~------~~~~~~~f~---~~d~~~~~~~~~~~~~-~~~--g~lT~NE~R~~~gl~p~~ 462 (552) -+-+-..+++.+-..+ .... .....+.|- ..+..+.++.+..+.+ ..+ |+++..-+++++|+.+-+ T Consensus 317 ~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~ 396 (410) T protein:vir:95 317 QRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGDM 396 (410) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChHH Confidence 1111111222111111 1111 112333343 3344444444444322 223 666777789999996531 Q ss_pred CCCeeeccccccchhhhccccccccccCCC Q lcl|NC_020081. 463 GGDVTLAGVHVQRLGQIMQQEQVEYQRQMD 492 (552) Q Consensus 463 ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~ 492 (552) +. ..+.+.+.+. ++ T Consensus 397 ---~~---------~~~~~e~~~~----g~ 410 (410) T protein:vir:95 397 ---SA---------KPVVSEGGSN----GE 410 (410) T ss_pred ---HH---------HHHHHHHHhC----CC Confidence 00 0000000000 00 No 194 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=98.26 E-value=1.9e-06 Score=51.97 Aligned_cols=418 Identities=13% Similarity=0.087 Sum_probs=164.7 Q ss_pred CC-------CCCCCcccccchhhcccccCcccccccccchhhhhcccccccccccccccc---ccccc--cccCCc-ccc Q lcl|NC_020081. 1 MG-------LLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEE---PIIGS--MSMNPD-FKE 67 (552) Q Consensus 1 ~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~--~~~~~~-~~~ 67 (552) ++ .+.=..++. +.....+...+.+.+.- ..+-.+-+.|-. .+..+ ..+... ... T Consensus 2 ~~~~~~~~~~~~~~~~~~------------~~~~~~~~i~~~~~~~~-~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~ 68 (479) T protein:vir:79 2 LNIYISETDLIKVQLKKE------------STINLVKVIEHYILKHR-PEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVD 68 (479) T ss_pred CCceecccceEeeccccC------------ChhHHHHHHHHHHhhhh-HHHHHHHHHHhccCCccccccccccccccccc Confidence 11 111111111 11111111111111100 000000000000 00000 000000 000 Q ss_pred cccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhc Q lcl|NC_020081. 68 APSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKT 147 (552) Q Consensus 68 ~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~ 147 (552) ....... ++. ....+.|+...+.-+ -+-+..+ .-. +.. ....+..|+. T Consensus 69 ~~~~~~~--------ki~--~~~~~~Ivd~~~~~l-----------~g~p~~~--~~~-----~~~--~~~~~~~~~~-- 116 (479) T protein:vir:79 69 DFTKVNN--------KAI--NNYHKLLVDQKVGYS-----------VGNPIVF--NAD-----DDN--LTKLLNDLLG-- 116 (479) T ss_pred ccccCcc--------eee--cchHHHHHHHHHhhh-----------hcCCcee--ccC-----CHH--HHHHHHHHHh-- Confidence 0000000 000 111122222211111 1112112 111 111 1122333321 Q ss_pred CCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEc--CCc Q lcl|NC_020081. 148 GRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVI--DDK 225 (552) Q Consensus 148 n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~--~~~ 225 (552) | .+......++.+.+.+|.+|..+..+..|++. +..++|..+.++.++.+... ....++|+... .+. T Consensus 117 ------n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~v~d~~~~~~-~~~~ir~y~~~~~~~~ 185 (479) T protein:vir:79 117 ------E---EFDDTITELYLNASNKGVEWLHPYINRKGEFK-YVIIPAEEAIPIWDSKRQRE-LVAFIRFYYIEDIDGN 185 (479) T ss_pred ------c---CHHHHHHHHHHHHHhcCeEEEEEEeCCCCceE-EEEEccceeEEEEeCCCCCc-eEEEEEEEEEeecCCc Confidence 1 34556677889999999999999999888865 88899999988876543211 01112222111 111 Q ss_pred ---eEEEEcccceeeecccc---------------------------cCC---------ccCCcccccHHHHHHHHHHHH Q lcl|NC_020081. 226 ---VVAKFKAKEMAWEVSNP---------------------------RTD---------LTVGKYGYPELEIALNHLQYH 266 (552) Q Consensus 226 ---~~~~~~~~evi~~~~~~---------------------------~~~---------~~~g~~G~spl~~~~~~i~~~ 266 (552) ....+..+.+.+++..- ..+ -....+|.|-++.+...++.. T Consensus 186 ~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~sd~~~v~~liDa~ 265 (479) T protein:vir:79 186 KIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNNEKCVSDLTFYKSLIDIY 265 (479) T ss_pred eEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecCCCCCCcchhhhHHHHHHH Confidence 01112222222211100 000 001235788888888777777 Q ss_pred HHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHH Q lcl|NC_020081. 267 DNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWL 346 (552) Q Consensus 267 ~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~ 346 (552) ..+.....+.+...+.|-.++. +. ...+ .+.+...+ ..+++ +..+++.+++-+........+.... T Consensus 266 d~~~S~~~~~~~~~~~~~~v~~--g~-~~~~--~~~~~~~~--------~~~~~-i~~~~~~~~~~l~~~~~~~~~~~~~ 331 (479) T protein:vir:79 266 DNNISTLADNLDEIQEVIYVLK--EY-PGTS--LQEFIDNI--------RYYKS-IKVDGGGGVDKLEINIPVEAKKELL 331 (479) T ss_pred HHHHHHHHHHHHHhhCceeeee--cC-Cccc--cccchhhh--------hhccc-eecCCCCcceEEeccCCHHHHHHHH Confidence 7666666666777777765554 32 1111 11111111 11222 2223344444444444556677888 Q ss_pred HHHHHHHHHHhcCCHHHhcccccccccccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cce Q lcl|NC_020081. 347 NYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDY 420 (552) Q Consensus 347 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~ 420 (552) +.+.+.|+..-++|..-.+. +++.++....+. ......+..+...|+-+++.+...++..-....+ ..+ T Consensus 332 ~~l~~~i~~~s~~p~~~~~~-----~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i 406 (479) T protein:vir:79 332 DRLEKNIIIFGQGVNPESQN-----TGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDYKTV 406 (479) T ss_pred HHHHHHHHHHhCcccccccc-----ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccc Confidence 88999999999988653321 111111111100 0122233344555555555555444432211222 345 Q ss_pred eecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCccc Q lcl|NC_020081. 421 VFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQ 500 (552) Q Consensus 421 ~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (552) .+.|.+..+.+..+.++++.+. .|+|+...+.++++.- + |. -..+..+...+.... +. ..... T Consensus 407 ~i~f~~~~p~~~~~~a~~~~kl-~g~iS~et~l~~l~~v--~--d~------~~E~~ri~~E~~~~~--~~-~~~~~--- 469 (479) T protein:vir:79 407 QITFNHSMIINEAEKIDMAAKS-TGIVSDETIVSNHPWV--E--DV------NDELERLKKQEDTQK--EY-DDLIP--- 469 (479) T ss_pred eEEeCCCCCcCHHHHHHHHHHH-hccCcHHHHHHhCCCC--C--CH------HHHHHHHHHHHHHHH--HH-HhccC--- Confidence 6777777666667666665554 5889988888777542 1 10 011111111110000 00 00000 Q ss_pred CCCCCCCCCCCCCCC Q lcl|NC_020081. 501 TGYDGNMDNVNGKDS 515 (552) Q Consensus 501 ~~~~~~~~~~~~~~~ 515 (552) +...+..+++ T Consensus 470 -----~~~~~~~~e~ 479 (479) T protein:vir:79 470 -----NNQDGVIDET 479 (479) T ss_pred -----cccCCCcCcC Confidence 0000000111 No 195 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=98.26 E-value=1.9e-06 Score=51.89 Aligned_cols=416 Identities=13% Similarity=0.122 Sum_probs=166.5 Q ss_pred CcccccccccchhhhhccccccccccccccccccccccccCCcccccccCC-CCchHHHHHHHhhcchHHHHHHHHHHHH Q lcl|NC_020081. 22 DDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIH-GKQNLLQMLKLWSRKNIILNAIIITRVN 100 (552) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~Lr~~a~~~~i~~a~i~~~~~ 100 (552) -..-.-++|+-++.| + ++ ...+ |+..-. +..+..-.... -...+.+.|+.+ +.-... T Consensus 1 ~~~~~~~~~~~~~~~-~------~~-~~~~--~~~~~~--~~~~~~~~~~~~~~~~~~~~i~~~----------i~~~~~ 58 (492) T protein:vir:97 1 MQFIQLISQVAQALI-K------GG-NILY--PSQPTQ--TEIFDAIVRTNNKPETLEEMIVRY----------IKQHLE 58 (492) T ss_pred ChHHHHHHHHHHHHh-c------CC-ceee--ccchhh--hhHhhhcccCCCchhhHHHHHHHH----------HHHHHH Confidence 000011222222222 1 11 0000 010000 00000000000 001111222221 111111 Q ss_pred HHHHH---------------------------------------HHHHHhhccc--cceeeeeccccccCChhHHHHHHH Q lcl|NC_020081. 101 QVSMF---------------------------------------CTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKE 139 (552) Q Consensus 101 ~~~~~---------------------------------------~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~ 139 (552) .+.++ ++.+....++ +|-.+.+... +...... T Consensus 59 ~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~-------d~~~~~~ 131 (492) T protein:vir:97 59 KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHT-------DDEVVKR 131 (492) T ss_pred HHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhhhcccCceeccC-------chHHHHH Confidence 11111 1111111111 1111111111 1112223 Q ss_pred HHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeE Q lcl|NC_020081. 140 IENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVR 217 (552) Q Consensus 140 l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~ 217 (552) +..|+. | ........+..+++.+|.+|..+.++.+|++ .+..++|..+.++.++. +... -.++ T Consensus 132 l~~~~~--------n---~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~i~d~~~~~~~~---~~vr 196 (492) T protein:vir:97 132 IDEVLG--------N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELE---AFIR 196 (492) T ss_pred HHHHHh--------c---cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceE---EEEE Confidence 333332 1 1334555678899999999999999988885 47889999999887643 2221 1222 Q ss_pred EEEEcCCceEEEEcccceeeecc----------------------cccCC-----ccCCcccccHHHHHHHHHHHHHHHH Q lcl|NC_020081. 218 YVQVIDDKVVAKFKAKEMAWEVS----------------------NPRTD-----LTVGKYGYPELEIALNHLQYHDNTE 270 (552) Q Consensus 218 y~~~~~~~~~~~~~~~evi~~~~----------------------~~~~~-----~~~g~~G~spl~~~~~~i~~~~~~~ 270 (552) |+..........+.+..+.++.. |+-.. -.....|.|-++.+...++....+. T Consensus 197 ~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~ 276 (492) T protein:vir:97 197 MYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRL 276 (492) T ss_pred EEeeccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchHhHHHHHHHHHHHH Confidence 32222222222222222222210 10000 0011358888888888887777766 Q ss_pred HHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHH Q lcl|NC_020081. 271 VFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLI 350 (552) Q Consensus 271 ~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~ 350 (552) .-..+.+...+.|-.++. +. +.+....++..+.. .++ +..+++.+...+........+....+.+. T Consensus 277 S~~~~~~~~~~~~~l~~~--g~---~~~~~~~~~~~~~~--------~~~-~~~~~~~~~~~l~~~~~~~~~~~~~~~L~ 342 (492) T protein:vir:97 277 SDLSNTFKDSNELTYVLK--NY---DDQELPEFKRLLRY--------YGA-IKVSDNGGVDTIQVEVPVENSKKYLDELY 342 (492) T ss_pred HHHHHHHHHhccceeeee--cC---CcccchhHHHHHhh--------ccc-eecCCCCcceeEeccCCHHHHHHHHHHHH Confidence 666666777677765553 32 11122223222211 111 22233334444444455667778889999 Q ss_pred HHHHHHhcCCHHHhcccccccccccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeeccc Q lcl|NC_020081. 351 NVICSIYSIDPSEINFPNRGGATGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFV 425 (552) Q Consensus 351 ~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~ 425 (552) +.|+..-++|..-.+-.. + +.++....+. ......+..+...|+.+++.|...++. ......+.+.|. T Consensus 343 ~~I~~~s~~p~~~~~~~~-~---n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~---~~~~~~i~v~f~ 415 (492) T protein:vir:97 343 QKIMLFGQAVDFSSDKFG-S---APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI---KGEHKDVDISFN 415 (492) T ss_pred HHHHHHhCCCCCCccccc-c---CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CcccceeeEEec Confidence 999999999864332110 0 1111111000 011222233444555555544443322 111234667777 Q ss_pred ccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCC Q lcl|NC_020081. 426 GGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDG 505 (552) Q Consensus 426 ~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 505 (552) +.-+.+.++.++++.+. .|++|..-+.+++++-+-+ + ..+..+.+.+.. .... .+.. .. . T Consensus 416 ~~~p~~~~e~a~~~~kl-~G~iS~et~l~~l~~v~d~--~--------~Eleri~~E~~~-~~~~---~~~~-~~----~ 475 (492) T protein:vir:97 416 YNKVANTELQVQTAQQS-MGIVSHETVLENHPFVEDL--Q--------AELERIEQEQTE-YNKQ---LPNL-DD----G 475 (492) T ss_pred CCCCCCHHHHHHHHHHH-hccCchHHHHHhCCCCCCH--H--------HHHHHHHHHHHH-HHHh---hhcc-cc----C Confidence 77776677666665554 5889988887777652211 0 111111111100 0000 0000 00 0 Q ss_pred CCCCCCCCCCcccccCCCCcccccc Q lcl|NC_020081. 506 NMDNVNGKDSFNQNVGKDGQSKQQA 530 (552) Q Consensus 506 ~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (552) .. +.+++. + ++.++.++ T Consensus 476 ~~--~~~~~~--~----~~~~~~~e 492 (492) T protein:vir:97 476 GA--DSAQQQ--E----RSNNKESE 492 (492) T ss_pred CC--CCCccc--c----cccccccC Confidence 00 000000 0 00000000 No 196 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=98.25 E-value=2e-06 Score=51.78 Aligned_cols=475 Identities=12% Similarity=0.096 Sum_probs=191.7 Q ss_pred ccccccccchhhhhccccccccc-cccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHH---HHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSN-KPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNII---LNAIIITRV 99 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i---~~a~i~~~~ 99 (552) |-+.+-.+...+..... ...+ .--+.++++- +|+..-+.+.-+....-=+-+.|.. +++ ++..+.=+. T Consensus 1 ma~~~lr~~rrpk~~p~--~~rr~~ltaAsq~~~-----~p~~~~kt~~~~~ar~~WQ~eAW~~-~d~v~Elry~vgW~~ 72 (639) T protein:vir:97 1 MAATSLRVVRRPKGSAP--AARRRSLTAASQLIT-----DPQKQMKTSLMGTARNEWQSEAWDF-SESIGELSYYVSWRA 72 (639) T ss_pred CCccceeeeecCCCCCc--chhhHHHhhhhhccC-----Ccccchhhhccccchhhhhhhhhhh-hhhhhhHHHHhhhhh Confidence 22221111111111100 0000 1111222221 1111111110000000001111110 011 122222223 Q ss_pred HHHHHHHHHHHhhccccceeeeecccc-----ccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcC Q lcl|NC_020081. 100 NQVSMFCTPARNSDKGVGYEIRLKDPL-----QEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYD 174 (552) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~i~~k~~~-----~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~G 174 (552) ++++++ ++ .....|++ +++..++.-..+.+.+..+... .. .+-..++++.+..++-+-| T Consensus 73 ~s~sr~-rL----------~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~ia--gG---~lGqa~llkr~~~~ltV~G 136 (639) T protein:vir:97 73 NSCSRT-TL----------IPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIA--DG---PLGQAALIKRAVECMTVVG 136 (639) T ss_pred hhhcee-ee----------EeeeeccccCCCCCccccccccCcchHHHHHHhhc--Cc---cchHHHHHHHHHhheeccc Confidence 333322 11 11111211 1122222223334444544432 22 2445679999999999999 Q ss_pred CeeEEEE-ECCCC------CEEEEEEe-cCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCc Q lcl|NC_020081. 175 KINFELV-YDKLG------DLHNFKAV-DASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDL 246 (552) Q Consensus 175 na~~~i~-r~~~G------~~~~L~~l-~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~ 246 (552) .+|+.++ |...+ .+.+-|++ -..-|. ...|.. .-....+|.........+++.-. .++++ T Consensus 137 E~wi~~l~r~~k~~~~~~~~~~~~W~vvs~~Ei~---~~~~~~-------~~i~lPdG~~he~~~~~d~l~Rv--W~P~p 204 (639) T protein:vir:97 137 EVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIK---SKAGET-------AEISLPDGKTHEFNRDLDSLVRI--WNPRP 204 (639) T ss_pred ceEEEEEEecCccccCcccccccceeeeeHHHhc---ccCCCe-------eEeecCCCCCccccCCCceEEEE--eCCCc Confidence 9998765 33333 23444443 222222 111111 00111122222112223444322 23455 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCH-----------------------HHHHHH Q lcl|NC_020081. 247 TVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSN-----------------------QALTSF 303 (552) Q Consensus 247 ~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~-----------------------~~~~~~ 303 (552) ....+--||+.+++..+.-...+.+...+..+.-.+-.|||.+|....+.. -+.+.| T Consensus 205 rr~~e~dSpvra~l~~l~Ei~~~t~~i~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l 284 (639) T protein:vir:97 205 RKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQL 284 (639) T ss_pred ccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHH Confidence 556778899999999998888888877777777777777877753221110 112333 Q ss_pred HHHHH----HHhcccc-ccccceeeccCC----ceeeeccCch-hHHHHHHHHHHHHHHHHHHhcCCHHHh-cccccccc Q lcl|NC_020081. 304 RREWT----SMFSGIN-GAWKIPVITAED----VKFVNMTQSS-KDMEFEKWLNYLINVICSIYSIDPSEI-NFPNRGGA 372 (552) Q Consensus 304 ~~~~~----~~~~G~~-nagk~~il~~~g----~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~ 372 (552) ...|- ..+..-+ .+--+||+...- -+++.+.+.. -+.--+.+++..+..||....|||..| |+. +++- T Consensus 285 ~~~l~qaa~tai~De~S~aA~vPiia~~p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~-d~NH 363 (639) T protein:vir:97 285 ATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMS-KGNH 363 (639) T ss_pred HHHHHHHHHhhhcCCCCccceeeeeEeechHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecc-cccc Confidence 33332 2232211 244567664321 2344444433 334457899999999999999999764 763 3443 Q ss_pred cccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc----cc---cceeecccccChHHHHHH-HHHHHHHhc Q lcl|NC_020081. 373 TGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ----FG---GDYVFNFVGGDAKTEAEI-ISILESKAK 444 (552) Q Consensus 373 ~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~----~~---~~~~~~f~~~d~~~~~~~-~~~~~~~~~ 444 (552) |.... -...-++..|.|.+..|+++|++.+|.. .| .+|.+.|+-.......+. .++.+.... T Consensus 364 WsAWq----------I~dedvrlHI~P~l~~icdAlT~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dPd~~deA~qa~dr 433 (639) T protein:vir:97 364 WSAWA----------IGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDR 433 (639) T ss_pred eEEEE----------ecccceeeecchhHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHc Confidence 33221 1112345679999999999999887742 22 357788864432211111 122334456 Q ss_pred CCcCHHHHHHHhCCCCCCCCCeeecc-------------ccccchhhhccccccccccCCCCCccCcccCC-CCCCCCCC Q lcl|NC_020081. 445 IGLTINDIRKELGYPDTEGGDVTLAG-------------VHVQRLGQIMQQEQVEYQRQMDANQFLAQQTG-YDGNMDNV 510 (552) Q Consensus 445 g~lT~NE~R~~~gl~p~~ggD~~~~~-------------~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 510 (552) |.||-.-.|+.+|+.--.|=|+.-.+ -.+.+. ......+.....+ +..++.+ ...+.+.+ T Consensus 434 GAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~~V~~~P~li~~---~apl~~P~lq~~e---~ptp~~a~~~a~~~~~ 507 (639) T protein:vir:97 434 GAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAM---YAPLLSSQLAGIE---FPQPANAIESTREDEE 507 (639) T ss_pred CCccHHHHHHHhccccccCCCCCCcHHHHHHHHHHhcCCcchhhh---hhhccCccceecc---cCCCCCCCCCCCCCCC Confidence 99999999999999654332211110 011000 0000000000000 0000000 00000101 Q ss_pred CCCCCcccccCCCCccccccccccccccCcc----------------cccc-----ccccccC Q lcl|NC_020081. 511 NGKDSFNQNVGKDGQSKQQANTNSTPQGGKD----------------DNGN-----VVNDWEA 552 (552) Q Consensus 511 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~-----~~~~~~~ 552 (552) +.+++.+.+ +..++.++...+-++..-... ..|. ...+-.+ T Consensus 508 ~de~~ga~~-~~ePdte~~~~~~~a~~~~~~a~~v~a~~llv~RALelAGkRr~~~~~r~~~a 569 (639) T protein:vir:97 508 DDEDSGARQ-QREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRFKVNDAALKT 569 (639) T ss_pred cccccCCCC-CcCCCcccccCCccccCcCchhHHHHHHHHHHHHHHHhhcccccCCCChhhHH Confidence 000000000 011111111111111000000 0000 0000000 No 197 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=98.25 E-value=2e-06 Score=51.78 Aligned_cols=475 Identities=12% Similarity=0.096 Sum_probs=191.7 Q ss_pred ccccccccchhhhhccccccccc-cccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHH---HHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSN-KPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNII---LNAIIITRV 99 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i---~~a~i~~~~ 99 (552) |-+.+-.+...+..... ...+ .--+.++++- +|+..-+.+.-+....-=+-+.|.. +++ ++..+.=+. T Consensus 1 ma~~~lr~~rrpk~~p~--~~rr~~ltaAsq~~~-----~p~~~~kt~~~~~ar~~WQ~eAW~~-~d~v~Elry~vgW~~ 72 (639) T protein:vir:10 1 MAATSLRVVRRPKGSAP--AARRRSLTAASQLIT-----DPQKQMKTSLMGTARNEWQSEAWDF-SESIGELSYYVSWRA 72 (639) T ss_pred CCccceeeeecCCCCCc--chhhHHHhhhhhccC-----Ccccchhhhccccchhhhhhhhhhh-hhhhhhHHHHhhhhh Confidence 22221111111111100 0000 1111222221 1111111110000000001111110 011 122222223 Q ss_pred HHHHHHHHHHHhhccccceeeeecccc-----ccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcC Q lcl|NC_020081. 100 NQVSMFCTPARNSDKGVGYEIRLKDPL-----QEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYD 174 (552) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~i~~k~~~-----~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~G 174 (552) ++++++ ++ .....|++ +++..++.-..+.+.+..+... .. .+-..++++.+..++-+-| T Consensus 73 ~s~sr~-rL----------~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~ia--gG---~lGqa~llkr~~~~ltV~G 136 (639) T protein:vir:10 73 NSCSRT-TL----------IPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIA--DG---PLGQAALIKRAVECMTVVG 136 (639) T ss_pred hhhcee-ee----------EeeeeccccCCCCCccccccccCcchHHHHHHhhc--Cc---cchHHHHHHHHHhheeccc Confidence 333322 11 11111211 1122222223334444544432 22 2445679999999999999 Q ss_pred CeeEEEE-ECCCC------CEEEEEEe-cCceeEEEECCCcccccccceeEEEEEcCCceEEEEcccceeeecccccCCc Q lcl|NC_020081. 175 KINFELV-YDKLG------DLHNFKAV-DASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDL 246 (552) Q Consensus 175 na~~~i~-r~~~G------~~~~L~~l-~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~ 246 (552) .+|+.++ |...+ .+.+-|++ -..-|. ...|.. .-....+|.........+++.-. .++++ T Consensus 137 E~wi~~l~r~~k~~~~~~~~~~~~W~vvs~~Ei~---~~~~~~-------~~i~lPdG~~he~~~~~d~l~Rv--W~P~p 204 (639) T protein:vir:10 137 EVWIAVLIRQEKDPVTGLAAPRARWYAVTREEIK---SKAGET-------AEISLPDGKTHEFNRDLDSLVRI--WNPRP 204 (639) T ss_pred ceEEEEEEecCccccCcccccccceeeeeHHHhc---ccCCCe-------eEeecCCCCCccccCCCceEEEE--eCCCc Confidence 9998765 33333 23444443 222222 111111 00111122222112223444322 23455 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCH-----------------------HHHHHH Q lcl|NC_020081. 247 TVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSN-----------------------QALTSF 303 (552) Q Consensus 247 ~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~-----------------------~~~~~~ 303 (552) ....+--||+.+++..+.-...+.+...+..+.-.+-.|||.+|....+.. -+.+.| T Consensus 205 rr~~e~dSpvra~l~~l~Ei~~~t~~i~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l 284 (639) T protein:vir:10 205 RKASQATSPVRACLETLREIERTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQL 284 (639) T ss_pred ccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHH Confidence 556778899999999998888888877777777777777877753221110 112333 Q ss_pred HHHHH----HHhcccc-ccccceeeccCC----ceeeeccCch-hHHHHHHHHHHHHHHHHHHhcCCHHHh-cccccccc Q lcl|NC_020081. 304 RREWT----SMFSGIN-GAWKIPVITAED----VKFVNMTQSS-KDMEFEKWLNYLINVICSIYSIDPSEI-NFPNRGGA 372 (552) Q Consensus 304 ~~~~~----~~~~G~~-nagk~~il~~~g----~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~ 372 (552) ...|- ..+..-+ .+--+||+...- -+++.+.+.. -+.--+.+++..+..||....|||..| |+. +++- T Consensus 285 ~~~l~qaa~tai~De~S~aA~vPiia~~p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~-d~NH 363 (639) T protein:vir:10 285 ATMIYQASVAAMEDENSQAAYIPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMS-KGNH 363 (639) T ss_pred HHHHHHHHHhhhcCCCCccceeeeeEeechHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecc-cccc Confidence 33332 2232211 244567664321 2344444433 334457899999999999999999764 763 3443 Q ss_pred cccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc----cc---cceeecccccChHHHHHH-HHHHHHHhc Q lcl|NC_020081. 373 TGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ----FG---GDYVFNFVGGDAKTEAEI-ISILESKAK 444 (552) Q Consensus 373 ~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~----~~---~~~~~~f~~~d~~~~~~~-~~~~~~~~~ 444 (552) |.... -...-++..|.|.+..|+++|++.+|.. .| .+|.+.|+-.......+. .++.+.... T Consensus 364 WsAWq----------I~dedvrlHI~P~l~~icdAlT~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dPd~~deA~qa~dr 433 (639) T protein:vir:10 364 WSAWA----------IGDEDVQLHIKPVMDLICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSDEAVEAHDR 433 (639) T ss_pred eEEEE----------ecccceeeecchhHHHHHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHHHHHc Confidence 33221 1112345679999999999999887742 22 357788864432211111 122334456 Q ss_pred CCcCHHHHHHHhCCCCCCCCCeeecc-------------ccccchhhhccccccccccCCCCCccCcccCC-CCCCCCCC Q lcl|NC_020081. 445 IGLTINDIRKELGYPDTEGGDVTLAG-------------VHVQRLGQIMQQEQVEYQRQMDANQFLAQQTG-YDGNMDNV 510 (552) Q Consensus 445 g~lT~NE~R~~~gl~p~~ggD~~~~~-------------~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 510 (552) |.||-.-.|+.+|+.--.|=|+.-.+ -.+.+. ......+.....+ +..++.+ ...+.+.+ T Consensus 434 GAIt~eAlR~~lG~~edd~yd~~t~e~~~~~A~~~V~~~P~li~~---~apl~~P~lq~~e---~ptp~~a~~~a~~~~~ 507 (639) T protein:vir:10 434 GAITSAALRRLLNVGEDSGYDLTTLDGCREFAADVVTKNPELIAM---YAPLLSSQLAGIE---FPQPANAIESTREDEE 507 (639) T ss_pred CCccHHHHHHHhccccccCCCCCCcHHHHHHHHHHhcCCcchhhh---hhhccCccceecc---cCCCCCCCCCCCCCCC Confidence 99999999999999654332211110 011000 0000000000000 0000000 00000101 Q ss_pred CCCCCcccccCCCCccccccccccccccCcc----------------cccc-----ccccccC Q lcl|NC_020081. 511 NGKDSFNQNVGKDGQSKQQANTNSTPQGGKD----------------DNGN-----VVNDWEA 552 (552) Q Consensus 511 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~-----~~~~~~~ 552 (552) +.+++.+.+ +..++.++...+-++..-... ..|. ...+-.+ T Consensus 508 ~de~~ga~~-~~ePdte~~~~~~~a~~~~~~a~~v~a~~llv~RALelAGkRr~~~~~r~~~a 569 (639) T protein:vir:10 508 DDEDSGARQ-QREPQTEDERSTEEAASLNDRAAYLVAERLLVNRALDLAGKRRFKVNDAALKT 569 (639) T ss_pred cccccCCCC-CcCCCcccccCCccccCcCchhHHHHHHHHHHHHHHHhhcccccCCCChhhHH Confidence 000000000 011111111111111000000 0000 0000000 No 198 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=98.18 E-value=3e-06 Score=50.86 Aligned_cols=447 Identities=14% Similarity=0.096 Sum_probs=169.3 Q ss_pred chhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCc--------------hHH Q lcl|NC_020081. 13 QQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQ--------------NLL 78 (552) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~ 78 (552) .|+.|--.+ ... +..+ +.+..+.+...+... .+ +.-.....+ .-. T Consensus 1 ~~~~~~~~~---~~~-----~~~~-------~~~~~~~~~~~~~~~-----~~-~~~~~~~~~~i~~~i~~~~~~~~~r~ 59 (501) T protein:vir:96 1 MEQTLFTDS---TGQ-----ERVL-------NLRFHRESRIRYRAD-----NL-EELMVNNWELLKNFINHHKLRQAPRI 59 (501) T ss_pred Cceeeeeec---ccc-----eecc-------ccccchhHHhhhccc-----cc-ccccCChHHHHHHHHHHHHHHHHHHH Confidence 111110000 000 0000 000000000000000 00 000000000 001 Q ss_pred HHHHHhhcc--hHHHHHHHH---HHHH--HHHHHHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCC Q lcl|NC_020081. 79 QMLKLWSRK--NIILNAIII---TRVN--QVSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGR 149 (552) Q Consensus 79 ~~Lr~~a~~--~~i~~a~i~---~~~~--~~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~ 149 (552) +.|+++-++ ..++..-.. .+.+ .+.-+++.+....++ +|-.+.+.-.+.. ........+.+++.. T Consensus 60 ~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~~---~~~~~~~~l~~~~~~--- 133 (501) T protein:vir:96 60 QELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDND---DNSQNDDAIKRIGRI--- 133 (501) T ss_pred HHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhcccCeeEeeCCcc---chhHHHHHHHHHHHh--- Confidence 111112111 111100000 0000 001111222221111 1111111111100 001111223333322 Q ss_pred CCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcC--Cc Q lcl|NC_020081. 150 IDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVID--DK 225 (552) Q Consensus 150 ~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~--~~ 225 (552) -.+......+..+.+.+|.+|..+.++.+|.+ .+..++|..+.++.++. +.... .++|+.... +. T Consensus 134 -------n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~v~d~~~~~~~~~---~v~~~~~~~~~~~ 202 (501) T protein:vir:96 134 -------NDLDSLNRTLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYDNSLEDNSIA---AVRYYNRGTLQSA 202 (501) T ss_pred -------cCHHHHHHHHHHHHhhcCeEEEEEEEcCCCce-EEEEEccceeEEEEcCCCCCceEE---EEEEEEeecCCCc Confidence 13455777889999999999999999998875 47889999999888764 22211 122221111 11 Q ss_pred --eEEEEcccceeeecc-----------cc-----cCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE Q lcl|NC_020081. 226 --VVAKFKAKEMAWEVS-----------NP-----RTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLL 287 (552) Q Consensus 226 --~~~~~~~~evi~~~~-----------~~-----~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil 287 (552) ....++++.+.++.. |+ -..-.+...|.|.++.+...++....+..-..+.+...+.|-.++ T Consensus 203 ~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i 282 (501) T protein:vir:96 203 KDVVEIYTDEHIYTLDASDDFNEISVTTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAI 282 (501) T ss_pred EEEEEEEcCCcEEEEeeCCCceeccccccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeee Confidence 011122222211100 00 000011246888888888888777777766666666667776555 Q ss_pred EeCCCCCC-CHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcc Q lcl|NC_020081. 288 HIKTGQEQ-SNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINF 366 (552) Q Consensus 288 ~~~~~~~~-s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~ 366 (552) . +.... ..+....++.. ..-.....+ .+-....+.+..-++....+..+....+.+.+.|+..-++|..-.+. T Consensus 283 ~--G~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~ 356 (501) T protein:vir:96 283 Y--GDLALPKGMQASDMKRT---RLMQLKPPK-SADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTN 356 (501) T ss_pred e--cccccCcccchhhhhhc---Ceeeecccc-cccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCccc Confidence 4 32111 11112222111 000011000 01112234455555555555667788889999999999999755542 Q ss_pred cccccccccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcC-cccc-cceeecccccChHHHHHHHHHH Q lcl|NC_020081. 367 PNRGGATGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIV-SQFG-GDYVFNFVGGDAKTEAEIISIL 439 (552) Q Consensus 367 ~~~~t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~-~~~~-~~~~~~f~~~d~~~~~~~~~~~ 439 (552) .. ++ .++....+. .-....+..+..+|+-+++.+...++..-- ...+ ..+.+.|.+.-+.+.++.++++ T Consensus 357 ~~-~n---~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~ 432 (501) T protein:vir:96 357 FS-GN---TSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSIL 432 (501) T ss_pred cc-cc---chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHH Confidence 21 11 111111110 111222344555555555555444433211 1111 2466778777777777777666 Q ss_pred HHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCc-cc Q lcl|NC_020081. 440 ESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSF-NQ 518 (552) Q Consensus 440 ~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 518 (552) .+. .|+++..-+.+++++-.-+ + ..+..+...+ .+........+.. ...++.+. .. T Consensus 433 ~kl-~g~iS~et~~~~l~~v~D~--~--------~E~~ri~~E~-~~~~~~~~~~~~~-----------~~~~~~~~~~~ 489 (501) T protein:vir:96 433 TGL-GGQVSQETALSLSGLVESP--N--------EELDKINKEM-SEIDFKGYSNDFN-----------EHVGKYTDEVK 489 (501) T ss_pred HHH-hccCchHHHHHhCCCCCCH--H--------HHHHHHHHHH-HHhhccccccchh-----------hcccccCCcCC Confidence 554 4889987777776542110 0 0111111110 0000000000000 00000000 00 Q ss_pred ccCCCCccccccccccccccCcccccccccc Q lcl|NC_020081. 519 NVGKDGQSKQQANTNSTPQGGKDDNGNVVND 549 (552) Q Consensus 519 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (552) +...| .||++-+ T Consensus 490 e~~~d-------------------~~e~~~~ 501 (501) T protein:vir:96 490 ETHTD-------------------DFEREYE 501 (501) T ss_pred CCCCC-------------------ccccccC Confidence 01111 1111111 No 199 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.17 E-value=3.1e-06 Score=50.77 Aligned_cols=427 Identities=11% Similarity=0.063 Sum_probs=145.5 Q ss_pred cccccchhhhhccccccccccccccccccccccccCCccc-c---cccCCCCchHHHHHHHhhcch-HH---HHHH-HHH Q lcl|NC_020081. 27 RIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFK-E---APSIHGKQNLLQMLKLWSRKN-II---LNAI-IIT 97 (552) Q Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~---~~~~~~~~~~~~~Lr~~a~~~-~i---~~a~-i~~ 97 (552) |. .+-.-|..++ ++ ....|-... ....-.. . .........-+..|..+-++- .+ .... -.. T Consensus 1 ~~--~~~~~~~~~~--~~-----~~~~p~~~~-~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~ 70 (501) T protein:vir:25 1 MT--VPVDVIADAP--AA-----DVEFPEDSM-SREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEV 70 (501) T ss_pred Cc--ccchhhhccC--cc-----cccCCcccC-ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhh Confidence 00 0001111110 00 000010000 0000000 0 000000000011222221110 00 0000 000 Q ss_pred HH---HHHHHHHHHHHhhcc----ccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHH Q lcl|NC_020081. 98 RV---NQVSMFCTPARNSDK----GVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDR 170 (552) Q Consensus 98 ~~---~~~~~~~~~~~~~~~----~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ 170 (552) +. ..+.-+++.+....+ ..||.+ .+ +. ....+..++.. | .+......+..+. T Consensus 71 ~~~~~~~v~n~~~~ivd~~a~~l~~~gf~~--~d--~~-------~~~~l~~i~~~-------N---~~d~~~~~~~~~a 129 (501) T protein:vir:25 71 KELAKLSVKNVLSLVRDSFAQNLSVVGYRN--AL--AK-------ENDPAWEMWQR-------N---RMDARQAEVHRPA 129 (501) T ss_pred hhhHhhhhcChHHHHHHHHHhhhcccceec--CC--cc-------chHHHHHHHHh-------c---ChhHHHHHHHHHH Confidence 00 011112222222221 122211 11 11 11223344332 1 2334556788899 Q ss_pred HhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCC------------ceEEEEcccc---- Q lcl|NC_020081. 171 LTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDD------------KVVAKFKAKE---- 234 (552) Q Consensus 171 ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~------------~~~~~~~~~e---- 234 (552) +++|.+|+.+.++..|. .+..++|..+.++.++..........++|+..... .....+...+ T Consensus 130 ~i~G~ay~~v~~de~~~--~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~ 207 (501) T protein:vir:25 130 LTYGASYVTVTPTDEGP--VFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLG 207 (501) T ss_pred hhcCceEEEEecCCCCC--eEEEeccccEEEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeee Confidence 99999999999988874 46678898888765332211111111222111110 0011111000 Q ss_pred ----------------------------------eeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_020081. 235 ----------------------------------MAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQG 280 (552) Q Consensus 235 ----------------------------------vi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng 280 (552) ++++. .......+|.|-++.+...++....+.........-. T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~----N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~ 283 (501) T protein:vir:25 208 DAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFV----NGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFG 283 (501) T ss_pred eccccccccccccccccccccccccccCCccceeeEecc----CccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhh Confidence 11111 1111124588877666655555555444443344333 Q ss_pred CCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHH-HHHHHHHHHHHHHHHhcC Q lcl|NC_020081. 281 GTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDME-FEKWLNYLINVICSIYSI 359 (552) Q Consensus 281 ~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgV 359 (552) +.|.-++. + ...++ .+. |+. . .+++.++.+++.++.++.. .+++ |++..+..+..|+..-++ T Consensus 284 a~p~~~i~--G-~~~~~--~~~----~~~-~-----~~~i~~~~~~~~~~~q~~~--~~~~~~~~~l~~~i~~i~~~s~~ 346 (501) T protein:vir:25 284 ANPQRVIS--G-WTGSK--AEV----LKA-S-----ALRVWTFEDPEVKAQAFPP--ASVEPYNLILEEMLQHVAMVAQI 346 (501) T ss_pred ccHHHHHh--C-CCCCc--cch----hhh-c-----ccceeccCCCCceEEEecc--cChHHHHHHHHHHHHHHHhhcCC Confidence 44533332 2 22222 222 211 1 2334444455677765543 3334 889999999999999999 Q ss_pred CHHHhcccccccccccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHH Q lcl|NC_020081. 360 DPSEINFPNRGGATGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAE 434 (552) Q Consensus 360 Pp~~lg~~~~~t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~ 434 (552) |++.+|..... .++....+. +..+..+..+...|+-+++.+....+.. .......+.+.|....+.+.++ T Consensus 347 P~~~~~~~~~N----~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~-~~~~~~~i~v~w~~~~~~s~~~ 421 (501) T protein:vir:25 347 SPAQVTGKMIN----VSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDP-DTAADSGAEVLWRDTEARSFGA 421 (501) T ss_pred ChhhhccccCC----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-ccccceeeeEEecCCCCCCHHH Confidence 99998743211 111111110 0001111112222222222221111100 0001123556666666666666 Q ss_pred HHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhhccccccccc-cCCCCCccCcccCCCCCCCCCCCC Q lcl|NC_020081. 435 IISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQ-RQMDANQFLAQQTGYDGNMDNVNG 512 (552) Q Consensus 435 ~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 512 (552) .++++.+.....++.-.+..+ .|+.+-+ + ..+......+..... ...-..++.+..+...+....++. T Consensus 422 ~ada~~kl~~~gis~et~~~~~~g~~~~~---i-------e~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (501) T protein:vir:25 422 VVDGITKLASAGIPIEHLLSMVPGMTQQT---I-------QAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALN 491 (501) T ss_pred HHHHHHHHHhcCCCHHHHHHHcCCCCHHH---H-------HHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccc Confidence 666655443333555444433 4665411 0 000000000000000 000000000000000000000000 Q ss_pred CCCcccccCCCCccccccccccccccCc Q lcl|NC_020081. 513 KDSFNQNVGKDGQSKQQANTNSTPQGGK 540 (552) Q Consensus 513 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 540 (552) ++ +..+.+ +. T Consensus 492 ~~---~~~~~~---------------g~ 501 (501) T protein:vir:25 492 EG---GVNGNG---------------GA 501 (501) T ss_pred cc---cCCCCC---------------CC Confidence 00 000000 00 No 200 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=98.14 E-value=3.7e-06 Score=50.35 Aligned_cols=395 Identities=11% Similarity=0.062 Sum_probs=156.0 Q ss_pred hhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhc-chHHHHHHHHHHHHHHHHHH----- Q lcl|NC_020081. 33 EDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSR-KNIILNAIIITRVNQVSMFC----- 106 (552) Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~-~~~i~~a~i~~~~~~~~~~~----- 106 (552) =.-|++.+.+.... . .++.++.-++ ....+..++......+.++- T Consensus 1 ~~~~~~~~~~~~~~-----------~------------------~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Y 51 (478) T protein:vir:10 1 MISINWPWDKPYHE-----------Q------------------VVEQIKPKYETQEEMILRLVREHKENIDNITMGERY 51 (478) T ss_pred CccccccCCchhhh-----------H------------------HHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00011111111100 0 0011111000 00111222222111111111 Q ss_pred ----------------------------------HHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCC Q lcl|NC_020081. 107 ----------------------------------TPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRI 150 (552) Q Consensus 107 ----------------------------------~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~ 150 (552) +.+....++ +|-.+.+... +......+..++. T Consensus 52 y~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~-------~~~~~~~l~~~~~----- 119 (478) T protein:vir:10 52 YNHHPDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVD-------NDKALKQIQHTLN----- 119 (478) T ss_pred hcccccccccchhhhcccccccccccceeccchHHHHHHHHhhhhcccCceeecC-------ChHHHHHHHHHHh----- Confidence 111111111 1111111110 1112223333332 Q ss_pred CCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCCceEE Q lcl|NC_020081. 151 DNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDDKVVA 228 (552) Q Consensus 151 ~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~~~~~ 228 (552) ..+......+..+.+.+|.+|..+-.+.+|++ .+..++|..+.++.++. +.... .++|+...+..... T Consensus 120 ------n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~~~~~~p~~~~~v~d~~~~~~~~~---~ir~~~~~~~~~~~ 189 (478) T protein:vir:10 120 ------HKWDDKLVDILTAASNKGIEWVQPYVDEEGEF-KTFRVPAEQAVPIWTNKERDELQA---FIRVYELDGAERVE 189 (478) T ss_pred ------ccHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEE---EEEEEeeeCceEEE Confidence 13445666778999999999999989988875 47789999998877542 22211 22333322222222 Q ss_pred EEcccceeeecc--------------------------ccc-----CCccCCcccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 229 KFKAKEMAWEVS--------------------------NPR-----TDLTVGKYGYPELEIALNHLQYHDNTEVFNARFF 277 (552) Q Consensus 229 ~~~~~evi~~~~--------------------------~~~-----~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f 277 (552) .+.++.+.+.+. |+- ..-.+...|.|-++.+...++....+..-..+.+ T Consensus 190 ~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~ 269 (478) T protein:vir:10 190 YWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTF 269 (478) T ss_pred EEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHH Confidence 233333322111 000 0000123577888777777777666655555555 Q ss_pred hccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec-cCCceeeeccCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 278 AQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT-AEDVKFVNMTQSSKDMEFEKWLNYLINVICSI 356 (552) Q Consensus 278 ~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~ 356 (552) ...+.|-.++. + ...++ ...+...+.. . +...+. .+|.++..+........+.+..+.+.+.|... T Consensus 270 ~~~~~~~~~~~--g-~~~~~--~~~~~~~~~~-------~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~ 336 (478) T protein:vir:10 270 DESVELIYILK--G-YEGED--MKDFMHNLKY-------Y-KAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEF 336 (478) T ss_pred HHhhCcceeee--c-CCccc--ccchhhhhhh-------C-ceeEecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHH Confidence 55556654443 3 22121 1112222111 1 122231 23333333444455667778888999999999 Q ss_pred hcCCHHHhcccccccccc-cccccccch--h---HHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecccccCh Q lcl|NC_020081. 357 YSIDPSEINFPNRGGATG-HSGNTLNEG--S---SAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNFVGGDA 429 (552) Q Consensus 357 fgVPp~~lg~~~~~t~~~-~~~~~~~~~--n---~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~d~ 429 (552) -++|..-.+ ++++ .++....+. . --......+..+|+-+++.|...++ ...+ ..+.+.|.+.-+ T Consensus 337 s~~p~~~~~-----~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~----~~~d~~~i~i~f~~~~p 407 (478) T protein:vir:10 337 GQGVDFQQD-----KFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYR----LDVRVQDIEITFNFNVM 407 (478) T ss_pred hCCcCcCcc-----ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC----CCcccccceEEeCCCCC Confidence 999853221 1111 111110000 0 0111222333344444333333221 1222 346677877767 Q ss_pred HHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCC Q lcl|NC_020081. 430 KTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDN 509 (552) Q Consensus 430 ~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (552) .+..+.++++... .|++|.--+.++++.- +.-+ ..+..+...+.. .... ......+. .+.+. T Consensus 408 ~~~~e~~~~~~~~-~g~iS~et~i~~~~~v--~d~~--------~E~~ri~~E~~~-~~~~-----~~~~~~~~-~d~~~ 469 (478) T protein:vir:10 408 VNELENSQIAMNS-TGLLSKETILGNHSWV--QDPV--------AEMERIEQENIE-LNQQ-----LPDIEEGL-NDEQQ 469 (478) T ss_pred CCHHHHHHHHHHH-hCCCChHHHHHhCCCC--CCHH--------HHHHHHHHHHHH-HHHh-----ccccCCCC-ccccc Confidence 6666666665543 6888877777766541 1100 111111111100 0000 00000000 00000 Q ss_pred CCCCCCccc Q lcl|NC_020081. 510 VNGKDSFNQ 518 (552) Q Consensus 510 ~~~~~~~~~ 518 (552) +.+++.... T Consensus 470 ~~~~d~~~e 478 (478) T protein:vir:10 470 RQSEDNQSE 478 (478) T ss_pred ccCcCCCCC Confidence 000000000 No 201 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.14 E-value=3.8e-06 Score=50.28 Aligned_cols=418 Identities=11% Similarity=0.076 Sum_probs=165.0 Q ss_pred hhcccccc-ccccccccccccccccccCCcccccccCCCCchHH---HHHHHhhcc-hHHHHHHHHHHH------HHH-H Q lcl|NC_020081. 36 ILKKGKNT-KSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLL---QMLKLWSRK-NIILNAIIITRV------NQV-S 103 (552) Q Consensus 36 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~Lr~~a~~-~~i~~a~i~~~~------~~~-~ 103 (552) .+..+.+. |..-.+........ .....+-+.-++... ...+++-++ ..++........ ..+ . T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~ 74 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALK------DVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSM 74 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhH------HHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeec Confidence 22222111 11111110000000 000011111111111 111222111 111110000000 000 0 Q ss_pred HHHHHHHhh----ccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEE Q lcl|NC_020081. 104 MFCTPARNS----DKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFE 179 (552) Q Consensus 104 ~~~~~~~~~----~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~ 179 (552) .+++.+... .-+-+..+.. .+.+....+.+++.. -.+..-+..++.+.+.+|.+|+. T Consensus 75 n~~k~i~~~~a~~l~~~p~~i~~---------~d~~~~e~l~~~~~~----------n~f~~~~~~~~~~a~~~G~~~~~ 135 (496) T protein:vir:38 75 NLPKVTAKYMSKLLFNEKVKINI---------DDKAAEEFVLNVLKT----------NGFTKNMERYIEYGEAMGGFVIK 135 (496) T ss_pred chHHHHHHHHhhhhhCCcceEee---------CChHHHHHHHHHHhc----------cCHHHHHHHHHHHHhhhCcEEEE Confidence 011111111 1111111211 112233334444432 13556677788899999999999 Q ss_pred EEECCCCCEEEEEEecCceeEEEECCCcccccc-------cceeEEEE----Ec-CCceE---------------EEEcc Q lcl|NC_020081. 180 LVYDKLGDLHNFKAVDASTVYVAVDEDGKERKA-------KDGVRYVQ----VI-DDKVV---------------AKFKA 232 (552) Q Consensus 180 i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~-------~~~~~y~~----~~-~~~~~---------------~~~~~ 232 (552) +..|.+|.+ .+..++|..+.++..+.+.+... ..+.+|.. .. ++... ..++. T Consensus 136 ~~~D~~~~~-~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~ 214 (496) T protein:vir:38 136 VYHDGNKNV-KVSFATADCMYPLSNDSENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSL 214 (496) T ss_pred EEEcCCCcE-EEEEEcccceEEEEecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccc Confidence 999988875 47888999888766655543110 01111110 00 00000 00000 Q ss_pred ---------c------ceeeecc--cccCC--ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC Q lcl|NC_020081. 233 ---------K------EMAWEVS--NPRTD--LTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQ 293 (552) Q Consensus 233 ---------~------evi~~~~--~~~~~--~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~ 293 (552) . +...|.+ ++..+ ....++|+|.+.-+...++....+..-..+-|..| .+..++ +... T Consensus 215 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~-~~~i~v--~~~~ 291 (496) T protein:vir:38 215 TLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLV--PSSF 291 (496) T ss_pred cccccccccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhc-ccceec--chHH Confidence 0 1111111 11111 22347899999999998888877766666666653 333222 1100 Q ss_pred -----CCCHHHHHHHHHHHHHHhccccccccceee--ccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcc Q lcl|NC_020081. 294 -----EQSNQALTSFRREWTSMFSGINGAWKIPVI--TAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINF 366 (552) Q Consensus 294 -----~~s~~~~~~~~~~~~~~~~G~~nagk~~il--~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~ 366 (552) ..+.+.. ..| .......+.... .+++..++.+........+.+..+...++|+...|+||..+|. T Consensus 292 l~~~~~~~g~~~----~~~----~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~ 363 (496) T protein:vir:38 292 VKTAVNLDGSTT----QYF----DSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTF 363 (496) T ss_pred hhccCCCCCccc----cCC----CCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCC Confidence 0000000 000 000000000000 1122345666666666778888999999999999999999987 Q ss_pred cccccccccccccc---cchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcC-c---ccccceeecccccChHHHHHHHH-H Q lcl|NC_020081. 367 PNRGGATGHSGNTL---NEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIV-S---QFGGDYVFNFVGGDAKTEAEIIS-I 438 (552) Q Consensus 367 ~~~~t~~~~~~~~~---~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~-~---~~~~~~~~~f~~~d~~~~~~~~~-~ 438 (552) ...+..++....+. .++.. ......++.+|..+++.+-+..+.... . .....+.+.|.+.-+.+..+.++ . T Consensus 364 ~~~g~~tAtei~~~~~~l~~~~-~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~ 442 (496) T protein:vir:38 364 DENGLKTATEVVSEKSETYQTK-NSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRY 442 (496) T ss_pred CccccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHH Confidence 54332111110000 11111 112333455566665555543332211 1 11234667776655554444333 3 Q ss_pred HHHHhcCCcCHHHHHHHh-CCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCC Q lcl|NC_020081. 439 LESKAKIGLTINDIRKEL-GYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDS 515 (552) Q Consensus 439 ~~~~~~g~lT~NE~R~~~-gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (552) .+.+.+|+||.-.++..+ |.... ... ..+..+.. +.... -+...... ..|+++ T Consensus 443 ~~~~~~GiiS~et~l~~~~~~~d~-ea~--------~el~ri~~----E~~~~-~~~~d~~~----------~~~~~e 496 (496) T protein:vir:38 443 TNAKNQGMIPLKIALQRAWNITEA-EAD--------EWAEMLAK----EKQAE-MPNNDMNG----------IFGEEE 496 (496) T ss_pred HHHHhcCCCCHHHHHHhcCCCChH-HHH--------HHHHHHHH----hhhcc-CccccccC----------CCCCCC Confidence 344457899988887543 33211 000 00111111 00000 00000000 001110 No 202 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=98.08 E-value=5.2e-06 Score=49.54 Aligned_cols=464 Identities=13% Similarity=0.098 Sum_probs=194.8 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHH-------HHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNII-------LNAIII 96 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i-------~~a~i~ 96 (552) |-+.+--+..-+. ..+ . .+.-.+.++|.. |+-....+..+.. .-..|....+- ++..+. T Consensus 1 ma~~~lrv~rrpk--~~p-~-~r~l~aasqp~~------P~~~~~~~~~g~~----~~~~WQ~eAW~~~d~VgElryyvg 66 (629) T protein:vir:10 1 MAASTLRVSRRPK--GSP-A-RRSLTAASQPME------PGRTPSRQVAGTV----VRTSWQNEAWECMDLVGELRYYVG 66 (629) T ss_pred CCccceeEEecCC--Ccc-c-eeeeccccCCCC------cchhhchhhhhhh----hhhhhhHHHHHHHHhhhhHHHHhh Confidence 2111111111100 000 0 111223333321 1111111111110 00011111111 112222 Q ss_pred HHHHHHHHHHHHHHhhccccceeeeecccc-----ccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHH Q lcl|NC_020081. 97 TRVNQVSMFCTPARNSDKGVGYEIRLKDPL-----QEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRL 171 (552) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~-----~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~l 171 (552) =+.++++++. +.....|++ +.+.+.+. ....+.+.++.+. .. .+-..++++.+..++- T Consensus 67 W~~ss~Sr~r-----------L~as~idpDtg~ptg~i~ed~p-~~~~v~~~v~~ia--gG---~lGqaqLlkr~~~~lt 129 (629) T protein:vir:10 67 WRASSCSRVE-----------LIASELDPDTGKPTGGIRDDDP-DGLRFLEIVKTMA--GG---PLGQAQLQKRAAECLT 129 (629) T ss_pred hhhhhheeee-----------EEEeeecCCCCCCccccccCch-hHHHHHHHHHHhc--Cc---cchHHHHHHHHHhhee Confidence 2233332211 111111211 11111111 1122334444432 12 2445678999999999 Q ss_pred hcCCeeEEEEECCC----CCEEEEEE-ecCceeEEEECCCcccccccceeEEEEEcCCce-EEEEcccceeeecccccCC Q lcl|NC_020081. 172 TYDKINFELVYDKL----GDLHNFKA-VDASTVYVAVDEDGKERKAKDGVRYVQVIDDKV-VAKFKAKEMAWEVSNPRTD 245 (552) Q Consensus 172 l~Gna~~~i~r~~~----G~~~~L~~-l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~-~~~~~~~evi~~~~~~~~~ 245 (552) +-|..|+.|+--.. |-+..-|+ |...-|. .+|. +.. .....++. -......|+++-..+ ++ T Consensus 130 V~GE~~i~il~~~~~~pd~~~r~~W~vVt~~Ei~----~kg~------g~~-~i~lpdg~~he~~~~~D~l~RvW~--P~ 196 (629) T protein:vir:10 130 VPGEHRICLLDQGDKNPDGSVRHNWYVVTNDEVK----NKGA------GKT-DIELPDGTIHEYSKGRDVMFRVWN--PR 196 (629) T ss_pred ccCceEEEEeecCCCCCCcccccceeeecHHHhc----cccC------cee-EEEcCCCceeeeeCCCeeEEEeeC--CC Confidence 99999999884333 33443333 3333322 1111 111 12222333 333345566544433 34 Q ss_pred ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCC--------------------HHHHHHHHH Q lcl|NC_020081. 246 LTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQS--------------------NQALTSFRR 305 (552) Q Consensus 246 ~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s--------------------~~~~~~~~~ 305 (552) +.....--||+.+++..+.-...+.+...+..+.-.+-.|||.+|....+. .-+.+.|.. T Consensus 197 Prr~~e~DSpvra~l~~lrEi~r~tk~i~~aakSRL~gnGvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~ 276 (629) T protein:vir:10 197 PRRAKEPDSPVRACLDSLREIIRTTKKIRNASKSRLIGNGVVFLPQELSLPRATAPVADNQPGAPVPIVDGVAAADELSN 276 (629) T ss_pred cccccCCcchhHHHHHHHHHHHHhhhHhHHHHHhHHhhCceeEeccCcccccccCCCCCCCCcccccccCCCcchHHHHH Confidence 555567789999999999888888887777777777777787765322211 012333433 Q ss_pred HHH----HHhcccc-ccccceeeccC-C---ceeeeccCch-hHHHHHHHHHHHHHHHHHHhcCCHHHh-cccccccccc Q lcl|NC_020081. 306 EWT----SMFSGIN-GAWKIPVITAE-D---VKFVNMTQSS-KDMEFEKWLNYLINVICSIYSIDPSEI-NFPNRGGATG 374 (552) Q Consensus 306 ~~~----~~~~G~~-nagk~~il~~~-g---~~~~~l~~~~-~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~ 374 (552) .|- ..+..-+ .+--+||+... | -+++.+.+.. -+.--+.+++..+..+|....|||..| |+..+++-|+ T Consensus 277 ~l~q~a~aAi~De~S~aA~vPiia~vP~E~l~~ikhLkf~~eite~~iktR~daI~RlAmglDispErLLGlGsd~NHWs 356 (629) T protein:vir:10 277 LLFQTAAAAVDDEDSQAALIPLLATVPGEHLQKIFHLKIGNEITEVEIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWS 356 (629) T ss_pred HHHHHHHhhhcCCCCccceeeeEEeechHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCChhheeeccCCcccee Confidence 332 2232211 24446666421 1 2344444433 334456899999999999999999764 7755555444 Q ss_pred cccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc----cc---cceeeccccc----ChHHHHHHHHHHHHHh Q lcl|NC_020081. 375 HSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ----FG---GDYVFNFVGG----DAKTEAEIISILESKA 443 (552) Q Consensus 375 ~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~----~~---~~~~~~f~~~----d~~~~~~~~~~~~~~~ 443 (552) ...= ...-++-.|.|.+..|+++|++.+|.. .+ .+|.+.|+-. |+.-..+..+ ... T Consensus 357 AWqI----------~dedvrlHI~P~l~~ic~Ait~~~Lrp~L~~eGiDp~~Yvvw~DaS~Lt~dPd~~deA~~---a~d 423 (629) T protein:vir:10 357 AWQI----------GDEDVQLHIKPVMEVLCAAIYREVLVATLRAEGIDPDRYVLWYDASGLTVDPDKTDEATA---AKE 423 (629) T ss_pred eEEe----------cccceeeecchHHHHHHHHHHhHHHHHHHHHhCCCHHHhEeeecCcccccCCCCcHHHHH---HHH Confidence 3211 111245668999999999999887742 22 3577777533 3332333333 344 Q ss_pred cCCcCHHHHHHHhCCCCCCCCCe-------------eeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCC Q lcl|NC_020081. 444 KIGLTINDIRKELGYPDTEGGDV-------------TLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNV 510 (552) Q Consensus 444 ~g~lT~NE~R~~~gl~p~~ggD~-------------~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (552) .|.||-...|+.+|+.--.+=|. ...+-.+++.-... ... .- ++.. ...+ +....+.+.. T Consensus 424 rGaIt~eAlRr~lG~~~dd~y~~~t~~~~q~~A~~~v~~~P~Li~~~apl---l~~-~l-~~i~-~P~p-~~a~~~~~~~ 496 (629) T protein:vir:10 424 QGAITHEAYRRYLGLADEDGYDLETLEGAQAWARDAIVADPSLIKVLAPL---LTD-EL-AEID-WPEP-PAALPPGEDD 496 (629) T ss_pred cCCccHHHHHHHhccccccCCCcCCcHHHHHHHHHHhcCCCchhhhhhhh---cCC-cc-cccc-ccCC-CCcCCCCCcc Confidence 69999999999999965433221 11111111100000 000 00 0000 0000 0001111111 Q ss_pred CCCCCcccccCCCCc-cccccccccccccC-------------cccccc---ccccccC Q lcl|NC_020081. 511 NGKDSFNQNVGKDGQ-SKQQANTNSTPQGG-------------KDDNGN---VVNDWEA 552 (552) Q Consensus 511 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-------------~~~~~~---~~~~~~~ 552 (552) +.+++...+ +.+++ +++.+...+-.... =+..|. .++|-.. T Consensus 497 ~~~~E~~~~-~~e~~~e~dA~~a~~~~~~aa~~~A~rllv~RALelAGkRl~~~rdR~~ 554 (629) T protein:vir:10 497 QADEEQDTT-GSEPSTEDDAEAAARISSVADMVLAERLLTVRALGLAGKRRVNTNDRAQ 554 (629) T ss_pred cCccccCCC-CCCcCCCcchhhcccCCchhhHHHHHHHHHHHHHHHccccccCCCchhh Confidence 111111111 11111 11111110000000 000111 1122111 No 203 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=97.96 E-value=3.7e-07 Score=55.86 Aligned_cols=190 Identities=16% Similarity=0.153 Sum_probs=85.0 Q ss_pred EEEeCCCCC-CCHHHHHHHHHHHH--HHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_020081. 286 LLHIKTGQE-QSNQALTSFRREWT--SMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPS 362 (552) Q Consensus 286 il~~~~~~~-~s~~~~~~~~~~~~--~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~ 362 (552) |+++++-.. ++.. -..+++++. ..+.|..+ ..++.+++-+|..++.+... +.+......+.||++-|||.. T Consensus 1 V~k~~~l~~~~~~~-~~~~~~r~~~~~~~~~~~~---~~~ld~~~e~~e~~~~~lsG--l~d~l~~~~~~iaa~s~iP~t 74 (201) T protein:vir:10 1 MWKAKGLADLCDDS-DGAARLRLAQVDNNSGVGQ---AIGIDADSEEYNVLNSDIGG--IDTFLSQKFDRIVALSGIHEI 74 (201) T ss_pred CccchHHHHHhcCC-hHHHHHHHHHHHHhhhhhh---hheeecCCcceeeeecCcCC--hHHHHHHHHHHHHhHhcCchh Confidence 555443111 1110 112333332 33444322 23455555566666555543 335667888999999999998 Q ss_pred HhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeeccccc---ChHHHHHHHH-- Q lcl|NC_020081. 363 EINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGG---DAKTEAEIIS-- 437 (552) Q Consensus 363 ~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~---d~~~~~~~~~-- 437 (552) .|--...+++.+++... ..|.-+.....-+..|+|.++++-..+ .. ..++.|+|... +.+++++... T Consensus 75 ~LfG~sp~Glnatge~d--~~nyyd~i~~~Qe~~l~p~le~l~~~~----~~--~~~~~~~f~pL~~~s~kekAei~~~~ 146 (201) T protein:vir:10 75 ILKGKNVGGVSASQNTA--LETFYGYVDRKRKAELLPLLEFLLPFI----VT--EQEWSVEFNPLSQVSDKDKSEILEKN 146 (201) T ss_pred hhcCCCCccccccchhH--HHHHHHHHHHHHHHHHHHHHHHHHHhh----cC--CCCceEeeCCCCCCCHHHHHHHHHHH Confidence 87444555554333211 112222333333456777777654422 22 23566666443 3334443322 Q ss_pred --HH-HHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCC Q lcl|NC_020081. 438 --IL-ESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKD 514 (552) Q Consensus 438 --~~-~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (552) ++ ..+.+|+++++|+|+.|--.+..++ .+-..+. ....... ..+ +++.+.. T Consensus 147 a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~----~~~~~~~---------------~~~~~~e----~~d-p~~~~~~-- 200 (201) T protein:vir:10 147 VNSVAALIAAGIIDADEARDTLRAISTEVK----IGEGSIQ---------------TEVVINE----SED-PLDVSAN-- 200 (201) T ss_pred HHHHHHHHHcCCCCHHHHHHHHHhcCCcCC----CCCCCCC---------------ccccccc----cCC-CCCCCCC-- Confidence 22 2234689999999998755443221 0000000 0000000 000 0000000 Q ss_pred Ccc Q lcl|NC_020081. 515 SFN 517 (552) Q Consensus 515 ~~~ 517 (552) + T Consensus 201 --~ 201 (201) T protein:vir:10 201 --N 201 (201) T ss_pred --C Confidence 0 No 204 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=97.88 E-value=1.3e-05 Score=47.36 Aligned_cols=414 Identities=8% Similarity=-0.028 Sum_probs=147.9 Q ss_pred ccccchhhhhccccccccccccccccccc----c---ccccCCccccc--ccCCCCchHHHHHHHhhc-chHHHHHHHHH Q lcl|NC_020081. 28 IKQIEEDAILKKGKNTKSNKPKAYEEPII----G---SMSMNPDFKEA--PSIHGKQNLLQMLKLWSR-KNIILNAIIIT 97 (552) Q Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~---~~~~~~~~~~~--~~~~~~~~~~~~Lr~~a~-~~~i~~a~i~~ 97 (552) +-++.-.+|.+ + ..-...--+... . +....-.|+.. +...-.......+|.+.- .++ .+-|+. T Consensus 1 ~~~~~~~~~~g-l----~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw-~~~~Vd- 73 (474) T protein:vir:81 1 MIQQQTVRIPS-L----SNDENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGW-TGKAVD- 73 (474) T ss_pred CcCCCcCcCCC-C----ChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcCh-HHHHHH- Confidence 11111111110 0 000000000000 0 00000011100 000000011122222210 111 111221 Q ss_pred HHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCee Q lcl|NC_020081. 98 RVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKIN 177 (552) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~ 177 (552) .+......-||.+ .+ .... ...+.+++... .+......+..+.|+||.+| T Consensus 74 ----------~~a~rl~~~Gf~~--~d--~~~~------~~~l~~iw~~N----------~ld~~~~~~~~~al~~G~sf 123 (474) T protein:vir:81 74 ----------ALARRCNLEGFVW--PD--GDLD------SLGGTEVVDDN----------HLLSEIDSAIVAAMQHGPAF 123 (474) T ss_pred ----------HHHhhhcccceEC--CC--CCcc------chHHHHHHHhc----------ChhHHHHHHHHHHHhhCcee Confidence 1122222334432 11 1111 12344555432 22345667788999999999 Q ss_pred EEEEECCCCCE-EEEEEecCceeEEEECCCcccccccceeEEEEEcCCceE--EEEcccc-------------------- Q lcl|NC_020081. 178 FELVYDKLGDL-HNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVV--AKFKAKE-------------------- 234 (552) Q Consensus 178 ~~i~r~~~G~~-~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~--~~~~~~e-------------------- 234 (552) +.|.++.+|.+ ..+.+++|..+.++.|+..+....-.. ++....++... ..|.++. T Consensus 124 ~~V~~~~d~~~~~~i~~~sp~~~~~~~D~~~~~~~~al~-~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~ 202 (474) T protein:vir:81 124 LINTVGEDDEPEALIHVKDASEATGEWNRRRRGLNNLLS-IIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEH 202 (474) T ss_pred EEEecCCCCCceeEEEEeccceEEEEEeCCCCcceeeeE-EEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCC Confidence 99998777764 457789999998877654332111000 01011111100 1111111 Q ss_pred -----eeeecccccCCccCCcccccHH----HHHHHHHHHHHHHHHHHHHHHhccCCCceEEE-eCCCCCC--CHHHHHH Q lcl|NC_020081. 235 -----MAWEVSNPRTDLTVGKYGYPEL----EIALNHLQYHDNTEVFNARFFAQGGTTRGLLH-IKTGQEQ--SNQALTS 302 (552) Q Consensus 235 -----vi~~~~~~~~~~~~g~~G~spl----~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~-~~~~~~~--s~~~~~~ 302 (552) |+++..+++ ..+.+|.|.| ..+.+.+.....-......||. .|.-.+. +...... +...... T Consensus 203 ~~gvPvV~~~n~~~---~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a---~pqr~i~G~~~~~~~d~d~~~~~~ 276 (474) T protein:vir:81 203 VYGVPAQVLPYKPA---PKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFS---YPEFWLLGADESALKNADGTIKSV 276 (474) T ss_pred CCCcceEEeccccc---ccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhc---chhheeecCChhhcccccccccch Confidence 233332222 2345788755 3344444433333344445553 4443332 2110000 0111223 Q ss_pred HHHHHHHHhccccc-cccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhccccccc-ccccccccc Q lcl|NC_020081. 303 FRREWTSMFSGING-AWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGG-ATGHSGNTL 380 (552) Q Consensus 303 ~~~~~~~~~~G~~n-agk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t-~~~~~~~~~ 380 (552) ++..+...+.-..+ .+. +....++++-++....- .-|++..+..+..||+.-++|++.||+...++ .++ ... T Consensus 277 ~~~~~~~i~~~~~d~d~~--~~~~~~~~~~q~~~a~l-~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~Sa---eAi 350 (474) T protein:vir:81 277 WEARLGRIKGLPDDADAD--IPQLARADVKQFPAASP-DAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSA---ESY 350 (474) T ss_pred hhhhHHHHhcCCCccccc--ccccccccccccCCCCh-hHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHH---HHH Confidence 44444333211111 122 22223456665554332 23889999999999999999999999643111 100 000 Q ss_pred cchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc--Cc-----cc---ccceeecccccChHHHHHHHHHHHHH-hcC--Cc Q lcl|NC_020081. 381 NEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI--VS-----QF---GGDYVFNFVGGDAKTEAEIISILESK-AKI--GL 447 (552) Q Consensus 381 ~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L--~~-----~~---~~~~~~~f~~~d~~~~~~~~~~~~~~-~~g--~l 447 (552) ...++....-.+..-+-+-..|++.+-..+ .. +. ...+.+.|-..+..+.++.+..+.+. .+| +. T Consensus 351 --~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~ 428 (474) T protein:vir:81 351 --DASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLA 428 (474) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCC Confidence 011111111111111111111222221111 01 11 12344455444444555555444333 333 33 Q ss_pred CHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCC Q lcl|NC_020081. 448 TINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDN 509 (552) Q Consensus 448 T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (552) +-.=+++++|+.+-+- ..+......+.....-.. .. ..+.++.+.. T Consensus 429 ~~~~~~~~lg~t~~~i----------~~~~~~~~~~~~~~~~~~-----l~-~~~~~~~~aq 474 (474) T protein:vir:81 429 ETEVGLELIGLTPQQA----------RRAMADKRRVQGRGTLQA-----LI-DRSNNGATAQ 474 (474) T ss_pred cHHHHHhhcCCCHHHH----------HHHHHHHHHHhHHHHHHH-----HH-hcCCCCCCCC Confidence 4455677888865310 000000000000000000 00 0000000000 No 205 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=97.84 E-value=1.6e-05 Score=46.91 Aligned_cols=465 Identities=14% Similarity=0.132 Sum_probs=192.6 Q ss_pred Ccccccccccchhhhhcccccccccccccc---ccccccccccCCcc-cccccCCCCchHHHHHHHhhcchHHHHHHHHH Q lcl|NC_020081. 22 DDMAVRIKQIEEDAILKKGKNTKSNKPKAY---EEPIIGSMSMNPDF-KEAPSIHGKQNLLQMLKLWSRKNIILNAIIIT 97 (552) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~ 97 (552) -.-.++...+++.... ++..|..+... +.++.+..- .+.| -..+.......+.+..|.++..+.+-.|+-.+ T Consensus 1 ~~~~lfg~~i~~~~~~---~~~~s~~~~~~~dg~~~~~~~~~-~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eI 76 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKV---PKGPSFVQKDSLDGSQPIVGGGY-FGYSVDFDGTIRNDHELITRYREMVLNPECDSAVDDV 76 (537) T ss_pred Cccccccceeeccccc---ccCCcccCCCcccccceeecccc-cccccccccccchHHHHHHHHHHHhhccchhhHHHHh Confidence 1112223333332221 11222211111 112222211 1111 12334444455667778888888887777665 Q ss_pred HHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCe Q lcl|NC_020081. 98 RVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKI 176 (552) Q Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna 176 (552) ..+++ .+.....+..+-+.+.+ .+..-+++|.. +..+++.++. + ...++ +++.+.+.|.. T Consensus 77 Vneai-------v~d~~~~pV~i~Ld~~~--~s~~iK~kI~eEF~~Il~ll~F-~-----~~~~e----~fR~WYVDgRi 137 (537) T protein:vir:10 77 VNETI-------CGNFDDVPISIDLHNLK--QSEKIKKLIRSEFDEILRLLDF-D-----NRAYE----IFRRWYVDGRL 137 (537) T ss_pred hccee-------EecCCCceEEEEecccc--cchHHHHHHHHHHHHHHHHhcc-c-----hhhhH----HHhhheeeeEE Confidence 44433 22233444455555433 23344444433 4444444332 1 23444 45677788999 Q ss_pred eEEEEECC---CCCEEEEEEecCceeEEEECC-----Ccccc------cccceeEEEEEcC------CceEEEEccccee Q lcl|NC_020081. 177 NFELVYDK---LGDLHNFKAVDASTVYVAVDE-----DGKER------KAKDGVRYVQVID------DKVVAKFKAKEMA 236 (552) Q Consensus 177 ~~~i~r~~---~G~~~~L~~l~p~~v~v~~~~-----~g~~~------~~~~~~~y~~~~~------~~~~~~~~~~evi 236 (552) |+.++-|. ..-+.+|.+|||..|+.++.. ++... .......|+.+.. ......++.+-|. T Consensus 138 ~fhKiid~k~pk~GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~dAI~ 217 (537) T protein:vir:10 138 FFHKVIDPKKPRQGLVELRYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAPDSIA 217 (537) T ss_pred EEEEEEeCCCccccceeeeeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceeccHhhee Confidence 99998653 335899999999999765531 11110 0000011111111 1222334443333 Q ss_pred eecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcc--- Q lcl|NC_020081. 237 WEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSG--- 313 (552) Q Consensus 237 ~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G--- 313 (552) +..-. ....++.+.+|-|..|...+.....++....-|=-.-+.=+-|..+..+......+-+=++.-+ ..|.. T Consensus 218 y~hSG--l~d~n~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM-~k~KNklV 294 (537) T protein:vir:10 218 YCHSG--IQDLNKNMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVM-GRYRNKLV 294 (537) T ss_pred eeccc--ceeCCCCeeeeeehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHH-HhccceEE Confidence 32211 1123346788999999999888888777665444444444555555544333332222222222 11210 Q ss_pred -------ccccccc-eeec---------cCCceeeecc--CchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccc Q lcl|NC_020081. 314 -------INGAWKI-PVIT---------AEDVKFVNMT--QSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATG 374 (552) Q Consensus 314 -------~~nagk~-~il~---------~~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~ 374 (552) ..+..+. .++. +.|.++..|. .+.-+|+- ..+..+.+.++++||.+.|+.. ++|.. T Consensus 295 YDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnlgem~D---V~YF~kKLy~aLnVP~SRl~~e--~~f~~ 369 (537) T protein:vir:10 295 YDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELED---VKYFQKKLYKALNVPSSRLETE--TTFNI 369 (537) T ss_pred EeccCceecccchhhhhhhhhcccccCCCcccceeeccccCCcChHHH---HHHHHHHHHHHhCCCccccCCC--Ccccc Confidence 0000010 0110 0133444332 33344443 4588899999999999999643 22211 Q ss_pred cccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----C-----cccc-----cceeecccccChHH--------- Q lcl|NC_020081. 375 HSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----V-----SQFG-----GDYVFNFVGGDAKT--------- 431 (552) Q Consensus 375 ~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~-----~~~~~~f~~~d~~~--------- 431 (552) .. +..-+.-|-. +...|.-+..++...|...| + ++.+ ..+.+.|.+...-+ T Consensus 370 Gr--~~EItRDEiK----F~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~ 443 (537) T protein:vir:10 370 GR--AAEITRDEVK----FQKFIARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRN 443 (537) T ss_pred cc--cchhhHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHH Confidence 11 1111112222 23344445555555444332 2 2222 34667776655322 Q ss_pred -HHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCC Q lcl|NC_020081. 432 -EAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDN 509 (552) Q Consensus 432 -~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (552) |...++.+..++.-.++.+=+|+. |.+--.+ +...+.+...+... ..-.++. ..+.+..+. T Consensus 444 ~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDee----------I~~~~k~I~~E~k~-~~~~~p~---~~~~~~~~~--- 506 (537) T protein:vir:10 444 ERMNEVAQMDPYVGKYFSANYIRTKVLKQTESE----------IKEIDKEIKQEIAD-GVIMDPQ---AMQAMEMGI--- 506 (537) T ss_pred HHHHHHHHhhhhhhcccchHHHHHHHhccCHHH----------HHHHHHHHHHHhhC-CCCCCcc---cccccccCC--- Confidence 222333333333334577666653 3332110 00011111110000 0001111 111110000 Q ss_pred CCCCCCcccccCCCCccccccccccccccCccccccc Q lcl|NC_020081. 510 VNGKDSFNQNVGKDGQSKQQANTNSTPQGGKDDNGNV 546 (552) Q Consensus 510 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (552) +.++...+.+.+++. ..+++.+-..-.+||- T Consensus 507 --~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:10 507 --GDEEPVPEGGEEPQT----DPNSAVSPADQKRGEL 537 (537) T ss_pred --CCcccCCCCCCCccc----CCccCCCCCCccCCCC Confidence 111111111111111 1112222222223333 No 206 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=97.81 E-value=1.8e-05 Score=46.64 Aligned_cols=472 Identities=12% Similarity=0.015 Sum_probs=196.5 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccCCccccccc--CCCCchHH Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPS--IHGKQNLL 78 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 78 (552) |.++ ||+|.-- .++.-. .+..+.-.+.+++..-.. ..+++... ...++ T Consensus 1 ~~~~--rPk~~p~------------~p~~~~----------~arrr~LtaAsa~l~~~~---~~~~kt~~~~~~~WQ--- 50 (646) T protein:vir:10 1 MALL--KPKSAPP------------EPFGAE----------VARRIALAGATAQVDLGA---SSSWKTWKFGNKDWQ--- 50 (646) T ss_pred Cccc--CCCCCCC------------Cccccc----------ccchhhhhhccccccCCC---cceeecCCCcchhhh--- Confidence 3333 2222111 000000 000111111111111000 00111100 01111 Q ss_pred HHHHHhhcchH--HHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCcc Q lcl|NC_020081. 79 QMLKLWSRKNI--ILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTR 156 (552) Q Consensus 79 ~~Lr~~a~~~~--i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~ 156 (552) -+.|...-. =++..+.=+.++++++ ++|. ...|..+.++. ....+.+..+...+.- .. T Consensus 51 --~eAW~~~d~vpELry~vgW~~~a~SR~-rL~a----------seiddtG~~tg--~v~~~~v~~iv~~~~G-----g~ 110 (646) T protein:vir:10 51 --TEGWRLYDIIPEHHFLAGRIGDSVAQA-RLYV----------TEVDDTGEETG--EVQDERIKRLAAVPLG-----TG 110 (646) T ss_pred --HHHHHHHhhhhhHhhHhhhhhhhhcee-eeee----------eeecCCCCCcC--ccchHHHHHHhhhhcc-----ch Confidence 111211100 0222223333333322 1111 11122222221 1122333444333220 11 Q ss_pred CCHHHHHHHHHHHHHhcCCeeEEEE----ECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEcCCceEEEEcc Q lcl|NC_020081. 157 DNFRSFVKKLVRDRLTYDKINFELV----YDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVIDDKVVAKFKA 232 (552) Q Consensus 157 ~t~~~f~~~~v~d~ll~Gna~~~i~----r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~~~~~~~~~ 232 (552) .--.++++.+..++-+-|.+|+... ...+++ ...+++-.+.|.. .|..+. ++-.....+........ T Consensus 111 ~gQ~qlLkr~~~~ltV~GE~wiv~~~~~~~~~~~~-~~W~vvt~~Ev~~----tg~~~~----i~~p~~~~g~~~v~~~~ 181 (646) T protein:vir:10 111 SQRDDNLRLAGLDLAVGGECWIVGEGAATSPEAAE-GSWFVVTGSAISR----TGDEIA----VRRPQQRGGSKLVLVDG 181 (646) T ss_pred hhHHHHHHHHHhheecccceEEeeccccCCCCCCc-cceeeecHHHhcc----CCCeee----eecCccCCCCCcceecC Confidence 2245799999999999999998641 112221 1233444444421 121110 11111112334455566 Q ss_pred cceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC-----CHHHHHHHHHHH Q lcl|NC_020081. 233 KEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQ-----SNQALTSFRREW 307 (552) Q Consensus 233 ~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~-----s~~~~~~~~~~~ 307 (552) .+++.-. .++++....+--||+.+++..+.-...+.+...+..+.-.+-.|||.+|.+... .+-....|...| T Consensus 182 ~d~lvRi--W~P~Prr~~epDSpvra~l~~l~Ei~~lt~~I~aaakSRL~GnGvLfvP~e~s~p~~~~~~a~~~~l~~~l 259 (646) T protein:vir:10 182 QDILIRC--WRPHPNDTDQADSFTRSAIVPLREIELLTKREFAELDSRLTGAGIMFLPEGVDFPRGEEDPAGLAGFMAYL 259 (646) T ss_pred CceEEEE--ecCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHHHHhcCceeeeccccccCCCCCCCcchhHHHHHH Confidence 6765432 345566667888999999999998888888888888877778888888643221 111223333333 Q ss_pred ----HHHhcccc-ccccceeeccC---Cc----eeeeccCc-hhHHHHHHHHHHHHHHHHHHhcCCHHHh-ccccccccc Q lcl|NC_020081. 308 ----TSMFSGIN-GAWKIPVITAE---DV----KFVNMTQS-SKDMEFEKWLNYLINVICSIYSIDPSEI-NFPNRGGAT 373 (552) Q Consensus 308 ----~~~~~G~~-nagk~~il~~~---g~----~~~~l~~~-~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~ 373 (552) ...+..-+ .+--+||+... -+ +++.+... .-+.--+.+++..+..||....|||..| |+. +++.| T Consensus 260 ~qaa~tAi~De~S~aA~vPiia~~P~E~i~~~~~ik~l~f~~eite~aiktR~daI~RlA~glDIppE~LLGlg-d~NHW 338 (646) T protein:vir:10 260 QRAAAASMADQSRASAMVPIMATIPNEMMEHLDKIKPLTFWSELSAEITPMKDKAIARLASSAEIPGEVLTGIG-DANHW 338 (646) T ss_pred HHHHHhhhcCCCCccceeeeEEeeChHHHhhhhcceeeccCchhhHHHhhhHHHHHHHHHhccCCchhheeecc-cccee Confidence 22232221 24456776432 11 23333333 2334457899999999999999999764 766 35544 Q ss_pred ccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcc----cc----cceeecccccChHHHHHH-HHHHHHHhc Q lcl|NC_020081. 374 GHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQ----FG----GDYVFNFVGGDAKTEAEI-ISILESKAK 444 (552) Q Consensus 374 ~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~----~~----~~~~~~f~~~d~~~~~~~-~~~~~~~~~ 444 (552) +... -...-++ -|.|.+..|+++|++.+|.. .| .+|.+.|+-.......+. .++.+.... T Consensus 339 tAWq----------I~de~vr-HI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~kyvvW~DaS~Lt~~pd~~deA~qa~dr 407 (646) T protein:vir:10 339 TAWL----------ISDEGIR-WIRGYLGLIADALTRGFLRRALESMGVTNPERYAFAFDTSTLASKPNRLDEAIQLHER 407 (646) T ss_pred eeee----------eccccch-hhhhHHHHHHHHHHhhHHHHHHHHcCCCChhHeEEeecCcccccCCCCcHHHHHHHHc Confidence 4321 0111234 49999999999999887742 22 357888864432211111 122334456 Q ss_pred CCcCHHHHHHHhCCCCCCCCCe--eec---------ccccc--chhhhccccccccccCCCCCccCcccCCCCCCCCCCC Q lcl|NC_020081. 445 IGLTINDIRKELGYPDTEGGDV--TLA---------GVHVQ--RLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVN 511 (552) Q Consensus 445 g~lT~NE~R~~~gl~p~~ggD~--~~~---------~~n~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (552) |.||-...|+.+|+.--.+-+. ..+ +-+++ |.-+.. ...+ ..+.....+..-+. .+++.+.++ T Consensus 408 GAIt~eAlrk~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~Lil~P~~qa~--~~~P-~~~~~~lpp~~~~~-~dg~~~~~e 483 (646) T protein:vir:10 408 NLIKDEEVVKAGAFSVDQMPTVQERAVQILLGLVKTQPDLILDPAIQAA--LGLP-AVQSVGLPPTAAQR-TDGDLDDDE 483 (646) T ss_pred CCccHHHHHHHhcccccccCChHHHHHHHHHHHhcCCccccccchhhcc--ccCC-CcCccccCCccccc-ccCCCCChh Confidence 9999999999999954322111 000 00000 100000 0000 00000000000000 011111111 Q ss_pred CCCCcccccCCCCccccccccccccccCcccccc------------------------------------ccccccC Q lcl|NC_020081. 512 GKDSFNQNVGKDGQSKQQANTNSTPQGGKDDNGN------------------------------------VVNDWEA 552 (552) Q Consensus 512 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------------------------~~~~~~~ 552 (552) ..+..+.+-..+.++-+++.+ -+..-+..+- ...|+.+ T Consensus 484 ~~g~~~~~E~~~~pda~~~~a---~~~~~~~r~~~~~~~~~~~~~p~a~~~aav~l~v~RAL~lAG~Rlrt~~~~~a 557 (646) T protein:vir:10 484 SEGAPNGGEAPDQPDADEARA---ITAALDRRIALAARPVLALPSPEAVFNASAKLMILRALELAGGRLTTPAERRG 557 (646) T ss_pred hcCCCCCCccCCCCCCCcccc---ccccccccchhhhhhhhccccchhHHHHHHHHHHHHHHHhccccccCchhhhH Confidence 111111111111111111111 1111111110 0111111 No 207 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=97.75 E-value=2.2e-05 Score=46.08 Aligned_cols=441 Identities=13% Similarity=0.125 Sum_probs=157.4 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCc--ccccccCCCCchHHHHHHHhhcc--hHHHHHHHH--- Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPD--FKEAPSIHGKQNLLQMLKLWSRK--NIILNAIII--- 96 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~Lr~~a~~--~~i~~a~i~--- 96 (552) |....+.--.+.+ .+++.+.. +..+.- +-.+.... ...-.+.|.++-++ ..+...... T Consensus 1 ~~~~~~~~~~~~~-------------~~~~~~~~-l~~~~i~~li~~~~~~-~~~r~~~l~~YY~g~~~~i~~~~~~~~~ 65 (506) T protein:vir:94 1 MDYDLTEHKQANL-------------IYQESLEN-LTPNKIMKFITHHFNY-QRPRLEMLDDYYQGYNLKILDKQSRRHE 65 (506) T ss_pred CCcchhhhhccee-------------ecccchhc-CCHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCcccccccccccc Confidence 1111100000000 00000000 000000 00000000 00001112221111 011000000 Q ss_pred -HHHH-H-HHHHHHHHHhhccc--cceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHH Q lcl|NC_020081. 97 -TRVN-Q-VSMFCTPARNSDKG--VGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRL 171 (552) Q Consensus 97 -~~~~-~-~~~~~~~~~~~~~~--~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~l 171 (552) .+.+ . +.-+++.+....++ +|-.+.+.-.+ ......+.+++.. | .+......+..+.+ T Consensus 66 ~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~~~d-------~~~~~~l~~~~~~-------N---~~~~~~~~~~~~~~ 128 (506) T protein:vir:94 66 DGKADHRATHSFAKYIADFQTSYSVGNPINVKLPD-------DGSNSGFDTFNKA-------N---DVDAENYDLFLDMS 128 (506) T ss_pred ccCCcceeecchHHHHHHHhhhhhcccCceeecCc-------chHHHHHHHHHhc-------c---CHhHHHHHHHHHHH Confidence 0000 0 00111111111111 11111111111 1112234444432 1 23445667888999 Q ss_pred hcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccceeEEEEEc-----------------CCceEEEEcccc Q lcl|NC_020081. 172 TYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDGVRYVQVI-----------------DDKVVAKFKAKE 234 (552) Q Consensus 172 l~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~-----------------~~~~~~~~~~~e 234 (552) .+|.+|..+.++.+|++ .+..++|..+.++.++..... ....++|+... .......+.... T Consensus 129 ~~G~a~~~v~~ded~~~-~i~~~~p~~~~~v~dd~~~~~-~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~ 206 (506) T protein:vir:94 129 RYGRAYEYVYRGEDNEE-HLAKLDPLDTFVIYSTDVDPK-PIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTP 206 (506) T ss_pred hcCeEEEEEEecCCCee-EEEEEcccceEEEecCCCCCc-eEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEecccc Confidence 99999999999988875 477799999988876533110 01112221110 111111111110 Q ss_pred ----e----eeeccc-ccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC--------------- Q lcl|NC_020081. 235 ----M----AWEVSN-PRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIK--------------- 290 (552) Q Consensus 235 ----v----i~~~~~-~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--------------- 290 (552) + -|..-. |-..-.+...|.|.++.....++....+..-..+.+...+.|-.+|+-. T Consensus 207 ~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 286 (506) T protein:vir:94 207 IMGKMQVDTTKPITTFPVVEFKNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTID 286 (506) T ss_pred CccceeccccccCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhcccccc Confidence 0 010000 0000001124666666666555555444333333332222222222100 Q ss_pred -----CCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhc Q lcl|NC_020081. 291 -----TGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEIN 365 (552) Q Consensus 291 -----~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg 365 (552) +.........+.++..-....-.....+. +...+.+.+++-++.......+....+.+.+.|...-++|..-.+ T Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~ 365 (506) T protein:vir:94 287 PNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMT-VNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDE 365 (506) T ss_pred ccccccccccccchhHHHhhhhhcCeeeeccccc-ccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccc Confidence 00000111111111110100001111111 111123445555666666777788889999999999999974321 Q ss_pred ccccccccccccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcC-cccc-cceeecccccChHHHHHHHHH Q lcl|NC_020081. 366 FPNRGGATGHSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIV-SQFG-GDYVFNFVGGDAKTEAEIISI 438 (552) Q Consensus 366 ~~~~~t~~~~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~-~~~~-~~~~~~f~~~d~~~~~~~~~~ 438 (552) -. .++.++....+. .-....+..+...|+.+++.|...++..=- ...+ ..+.+.|.+.-+.+..+.+++ T Consensus 366 ~~----~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~ 441 (506) T protein:vir:94 366 NF----ASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKA 441 (506) T ss_pred cc----cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHH Confidence 10 011111111111 112333445556666666665555442100 1111 245677877777777777776 Q ss_pred HHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCccc Q lcl|NC_020081. 439 LESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQ 518 (552) Q Consensus 439 ~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (552) +.+. .|+||...+++++++-+-+ + ..+..+...+... .+.. + .....+.++.++. ..+.++. T Consensus 442 ~~kl-~g~iS~et~~~~lp~v~d~--~--------~E~~ri~~E~~~~-~~~~--~--~~~~~~~~~~~~~--~~~~~~~ 503 (506) T protein:vir:94 442 LVQA-GATLPQKYLYQQLPGVTNP--Q--------DIVDMMKEQSANG-DYSF--D--QNGVISNDGQTNT--TATQTDE 503 (506) T ss_pred HHHH-hccCChHHHHHhCCCCCCH--H--------HHHHHHHHHHHHH-hhcc--h--hhcCCCcccCccc--ccccccc Confidence 6554 5899998888887542211 0 1111111111100 0000 0 0000000111100 0111111 Q ss_pred ccC Q lcl|NC_020081. 519 NVG 521 (552) Q Consensus 519 ~~~ 521 (552) .+. T Consensus 504 e~~ 506 (506) T protein:vir:94 504 EVR 506 (506) T ss_pred CCC Confidence 111 No 208 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=97.71 E-value=2.6e-05 Score=45.70 Aligned_cols=419 Identities=11% Similarity=0.069 Sum_probs=162.3 Q ss_pred hhhccccccccccccccccccccccccCCcccccccCCCCchHHHHH---HHhhcch--HHHHHH---------HHHH-H Q lcl|NC_020081. 35 AILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQML---KLWSRKN--IILNAI---------IITR-V 99 (552) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L---r~~a~~~--~i~~a~---------i~~~-~ 99 (552) ++-+....=|..-.+... .. ...+.. ..+.+..++...+.+ ++|-.+. .++..- .... . T Consensus 1 m~~~~~~~~~~~~~~~~~---~~--~~~~~~-~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~ 74 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGL---LK--SLKDVT-DHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSM 74 (499) T ss_pred ChhHHHHHHHHHHHHhcc---cc--chhhhh-cCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeec Confidence 111110001111111000 00 000000 001111111111111 1111111 110000 0000 0 Q ss_pred HHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEE Q lcl|NC_020081. 100 NQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFE 179 (552) Q Consensus 100 ~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~ 179 (552) +-....|...+...-+-+..+.. .+......+.+++.. ..+..-+..++.+.+.+|.+|+. T Consensus 75 n~~~~iv~~~a~~l~~ep~~i~~---------~d~~~~e~l~~~~~~----------n~f~~~~~~~~~~a~~~G~~~~~ 135 (499) T protein:vir:80 75 NLPKVTAKYMSKLLFNEKVKINI---------DDETAEEFVLNVLKT----------NGFTKNMERYIEYGEAMGGFVIK 135 (499) T ss_pred chHHHHHHHHHHhhhCCcceEee---------CCHHHHHHHHHHHhh----------ccHHHHHHHHHHHHhhcCcEEEE Confidence 00000011111111111111111 111222233333321 13555667778889999999999 Q ss_pred EEECCCCCEEEEEEecCceeEEEECCCccccccc-------ceeEEEEE----cCCce--E-----------------EE Q lcl|NC_020081. 180 LVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAK-------DGVRYVQV----IDDKV--V-----------------AK 229 (552) Q Consensus 180 i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~-------~~~~y~~~----~~~~~--~-----------------~~ 229 (552) +..|.+|++. +..++|..+.++..+.|.+.... ...+|... ..+.. . .. T Consensus 136 ~~~D~~~~~~-i~~v~a~~~~Pi~~d~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~ 214 (499) T protein:vir:80 136 VYHDGNKNVK-VSFATADCMYPLSNDSENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGK 214 (499) T ss_pred EEECCCCcEE-EEEEcCCceEEEEecCCCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcc Confidence 9999888754 78899999888665555431110 00111000 00000 0 00 Q ss_pred Ecccc------------------eeeecccccCC-ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE--- Q lcl|NC_020081. 230 FKAKE------------------MAWEVSNPRTD-LTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLL--- 287 (552) Q Consensus 230 ~~~~e------------------vi~~~~~~~~~-~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil--- 287 (552) ++..+ .+|++.+.... ....++|+|.+.-+...++....+..-..+-|..|. ...++ T Consensus 215 v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~-~~i~v~~~ 293 (499) T protein:vir:80 215 VSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGK-KKVLVPSS 293 (499) T ss_pred cchhhhccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcc-cceecchh Confidence 00000 11222211111 123468999999999999888877777777777543 33222 Q ss_pred --EeCCCCCCCHHHHHHHHHHHHHHhccccccccc-eeec-cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHH Q lcl|NC_020081. 288 --HIKTGQEQSNQALTSFRREWTSMFSGINGAWKI-PVIT-AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSE 363 (552) Q Consensus 288 --~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~-~il~-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 363 (552) ....+..... . ..| .......+. .... +++-.++.+.....+-.+.+..+...++|....|++|.. T Consensus 294 ~l~~~~~~~g~~--~----~~~----~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~ 363 (499) T protein:vir:80 294 FVKTAVNLDGST--T----QYF----DSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGT 363 (499) T ss_pred hhhccCCCCCCc--c----cCC----CcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhh Confidence 1110000000 0 000 000000110 0111 122346666777777778888999999999999999999 Q ss_pred hcccccccccccccccc---cchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcC-cc---cccceeecccccChHHHHHHH Q lcl|NC_020081. 364 INFPNRGGATGHSGNTL---NEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIV-SQ---FGGDYVFNFVGGDAKTEAEII 436 (552) Q Consensus 364 lg~~~~~t~~~~~~~~~---~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~-~~---~~~~~~~~f~~~d~~~~~~~~ 436 (552) +|....+..++....+. .+.+ ....+..++.+|..++..|-...+...+ .. ....+.+.|.+.-+.+..+.. T Consensus 364 fg~~~~g~~TAtei~s~~~~l~~~-~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~ 442 (499) T protein:vir:80 364 FTFDENGLKTATEVVSEKSETYQT-KNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTI 442 (499) T ss_pred cCCCcccchhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHH Confidence 98754332111110000 0111 1112333344555555554433322111 11 123466777665444443333 Q ss_pred -HHHHHHhcCCcCHHHHHHHh-CCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCC Q lcl|NC_020081. 437 -SILESKAKIGLTINDIRKEL-GYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKD 514 (552) Q Consensus 437 -~~~~~~~~g~lT~NE~R~~~-gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (552) ...+.+.+|+|+.-.++..+ |.+- +. +.+..+....+.... -+.+... ...|++ T Consensus 443 ~~~~~~~~~Gi~S~et~l~~~~~~~d-~e------------a~~el~~i~~E~~~~-~~~~d~~----------g~~ge~ 498 (499) T protein:vir:80 443 NRYTTAKNQGMIPLKIALQRAWNITE-AE------------ADEWAEMLAKEKQAE-IPNNDMT----------GIFGEE 498 (499) T ss_pred HHHHHHHHcCCCCHHHHHhhcCCCCh-HH------------HHHHHHHHHHHhhcC-CCCCCcc----------ccCCCC Confidence 33445557999988887543 4321 00 000001000000000 0000000 001111 Q ss_pred C Q lcl|NC_020081. 515 S 515 (552) Q Consensus 515 ~ 515 (552) + T Consensus 499 e 499 (499) T protein:vir:80 499 E 499 (499) T ss_pred C Confidence 1 No 209 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=97.69 E-value=2.9e-05 Score=45.50 Aligned_cols=474 Identities=13% Similarity=0.104 Sum_probs=184.8 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhcccccccccccccccccccccc-ccCCccccc---ccCCCCch Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSM-SMNPDFKEA---PSIHGKQN 76 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~---~~~~~~~~ 76 (552) |-.| +...+++... .++.|-.+....+++.--. .+.+.|-.- ........ T Consensus 1 m~~l----------------------fgf~i~~~~~----~~~~S~vpp~~~~~~~~i~~g~~g~~v~~~g~~~~~n~~e 54 (564) T protein:vir:10 1 MSQL----------------------FGFLINEKEG----QKGQSPVPPNDEASVSTVAGGYFGTYVDTSGGQNSRNEYE 54 (564) T ss_pred Ccch----------------------hcceeeeecc----CCCCCcccCCcCCChhhhhccccceeeecccccchhhHHH Confidence 2222 2222222111 0111111111111111000 001111000 01122334 Q ss_pred HHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcCCCCCCCc Q lcl|NC_020081. 77 LLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTGRIDNDFT 155 (552) Q Consensus 77 ~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n~~~~pn~ 155 (552) +.+..|.++..+.+-.|+-.+..+++ .+....-+.++-+.+.+ .+..-+++|.. +..+++.++. T Consensus 55 LI~~YR~ma~~pEVd~Av~eIVneaI-------v~d~~~~pV~vdL~~~~--~s~siK~kI~eEF~~Il~ll~F------ 119 (564) T protein:vir:10 55 LIRRYRDMSLHPEVDSAIDEIVNEFV-------VNDGDDKPVEVDLQNLE--IGSGVKKKIRDEFNRILRMMNF------ 119 (564) T ss_pred HHHHHHHHhhccchhhHHHHhhccee-------EecCCCceEEEEecccC--cchHHHHHHHHHHHHHHHHhcc------ Confidence 55666778888887777766554432 22233444566554443 45555555543 4444444332 Q ss_pred cCCHHHHHHHHHHHHHhcCCeeEEEEECC---CCCEEEEEEecCceeEEEECC------Cccccccc--------ceeEE Q lcl|NC_020081. 156 RDNFRSFVKKLVRDRLTYDKINFELVYDK---LGDLHNFKAVDASTVYVAVDE------DGKERKAK--------DGVRY 218 (552) Q Consensus 156 ~~t~~~f~~~~v~d~ll~Gna~~~i~r~~---~G~~~~L~~l~p~~v~v~~~~------~g~~~~~~--------~~~~y 218 (552) ..+.++ +++.+.+.|..|+.++-|. ..-+.+|.+|||..|+.++.. .+...... ....| T Consensus 120 ~~~~~e----~fR~WYVDgRi~fHkiid~~~pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Ey 195 (564) T protein:vir:10 120 NVNAHE----IIRNWYVDGRSHYHKVIDLDNPKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEY 195 (564) T ss_pred chhhhH----HHhhhhhcceEEEEEEeeCCChhhhhhhhhhhcccceeeeeeeccccccccceeeeeeeeeccccccccc Confidence 123444 4567778899999987652 223999999999987765521 11111000 00112 Q ss_pred EEEc-CC----------------ceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_020081. 219 VQVI-DD----------------KVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGG 281 (552) Q Consensus 219 ~~~~-~~----------------~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~ 281 (552) +.+. .+ +....++.+-|.|...+. ...++..=+|-|..|...+.....++....-|=-.-+ T Consensus 196 y~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~daI~y~hSGL--~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRA 273 (564) T protein:vir:10 196 YIYNPKGFAGNIPMVTGSMDWSNQEGIKIASDAIAQSTSGL--MDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRA 273 (564) T ss_pred eeeccccccCcccccccccccccccceeechhhcceecccc--eeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhcc Confidence 2222 11 112456666666654332 2233445567788888888877777776654433334 Q ss_pred CCceEEEeCCCCCCCHHHHHHHHHHHHHHhcc----------ccccccc-eeec---------cCCceeeec--cCchhH Q lcl|NC_020081. 282 TTRGLLHIKTGQEQSNQALTSFRREWTSMFSG----------INGAWKI-PVIT---------AEDVKFVNM--TQSSKD 339 (552) Q Consensus 282 ~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G----------~~nagk~-~il~---------~~g~~~~~l--~~~~~d 339 (552) .=+-|..+..+......+-+=++.-+ ..|.. ..+..+. -++. +.|.++..| +.+.-+ T Consensus 274 PeRRvFYIDVGnLPk~KAeqYlr~iM-~k~KNklVYDa~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLge 352 (564) T protein:vir:10 274 PERRIFYIDVGNLPKVKAEQYLRDVM-SRYRNKLVYDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGE 352 (564) T ss_pred ccceEEEEecCCCCchhHHHHHHHHH-HhcCceEEEeccCceecccchhhhhHhhhcccccCCCcccceeeccccCCcch Confidence 44555555444332322222222222 11210 0000000 0110 013344444 333344 Q ss_pred HHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----C-- Q lcl|NC_020081. 340 MEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----V-- 413 (552) Q Consensus 340 ~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----~-- 413 (552) |+- ..+..+.+.++++||.+.|+.. .++|.-. .++--.....=+...|.-+..++...|...| + T Consensus 353 m~D---V~YF~kKLY~aLnVP~SRl~~e-~~~f~~G------r~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLK 422 (564) T protein:vir:10 353 LKD---VEYFKKKLYNSLNLPPSRLTDD-NKAFNLG------KSTEILRDELKFTKFIGRLRKRFAQLFHDILKTQLILK 422 (564) T ss_pred HHH---HHHHHHHHHHHhCCCcccccCC-Cceeecc------cccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Confidence 443 4588899999999999999753 2222211 1111111122223344455555555444332 2 Q ss_pred ---cccc-----cceeecccccChH----------HHHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeecccccc Q lcl|NC_020081. 414 ---SQFG-----GDYVFNFVGGDAK----------TEAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQ 474 (552) Q Consensus 414 ---~~~~-----~~~~~~f~~~d~~----------~~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~ 474 (552) ++.+ ..+.+.|.+...- .|...++.+..++.-.+|.+=+|+. |.+--.+ +. T Consensus 423 giit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDee----------i~ 492 (564) T protein:vir:10 423 GIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENE----------FK 492 (564) T ss_pred cCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHH----------HH Confidence 2222 3466777655432 2223333333333334566666542 3332110 00 Q ss_pred chhhhccccccccccCCCCCccC----cccCCCCCCCCCCCCCCCcccccCCCCccccccccccccccCcccccccccc Q lcl|NC_020081. 475 RLGQIMQQEQVEYQRQMDANQFL----AQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGKDDNGNVVND 549 (552) Q Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (552) ..+.+...+.. ...-.+|.+.. .+.++....+....-.++. ..++..+ ...++++.-.+.+++++++- T Consensus 493 ~~~kqI~~E~k-~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~----~~~~~~~--~~~~a~~~~~~~~~~~~~~~ 564 (564) T protein:vir:10 493 EIDKQMKSDIE-SGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDL----AAEREIK--KLNSAPKPPPSQQSKSQSNK 564 (564) T ss_pred HHHHHHHHHhh-cCCCCCchhhhcCCCccCCCCcCCcchhhhcccc----ccccChh--hhccCCCCCCCCCCcCcCCC Confidence 00000000000 00000000000 0000000000000000000 0000000 00000111111111111111 No 210 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=97.67 E-value=3.1e-05 Score=45.31 Aligned_cols=443 Identities=14% Similarity=0.137 Sum_probs=188.7 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhh-----hccccccccccccccccccccc---cccCCc----cc-c Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAI-----LKKGKNTKSNKPKAYEEPIIGS---MSMNPD----FK-E 67 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~---~~~~~~----~~-~ 67 (552) ||.+. .+ |++..=+.....-....++.+ +-..++.-.++. .+.++ ..++.. |- . T Consensus 1 ~~~~~-~~-------~~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~-----~i~~~~~~~~~~g~~~~~y~~~ 67 (524) T protein:vir:98 1 MNFLG-FG-------NVLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAY-----EIETDLNNQKYAGVFQQFYSGQ 67 (524) T ss_pred CCCcc-hh-------hHHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCce-----eecCCCCcceecceeeeecccc Confidence 55443 21 111000000000000000000 000011111110 01111 111111 11 1 Q ss_pred cccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHh Q lcl|NC_020081. 68 APSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEK 146 (552) Q Consensus 68 ~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~ 146 (552) .+.+.+...+.+..|.++..+.+-.|+-.+..+++ .+....-+..+-+.+. +.+..-+++|.. +..++.. T Consensus 68 e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaI-------v~~~~~~pV~l~L~~~--~~s~~iK~kI~eeF~~Il~l 138 (524) T protein:vir:98 68 DPAIQNKEQLINTYRGIMSYPEVENAVSEIIDDAI-------VNEQGKDIITMDLAKT--NFSKAIQDKIVEEFDNVLNI 138 (524) T ss_pred ccccchHHHHHHHHHHHhhccchhhHHHhhhccee-------EecCCCceEEEEeccc--ccchHHHHHHHHHHHHHHHH Confidence 22233344456677788888887777766544432 2233444455555433 344444555543 4444444 Q ss_pred cCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCC--EEEEEEecCceeEEEEC-----CCcccccccceeEEE Q lcl|NC_020081. 147 TGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGD--LHNFKAVDASTVYVAVD-----EDGKERKAKDGVRYV 219 (552) Q Consensus 147 ~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~--~~~L~~l~p~~v~v~~~-----~~g~~~~~~~~~~y~ 219 (552) ++. ....++ +++.+.+.|..|+.++-+.+.. +.+|.+|||..|+.++. .+++........-|+ T Consensus 139 l~F------~~~~~~----~fR~WYVDgRi~fhkiid~~~~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f 208 (524) T protein:vir:98 139 YDF------DNMGAR----LFRDWYVDSRIYFHKIMHKDESKGIRELRQLDPRCMELIRESITETLDGGVKVFRGYREFF 208 (524) T ss_pred hcc------chhhhH----HHhhhhhcceeEEEEEEcCCCCcceeeeeeeCCccceeeeeccccccccchhhccceeeee Confidence 332 123444 4567778899999999654433 99999999999977641 122211111111122 Q ss_pred EEcC-------------CceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Q lcl|NC_020081. 220 QVID-------------DKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGL 286 (552) Q Consensus 220 ~~~~-------------~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 286 (552) .+.. -+....++.+.|+|...+.-.. ..+ . +|-|..|...+.....++....-|=-.-+.-+-| T Consensus 209 ~Y~~~~~~~~~~g~~~~~~~~ikI~~dAIvy~hSGL~d~-~~~-i-isyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRv 285 (524) T protein:vir:98 209 VYSAPKAGYTYNGQIYQANQKIKIPRSAIVYAHSGLEDC-SNN-I-IGYLHRAVKPANQLRLLEDAMVIYRITRAPERRV 285 (524) T ss_pred eeccCCCccccccceecCCCceeechhheeeeccCcccC-CCC-e-eeehhHhhHhHHhhHHHHhhHHHHhhhccccceE Confidence 2110 1122456777788765554322 222 2 5888888888888887777665444444444556 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHHHhc---------c-ccccccc-eeec---------cCCceeeecc--CchhHHHHHH Q lcl|NC_020081. 287 LHIKTGQEQSNQALTSFRREWTSMFS---------G-INGAWKI-PVIT---------AEDVKFVNMT--QSSKDMEFEK 344 (552) Q Consensus 287 l~~~~~~~~s~~~~~~~~~~~~~~~~---------G-~~nagk~-~il~---------~~g~~~~~l~--~~~~d~q~~e 344 (552) ..+..+......+-+=++.-+ ..|. | ..+..+. .++. +.|.++..|. .+.-+|+- T Consensus 286 FYIDvGnlPk~KAeqYl~~im-~k~kNklvYDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlgem~D-- 362 (524) T protein:vir:98 286 FYIDVGQMGGNKATQYVNNIA-QGLKNRVVYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDD-- 362 (524) T ss_pred EEEecCCCCchhHHHHHHHHH-HhcCceeEeeccCceeeccccccchhhhhcccccCCCCccceeeccccCCcChHHH-- Confidence 555544333333322233222 2222 1 1111111 1111 1134444443 33344443 Q ss_pred HHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHh----hcC-----cc Q lcl|NC_020081. 345 WLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNK----YIV-----SQ 415 (552) Q Consensus 345 ~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~----~L~-----~~ 415 (552) ..+..+.+.++++||.+.|+..+ ++|.-..+ ..-+.-|-.. ...|.-+..++...|.. .|+ ++ T Consensus 363 -V~YF~kkLy~aLnVP~sRl~~~~-~~f~~Gr~--~EItRDEiKF----~KFI~rLR~rFs~lf~~~L~~qLilKgiit~ 434 (524) T protein:vir:98 363 -IKWFNRKLYEALRVPLSRMPRDD-GGMQIGGG--GEITRDELKF----SKFIRTLQIQFSPVLSDPLKTNLIAKKIITE 434 (524) T ss_pred -HHHHHHHHHHHhCCCceeccCCC-Cccccccc--cchhHHHHHH----HHHHHHHHHHHHHHHHHHHHHhhhhhcCCCH Confidence 45888999999999999997542 33321111 1111222222 33344455555444443 322 22 Q ss_pred cc-----cceeecccccChHH----------HHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhh Q lcl|NC_020081. 416 FG-----GDYVFNFVGGDAKT----------EAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQI 479 (552) Q Consensus 416 ~~-----~~~~~~f~~~d~~~----------~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~ 479 (552) .+ ..+.+.|.+...-+ |...++.+..+..-+++.+=+|+. |.+--.+ +...+.+ T Consensus 435 eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDee----------i~~~~k~ 504 (524) T protein:vir:98 435 DEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEILRMSDED----------IDEQAKL 504 (524) T ss_pred HHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHHhccCHHH----------HHHHHHH Confidence 22 23667776655322 222223333333335677766653 3332110 0001111 Q ss_pred ccccccccccCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 480 MQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) .+.+ .+.+--.+++.+. ++. T Consensus 505 I~~E----~k~~~~~~p~~e~-------------~~f 524 (524) T protein:vir:98 505 IEEE----SKEERFKNPEAEE-------------ENF 524 (524) T ss_pred HHHH----HhCCCCcCCcccc-------------ccC Confidence 1100 0001111111111 111 No 211 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=97.65 E-value=3.3e-05 Score=45.14 Aligned_cols=458 Identities=12% Similarity=0.100 Sum_probs=188.3 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhcccccccccccccc---ccccccccccCCcc-cccccCCCCch Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAY---EEPIIGSMSMNPDF-KEAPSIHGKQN 76 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~-~~~~~~~~~~~ 76 (552) |-.+= ..++....- ..+..|..+... +.++.... .++.| --.+....... T Consensus 1 m~~lf----------------------g~~i~~~~~---~~~~~s~~~~~~~dg~~~i~~~~-~~~~~~~~e~~~~~~~e 54 (533) T protein:vir:10 1 MSQLF----------------------GFSLERAKK---APKGPSFVQKDNLDGSQPVSGGG-YYGYTVDFDGQVRNEYQ 54 (533) T ss_pred Ccccc----------------------ccccccccc---cccCCCCCCCCcccccceeeccc-ccceeeecccccchHHH Confidence 22221 222211100 111122211111 11222221 12222 22334444555 Q ss_pred HHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcCCCCCCCc Q lcl|NC_020081. 77 LLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTGRIDNDFT 155 (552) Q Consensus 77 ~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n~~~~pn~ 155 (552) +.+..|.++..+.+-.|+-.+..+++ .+.....+..+-+.+.+ .+..-+++|.. +..+++.++. T Consensus 55 LI~~YR~ma~~pEvd~Av~eIVneai-------v~d~~~~pV~i~Ld~~~--~s~~iK~kI~eEF~~Il~ll~F------ 119 (533) T protein:vir:10 55 LISRYREMVLQPECDSAVDDIVNETI-------CGNFDDVPVSVELSNLK--VSDKIKKLIREEFGEILRLLDF------ 119 (533) T ss_pred HHHHHHHHhhccchhhHHHHhhccee-------eecCCCceEEEEecccc--cchHHHHHHHHHHHHHHHHhcc------ Confidence 66777888888888777766544433 22233444555554432 44444445543 4444444332 Q ss_pred cCCHHHHHHHHHHHHHhcCCeeEEEEECC---CCCEEEEEEecCceeEEEECC-----Cccc------ccccceeEEEEE Q lcl|NC_020081. 156 RDNFRSFVKKLVRDRLTYDKINFELVYDK---LGDLHNFKAVDASTVYVAVDE-----DGKE------RKAKDGVRYVQV 221 (552) Q Consensus 156 ~~t~~~f~~~~v~d~ll~Gna~~~i~r~~---~G~~~~L~~l~p~~v~v~~~~-----~g~~------~~~~~~~~y~~~ 221 (552) ....++ +++.+.+.|..|+.++-|. ...+.+|.+|||..|+.++.- ++.. .......-|+.+ T Consensus 120 ~~~~~e----~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Y 195 (533) T protein:vir:10 120 ENRSYE----IFRRWYVDGRLFYHKVIDPDNPQGGLIELRYIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLY 195 (533) T ss_pred chhhhH----HHhhhhhcceEEEEEEecCCCccccceeeeeccccceeeeeeeeccCCCccceeecchhhhccceeeeee Confidence 123444 4567778899999988653 346999999999999875421 2211 011111112222 Q ss_pred cCC------ceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCC Q lcl|NC_020081. 222 IDD------KVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQ 295 (552) Q Consensus 222 ~~~------~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~ 295 (552) ... .....++.+-|.+..-+. ...++.+=+|-|..|...+.....++....-|=-.-+.=+-|..+..+... T Consensus 196 np~g~~~~~~~~vkI~~dAI~y~hSGl--~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLP 273 (533) T protein:vir:10 196 DPKGLKNSTTQGLKIAPDSICYVHSGI--MDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLP 273 (533) T ss_pred ccccccccCCCceecchhheeeeeccc--eeCCCCceeccchHhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCC Confidence 211 222345554444432222 233444556888888888888877777665443334444555555544332 Q ss_pred CHHHHHHHHHHHHHHhcc----------ccccccc-eeec---------cCCceeeecc--CchhHHHHHHHHHHHHHHH Q lcl|NC_020081. 296 SNQALTSFRREWTSMFSG----------INGAWKI-PVIT---------AEDVKFVNMT--QSSKDMEFEKWLNYLINVI 353 (552) Q Consensus 296 s~~~~~~~~~~~~~~~~G----------~~nagk~-~il~---------~~g~~~~~l~--~~~~d~q~~e~~~~~~~~I 353 (552) ...+-+=++.-+ ..|.. ..+..+. .++. +.|.++..|. .+.-+|+- ..+..+.+ T Consensus 274 k~KAeqYlr~iM-~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~D---V~YF~kKL 349 (533) T protein:vir:10 274 KNKAEQYLREVM-GRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLGELED---VKYFQKKL 349 (533) T ss_pred chhHHHHHHHHH-HhccceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcChHHH---HHHHHHHH Confidence 322222222222 11210 0000010 0110 0133444432 33344443 45888999 Q ss_pred HHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----C-----cccc-----cc Q lcl|NC_020081. 354 CSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----V-----SQFG-----GD 419 (552) Q Consensus 354 a~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~-----~~ 419 (552) .++++||.+.|+.. ++|....+ ..-+.-|-. +...|.-+..++...|...| + ++.+ .. T Consensus 350 Y~aLnVP~SRl~~e--~~f~~Gr~--~EItRDEiK----F~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~ 421 (533) T protein:vir:10 350 YKSLNVPGSRLETE--TTFNVGRA--AEITRDEVK----FQKFVARLRKRFSELFTDLLKTQLVLKGVISIEEWDQMKEH 421 (533) T ss_pred HHHhCCCccccCCC--Cccccccc--chhhHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhc Confidence 99999999999643 23221111 111111222 23344445555555444332 2 2222 34 Q ss_pred eeecccccChH----------HHHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhhccccccccc Q lcl|NC_020081. 420 YVFNFVGGDAK----------TEAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQ 488 (552) Q Consensus 420 ~~~~f~~~d~~----------~~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~ 488 (552) +.+.|.+...- .|...++.+..++.-.+|.+=+|+. |.+--.+ +...+.+...+.. .. T Consensus 422 I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDee----------i~~~~kqI~~E~k-~~ 490 (533) T protein:vir:10 422 IQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVE----------MKEIDKQIESEME-SG 490 (533) T ss_pred ceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHH----------HHHHHHHHHHHHh-CC Confidence 66777665432 2223333333333335677777653 3432110 0000111000000 00 Q ss_pred cCCCCCccCcccCCCCCCCCCCCCCCCcc---cccCCCCccccccccccccccCccccccccccc Q lcl|NC_020081. 489 RQMDANQFLAQQTGYDGNMDNVNGKDSFN---QNVGKDGQSKQQANTNSTPQGGKDDNGNVVNDW 550 (552) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (552) .-.++...... ..+..+ |+..+... .+.+.++++.. ...| T Consensus 491 ~~~~p~~~~~~---~~~~~~-~~~~~~~~~~~~~~~~~~~~~~------------------~~~~ 533 (533) T protein:vir:10 491 IIADPAAEMDP---AMAAGD-PDAGGAPAEEVAPEGPDPSDER------------------KAEF 533 (533) T ss_pred CCCCCcchhhH---HhcCCC-CCcCCcccccCCCCCCCcchhh------------------ccCC Confidence 00011000000 000000 00000000 11111111100 0111 No 212 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=97.61 E-value=3.8e-05 Score=44.80 Aligned_cols=424 Identities=12% Similarity=0.111 Sum_probs=174.0 Q ss_pred CCCCCCCcc----cccch---hhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCC Q lcl|NC_020081. 1 MGLLDGFFK----GRKQQ---DNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHG 73 (552) Q Consensus 1 ~~~~~~~~~----~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (552) ||+++ |.+ +-.+. +||..+.+...........+++-. + .+-|. +... ...|... .+ T Consensus 1 m~~~~-~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~-~-------~~~Y~----g~~~-~~~~~~~---~~ 63 (500) T protein:vir:30 1 MGVIQ-KIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITT-N-------LKYYK----SDWD-SVLYLNT---DG 63 (500) T ss_pred CchHH-HHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHH-H-------HHHhc----CCCC-CcccccC---CC Confidence 98887 332 22211 333333332222222222222210 0 00110 0000 0000000 00 Q ss_pred CchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCC Q lcl|NC_020081. 74 KQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDND 153 (552) Q Consensus 74 ~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~p 153 (552) .... ++.. .--+-..++.. .+.+ ..+-.-.+... +.+....+.+++.. T Consensus 64 ~~~~----~~~~-slnl~~~i~~~----~A~l-------v~~e~~~i~~~---------d~~~~~~l~~il~~------- 111 (500) T protein:vir:30 64 ETKK----RDLN-HLPIARTAAKK----IASL-------VFNEQAEIKVD---------DDAANEFISETLKN------- 111 (500) T ss_pred Cccc----Ccee-ecchHHHHHHH----Hhhh-------hcCCcceEecC---------ChHHHHHHHHHHhh------- Confidence 0000 0000 00011111111 1111 11111112111 11222334444432 Q ss_pred CccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccc-----------cceeEEEE-- Q lcl|NC_020081. 154 FTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKA-----------KDGVRYVQ-- 220 (552) Q Consensus 154 n~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~-----------~~~~~y~~-- 220 (552) -.+...++..+.+.+..|.+++.+..+. |.| .+.++++..+.++..+.++.... ....+|.. T Consensus 112 ---n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~-~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE 186 (500) T protein:vir:30 112 ---DRFNKNFERYLESCLALGGLAMRPYVDG-DKV-RVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIE 186 (500) T ss_pred ---ccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEE Confidence 1355667778888999999999988874 443 47778888887754332222100 01111110 Q ss_pred ---EcCCc-eE---EEEc--------------------ccce----------eeecccccCC-ccCCcccccHHHHHHHH Q lcl|NC_020081. 221 ---VIDDK-VV---AKFK--------------------AKEM----------AWEVSNPRTD-LTVGKYGYPELEIALNH 262 (552) Q Consensus 221 ---~~~~~-~~---~~~~--------------------~~ev----------i~~~~~~~~~-~~~g~~G~spl~~~~~~ 262 (552) ..++. .. ..|. +.++ .|++.+.... ....++|+|.+..+... T Consensus 187 ~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~l 266 (500) T protein:vir:30 187 FHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTT 266 (500) T ss_pred EEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHH Confidence 00010 00 0010 0010 1222111111 12347899999999999 Q ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCC-----CC-CHHHHHHHHHHHH---HHhccccccccceeeccCCceeeec Q lcl|NC_020081. 263 LQYHDNTEVFNARFFAQGGTTRGLLHIKTGQ-----EQ-SNQALTSFRREWT---SMFSGINGAWKIPVITAEDVKFVNM 333 (552) Q Consensus 263 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-----~~-s~~~~~~~~~~~~---~~~~G~~nagk~~il~~~g~~~~~l 333 (552) +............-|..|.. . |.++... .. +.+... ...|. ..|... ..-.+++..++.+ T Consensus 267 id~lD~~~s~~~~e~~~g~~-~--i~v~~~~l~~~~~~~~g~~~~--~~~~d~~~~~~~~~------~~~~~~~~~i~~~ 335 (500) T protein:vir:30 267 IDFINTTYDEFMWEVKMGQR-R--VAVPESLTALTVRTTDGDVVP--RPRFESDQNVYIRM------GGRDLDSSAIQDL 335 (500) T ss_pred HHHHHHHHHHHHHHHHhCcc-e--eeechHHhcccCCCCCccccC--CcccCCCcceEEEc------CCCCCcCcceeEe Confidence 99888887777777776543 3 2222111 00 000000 00010 011110 0111233456677 Q ss_pred cCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccc---cccccchhHHHHHHHHHHHHhhHHHHHHHHHHHh Q lcl|NC_020081. 334 TQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHS---GNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNK 410 (552) Q Consensus 334 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~---~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~ 410 (552) +....+-++.+..+...++||...|+++..+|+...+.-++.. ..+..+.+.. ..+..++.+|.-++..|-+..+. T Consensus 336 ~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~-~~~~~~~~al~~lv~~il~~~~~ 414 (500) T protein:vir:30 336 TTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRN-SIVALVEQSLKELVISIFEIAKA 414 (500) T ss_pred ccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 7777888999999999999999999999999875443211111 0011122221 23344456666666666543332 Q ss_pred -hcCc---ccccceeecccccChHHH-HHHHHHHHHHhcCCcCHHHHHHHh-CCCCCCCCCeeeccccccchhhhccccc Q lcl|NC_020081. 411 -YIVS---QFGGDYVFNFVGGDAKTE-AEIISILESKAKIGLTINDIRKEL-GYPDTEGGDVTLAGVHVQRLGQIMQQEQ 484 (552) Q Consensus 411 -~L~~---~~~~~~~~~f~~~d~~~~-~~~~~~~~~~~~g~lT~NE~R~~~-gl~p~~ggD~~~~~~n~~~~~~~~~~~~ 484 (552) .+.. .....+.+.|..+-+.++ ++.....+.+.+|+|+.-+++.++ |++.-+ +.+..+... T Consensus 415 ~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~eee-------------a~~~l~~i~ 481 (500) T protein:vir:30 415 YDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVTEEK-------------AQEIAAEIN 481 (500) T ss_pred HhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCHHH-------------HHHHHHHHH Confidence 1211 112345566755433333 333344455667999999988544 543210 001001000 Q ss_pred cccccCCCCCccCcccCCCCCCCCCCCCC Q lcl|NC_020081. 485 VEYQRQMDANQFLAQQTGYDGNMDNVNGK 513 (552) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (552) .+ .....+++ .+....-|+ T Consensus 482 ~E--~~~~~~~~--------~~~~~~~g~ 500 (500) T protein:vir:30 482 TG--IVDEINQQ--------RTDTHLYGE 500 (500) T ss_pred Hh--ccccCCCC--------CccccccCC Confidence 00 00000000 000111111 No 213 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=97.61 E-value=3.8e-05 Score=44.80 Aligned_cols=424 Identities=12% Similarity=0.111 Sum_probs=174.0 Q ss_pred CCCCCCCcc----cccch---hhcccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCC Q lcl|NC_020081. 1 MGLLDGFFK----GRKQQ---DNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHG 73 (552) Q Consensus 1 ~~~~~~~~~----~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (552) ||+++ |.+ +-.+. +||..+.+...........+++-. + .+-|. +... ...|... .+ T Consensus 1 m~~~~-~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~-~-------~~~Y~----g~~~-~~~~~~~---~~ 63 (500) T protein:vir:98 1 MGVIQ-KIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITT-N-------LKYYK----SDWD-SVLYLNT---DG 63 (500) T ss_pred CchHH-HHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHH-H-------HHHhc----CCCC-CcccccC---CC Confidence 98887 332 22211 333333332222222222222210 0 00110 0000 0000000 00 Q ss_pred CchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCC Q lcl|NC_020081. 74 KQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDND 153 (552) Q Consensus 74 ~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~p 153 (552) .... ++.. .--+-..++.. .+.+ ..+-.-.+... +.+....+.+++.. T Consensus 64 ~~~~----~~~~-slnl~~~i~~~----~A~l-------v~~e~~~i~~~---------d~~~~~~l~~il~~------- 111 (500) T protein:vir:98 64 ETKK----RDLN-HLPIARTAAKK----IASL-------VFNEQAEIKVD---------DDAANEFISETLKN------- 111 (500) T ss_pred Cccc----Ccee-ecchHHHHHHH----Hhhh-------hcCCcceEecC---------ChHHHHHHHHHHhh------- Confidence 0000 0000 00011111111 1111 11111112111 11222334444432 Q ss_pred CccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccc-----------cceeEEEE-- Q lcl|NC_020081. 154 FTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKA-----------KDGVRYVQ-- 220 (552) Q Consensus 154 n~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~-----------~~~~~y~~-- 220 (552) -.+...++..+.+.+..|.+++.+..+. |.| .+.++++..+.++..+.++.... ....+|.. T Consensus 112 ---n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~-~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE 186 (500) T protein:vir:98 112 ---DRFNKNFERYLESCLALGGLAMRPYVDG-DKV-RVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIE 186 (500) T ss_pred ---ccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEE Confidence 1355667778888999999999988874 443 47778888887754332222100 01111110 Q ss_pred ---EcCCc-eE---EEEc--------------------ccce----------eeecccccCC-ccCCcccccHHHHHHHH Q lcl|NC_020081. 221 ---VIDDK-VV---AKFK--------------------AKEM----------AWEVSNPRTD-LTVGKYGYPELEIALNH 262 (552) Q Consensus 221 ---~~~~~-~~---~~~~--------------------~~ev----------i~~~~~~~~~-~~~g~~G~spl~~~~~~ 262 (552) ..++. .. ..|. +.++ .|++.+.... ....++|+|.+..+... T Consensus 187 ~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~l 266 (500) T protein:vir:98 187 FHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTT 266 (500) T ss_pred EEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHH Confidence 00010 00 0010 0010 1222111111 12347899999999999 Q ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCC-----CC-CHHHHHHHHHHHH---HHhccccccccceeeccCCceeeec Q lcl|NC_020081. 263 LQYHDNTEVFNARFFAQGGTTRGLLHIKTGQ-----EQ-SNQALTSFRREWT---SMFSGINGAWKIPVITAEDVKFVNM 333 (552) Q Consensus 263 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-----~~-s~~~~~~~~~~~~---~~~~G~~nagk~~il~~~g~~~~~l 333 (552) +............-|..|.. . |.++... .. +.+... ...|. ..|... ..-.+++..++.+ T Consensus 267 id~lD~~~s~~~~e~~~g~~-~--i~v~~~~l~~~~~~~~g~~~~--~~~~d~~~~~~~~~------~~~~~~~~~i~~~ 335 (500) T protein:vir:98 267 IDFINTTYDEFMWEVKMGQR-R--VAVPESLTALTVRTTDGDVVP--RPRFESDQNVYIRM------GGRDLDSSAIQDL 335 (500) T ss_pred HHHHHHHHHHHHHHHHhCcc-e--eeechHHhcccCCCCCccccC--CcccCCCcceEEEc------CCCCCcCcceeEe Confidence 99888887777777776543 3 2222111 00 000000 00010 011110 0111233456677 Q ss_pred cCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccc---cccccchhHHHHHHHHHHHHhhHHHHHHHHHHHh Q lcl|NC_020081. 334 TQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHS---GNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNK 410 (552) Q Consensus 334 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~---~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~ 410 (552) +....+-++.+..+...++||...|+++..+|+...+.-++.. ..+..+.+.. ..+..++.+|.-++..|-+..+. T Consensus 336 ~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~-~~~~~~~~al~~lv~~il~~~~~ 414 (500) T protein:vir:98 336 TTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRN-SIVALVEQSLKELVISIFEIAKA 414 (500) T ss_pred ccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 7777888999999999999999999999999875443211111 0011122221 23344456666666666543332 Q ss_pred -hcCc---ccccceeecccccChHHH-HHHHHHHHHHhcCCcCHHHHHHHh-CCCCCCCCCeeeccccccchhhhccccc Q lcl|NC_020081. 411 -YIVS---QFGGDYVFNFVGGDAKTE-AEIISILESKAKIGLTINDIRKEL-GYPDTEGGDVTLAGVHVQRLGQIMQQEQ 484 (552) Q Consensus 411 -~L~~---~~~~~~~~~f~~~d~~~~-~~~~~~~~~~~~g~lT~NE~R~~~-gl~p~~ggD~~~~~~n~~~~~~~~~~~~ 484 (552) .+.. .....+.+.|..+-+.++ ++.....+.+.+|+|+.-+++.++ |++.-+ +.+..+... T Consensus 415 ~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~eee-------------a~~~l~~i~ 481 (500) T protein:vir:98 415 YDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVTEEK-------------AQEIAAEIN 481 (500) T ss_pred HhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCHHH-------------HHHHHHHHH Confidence 1211 112345566755433333 333344455667999999988544 543210 001001000 Q ss_pred cccccCCCCCccCcccCCCCCCCCCCCCC Q lcl|NC_020081. 485 VEYQRQMDANQFLAQQTGYDGNMDNVNGK 513 (552) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (552) .+ .....+++ .+....-|+ T Consensus 482 ~E--~~~~~~~~--------~~~~~~~g~ 500 (500) T protein:vir:98 482 TG--IVDEINQQ--------RTDTHLYGE 500 (500) T ss_pred Hh--ccccCCCC--------CccccccCC Confidence 00 00000000 000111111 No 214 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=97.52 E-value=5.1e-05 Score=44.09 Aligned_cols=443 Identities=12% Similarity=0.101 Sum_probs=185.7 Q ss_pred CCCCC--CCcccccchhhcccccCcccccccccch-hhhhccccccccccccccccccccccccCC----cccccccCCC Q lcl|NC_020081. 1 MGLLD--GFFKGRKQQDNIIDINDDMAVRIKQIEE-DAILKKGKNTKSNKPKAYEEPIIGSMSMNP----DFKEAPSIHG 73 (552) Q Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 73 (552) ||.+. |+...++- +. .++... ...+-..++.-.++.....++. ...++. .+...+...+ T Consensus 1 ~~~~~lf~f~~~~d~--------~~----~~~~~~~~~~s~~~p~~~dGa~~i~~~~~--~~~~~g~~~~~~~~~~~~~~ 66 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQ--------NE----YDERLKLGHESIATPKKDDGATEIETREG--EATYNAVMQQFFGIDNNISG 66 (516) T ss_pred CCchHhcccccchhh--------hH----HhhhhcCCcCcccCCCCCCCceeeecCCC--cccccceeeeeeccccccch Confidence 32221 11111100 00 000000 0000011111111111111100 011121 1222334444 Q ss_pred CchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcCCCCC Q lcl|NC_020081. 74 KQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTGRIDN 152 (552) Q Consensus 74 ~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n~~~~ 152 (552) ...+.+..|.++..+.+-.|+-.+..+++ .+....-+..+-+.+.+ .+..-+++|.. +..++..++. T Consensus 67 ~~eLI~~YR~ma~~pEvd~Av~eIVneai-------v~d~~~~pV~l~L~~~~--~s~~ik~kI~eeF~~Il~ll~F--- 134 (516) T protein:vir:10 67 TKDLINTYRQLINNPEVERAVANIVNEAI-------VYERGHKVVSLDLDDTD--FGSNVKEKILEEFDEVCRLLDA--- 134 (516) T ss_pred HHHHHHHHHHHhhccchhhHHHHhhccee-------EecCCCceEEEEecccC--cchHHHHHHHHHHHHHHHHhcc--- Confidence 55566777888888887777766544443 12234444555554433 44444555543 4444444332 Q ss_pred CCccCCHHHHHHHHHHHHHhcCCeeEEEEEC-CCCCEEEEEEecCceeEEEEC-----CCcccccccceeEEEEE----- Q lcl|NC_020081. 153 DFTRDNFRSFVKKLVRDRLTYDKINFELVYD-KLGDLHNFKAVDASTVYVAVD-----EDGKERKAKDGVRYVQV----- 221 (552) Q Consensus 153 pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~-~~G~~~~L~~l~p~~v~v~~~-----~~g~~~~~~~~~~y~~~----- 221 (552) ....++ +++.+.+.|..|+.++-+ ....+.+|.+|||..|+.++. .+|.........+|++. T Consensus 135 ---~~~~~~----~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~ 207 (516) T protein:vir:10 135 ---SRKLDT----LFRRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEG 207 (516) T ss_pred ---chhhhH----HHhhhhhcceEEEEEEecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccc Confidence 123444 456777889999996654 355699999999999977543 22322111111112111 Q ss_pred --cCC-----ceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC Q lcl|NC_020081. 222 --IDD-----KVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQE 294 (552) Q Consensus 222 --~~~-----~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~ 294 (552) ..+ .....++.+-|.|...+. .+...+. =+|-|..|...+.....++....-|=-.-+.=+-|..+..+.. T Consensus 208 ~~~~g~~~~~~~~ikI~~dAI~y~hSGL-~d~~~~~-i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnl 285 (516) T protein:vir:10 208 YSYNGRIFEPNTRIKIPRSAVVYASSGL-MDCSDRG-IIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNM 285 (516) T ss_pred cccccceeCCCcceeechhheeeecccc-eeCCCCc-eeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCC Confidence 111 122334444444432221 1222333 3788888888888877777766544333444455555544433 Q ss_pred CCHHHHHHHHHHHHHHhcc----------ccccccce-eec---------cCCceeeec--cCchhHHHHHHHHHHHHHH Q lcl|NC_020081. 295 QSNQALTSFRREWTSMFSG----------INGAWKIP-VIT---------AEDVKFVNM--TQSSKDMEFEKWLNYLINV 352 (552) Q Consensus 295 ~s~~~~~~~~~~~~~~~~G----------~~nagk~~-il~---------~~g~~~~~l--~~~~~d~q~~e~~~~~~~~ 352 (552) ....+-+=++.-+. .|.. ..+..+.. ++. +.|.++..| +.+.-+|+- ..+..+. T Consensus 286 Pk~KAeqYl~~im~-k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kk 361 (516) T protein:vir:10 286 NNRKATEYVNGIMQ-SLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDD---VRWFNKK 361 (516) T ss_pred CchhHHHHHHHHHH-hcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH---HHHHHHH Confidence 23222222222221 1110 00111100 110 013344433 233344443 4588899 Q ss_pred HHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----C-----cccc-----c Q lcl|NC_020081. 353 ICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----V-----SQFG-----G 418 (552) Q Consensus 353 Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~-----~ 418 (552) +.++++||.+.|+.....++.+.. ++--..-..=+...|.-+..++...|...| + ++.+ . T Consensus 362 Ly~aLnVP~sRl~~e~~~~~~~Gr------~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~ 435 (516) T protein:vir:10 362 LYEALRIPLSRIPRDDGGMVIGGQ------DTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQIN 435 (516) T ss_pred HHHHhCCCcccccCCCCceeeccc------cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhh Confidence 999999999999754433221111 111111112223344445555554444332 2 2222 3 Q ss_pred ceeecccccChH----------HHHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhhcccccccc Q lcl|NC_020081. 419 DYVFNFVGGDAK----------TEAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEY 487 (552) Q Consensus 419 ~~~~~f~~~d~~----------~~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~ 487 (552) .+.+.|.+...- .|...++.+..+...+++.+=+|+. |.+.-.+ +...+.+...+. T Consensus 436 ~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee----------i~~e~k~I~~E~--- 502 (516) T protein:vir:10 436 NIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQ----------IAQEEKQIEQEA--- 502 (516) T ss_pred cceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhh----------HHHHHHHHHHhh--- Confidence 466777665432 2333333444444457787777753 4443211 000011110000 Q ss_pred ccCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 488 QRQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) +.+- -.+|+.+++. T Consensus 503 -~~~~--------------~~~p~~~~~f 516 (516) T protein:vir:10 503 -GIKR--------------FQNPENEDDF 516 (516) T ss_pred -hCCC--------------CCCCCccccC Confidence 0000 0011111111 No 215 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=97.52 E-value=5.1e-05 Score=44.09 Aligned_cols=443 Identities=12% Similarity=0.101 Sum_probs=185.7 Q ss_pred CCCCC--CCcccccchhhcccccCcccccccccch-hhhhccccccccccccccccccccccccCC----cccccccCCC Q lcl|NC_020081. 1 MGLLD--GFFKGRKQQDNIIDINDDMAVRIKQIEE-DAILKKGKNTKSNKPKAYEEPIIGSMSMNP----DFKEAPSIHG 73 (552) Q Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 73 (552) ||.+. |+...++- +. .++... ...+-..++.-.++.....++. ...++. .+...+...+ T Consensus 1 ~~~~~lf~f~~~~d~--------~~----~~~~~~~~~~s~~~p~~~dGa~~i~~~~~--~~~~~g~~~~~~~~~~~~~~ 66 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQ--------NE----YDERLKLGHESIATPKKDDGATEIETREG--EATYNAVMQQFFGIDNNISG 66 (516) T ss_pred CCchHhcccccchhh--------hH----HhhhhcCCcCcccCCCCCCCceeeecCCC--cccccceeeeeeccccccch Confidence 32221 11111100 00 000000 0000011111111111111100 011121 1222334444 Q ss_pred CchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcCCCCC Q lcl|NC_020081. 74 KQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTGRIDN 152 (552) Q Consensus 74 ~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n~~~~ 152 (552) ...+.+..|.++..+.+-.|+-.+..+++ .+....-+..+-+.+.+ .+..-+++|.. +..++..++. T Consensus 67 ~~eLI~~YR~ma~~pEvd~Av~eIVneai-------v~d~~~~pV~l~L~~~~--~s~~ik~kI~eeF~~Il~ll~F--- 134 (516) T protein:vir:10 67 TKDLINTYRQLINNPEVERAVANIVNEAI-------VYERGHKVVSLDLDDTD--FGSNVKEKILEEFDEVCRLLDA--- 134 (516) T ss_pred HHHHHHHHHHHhhccchhhHHHHhhccee-------EecCCCceEEEEecccC--cchHHHHHHHHHHHHHHHHhcc--- Confidence 55566777888888887777766544443 12234444555554433 44444555543 4444444332 Q ss_pred CCccCCHHHHHHHHHHHHHhcCCeeEEEEEC-CCCCEEEEEEecCceeEEEEC-----CCcccccccceeEEEEE----- Q lcl|NC_020081. 153 DFTRDNFRSFVKKLVRDRLTYDKINFELVYD-KLGDLHNFKAVDASTVYVAVD-----EDGKERKAKDGVRYVQV----- 221 (552) Q Consensus 153 pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~-~~G~~~~L~~l~p~~v~v~~~-----~~g~~~~~~~~~~y~~~----- 221 (552) ....++ +++.+.+.|..|+.++-+ ....+.+|.+|||..|+.++. .+|.........+|++. T Consensus 135 ---~~~~~~----~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~ 207 (516) T protein:vir:10 135 ---SRKLDT----LFRRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEG 207 (516) T ss_pred ---chhhhH----HHhhhhhcceEEEEEEecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccc Confidence 123444 456777889999996654 355699999999999977543 22322111111112111 Q ss_pred --cCC-----ceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC Q lcl|NC_020081. 222 --IDD-----KVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQE 294 (552) Q Consensus 222 --~~~-----~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~ 294 (552) ..+ .....++.+-|.|...+. .+...+. =+|-|..|...+.....++....-|=-.-+.=+-|..+..+.. T Consensus 208 ~~~~g~~~~~~~~ikI~~dAI~y~hSGL-~d~~~~~-i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnl 285 (516) T protein:vir:10 208 YSYNGRIFEPNTRIKIPRSAVVYASSGL-MDCSDRG-IIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNM 285 (516) T ss_pred cccccceeCCCcceeechhheeeecccc-eeCCCCc-eeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCC Confidence 111 122334444444432221 1222333 3788888888888877777766544333444455555544433 Q ss_pred CCHHHHHHHHHHHHHHhcc----------ccccccce-eec---------cCCceeeec--cCchhHHHHHHHHHHHHHH Q lcl|NC_020081. 295 QSNQALTSFRREWTSMFSG----------INGAWKIP-VIT---------AEDVKFVNM--TQSSKDMEFEKWLNYLINV 352 (552) Q Consensus 295 ~s~~~~~~~~~~~~~~~~G----------~~nagk~~-il~---------~~g~~~~~l--~~~~~d~q~~e~~~~~~~~ 352 (552) ....+-+=++.-+. .|.. ..+..+.. ++. +.|.++..| +.+.-+|+- ..+..+. T Consensus 286 Pk~KAeqYl~~im~-k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kk 361 (516) T protein:vir:10 286 NNRKATEYVNGIMQ-SLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDD---VRWFNKK 361 (516) T ss_pred CchhHHHHHHHHHH-hcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH---HHHHHHH Confidence 23222222222221 1110 00111100 110 013344433 233344443 4588899 Q ss_pred HHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----C-----cccc-----c Q lcl|NC_020081. 353 ICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----V-----SQFG-----G 418 (552) Q Consensus 353 Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~-----~ 418 (552) +.++++||.+.|+.....++.+.. ++--..-..=+...|.-+..++...|...| + ++.+ . T Consensus 362 Ly~aLnVP~sRl~~e~~~~~~~Gr------~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~ 435 (516) T protein:vir:10 362 LYEALRIPLSRIPRDDGGMVIGGQ------DTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQIN 435 (516) T ss_pred HHHHhCCCcccccCCCCceeeccc------cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhh Confidence 999999999999754433221111 111111112223344445555554444332 2 2222 3 Q ss_pred ceeecccccChH----------HHHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhhcccccccc Q lcl|NC_020081. 419 DYVFNFVGGDAK----------TEAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEY 487 (552) Q Consensus 419 ~~~~~f~~~d~~----------~~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~ 487 (552) .+.+.|.+...- .|...++.+..+...+++.+=+|+. |.+.-.+ +...+.+...+. T Consensus 436 ~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee----------i~~e~k~I~~E~--- 502 (516) T protein:vir:10 436 NIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQ----------IAQEEKQIEQEA--- 502 (516) T ss_pred cceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhh----------HHHHHHHHHHhh--- Confidence 466777665432 2333333444444457787777753 4443211 000011110000 Q ss_pred ccCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 488 QRQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) +.+- -.+|+.+++. T Consensus 503 -~~~~--------------~~~p~~~~~f 516 (516) T protein:vir:10 503 -GIKR--------------FQNPENEDDF 516 (516) T ss_pred -hCCC--------------CCCCCccccC Confidence 0000 0011111111 No 216 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=97.48 E-value=5.9e-05 Score=43.78 Aligned_cols=443 Identities=13% Similarity=0.119 Sum_probs=189.5 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhcccc--cccccccccccc---cccccc------ccCCccc--c Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGK--NTKSNKPKAYEE---PIIGSM------SMNPDFK--E 67 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~---~~~~~~------~~~~~~~--~ 67 (552) |.-++ |++++=+.. ....+...+... ++.|-.+....+ .+..++ .+.++|+ . T Consensus 1 ~~~~~----------~~~~lf~f~-----~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~ 65 (524) T protein:vir:10 1 MANFN----------TILSFLKPW-----ANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSN 65 (524) T ss_pred CCchh----------hHHHHhhhh-----hcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcc Confidence 22111 111110000 000000000000 011111100000 011111 1111221 2 Q ss_pred cccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHh Q lcl|NC_020081. 68 APSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEK 146 (552) Q Consensus 68 ~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~ 146 (552) .+.......+.+..|.++..+.+-.|+-.+..+++ .+.....+..+-+.+.+ .+..-+++|.. +..++.. T Consensus 66 e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai-------v~d~~~~pV~l~Ld~~~--~s~siK~kI~eeF~~Il~l 136 (524) T protein:vir:10 66 EPEVKNTRELIDTYRNLMNNYEVDNAVQEIVSDAI-------VYEDDKEVVALNLDGTD--FSQSIKDKILAEFSEVLNL 136 (524) T ss_pred cchhhhHHHHHHHHHHHhhccchhhHHHHhhccee-------EecCCCceEEEEecccC--cchHHHHHHHHHHHHHHHH Confidence 33444444556677888888887777766544443 12234444555554433 44554555543 4444444 Q ss_pred cCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC---CCCEEEEEEecCceeEEEEC----CCcccccccceeEEE Q lcl|NC_020081. 147 TGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK---LGDLHNFKAVDASTVYVAVD----EDGKERKAKDGVRYV 219 (552) Q Consensus 147 ~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~---~G~~~~L~~l~p~~v~v~~~----~~g~~~~~~~~~~y~ 219 (552) ++. ....++ +++.+.+.|..|+.++-|. ...+.+|.+|||..|+.++. .+++.........|+ T Consensus 137 l~F------~~~~~~----~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f 206 (524) T protein:vir:10 137 LNF------QRKGTD----HFQRWYVDSRIFFHKIINPKKMKDGVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFF 206 (524) T ss_pred hcc------chhhhH----HHhhheeeceEEEEEEeeCCCccccceeeeeeCCccceeeeeecccCcccchhhcchhhhe Confidence 332 123344 4567778899999988653 34699999999999976432 222222222212233 Q ss_pred EEc-------------CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Q lcl|NC_020081. 220 QVI-------------DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGL 286 (552) Q Consensus 220 ~~~-------------~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi 286 (552) .+. ..+....++.+.|+|...+. .+.++..=+|-|..|...+.....++....-|=-.-+.=+-| T Consensus 207 ~Y~~~~~~~~~~~~~~~~~~~ikI~~dAIvy~~SGL--~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRv 284 (524) T protein:vir:10 207 VYDTGHESYCADGRIYSAGTKVKIPRAAVVYAHSGL--LDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRV 284 (524) T ss_pred eecCCCcccccCcceecCCcceecchhheeeeccCc--ccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceE Confidence 322 12233456777788764433 233444556778888888888777777665443333444555 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHHHhc---------c-ccccccce-eec---------cCCceeeecc--CchhHHHHHH Q lcl|NC_020081. 287 LHIKTGQEQSNQALTSFRREWTSMFS---------G-INGAWKIP-VIT---------AEDVKFVNMT--QSSKDMEFEK 344 (552) Q Consensus 287 l~~~~~~~~s~~~~~~~~~~~~~~~~---------G-~~nagk~~-il~---------~~g~~~~~l~--~~~~d~q~~e 344 (552) ..+..+......+-+=++.-+. .|. | ..+..+.. ++. +.|.++..|. .+.-+|+- T Consensus 285 FYIDVGnlPk~KAeqYl~~im~-k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-- 361 (524) T protein:vir:10 285 FYIDTGNMPSRKAAAQMQHIMN-TMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMDD-- 361 (524) T ss_pred EEEecCCCCchhHHHHHHHHHH-hcCceeEEeccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH-- Confidence 5554443323222222222221 111 0 01111110 110 1133444432 33344443 Q ss_pred HHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----C-----cc Q lcl|NC_020081. 345 WLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----V-----SQ 415 (552) Q Consensus 345 ~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~ 415 (552) ..+..+.+.++++||.+.|+....++|....++-. +.-|-. +...|.-+..++...|...| + ++ T Consensus 362 -V~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EI--tRDEiK----F~KFI~rLR~rFs~lf~~~L~~qLilKgiit~ 434 (524) T protein:vir:10 362 -VLYFRTALYRALRIPESRIPSESNSGVMFDAGTAI--TRDELK----FAKWIRQLQNKFEEIFLDPLKTNLILKKIITE 434 (524) T ss_pred -HHHHHHHHHHHhCCCchhccCCCCccccccccchh--hHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCH Confidence 45888999999999999997554444432221111 112222 23344445555554444332 2 22 Q ss_pred cc-----cceeecccccChHH----------HHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhh Q lcl|NC_020081. 416 FG-----GDYVFNFVGGDAKT----------EAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQI 479 (552) Q Consensus 416 ~~-----~~~~~~f~~~d~~~----------~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~ 479 (552) .+ ..+.+.|.+...-+ |...++.+..++.-.++.+=+|+. |.+--.+ +...+.+ T Consensus 435 eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee----------i~~~~k~ 504 (524) T protein:vir:10 435 DEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDFLQMTDEE----------INQEAKQ 504 (524) T ss_pred HHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHH----------HHHHHHH Confidence 22 34667776655322 222233333333334566666653 3332110 0000111 Q ss_pred ccccccccccCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 480 MQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) ...+ .+.+--.++..+. ++. T Consensus 505 I~~E----~k~~~~~~~~~~~-------------~~f 524 (524) T protein:vir:10 505 IEEE----SKEARFQNPDEEE-------------EDF 524 (524) T ss_pred HHHH----hhcCCCCCCChhh-------------hcC Confidence 1100 0001111111110 111 No 217 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=97.47 E-value=6.1e-05 Score=43.71 Aligned_cols=444 Identities=12% Similarity=0.099 Sum_probs=185.6 Q ss_pred CCCCC--CCcccccchhhcccccCcccccccccch-hhhhccccccccccccccccccc--cccccCCcccccccCCCCc Q lcl|NC_020081. 1 MGLLD--GFFKGRKQQDNIIDINDDMAVRIKQIEE-DAILKKGKNTKSNKPKAYEEPII--GSMSMNPDFKEAPSIHGKQ 75 (552) Q Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~ 75 (552) ||.+. |+....+-.. .++... ...+-..++.-.++...-..+.. .....+..+-..+.+.... T Consensus 1 ~~~~~lf~f~~~~d~~~------------~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~ 68 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNE------------YDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTK 68 (516) T ss_pred CCchHhcccccchhhHH------------HHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHH Confidence 33221 2221111100 000000 00010111111111111111000 0011112222344555556 Q ss_pred hHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcCCCCCCC Q lcl|NC_020081. 76 NLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTGRIDNDF 154 (552) Q Consensus 76 ~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n~~~~pn 154 (552) .+.+..|.++..+.+-.|+-.+..+++. +.....+..+-+.+.+ .+..-+++|.. +..++..++. T Consensus 69 ~LI~~YR~ma~~pEvd~Av~eIvneaiv-------~d~~~~pV~l~l~~~e--~s~sik~kI~eeF~~Il~ll~F----- 134 (516) T protein:vir:10 69 DLINTYRQLTNNPEVERAVANIVNEAVV-------YEKGHKVVSLDLDDTE--FSSSIKDKILEEFDEICRLLDA----- 134 (516) T ss_pred HHHHHHHHhhhccchhHHHHHhhcceeE-------ecCCCceEEEEecccc--cchHHHHHHHHHHHHHHHHhcc----- Confidence 6777788899888888887666554431 2233344455444432 44444455543 4444444332 Q ss_pred ccCCHHHHHHHHHHHHHhcCCeeEEEEEC-CCCCEEEEEEecCceeEEEEC-----CCcccccccceeEEEEEc------ Q lcl|NC_020081. 155 TRDNFRSFVKKLVRDRLTYDKINFELVYD-KLGDLHNFKAVDASTVYVAVD-----EDGKERKAKDGVRYVQVI------ 222 (552) Q Consensus 155 ~~~t~~~f~~~~v~d~ll~Gna~~~i~r~-~~G~~~~L~~l~p~~v~v~~~-----~~g~~~~~~~~~~y~~~~------ 222 (552) ....++ +++.+.+.|..|+.++-+ ....+.+|.+|||..|+.++. .+|.........+|++.. T Consensus 135 -~~~~~~----~fR~WYVDgRi~fhKiid~~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~ 209 (516) T protein:vir:10 135 -SRKLDT----LFRRWYIDSRIFFHKIMPNPKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYA 209 (516) T ss_pred -chhhhH----HHHhhhhcceEEEEEEecCcccceeeeeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCcccee Confidence 123444 456777889999996654 355699999999999977543 222211111111122211 Q ss_pred -CC-----ceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCC Q lcl|NC_020081. 223 -DD-----KVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQS 296 (552) Q Consensus 223 -~~-----~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s 296 (552) .+ .....++.+-|.+..-+. ...++..=+|-|..|...+.....++....-|=-.-+.-+-|..+..+.... T Consensus 210 ~~g~~~~~~~~ikI~~daI~y~hSGl--~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk 287 (516) T protein:vir:10 210 YNGRLFEPNTRIKIPRSAIVYAHSGL--QDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPN 287 (516) T ss_pred ccccccCCCCceecchhheeeeecCc--ccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCc Confidence 11 112333333333332222 1222222267788888888877777776654433344445555554443333 Q ss_pred HHHHHHHHHHHHHHhcc----------ccccccce-eec---------cCCceeeecc--CchhHHHHHHHHHHHHHHHH Q lcl|NC_020081. 297 NQALTSFRREWTSMFSG----------INGAWKIP-VIT---------AEDVKFVNMT--QSSKDMEFEKWLNYLINVIC 354 (552) Q Consensus 297 ~~~~~~~~~~~~~~~~G----------~~nagk~~-il~---------~~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia 354 (552) ..+-+=++.-+. .|.. ..+..+.. ++. +.|.++..|. .+.-+|+- ..+..+.+. T Consensus 288 ~KAeqYl~~iM~-k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kkLy 363 (516) T protein:vir:10 288 RKATEYVNGIMQ-SLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMDD---VRWFNKKLY 363 (516) T ss_pred hhHHHHHHHHHH-hcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH---HHHHHHHHH Confidence 222222222221 1110 00111100 110 0133444432 33344443 458889999 Q ss_pred HHhcCCHHHhcccccccccccccccccchhH-HHHHHHHHHHHhhHHHHH----HHHHHHhhcC-----cccc-----cc Q lcl|NC_020081. 355 SIYSIDPSEINFPNRGGATGHSGNTLNEGSS-AEKYRNSKDKGLEPLLKF----IEDAVNKYIV-----SQFG-----GD 419 (552) Q Consensus 355 ~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~-e~~~~~~~~~~l~P~~~~----ie~~ln~~L~-----~~~~-----~~ 419 (552) ++++||.+.|+.....++.+..+ +.-+- |-.. ...|.-+..+ +-+.|...|+ ++.+ .. T Consensus 364 ~aLnVP~SRl~~e~~~~~~~Gr~---~EItRDEiKF----~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~ 436 (516) T protein:vir:10 364 EALRIPLSRMPRDDGGMVIGGQD---MAITRDELDF----RKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINN 436 (516) T ss_pred HHhCCCcccccCCCCceeecccc---chhhHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhc Confidence 99999999997544332211111 11111 2222 2333444443 3444444332 2222 24 Q ss_pred eeecccccChH----------HHHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhhccccccccc Q lcl|NC_020081. 420 YVFNFVGGDAK----------TEAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQ 488 (552) Q Consensus 420 ~~~~f~~~d~~----------~~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~ 488 (552) +.+.|.+...- .|...++.+..+...+++.+=+|+. |.+.-.+ +...+.+...+ . T Consensus 437 I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDee----------i~~~~k~I~~E----~ 502 (516) T protein:vir:10 437 IKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDEQ----------IAQEEKQIEKE----A 502 (516) T ss_pred ceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhH----------HHHHHHHHHHh----h Confidence 66777665432 2333333444444457787777753 4443211 00001111000 0 Q ss_pred cCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 489 RQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) +.+- -.+|+.+++. T Consensus 503 ~~~~--------------~~~p~~e~~f 516 (516) T protein:vir:10 503 NVKR--------------FQNPENEDDF 516 (516) T ss_pred hCCC--------------CCCCCccccC Confidence 0000 0001111111 No 218 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=97.45 E-value=6.4e-05 Score=43.57 Aligned_cols=445 Identities=13% Similarity=0.088 Sum_probs=183.8 Q ss_pred CCCCC----CCcccccchhhcccccCcccccccccchhhh-hcccccccccccccc---ccccccccccCCcccccccCC Q lcl|NC_020081. 1 MGLLD----GFFKGRKQQDNIIDINDDMAVRIKQIEEDAI-LKKGKNTKSNKPKAY---EEPIIGSMSMNPDFKEAPSIH 72 (552) Q Consensus 1 ~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 72 (552) |+..- |+.-.++- ...+++.++.. +-..++.-.+..... ..++......+..+...+-.. T Consensus 1 m~~~~l~lf~f~~k~~e------------~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~ 68 (521) T protein:vir:10 1 MNPIFLKLLQPWMKDDE------------KRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQ 68 (521) T ss_pred CCcchhHHhhhhhhhhh------------hHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccc Confidence 33210 00000000 00000000000 000000111110000 000000000011111222233 Q ss_pred CCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcCCCC Q lcl|NC_020081. 73 GKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTGRID 151 (552) Q Consensus 73 ~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n~~~ 151 (552) +...+.+..|.++..+.+-.|+-.+..+++ .+....-+..+-+.+.+. ++.-+++|.. +..++..++. T Consensus 69 n~~eLI~~YR~ma~~pEvd~Av~eIvneai-------v~d~~~~pV~i~Ld~~~~--s~~iK~kI~eeF~~Il~ll~F-- 137 (521) T protein:vir:10 69 NTKDLINQYRSLSKYHEVDNAIDEIINDAI-------VQEDNRDTVYLDLDKTDW--NESVKEMVREEFRTILKLLKF-- 137 (521) T ss_pred hHHHHHHHHHHHhhccchhhHHHhhhcceE-------EecCCCceEEEEecCccc--chHHHHHHHHHHHHHHHHhcc-- Confidence 444556677788888887777665544433 122334445555544433 3333444433 4444443332 Q ss_pred CCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC---CCCEEEEEEecCceeEEEEC----CCcccccccceeEEEEEcC- Q lcl|NC_020081. 152 NDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK---LGDLHNFKAVDASTVYVAVD----EDGKERKAKDGVRYVQVID- 223 (552) Q Consensus 152 ~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~---~G~~~~L~~l~p~~v~v~~~----~~g~~~~~~~~~~y~~~~~- 223 (552) ....++ +++.+.+.|..|+.++-|. ...+.+|.+|||..|+.++. ..++........-|+.+.. T Consensus 138 ----~~~~~~----~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~ 209 (521) T protein:vir:10 138 ----EREGKR----HFRRWYVDSRIYFHKMIDPARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGAT 209 (521) T ss_pred ----chhhhH----HHhhheeeeeEEEEEEeeCCCccccceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeeccC Confidence 123344 4567778899999987653 34599999999999966442 1222211111111222211 Q ss_pred ----------CceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC Q lcl|NC_020081. 224 ----------DKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQ 293 (552) Q Consensus 224 ----------~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~ 293 (552) .+....++.+-|.|...+. ...++.+.+|-|..|...+.....++....-|=-.-+.=+-|..+..+. T Consensus 210 ~~~~~~~~g~~~~~vkI~~daI~y~hSGL--~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGn 287 (521) T protein:vir:10 210 EDNRYNISGNSNNLVQIPIDAIVYSHSGK--VDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGT 287 (521) T ss_pred CCceecCCCCCCcceeechhheeeecccc--eeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCC Confidence 1122345565555543222 2345678899999999998888877776654443344445555555443 Q ss_pred CCCHHHHHHHHHHHHHHhc----------cccccccce-eec---------cCCceeeecc--CchhHHHHHHHHHHHHH Q lcl|NC_020081. 294 EQSNQALTSFRREWTSMFS----------GINGAWKIP-VIT---------AEDVKFVNMT--QSSKDMEFEKWLNYLIN 351 (552) Q Consensus 294 ~~s~~~~~~~~~~~~~~~~----------G~~nagk~~-il~---------~~g~~~~~l~--~~~~d~q~~e~~~~~~~ 351 (552) .....+-+=++.-+. .+. ...+..+.. ++. +.|.++..|. .+.-+|+- ..+..+ T Consensus 288 lpk~KAeqYl~~iM~-k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~D---V~YF~k 363 (521) T protein:vir:10 288 MPNKKATQHLNNVMQ-GLKNRVVYDSSTGKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDD---VRWFNR 363 (521) T ss_pred CCchhHHHHHHHHHH-hcCceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH---HHHHHH Confidence 333322222222221 111 011111110 110 1133444432 33344443 458889 Q ss_pred HHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----C-----cccc----- Q lcl|NC_020081. 352 VICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----V-----SQFG----- 417 (552) Q Consensus 352 ~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~----- 417 (552) .+.++++||.+.|+.. .++|.-..++ .-+.-|-.. ...|.-+..++...|...| + ++.+ T Consensus 364 kLy~aLnVP~sRl~~e-~~~f~~Gr~~--EItRDEikF----~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~ 436 (521) T protein:vir:10 364 KLYESMKIPLSRLPQE-GAGVTFGAGN--DITRDELQF----TKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQA 436 (521) T ss_pred HHHHHhCCCccccCCC-CCceeccccc--chhHHHHHH----HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHh Confidence 9999999999999653 2222111111 111122222 3344445555554444332 2 2222 Q ss_pred cceeecccccChHHHH----------HHHHHHHH--HhcCCcCHHHHHH-HhCCCCCCCCCeeeccccccchhhhccccc Q lcl|NC_020081. 418 GDYVFNFVGGDAKTEA----------EIISILES--KAKIGLTINDIRK-ELGYPDTEGGDVTLAGVHVQRLGQIMQQEQ 484 (552) Q Consensus 418 ~~~~~~f~~~d~~~~~----------~~~~~~~~--~~~g~lT~NE~R~-~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~ 484 (552) ..+.+.|.+...-++. ..++.+.. +..-+++.+=+|+ .|.+.-.+ +...+.+...+ T Consensus 437 ~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDee----------ik~~~k~I~~E- 505 (521) T protein:vir:10 437 ENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMSDED----------IKTEREKIDGE- 505 (521) T ss_pred hcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHHHHHHHhcCCHhH----------HHHHHHHHHHh- Confidence 3466777666533222 22222211 2223566666665 33432110 00111111100 Q ss_pred cccccCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 485 VEYQRQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) .+.+--.++..+ .++. T Consensus 506 ---~~~~~~~~p~~e-------------~~df 521 (521) T protein:vir:10 506 ---LKDSVYKNPEDP-------------MEEF 521 (521) T ss_pred ---hhCCCCCCCcch-------------hhcC Confidence 000000001100 0111 No 219 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=97.35 E-value=8.9e-05 Score=42.79 Aligned_cols=399 Identities=12% Similarity=0.057 Sum_probs=156.2 Q ss_pred hhhccccccccccccccccccccccccCCcccccccCCCCchH--------------HHHHHHhhcch-HHHHHHHH--- Q lcl|NC_020081. 35 AILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNL--------------LQMLKLWSRKN-IILNAIII--- 96 (552) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~Lr~~a~~~-~i~~a~i~--- 96 (552) ++-.+....|-..+ +-+. ...+........ ...++++-++. .++..-.. T Consensus 1 ~~~~~~~~~~~~~~----~~~~---------~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~ 67 (468) T protein:vir:96 1 MIDIFWPNEKPYHE----RVVE---------QIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNV 67 (468) T ss_pred CccccCCcCceeeh----heee---------cccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccc Confidence 11111111111100 0000 000111111111 11111111110 01000000 Q ss_pred ------HHH-HH-HHHHHHHHHhhcccc--ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHH Q lcl|NC_020081. 97 ------TRV-NQ-VSMFCTPARNSDKGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKL 166 (552) Q Consensus 97 ------~~~-~~-~~~~~~~~~~~~~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~ 166 (552) .++ .. +.-+++.+.....++ |-.+.+.- .+......+.+++. ..+......+ T Consensus 68 ~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~-------~d~~~~~~l~~~~~-----------n~~~~~~~~~ 129 (468) T protein:vir:96 68 KGEIDPFKPDWRMYTNYHQNLVDQKVAYAVANPVTYGT-------EDEKSLKTIQEVLN-----------HKWDDKLVDI 129 (468) T ss_pred cccccccccccccccchHHHHHHHHHhhhccCCceecc-------CChHHHHHHHHHHh-----------cCHHHHHHHH Confidence 000 00 011111111111111 11111110 11112233444432 1234556678 Q ss_pred HHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCCceEEEEcccceeeecc---- Q lcl|NC_020081. 167 VRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDDKVVAKFKAKEMAWEVS---- 240 (552) Q Consensus 167 v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~~~~~~~~~~evi~~~~---- 240 (552) ..+.+.+|.+|..+.++.+|.+ .+..++|..+.++.++. +.... .++|+..........+....+.+.+. T Consensus 130 ~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~~~~~~~~~~~---~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~ 205 (468) T protein:vir:96 130 LTAASNKGVEWIQPYVDEQGEF-KTFRVPAEQAIPIWTNKERDELKA---FIRLYELDGGERVEYWTANDVTFYELKDGQ 205 (468) T ss_pred HHHHhhcCeEEEEEEEcCCCce-EEEEEcccceEEEEcCCCCCceEE---EEEEEEecCceEEEEEeCCeEEEEEEcCCc Confidence 8999999999999989888875 47788999988876532 22111 12222221111112222222221110 Q ss_pred ----------------------ccc---CC--ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC Q lcl|NC_020081. 241 ----------------------NPR---TD--LTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQ 293 (552) Q Consensus 241 ----------------------~~~---~~--~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~ 293 (552) |+. +. -.+...|.|-++.+...++....+..-..+.+...+.|-.++. +. T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--g~- 282 (468) T protein:vir:96 206 LIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLK--GY- 282 (468) T ss_pred eeecccccccccccceeeccccccCCcccEEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--cC- Confidence 000 00 0012457787777777777766666666666666677755543 32 Q ss_pred CCCHHHHHHHHHHHHHHhccccccccceeec-cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc Q lcl|NC_020081. 294 EQSNQALTSFRREWTSMFSGINGAWKIPVIT-AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGA 372 (552) Q Consensus 294 ~~s~~~~~~~~~~~~~~~~G~~nagk~~il~-~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~ 372 (552) ..++ .+.+...+. .+++..+. .++.+.+.++.......+....+.+.+.|...-++|..-.. ++ T Consensus 283 ~~~~--~~~~~~~~~--------~~~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-----~~ 347 (468) T protein:vir:96 283 EGED--LEEFMYNLK--------YYKAINVDGDGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQD-----KF 347 (468) T ss_pred Cccc--cchhhhhhh--------cCceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccccc-----cc Confidence 2222 112222111 12222332 22333333444444566677788899999999998863211 11 Q ss_pred cc-cccccccch-----hHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecccccChHHHHHHHHHHHHHhcC Q lcl|NC_020081. 373 TG-HSGNTLNEG-----SSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNFVGGDAKTEAEIISILESKAKI 445 (552) Q Consensus 373 ~~-~~~~~~~~~-----n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~d~~~~~~~~~~~~~~~~g 445 (552) ++ .++....+. ......+..+...|+-+++.|...+.. ..+ ..+.+.|.+..+.+..+.++++.. .| T Consensus 348 ~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~----~~d~~~i~i~f~~~~p~d~~e~a~~~~~--~g 421 (468) T protein:vir:96 348 GNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL----SIKVQDVEITFNFNVMVNELEQSQIGVN--SQ 421 (468) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CcccceeeEEecCCCCcCHHHHHHHHHh--cC Confidence 11 111110000 001222333344444444444332211 111 245667777777777776666544 58 Q ss_pred CcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCC Q lcl|NC_020081. 446 GLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNM 507 (552) Q Consensus 446 ~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (552) +||.-.+.++++.- + |. -..+..+...+......+ +... +.+.++++ T Consensus 422 ~iS~et~i~~l~~v--~--D~------~~E~~ri~~E~~~~~~~~----~~~~-~~~~~~~~ 468 (468) T protein:vir:96 422 YLSKETVVTNHPWV--D--DP------VAEMERIDQEELALPSIE----EGLN-GKENNEPT 468 (468) T ss_pred CCchHHHHHhCCCC--C--CH------HHHHHHHHHHHHHHHHHh----hccC-CCCCCCCC Confidence 99988788776431 1 10 011111111111000000 0000 00111111 No 220 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=97.32 E-value=9.6e-05 Score=42.62 Aligned_cols=397 Identities=11% Similarity=0.027 Sum_probs=160.7 Q ss_pred hhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcc-hHHHHHHHHH-------------HH-- Q lcl|NC_020081. 36 ILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRK-NIILNAIIIT-------------RV-- 99 (552) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~-~~i~~a~i~~-------------~~-- 99 (552) +... ....+ -..+...... .-.....|+++-++ ..|+..-... ++ T Consensus 1 ~~~~---------------~~~~~--i~~~~~~~~~--~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ 61 (470) T protein:vir:10 1 MELD---------------ALKKL--IQNTSTSRND--LINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADN 61 (470) T ss_pred CchH---------------HHHHH--HHHHHHHHHH--HHHHHHHHHHHhccccchhccccchhcccccccccccccCCc Confidence 0000 00000 0000000000 00001111111111 0111000000 00 Q ss_pred HHHHHHHHHHHhhcccc--ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCee Q lcl|NC_020081. 100 NQVSMFCTPARNSDKGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKIN 177 (552) Q Consensus 100 ~~~~~~~~~~~~~~~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~ 177 (552) --+.-+++.+....+++ |-.+.+...+ .+..+.+.+++.. .+...+..+..+++.+|.+| T Consensus 62 ki~~n~~k~Iv~~~~~yl~G~p~~~~~~d-------~~~~~~l~~~~~~-----------~~~~~~~~l~~~~~~~G~a~ 123 (470) T protein:vir:10 62 RIPSNFYQLLVDQEAGYVASVFPDIDVGK-------DADNKKIIDVLGD-----------DRALTLNGLLVDSSNAGRAW 123 (470) T ss_pred ccccchHHHHHHhhhhheeccceeeecCc-------hHHHHHHHHHHhh-----------hHHHHHHHHHHHHhhcCeeE Confidence 00001111222211221 2122221111 1122334444421 23344556778999999999 Q ss_pred EEEEECCCCCEEEEEEecCceeEEEECCC--cccccccceeEEEEEcCC--ce----EEEEcccceeeeccccc------ Q lcl|NC_020081. 178 FELVYDKLGDLHNFKAVDASTVYVAVDED--GKERKAKDGVRYVQVIDD--KV----VAKFKAKEMAWEVSNPR------ 243 (552) Q Consensus 178 ~~i~r~~~G~~~~L~~l~p~~v~v~~~~~--g~~~~~~~~~~y~~~~~~--~~----~~~~~~~evi~~~~~~~------ 243 (552) ..+.++..|++ .+..++|..+.++.++. +... ..++|+..... .. ...++...+.|.+..-. T Consensus 124 ~~~y~d~~~~~-~~~~~~p~~~~~v~d~~~~~~~~---a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~ 199 (470) T protein:vir:10 124 LHYWIDEDGNF-RYGIIQPDQITPIYATTLDNKLL---GILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIE 199 (470) T ss_pred EEEEecCCCce-EEEEEcccceEEEEcCCCCCceE---EEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceecc Confidence 99999999886 47789999999887653 2221 12233322211 11 11222233222211000 Q ss_pred ----------------------CCc---------cCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC Q lcl|NC_020081. 244 ----------------------TDL---------TVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTG 292 (552) Q Consensus 244 ----------------------~~~---------~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ 292 (552) .+. .+..+|.|-++.....++....+..-..+.+...+.|-.+|. +. T Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~--g~ 277 (470) T protein:vir:10 200 PYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLT--NY 277 (470) T ss_pred ccccccccccccccccccccccccCCCeeeEEEeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeee--cC Confidence 000 011357788887777777777766666666666666665554 32 Q ss_pred CCCCHHHHHHHHHHHHHHhccccccccceeec----cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccc Q lcl|NC_020081. 293 QEQSNQALTSFRREWTSMFSGINGAWKIPVIT----AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPN 368 (552) Q Consensus 293 ~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~----~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 368 (552) ..++ ...+...+.. .+. ..+. +.+.+++-++.......+....+.+.+.|...-++|.. ... T Consensus 278 -~~~~--~~~~~~~~~~-------~~~-i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~--~~~- 343 (470) T protein:vir:10 278 -GGAD--LHQFMNDLRK-------YKS-IKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDP--ANF- 343 (470) T ss_pred -Cccc--cchhhhhhhh-------cCe-EeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCC--Ccc- Confidence 1121 1122232221 111 1121 12333333344444455677788899999998888853 211 Q ss_pred cccccccccccccc--hhH---HHHHHHHHHHHhhHHHHHHHHHHHhhcCcccc-cceeecccccChHHHHHHHHHHHHH Q lcl|NC_020081. 369 RGGATGHSGNTLNE--GSS---AEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFG-GDYVFNFVGGDAKTEAEIISILESK 442 (552) Q Consensus 369 ~~t~~~~~~~~~~~--~n~---e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~d~~~~~~~~~~~~~~ 442 (552) .+++.++....+ ... ....+..+..+|+-+++.|...++.. ..+ ..+.+.|.+.-+.+..+.++++... T Consensus 344 --~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~---~~d~~~i~i~f~~~~p~d~~e~~~~~~~~ 418 (470) T protein:vir:10 344 --ESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKRHISQHWTRTKVEDSLTKAQIVSTV 418 (470) T ss_pred --ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CcccceeeEEeccCCCCCHHHHHHHHHHH Confidence 122222211111 111 12233344455555555554444321 111 3466778777777777777766554 Q ss_pred hcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCC Q lcl|NC_020081. 443 AKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKD 514 (552) Q Consensus 443 ~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (552) .|+||.--+++++++ +++-+ ..+..+...+..... ..++.. +.+...+++++ T Consensus 419 -~g~iS~et~l~~~p~--v~D~~--------~E~eri~~E~~e~~~--------~~~~~~-~~~~~~~dde~ 470 (470) T protein:vir:10 419 -ANYSSKEAVAKANPI--VDDWQ--------QELKDLAKDKEENDP--------YSNQAD-ELNGKGVNDEQ 470 (470) T ss_pred -hccCcHHHHHHhCCC--CCCHH--------HHHHHHHHHHHHHHH--------hhcccc-ccCCCCCCCCC Confidence 588998877777654 22100 011111111100000 000000 00000000000 No 221 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=97.24 E-value=0.00012 Score=42.12 Aligned_cols=429 Identities=15% Similarity=0.170 Sum_probs=173.6 Q ss_pred CCCCCC---Ccccccchhh----cccccCcccccccccchhhhhccccccccccccccccccccccccCCcccccccCCC Q lcl|NC_020081. 1 MGLLDG---FFKGRKQQDN----IIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHG 73 (552) Q Consensus 1 ~~~~~~---~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (552) ||++++ -||+.-+..+ |-.+.+...........++|.. + .+-|. +.. +.+...+. .+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~-~-------~~~y~----g~~---~~~~~~~~-~~ 64 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQT-D-------LDYYS----DKL---QYIHYQAS-DG 64 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHH-H-------HHHhc----CCC---cccccccC-CC Confidence 998662 2344333322 2111111111111111111110 0 00010 000 01111111 01 Q ss_pred CchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCC Q lcl|NC_020081. 74 KQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDND 153 (552) Q Consensus 74 ~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~p 153 (552) .... .++ .. .-+-+.++.. .+.+ ..+-...+...+. ......|.+++.. T Consensus 65 ~~~~--~~~-~s--ln~~~~i~~~----~A~l-------v~~e~~~i~v~~~--------~~~~e~l~~il~~------- 113 (508) T protein:vir:15 65 IKKK--RLK-NT--INMAKTAARR----IASV-------VFNEKAEIHVKDN--------NEADKFLNDVLED------- 113 (508) T ss_pred Cccc--cce-ee--cchHHHHHHH----HHhh-------hhCCCceEEeCCc--------hHHHHHHHHHHHh------- Confidence 1000 000 00 0001111111 1111 1111112222111 1112234444432 Q ss_pred CccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccc-----------cceeEEEE-- Q lcl|NC_020081. 154 FTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKA-----------KDGVRYVQ-- 220 (552) Q Consensus 154 n~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~-----------~~~~~y~~-- 220 (552) -.+..-++..+.+.+..|.+++.+..|.. . ..+.++++..+.++..+.|++... ....+|.. T Consensus 114 ---n~f~~~~~~~~e~a~a~G~~~~k~~~d~~-~-~~i~~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE 188 (508) T protein:vir:15 114 ---NDFKNKFEEALEKGVALGGFAMRPYIDGN-H-IKIAWVRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLE 188 (508) T ss_pred ---ccHHHHHHHHHHHHhhcCceEEEEEEeCC-e-eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEE Confidence 12445566778889999999999888754 3 457778888877754444432100 00111110 Q ss_pred ---EcCC-ceE---EEEcc----------------------cce----------eeecccccCC--ccCCcccccHHHHH Q lcl|NC_020081. 221 ---VIDD-KVV---AKFKA----------------------KEM----------AWEVSNPRTD--LTVGKYGYPELEIA 259 (552) Q Consensus 221 ---~~~~-~~~---~~~~~----------------------~ev----------i~~~~~~~~~--~~~g~~G~spl~~~ 259 (552) ..++ ... ..|.. .++ +|++. +..+ ....++|+|.+.-+ T Consensus 189 ~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~~~-~~~N~~~~~splG~S~~~~~ 267 (508) T protein:vir:15 189 FHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRPLFAYFKT-PGANNINIESPLGLGVVDNA 267 (508) T ss_pred EEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCCCcceeEEecC-CccccccCCCCcCCchHhhh Confidence 0000 000 00100 011 12221 1111 11347899999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC-CCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchh Q lcl|NC_020081. 260 LNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQ-EQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSK 338 (552) Q Consensus 260 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~ 338 (552) ...+............-|+.| .+..++ +... ..+++....+... ...|.+. ..-...|..++.++.... T Consensus 268 ~~lid~lD~~~s~~~~e~~~~-~~~i~v--~~~~l~~d~~~~~~~~~~-~~~~~~~------~~~~~~~~~i~~~~~~ir 337 (508) T protein:vir:15 268 KHVLDDINDTHDQFIWEIRLG-QKHIAV--QPGMLRFDDEHKPTFDTE-QNVYVGV------LSDDNNGLGVKDMTTPIR 337 (508) T ss_pred HHHHHHHHHHHHHHHHHHHhc-ccceee--chHHhcCCCCCccccCCC-CeeEEec------cCCCCCCCceeEeecccC Confidence 999998888877777777644 444333 2111 0011110001000 0111110 001123445677777778 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCHHHhccccccccccccc---ccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhh-cCc Q lcl|NC_020081. 339 DMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSG---NTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKY-IVS 414 (552) Q Consensus 339 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~---~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~-L~~ 414 (552) +-++.+..+...+.|....|++|..+|+...+..++... .+..+.+. ...+..++.+|..++..|-...+.. +.. T Consensus 338 ~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~-~~~~~~~~~al~~lv~~il~l~~~~~~~~ 416 (508) T protein:vir:15 338 TVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTR-SSYLTMVEKAIDELCQSIFELANAGALFD 416 (508) T ss_pred hHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 888999999999999999999999998754432211110 01111111 1233345556666655554443321 111 Q ss_pred -----------ccccceeecccccChHHHH-HHHHHHHHHhcCCcCHHHHHHHh-CCCCCCCCCeeeccccccchhhhcc Q lcl|NC_020081. 415 -----------QFGGDYVFNFVGGDAKTEA-EIISILESKAKIGLTINDIRKEL-GYPDTEGGDVTLAGVHVQRLGQIMQ 481 (552) Q Consensus 415 -----------~~~~~~~~~f~~~d~~~~~-~~~~~~~~~~~g~lT~NE~R~~~-gl~p~~ggD~~~~~~n~~~~~~~~~ 481 (552) .....+.|.|..+-+.++. +.....+.+.+|+|+.-+++... |+.. + + +....+ T Consensus 417 ~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~e~~i~~~~g~~d-e--e----------a~~el~ 483 (508) T protein:vir:15 417 DGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSKQTFLQRNYGMTD-E--Q----------AAEELA 483 (508) T ss_pred ccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCh-H--H----------HHHHHH Confidence 0112345666655444333 33344455667899999987653 4432 1 0 000000 Q ss_pred ccccccccCCCCCccCcccCCCCCCCCCCCCC Q lcl|NC_020081. 482 QEQVEYQRQMDANQFLAQQTGYDGNMDNVNGK 513 (552) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (552) ....+. .... ...+..+..+-.+|+ T Consensus 484 ri~~E~---~~~~----~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 484 KIQSEA---PTDT----FEGGRSAILNGGDGE 508 (508) T ss_pred HHHHhc---cccC----ccccccccCCCCCCC Confidence 000000 0000 000001111111111 No 222 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=97.19 E-value=0.00013 Score=41.80 Aligned_cols=390 Identities=11% Similarity=-0.005 Sum_probs=154.4 Q ss_pred cccCCc------ccccccCCCCchHHHHHHHhhcch-HHHHHHHH-----------H----HH--H--HHHHHHHHHHhh Q lcl|NC_020081. 59 MSMNPD------FKEAPSIHGKQNLLQMLKLWSRKN-IILNAIII-----------T----RV--N--QVSMFCTPARNS 112 (552) Q Consensus 59 ~~~~~~------~~~~~~~~~~~~~~~~Lr~~a~~~-~i~~a~i~-----------~----~~--~--~~~~~~~~~~~~ 112 (552) |....- +....+. ...-...++++-++. .|+..-.. . +. + -+.-+++.+... T Consensus 1 ~~~e~~~~~i~~~~~~~~~--~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~ 78 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGK--FVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQ 78 (471) T ss_pred CCHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHh Confidence 100000 0000000 000011111111110 00000000 0 00 0 000011111111 Q ss_pred cccc--ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC-CCCEE Q lcl|NC_020081. 113 DKGV--GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK-LGDLH 189 (552) Q Consensus 113 ~~~~--~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~-~G~~~ 189 (552) ..++ |-.+.+.-. +.+....+..++. | .+......+..+.+.+|.+|..+.++. +|++ T Consensus 79 ~~~yl~G~p~~~~~~-------~~~~~~~l~~~~~--------n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~- 139 (471) T protein:vir:10 79 KKAYALTYPPTFDVD-------DKKVNDMIVDVLG--------D---DYERISKQLCVNAGNAGIAWLHVWKDASDNSF- 139 (471) T ss_pred hhhhhcccCceeccC-------ChHHHHHHHHHHh--------c---CHHHHHHHHHHHHhhCCeEEEEEEeeCCCCee- Confidence 1111 111111111 1112223333331 1 234456667889999999999999885 5664 Q ss_pred EEEEecCceeEEEECCCcccccccceeEEEEEcC--Cce----EEEEcccceeeecccc--------------------- Q lcl|NC_020081. 190 NFKAVDASTVYVAVDEDGKERKAKDGVRYVQVID--DKV----VAKFKAKEMAWEVSNP--------------------- 242 (552) Q Consensus 190 ~L~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~--~~~----~~~~~~~evi~~~~~~--------------------- 242 (552) .+..++|..+.++.++.+... ....++|+.... +.. ...+..+.+.|++... T Consensus 140 ~~~~~~p~~~~~i~d~~~~~~-~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (471) T protein:vir:10 140 RYACVDSKEVIPIYSKSLDKK-SIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNG 218 (471) T ss_pred EEEEEcccceEEEEcCCCCCc-eEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccc Confidence 578899999988887643210 111223322211 110 1112222222221100 Q ss_pred -------cCCcc---------CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_020081. 243 -------RTDLT---------VGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRRE 306 (552) Q Consensus 243 -------~~~~~---------~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~ 306 (552) ..+.. +...|.|-++.....++....+..-..+.+...+.|-.+++ +. ...+ .+.+... T Consensus 219 ~~~~~~~~~~~~g~iPvv~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~--g~-~~~~--~~~~~~~ 293 (471) T protein:vir:10 219 DRSSDNSFKHDFGLVPFIPFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLT--NY-GGQD--KQEFLED 293 (471) T ss_pred cccccccccCCCCceeEEEeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeee--cC-Cccc--cchhHHH Confidence 00000 11347777777777777666666556666666666644443 32 1111 1122222 Q ss_pred HHHHhccccccccceeec----cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccc Q lcl|NC_020081. 307 WTSMFSGINGAWKIPVIT----AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNE 382 (552) Q Consensus 307 ~~~~~~G~~nagk~~il~----~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~ 382 (552) +.. . +...+. +.+.+++-++.......+....+.+.+.|...-++|..-.. .+++.++....+ T Consensus 294 ~~~-------~-~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~-----~~gn~Sg~Alk~ 360 (471) T protein:vir:10 294 LKR-------Y-KMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETD-----KLGNSSGVALKF 360 (471) T ss_pred hhc-------C-CeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcc-----cccCccHHHHHH Confidence 211 1 111221 12223333344444456678888999999999998854221 121221111111 Q ss_pred h--hH---HHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhC Q lcl|NC_020081. 383 G--SS---AEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELG 457 (552) Q Consensus 383 ~--n~---e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~g 457 (552) . .. -...+..+...|+-+++.|...+.. .....+.+.|.+..+.+..+.++++... .|+||.--+.++++ T Consensus 361 ~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~----~d~~~i~i~f~~~~p~n~~e~~~~~~kl-~g~iS~et~~~~~p 435 (471) T protein:vir:10 361 LYSLLELKAGNMETQFRSGYATLVKMILKHLGL----SDKLKIKQTWTRNSINNDTEMAQVVSTL-ATITSRENVAKSNP 435 (471) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCceeEEEeCCCCCCCHHHHHHHHHHH-hccCchHHHHHhCC Confidence 1 10 1222333344444444444433321 1123567778777777777777766654 58899888877764 Q ss_pred CCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCC Q lcl|NC_020081. 458 YPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNV 510 (552) Q Consensus 458 l~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (552) . +++-+ ..+..+...+..... ......+..+++... T Consensus 436 ~--v~D~~--------~E~eri~~E~~~~~~-------~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 436 I--VEDWQ--------DELRLQKAEQEGRSE-------KLYDMEEVEHESEVE 471 (471) T ss_pred C--CCCHH--------HHHHHHHHHHHHHHh-------cccccCCCCCccccC Confidence 4 21100 111111111110000 000000000000000 No 223 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=97.19 E-value=0.00014 Score=41.75 Aligned_cols=443 Identities=11% Similarity=0.115 Sum_probs=191.1 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccccccccccccC---Ccc-cccccCCCCch Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMN---PDF-KEAPSIHGKQN 76 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~-~~~~~~~~~~~ 76 (552) |-.+. ++-...|+..+.... .+-..++.-.++.......-.....++ +.| -..+...+... T Consensus 8 ~~~~~-~~d~~~~~e~~~~~~--------------~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~e 72 (521) T protein:vir:65 8 LARWA-DFDNDKYEEQIKDKA--------------ESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQ 72 (521) T ss_pred hhhcc-CchhhHHHhhhccCC--------------CcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHH Confidence 22222 222222222221000 000111112222211111000000111 111 12333444455 Q ss_pred HHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcCCCCCCCc Q lcl|NC_020081. 77 LLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTGRIDNDFT 155 (552) Q Consensus 77 ~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n~~~~pn~ 155 (552) +.+..|.++..+.+-.|+-.+..+++ .+....-+..+-+.+. +.+..-+++|.. +..++..++. T Consensus 73 LI~~YR~ma~~pEvd~Av~eIVneai-------v~d~~~~pV~l~L~~~--~~s~~iK~kI~eeF~~Il~ll~F------ 137 (521) T protein:vir:65 73 LVNTYRGLMNNHEVENAVQNIVNDAI-------VFEEGHEVVSLNLEAT--GFSESVKERIHEEFKDLLNTIQF------ 137 (521) T ss_pred HHHHHHHHhhccchhhHHHHhhccee-------EecCCCceEEEEeccc--ccchHHHHHHHHHHHHHHHHhcc------ Confidence 66777888888888777766544443 1223344455555433 344554555543 4444444332 Q ss_pred cCCHHHHHHHHHHHHHhcCCeeEEEEECC--CCCEEEEEEecCceeEEEEC-----CCcccccccceeEEEEEc------ Q lcl|NC_020081. 156 RDNFRSFVKKLVRDRLTYDKINFELVYDK--LGDLHNFKAVDASTVYVAVD-----EDGKERKAKDGVRYVQVI------ 222 (552) Q Consensus 156 ~~t~~~f~~~~v~d~ll~Gna~~~i~r~~--~G~~~~L~~l~p~~v~v~~~-----~~g~~~~~~~~~~y~~~~------ 222 (552) ....++ +++.+.+.|..|+.++-|. ...+.+|.+|||..|+.++. ..|.........+|+... T Consensus 138 ~~~~~~----~fR~WYVDgRi~fhkiid~~pk~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~ 213 (521) T protein:vir:65 138 DRRGQD----MFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYC 213 (521) T ss_pred chhhhH----HHhhhhhcceeEEEEEEcCCccccceeeeeeCCcceeeeeeecccccCCcceecceeeeeeeecCCccee Confidence 123444 4567778899999999553 35699999999999987543 112111111111121111 Q ss_pred ------CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCC Q lcl|NC_020081. 223 ------DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQS 296 (552) Q Consensus 223 ------~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s 296 (552) .......++.+-|.+..-+. ...++..=+|-|..|...+.....++....-|=-.-+.=+-|..+..+.... T Consensus 214 ~~g~~~~~~~~vkI~~dAI~y~hSGl--~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk 291 (521) T protein:vir:65 214 AGGQVFSPNSRVKIPRSAITYAHSGL--MDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNN 291 (521) T ss_pred ccceeecCCcceeechhheeeeeccc--eeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCc Confidence 11122334444444432222 2334445578888998888888877776654444444445566555443333 Q ss_pred HHHHHHHHHHHHHHhcc----------ccccccc-eeec---------cCCceeeecc--CchhHHHHHHHHHHHHHHHH Q lcl|NC_020081. 297 NQALTSFRREWTSMFSG----------INGAWKI-PVIT---------AEDVKFVNMT--QSSKDMEFEKWLNYLINVIC 354 (552) Q Consensus 297 ~~~~~~~~~~~~~~~~G----------~~nagk~-~il~---------~~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia 354 (552) ..+-+=++.-+ ..|.. ..+..+. .++. +.|.++..|. .+.-+|+- ..+..+.+. T Consensus 292 ~KAeqYl~~im-~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kkLy 367 (521) T protein:vir:65 292 RKAAQHMNSVA-QSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDD---IRYFNRKLY 367 (521) T ss_pred hhHHHHHHHHH-HhcCceeEeecccccccccccccchhhhhcccccCCCCccceeecccCCCcChHHH---HHHHHHHHH Confidence 33322222222 22221 1111111 1111 1134444442 33444444 458889999 Q ss_pred HHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHh----hcC-----cccc-----cce Q lcl|NC_020081. 355 SIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNK----YIV-----SQFG-----GDY 420 (552) Q Consensus 355 ~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~----~L~-----~~~~-----~~~ 420 (552) ++++||.+.|+....+.+....++-. +.-|-.. ...|.-+..++...|.. .|+ ++.+ ..+ T Consensus 368 ~aLnVP~sRl~~e~~~~~~~gr~~EI--tRDEiKF----~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I 441 (521) T protein:vir:65 368 EALRVPLSRSNLSDANMVIGGDGSEI--TRDELEF----SKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNI 441 (521) T ss_pred HHhCCCceeccCCCCcceeccccchh--hHHHHHH----HHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcc Confidence 99999999997655544432221111 1122222 33344455555444443 322 2222 236 Q ss_pred eecccccChHH----------HHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhhcccccccccc Q lcl|NC_020081. 421 VFNFVGGDAKT----------EAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQR 489 (552) Q Consensus 421 ~~~f~~~d~~~----------~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~ 489 (552) .+.|.+...-+ |...++.+..++.-.+|.+=+|+. |.+--.+ +...+.+.+. ..+ T Consensus 442 ~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~ILr~tDee----------i~~~~k~I~~----E~~ 507 (521) T protein:vir:65 442 KVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQ----------MDTEKKQIEE----EAN 507 (521) T ss_pred eEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHH----------HHHHHHHHHH----hhh Confidence 67776655322 222333333333335577766653 3332110 0000111110 001 Q ss_pred CCCCCccCcccCCC Q lcl|NC_020081. 490 QMDANQFLAQQTGY 503 (552) Q Consensus 490 ~~~~~~~~~~~~~~ 503 (552) .+--.++..+.++- T Consensus 508 ~~~~~~p~~~~~~f 521 (521) T protein:vir:65 508 DPRFKQTPDEIEDF 521 (521) T ss_pred CCCCCCCcccccCC Confidence 11111111111111 No 224 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=97.12 E-value=0.00016 Score=41.37 Aligned_cols=441 Identities=11% Similarity=0.122 Sum_probs=189.0 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhcccccccccccccccc---c--cccccccCCccc-ccccCCCC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEE---P--IIGSMSMNPDFK-EAPSIHGK 74 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~--~~~~~~~~~~~~-~~~~~~~~ 74 (552) |-.+. +|-.+.++..|. ....+-..++.-.+....... | ..+.+ .+.|. ..+...+. T Consensus 8 ~~~~~-~~~~~~~~~~~~--------------~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~--~~~~~~~e~~~~~~ 70 (521) T protein:vir:81 8 LARWA-DFDNDKYEEQIK--------------DKAESIAAPKNNDGATEVEINDNLPASAWNSL--TQQFYSTDQKISTT 70 (521) T ss_pred hHhhc-CchhhhHHhhhc--------------cCccccccCCCCCCceEecccCCCcceeecce--eeeecccccchhhH Confidence 22222 111111211110 000011111111221111000 0 01111 11111 23334444 Q ss_pred chHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcCCCCCC Q lcl|NC_020081. 75 QNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTGRIDND 153 (552) Q Consensus 75 ~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n~~~~p 153 (552) ..+.+..|.++..+.+-.|+-.+..+++ .+.....+..+-+.+. +.+..-+++|.. +..++..++. T Consensus 71 ~eLI~~YR~ma~~pEvd~Av~eIVneai-------v~d~~~~pV~l~L~~~--~~s~~iK~kI~eeF~~Il~ll~F---- 137 (521) T protein:vir:81 71 KQLVNTYRGLMNNHEVENAVQNIVNDAI-------VFEEGHEVVSLNLEAT--GFSESVKERIHEEFKDLLNTIQF---- 137 (521) T ss_pred HHHHHHHHHHhhccchhhHHHHhhccee-------EecCCCceEEEEeccc--ccchHHHHHHHHHHHHHHHHhcc---- Confidence 5566777888888888777766544443 1223444455555433 345554555543 4444444332 Q ss_pred CccCCHHHHHHHHHHHHHhcCCeeEEEEECC--CCCEEEEEEecCceeEEEEC-----CCcccccccceeEEEEEc---- Q lcl|NC_020081. 154 FTRDNFRSFVKKLVRDRLTYDKINFELVYDK--LGDLHNFKAVDASTVYVAVD-----EDGKERKAKDGVRYVQVI---- 222 (552) Q Consensus 154 n~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~--~G~~~~L~~l~p~~v~v~~~-----~~g~~~~~~~~~~y~~~~---- 222 (552) ....++ +++.+.+.|..|+.++-|. ...+.+|.+|||..|+.++. ..|.........+|+... T Consensus 138 --~~~~~~----~fR~WYVDgRi~fhkiid~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~ 211 (521) T protein:vir:81 138 --DRRGQD----MFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSS 211 (521) T ss_pred --chhhhH----HHhhhhhcceEEEEEEEcCCccccceeeeeeCCcceeeeeeecccccCccceecceeeeeeeecCCcc Confidence 123444 4567778899999999553 35699999999999987543 112111111111121111 Q ss_pred --------CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC Q lcl|NC_020081. 223 --------DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQE 294 (552) Q Consensus 223 --------~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~ 294 (552) .......++.+-|.+..-+. ...++..=+|-|..|...+.....++....-|=-.-+.=+-|..+..+.. T Consensus 212 ~~~~g~~~~~~~~vkI~~dAI~y~hSGl--~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnl 289 (521) T protein:vir:81 212 YCAGGQVFSPNSRVKIPRSAITYAHSGL--MDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNM 289 (521) T ss_pred ccccceeecCCcceeechhheeeeeccc--eeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCC Confidence 11122334444444332222 23334455688888888888888777766544444444455665554433 Q ss_pred CCHHHHHHHHHHHHHHhcc----------ccccccc-eeec---------cCCceeeecc--CchhHHHHHHHHHHHHHH Q lcl|NC_020081. 295 QSNQALTSFRREWTSMFSG----------INGAWKI-PVIT---------AEDVKFVNMT--QSSKDMEFEKWLNYLINV 352 (552) Q Consensus 295 ~s~~~~~~~~~~~~~~~~G----------~~nagk~-~il~---------~~g~~~~~l~--~~~~d~q~~e~~~~~~~~ 352 (552) ....+-+=++.-+ ..|.. ..+..+. .++. +.|.++..|. .+.-+|+- ..+..+. T Consensus 290 pk~KAeqYl~~im-~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V~YF~kk 365 (521) T protein:vir:81 290 NNRKAAQHMNSVA-QSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDD---IRYFNRK 365 (521) T ss_pred CchhHHHHHHHHH-HhcCceeEeecccccccccccccchhhhhcccccCCCcccceeecccCCCCChHHH---HHHHHHH Confidence 3333322222222 22221 1111111 1111 1134444442 33444444 4588899 Q ss_pred HHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHh----hcC-----cccc-----c Q lcl|NC_020081. 353 ICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNK----YIV-----SQFG-----G 418 (552) Q Consensus 353 Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~----~L~-----~~~~-----~ 418 (552) +.++++||.+.|+....+++....++-. +.-|-.. ...|.-+..++...|.. .|+ ++.+ . T Consensus 366 Ly~aLnVP~sRl~~e~~~~~~~Gr~~EI--tRDEiKF----~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~ 439 (521) T protein:vir:81 366 LYEALRVPLSRSNLSDANMVIGGDGSEI--TRDELEF----SKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREIN 439 (521) T ss_pred HHHHhCCccccccCCCCcceeccccchh--hHHHHHH----HHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhh Confidence 9999999999996554444332221111 1122222 33344445555444443 322 2222 2 Q ss_pred ceeecccccChHH----------HHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhhcccccccc Q lcl|NC_020081. 419 DYVFNFVGGDAKT----------EAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEY 487 (552) Q Consensus 419 ~~~~~f~~~d~~~----------~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~ 487 (552) .+.+.|.+...-+ |...++.+..++.-.++.+=+|+. |.+--.+ +...+.+.+. . T Consensus 440 ~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDee----------i~~~~k~I~~----E 505 (521) T protein:vir:81 440 NIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQ----------MDTEKKQIEE----E 505 (521) T ss_pred cceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHH----------HHHHHHHHHH----H Confidence 3667776655322 222233333333334566666653 3332110 0000111110 0 Q ss_pred ccCCCCCccCcccCCC Q lcl|NC_020081. 488 QRQMDANQFLAQQTGY 503 (552) Q Consensus 488 ~~~~~~~~~~~~~~~~ 503 (552) .+.+--.++..+.++- T Consensus 506 ~~~~~~~~p~~~~~~f 521 (521) T protein:vir:81 506 ANDPRFKQTPDEIEDF 521 (521) T ss_pred hhCCCCCCCcccccCC Confidence 0111111111111111 No 225 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=96.87 E-value=0.00029 Score=39.99 Aligned_cols=470 Identities=14% Similarity=0.120 Sum_probs=183.7 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhccccccccccccccc---cccccccccCCcc-cccccCCCCch Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYE---EPIIGSMSMNPDF-KEAPSIHGKQN 76 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~-~~~~~~~~~~~ 76 (552) |-.| +...+++.. +...+..|..+.... .++... .+.+.| -..+...+... T Consensus 1 m~~l----------------------fgf~~~~~~--~~~~~~~s~~~p~~ddg~~~~~~~-g~~~~~~~~~~~~~~~~e 55 (558) T protein:vir:10 1 MAKL----------------------FGFSIEETQ--KKSTSIISPVPKNNEDGVDNFISS-GFYGQYVDIEGAYRSEYD 55 (558) T ss_pred Ccch----------------------hcchhhhhh--hhccCCccccCCCccccccceecc-ceeeeeecccchhhhHHH Confidence 2222 222221110 000011111111111 122111 112222 22334444455 Q ss_pred HHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcCCCCCCCc Q lcl|NC_020081. 77 LLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTGRIDNDFT 155 (552) Q Consensus 77 ~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n~~~~pn~ 155 (552) +.+..|.++..+.+-.|+-.+..+++ .+.....+..+-+.+.+. +..-+++|.. +..+++.++. T Consensus 56 LI~~YR~ma~~pEvd~Av~eIVneai-------v~d~~~~pV~i~Ld~~~~--s~~iK~kI~eEF~~Il~ll~F------ 120 (558) T protein:vir:10 56 LIRRYREMALHPEADGAIEDVVNEAI-------VSDLYDSPVEVELSNLNA--SNTLKKKIREEFRYIKEMMDF------ 120 (558) T ss_pred HHHHHHHHhhccchhhHHHHhhccee-------EecCCCceEEEEecccCc--chHHHHHHHHHHHHHHHHhcc------ Confidence 66777888888888777766544433 223344455565554443 3334445433 4444444332 Q ss_pred cCCHHHHHHHHHHHHHhcCCeeEEEEECC---CCCEEEEEEecCceeEEEECC----------------Cccccccccee Q lcl|NC_020081. 156 RDNFRSFVKKLVRDRLTYDKINFELVYDK---LGDLHNFKAVDASTVYVAVDE----------------DGKERKAKDGV 216 (552) Q Consensus 156 ~~t~~~f~~~~v~d~ll~Gna~~~i~r~~---~G~~~~L~~l~p~~v~v~~~~----------------~g~~~~~~~~~ 216 (552) ....++ +++.+.+.|..|+.++-|. ...+.+|.+|||..|+.++.- .+.... +... T Consensus 121 ~~~~~e----~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~-~~~~ 195 (558) T protein:vir:10 121 DKKSHE----IFRNWYVDGRVFYLKVIDTKNPQEGIQDLRYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPN-PEFE 195 (558) T ss_pred chhhhH----HHhhheeeeEEEEEEEEeCCCccccceeeeeeCcccceeeeeeccccccccceeeeecccceeec-ccee Confidence 123344 4567778899999998653 335899999999999765442 111111 1111 Q ss_pred EEEEEcCC-------------ceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCC Q lcl|NC_020081. 217 RYVQVIDD-------------KVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTT 283 (552) Q Consensus 217 ~y~~~~~~-------------~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p 283 (552) .|+.+..+ +....++.+-|.+..... ...++.+=+|-|..|...+.....++....-|=-.-+.= T Consensus 196 eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~dAI~y~hSGL--~d~~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPE 273 (558) T protein:vir:10 196 EFYIYTPKVQHPTGMVGQMGGKNSIKIAKDSITMCTSGL--VDRNKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPE 273 (558) T ss_pred EeeeecCCcccccccceeecCCCceeechhheeeecccc--eecCCCeeeecchHhhHhHHhhHHHHhhHHHHhhhcccc Confidence 12222111 111233333333222111 112334456778888888887777777655443333444 Q ss_pred ceEEEeCCCCCCCHHHHHHHHHHHHHHhcc----------ccccccc-eeec---------cCCceeeec--cCchhHHH Q lcl|NC_020081. 284 RGLLHIKTGQEQSNQALTSFRREWTSMFSG----------INGAWKI-PVIT---------AEDVKFVNM--TQSSKDME 341 (552) Q Consensus 284 ~gil~~~~~~~~s~~~~~~~~~~~~~~~~G----------~~nagk~-~il~---------~~g~~~~~l--~~~~~d~q 341 (552) +-|..+..+......+-+=++.-+ ..|.. ..+..+. .++. +.|.++..| +.+.-+|+ T Consensus 274 RRvFYIDVGnLPk~KAeqYlr~iM-~k~KNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~ 352 (558) T protein:vir:10 274 RRIFYIDVGNLPKVKAEQYLKEVM-SRYRNKLVYDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELS 352 (558) T ss_pred ceEEEEecCCCCchhHHHHHHHHH-HhccceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcchHH Confidence 555555444332322222222222 11210 0000000 0110 013344443 23334444 Q ss_pred HHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----C---- Q lcl|NC_020081. 342 FEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----V---- 413 (552) Q Consensus 342 ~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----~---- 413 (552) - ..+..+.+.++++||.+.|+.. ++|....+. --+.-|-. +...|.-+..++...|...| + T Consensus 353 D---V~YF~kKLy~aLnVP~SRl~~e--~~f~~Gr~~--EItRDEiK----F~KFI~RLR~rFs~lF~~~Lk~qLilKgi 421 (558) T protein:vir:10 353 D---VDYFQKKLYRALGVPESRIAAE--GGFNLGRSS--EILRDELK----FAKFVGRLRKRFAAMFNDMLKTQLVLKNI 421 (558) T ss_pred H---HHHHHHHHHHHhCCCccccCCC--Ccccccccc--hhhHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhhhccC Confidence 3 4588899999999999999743 233221111 11111222 23344445555555444332 2 Q ss_pred -cccc-----cceeecccccChH----------HHHHHHHHHHHHhcCCcCHHHHHH-HhCCCCCCCCCeeeccccccch Q lcl|NC_020081. 414 -SQFG-----GDYVFNFVGGDAK----------TEAEIISILESKAKIGLTINDIRK-ELGYPDTEGGDVTLAGVHVQRL 476 (552) Q Consensus 414 -~~~~-----~~~~~~f~~~d~~----------~~~~~~~~~~~~~~g~lT~NE~R~-~~gl~p~~ggD~~~~~~n~~~~ 476 (552) ++.+ ..+.+.|.+...- .|...++.+..++.-.+|.+=+|+ .|.+--.+ +... T Consensus 422 it~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDee----------I~~~ 491 (558) T protein:vir:10 422 VTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDME----------IEEI 491 (558) T ss_pred CCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHH----------HHHH Confidence 2222 3466777665432 223333333333333557776665 33432110 0000 Q ss_pred hhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcccccccccccc----ccCccccccc Q lcl|NC_020081. 477 GQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTP----QGGKDDNGNV 546 (552) Q Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 546 (552) +.+...+.. ...-.++++ .++- .+.+-...+. ....+.+..+.+.+- ...+++ -+.+.++.|- T Consensus 492 ~kqI~~E~k-~~~~~~p~~---~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 558 (558) T protein:vir:10 492 DTQIEDEIQ-KGIIPDPSQ---IDPI-TGEPLPQEGD-PAMEGMGEQPVDPDL-EAQAQAVDAQYSKDTKKAEL 558 (558) T ss_pred HHHHHHHHh-CCCCCCccc---cChh-hccccCccCC-chhccCCCCCccccc-ccchhhhhhhhhhhhhhhcC Confidence 000000000 000000000 0000 0000000000 000000000000000 000000 0111111111 No 226 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=96.86 E-value=0.00029 Score=39.97 Aligned_cols=419 Identities=11% Similarity=0.090 Sum_probs=170.2 Q ss_pred hcccccccccccc-ccccccccccccCCcccccccCCCCchHHHHH---HHh-hcchHHH-HHH-------HHHHH-HHH Q lcl|NC_020081. 37 LKKGKNTKSNKPK-AYEEPIIGSMSMNPDFKEAPSIHGKQNLLQML---KLW-SRKNIIL-NAI-------IITRV-NQV 102 (552) Q Consensus 37 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L---r~~-a~~~~i~-~a~-------i~~~~-~~~ 102 (552) |+-+.+-|....+ .+...+.... ..-...|.+..++...+.+ ++| .-....+ ..- ..... +-. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~---~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~ 77 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSL---GQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVT 77 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhh---hhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchH Confidence 4333332322111 1100000000 0001112222222111111 111 1111111 000 00000 000 Q ss_pred HHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE Q lcl|NC_020081. 103 SMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY 182 (552) Q Consensus 103 ~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r 182 (552) ...|+..+....+-+..+.. .+.+....+.+++.. ..+...++..+.+.+..|.+++.+.. T Consensus 78 ~~i~~~~A~ll~~e~~~i~~---------~d~~~~e~l~~i~~~----------n~f~~~~~~~~e~a~a~G~~~~k~~~ 138 (505) T protein:vir:79 78 KLASAKLASLIFNEQCQVTV---------SDETANDFLDDVFQQ----------NDFYTTFEEKLEEWIALGSGCVRPYV 138 (505) T ss_pred HHHHHHHHhhhcCCCceeec---------CChHHHHHHHHHHHh----------ccHHHHHHHHHHHHhhcCCeEEEEEE Confidence 11111111111111111111 112222334444432 13455667788899999999999888 Q ss_pred CCCCCEEEEEEecCceeEEEECCCcccccc-----------cceeEEEEE-----cCCceE---------------EEEc Q lcl|NC_020081. 183 DKLGDLHNFKAVDASTVYVAVDEDGKERKA-----------KDGVRYVQV-----IDDKVV---------------AKFK 231 (552) Q Consensus 183 ~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~-----------~~~~~y~~~-----~~~~~~---------------~~~~ 231 (552) |. |. ..+.++++..+.++..+.++.... ....+|... .++... ..++ T Consensus 139 D~-~~-~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~ 216 (505) T protein:vir:79 139 DS-GK-IKLAWATADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVP 216 (505) T ss_pred eC-Cc-eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccc Confidence 74 44 357778888877754333332110 000011000 000000 0000 Q ss_pred ----------ccc----------eeeecccccCCc--cCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCC----ce Q lcl|NC_020081. 232 ----------AKE----------MAWEVSNPRTDL--TVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTT----RG 285 (552) Q Consensus 232 ----------~~e----------vi~~~~~~~~~~--~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p----~g 285 (552) ..+ ..|++ ++..+. ...++|+|.+.-+...++.......-...-|..|... .. T Consensus 217 l~~~~~~~~l~~~~~~~g~~~p~f~~~~-~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~ 295 (505) T protein:vir:79 217 LNSLEQYEGLEPQVKITGLKHPLFAFYR-NKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAE 295 (505) T ss_pred hhhcccccccCcceeecCCCcceEEEec-CCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechH Confidence 011 12222 121221 1346899999999999988888777777777766443 11 Q ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhc Q lcl|NC_020081. 286 LLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEIN 365 (552) Q Consensus 286 il~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg 365 (552) +|...+.......... .. .+.+.....+.....+++..++.++....+.++.+..+...++|+...|+++..+| T Consensus 296 ~l~~~~~~~~~~~~~~--~~----~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~ 369 (505) T protein:vir:79 296 WLKTGSSYGGQASETH--PP----MFDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFT 369 (505) T ss_pred HhcccCCCCccccccc--cc----CCCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcC Confidence 1211110000000000 00 01010011111111223445777788888889999999999999999999999998 Q ss_pred cccccccccccc---ccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCccc----------ccceeecccccChHHH Q lcl|NC_020081. 366 FPNRGGATGHSG---NTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQF----------GGDYVFNFVGGDAKTE 432 (552) Q Consensus 366 ~~~~~t~~~~~~---~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~----------~~~~~~~f~~~d~~~~ 432 (552) +...+..++... .+..+.+.. ..+..++.+|..++..|-.......+... ...+.|.|..+-+.++ T Consensus 370 ~~~~~~~TAtei~s~~~~l~~t~~-~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~ 448 (505) T protein:vir:79 370 TSPSGIQTATEVVTNNSQTYQTRS-SYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQ 448 (505) T ss_pred CCccccchHHHHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCH Confidence 755432211111 111222222 23334566666666666554433222111 1235566765544443 Q ss_pred H-HHHHHHHHHhcCCcCHHHHHHHh-CCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCC Q lcl|NC_020081. 433 A-EIISILESKAKIGLTINDIRKEL-GYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNV 510 (552) Q Consensus 433 ~-~~~~~~~~~~~g~lT~NE~R~~~-gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (552) . +.....+.+.+|+|+.-+++... |+.. .+.- ..+..+...+ . ...+...+-+ T Consensus 449 ~~~~~~~~~~v~~Gi~s~e~~l~~~~~~~e---eea~------~el~ri~~E~-------~-~~~p~~~~~g-------- 503 (505) T protein:vir:79 449 ESKRAADLQAVQAQVMPKKQFLMRNYGLDE---EEAD------EWLAQIDAEN-------S-TAEPEFNQFG-------- 503 (505) T ss_pred HHHHHHHHHHHHcCCCCHHHHHHhcCCCCh---HHHH------HHHHHHHHhc-------c-ccCCCchhcc-------- Confidence 3 33344455667899998887654 4322 1000 0011110000 0 0000000000 Q ss_pred CCCCCcccccCCC Q lcl|NC_020081. 511 NGKDSFNQNVGKD 523 (552) Q Consensus 511 ~~~~~~~~~~~~~ 523 (552) +| T Consensus 504 -----------g~ 505 (505) T protein:vir:79 504 -----------GD 505 (505) T ss_pred -----------CC Confidence 00 No 227 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=96.80 E-value=0.00033 Score=39.68 Aligned_cols=415 Identities=14% Similarity=0.063 Sum_probs=154.8 Q ss_pred cccCCccc-ccccCCCCchH--------------HHHHHHhhcch-HHHHH---HHHHHHH--HHHHHHHHHHhhcccc- Q lcl|NC_020081. 59 MSMNPDFK-EAPSIHGKQNL--------------LQMLKLWSRKN-IILNA---IIITRVN--QVSMFCTPARNSDKGV- 116 (552) Q Consensus 59 ~~~~~~~~-~~~~~~~~~~~--------------~~~Lr~~a~~~-~i~~a---~i~~~~~--~~~~~~~~~~~~~~~~- 116 (552) +.-+-.+. +.+.......+ .+.++++-++. .+... ....+.+ .+.-+++.+....+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~l 80 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGYM 80 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhhh Confidence 11111111 11111111111 11222221110 01100 0000000 0011122222222211 Q ss_pred -ceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEE----CCCCCEEEE Q lcl|NC_020081. 117 -GYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVY----DKLGDLHNF 191 (552) Q Consensus 117 -~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r----~~~G~~~~L 191 (552) |-.+.+... +......+..++.. | .+..+...+..+.+++|.+|..+.. +..|+ ..+ T Consensus 81 ~g~~~~~~~~-------d~~~~~~l~~~~~~-------n---~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~-~~i 142 (489) T protein:vir:99 81 LGVPVEYKNE-------NKDLQAAIDLMSVR-------N---NEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTE-VKL 142 (489) T ss_pred ccCCceeecC-------ChhHHHHHHHHHhh-------c---ChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcc-eEE Confidence 111111111 11112223334332 1 2345667788999999999987764 34444 458 Q ss_pred EEecCceeEEEECCCcccccccceeEEEEEcCC--c---eEEEEcccceeeecc---------------cccCC-c---- Q lcl|NC_020081. 192 KAVDASTVYVAVDEDGKERKAKDGVRYVQVIDD--K---VVAKFKAKEMAWEVS---------------NPRTD-L---- 246 (552) Q Consensus 192 ~~l~p~~v~v~~~~~g~~~~~~~~~~y~~~~~~--~---~~~~~~~~evi~~~~---------------~~~~~-~---- 246 (552) ..++|..+.++.++..... ..-.++|+....+ . ....+.++.+.++.. |+... | T Consensus 143 ~~~~p~~~~~v~dd~~~~~-~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 221 (489) T protein:vir:99 143 YQLPAEQTFVIYDDTYQRN-SLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVNEY 221 (489) T ss_pred EEEcccceEEEEcCCCCCc-eEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeEEEe Confidence 8899999988876543210 0011222211111 1 111222222222111 00000 0 Q ss_pred cCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcc------ccccccc Q lcl|NC_020081. 247 TVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSG------INGAWKI 320 (552) Q Consensus 247 ~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G------~~nagk~ 320 (552) .+...|.|.++.+...++....+..-..+.+...+.|-.++. +. .........+...+.....+ ....+++ T Consensus 222 ~n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~--g~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 298 (489) T protein:vir:99 222 ANNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIA--GN-AYTGADENDYLDDGRLNPNGRLAISIGFKKAQV 298 (489) T ss_pred ecCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhc--cC-Ccccccchhhhhhccccccccccccccccccee Confidence 011356677766666666555544444444444444443332 21 11221112222222211110 1112232 Q ss_pred eeecc------CCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHh-cccccccccccccccccc--h---hHHHH Q lcl|NC_020081. 321 PVITA------EDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEI-NFPNRGGATGHSGNTLNE--G---SSAEK 388 (552) Q Consensus 321 ~il~~------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~~~~~~~~~--~---n~e~~ 388 (552) ..+.+ .+.+.+.+.....+..+....+.+.+.|...-++|..-. +.. ++ .++....+ . ..... T Consensus 299 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~--~n---~Sg~Al~~~~~~l~~k~~~ 373 (489) T protein:vir:99 299 LILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFS--GV---QSGESMKYKLMASDNYREK 373 (489) T ss_pred eeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCccccccccc--cc---chHHHHHHHHHHHHHHHHH Confidence 22211 122333444444455556777888999999999986322 111 11 11111111 0 01223 Q ss_pred HHHHHHHHhhHHHHHHHHHHHhhcCc---ccc-cceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCC Q lcl|NC_020081. 389 YRNSKDKGLEPLLKFIEDAVNKYIVS---QFG-GDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGG 464 (552) Q Consensus 389 ~~~~~~~~l~P~~~~ie~~ln~~L~~---~~~-~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~gg 464 (552) .+..+...|+-+++.|...++..-.. ... .++.+.|.+..+.+.++.++++.+. .|+|+.-.+.++++. +..- T Consensus 374 k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl-~giis~et~~~~l~~--v~~~ 450 (489) T protein:vir:99 374 QERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNL-YGIVSDQTIFEILNT--VTGV 450 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHH-hccCCHHHHHHhcCC--CCch Confidence 33455566666666665555432111 111 2466778777777777777766554 488998888877633 1111 Q ss_pred CeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccc Q lcl|NC_020081. 465 DVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQN 519 (552) Q Consensus 465 D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (552) |.- ..+..+...+. .... ..+..... +.++...++.+ ++ T Consensus 451 d~~------~E~~ri~~E~~---~~~~-~~~~~~~~-~~~~~~~~~~~-----~p 489 (489) T protein:vir:99 451 DAE------AELKRLKEEAD---KKQS-LPEPRLVG-DASGQEEPTAE-----KP 489 (489) T ss_pred hHH------HHHHHHHHHHH---HHhc-cccccccC-CCCCCcCCCCC-----CC Confidence 100 00111111000 0000 00000000 00000000000 00 No 228 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=96.27 E-value=0.0008 Score=37.58 Aligned_cols=444 Identities=13% Similarity=0.118 Sum_probs=181.4 Q ss_pred CCC----CCCCcccccchhhcccccCcccccccccchhhhhccc-ccccccccccccc---ccccccccCCc-cc-cccc Q lcl|NC_020081. 1 MGL----LDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKG-KNTKSNKPKAYEE---PIIGSMSMNPD-FK-EAPS 70 (552) Q Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~---~~~~~~~~~~~-~~-~~~~ 70 (552) |+. +=|+.-.++- ...++..+..+.... ++.-.++...... +.+.-....+. |. -.+. T Consensus 1 m~f~~~~lf~f~~~~de------------~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~ 68 (523) T protein:vir:68 1 MKFNILSLFAPWAKMDE------------RDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPG 68 (523) T ss_pred CCCchhhhhhhhhhhhh------------hhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccc Confidence 322 0011100000 000000011110000 0001111000000 00000000111 11 1233 Q ss_pred CCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcCC Q lcl|NC_020081. 71 IHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTGR 149 (552) Q Consensus 71 ~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n~ 149 (552) ..+...+.+..|.++..+.+-.|+-.+..+++ .+....-+..+-+.+.+ .+..-+++|.. +..++..++. T Consensus 69 ~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai-------v~d~~~~pV~i~Ld~~~--~s~~iK~kI~eeF~~Il~ll~F 139 (523) T protein:vir:68 69 LKSTRELIDTYRNLMTNYEVDNAVSEIVSDAI-------VYEDDTEVVSINLDNTK--FSPNIKSMMLDEFNEVLNHLSF 139 (523) T ss_pred cchHHHHHHHHHHHhhccchhhHHHHhhccee-------eecCCCceEEEEecccc--cchHHHHHHHHHHHHHHHHhcc Confidence 34444556677888888888777766554443 12233344455554433 44444555543 4444443332 Q ss_pred CCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC---CCCEEEEEEecCceeEEEEC-----CCcccccccceeEEEEE Q lcl|NC_020081. 150 IDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK---LGDLHNFKAVDASTVYVAVD-----EDGKERKAKDGVRYVQV 221 (552) Q Consensus 150 ~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~---~G~~~~L~~l~p~~v~v~~~-----~~g~~~~~~~~~~y~~~ 221 (552) ....++ +++.+.+.|..|+.++-|. ...+.+|.+|||..|+.++. +.|..... ...-|+.+ T Consensus 140 ------~~~~~~----~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~-~~~e~f~Y 208 (523) T protein:vir:68 140 ------QRKGSD----HFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKIVK-GYKEYFIY 208 (523) T ss_pred ------chhhhH----HHHhheeeeEEEEEEEeeCCCccccceeeeeeCCcceeEEEeecCCCCcchhhhh-hhhhheee Confidence 123444 4567778899999998653 33589999999999976432 22221111 11111111 Q ss_pred cCC-------------ceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Q lcl|NC_020081. 222 IDD-------------KVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLH 288 (552) Q Consensus 222 ~~~-------------~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 288 (552) ... +....++.+-|+|...+ ..+.++..=+|-|..|...+.....++....-|=-.-+.-+-|.. T Consensus 209 ~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSG--L~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFY 286 (523) T protein:vir:68 209 DTSHESYACDGRIYEAGTKIKIPKAAIVYAHSG--LVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWY 286 (523) T ss_pred ccccccccccccccCCCcceecchhheeeeecc--ceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEE Confidence 111 12233444444443322 223334455678888888888777777766544333444455555 Q ss_pred eCCCCCCCHHHHHHHHHHHHHHhc---------c-ccccccce-eec---------cCCceeeecc--CchhHHHHHHHH Q lcl|NC_020081. 289 IKTGQEQSNQALTSFRREWTSMFS---------G-INGAWKIP-VIT---------AEDVKFVNMT--QSSKDMEFEKWL 346 (552) Q Consensus 289 ~~~~~~~s~~~~~~~~~~~~~~~~---------G-~~nagk~~-il~---------~~g~~~~~l~--~~~~d~q~~e~~ 346 (552) +..+......+-+=++.-+. .+. | ..+..+.. ++. +.|.++..|. .+.-+|+- . T Consensus 287 IDvGnlPk~KAeqYl~~im~-k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V 362 (523) T protein:vir:68 287 VDTGNMPSRKAAEHMQHVMN-TMKNRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED---V 362 (523) T ss_pred EecCCCCchhHHHHHHHHHH-hhcceeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH---H Confidence 55443333322222222221 111 0 01111110 110 1133444442 33444443 4 Q ss_pred HHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----C-----cccc Q lcl|NC_020081. 347 NYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----V-----SQFG 417 (552) Q Consensus 347 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~ 417 (552) .+....+.++++||.+.|.. +.++|.... ++--.....=+...|.-+..++...|...| + ++.+ T Consensus 363 ~YF~kkLy~aLnVP~sRl~~-~~~~f~~Gr------~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~ee 435 (523) T protein:vir:68 363 RWFRNALYMALRIPITRIPS-DQGGIQFDA------GTSITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDE 435 (523) T ss_pred HHHHHHHHHHhCCcceeecC-CCcceeccc------ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHH Confidence 58889999999999999953 223222111 111111122223344455555555544333 2 2222 Q ss_pred -----cceeecccccChHH----------HHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhhcc Q lcl|NC_020081. 418 -----GDYVFNFVGGDAKT----------EAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQIMQ 481 (552) Q Consensus 418 -----~~~~~~f~~~d~~~----------~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~~~ 481 (552) ..+.+.|.+...-+ |...++.+..++.-.++.+=+|+. |.+--.+ +...+.+.. T Consensus 436 w~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee----------i~~~~kqI~ 505 (523) T protein:vir:68 436 WNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKDILQMSDEE----------IEQEAKQIE 505 (523) T ss_pred HHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHH----------HHHHHHHHH Confidence 34667776655322 222233333333334566666653 3332110 000011111 Q ss_pred ccccccccCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 482 QEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) . ..+.+--.++...++ +. T Consensus 506 ~----E~k~~~~~~p~~e~~-------------~f 523 (523) T protein:vir:68 506 E----ESKEARFQDPDQEQE-------------DF 523 (523) T ss_pred H----HhhcCCCCCCchhhh-------------cC Confidence 0 001111111111111 11 No 229 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=96.00 E-value=0.0011 Score=36.71 Aligned_cols=445 Identities=12% Similarity=0.109 Sum_probs=184.5 Q ss_pred CCC--CCCCcccccchhhcccccCcccccccccchh-hhhccccccccccccccccccccccccCCc-------cc-ccc Q lcl|NC_020081. 1 MGL--LDGFFKGRKQQDNIIDINDDMAVRIKQIEED-AILKKGKNTKSNKPKAYEEPIIGSMSMNPD-------FK-EAP 69 (552) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~-~~~ 69 (552) |+. |. .|. .+ .+.-....++..+. ..+-..++.-.++..... ......++. |. ..+ T Consensus 1 m~~~~L~-~~~--~w-------~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~---~~~~~a~~~~g~~~~~~g~~e~ 67 (524) T protein:vir:10 1 MKFNVLS-LFA--PW-------AKMDERNFKDQEKEDLVSITAPKLDDGAREFEV---SSNEAASPYNAAFQTIFGSYEP 67 (524) T ss_pred CCCchhh-Hhh--cc-------ccCcchhhhhhhccCCccccCccCCCCceeeee---cccccccccceeeeehhccccc Confidence 433 10 000 00 00000011111111 112222222222111110 111111111 11 222 Q ss_pred cCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcC Q lcl|NC_020081. 70 SIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTG 148 (552) Q Consensus 70 ~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n 148 (552) ...+...+.+..|.++..+.+-.|+-.+..+++ .+....-+..+-+.+.+ .++.-+++|.. +..++..++ T Consensus 68 ~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai-------v~d~~~~pV~l~L~~~~--~s~~iK~kI~eeF~~Il~ll~ 138 (524) T protein:vir:10 68 GMKTTRELIDTYRNLMNNYEVDNAVSEIVSDAI-------VYEDDTEVVALNLDKSK--FSPKIKNMMLDEFNDVLNHLS 138 (524) T ss_pred ccchHHHHHHHHHHHhhccchhhHHHHhhccee-------EecCCCceEEEEecCcC--cchHHHHHHHHHHHHHHHHhc Confidence 334455566777888888888777766554443 12234445555554433 44444445533 444444433 Q ss_pred CCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC---CCCEEEEEEecCceeEEEEC----CCcccccccceeEEEEE Q lcl|NC_020081. 149 RIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK---LGDLHNFKAVDASTVYVAVD----EDGKERKAKDGVRYVQV 221 (552) Q Consensus 149 ~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~---~G~~~~L~~l~p~~v~v~~~----~~g~~~~~~~~~~y~~~ 221 (552) . ....++ +++.+.+.|..|+.++-|. ...+.+|.+|||..|+.++. .+++........-|+.+ T Consensus 139 F------~~~~~~----~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y 208 (524) T protein:vir:10 139 F------QRKGSD----HFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIY 208 (524) T ss_pred c------chhhhH----HHhhheeeeEEEEEEEeeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheee Confidence 2 123344 4567778899999998653 33589999999999976432 12221111111112221 Q ss_pred cCC-------------ceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Q lcl|NC_020081. 222 IDD-------------KVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLH 288 (552) Q Consensus 222 ~~~-------------~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 288 (552) ..+ +....++.+-|.|...+ ..+.++..=+|-|..|...+.....++....-|=-.-+.-+-|.. T Consensus 209 ~~~~~~y~~~g~~~~~~~~ikI~~dAI~y~hSG--L~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFY 286 (524) T protein:vir:10 209 DTAHESYACDGRMYEAGTKIKIPKAAIVYAHSG--LVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWY 286 (524) T ss_pred ccCccccccCccccCCCcceecchhheeeeecc--ceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEE Confidence 111 12233444444443222 223334455677888888888777777765544333444455555 Q ss_pred eCCCCCCCHHHHHHHHHHHHHHhcc----------ccccccce-eec---------cCCceeeecc--CchhHHHHHHHH Q lcl|NC_020081. 289 IKTGQEQSNQALTSFRREWTSMFSG----------INGAWKIP-VIT---------AEDVKFVNMT--QSSKDMEFEKWL 346 (552) Q Consensus 289 ~~~~~~~s~~~~~~~~~~~~~~~~G----------~~nagk~~-il~---------~~g~~~~~l~--~~~~d~q~~e~~ 346 (552) +..+......+-+=++.-+. .|.. ..+..+.. ++. +.|.++..|. .+.-+|+- . T Consensus 287 IDvGnlPk~KAeqYl~~im~-k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V 362 (524) T protein:vir:10 287 VDTGNMPARKAAEHMQHVMN-TMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED---V 362 (524) T ss_pred EecCCCCchhHHHHHHHHHH-hcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH---H Confidence 55443333222222222221 1110 00111100 110 0133444432 33444443 4 Q ss_pred HHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----C-----cccc Q lcl|NC_020081. 347 NYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----V-----SQFG 417 (552) Q Consensus 347 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~ 417 (552) .+..+.+.++++||.+.|..-..+++....++ .-+.-|-. +...|.-+..++...|...| + ++.+ T Consensus 363 ~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~--EItRDEik----F~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~ee 436 (524) T protein:vir:10 363 RWFRQALYMALRVPLSRIPQDQQGGVMFDSGT--SITRDELT----FAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDE 436 (524) T ss_pred HHHHHHHHHHhCCchhhcCCCCCccccccccc--hhhHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHH Confidence 58889999999999999943222333221111 11112222 23344455555555544332 2 2222 Q ss_pred -----cceeecccccChHH----------HHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhhcc Q lcl|NC_020081. 418 -----GDYVFNFVGGDAKT----------EAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQIMQ 481 (552) Q Consensus 418 -----~~~~~~f~~~d~~~----------~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~~~ 481 (552) ..+.+.|.+...-+ |...++.+..++.-.++.+=+|+. |.+--.+ +...+.+.. T Consensus 437 w~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee----------i~~~~k~I~ 506 (524) T protein:vir:10 437 WNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEE----------IEQEAKQIE 506 (524) T ss_pred HHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHH----------HHHHHHHHH Confidence 34667776665332 222233333333334566666653 3332110 000011110 Q ss_pred ccccccccCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 482 QEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) . ..+.+--.++...++ +. T Consensus 507 ~----E~k~~~~~~~~~~~~-------------~f 524 (524) T protein:vir:10 507 E----ESKEARFQDPDQEQE-------------DF 524 (524) T ss_pred H----HhhcCCCCCCchhhh-------------cC Confidence 0 001111111111111 11 No 230 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=95.70 E-value=0.00039 Score=39.25 Aligned_cols=267 Identities=14% Similarity=0.099 Sum_probs=118.2 Q ss_pred ccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEE-- Q lcl|NC_020081. 113 DKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHN-- 190 (552) Q Consensus 113 ~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~-- 190 (552) -.-+.+.-|..|.. ....-.+.| |..-++.+++--.-.++|.- .....++.. T Consensus 1 ~~~~~~~~~~~~~~------------------~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~ 54 (279) T protein:vir:40 1 MSLFNLSRRAEDVS------------------FSTFTVQDP----TTDLLLGKLLGLVSYFDNVD----YSEASKLEDLF 54 (279) T ss_pred Ccccccchhhcccc------------------eeeeeecCc----chhHHHHHHHHHHHHhhccc----chhhhhhhhhh Confidence 00011111111100 000111111 11112222222222233321 111111111 Q ss_pred EEEecCceeEEEECCCcccccccc-----eeEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHH Q lcl|NC_020081. 191 FKAVDASTVYVAVDEDGKERKAKD-----GVRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQY 265 (552) Q Consensus 191 L~~l~p~~v~v~~~~~g~~~~~~~-----~~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~ 265 (552) .|.|....|.-+ +.|+..+... +.--++........+++-.|+ ++..|| .||.-+- .....+++ T Consensus 55 ~~~~~~~~~~~~--~~~~~~~~~~~~~~d~fn~~vr~~~~~~vtVP~~Dv-~IieNP-------lv~v~~e-e~~kM~~l 123 (279) T protein:vir:40 55 YWALQGKEVYRV--WYGGFKYYAQRVNADQFNIVVREPNRREVTIRTNDY-EMLLNP-------FYGANPQ-RFGVMFGM 123 (279) T ss_pred hhhhccceeehh--hhhhHHHHHhhcCcchhhhheecCCcceeEeecchh-hhhhcc-------hheeccc-hhhHHHHH Confidence 223333322211 2222111000 000011111222233333343 233333 3554432 22333333 Q ss_pred HHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHH Q lcl|NC_020081. 266 HDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKW 345 (552) Q Consensus 266 ~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~ 345 (552) + ......-+.+.+..+++|+.+.+. ..++..++.+..++++..++++-+++..+ +.|-+++++..+=.-.- .+- T Consensus 124 a---~nai~~KLD~~~qIk~fIKTd~d~-glee~kekaR~rIk~mlalAk~~nGityi-d~~ddItQL~kDYStsl-k~d 197 (279) T protein:vir:40 124 A---SNGIGRRLDSQAQIKIYWKTKVSS-GLKEVWDRIRERLTQQQQLAREFNGVSVI-GSDDDIKQIQPDYSGSL-QND 197 (279) T ss_pred H---HhhhhhhhcccceeeeEEecCcch-hHHHHHHHHHHHHHHHHHHHHhcCCeeee-cCCceeEeecccccccc-HHH Confidence 3 222333347778888999876542 24677788888898888888764555555 46889998875533222 344 Q ss_pred HHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCcccccceeeccc Q lcl|NC_020081. 346 LNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFV 425 (552) Q Consensus 346 ~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~ 425 (552) ..+.+++.++.||||..+|- .+..|.+..+|+..++.|+++++|-.|.. .+.+...|. T Consensus 198 ie~lkS~l~Sq~GinekIL~----------------GsAtE~q~iAyy~rtVePILkQyek~liY------~~E~fv~y~ 255 (279) T protein:vir:40 198 ANLAIEIALSEYGMPRELLY----------------GQSNEVTIIAFAIQKVLPLLKQHDKNIIF------NQENFVAYI 255 (279) T ss_pred HHHHHHHHHhhcCCchhhcc----------------ccCchhhhhhHHHhhHHHHHHHhcccccc------hhhhhhhhh Confidence 56888999999999998872 35568899999999999999998764432 111111111 Q ss_pred -----ccChHHHHHHHHHHHHHhcCCcCHHH Q lcl|NC_020081. 426 -----GGDAKTEAEIISILESKAKIGLTIND 451 (552) Q Consensus 426 -----~~d~~~~~~~~~~~~~~~~g~lT~NE 451 (552) ++...+...+... .-+. |+ T Consensus 256 ttta~gg~~~s~~~~~~~-~~~~------~~ 279 (279) T protein:vir:40 256 STTAKGGAIESKSSKRDS-EPVG------ND 279 (279) T ss_pred eecccCcccccccccccC-CCCC------CC Confidence 1111111111100 0000 11 No 231 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=95.63 E-value=0.0017 Score=35.73 Aligned_cols=423 Identities=13% Similarity=0.085 Sum_probs=165.7 Q ss_pred ccccccccchhhhhccccccc-cc----ccccccccccccc-ccCCc-----ccccccCCCCchHHHHHHHhhcchHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTK-SN----KPKAYEEPIIGSM-SMNPD-----FKEAPSIHGKQNLLQMLKLWSRKNIILN 92 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~-~~----~~~~~~~~~~~~~-~~~~~-----~~~~~~~~~~~~~~~~Lr~~a~~~~i~~ 92 (552) |. +-+.++..| |.+-+.. +. .-..+..-..+.. .|... |..++.. .... -+.+.. .+-+ T Consensus 1 ~~--~~~~~~~~i-~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~----~~~~-~~~~~~--~l~~ 70 (518) T protein:vir:78 1 MG--VWSVMTRFI-KGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYV----PTVH-DKLMNS--GTGN 70 (518) T ss_pred Cc--chhhHHHHH-HHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCC----Cccc-cccccC--ChHH Confidence 32 223334333 3322111 00 0000000000000 00000 1111110 0000 011110 1111 Q ss_pred HHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHh Q lcl|NC_020081. 93 AIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLT 172 (552) Q Consensus 93 a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll 172 (552) .|+. ..+....+-...|.....+.. . +......+.+++.. ..+..-+...+.+.+. T Consensus 71 ~i~~-----------~~A~ll~~e~~~i~v~~~~~~-d--~e~~~~~l~~il~~----------n~f~~~~~~~~e~a~a 126 (518) T protein:vir:78 71 EIVV-----------VAAEYISGKPLSIDVTGVNGS-K--DENLTKQLKEALRI----------DNFDSKSVKIVELAGG 126 (518) T ss_pred HHHH-----------HHHHhhcCCCceEEecCcccc-C--cHHHHHHHHHHHHh----------ccHHHHHHHHHHHhhc Confidence 1111 111111222222222221111 1 11112234444432 1344556667888889 Q ss_pred cCCeeEEEEECCCCCEEEEEEecCceeEEEECCCccccc---------ccceeEEE------------------------ Q lcl|NC_020081. 173 YDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERK---------AKDGVRYV------------------------ 219 (552) Q Consensus 173 ~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~---------~~~~~~y~------------------------ 219 (552) .|.+++.+..+ +|++ .+.++++..+.+... +|.... ......|. T Consensus 127 ~G~~~~k~~~d-~~~~-~i~~v~ad~~~P~~~-~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n 203 (518) T protein:vir:78 127 SGVSAVKINIL-NGRP-SISVHSSSQFWIDFK-NNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTY 203 (518) T ss_pred cCceEEEEEEE-CCee-EEEEEcCCeeEEEee-cCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEE Confidence 99999888776 4564 577788888877542 332110 00010111 Q ss_pred -EEcC-CceEEE-------------Ecccc------------e--eeecccccCCc--cCCcccccHHHHHHHHHHHHHH Q lcl|NC_020081. 220 -QVID-DKVVAK-------------FKAKE------------M--AWEVSNPRTDL--TVGKYGYPELEIALNHLQYHDN 268 (552) Q Consensus 220 -~~~~-~~~~~~-------------~~~~e------------v--i~~~~~~~~~~--~~g~~G~spl~~~~~~i~~~~~ 268 (552) .+.. .+.... ...++ . +.++.++..+. ...++|+|.+.-+...+..... T Consensus 204 ~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~ 283 (518) T protein:vir:78 204 SVIKIDGDKTTPISAERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDY 283 (518) T ss_pred EEeeecCcccccccccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHH Confidence 1000 000000 00000 1 12222222121 2346899999999999998888 Q ss_pred HHHHHHHHHhccCCCceEEEeCCCC-----C-CCHHHHHHHHHHHHHHhccccccccceeeccCCc----eeeeccCchh Q lcl|NC_020081. 269 TEVFNARFFAQGGTTRGLLHIKTGQ-----E-QSNQALTSFRREWTSMFSGINGAWKIPVITAEDV----KFVNMTQSSK 338 (552) Q Consensus 269 ~~~~~~~~f~ng~~p~gil~~~~~~-----~-~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~----~~~~l~~~~~ 338 (552) .......-|+.| .+..++ +... . ......-.+... .+.|...+ ....+|. .++.++.... T Consensus 284 ~~s~~~~e~~~g-~~~i~v--~~~~l~~~~~~~~~~~~~~fd~~-~~~y~~i~------~~~~~~~~~~~~i~~~~~~Ir 353 (518) T protein:vir:78 284 FFTVYMREGEKT-KTKIAA--SERMFRKKVNKSTDKEEWSMNVD-EDYFMQFK------GTLDAGAKLNDMIQFMQGDFR 353 (518) T ss_pred HHHHHHHHHHhC-Cceeee--chhHhccCCCCCCCccccccCCC-CceEEEec------CcCCCCCccccceeeeecccC Confidence 887777778764 444333 2110 0 000000000000 01111100 0011122 2666777778 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCHHHhccccccccccccc---ccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcCc- Q lcl|NC_020081. 339 DMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSG---NTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIVS- 414 (552) Q Consensus 339 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~---~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~- 414 (552) +.++.+..+...+.|....|++|..+|... +.-++... .+..+++. ...+..++.+|.-++..|...+...... T Consensus 354 ~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~-~~~TATei~s~~~~~~~t~-~~~~~~~e~al~~l~~~i~~l~~~~~~~~ 431 (518) T protein:vir:78 354 DGSYRETMEYFAQKAVSKSGYNPATFNLGN-REVKATEIWSLQDATVRKI-EKKKRLIQNVYEQMLWDFLYLLTGGTNNK 431 (518) T ss_pred hHHHHHHHHHHHHHHHHhhCCChhhcCccc-ccccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCcc Confidence 889999999999999999999999887532 11111110 00111111 2223344555555555554444322111 Q ss_pred -----ccccceeecccccChHHHHHHHHHHH-HHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccc Q lcl|NC_020081. 415 -----QFGGDYVFNFVGGDAKTEAEIISILE-SKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQ 488 (552) Q Consensus 415 -----~~~~~~~~~f~~~d~~~~~~~~~~~~-~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~ 488 (552) .....+.|.|..+-+.++.+..+.+. .+.+|+|++.++-+++... ..+.+.. ..+..+...+... T Consensus 432 ~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aGimS~e~~i~~~~~~-~~deea~------~e~~ri~~E~~~~-- 502 (518) T protein:vir:78 432 EKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSALAMSVEEKVKLIHPK-WEDEEIQ------AEVKRIYLENAIG-- 502 (518) T ss_pred ccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHhCCC-CCHHHHH------HHHHHHHHHhccc-- Confidence 11123566676665556555555544 3457899998865554221 1110000 0011111110000 Q ss_pred cCCCCCccCcccCCCCCCCCCCCC Q lcl|NC_020081. 489 RQMDANQFLAQQTGYDGNMDNVNG 512 (552) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~ 512 (552) ..+++....+ .+... | T Consensus 503 ~~~~p~~~~g----~~~~~----g 518 (518) T protein:vir:78 503 EVPDPEAIGG----METKG----G 518 (518) T ss_pred CCCCCccccC----CCCCC----C Confidence 0000100000 00000 0 No 232 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=95.37 E-value=0.0022 Score=35.16 Aligned_cols=439 Identities=11% Similarity=0.060 Sum_probs=167.7 Q ss_pred CCCCCCCcccccchhhcccccCcccccc-cccchhhhhccccccccccccccccccccccccCCcccc-cccCC---CCc Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRI-KQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKE-APSIH---GKQ 75 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~---~~~ 75 (552) ||+++ |.++-..+.. -. +..+. .++....-+. . -+..+ .+...+-.++. .+... ... T Consensus 1 m~~~~-~~k~~~~k~~-~~----~~~~~~~~i~~~~~i~-----~--~~~~~-----~~i~~~~~~y~g~~~~~~~~~~~ 62 (522) T protein:vir:47 1 MSLFQ-KVKDFFSRGR-YY----MQTSNLNSILEHPKIA-----V--TQEEY-----DRIKRNLVYYQSKWDDVQYKNTD 62 (522) T ss_pred CchHH-HHHHHHHHHH-HH----hhcccchhccccCCCC-----C--CHHHH-----HHHHHHHHHhcCCcccccccccC Confidence 77776 3332211110 00 00000 0000000000 0 00000 00000000000 00000 000 Q ss_pred hHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCc Q lcl|NC_020081. 76 NLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFT 155 (552) Q Consensus 76 ~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~ 155 (552) .... -++.. ..-+-..++. ..+....+-...+.. + +.+....+.+++.. T Consensus 63 ~~~~-~~~~~-slnl~~~i~~-----------~~A~lv~~e~~~i~v-------~--d~~~~~~l~~~l~~--------- 111 (522) T protein:vir:47 63 GDIK-SRPMN-HLPIARTASK-----------KIASLVYNEQATITT-------K--NEILQKFLDDMLTN--------- 111 (522) T ss_pred cchh-cccce-ecchHHHHHH-----------HHhhhhcCCcceeec-------C--ChHHHHHHHHHHhh--------- Confidence 0000 00000 0001111111 111111111111111 1 12222334444432 Q ss_pred cCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEEC-CCcccc----c------ccceeEEEE---- Q lcl|NC_020081. 156 RDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVD-EDGKER----K------AKDGVRYVQ---- 220 (552) Q Consensus 156 ~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~-~~g~~~----~------~~~~~~y~~---- 220 (552) ..+...++..+...+..|.+++.+.++. |. ..+-++++..+.++.. ..|... . .....+|.+ T Consensus 112 -n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~-~~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~h 188 (522) T protein:vir:47 112 -DRFNKNFERYLESCLALGGLAMRPYIDG-DK-VRVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFH 188 (522) T ss_pred -cchHHHHHHHHHHhhccCCEEEEEEEcC-Cc-eEEEEEcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEe Confidence 1355667778888888999999888874 43 4566777777766432 222110 0 001111110 Q ss_pred -------------------------EcCC-----ceEEEEc--------ccc----------eeeecccccCC-ccCCcc Q lcl|NC_020081. 221 -------------------------VIDD-----KVVAKFK--------AKE----------MAWEVSNPRTD-LTVGKY 251 (552) Q Consensus 221 -------------------------~~~~-----~~~~~~~--------~~e----------vi~~~~~~~~~-~~~g~~ 251 (552) +... +....+. +.+ .+|++.+.... ....++ T Consensus 189 e~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~spl 268 (522) T protein:vir:47 189 EWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPL 268 (522) T ss_pred eecccccccccccccCCceEEEEEEeecCCCcccCccccccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCc Confidence 0000 0000000 011 11222221111 113478 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCC----ceEEEeCCCCCCCH-HHHHHHHHHHHHHhccccccccceeeccC Q lcl|NC_020081. 252 GYPELEIALNHLQYHDNTEVFNARFFAQGGTT----RGLLHIKTGQEQSN-QALTSFRREWTSMFSGINGAWKIPVITAE 326 (552) Q Consensus 252 G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p----~gil~~~~~~~~s~-~~~~~~~~~~~~~~~G~~nagk~~il~~~ 326 (552) |+|.+..+...+......-.....-|+-|... ..+|....+....+ .....+ ..-...|.+.+. -.++ T Consensus 269 G~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~f-d~~~~~f~~~~~------~~~~ 341 (522) T protein:vir:47 269 GLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQRPDGTIDFRPRF-DVEQNVYMQIGG------SSMD 341 (522) T ss_pred CCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCCCCCCccccccccc-CcccceEeecCC------CCCC Confidence 99999999999988887777777777766542 11122211100000 000000 000111222110 0122 Q ss_pred CceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhccccccccccccc---ccccchhHHHHHHHHHHHHhhHHHHH Q lcl|NC_020081. 327 DVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSG---NTLNEGSSAEKYRNSKDKGLEPLLKF 403 (552) Q Consensus 327 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~---~~~~~~n~e~~~~~~~~~~l~P~~~~ 403 (552) +-.++.++....+.++.+..+...+.|+...|+++..+|+...+.-++... .+..+.+ ....+..++.+|..++.. T Consensus 342 ~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~~~t-~~~~~~~~~~al~~lv~~ 420 (522) T protein:vir:47 342 AGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTYQM-RSSIVALVEQSIKELCVS 420 (522) T ss_pred CCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Confidence 334666777778889999999999999999999998888654322111100 0001111 122444556666666666 Q ss_pred HHHHHHhh-cCc-c--cccceeecccccChHHH-HHHHHHHHHHhcCCcCHHHHHHHh-CCCCCCCCCeeeccccccchh Q lcl|NC_020081. 404 IEDAVNKY-IVS-Q--FGGDYVFNFVGGDAKTE-AEIISILESKAKIGLTINDIRKEL-GYPDTEGGDVTLAGVHVQRLG 477 (552) Q Consensus 404 ie~~ln~~-L~~-~--~~~~~~~~f~~~d~~~~-~~~~~~~~~~~~g~lT~NE~R~~~-gl~p~~ggD~~~~~~n~~~~~ 477 (552) |.+..+.. ++. . ....+.+.|..+-+.++ ++.....+.+.+|+|++-+++..+ |+..-+ .. ..+. T Consensus 421 i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG~~s~e~~i~~~~g~~eee-a~--------~el~ 491 (522) T protein:vir:47 421 MCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAGFSTKKRAIGKTLNISGVE-AE--------KELN 491 (522) T ss_pred HHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCChHH-HH--------HHHH Confidence 65444321 211 1 12335566665433333 333344455667999999987653 442210 00 0011 Q ss_pred hhccccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcccc Q lcl|NC_020081. 478 QIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQ 528 (552) Q Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (552) .+.. +... +.+. ... -.+...++. ..+++++ T Consensus 492 ri~~----E~~~-~~~~--~~~----~~~~~~~~~---------~~~d~~~ 522 (522) T protein:vir:47 492 AINS----ELLP-MNDA--ELA----IYGMHDQNE---------EKADDKG 522 (522) T ss_pred HHHH----hhcc-CCCC--CCC----CCCCCCccc---------ccCCCCC Confidence 1111 0000 0000 000 000000000 0001111 No 233 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=95.26 E-value=0.0024 Score=34.93 Aligned_cols=445 Identities=12% Similarity=0.107 Sum_probs=184.4 Q ss_pred CCC--CCCCcccccchhhcccccCcccccccccchh-hhhccccccccccccccccccccccccCCc-------cc-ccc Q lcl|NC_020081. 1 MGL--LDGFFKGRKQQDNIIDINDDMAVRIKQIEED-AILKKGKNTKSNKPKAYEEPIIGSMSMNPD-------FK-EAP 69 (552) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~-~~~ 69 (552) |+. |. .|. .+ .+.-....++..+. ..+-..++.-.++..... ......++. |. ..+ T Consensus 1 m~~~~L~-~~~--~w-------~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~---~~~~~a~~~~g~~~~~~g~~e~ 67 (524) T protein:vir:72 1 MKFNVLS-LFA--PW-------AKMDERNFKDQEKEDLVSITAPKLDDGAREFEV---SSNEAASPYNAAFQTIFGSYEP 67 (524) T ss_pred CCCchhh-Hhh--cc-------ccCcchhhhhhhccCCccccCccCCCCceeeee---cccccccccceeeeehhccccc Confidence 433 10 000 00 00000011111111 112222222222111110 111111111 11 222 Q ss_pred cCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeeccccccCChhHHHHHHH-HHHHHHhcC Q lcl|NC_020081. 70 SIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKE-IENFIEKTG 148 (552) Q Consensus 70 ~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~-l~~~l~~~n 148 (552) ...+...+.+..|.++..+.+-.|+-.+..+++ .+....-+..+-+.+.+ .++.-+++|.. +..++..++ T Consensus 68 ~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai-------v~d~~~~pV~l~L~~~~--~s~~iK~kI~eeF~~Il~ll~ 138 (524) T protein:vir:72 68 GMKTTRELIDTYRNLMNNYEVDNAVSEIVSDAI-------VYEDDTEVVALNLDKSK--FSPKIKNMMLDEFSDVLNHLS 138 (524) T ss_pred ccchHHHHHHHHHHHhhccchhhHHHHhhccee-------EecCCCceEEEEecCcC--cchHHHHHHHHHHHHHHHHhc Confidence 334455566777888888888777766554443 12234445555554433 44444445533 444444433 Q ss_pred CCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC---CCCEEEEEEecCceeEEEEC----CCcccccccceeEEEEE Q lcl|NC_020081. 149 RIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK---LGDLHNFKAVDASTVYVAVD----EDGKERKAKDGVRYVQV 221 (552) Q Consensus 149 ~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~---~G~~~~L~~l~p~~v~v~~~----~~g~~~~~~~~~~y~~~ 221 (552) . ....++ +++.+.+.|..|+.++-|. ...+.+|.+|||..|+.++. .+++........-|+.+ T Consensus 139 F------~~~~~~----~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y 208 (524) T protein:vir:72 139 F------QRKGSD----HFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIY 208 (524) T ss_pred c------chhhhH----HHhhheeeeEEEEEEEEeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheee Confidence 2 123344 4567778899999998653 33589999999999976432 12221111111112221 Q ss_pred cCC-------------ceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Q lcl|NC_020081. 222 IDD-------------KVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLH 288 (552) Q Consensus 222 ~~~-------------~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 288 (552) ..+ +....++.+-|.|...+ ..+.++..=+|-|..|...+.....++....-|=-.-+.-+-|.. T Consensus 209 ~~~~~~y~~~g~~~~~~~~ikI~~dAI~y~hSG--L~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFY 286 (524) T protein:vir:72 209 DTAHESYACDGRMYEAGTKIKIPKAAVVYAHSG--LVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWY 286 (524) T ss_pred ccCccccccCccccCCCcceecchhheeeeecc--ceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEE Confidence 111 12233444444443322 223334455677888888888777777765544333444455555 Q ss_pred eCCCCCCCHHHHHHHHHHHHHHhcc----------ccccccce-eec---------cCCceeeecc--CchhHHHHHHHH Q lcl|NC_020081. 289 IKTGQEQSNQALTSFRREWTSMFSG----------INGAWKIP-VIT---------AEDVKFVNMT--QSSKDMEFEKWL 346 (552) Q Consensus 289 ~~~~~~~s~~~~~~~~~~~~~~~~G----------~~nagk~~-il~---------~~g~~~~~l~--~~~~d~q~~e~~ 346 (552) +..+......+-+=++.-+. .|.. ..+..+.. ++. +.|.++..|. .+.-+|+- . T Consensus 287 IDvGnlPk~KAeqYl~~im~-k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D---V 362 (524) T protein:vir:72 287 VDTGNMPARKAAEHMQHVMN-TMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED---I 362 (524) T ss_pred EecCCCCchhHHHHHHHHHH-hcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH---H Confidence 55443333222222222221 1110 00111100 110 0133444432 33444443 4 Q ss_pred HHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc----C-----cccc Q lcl|NC_020081. 347 NYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI----V-----SQFG 417 (552) Q Consensus 347 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L----~-----~~~~ 417 (552) .+..+.+.++++||.+.|..-..+++....++ .-+.-|-. +...|.-+..++...|...| + ++.+ T Consensus 363 ~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~--EItRDEik----F~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~ee 436 (524) T protein:vir:72 363 RWFRQALYMALRVPLSRIPQDQQGGVMFDSGT--SITRDELT----FAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDE 436 (524) T ss_pred HHHHHHHHHHhCCchhhcCCCCCccccccccc--hhhHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHH Confidence 58889999999999999943222333221111 11112222 23344455555555544332 2 2222 Q ss_pred -----cceeecccccChHH----------HHHHHHHHHHHhcCCcCHHHHHHH-hCCCCCCCCCeeeccccccchhhhcc Q lcl|NC_020081. 418 -----GDYVFNFVGGDAKT----------EAEIISILESKAKIGLTINDIRKE-LGYPDTEGGDVTLAGVHVQRLGQIMQ 481 (552) Q Consensus 418 -----~~~~~~f~~~d~~~----------~~~~~~~~~~~~~g~lT~NE~R~~-~gl~p~~ggD~~~~~~n~~~~~~~~~ 481 (552) ..+.+.|.+...-+ |...++.+..++.-.++.+=+|+. |.+--.+ +...+.+.. T Consensus 437 w~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDee----------i~~~~k~I~ 506 (524) T protein:vir:72 437 WNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEE----------IEQEAKQIE 506 (524) T ss_pred HHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHH----------HHHHHHHHH Confidence 34667776665322 222233333333334566666653 3332110 000011111 Q ss_pred ccccccccCCCCCccCcccCCCCCCCCCCCCCCCc Q lcl|NC_020081. 482 QEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSF 516 (552) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 516 (552) . ..+.+--.++....+ +. T Consensus 507 ~----E~k~~~~~~~~~~~~-------------~f 524 (524) T protein:vir:72 507 E----ESKEARFQDPDQEQE-------------DF 524 (524) T ss_pred H----HhhcCCCCCCchhhh-------------cC Confidence 0 001111111111110 01 No 234 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=95.25 E-value=0.0025 Score=34.89 Aligned_cols=384 Identities=12% Similarity=0.035 Sum_probs=149.0 Q ss_pred cCCccccccc--CCCCchHHHHHHHhhcc-hHHHHH-HH--------HHHH-HHH-HHHHHHHHhhcccc--ceeeeecc Q lcl|NC_020081. 61 MNPDFKEAPS--IHGKQNLLQMLKLWSRK-NIILNA-II--------ITRV-NQV-SMFCTPARNSDKGV--GYEIRLKD 124 (552) Q Consensus 61 ~~~~~~~~~~--~~~~~~~~~~Lr~~a~~-~~i~~a-~i--------~~~~-~~~-~~~~~~~~~~~~~~--~~~i~~k~ 124 (552) -+...-.+.. ......-...++++-++ ..|+.. .. .... ..+ .-+++.+....++. |-.+.+.. T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~ 80 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFDI 80 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceeec Confidence 0000000000 00000001111111111 111100 00 0000 000 11222222222221 11111111 Q ss_pred ccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCC--------CCEEEEEEecC Q lcl|NC_020081. 125 PLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKL--------GDLHNFKAVDA 196 (552) Q Consensus 125 ~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~--------G~~~~L~~l~p 196 (552) .+ +......+..++. | .+......+..+.+.+|.||..+.++.. |. ..+..++| T Consensus 81 ~~------~~~~~~~~~~~~~--------n---~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~-~~~~~i~p 142 (451) T protein:vir:10 81 DN------NKELNEKVTDVLG--------N---EFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQT-FKYGVVNT 142 (451) T ss_pred CC------cHHHHHHHHHHhc--------c---CHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccc-eeEEEEcc Confidence 11 0111122233221 1 3445667788999999999999888764 32 44777899 Q ss_pred ceeEEEECCCcccccccceeEEEEEcCC-c---------eEEEEcccceeeecc----------------ccc-----CC Q lcl|NC_020081. 197 STVYVAVDEDGKERKAKDGVRYVQVIDD-K---------VVAKFKAKEMAWEVS----------------NPR-----TD 245 (552) Q Consensus 197 ~~v~v~~~~~g~~~~~~~~~~y~~~~~~-~---------~~~~~~~~evi~~~~----------------~~~-----~~ 245 (552) ..+.++.++.-.. .....++|+....+ . ....++.+.+.+.+. |+- .. T Consensus 143 ~~~~~vydd~~~~-~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 221 (451) T protein:vir:10 143 EEIIPIYRNGIER-ELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVE 221 (451) T ss_pred cceEEEEcCCCCC-ceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEE Confidence 8888876543110 01112222221111 0 011222222222110 000 00 Q ss_pred ccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC-CCCHHHHHHHHHHHHHHhccccccccceeec Q lcl|NC_020081. 246 LTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQ-EQSNQALTSFRREWTSMFSGINGAWKIPVIT 324 (552) Q Consensus 246 ~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~~s~~~~~~~~~~~~~~~~G~~nagk~~il~ 324 (552) -.....|.|-++.+...++....+..-..+.+...+.|-.++ ++.. ..+.+. ...+.. . ++.++. T Consensus 222 ~~nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~--~g~~~~~~~~~----~~~~~~-------~-~~i~~~ 287 (451) T protein:vir:10 222 FSNNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYIL--ENFGGEDTSEF----LKELKR-------Y-KTIKTE 287 (451) T ss_pred eccCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeee--ecCCcccchhh----HHHHhh-------C-CeEEec Confidence 001134667777777766666665555555555555564444 3321 112222 222211 1 111221 Q ss_pred ------cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchh--H---HHHHHHHH Q lcl|NC_020081. 325 ------AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGS--S---AEKYRNSK 393 (552) Q Consensus 325 ------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n--~---e~~~~~~~ 393 (552) +++++|. +.......+....+.+.+.|...-++|.. .. ..+++.++....+.. . -...+..+ T Consensus 288 ~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~I~~~s~~p~~--~~---~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f 360 (451) T protein:vir:10 288 TDSEGDSGGLKTM--QIEIPTEARKIILEILKKQIYESGQGLQQ--DT---ENFGNASGVALKFFYRKLELKSGLLETEF 360 (451) T ss_pred CcCCccCCcceEE--eecCCHHHHHHHHHHHHHHHHHHhCcccc--cc---cccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 1235554 33334555677888999999999999852 21 112222221111110 0 11122223 Q ss_pred HHHhhHHHHHHHHHHHhhcCcccccceeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccc Q lcl|NC_020081. 394 DKGLEPLLKFIEDAVNKYIVSQFGGDYVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHV 473 (552) Q Consensus 394 ~~~l~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~ 473 (552) ..+|+-+++.|...++. ..-..+.+.|.+..+.+..+.++++... .|+++.--+.++++.-.-+ +. T Consensus 361 ~~~l~~~~~li~~~~~~----~d~~~i~i~f~~~~p~n~~e~~~~~~kl-~g~iS~et~~~~~p~v~d~--~~------- 426 (451) T protein:vir:10 361 RTSFDKLIKAILYFLGV----TDYKKIQQTYTRNMMSNDLEDADIATKS-VGIIPTKIILRHHPWVDDV--EE------- 426 (451) T ss_pred HHHHHHHHHHHHHHhCC----CCccceeEEecCCCCCCHHHHHHHHHHH-hccCchHHHHHhCCCCCCH--HH------- Confidence 33444444444333221 1123466778887777777777776655 4888887777777542211 00 Q ss_pred cchhhhccccccccccCCCCCccCcc Q lcl|NC_020081. 474 QRLGQIMQQEQVEYQRQMDANQFLAQ 499 (552) Q Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (552) .+......+........+.-+...+ T Consensus 427 -e~~~~~ee~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 427 -AEKLYLEEKKIQASKVSDDYNNFTE 451 (451) T ss_pred -HHHHHHHHHHHHHHHHHhhcCCCCC Confidence 0001101000000000000000000 No 235 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=94.08 E-value=0.0055 Score=32.99 Aligned_cols=432 Identities=10% Similarity=0.065 Sum_probs=162.9 Q ss_pred hccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHh---hcc--hHHHHHHH--------HHHHHHHH Q lcl|NC_020081. 37 LKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLW---SRK--NIILNAII--------ITRVNQVS 103 (552) Q Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~---a~~--~~i~~a~i--------~~~~~~~~ 103 (552) |+-+.+-|....+....-+.. +...- ...+.+..++...+.+..| -++ +++...-+ ....+-.. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~--~~~~~-~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~ 77 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQ--TLKSI-NDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRK 77 (517) T ss_pred CchHHHHHHHHHHHHHHhccc--chhHh-hcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHH Confidence 332222222211110000000 00000 0011112121111111111 000 00100000 00000000 Q ss_pred HHHHHHHhhccccceeeeeccccccCCh--hHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEE Q lcl|NC_020081. 104 MFCTPARNSDKGVGYEIRLKDPLQEPND--HNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELV 181 (552) Q Consensus 104 ~~~~~~~~~~~~~~~~i~~k~~~~~~~~--~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~ 181 (552) ..|+..+.....-...|...+.+..... ........+.+++.. ..+...++..+.+.+..|.+++-+. T Consensus 78 ~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~----------n~f~~~~~~~~e~a~a~G~~a~k~~ 147 (517) T protein:vir:98 78 LSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQH----------NKFIKNLSDYLEPTFALGGLTVRPY 147 (517) T ss_pred HHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHh----------ccHHHHHHHHHHHHhhhCCEEEEEE Confidence 0111111111111112222222111110 111112223333321 1345566667888888999999988 Q ss_pred ECCCCCEEEEEEecCceeEEEECCCcccccc-----------cceeEEEEEc----C------CceE------------- Q lcl|NC_020081. 182 YDKLGDLHNFKAVDASTVYVAVDEDGKERKA-----------KDGVRYVQVI----D------DKVV------------- 227 (552) Q Consensus 182 r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~-----------~~~~~y~~~~----~------~~~~------------- 227 (552) +|. |. +.+.++++..+.+...+.+++... ....+|.+.- . +..+ T Consensus 148 ~d~-~~-~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~ 225 (517) T protein:vir:98 148 VDN-GE-IEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGE 225 (517) T ss_pred EeC-Ce-eEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCcc Confidence 874 33 347778888877643322221100 0111121100 0 0000 Q ss_pred --EEEc--------cccee----------eecccccCCcc--CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_020081. 228 --AKFK--------AKEMA----------WEVSNPRTDLT--VGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRG 285 (552) Q Consensus 228 --~~~~--------~~evi----------~~~~~~~~~~~--~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~g 285 (552) ..++ +.++. |++ ++..+.. ..++|+|.+..+...+......-.....-|.-|.+ + T Consensus 226 lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~-~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~-~- 302 (517) T protein:vir:98 226 IGKRIPLEELYEGMQEKTYIQGLSRPLFNYLK-PSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQR-T- 302 (517) T ss_pred ccccccccccccCCCcceeECCCCcceEEEec-CCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCc-c- Confidence 0000 11111 221 2222211 34789999999999998888777777777776654 2 Q ss_pred EEEeCCCCCC-CHHHH-HHHHHHHH---HHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_020081. 286 LLHIKTGQEQ-SNQAL-TSFRREWT---SMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSID 360 (552) Q Consensus 286 il~~~~~~~~-s~~~~-~~~~~~~~---~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP 360 (552) |.++...-- +.+.- ......|+ ..|.+.. ...++-.++.++....+.++.+..+...+.|+...|++ T Consensus 303 -i~vp~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~-------~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls 374 (517) T protein:vir:98 303 -VFVSDVMLRTVPDESGMPPPQVFDPDVNVYKSIR-------MGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLS 374 (517) T ss_pred -eecChhhhccccCCCCcccCCCCCcccceeeecc-------CCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCC Confidence 323221100 00000 00000000 1111110 01122346667777788899999999999999999999 Q ss_pred HHHhccccccccccccc---ccccchhHHHHHHHHHHHHhhHHHHHHHHHHHh-hcCcc---cccceeecccccChHHHH Q lcl|NC_020081. 361 PSEINFPNRGGATGHSG---NTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNK-YIVSQ---FGGDYVFNFVGGDAKTEA 433 (552) Q Consensus 361 p~~lg~~~~~t~~~~~~---~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~-~L~~~---~~~~~~~~f~~~d~~~~~ 433 (552) |..+|+...+.-++... .+..+.+.. ..+..++.+|.-++..+...... .++.. ....+.+.|..+-+.++. T Consensus 375 ~~t~~~~~~~~kTATEi~s~~~~~~~t~~-~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~ 453 (517) T protein:vir:98 375 VGTFSFDGRSMKTATEIVSENDLTYRTRN-DHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRS 453 (517) T ss_pred cccccccccccccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHH Confidence 99999765433211110 011111111 12223344444444443322221 12221 123356667655444433 Q ss_pred HHH-HHHHHHhcCCcCHHHHHHHh-CCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCCCCCCCCCCC Q lcl|NC_020081. 434 EII-SILESKAKIGLTINDIRKEL-GYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNVN 511 (552) Q Consensus 434 ~~~-~~~~~~~~g~lT~NE~R~~~-gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (552) +.. ...+.+.+|+|++-+++.++ |+..-+ .+. .+..+. .+ ....+ +..... ...+... T Consensus 454 ~~~~~~~~~v~aG~ms~~~~i~~~~g~~eee-A~~--------e~~~i~----~E-~~~~~---~~~~~~---~~~~~~~ 513 (517) T protein:vir:98 454 ALLRFYGQAKTFGFIPTVEAIQRIFKVPKKT-AEQ--------WLEEIR----KD-QIELD---PVTISQ---RAQKRMF 513 (517) T ss_pred HHHHHHHHHHhcCCCCHHHHHHHhCCCChHH-HHH--------HHHHHH----Hh-ccccC---CCCccc---cccCCCC Confidence 333 33455667999999986654 653211 000 000000 00 00000 000000 0000001 Q ss_pred CCCC Q lcl|NC_020081. 512 GKDS 515 (552) Q Consensus 512 ~~~~ 515 (552) |+++ T Consensus 514 gd~e 517 (517) T protein:vir:98 514 GDEE 517 (517) T ss_pred CCCC Confidence 1111 No 236 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=93.07 E-value=0.0089 Score=31.83 Aligned_cols=464 Identities=14% Similarity=0.118 Sum_probs=170.3 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhh---hcccccccccc---ccccccccccccccCCcccccccCCCC Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAI---LKKGKNTKSNK---PKAYEEPIIGSMSMNPDFKEAPSIHGK 74 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (552) |- |-.|.+-...-+ ...-++.+.. +..+.++-++- ..+....+.+.+-.|. ... T Consensus 1 ~~----~~~~~~~~~~t~--------~k~~~~~e~~~~~~n~~~~~y~ty~~~~~~f~~gfv~~~~~ng--------~i~ 60 (525) T protein:vir:10 1 MT----RTKGSKNKSTTI--------EKQSLQIEQLQEHINELERQYNTYDDVVDAFIDGFVMDLCNNG--------KIK 60 (525) T ss_pred CC----CCcCCcccccch--------hhhhhhHHHHHHHHhhhhhhcchhhhHHHHHHHHHHHHhhcCC--------cee Confidence 21 222221111111 1011111111 11122222211 1111122222222221 123 Q ss_pred chHHHHHHHhhcchHHHH-HHHHHH------HHHHHHHHHHHHh--hccccceeeeeccccccCChhHHHHHHHHHHHHH Q lcl|NC_020081. 75 QNLLQMLKLWSRKNIILN-AIIITR------VNQVSMFCTPARN--SDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIE 145 (552) Q Consensus 75 ~~~~~~Lr~~a~~~~i~~-a~i~~~------~~~~~~~~~~~~~--~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~ 145 (552) +...++|+.|..+++-.. .|+... ...+. +++.- +..-+.+.|..-..++. .+ +-+.-+...|+ T Consensus 61 ~v~~~~l~~~f~npd~~~~~i~~l~~y~yi~~~~v~---ql~~li~~lp~l~y~i~~~~~~k~-~~---~~~s~~n~~l~ 133 (525) T protein:vir:10 61 TVNLDTLQLWFNNPDKYINNIVNLLTYYYIIDGNVF---QLYDLIFSLPPLDYQIKVLKRDKD-YK---EDLSTINLYLE 133 (525) T ss_pred eeeHHHHHhhhcChHHHHHHHHHHHHHhhhhcchHH---HHHHHHHhcCCcceeehhhhhccc-hh---hHHHHHHHHHH Confidence 334678888887766332 222111 00111 11111 11122222221111111 11 11111222222 Q ss_pred hcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCC-C-----EEEEEEecC------ceeEEEEC--------- Q lcl|NC_020081. 146 KTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLG-D-----LHNFKAVDA------STVYVAVD--------- 204 (552) Q Consensus 146 ~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G-~-----~~~L~~l~p------~~v~v~~~--------- 204 (552) .-- -.-++-+-++.++...|.-. -+|=++- . ..+|.++-| ..|.|+.- T Consensus 134 k~i---------~hk~ltrdll~q~a~~gtli--g~wlg~~~~py~~vf~~~kyvfp~~r~~g~~v~vid~~~f~~~~~~ 202 (525) T protein:vir:10 134 KKI---------QHKQLTRDLLVQLAHSGTLI--GTWLGSKREPYFNVFNNLKYVFPYGRAKGKMVAVIDLQWFDEMSEL 202 (525) T ss_pred HhH---------HHHHHHHHHHHHhhccCcee--EeeecCCCCcchhhhhhhhhhccccccCCceEEEEehHHhhhhhHH Confidence 100 00011111122222222211 1110000 0 011111111 11222110 Q ss_pred CCccccccccee----E---EEEEc----CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 205 EDGKERKAKDGV----R---YVQVI----DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFN 273 (552) Q Consensus 205 ~~g~~~~~~~~~----~---y~~~~----~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~ 273 (552) +....+....++ . |..+. +.-....++.+.++|.+.|-... +...|.|....+...|.........- T Consensus 203 ~r~~~~~~lsp~i~~~~y~~~~~~~~~~~~~~r~i~LP~e~t~~lr~~tl~r--nqrlG~s~vtp~l~dI~hk~klrd~E 280 (525) T protein:vir:10 203 ERKLTFENLSPLITENKYKKWKEYNGENEDALRYIMLPISKTLVARIHTLSR--NQRLGIPYGTQTLFDIQHKQKLRDLE 280 (525) T ss_pred HHHHHHHhhchhhhhhhhhHHhhcccccchhheeeecccceeEEeeeccccc--CcccCcchhhhHHHHHHHHHHHHHHH Confidence 000000000000 0 11011 11134567788888888764322 33457777777776666665555555 Q ss_pred HHHHhccCCCceEEEeCCCCC----CCHHHHHHHHHHHHHHhc-cccc-cccceeeccCC--ceeeeccC--chhHHHHH Q lcl|NC_020081. 274 ARFFAQGGTTRGLLHIKTGQE----QSNQALTSFRREWTSMFS-GING-AWKIPVITAED--VKFVNMTQ--SSKDMEFE 343 (552) Q Consensus 274 ~~~f~ng~~p~gil~~~~~~~----~s~~~~~~~~~~~~~~~~-G~~n-agk~~il~~~g--~~~~~l~~--~~~d~q~~ 343 (552) .+....=..|-.+|++.+... +-+.+..++-+..+.+.. |.+. +|-+.|-.+.= ++|-.+.. .--|-+ T Consensus 281 qsIA~kii~a~avLk~gg~~gn~mk~p~~~kqkil~gVk~aleK~~kdK~Gi~vi~~Pdfa~~efp~ik~~~~glDg~-- 358 (525) T protein:vir:10 281 QSIADKIIKAMAVLKFRGKDDNDSKVKESAKRKVLAGVKRALEKGVKDKNGIACIAMPDFATFEFPEIKNGDKTLDPK-- 358 (525) T ss_pred HHHHHHhhhhheeeeeccccCccccCchHHHHHHHHHHHHHHhcccccccCeEEEeccceeecccccccCcccCCCch-- Confidence 555555556777888854322 223233333333333332 2222 23322222322 22222211 111111 Q ss_pred HHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc---Cc-ccccc Q lcl|NC_020081. 344 KWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI---VS-QFGGD 419 (552) Q Consensus 344 e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L---~~-~~~~~ 419 (552) --+...++|..|+|++..+++ +...||+++.-....||.. |.-+++.||+..+..| ++ +.+.. T Consensus 359 -K~d~I~~DI~~A~GlS~sL~n-----------GdggNyAtaslnld~fykk-igVm~e~Iee~y~kL~d~Vl~~~k~~n 425 (525) T protein:vir:10 359 -KYDSIDNDITNATGISQVLTN-----------GTKGNYASAKLNLDVFYKK-IGVMLEIIEEIYNQLIDIILGEEKGCN 425 (525) T ss_pred -hhhhhhhhhhhhhccceeeec-----------CCCCceeeeeeeHHHHHHH-HHHHHHHHHHHHHHHHhhhcCcccCcc Confidence 223567899999999999884 2345778777777777764 6778888886655432 33 33456 Q ss_pred eeecccccChHHHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcc Q lcl|NC_020081. 420 YVFNFVGGDAKTEAEIISILESKAKIGLTINDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQ 499 (552) Q Consensus 420 ~~~~f~~~d~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (552) |.|.++..++.++.++..++.....-.++.--+....|+.-- .++-..++..-.........+ .... T Consensus 426 yifnydkd~pi~~kkk~d~LIkL~d~g~s~k~vldl~gis~e----~y~E~s~yEtE~lkl~EKi~p----p~~~----- 492 (525) T protein:vir:10 426 YIFQYNKDTPIEREKKLDTLIKLEAQGYSAKYVLDILGISSE----EYFEESIYEIEKLKLREKIMP----PLNT----- 492 (525) T ss_pred eEEecCCCchhhhhhhhhhhhhhhccchhhhhhhhhhccCcc----hHHHHHHHHHHHHHHhhhccc----cccc----- Confidence 889999988888888777665444323333333334443211 111111111000000000000 0000 Q ss_pred cCCCCCCCCCCCCCCCcccccCCCCccccccccccccccCc Q lcl|NC_020081. 500 QTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQGGK 540 (552) Q Consensus 500 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 540 (552) ..-.|++...-......+++..++|-.+++.+- T Consensus 493 --------~v~SGk~~n~iG~P~~dd~~~~dati~s~~~~~ 525 (525) T protein:vir:10 493 --------NVLSGKDGNDIGSPKLDDSDSSDATIESKERGV 525 (525) T ss_pred --------eeeeccccccccCCccCCCcchhhhhhhhhcCC Confidence 001111111111111112222333333333222 No 237 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=81.40 E-value=0.086 Score=26.42 Aligned_cols=433 Identities=10% Similarity=0.056 Sum_probs=158.0 Q ss_pred chhhhhccccccccccccccccccccccccCCcc-----cccccCCCCchHHHHHH-HhhcchHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 32 EEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDF-----KEAPSIHGKQNLLQMLK-LWSRKNIILNAIIITRVNQVSMF 105 (552) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~Lr-~~a~~~~i~~a~i~~~~~~~~~~ 105 (552) +.++.--...+.+.+ +..--..+-.|-+.. +..|+..+.......-+ .-...+....++ ++.+..++.. T Consensus 1 ~~~~~~~~~~~~~~r----~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~-~~Las~l~~~ 75 (522) T protein:vir:94 1 MAEREGFAAEGAKAV----YDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCL-NNLAAKLMLA 75 (522) T ss_pred CcccchhhHHHHHHH----HHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHH-HHHHHHHHhh Confidence 222111111111111 111001111222111 12233222221111000 000011111222 2222222211 Q ss_pred HHHHHhhccccceeeeecccc---------ccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCe Q lcl|NC_020081. 106 CTPARNSDKGVGYEIRLKDPL---------QEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKI 176 (552) Q Consensus 106 ~~~~~~~~~~~~~~i~~k~~~---------~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna 176 (552) .. .. +-++++.-.+ .......++....+++.++.-. ..-+++.-+..++.|+.++||+ T Consensus 76 lt------P~-~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~------~~snf~~~~~~~~~~L~~~G~a 142 (522) T protein:vir:94 76 LF------PQ-SPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYM------ETNSFRVPLFEALKQLIVSGNC 142 (522) T ss_pred cC------CC-CcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHH------HhcCcHHHHHHHHHHHHhhCcE Confidence 11 11 1234432211 1111111222233444443221 2345666777788899999999 Q ss_pred eEEEEECCCCCEEEEEEecCceeEEEECCCcccccccce----------------------------eEEEEEcCCceEE Q lcl|NC_020081. 177 NFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKDG----------------------------VRYVQVIDDKVVA 228 (552) Q Consensus 177 ~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~----------------------------~~y~~~~~~~~~~ 228 (552) +.++..+..|.+..+..+|-..+.+..|..|++-..... ++........... T Consensus 143 ~l~~~~~~~~~~~~~~~~pl~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~ 222 (522) T protein:vir:94 143 LLYIPEPEQGTYSPMRMYRLVSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYL 222 (522) T ss_pred eEeeeccCCCceeeEEEEEcceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCcee Confidence 999988877776655555556677777777754111000 0000011111000 Q ss_pred EE-cccceeee------------cccccCCccCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC Q lcl|NC_020081. 229 KF-KAKEMAWE------------VSNPRTDLTVG-KYGYPELEIALNHLQYHDNTEVFNARFFAQGGTTRGLLHIKTGQE 294 (552) Q Consensus 229 ~~-~~~evi~~------------~~~~~~~~~~g-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~ 294 (552) .+ ..++..+. ...+++...+| .||.||.+-++..+.......+.......-...|..++. .+.. T Consensus 223 ~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~--~~g~ 300 (522) T protein:vir:94 223 RYEEVEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVN--PNGI 300 (522) T ss_pred EEeeccCceecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec--cccc Confidence 00 00000000 00123333333 799999999999999999999988888888888885553 2222 Q ss_pred CCHHHHHHHHHHHHHHhccccccccceeec--cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc Q lcl|NC_020081. 295 QSNQALTSFRREWTSMFSGINGAWKIPVIT--AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGA 372 (552) Q Consensus 295 ~s~~~~~~~~~~~~~~~~G~~nagk~~il~--~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~ 372 (552) ..... + ..|..+. ++. ++++...++...++-.-..+..+..+..|..+|-+.. ++..+.. T Consensus 301 ~~~~~-------~---~~~~~g~----~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~-- 362 (522) T protein:vir:94 301 TQPRR-------L---NKAATGE----FVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNS--AVQRNAE-- 362 (522) T ss_pred ccchh-------e---eccCCce----eecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh--hccCCCc-- Confidence 22211 1 1222221 232 2345555655433322234666788889999996652 2211111 Q ss_pred cccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHh-------------hcCcccc-cceeecccccCh-HHHHHHHH Q lcl|NC_020081. 373 TGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNK-------------YIVSQFG-GDYVFNFVGGDA-KTEAEIIS 437 (552) Q Consensus 373 ~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~-------------~L~~~~~-~~~~~~f~~~d~-~~~~~~~~ 437 (552) ..+ +.--..+..-....|.|.+.+++.+|=. .++++.- ..+.++|..+-. ..+....+ T Consensus 363 ------r~T-AtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~ 435 (522) T protein:vir:94 363 ------RVT-AEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLE 435 (522) T ss_pred ------ccc-HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHH Confidence 111 1111222223345566666666555532 2233322 234555543211 11111111 Q ss_pred ----HHHHHhc---C----CcCH----HHHHHHhCCCCCCCCCeeeccccccchhhhccccccccccCCCCCccCcccCC Q lcl|NC_020081. 438 ----ILESKAK---I----GLTI----NDIRKELGYPDTEGGDVTLAGVHVQRLGQIMQQEQVEYQRQMDANQFLAQQTG 502 (552) Q Consensus 438 ----~~~~~~~---g----~lT~----NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (552) .+..+.. - -+.. +++.+.+|.+|. .++..+- .+.+..+++......+...+ .... + T Consensus 436 ~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~---~ivr~~e---e~~~~~~q~~~~~~~~~~~~-~~~~--~ 506 (522) T protein:vir:94 436 KLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTA---GLLLTQD---EKIQRMAEQSSQQAVVQGAS-AAGA--N 506 (522) T ss_pred HHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChh---hccCCHH---HHHHHHHHHHHHHHHHHHHH-HHHH--H Confidence 1111100 0 0112 222334455331 0111000 01111111100000000000 0000 0 Q ss_pred CCCCCCCCCCCCCccc Q lcl|NC_020081. 503 YDGNMDNVNGKDSFNQ 518 (552) Q Consensus 503 ~~~~~~~~~~~~~~~~ 518 (552) ..-......+++-... T Consensus 507 ~~a~~~~~~~~~~~~~ 522 (522) T protein:vir:94 507 MGAAVGQGAGEDMAQA 522 (522) T ss_pred hhhhhhcccchhhhcC Confidence 0000000000110000 No 238 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=52.09 E-value=0.56 Score=21.95 Aligned_cols=419 Identities=11% Similarity=0.035 Sum_probs=168.7 Q ss_pred ccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHH----HH-HHHHHHHH---HHHHHHHHhhc Q lcl|NC_020081. 42 NTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILN----AI-IITRVNQV---SMFCTPARNSD 113 (552) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~----a~-i~~~~~~~---~~~~~~~~~~~ 113 (552) -.-+++|-..+.||-+.-...|.. -++.--.-|+.+-.--.+.+ .. +..+.... ..+..++...+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~-------v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~ 73 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNA-------VTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKL 73 (527) T ss_pred CCccccccCCCcCcCCccccCccc-------CCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHh Confidence 123344555555553222222211 11111112222111000000 00 00000000 00000000000 Q ss_pred cccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC---CCCEEE Q lcl|NC_020081. 114 KGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK---LGDLHN 190 (552) Q Consensus 114 ~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~---~G~~~~ 190 (552) -+..-.+.....+...+....+-...+..|.++. .+.......-++.++.|.+.+.+++|. .|.=+. T Consensus 74 ~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e----------~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~ 143 (527) T protein:vir:10 74 IEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDRE----------NWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLS 143 (527) T ss_pred hCCcceeeccCccccccchhHHHHHHHHHHHHHh----------hhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCce Confidence 0111111112222222223333334455555543 344566677888999999999999984 334467 Q ss_pred EEEecCceeEEEECCCcccccccceeEEE------------------------E------EcCCceEEEE---------- Q lcl|NC_020081. 191 FKAVDASTVYVAVDEDGKERKAKDGVRYV------------------------Q------VIDDKVVAKF---------- 230 (552) Q Consensus 191 L~~l~p~~v~v~~~~~g~~~~~~~~~~y~------------------------~------~~~~~~~~~~---------- 230 (552) +..+||..+.++.++++..+.. +++.+ . ...|..-++. T Consensus 144 v~~~DP~~~f~~ed~d~~~~v~--~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d 221 (527) T protein:vir:10 144 LHEVDPSTYFPYEDPRYPGQVL--GVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDD 221 (527) T ss_pred EeecCcceeeeeecCCCCCcee--eEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecccccc Confidence 8889988888877765533211 11000 0 0000000000 Q ss_pred --------------cccc-------------eeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCC Q lcl|NC_020081. 231 --------------KAKE-------------MAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTT 283 (552) Q Consensus 231 --------------~~~e-------------vi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p 283 (552) ...+ |+|+.-.+ .....+|.|-|+-+...+.....+..-......-++.| T Consensus 222 ~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p---~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~P 298 (527) T protein:vir:10 222 RPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHP---IMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLG 298 (527) T ss_pred ccccccchhhhhhhcCceeeecccCCCCccceEeecCCC---ccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCc Confidence 0000 23442222 23447899988876666665554444444444446677 Q ss_pred ceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHH Q lcl|NC_020081. 284 RGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSE 363 (552) Q Consensus 284 ~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 363 (552) -.++. + ...-+ .+-+.....-| ....+=++++.++..+...+.=..|...+..+.+.|+..-++|.+- T Consensus 299 i~~~t--g-~~~vd-----~~G~~~~~~Vg----PG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA 366 (527) T protein:vir:10 299 FYATD--S-APPRD-----SRGNMVPWTIS----PLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIA 366 (527) T ss_pred eeeec--c-ccccc-----ccCCcCccccC----CceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeee Confidence 55553 2 21111 01111110111 1112225677888888876666668888999999999999999999 Q ss_pred hcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHH-------HH--------hhc-------Cccccc--c Q lcl|NC_020081. 364 INFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDA-------VN--------KYI-------VSQFGG--D 419 (552) Q Consensus 364 lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~-------ln--------~~L-------~~~~~~--~ 419 (552) +|..+.+.. .+.. .++-.|.|++.+-+.. +. ++| +...+. . T Consensus 367 ~G~vD~s~~-------~SG~--------ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~ 431 (527) T protein:vir:10 367 VGVVDAAVA-------ESGI--------ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLT 431 (527) T ss_pred eccccCCcC-------cHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccc Confidence 996654321 0111 1222334443321111 10 010 111111 2 Q ss_pred eeecccccChHHHHHHHH-HHHHHhcCCcCHHHHHHHhCCCC-CCCCCeeeccc----cccchhhhccccccccccCCCC Q lcl|NC_020081. 420 YVFNFVGGDAKTEAEIIS-ILESKAKIGLTINDIRKELGYPD-TEGGDVTLAGV----HVQRLGQIMQQEQVEYQRQMDA 493 (552) Q Consensus 420 ~~~~f~~~d~~~~~~~~~-~~~~~~~g~lT~NE~R~~~gl~p-~~ggD~~~~~~----n~~~~~~~~~~~~~~~~~~~~~ 493 (552) +.+.|-..-+.++.+..+ ..+.+.+|+++.-=+-++|+--+ ++....-+... ..+-+..+.+..... T Consensus 432 v~ivf~p~lP~D~~avie~v~tL~~aGi~S~~tAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~------- 504 (527) T protein:vir:10 432 VTITFRDPKPVNSEKRFNQLLQLWEAGLIPAKKLTEELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADPFG------- 504 (527) T ss_pred eEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCchh------- Confidence 334443333445554443 44567789999877777661110 22111111000 000001111000000 Q ss_pred CccCcccCCCCCCCCCCCCCCCccccc Q lcl|NC_020081. 494 NQFLAQQTGYDGNMDNVNGKDSFNQNV 520 (552) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (552) .+ .....|.+.......+..-+ . T Consensus 505 a~-~~~~~g~~~~~~d~~~~~~~---~ 527 (527) T protein:vir:10 505 AQ-MAAEQGIPDEEDDQALNGQP---L 527 (527) T ss_pred hh-hccccCCCCCCcccccCCCC---C Confidence 00 00001111100000010000 0 No 239 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=52.03 E-value=0.57 Score=21.94 Aligned_cols=419 Identities=12% Similarity=0.039 Sum_probs=168.7 Q ss_pred ccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHH----HH-HHHHHHHH---HHHHHHHHhhc Q lcl|NC_020081. 42 NTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILN----AI-IITRVNQV---SMFCTPARNSD 113 (552) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~----a~-i~~~~~~~---~~~~~~~~~~~ 113 (552) -.-+++|-..+.||-+.-...|.. -++.--.-|+.+-.--.+.+ .. +..+.... ..+..++...+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~-------v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~ 73 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNA-------VTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKL 73 (527) T ss_pred CCccccccCCCcCcCCccccCccc-------CCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHh Confidence 123344545555553222222211 11111112222111000000 00 00000000 00000000000 Q ss_pred cccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECC---CCCEEE Q lcl|NC_020081. 114 KGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDK---LGDLHN 190 (552) Q Consensus 114 ~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~---~G~~~~ 190 (552) -+..-.+.....+...+....+-...+..|.++. .+.......-++.++.|.+.+.+++|. .|.=+. T Consensus 74 ~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e----------~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~ 143 (527) T protein:vir:10 74 IEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDRE----------NWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLS 143 (527) T ss_pred hCCcceeeccCccccccchhHHHHHHHHHHHHHh----------hhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCce Confidence 0111111112222222223333334455555543 344566677888999999999999984 334467 Q ss_pred EEEecCceeEEEECCCcccccccceeEEE------------------------E------EcCCceEEEE---------- Q lcl|NC_020081. 191 FKAVDASTVYVAVDEDGKERKAKDGVRYV------------------------Q------VIDDKVVAKF---------- 230 (552) Q Consensus 191 L~~l~p~~v~v~~~~~g~~~~~~~~~~y~------------------------~------~~~~~~~~~~---------- 230 (552) +..+||..+.++.++++..+.. ++..+ . ...|..-++. T Consensus 144 v~~~DP~~~f~~ed~d~~~~v~--~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d 221 (527) T protein:vir:10 144 LHEVDPSTYFPYEDPRYPGQVL--GVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDD 221 (527) T ss_pred EeecCcceeeeeecCCCCCcee--eEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecccccc Confidence 8889988888877765533211 11000 0 0000000000 Q ss_pred --------------cccc-------------eeeecccccCCccCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCC Q lcl|NC_020081. 231 --------------KAKE-------------MAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDNTEVFNARFFAQGGTT 283 (552) Q Consensus 231 --------------~~~e-------------vi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p 283 (552) ...+ |+|+.-.+ .....+|.|-|+-+...+.....+..-......-++.| T Consensus 222 ~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p---~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~P 298 (527) T protein:vir:10 222 RPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHP---IMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLG 298 (527) T ss_pred ccccccchhhhhhhcCceeeecccCCCCccceEeecCCC---ccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCc Confidence 0000 23442222 23447899988876666665554444444444446677 Q ss_pred ceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCCHHH Q lcl|NC_020081. 284 RGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSIDPSE 363 (552) Q Consensus 284 ~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~ 363 (552) -.++. + ...-+ .+-+.....-| ....+=++++.++..+...+.=..|...+..+.+.|+..-++|.+- T Consensus 299 i~~~t--g-~~~vd-----~~G~~~~~~Vg----PG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA 366 (527) T protein:vir:10 299 FYATD--S-APPRD-----SRGNMVPWTIS----PLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIA 366 (527) T ss_pred eeeec--c-ccccc-----ccCCcCccccC----CceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeee Confidence 55553 2 21111 01111110111 1112225677888888876666668888999999999999999999 Q ss_pred hcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHH-------HH--------hhc-------Cccccc--c Q lcl|NC_020081. 364 INFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDA-------VN--------KYI-------VSQFGG--D 419 (552) Q Consensus 364 lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~-------ln--------~~L-------~~~~~~--~ 419 (552) +|..+.+.. .+.. .++-.|.|++.+-+.. +. ++| +...+. . T Consensus 367 ~G~vD~s~~-------~SG~--------ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~ 431 (527) T protein:vir:10 367 VGVVDAAVA-------ESGI--------ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLT 431 (527) T ss_pred eccccCCcC-------cHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccc Confidence 996654321 0111 1222334443321111 10 010 111111 2 Q ss_pred eeecccccChHHHHHHHH-HHHHHhcCCcCHHHHHHHhCCCC-CCCCCeeeccc----cccchhhhccccccccccCCCC Q lcl|NC_020081. 420 YVFNFVGGDAKTEAEIIS-ILESKAKIGLTINDIRKELGYPD-TEGGDVTLAGV----HVQRLGQIMQQEQVEYQRQMDA 493 (552) Q Consensus 420 ~~~~f~~~d~~~~~~~~~-~~~~~~~g~lT~NE~R~~~gl~p-~~ggD~~~~~~----n~~~~~~~~~~~~~~~~~~~~~ 493 (552) +.+.|-..-+.++.+..+ ....+.+|+++.-=+-++|+--+ ++....-+... ..+-+..+.+..... T Consensus 432 v~ivf~p~lP~D~~avie~v~tL~~aGiiS~etAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~------- 504 (527) T protein:vir:10 432 VTITFRDPKPVNNEKRFAQLLELWEAGLIPAKKLTEELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADPFG------- 504 (527) T ss_pred eEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCchh------- Confidence 334443333445554443 44566789999877777661110 22111111100 000001111000000 Q ss_pred CccCcccCCCCCCCCCCCCCCCccccc Q lcl|NC_020081. 494 NQFLAQQTGYDGNMDNVNGKDSFNQNV 520 (552) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (552) .+ .....|.+.......+..-+ . T Consensus 505 a~-~~~~~g~~~~~~d~~~~~~~---~ 527 (527) T protein:vir:10 505 AQ-MAAEQGIPDEEDDQALNGQP---L 527 (527) T ss_pred hh-hccccCCCCCCcccccCCCC---C Confidence 00 00001111100000010000 0 No 240 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=49.82 E-value=0.63 Score=21.69 Aligned_cols=425 Identities=10% Similarity=0.071 Sum_probs=147.6 Q ss_pred cccccchhhhhcc-----ccccccccc---------cccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHH Q lcl|NC_020081. 27 RIKQIEEDAILKK-----GKNTKSNKP---------KAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILN 92 (552) Q Consensus 27 ~~~~~~~~~~~~~-----~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~ 92 (552) |..+..++.+.+. +..-++.++ -.|..|..+...... ... .+.+. ..+.... T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~---------~~~----~~~~~-~dst~~~ 66 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDN---------AST----DYTTP-WQAVGAR 66 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCc---------ccc----ccCCc-ccccHHH Confidence 3333323322221 111122221 112222111111110 000 00011 1223333 Q ss_pred HHHHHHHHHHHHHHHHHHhhccccceeeee--ccccc-cCChhHHHHHHHHHHHHHhcC-CCCCCCccCCHHHHHHHHHH Q lcl|NC_020081. 93 AIIITRVNQVSMFCTPARNSDKGVGYEIRL--KDPLQ-EPNDHNKKKIKEIENFIEKTG-RIDNDFTRDNFRSFVKKLVR 168 (552) Q Consensus 93 a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~--k~~~~-~~~~~~~~~~~~l~~~l~~~n-~~~~pn~~~t~~~f~~~~v~ 168 (552) +|. +.+..++....+ ..+ ++++ .+... +... +..+...+..||...- ....-...-+++.-+..++. T Consensus 67 a~~-~Laa~l~~~ltP------~~~-WF~l~~~d~~~~~~~~-~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~ 137 (535) T protein:vir:94 67 GLN-NLASKLMLALFP------MQT-WMKLTISEFEAKQLVA-QPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLK 137 (535) T ss_pred HHH-HHHHHHHhhhcC------CCC-ccccccChhhhhcccc-chhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHH Confidence 333 223332222212 112 2333 22111 1111 1111122333332210 00000123456666777788 Q ss_pred HHHhcCCeeEEEEECC-CCCEEEEEEecCceeEEEECCCccccccc---------------------------ce--eEE Q lcl|NC_020081. 169 DRLTYDKINFELVYDK-LGDLHNFKAVDASTVYVAVDEDGKERKAK---------------------------DG--VRY 218 (552) Q Consensus 169 d~ll~Gna~~~i~r~~-~G~~~~L~~l~p~~v~v~~~~~g~~~~~~---------------------------~~--~~y 218 (552) |+.++||+.+++..+. .+.....||| ..+.+..+..|++-... .. ++. T Consensus 138 ~L~~~G~a~l~~~~~~~~~~~f~~~pl--~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~ 215 (535) T protein:vir:94 138 QLVVAGNALLYIPEPEGTYNPMKLYRL--SSYVVQRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYT 215 (535) T ss_pred HHHhhCcEeEeeccCcCcccceEEEEc--CeEEEeeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEE Confidence 9999999999987653 2223344444 55666666666531100 00 000 Q ss_pred EEEcC--CceEEE-Ecccceeee------------cccccCCccCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_020081. 219 VQVID--DKVVAK-FKAKEMAWE------------VSNPRTDLTVG-KYGYPELEIALNHLQYHDNTEVFNARFFAQGGT 282 (552) Q Consensus 219 ~~~~~--~~~~~~-~~~~evi~~------------~~~~~~~~~~g-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~ 282 (552) ....+ +..... +..++..+. ...+++...+| .||.||.+-++..+.......+.......-... T Consensus 216 ~v~~~~~~~~~~~~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~ 295 (535) T protein:vir:94 216 HIYLDEESGEYLKYEEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAK 295 (535) T ss_pred EEEeeCCCCcEEEEEEecCeeeccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 00001 111100 001110000 00123333333 799999999999888888887777666655556 Q ss_pred CceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec--cCCceeeeccCchhHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_020081. 283 TRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT--AEDVKFVNMTQSSKDMEFEKWLNYLINVICSIYSID 360 (552) Q Consensus 283 p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~--~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP 360 (552) |..++. + +...... .+ ..+..+. ++. .+++...++...++-....+..+..+..|..+|-+. T Consensus 296 ~~~lv~-p-~g~~~~~-------~~---~~~~~g~----~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~ 359 (535) T protein:vir:94 296 VIGLVN-P-AGITQVR-------RL---TKAQTGD----FVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLN 359 (535) T ss_pred CCcccc-c-ccccchh-------hc---ccCCCce----eecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhHh Confidence 654443 2 1111211 11 1111111 222 244556666654433334466678889999999432 Q ss_pred HHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHh-------------hcCcccc-cceeecccc Q lcl|NC_020081. 361 PSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNK-------------YIVSQFG-GDYVFNFVG 426 (552) Q Consensus 361 p~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~-------------~L~~~~~-~~~~~~f~~ 426 (552) . +...+. ...+ +.=-..+..-....|.|.+.+++.+|=. .++++.- ..+..+|.. T Consensus 360 ~--~~~~d~--------~rvT-AtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs 428 (535) T protein:vir:94 360 S--AVQRTG--------ERVT-AEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPELPKEAVEPTIST 428 (535) T ss_pred h--hccCCC--------CCcc-HHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEee Confidence 1 111111 0111 1111222233345566666666555532 2333221 123333322 Q ss_pred c-ChHHHHHHHHHHHHHh---cC--------CcCHHH----HHHHhCCCCCCCCCeeeccccccchhh-hc-ccc---cc Q lcl|NC_020081. 427 G-DAKTEAEIISILESKA---KI--------GLTIND----IRKELGYPDTEGGDVTLAGVHVQRLGQ-IM-QQE---QV 485 (552) Q Consensus 427 ~-d~~~~~~~~~~~~~~~---~g--------~lT~NE----~R~~~gl~p~~ggD~~~~~~n~~~~~~-~~-~~~---~~ 485 (552) + ....+....+.+..+. +. .+..++ +-+.+|.|+. .++..+--...+-+ .. +++ .. T Consensus 429 ~la~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~---~i~rs~eev~~~~~q~~~~~~~~~~~ 505 (535) T protein:vir:94 429 GMEALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTS---GILKTPEEKQQEMAEAAQGTAMQNAA 505 (535) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChh---hhcCCHHHHHHHHHHHHHHHHHHHHH Confidence 1 1112222111111110 00 122222 2233455421 01111111110000 00 000 00 Q ss_pred ccccC-------CCCCcc--CcccCCCCCC Q lcl|NC_020081. 486 EYQRQ-------MDANQF--LAQQTGYDGN 506 (552) Q Consensus 486 ~~~~~-------~~~~~~--~~~~~~~~~~ 506 (552) ....+ ..+... ..++-|..++ T Consensus 506 ~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 506 ASAGAGAGTMATASPENMKAAAAQAGMAPN 535 (535) T ss_pred HHHHHhhhcccccChHHHHHHHHHhccCCC Confidence 00000 000000 0011111111 No 241 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=48.49 E-value=0.67 Score=21.54 Aligned_cols=468 Identities=12% Similarity=0.087 Sum_probs=155.9 Q ss_pred CCCCCCCcccccchhhcccccCcccccccccchhhhhcccccccccccccccccccccc----------ccCCc--cccc Q lcl|NC_020081. 1 MGLLDGFFKGRKQQDNIIDINDDMAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSM----------SMNPD--FKEA 68 (552) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~--~~~~ 68 (552) --++-|-|...-..+|++- +.=--...++..- .+++.+.+......+.. .-+.+ +... T Consensus 12 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~---------s~~g~p~~~~~~~~~~~~~~t~~~D~~~~g~~~~~~~~ 81 (569) T protein:vir:10 12 RKALAGVFKDNGERDNILL-SALAVHGGSGYLF---------SRAGAPVQLSGFLGGKPGDSGMAGDGLVDGSRFIFDEV 81 (569) T ss_pred HHHHhhhhhcCCccchhhh-hhheeecCcceEE---------eecCcchhhhhhhccCccccchhhhhHHHHHHHHhhhc Confidence 1223333333333444420 0000011111110 12222211111111110 00000 0000 Q ss_pred ccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHhhccccceeeeec-cccccCChhHHHHHHHHHHHHHhc Q lcl|NC_020081. 69 PSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVSMFCTPARNSDKGVGYEIRLK-DPLQEPNDHNKKKIKEIENFIEKT 147 (552) Q Consensus 69 ~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k-~~~~~~~~~~~~~~~~l~~~l~~~ 147 (552) ..-+.+......+-.++..++| .+...+.+.+.. .-....|--+.|..+ .....--+..++..+++.+-|..+ T Consensus 82 ~~pr~R~qiY~~~eeM~~~p~I-a~AlniHVtaAL-----ggde~TGd~vfI~p~~~~~~a~~daakai~~el~~dl~~~ 155 (569) T protein:vir:10 82 QLPEDRLQRYPLLEEMAVYSTI-ATALNIHITHAL-----SFDKKTGQTFSIVPVHNGNDSDYDAAQALCGELMNDIGRT 155 (569) T ss_pred cCchhHHHHHHHHHHHhcCchh-hhhhhhhhheee-----cccccccceEEEEeecCCCCCcchHHHHHHHHHHHHHHHH Confidence 0112222222333344433333 333233222210 001112222333221 111111112223344444433222 Q ss_pred CCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEECCCCCEEEEE---EecCceeEEEECCCcccccc------------ Q lcl|NC_020081. 148 GRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYDKLGDLHNFK---AVDASTVYVAVDEDGKERKA------------ 212 (552) Q Consensus 148 n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~~~G~~~~L~---~l~p~~v~v~~~~~g~~~~~------------ 212 (552) .-.-...+.++...+|.+|+.|--+..--|+.|+ +..|+-|++.. .|..... T Consensus 156 -----------iNr~~~~lA~~~~aFGdsYaRiY~~~~~GV~dl~~s~yt~PsfIqpFE--~g~~tvGF~~~~~~~~~~t 222 (569) T protein:vir:10 156 -----------INKEVAGWAFIMSVFGVAYVRPYAKEGIGITSFECSYYTLPSFIKEFE--VSGNLAGFSGDYLKDASGK 222 (569) T ss_pred -----------HHHHhhHHHHHHHhhhhhheeeeccCCceeEEEEecccccccccchhh--hcCceEEeecccCCccccc Confidence 1124556788999999999998754433344443 33444444321 1111000 Q ss_pred -----------cceeEEEEEc------CCceEEEEcccceeeecccccCCccCCcccccHHHHHHHHHHHHHH------H Q lcl|NC_020081. 213 -----------KDGVRYVQVI------DDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNHLQYHDN------T 269 (552) Q Consensus 213 -----------~~~~~y~~~~------~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~i~~~~~------~ 269 (552) ..-++++... .+.....+..++.=+ .|. .-..||-|-++.+.+....... + T Consensus 223 i~~l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~d~~~~---~Pi---~psn~GgSFL~~ae~pf~~l~~Al~sL~~ 296 (569) T protein:vir:10 223 MVFADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLDNPEER---TPI---ETQNYGTSLLEYAYEPYMNLRSAIRSLKA 296 (569) T ss_pred eeeechhhhhhhcccceeeccccchhhhhhhheeeccccccc---ccc---cchhhhhHHHHHHHhHHHHHHHHHHhccc Confidence 0001111000 011111121111111 110 1124788888777654433332 2 Q ss_pred HHHHHHHHhccCCCceEEEeCCCCCCCHHH-----------HHHHHHHHHHHhcccccc-----ccceeeccCCceee-- Q lcl|NC_020081. 270 EVFNARFFAQGGTTRGLLHIKTGQEQSNQA-----------LTSFRREWTSMFSGINGA-----WKIPVITAEDVKFV-- 331 (552) Q Consensus 270 ~~~~~~~f~ng~~p~gil~~~~~~~~s~~~-----------~~~~~~~~~~~~~G~~na-----gk~~il~~~g~~~~-- 331 (552) ++++.+. ...+|.+... .+++.+ +++-++.+++...|.+.- |-+|+. +++--.. T Consensus 297 qri~dSv------~~~~Itlnm~-gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~~~~~~~H~LPv~-gekq~~~tv 368 (569) T protein:vir:10 297 TRFNASK------IDRIIGLAMN-SLDPVKAADYSRTITQTLKRAADLMERRARGANNMPTVTNTLLPIM-GDGKGQMTI 368 (569) T ss_pred hhhHHHH------HhHHhhcccc-CCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCccccccceeeeeee-cCccccccc Confidence 2333222 2223333211 223333 456677777777765542 223333 2222111 Q ss_pred eccCchhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHH-HHHHHHhhHHHHHHHH-HHH Q lcl|NC_020081. 332 NMTQSSKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYR-NSKDKGLEPLLKFIED-AVN 409 (552) Q Consensus 332 ~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~-~~~~~~l~P~~~~ie~-~ln 409 (552) .+..-+.+.-=+|-.-+..+.+|.++|+++.|||..+-=+ +|.+....-...+..+++ ..+++.+.-++..+-+ .+. T Consensus 369 Dt~~~~A~~~gIEdvM~~~R~LagaLGlD~SMlGwAD~Ls-GGLGeGG~frtSaQaa~RS~~iRqa~~e~in~iidiH~~ 447 (569) T protein:vir:10 369 DTQTIQADINGIEDILTYMRQLAAALGLDYTLLGWADQMS-GGLGEGGFLRTAIQAAMRASWIQQGVEEFIQRAIDIHLA 447 (569) T ss_pred cccccccCcccHHHHHHHHHHHHhhhccchhHhhHHHHhc-ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 2223334444456666888999999999999999866422 222333333344433333 3344444333332211 111 Q ss_pred hh---cCcccccceeecccccChHHHHHH----------HHHHHHHh-----cCCcCHHHHHHHhCCCCCCCCCeeeccc Q lcl|NC_020081. 410 KY---IVSQFGGDYVFNFVGGDAKTEAEI----------ISILESKA-----KIGLTINDIRKELGYPDTEGGDVTLAGV 471 (552) Q Consensus 410 ~~---L~~~~~~~~~~~f~~~d~~~~~~~----------~~~~~~~~-----~g~lT~NE~R~~~gl~p~~ggD~~~~~~ 471 (552) .| .++..+..|.++|.......+.|. +.++.+.. ++.+-.||.-..+=+..+=+.|. T Consensus 448 fKYgevf~~~drP~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~~m~y~l~d~~~~De----- 522 (569) T protein:vir:10 448 FKYGKVYPEGDRPYKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDVLEIDE----- 522 (569) T ss_pred hhcCcccCCCCcceEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHHHHhhcch----- Confidence 11 123334558888876554332222 11111111 11222222111100000001110 Q ss_pred cccchhhhccccccccccCCCCCccCcccCCCCCCCCCC-CCCCCcccccCCCCcccc Q lcl|NC_020081. 472 HVQRLGQIMQQEQVEYQRQMDANQFLAQQTGYDGNMDNV-NGKDSFNQNVGKDGQSKQ 528 (552) Q Consensus 472 n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 528 (552) ++. +.+..... .++.+. +-. .+.--..| ..-..-...+.+++++.+ T Consensus 523 ~~~--e~l~ae~~---akp~DE-e~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 569 (569) T protein:vir:10 523 KIS--EALVNELK---AKSEDD-DHL-----MDSIIKTPPQELAQILESVFKEGNDND 569 (569) T ss_pred hHH--HHHHhhcC---CCcchh-HHH-----HHHHhcCChHHHHHHHHHHhhccCCCC Confidence 000 00000000 000000 000 00000000 000000011112221111 No 242 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=39.32 E-value=1 Score=20.53 Aligned_cols=418 Identities=14% Similarity=0.103 Sum_probs=149.0 Q ss_pred Ccccccccccchhhhhcccccccccccc---------ccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHH Q lcl|NC_020081. 22 DDMAVRIKQIEEDAILKKGKNTKSNKPK---------AYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILN 92 (552) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~ 92 (552) -.--.+...++++.+.+-+..-++.++. .+..|.. +...+.... ..+. ..++... T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~--------~~~~~~~~~-------~~~~-~dstg~~ 64 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL--------MNNKGDNET-------SQNG-WQGVGAQ 64 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccc--------cCCCCCccc-------cccc-ccchHHH Confidence 0000111222222222222222222221 1122211 000110000 1011 1223233 Q ss_pred HHHHHHHHHHHHHHHHHHhhccccce-eeeeccccc-cCC--hhHHHHH----HHHHHHHHhcCCCCCCCccCCHHHHHH Q lcl|NC_020081. 93 AIIITRVNQVSMFCTPARNSDKGVGY-EIRLKDPLQ-EPN--DHNKKKI----KEIENFIEKTGRIDNDFTRDNFRSFVK 164 (552) Q Consensus 93 a~i~~~~~~~~~~~~~~~~~~~~~~~-~i~~k~~~~-~~~--~~~~~~~----~~l~~~l~~~n~~~~pn~~~t~~~f~~ 164 (552) ++ ++.+..++.... -.+.+| .+.+.+... ..+ +.+...+ ..+++.+..- ...-+++.-+. T Consensus 65 a~-~~LAa~l~~~lt-----pp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~------l~~snf~~~~~ 132 (515) T protein:vir:70 65 AT-NHLANKLAQVLF-----PAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKA------LEQRQFRPAIV 132 (515) T ss_pred HH-HHHHHHHHHhhc-----CCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHH------HHhcCchHHHH Confidence 33 222322221111 111121 222222211 111 1122221 1222322221 12235666777 Q ss_pred HHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCccccccc------------------------------- Q lcl|NC_020081. 165 KLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAK------------------------------- 213 (552) Q Consensus 165 ~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~------------------------------- 213 (552) .++.|+..+||+.+++. ..+. ...||| ..+.+..|..|++-... T Consensus 133 ~~~~~L~~~G~a~l~~d--~~~~-~~~~pl--~~y~v~~d~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~ 207 (515) T protein:vir:70 133 EVFKHLIVAGNCLLYKP--SKGA-MSAVPM--HHYVVNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDD 207 (515) T ss_pred HHHHHHHhHCeEEEEEe--CCCC-eEEEEc--CeEEEeeCCCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCC Confidence 78889999999998873 3332 445666 34555555555421100 Q ss_pred ------------ce-eEEEEEcCCceEEE---EcccceeeecccccCCccC-CcccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 214 ------------DG-VRYVQVIDDKVVAK---FKAKEMAWEVSNPRTDLTV-GKYGYPELEIALNHLQYHDNTEVFNARF 276 (552) Q Consensus 214 ------------~~-~~y~~~~~~~~~~~---~~~~evi~~~~~~~~~~~~-g~~G~spl~~~~~~i~~~~~~~~~~~~~ 276 (552) .+ ..+++..++..... ++.++.=|+ .+++...+ ..||.||.+-++..+.......+..... T Consensus 208 ~v~i~~~v~~~~~~~~~~~~e~d~~~~~~es~y~~~e~P~~--~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~ 285 (515) T protein:vir:70 208 NVKLYTHAQYAGEGFWKINQSADDIPVGKESRIKSEKLPFI--PLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARG 285 (515) T ss_pred ceEEEEEEEecCCCceEEEEecCceeeccccccccccCCce--eeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHH Confidence 00 01111111111100 000111111 12344433 3799999999999999999888888887 Q ss_pred HhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecc--CCceeeeccCchhHHH-HHHHHHHHHHHH Q lcl|NC_020081. 277 FAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITA--EDVKFVNMTQSSKDME-FEKWLNYLINVI 353 (552) Q Consensus 277 f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~--~g~~~~~l~~~~~d~q-~~e~~~~~~~~I 353 (552) ..-...|..++. .+...... .+ ..|..+ .++.+ +++...+++.. .|.+ ..+..+..+..| T Consensus 286 ~~~a~~p~~lv~--~~g~~~~~-------~l---~~~~~g----~iv~g~~~~v~~~~~~~~-~d~~~~~~~i~~~~~rI 348 (515) T protein:vir:70 286 AALMADIKYLIR--PGSQTDVD-------HF---VNSGTG----EVITGVAEDIHIVQLGKY-ADLTPISAVLEVYTRRI 348 (515) T ss_pred HHHhcCCCeeeC--cccccchh-------hc---cccCCc----eeecCCcccceeeecCcc-cchhHHHHHHHHHHHHH Confidence 777777765553 22222211 11 122111 12322 33444444432 2333 335567888999 Q ss_pred HHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhc--------Ccccc-cceeecc Q lcl|NC_020081. 354 CSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYI--------VSQFG-GDYVFNF 424 (552) Q Consensus 354 a~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L--------~~~~~-~~~~~~f 424 (552) ..+|-+........++=| +.--..+..-....|.|.+.++.++|=.-| +++.- ..+...+ T Consensus 349 ~~af~~~~l~~rd~~rvT-----------AtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~p~~P~~~v~~~~ 417 (515) T protein:vir:70 349 GVIFMMETMTRRDAERVT-----------AVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVI 417 (515) T ss_pred HHHHhhhhhhccCCcccc-----------HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhCCCCChhhcccce Confidence 999977654433222111 111222333345567788888777764433 22211 1122222 Q ss_pred ccc-ChHHHHHHHHHHHH---Hhc----------CCcCHHH----HHHHhCCCCCCCCCeeeccc-cccchhhh-ccccc Q lcl|NC_020081. 425 VGG-DAKTEAEIISILES---KAK----------IGLTIND----IRKELGYPDTEGGDVTLAGV-HVQRLGQI-MQQEQ 484 (552) Q Consensus 425 ~~~-d~~~~~~~~~~~~~---~~~----------g~lT~NE----~R~~~gl~p~~ggD~~~~~~-n~~~~~~~-~~~~~ 484 (552) ..+ ....+....+.+.. ..+ -.+..++ +....|.|+- ++.+- .+..+-+. .+.+. T Consensus 418 vs~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~-----~~rs~eev~~~r~q~~~~~~ 492 (515) T protein:vir:70 418 VTGIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELP-----FLKSEEEMQQEMAQQAQAQQ 492 (515) T ss_pred ehhHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCcc-----ccCCHHHHHHHHHHHHHHHH Confidence 111 22222222211110 111 0122222 2222222210 11110 00000000 00000 Q ss_pred cccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCcccccccccccccc Q lcl|NC_020081. 485 VEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQSKQQANTNSTPQG 538 (552) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 538 (552) . +....+-+..... . ..+..+ |+ T Consensus 493 ~---------~~~~~~~~~a~~~--~------~~~~~~--------------~~ 515 (515) T protein:vir:70 493 E---------AMLNEGVAKAVPG--V------IQQEMK--------------EG 515 (515) T ss_pred H---------HHHHHhhhhhccc--c------hhhhhc--------------cC Confidence 0 0000000000000 0 000000 00 No 243 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=35.59 E-value=1.2 Score=20.11 Aligned_cols=426 Identities=13% Similarity=0.067 Sum_probs=159.5 Q ss_pred cchhhhhcccccccccccc---------ccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHH Q lcl|NC_020081. 31 IEEDAILKKGKNTKSNKPK---------AYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQ 101 (552) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~ 101 (552) ++.+.+.+.+...++.++. .|..|...++ ....+..+.. .......+ ..+....+|.+ .+.. T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~------~~~~~~~~~~-~~~~~~~i-~dst~~~a~~~-Las~ 71 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDF------FSDLRSEGSI-NWNQNREV-FDSTAGDGLET-LSSS 71 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccccc------ccCCCCCccc-cccccccc-ccchHHHHHHH-HHHH Confidence 3333333333333333221 1111111110 0000000000 00001111 12222333332 2222 Q ss_pred HHHHHHHHHhhccccce-eeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEE Q lcl|NC_020081. 102 VSMFCTPARNSDKGVGY-EIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFEL 180 (552) Q Consensus 102 ~~~~~~~~~~~~~~~~~-~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i 180 (552) ++.... -.+.+| .+...+.+.......+.....+++.+..-. ..-+++.-+..++.|+.++||+.+++ T Consensus 72 L~~~lt-----Pp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l------~~snf~~~~~~~~~~L~~~G~a~l~~ 140 (547) T protein:vir:10 72 LHGSLT-----SPATKWFELAFRDKELNSDDECRKWLENATHDVYSAL------QDSNFNLEANETYIDLCGYGNAIMVE 140 (547) T ss_pred HHHhhc-----CCCCcccccccCCccccchHHHHHHHHHHHHHHHHHH------HhcCcHHHHHHHHHHHHhHCcEeEEe Confidence 221111 112222 222223222222222333344444443322 12345666777789999999999999 Q ss_pred EECCC-CCEEEEEEecCceeEEEECCCccccccc-----------------------------cee----EE--E--EEc Q lcl|NC_020081. 181 VYDKL-GDLHNFKAVDASTVYVAVDEDGKERKAK-----------------------------DGV----RY--V--QVI 222 (552) Q Consensus 181 ~r~~~-G~~~~L~~l~p~~v~v~~~~~g~~~~~~-----------------------------~~~----~y--~--~~~ 222 (552) ..+.+ +....+..++...+.+..+..|++-... .+. .+ + .+. T Consensus 141 ~~d~~~~~~~r~~~~pl~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~ 220 (547) T protein:vir:10 141 EEDEDEEGSVVFQSSPIQDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFT 220 (547) T ss_pred ccCCCCCCceeEEEeecceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEee Confidence 87642 3345566677778888787777642110 000 00 0 000 Q ss_pred C-Cc---------------e--EEEEcccc---eeee-------cccccCCccCC-cccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 223 D-DK---------------V--VAKFKAKE---MAWE-------VSNPRTDLTVG-KYGYPELEIALNHLQYHDNTEVFN 273 (552) Q Consensus 223 ~-~~---------------~--~~~~~~~e---vi~~-------~~~~~~~~~~g-~~G~spl~~~~~~i~~~~~~~~~~ 273 (552) . .. . ...+..++ ++.. ...+++...+| .||.||.+.++..+.......+.. T Consensus 221 ~~~~~~~~~~~~~~~~~~~p~~s~~~e~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~ 300 (547) T protein:vir:10 221 RYDKKQNRNAGTVLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELV 300 (547) T ss_pred ccCCCCCccccceeeccccceeEEEEEecCceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHH Confidence 0 00 0 00001111 1000 01123343333 799999999999999998888888 Q ss_pred HHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeeccCCceeeeccCchhHHHHHHHHHHHHHHH Q lcl|NC_020081. 274 ARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITAEDVKFVNMTQSSKDMEFEKWLNYLINVI 353 (552) Q Consensus 274 ~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~~g~~~~~l~~~~~d~q~~e~~~~~~~~I 353 (552) .....-...|..++. .+....+ ++ ...| . .++.+..-.++++...++-....+..+.....| T Consensus 301 l~~~~~~~~pp~~v~--~~g~~~~---------~~-~~pg-----g-~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI 362 (547) T protein:vir:10 301 LRSSEKVIDPAIMVT--ERGLISD---------ID-LGAS-----G-LTVVRDMESMKPFESRARFDVSSIQLTDLRSAV 362 (547) T ss_pred HHHHHHHhcCceecc--ccccccc---------ce-ecCC-----e-eeecCCcccceeeecccchHHHHHHHHHHHHHH Confidence 777777777775543 2211111 11 1111 2 233344445566655443233346678889999 Q ss_pred HHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHH------------h-hcCcccc--- Q lcl|NC_020081. 354 CSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVN------------K-YIVSQFG--- 417 (552) Q Consensus 354 a~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln------------~-~L~~~~~--- 417 (552) ..+|-++...+-...+-| +.--.....-....|.|...+++.+|- + .++++.. T Consensus 363 ~~af~~d~~~~~~~~~~T-----------AtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l 431 (547) T protein:vir:10 363 RRIYYVDQLQMKDSPAMT-----------ATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKL 431 (547) T ss_pred HHHhhhhhhhcCCCcccc-----------HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhh Confidence 999987665443222111 111112222233444555554444432 2 2333221 Q ss_pred -----cceeecccccChHHHHHHHH-------HHHHHh--cC-------CcCHHHH----HHHhCCCCCCCCCeeecccc Q lcl|NC_020081. 418 -----GDYVFNFVGGDAKTEAEIIS-------ILESKA--KI-------GLTINDI----RKELGYPDTEGGDVTLAGVH 472 (552) Q Consensus 418 -----~~~~~~f~~~d~~~~~~~~~-------~~~~~~--~g-------~lT~NE~----R~~~gl~p~~ggD~~~~~~n 472 (552) ..+.+++ ..+..++.+.. .+..+. .+ .+..+++ -..+|.|+ +.+...-. T Consensus 432 ~~~~~~~~~v~~--is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~----~~irs~ee 505 (547) T protein:vir:10 432 LESGKAAMDIVY--TGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQ----TLMRPKAK 505 (547) T ss_pred hccCcceEEEEe--ccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCCh----hccCCHHH Confidence 1223333 23333332211 111110 01 1233333 33455543 11211111 Q ss_pred ccchhhh--ccccccc-cccCCCCCccCcccCCCCCCCCCCCCCCCccc Q lcl|NC_020081. 473 VQRLGQI--MQQEQVE-YQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQ 518 (552) Q Consensus 473 ~~~~~~~--~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (552) ...+-+. .+++... +.......+.... -|.....-+ .|+ T Consensus 506 v~~~r~qr~~~~q~~~qaa~~~~~g~~m~~-~~~~~a~~~------~~~ 547 (547) T protein:vir:10 506 VTSIRKNRSQTQQKAEQAAIAEAEGNAMEA-QGKGQAALK------ENQ 547 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hcCcccchh------ccC Confidence 1111000 0000000 0000000000000 000000000 000 No 244 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=32.95 E-value=1.4 Score=19.80 Aligned_cols=433 Identities=11% Similarity=0.061 Sum_probs=154.7 Q ss_pred cccccc-ccchhhhhccccccccccc---------cccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHH Q lcl|NC_020081. 24 MAVRIK-QIEEDAILKKGKNTKSNKP---------KAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNA 93 (552) Q Consensus 24 ~~~~~~-~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a 93 (552) |....+ +.-++.+-+-+..-++.++ -.|..|..+... +.... ..+.++- .+....+ T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~---------~~~~~----~~~~~~~-dst~~~a 66 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKE---------SDNES----TDYTTPW-QAVGARG 66 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCC---------CCccc----ccccccc-cccHHHH Confidence 211110 0011111111111111111 111222111100 00000 0000111 2222233 Q ss_pred HHHHHHHHHHHHHHHHHhhccccceeeeeccccc---c--CChh----HHHHHHHHHHHHHhcCCCCCCCccCCHHHHHH Q lcl|NC_020081. 94 IIITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQ---E--PNDH----NKKKIKEIENFIEKTGRIDNDFTRDNFRSFVK 164 (552) Q Consensus 94 ~i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~---~--~~~~----~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~ 164 (552) + ++.+..++..+.+ ..+ ++++.-.+. + ..+. .+.-...+++.+..-. ..-+++.-+. T Consensus 67 ~-~~Laa~l~~~ltP------~~~-WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~------~~snf~~~~~ 132 (535) T protein:vir:33 67 L-NNLASKLMLALFP------MQS-WMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYI------ESNSYRVTLF 132 (535) T ss_pred H-HHHHHHHHHhhcC------CCc-ccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHH------HhcCcHHHHH Confidence 3 3333333222222 112 333321111 0 0111 1111223333333221 2235666777 Q ss_pred HHHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccc------------------------------ Q lcl|NC_020081. 165 KLVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKD------------------------------ 214 (552) Q Consensus 165 ~~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~------------------------------ 214 (552) .+..|++++||+.+++..+.. ....+..++-..+.+..+..|++-.... T Consensus 133 ~~~~~L~~~G~a~l~~~~~~~-~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~ 211 (535) T protein:vir:33 133 ECLKQLIVAGNALLYLPEPEG-SYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMV 211 (535) T ss_pred HHHHHHHhhCceeEEeecCCC-CceeeEEEEcCeeEEeeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCC Confidence 788899999999999887643 3344444455666666776665311100 Q ss_pred eeEEEEEcC--CceEEEEc--ccceee-------e----cccccCCccCC-cccccHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020081. 215 GVRYVQVID--DKVVAKFK--AKEMAW-------E----VSNPRTDLTVG-KYGYPELEIALNHLQYHDNTEVFNARFFA 278 (552) Q Consensus 215 ~~~y~~~~~--~~~~~~~~--~~evi~-------~----~~~~~~~~~~g-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ 278 (552) .++...... ++....+. .+..++ + ...+++...+| .||.||.+-++..+.......+....... T Consensus 212 ~v~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~ 291 (535) T protein:vir:33 212 DVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSM 291 (535) T ss_pred eEEEEEEeeCCCCcEEEEEEEeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 001111111 11111110 000000 0 00123333333 79999999999999999999888888888 Q ss_pred ccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceee--ccCCceeeeccCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 279 QGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVI--TAEDVKFVNMTQSSKDMEFEKWLNYLINVICSI 356 (552) Q Consensus 279 ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il--~~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~ 356 (552) -...|..++. .+...... +...|..+. ++ ..+++...++...++-.-..+..+..+..|..+ T Consensus 292 ~~~~p~~lv~--~~g~~~~~----------~~~~~~~g~----~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~a 355 (535) T protein:vir:33 292 ISAKVIGLVN--PAGITQPR----------RLTKAQTGD----FVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYA 355 (535) T ss_pred HHhcCceeec--cccccchh----------hcccCCcee----eecCCcccceeeecccccchhHHHHHHHHHHHHHHHH Confidence 8888876543 22221211 112222222 22 234566666665554334456677888999999 Q ss_pred hcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHh-------------hcCccc-ccceee Q lcl|NC_020081. 357 YSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNK-------------YIVSQF-GGDYVF 422 (552) Q Consensus 357 fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~-------------~L~~~~-~~~~~~ 422 (552) |-+.. +...+. ...+ +.=-..+..-....|.|.+.+++++|=. .++++. +..+.+ T Consensus 356 f~~~~--~~~~~~--------~r~T-AtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~ 424 (535) T protein:vir:33 356 FMLNS--AVQRTG--------ERVT-AEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEP 424 (535) T ss_pred Hhhhh--cccCCC--------cccc-HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeE Confidence 95441 211111 0111 1111222223344455555555544432 233332 234556 Q ss_pred cccccChH-HHH-------HHHHHHHHHh---cC-CcCH----HHHHHHhCCCCCCCCCeeeccccccchhh-----hcc Q lcl|NC_020081. 423 NFVGGDAK-TEA-------EIISILESKA---KI-GLTI----NDIRKELGYPDTEGGDVTLAGVHVQRLGQ-----IMQ 481 (552) Q Consensus 423 ~f~~~d~~-~~~-------~~~~~~~~~~---~g-~lT~----NE~R~~~gl~p~~ggD~~~~~~n~~~~~~-----~~~ 481 (552) +|..+-.. .+. .++..+..+. .. .+.. +++...+|.|+. .++...-..+.+-+ +.. T Consensus 425 ~yis~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~---~i~~~~ee~~~~~~q~~~~~~~ 501 (535) T protein:vir:33 425 TISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTS---GILLTDEQKQALMMQDAAQTGV 501 (535) T ss_pred EEecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHh---HhcCCHHHHHHHHHHHHHHHHH Confidence 66433111 111 1111111110 00 1222 222334555431 00100000000000 000 Q ss_pred ccccccccCCCCCccCcccCCCCCCCCCCCCCCCcccccCCCCc Q lcl|NC_020081. 482 QEQVEYQRQMDANQFLAQQTGYDGNMDNVNGKDSFNQNVGKDGQ 525 (552) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (552) .+...+..++-...+ ...+..-..-....|-+.. T Consensus 502 ~~~~~~~g~~~~~~~----------~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 502 ENAAAAGGAGVGALA----------TSSPEAMQGAAAKAGLNAT 535 (535) T ss_pred HHHHHhhhhhhcchh----------hcCChhHHHHHHhccCCCC Confidence 000000000000000 0000000000001111100 No 245 >protein:vir:97376 Length: 320 # NCBI annotation: putative portal protein # Family: family:all:11744 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762589;genbank:gi:115304290;genbank:GeneID:5130579 Probab=30.62 E-value=1.6 Score=19.52 Aligned_cols=305 Identities=13% Similarity=0.129 Sum_probs=114.3 Q ss_pred ccccccccchhhhhccccccccccccccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHHHHHHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPKAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAIIITRVNQVS 103 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~i~~~~~~~~ 103 (552) |..+- .+++..-.....+.+..+.... -+.| ..+... | .+.+-.+..++ T Consensus 1 ~~~~~------------~~~~~~~~~~~~~~~~~~~~~~---~~~~-~~~~~~-------~--------~~~~~~~~~~~ 49 (320) T protein:vir:97 1 MGIFN------------FKKRETLTPELKESIIRQVTIE---DESP-FTGTTD-------F--------NVRNEVAESIA 49 (320) T ss_pred CCccc------------cccccccChhHHhhhhheeeec---cCCC-cccccc-------c--------chhhHHHHHHH Confidence 00000 0000000000001111100000 0000 000000 0 00000111122 Q ss_pred HHHHHHHhhccccceeeeeccccccCChhHHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHHHHHHHHhcCCeeEEEEEC Q lcl|NC_020081. 104 MFCTPARNSDKGVGYEIRLKDPLQEPNDHNKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKKLVRDRLTYDKINFELVYD 183 (552) Q Consensus 104 ~~~~~~~~~~~~~~~~i~~k~~~~~~~~~~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~~v~d~ll~Gna~~~i~r~ 183 (552) .+.-.|..+- +.| ..+..+| .|++.++.+.|..-..|++.-.. T Consensus 50 ~~~~~~~~~~----------------------------~~~--~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~ 92 (320) T protein:vir:97 50 TYLGAYKTSA----------------------------KRL--SLLTNNP-------SFLRRLVKHALHNKTTYVYKSPT 92 (320) T ss_pred HHhhhhcccc----------------------------cee--eeeeCCH-------HHHHHHHHHhhcccceEEeeCCc Confidence 2221111110 000 1122222 59999999999988888876432 Q ss_pred CCCCEEEEEEecCceeEEEECCCcccccccce-eEEEEEcCCceEEEEcccceeeecccccCCccCCcccccHHHHHHHH Q lcl|NC_020081. 184 KLGDLHNFKAVDASTVYVAVDEDGKERKAKDG-VRYVQVIDDKVVAKFKAKEMAWEVSNPRTDLTVGKYGYPELEIALNH 262 (552) Q Consensus 184 ~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~~-~~y~~~~~~~~~~~~~~~evi~~~~~~~~~~~~g~~G~spl~~~~~~ 262 (552) -|.++ -++-++.=.+..-. +.-+.+ ..-+ ....+|..-+|| .+..+|+-+-+.. .. T Consensus 93 -~~~~~----~~~~~~~~~~~~~~--~~~~D~FN~~V-----~mtvpfyD~~IL----------dnpl~gv~tqe~g-kM 149 (320) T protein:vir:97 93 -YGWLI----TDSMTIEGLRARLT--FTLPDPFNSAV-----TMTVPFYDVGII----------DSPLVEVDTEEAN-KM 149 (320) T ss_pred -cceee----ecceeeeeeeeeEE--EecCcccceeE-----EEEeeeechhhh----------hhhhcccChHHhh-HH Confidence 23221 12212111110000 000000 0001 011222222222 1226777765322 22 Q ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccc-ccccceeeccCCceeeeccCc----- Q lcl|NC_020081. 263 LQYHDNTEVFNARFFAQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGIN-GAWKIPVITAEDVKFVNMTQS----- 336 (552) Q Consensus 263 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~-nagk~~il~~~g~~~~~l~~~----- 336 (552) +++ +....-+-+.|.+.....++.+.+..+ ++-.++.+..+.++..-++ =+|-.++ +.|-+++++..+ T Consensus 150 ~g~---a~~~v~kkL~~~~~IKafi~Tdid~GL-ee~kD~~~~kIk~mq~~A~~~nG~T~i--~~~dDI~Qi~pDYS~sn 223 (320) T protein:vir:97 150 LEA---AYSAVMKKLHNTGAIKAFISSDIDVGL-EKMKEESDSKIKAMLATAELLSGYTYI--QRGDDVTQMMPDYTTSN 223 (320) T ss_pred HHH---HhhhhhhhccccceeEEEEecccchhH-HHHHHHHHHHHHHHHHHHHHhcCcccc--cCCcceeeecccccccc Confidence 222 333344556667777777765433111 3333344333333222111 1222222 345666665432 Q ss_pred hhHHHHHHHHHHHHHHHHHHhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHH---HHHHHhhcC Q lcl|NC_020081. 337 SKDMEFEKWLNYLINVICSIYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFI---EDAVNKYIV 413 (552) Q Consensus 337 ~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~i---e~~ln~~L~ 413 (552) ..|. ...++..+.-|+||..+|- .+..+.+..+|+.+.+.|+++++ |-.|..++ T Consensus 224 ~~D~------~l~~t~alS~y~m~~~IL~----------------GsAte~~~Iaf~~~~V~PLL~Q~~~~Ek~Lvy~m- 280 (320) T protein:vir:97 224 VTDF------AAMRTFAASQLSVSDKILD----------------GSATDGEKVAVMFRFVEPILEQFREYEPSLIYAM- 280 (320) T ss_pred hhHH------HHHHHHHHhhcCCchhhcc----------------ccCCcceeeehhhHhHHHHHHHhhhcCcceeeee- Confidence 2233 3456778888999998872 12345667789999999999997 44443332 Q ss_pred cccccceeecccc--cChHHHHHHHHHHHHHhcCC---cCHHHHHHHhCCCCCCCCCeeec Q lcl|NC_020081. 414 SQFGGDYVFNFVG--GDAKTEAEIISILESKAKIG---LTINDIRKELGYPDTEGGDVTLA 469 (552) Q Consensus 414 ~~~~~~~~~~f~~--~d~~~~~~~~~~~~~~~~g~---lT~NE~R~~~gl~p~~ggD~~~~ 469 (552) ....++.|.. +...+ .++.|| ..|||- .|||+--+ T Consensus 281 ---~~E~FVs~mtTGG~l~S---------~~~~~~~~~~~~~~~---------~~~~~~~~ 320 (320) T protein:vir:97 281 ---RDEFFVSFMTTGGMLNS---------NRVDGWGKEKAPNES---------KGGDVGDV 320 (320) T ss_pred ---ccceeeeeeecCceeec---------ccccccccccCCccc---------cCCcccCC Confidence 2223344432 11110 111232 223331 24543221 No 246 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=25.57 E-value=2 Score=18.88 Aligned_cols=424 Identities=12% Similarity=0.103 Sum_probs=147.9 Q ss_pred ccccccccchhhhhccccccccccc---------cccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKP---------KAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAI 94 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~ 94 (552) |...-...-.+.+.+-+..-++.++ -.|..|..+..+.+++- ..+.+. ..+....+| T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~-------------~~~~~~-~dst~~~a~ 66 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS-------------TDYQTP-WQAVGARGL 66 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc-------------cccccc-ccccHHHHH Confidence 2221111111122222222222221 11222221111111100 001111 122223333 Q ss_pred HHHHHHHHHHHHHHHHhhccccceeeeeccccccC-----ChhHH----HHHHHHHHHHHhcCCCCCCCccCCHHHHHHH Q lcl|NC_020081. 95 IITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEP-----NDHNK----KKIKEIENFIEKTGRIDNDFTRDNFRSFVKK 165 (552) Q Consensus 95 i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~-----~~~~~----~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~ 165 (552) . +.+..++....+ ..+ ++++.-.+... .+.+. +-...+++.++.-. ..-+++.-+.. T Consensus 67 ~-~Laa~l~~~ltP------~~~-WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l------~~snf~~~~~~ 132 (536) T protein:vir:10 67 N-NLASKLMLALFP------MQT-WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI------ESNSYRVTLFE 132 (536) T ss_pred H-HHHHHHHhhhcC------CCc-ccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHH------HhcCcHHHHHH Confidence 3 222222221111 112 34433222111 01111 11222233332211 22346666777 Q ss_pred HHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCcccccccc------------------------------e Q lcl|NC_020081. 166 LVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAKD------------------------------G 215 (552) Q Consensus 166 ~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~~------------------------------~ 215 (552) ++.|+.++||+..++..+..+.+..+..+|-..+.+..|.+|++-.... . T Consensus 133 ~~~~L~~~G~a~ly~~e~~~~~~~~~~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~ 212 (536) T protein:vir:10 133 ALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETID 212 (536) T ss_pred HHHHHHhHCcEeEEEeeCCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceE Confidence 8889999999999998766554444444444566666666665311100 0 Q ss_pred eE-------------EEEEcCCceEEEEc----ccceeeecccccCCccCC-cccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 216 VR-------------YVQVIDDKVVAKFK----AKEMAWEVSNPRTDLTVG-KYGYPELEIALNHLQYHDNTEVFNARFF 277 (552) Q Consensus 216 ~~-------------y~~~~~~~~~~~~~----~~evi~~~~~~~~~~~~g-~~G~spl~~~~~~i~~~~~~~~~~~~~f 277 (552) ++ +++..++....... -++.=|+ .+++...+| .||.||.+-+...+.......+...... T Consensus 213 v~~~V~~~~~~~~~~~~~e~~g~~v~~~~g~~~f~~~P~i--~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 290 (536) T protein:vir:10 213 VYTHIYLDEASGEYLRYEEVEGMEVQGSDGTYPKEACPYI--PIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMS 290 (536) T ss_pred EEEEEEEecCCCcEEEEEeecCccccccccccccccCCce--eeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 00 01111111100000 0011011 123333333 7999999999988888888877777655 Q ss_pred hccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec--cCCceeeeccCchhHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 278 AQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT--AEDVKFVNMTQSSKDMEFEKWLNYLINVICS 355 (552) Q Consensus 278 ~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~--~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 355 (552) .-...|..++. +.+ ..... .+ ..+..+. ++. .+++...++...++-.-..+..+..+..|.. T Consensus 291 ~~a~~~~~lv~-p~g-~~~~~-------~~---~~~~~g~----~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~ 354 (536) T protein:vir:10 291 MISSKVIGLVN-PAG-ITQPR-------RL---TKAQTGD----FVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSF 354 (536) T ss_pred HHHhcCCcccC-ccc-ccchh-------hh---ccCCCcc----eecCCcccceeeeccccccchHHHHHHHHHHHHHHH Confidence 55555554443 221 11111 11 1121111 221 2345555655444433344666788899999 Q ss_pred HhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHh-------------hcCccccc-cee Q lcl|NC_020081. 356 IYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNK-------------YIVSQFGG-DYV 421 (552) Q Consensus 356 ~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~-------------~L~~~~~~-~~~ 421 (552) +|-+.. +...+. ...+ +.--..+..-....|.|.+.+++.+|=. .++++.-. .+. T Consensus 355 af~~~~--l~~~~~--------~r~T-AtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~ 423 (536) T protein:vir:10 355 AFMLNS--AVQRTG--------ERVT-AEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVE 423 (536) T ss_pred HHhhhh--cccCCC--------CCcc-HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhcc Confidence 995542 111111 0111 1111122222334455555555444432 23332221 223 Q ss_pred eccccc-ChHHHHHHHH-------HHHHHhc---C-CcCHHHHH----HHhCCCCCCCCCeeeccccccchh-----hhc Q lcl|NC_020081. 422 FNFVGG-DAKTEAEIIS-------ILESKAK---I-GLTINDIR----KELGYPDTEGGDVTLAGVHVQRLG-----QIM 480 (552) Q Consensus 422 ~~f~~~-d~~~~~~~~~-------~~~~~~~---g-~lT~NE~R----~~~gl~p~~ggD~~~~~~n~~~~~-----~~~ 480 (552) .++..+ ....+....+ .+....- . .+..+++- +.+|.+|.. .+..+--.+.+- ++. T Consensus 424 ~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~---~irt~eev~~~r~q~~~~~~ 500 (536) T protein:vir:10 424 PTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG---ILLTEEQKQQKMAQQSMQMG 500 (536) T ss_pred ceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCchh---hcCCHHHHHHHHHHHHHHHH Confidence 333211 1222222211 1111100 1 12223322 234553321 111100000000 000 Q ss_pred ccccccccc---------CCCCCccCcccCCCCCCC Q lcl|NC_020081. 481 QQEQVEYQR---------QMDANQFLAQQTGYDGNM 507 (552) Q Consensus 481 ~~~~~~~~~---------~~~~~~~~~~~~~~~~~~ 507 (552) ..+.+.+.. .+..-+...++.|..+.- T Consensus 501 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:10 501 MDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred HHHHHHHHHHHHHHHHhcCchhHHhhhhccccCCCC Confidence 000000000 000000000011110000 No 247 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=24.71 E-value=2.1 Score=18.77 Aligned_cols=424 Identities=12% Similarity=0.107 Sum_probs=147.9 Q ss_pred ccccccccchhhhhccccccccccc---------cccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKP---------KAYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAI 94 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~ 94 (552) |...-...-.+.+.+-+..-++.++ -.|..|..+..+.+.+- ..+.+. ..+....+| T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~-------------~~~~~~-~dst~~~a~ 66 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNAS-------------TDYQTP-WQAVGARGL 66 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccc-------------cccccc-ccccHHHHH Confidence 2221111111122222222222221 11222221111111100 001111 122333333 Q ss_pred HHHHHHHHHHHHHHHHhhccccceeeeeccccccC-----ChhHH----HHHHHHHHHHHhcCCCCCCCccCCHHHHHHH Q lcl|NC_020081. 95 IITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQEP-----NDHNK----KKIKEIENFIEKTGRIDNDFTRDNFRSFVKK 165 (552) Q Consensus 95 i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~~~-----~~~~~----~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~ 165 (552) . +.+..++....+ ..+ ++++.-.+... .+.+. +-...+++.++.-. ..-+++.-+.. T Consensus 67 ~-~Laa~l~~~ltP------~~~-WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l------~~snf~~~~~~ 132 (536) T protein:vir:21 67 N-NLASKLMLALFP------MQT-WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYI------ESNSYRVTLFE 132 (536) T ss_pred H-HHHHHHHHhhcC------CCc-ccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHH------HhcCcHHHHHH Confidence 3 222222221111 112 34433222111 11111 11222233332211 22346666777 Q ss_pred HHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCccccccc----------------------------c--e Q lcl|NC_020081. 166 LVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAK----------------------------D--G 215 (552) Q Consensus 166 ~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~----------------------------~--~ 215 (552) ++.|+.++||+..++..+..+.+..+..+|-..+.+..|.+|++-... . . T Consensus 133 ~~~~L~~~G~a~ly~~e~~~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~ 212 (536) T protein:vir:21 133 ALKQLVVAGNVLLYLPEPEGSNYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETID 212 (536) T ss_pred HHHHHHhHCcEeEEEeeCCCCceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhccccccccccccee Confidence 888999999999999876655444444444456666666666531110 0 0 Q ss_pred eEEEEEc--CCceEEEEc-cc--------------ceeeecccccCCccCC-cccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 216 VRYVQVI--DDKVVAKFK-AK--------------EMAWEVSNPRTDLTVG-KYGYPELEIALNHLQYHDNTEVFNARFF 277 (552) Q Consensus 216 ~~y~~~~--~~~~~~~~~-~~--------------evi~~~~~~~~~~~~g-~~G~spl~~~~~~i~~~~~~~~~~~~~f 277 (552) ++..+.. ++.....+. .+ +.=|+ .+++...+| .||.||.+-+...+.......+...... T Consensus 213 v~~~v~~~~~~~~~~~~~e~~g~~v~~~~g~~~f~~~P~i--~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 290 (536) T protein:vir:21 213 VYTHIYLDEDSGEYLRYEEVEGMEVQGSDGTYPKEACPYI--PIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMS 290 (536) T ss_pred EEEEEEEecCCCcEEEEeccCCeeeccccCccccccCCee--eeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 0001111 111110000 00 10011 123333333 7999999999988888888877777655 Q ss_pred hccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeec--cCCceeeeccCchhHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 278 AQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVIT--AEDVKFVNMTQSSKDMEFEKWLNYLINVICS 355 (552) Q Consensus 278 ~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~--~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 355 (552) .-...|..++. +.+ ..... .+ ..+..+. ++. .+++...++...++-.-..+..+..+..|.. T Consensus 291 ~~a~~~~~lv~-p~g-~~~~~-------~~---~~~~~g~----~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~ 354 (536) T protein:vir:21 291 MISSKVIGLVN-PAG-ITQPR-------RL---TKAQTGD----FVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSF 354 (536) T ss_pred HHHhcCCcccC-ccc-ccchh-------hh---ccCCCcc----eecCCcccceeeeccccccchHHHHHHHHHHHHHHH Confidence 55555554443 221 11111 11 1121111 221 2345555655444433344666788899999 Q ss_pred HhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHh-------------hcCccccc-cee Q lcl|NC_020081. 356 IYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNK-------------YIVSQFGG-DYV 421 (552) Q Consensus 356 ~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~-------------~L~~~~~~-~~~ 421 (552) +|-+.. +...+. ...+ +.--..+..-....|.|.+.+++.+|=. .++++... .+. T Consensus 355 af~~~~--l~~~~~--------~r~T-AtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~ 423 (536) T protein:vir:21 355 AFMLNS--AVQRTG--------ERVT-AEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVE 423 (536) T ss_pred HHhhhh--cccCCC--------CCcc-HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhcc Confidence 995442 111111 0111 1111122222334455555555444432 23332221 223 Q ss_pred eccccc-ChHHHHHHHH-------HHHHHhc---C-CcCHHHH----HHHhCCCCCCCCCeeeccccccchh-----hhc Q lcl|NC_020081. 422 FNFVGG-DAKTEAEIIS-------ILESKAK---I-GLTINDI----RKELGYPDTEGGDVTLAGVHVQRLG-----QIM 480 (552) Q Consensus 422 ~~f~~~-d~~~~~~~~~-------~~~~~~~---g-~lT~NE~----R~~~gl~p~~ggD~~~~~~n~~~~~-----~~~ 480 (552) .++..+ ....+....+ .+....- . .+..+++ -+.+|.+|.. .+..+--.+.+- ++. T Consensus 424 ~~~vs~l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~---~irt~eev~~~r~q~~~~~~ 500 (536) T protein:vir:21 424 PTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSG---ILLTEEQKQQKMAQQSMQMG 500 (536) T ss_pred ceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhh---hcCCHHHHHHHHHHHHHHHH Confidence 333211 1222222211 1111100 1 1222222 2234553321 111100000000 000 Q ss_pred ccccccccc---------CCCCCccCcccCCCCCCC Q lcl|NC_020081. 481 QQEQVEYQR---------QMDANQFLAQQTGYDGNM 507 (552) Q Consensus 481 ~~~~~~~~~---------~~~~~~~~~~~~~~~~~~ 507 (552) ..+.+.+.. .+..-+...++.|..+.- T Consensus 501 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:21 501 MDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred HHHHHHHHHHHHHHHHhcChhhHHhhhhccccCCCC Confidence 000000000 000000000011110000 No 248 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=23.03 E-value=2.4 Score=18.54 Aligned_cols=417 Identities=12% Similarity=0.050 Sum_probs=148.5 Q ss_pred ccccccccchhhhhcccccccccccc---------ccccccccccccCCcccccccCCCCchHHHHHHHhhcchHHHHHH Q lcl|NC_020081. 24 MAVRIKQIEEDAILKKGKNTKSNKPK---------AYEEPIIGSMSMNPDFKEAPSIHGKQNLLQMLKLWSRKNIILNAI 94 (552) Q Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Lr~~a~~~~i~~a~ 94 (552) |+ |..-.+.+.+.+-+..-++.++. .+..|..+..... . .... .. ..++...++ T Consensus 1 ~~-~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~-----~---~~~~-------~~-~dstg~~a~ 63 (517) T protein:vir:10 1 MD-MRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVND-----D---LSSQ-------NA-WQDDGASAT 63 (517) T ss_pred Cc-ccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCC-----C---cccc-------cc-ccchHHHHH Confidence 22 22222233333222222322221 1222221110000 0 0000 01 122222333 Q ss_pred HHHHHHHHHHHHHHHHhhccccceeeeeccccc-----cCChh----HHHHHHHHHHHHHhcCCCCCCCccCCHHHHHHH Q lcl|NC_020081. 95 IITRVNQVSMFCTPARNSDKGVGYEIRLKDPLQ-----EPNDH----NKKKIKEIENFIEKTGRIDNDFTRDNFRSFVKK 165 (552) Q Consensus 95 i~~~~~~~~~~~~~~~~~~~~~~~~i~~k~~~~-----~~~~~----~~~~~~~l~~~l~~~n~~~~pn~~~t~~~f~~~ 165 (552) ++.+..++.... -.+.+ ++++.-.+. ..... -+.....+++.+..-. ..-+++.-+.. T Consensus 64 -~~LAa~l~~~lt-----pp~~~-WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l------~~snf~~~~~~ 130 (517) T protein:vir:10 64 -NFLSNKLSQVLF-----PAQRS-FFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYG------ESLQFRPAVVE 130 (517) T ss_pred -HHHHHHHHHhhc-----CCCCc-cccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHH------HhcCcHHHHHH Confidence 222222221111 11112 233221110 11111 1122233333332211 23356677778 Q ss_pred HHHHHHhcCCeeEEEEECCCCCEEEEEEecCceeEEEECCCccccccc-------------------------------- Q lcl|NC_020081. 166 LVRDRLTYDKINFELVYDKLGDLHNFKAVDASTVYVAVDEDGKERKAK-------------------------------- 213 (552) Q Consensus 166 ~v~d~ll~Gna~~~i~r~~~G~~~~L~~l~p~~v~v~~~~~g~~~~~~-------------------------------- 213 (552) +..|+..+||+.+++. ..+.....||| ..+.+..|..|++.... T Consensus 131 ~~~~L~~~G~a~ly~~--~~~~~~~~~pl--~~y~v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 206 (517) T protein:vir:10 131 AFKHLIVTGNVMMYHP--DKTSPIQAVPL--HHYCVRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDN 206 (517) T ss_pred HHHHHHhHCeEEEEEe--CCCCcEEEEEc--CeEEEeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCc Confidence 8889999999987752 33344566776 34555566665431100 Q ss_pred ------------ceeEEEEEcCCceEE---EEcccceeeecccccCCccCC-cccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 214 ------------DGVRYVQVIDDKVVA---KFKAKEMAWEVSNPRTDLTVG-KYGYPELEIALNHLQYHDNTEVFNARFF 277 (552) Q Consensus 214 ------------~~~~y~~~~~~~~~~---~~~~~evi~~~~~~~~~~~~g-~~G~spl~~~~~~i~~~~~~~~~~~~~f 277 (552) ....+++..++.... .+.-++.=|+ -+++...+| .||.||.+-++..+.......+...... T Consensus 207 v~v~~~v~~~~~~~~~~~~~~d~~~~~~~s~y~~~e~P~~--~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~ 284 (517) T protein:vir:10 207 VKLYTHAKRTKDGKYLIRQSADDVPVGKESTVTEDKSPFL--ILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGM 284 (517) T ss_pred eEEEEEEEEeCCCceEEEEEeCceeeccccccccccCCee--eeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHH Confidence 000111111111100 0000111111 123333333 7999999999999999888888887777 Q ss_pred hccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecc--CCceeeeccCchhHHHHHHHHHHHHHHHHH Q lcl|NC_020081. 278 AQGGTTRGLLHIKTGQEQSNQALTSFRREWTSMFSGINGAWKIPVITA--EDVKFVNMTQSSKDMEFEKWLNYLINVICS 355 (552) Q Consensus 278 ~ng~~p~gil~~~~~~~~s~~~~~~~~~~~~~~~~G~~nagk~~il~~--~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 355 (552) .-...|..++. .+...... .+ ..|..+. ++.+ +++...++...++=....+..+..+..|.. T Consensus 285 ~~a~~~~~lv~--~~~~~~~~-------~l---~~~~~g~----~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~ 348 (517) T protein:vir:10 285 ALMADVKYLVK--PGSYTDIN-------QF---VEGGSGA----VLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGR 348 (517) T ss_pred HHhccCCcccC--cccccchh-------hc---cCCCccc----cccCCcccceeeecccccchhHHHHHHHHHHHHHHH Confidence 77777765543 22211211 11 1221111 2211 334444444333322234666788899999 Q ss_pred HhcCCHHHhcccccccccccccccccchhHHHHHHHHHHHHhhHHHHHHHHHHHhhcC-------ccc--ccceeecccc Q lcl|NC_020081. 356 IYSIDPSEINFPNRGGATGHSGNTLNEGSSAEKYRNSKDKGLEPLLKFIEDAVNKYIV-------SQF--GGDYVFNFVG 426 (552) Q Consensus 356 ~fgVPp~~lg~~~~~t~~~~~~~~~~~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~-------~~~--~~~~~~~f~~ 426 (552) +|-+.....-..+ ..+ +.=-..+..-....|.|.+.++.++|=.-|+ ... ...+..++.. T Consensus 349 af~~~~l~~~~~~----------rvT-AtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~~~~v~~~~~s 417 (517) T protein:vir:10 349 VFMMEAMTRRDAE----------RVT-AYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSILTSKNVSPTILT 417 (517) T ss_pred HHhhhhhhccCCc----------ccc-HHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcCCCCccceeec Confidence 9965542221111 111 1112223333445677777776666421111 100 1223333322 Q ss_pred c-ChHHHHHHHHHHHHH---hcC----------CcCH----HHHHHHhCCCCCCCCCeeeccccccchhh-----hcccc Q lcl|NC_020081. 427 G-DAKTEAEIISILESK---AKI----------GLTI----NDIRKELGYPDTEGGDVTLAGVHVQRLGQ-----IMQQE 483 (552) Q Consensus 427 ~-d~~~~~~~~~~~~~~---~~g----------~lT~----NE~R~~~gl~p~~ggD~~~~~~n~~~~~~-----~~~~~ 483 (552) + ....+....+.+... ... .+.. +++.+.+|.|+- .+...--...... ..... T Consensus 418 ~la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~----~irs~~ev~~~~~~~~~~~~~~~ 493 (517) T protein:vir:10 418 GIEALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFP----FFKTQDELNAEAQAQQEQEATKY 493 (517) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChh----hcCCHHHHHHHHHHHHHHHHHHH Confidence 2 112222111111100 000 1122 222334455431 1111100000000 00000 Q ss_pred ccccccCCCCCccCcccCCCCCCCCCCCCCC Q lcl|NC_020081. 484 QVEYQRQMDANQFLAQQTGYDGNMDNVNGKD 514 (552) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (552) .+++..+.-...... +..+++|.. T Consensus 494 ~~~~ag~~~~~~~~~-------~~~~~~~~~ 517 (517) T protein:vir:10 494 AAEQAGKAIPDMVKN-------GQINPQGGQ 517 (517) T ss_pred HHHHHHHHHHHHHhC-------CCCCCCCCC Confidence 000000000000000 001111110 Done!