Query lcl|NC_019527.1_cdsid_YP_007007739.1 [gene=F394_gp50] [protein=putative portal protein] [protein_id=YP_007007739.1] [location=21374..22924] Match_columns 516 No_of_seqs 156 out of 254 Neff 7.4 Searched_HMMs 1612 Date Thu Nov 7 18:09:27 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_50 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_50_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:106716 Length: 698 100.0 1E-139 8E-143 782.2 40.0 505 4-516 1-551 (698) 2 protein:vir:101541 Length: 694 100.0 1E-139 6E-143 782.7 37.7 505 4-516 1-556 (694) 3 protein:vir:78589 Length: 695 100.0 5E-139 3E-142 778.7 40.8 505 4-516 1-557 (695) 4 protein:vir:3648 Length: 695 # 100.0 6E-139 4E-142 778.2 39.6 510 4-516 1-557 (695) 5 protein:vir:107742 Length: 537 100.0 6E-137 4E-140 767.3 46.8 505 1-516 1-533 (537) 6 protein:vir:96068 Length: 765 100.0 3E-136 2E-139 763.6 43.7 494 1-516 5-555 (765) 7 protein:vir:94049 Length: 532 100.0 9E-133 6E-136 744.4 45.6 489 22-516 1-508 (532) 8 protein:vir:99563 Length: 862 100.0 1E-132 7E-136 744.1 45.8 503 1-516 1-553 (862) 9 protein:vir:5249 Length: 437 # 100.0 5E-119 3E-122 669.0 42.5 428 70-516 1-435 (437) 10 protein:vir:107662 Length: 427 100.0 6E-119 4E-122 668.8 40.3 417 59-512 1-427 (427) 11 protein:vir:104338 Length: 422 100.0 2E-117 1E-120 660.6 40.1 412 61-514 1-422 (422) 12 protein:vir:79647 Length: 435 100.0 9E-116 6E-119 651.2 41.3 423 60-515 1-435 (435) 13 protein:vir:80040 Length: 461 100.0 3E-106 2E-109 599.2 40.6 428 39-504 1-461 (461) 14 protein:vir:105782 Length: 449 100.0 5E-98 3E-101 554.0 36.0 423 42-504 1-449 (449) 15 protein:vir:103219 Length: 201 100.0 3.8E-58 2.4E-61 335.4 19.9 200 299-509 1-201 (201) 16 protein:vir:79772 Length: 648 100.0 1.9E-28 1.2E-31 172.5 35.3 461 4-516 1-508 (648) 17 protein:vir:7853 Length: 518 # 99.9 2.6E-26 1.6E-29 160.8 30.7 401 61-516 1-443 (518) 18 protein:vir:102118 Length: 409 99.9 4.9E-26 3E-29 159.4 29.4 392 42-515 1-409 (409) 19 protein:vir:105002 Length: 432 99.9 3.7E-25 2.3E-28 154.6 33.7 410 36-516 1-429 (432) 20 protein:vir:107605 Length: 432 99.9 3.7E-25 2.3E-28 154.6 33.7 410 36-516 1-429 (432) 21 protein:vir:102855 Length: 432 99.9 3.7E-25 2.3E-28 154.6 33.7 410 36-516 1-429 (432) 22 protein:vir:8418 Length: 409 # 99.9 2E-25 1.3E-28 156.0 30.6 395 42-515 1-409 (409) 23 protein:vir:101648 Length: 518 99.9 7.4E-25 4.6E-28 152.9 32.2 403 61-516 1-443 (518) 24 protein:vir:95378 Length: 406 99.9 2.7E-25 1.7E-28 155.3 29.4 400 42-515 1-406 (406) 25 protein:vir:1380 Length: 422 # 99.9 1.1E-24 6.8E-28 151.9 31.9 403 36-514 1-422 (422) 26 protein:vir:102080 Length: 429 99.9 2.9E-24 1.8E-27 149.7 34.0 408 42-516 1-426 (429) 27 protein:vir:80644 Length: 551 99.9 1.2E-23 7.2E-27 146.3 36.2 453 1-516 5-518 (551) 28 protein:vir:483 Length: 413 # 99.9 2.1E-24 1.3E-27 150.4 30.6 396 33-516 1-411 (413) 29 protein:vir:4454 Length: 414 # 99.9 4.8E-24 3E-27 148.4 30.8 396 32-516 1-413 (414) 30 protein:vir:63755 Length: 547 99.9 9.4E-23 5.8E-26 141.4 35.9 452 1-516 1-523 (547) 31 protein:vir:1266 Length: 416 # 99.9 3.9E-23 2.4E-26 143.4 31.7 399 33-515 1-416 (416) 32 protein:vir:93610 Length: 454 99.9 5.2E-23 3.2E-26 142.8 31.6 408 38-516 1-437 (454) 33 protein:vir:1326 Length: 457 # 99.9 6.2E-23 3.9E-26 142.3 31.9 414 36-516 1-457 (457) 34 protein:vir:5737 Length: 419 # 99.9 6E-23 3.7E-26 142.4 31.4 396 33-516 1-414 (419) 35 protein:vir:94666 Length: 723 99.9 4.4E-23 2.7E-26 143.1 30.5 393 73-516 1-445 (723) 36 protein:vir:6240 Length: 457 # 99.9 8.3E-23 5.1E-26 141.7 31.3 412 36-516 1-451 (457) 37 protein:vir:100691 Length: 535 99.9 1.7E-22 1.1E-25 139.9 32.7 444 32-516 1-528 (535) 38 protein:vir:10362 Length: 432 99.9 8.9E-23 5.5E-26 141.5 31.0 405 26-516 1-431 (432) 39 protein:vir:81072 Length: 432 99.9 8E-23 5E-26 141.7 30.7 406 26-516 1-431 (432) 40 protein:vir:960 Length: 413 # 99.9 3.6E-23 2.2E-26 143.6 28.4 392 59-515 1-413 (413) 41 protein:vir:4337 Length: 434 # 99.9 8.7E-23 5.4E-26 141.5 29.5 408 38-513 1-434 (434) 42 protein:vir:97060 Length: 432 99.9 1.6E-22 9.7E-26 140.1 30.9 406 26-516 1-431 (432) 43 protein:vir:8100 Length: 466 # 99.9 8.9E-23 5.5E-26 141.5 29.4 444 1-514 3-466 (466) 44 protein:vir:3153 Length: 467 # 99.9 1.9E-22 1.2E-25 139.7 31.0 370 104-516 1-463 (467) 45 protein:vir:100150 Length: 437 99.9 6.4E-23 4E-26 142.3 28.0 407 42-515 1-437 (437) 46 protein:vir:81152 Length: 411 99.9 5.3E-23 3.3E-26 142.7 27.4 395 32-515 1-411 (411) 47 protein:vir:93943 Length: 409 99.9 3.5E-22 2.2E-25 138.2 31.7 387 62-512 1-409 (409) 48 protein:vir:94426 Length: 409 99.9 5.7E-22 3.6E-25 137.0 32.0 387 62-512 1-409 (409) 49 protein:vir:1431 Length: 419 # 99.9 7.4E-22 4.6E-25 136.4 31.9 395 51-516 1-413 (419) 50 protein:vir:80796 Length: 574 99.9 1.3E-21 8E-25 135.1 33.0 459 1-516 1-521 (574) 51 protein:vir:80333 Length: 419 99.9 5.8E-22 3.6E-25 137.0 31.1 398 52-516 1-413 (419) 52 protein:vir:80134 Length: 403 99.9 4.1E-22 2.5E-25 137.8 29.9 393 32-515 1-403 (403) 53 protein:vir:96980 Length: 409 99.9 6.8E-22 4.2E-25 136.6 31.1 389 33-512 1-409 (409) 54 protein:vir:4598 Length: 416 # 99.9 3.1E-22 1.9E-25 138.5 29.1 390 60-516 1-415 (416) 55 protein:vir:81095 Length: 416 99.9 3.1E-22 1.9E-25 138.5 29.1 390 60-516 1-415 (416) 56 protein:vir:81218 Length: 423 99.9 4.1E-22 2.5E-25 137.8 29.6 397 36-516 1-423 (423) 57 protein:vir:2683 Length: 412 # 99.9 1.1E-21 6.9E-25 135.5 31.7 392 42-512 1-412 (412) 58 protein:vir:95599 Length: 563 99.9 3.5E-21 2.2E-24 132.7 34.0 463 1-516 1-527 (563) 59 protein:vir:99312 Length: 563 99.9 3.5E-21 2.2E-24 132.7 34.0 463 1-516 1-527 (563) 60 protein:vir:3843 Length: 397 # 99.9 3.2E-22 2E-25 138.4 28.2 384 32-515 1-397 (397) 61 protein:vir:96579 Length: 576 99.9 5.6E-21 3.5E-24 131.6 34.4 463 1-516 1-529 (576) 62 protein:vir:105064 Length: 421 99.9 2E-21 1.2E-24 134.1 31.0 398 52-516 1-421 (421) 63 protein:vir:100249 Length: 431 99.9 1.8E-21 1.1E-24 134.3 30.1 410 32-516 1-431 (431) 64 protein:vir:189 Length: 424 # 99.9 1.8E-21 1.1E-24 134.4 29.6 402 33-516 1-424 (424) 65 protein:vir:102727 Length: 945 99.9 1.9E-21 1.2E-24 134.2 29.7 411 1-516 77-538 (945) 66 protein:vir:100882 Length: 383 99.9 4.3E-22 2.7E-25 137.7 26.1 369 42-508 1-383 (383) 67 protein:vir:8317 Length: 409 # 99.9 1.3E-21 7.8E-25 135.2 27.9 387 32-494 1-409 (409) 68 protein:vir:3868 Length: 417 # 99.9 3E-21 1.9E-24 133.1 29.6 391 32-516 1-411 (417) 69 protein:vir:1884 Length: 424 # 99.8 5E-21 3.1E-24 131.9 30.3 403 26-516 1-424 (424) 70 protein:vir:79984 Length: 441 99.8 3.4E-21 2.1E-24 132.8 29.3 407 51-516 1-440 (441) 71 protein:vir:9408 Length: 441 # 99.8 3.4E-21 2.1E-24 132.8 29.3 407 51-516 1-440 (441) 72 protein:vir:4509 Length: 424 # 99.8 1E-20 6.3E-24 130.2 31.6 404 31-515 1-424 (424) 73 protein:vir:100187 Length: 385 99.8 1.4E-21 8.6E-25 134.9 26.8 372 32-510 1-385 (385) 74 protein:vir:9702 Length: 406 # 99.8 4.9E-21 3.1E-24 131.9 28.2 389 50-514 1-406 (406) 75 protein:vir:98396 Length: 441 99.8 1.1E-20 6.9E-24 130.0 29.7 406 51-516 1-440 (441) 76 protein:vir:9507 Length: 395 # 99.8 1.2E-20 7.4E-24 129.8 28.6 377 74-512 1-395 (395) 77 protein:vir:100650 Length: 395 99.8 1.2E-20 7.4E-24 129.8 28.6 377 74-512 1-395 (395) 78 protein:vir:101289 Length: 395 99.8 1.2E-20 7.4E-24 129.8 28.6 377 74-512 1-395 (395) 79 protein:vir:104259 Length: 403 99.8 1.5E-20 9.2E-24 129.3 28.6 378 48-513 1-403 (403) 80 protein:vir:6210 Length: 394 # 99.8 3.8E-20 2.3E-23 127.1 29.5 381 42-514 1-394 (394) 81 protein:vir:101647 Length: 460 99.8 1.1E-19 6.6E-23 124.6 31.2 411 38-510 1-460 (460) 82 protein:vir:9359 Length: 348 # 99.8 2.8E-20 1.7E-23 127.8 27.6 329 125-512 1-348 (348) 83 protein:vir:4194 Length: 540 # 99.8 3.6E-19 2.2E-22 121.7 33.3 409 43-516 1-460 (540) 84 protein:vir:4952 Length: 386 # 99.8 3E-20 1.9E-23 127.6 27.4 375 32-511 1-386 (386) 85 protein:vir:7407 Length: 392 # 99.8 2E-19 1.2E-22 123.1 31.0 376 34-511 1-392 (392) 86 protein:vir:4156 Length: 542 # 99.8 1E-18 6.2E-22 119.3 32.9 405 43-516 1-463 (542) 87 protein:vir:1023 Length: 392 # 99.8 1.1E-18 7E-22 119.0 29.7 377 34-511 1-392 (392) 88 protein:vir:3989 Length: 392 # 99.8 1.1E-18 7E-22 119.0 29.7 377 34-511 1-392 (392) 89 protein:vir:94002 Length: 378 99.8 2.9E-19 1.8E-22 122.2 26.3 348 74-514 1-378 (378) 90 protein:vir:1661 Length: 378 # 99.8 2.1E-19 1.3E-22 123.0 25.3 350 74-516 1-376 (378) 91 protein:vir:4854 Length: 386 # 99.8 9E-19 5.6E-22 119.5 27.9 373 32-511 1-386 (386) 92 protein:vir:4828 Length: 382 # 99.8 1.4E-18 8.5E-22 118.5 28.1 369 42-511 1-382 (382) 93 protein:vir:95965 Length: 385 99.8 2.1E-18 1.3E-21 117.5 27.5 367 74-510 1-385 (385) 94 protein:vir:93867 Length: 378 99.7 5.9E-19 3.6E-22 120.5 23.0 350 74-516 1-376 (378) 95 protein:vir:99452 Length: 651 99.7 2.9E-17 1.8E-20 111.2 32.0 435 1-516 1-542 (651) 96 protein:vir:9641 Length: 395 # 99.7 1.6E-17 1E-20 112.6 29.1 373 74-516 1-395 (395) 97 protein:vir:4995 Length: 384 # 99.7 4.4E-18 2.7E-21 115.7 23.6 366 42-511 1-384 (384) 98 protein:vir:78310 Length: 376 99.7 2.9E-17 1.8E-20 111.3 26.9 362 74-515 1-376 (376) 99 protein:vir:94869 Length: 378 99.7 1.3E-17 8.1E-21 113.1 23.9 352 74-514 1-378 (378) 100 protein:vir:4089 Length: 395 # 99.7 2.1E-16 1.3E-19 106.6 28.8 377 60-516 1-395 (395) 101 protein:vir:1082 Length: 359 # 99.7 2.8E-17 1.7E-20 111.3 24.0 347 42-478 1-359 (359) 102 protein:vir:7987 Length: 456 # 99.7 4.6E-17 2.8E-20 110.2 22.8 407 59-516 1-455 (456) 103 protein:vir:858 Length: 378 # 99.7 1.5E-16 9.4E-20 107.3 25.4 346 74-513 1-378 (378) 104 protein:vir:98643 Length: 395 99.7 1.6E-15 1E-18 101.7 30.5 376 74-516 1-395 (395) 105 protein:vir:105819 Length: 456 99.6 1.6E-16 9.8E-20 107.2 24.0 406 59-516 1-455 (456) 106 protein:vir:102602 Length: 456 99.6 1.6E-16 9.8E-20 107.2 24.0 406 59-516 1-455 (456) 107 protein:vir:98444 Length: 434 99.6 8.8E-15 5.4E-18 97.6 27.4 374 94-515 1-434 (434) 108 protein:vir:97447 Length: 474 99.5 1.8E-14 1.1E-17 95.9 25.1 434 1-516 1-473 (474) 109 protein:vir:94498 Length: 474 99.5 1.8E-14 1.1E-17 95.9 25.1 434 1-516 1-473 (474) 110 protein:vir:107880 Length: 491 99.5 1.5E-12 9.3E-16 85.4 33.1 408 13-516 1-419 (491) 111 protein:vir:99072 Length: 479 99.5 1E-13 6.2E-17 91.9 26.2 413 1-516 1-477 (479) 112 protein:vir:96738 Length: 505 99.5 1.3E-13 7.9E-17 91.3 25.2 439 13-514 1-505 (505) 113 protein:vir:10321 Length: 495 99.5 1.1E-13 6.9E-17 91.6 24.8 425 42-514 1-495 (495) 114 protein:vir:95113 Length: 474 99.5 2E-13 1.3E-16 90.2 25.7 429 1-516 1-473 (474) 115 protein:vir:3420 Length: 533 # 99.5 1.4E-13 8.7E-17 91.0 23.8 436 57-516 1-531 (533) 116 protein:vir:389 Length: 530 # 99.4 2.9E-13 1.8E-16 89.3 25.3 433 43-516 1-530 (530) 117 protein:vir:105889 Length: 474 99.4 6.9E-13 4.3E-16 87.2 27.3 415 36-516 1-473 (474) 118 protein:vir:94101 Length: 474 99.4 6.9E-13 4.3E-16 87.2 27.3 415 36-516 1-473 (474) 119 protein:vir:79538 Length: 502 99.4 1.1E-13 6.8E-17 91.6 22.8 429 32-514 1-502 (502) 120 protein:vir:5839 Length: 533 # 99.4 5E-13 3.1E-16 88.0 26.0 434 1-516 1-522 (533) 121 protein:vir:95806 Length: 440 99.4 1.3E-12 8.4E-16 85.7 27.8 399 58-514 1-440 (440) 122 protein:vir:79703 Length: 505 99.4 4.1E-13 2.6E-16 88.5 24.8 439 1-513 3-505 (505) 123 protein:vir:99916 Length: 504 99.4 7.6E-12 4.7E-15 81.5 31.6 450 42-515 1-504 (504) 124 protein:vir:105292 Length: 478 99.4 1.2E-12 7.2E-16 86.0 26.5 404 59-516 1-475 (478) 125 protein:vir:93747 Length: 472 99.4 8.9E-13 5.5E-16 86.6 25.2 420 34-516 1-471 (472) 126 protein:vir:102950 Length: 471 99.4 2.3E-12 1.4E-15 84.4 26.7 387 73-514 1-471 (471) 127 protein:vir:96179 Length: 468 99.4 8.1E-12 5E-15 81.4 29.6 409 49-512 1-468 (468) 128 protein:vir:78641 Length: 278 99.4 6.5E-13 4E-16 87.4 23.6 267 125-438 1-278 (278) 129 protein:vir:99522 Length: 470 99.4 4.2E-12 2.6E-15 83.0 28.0 419 39-514 1-470 (470) 130 protein:vir:79063 Length: 491 99.4 4.9E-11 3E-14 77.1 32.7 405 13-516 1-419 (491) 131 protein:vir:2500 Length: 501 # 99.4 1.1E-11 6.5E-15 80.8 29.0 428 43-516 1-501 (501) 132 protein:vir:107112 Length: 478 99.4 1.2E-11 7.5E-15 80.4 28.7 406 48-514 1-478 (478) 133 protein:vir:96839 Length: 474 99.3 3.2E-11 2E-14 78.1 29.8 410 48-516 1-473 (474) 134 protein:vir:6382 Length: 553 # 99.3 2.6E-12 1.6E-15 84.1 23.8 444 42-513 1-553 (553) 135 protein:vir:78227 Length: 480 99.3 6.7E-12 4.2E-15 81.8 25.6 406 33-516 1-477 (480) 136 protein:vir:5961 Length: 503 # 99.3 3.2E-11 2E-14 78.1 29.2 404 59-516 1-501 (503) 137 protein:vir:80959 Length: 499 99.3 5.3E-12 3.3E-15 82.4 24.3 432 1-515 1-499 (499) 138 protein:vir:79043 Length: 479 99.3 2.2E-11 1.4E-14 79.0 27.5 408 52-516 1-479 (479) 139 protein:vir:9751 Length: 422 # 99.3 9.9E-12 6.2E-15 80.9 25.4 379 34-499 1-422 (422) 140 protein:vir:96266 Length: 474 99.3 1.4E-11 8.8E-15 80.0 26.2 408 42-516 1-471 (474) 141 protein:vir:95899 Length: 474 99.3 1.4E-11 8.8E-15 80.0 26.2 408 42-516 1-471 (474) 142 protein:vir:106571 Length: 499 99.3 4E-11 2.5E-14 77.6 28.0 433 30-516 1-488 (499) 143 protein:vir:94805 Length: 492 99.3 3.4E-11 2.1E-14 78.0 27.5 427 33-516 1-490 (492) 144 protein:vir:1236 Length: 483 # 99.3 4.4E-11 2.7E-14 77.4 27.1 414 42-516 1-482 (483) 145 protein:vir:7768 Length: 484 # 99.3 1.4E-10 8.7E-14 74.6 29.7 419 13-515 1-484 (484) 146 protein:vir:1587 Length: 508 # 99.3 4E-11 2.5E-14 77.6 26.5 432 1-516 3-508 (508) 147 protein:vir:3609 Length: 452 # 99.3 5E-11 3.1E-14 77.0 27.1 401 51-514 1-452 (452) 148 protein:vir:9815 Length: 500 # 99.3 7.5E-12 4.7E-15 81.6 22.4 431 1-514 3-500 (500) 149 protein:vir:3028 Length: 500 # 99.3 7.5E-12 4.7E-15 81.6 22.4 431 1-514 3-500 (500) 150 protein:vir:106639 Length: 481 99.3 4.3E-11 2.7E-14 77.4 26.1 412 50-515 1-481 (481) 151 protein:vir:104500 Length: 537 99.3 5.8E-11 3.6E-14 76.7 26.8 435 34-515 1-537 (537) 152 protein:vir:108215 Length: 469 99.2 3.3E-10 2E-13 72.6 38.1 415 50-516 1-453 (469) 153 protein:vir:95542 Length: 548 99.2 1.9E-12 1.2E-15 84.8 18.3 434 32-516 1-536 (548) 154 protein:vir:104082 Length: 485 99.2 1.1E-10 6.9E-14 75.1 27.9 419 21-516 1-485 (485) 155 protein:vir:38 Length: 496 # N 99.2 1.5E-11 9.4E-15 79.9 23.0 433 34-515 1-496 (496) 156 protein:vir:78537 Length: 480 99.2 5.2E-11 3.3E-14 76.9 25.7 409 32-516 1-472 (480) 157 protein:vir:9871 Length: 429 # 99.2 7.3E-11 4.5E-14 76.2 26.4 393 63-514 1-429 (429) 158 protein:vir:96494 Length: 501 99.2 4E-10 2.5E-13 72.1 30.1 430 34-515 1-501 (501) 159 protein:vir:97171 Length: 512 99.2 1.2E-10 7.3E-14 75.0 27.0 435 38-515 1-512 (512) 160 protein:vir:78907 Length: 518 99.2 1.6E-10 1E-13 74.2 27.6 440 32-516 1-516 (518) 161 protein:vir:3964 Length: 453 # 99.2 3.2E-11 2E-14 78.1 23.5 406 6-514 1-453 (453) 162 protein:vir:8184 Length: 474 # 99.2 3.2E-10 2E-13 72.6 28.9 424 50-516 1-472 (474) 163 protein:vir:6596 Length: 521 # 99.2 1.9E-10 1.2E-13 73.9 27.2 450 1-516 1-521 (521) 164 protein:vir:97336 Length: 492 99.2 1.5E-10 9E-14 74.5 26.4 428 33-516 1-491 (492) 165 protein:vir:102330 Length: 451 99.2 1.4E-10 8.8E-14 74.6 26.3 394 63-506 1-451 (451) 166 protein:vir:78083 Length: 537 99.2 5E-10 3.1E-13 71.6 29.2 412 57-516 1-532 (537) 167 protein:vir:2732 Length: 501 # 99.2 1.3E-10 7.8E-14 74.8 25.3 444 28-514 1-501 (501) 168 protein:vir:79150 Length: 368 99.2 9.3E-12 5.8E-15 81.1 18.9 336 42-451 1-368 (368) 169 protein:vir:9306 Length: 511 # 99.2 2.1E-10 1.3E-13 73.7 26.2 442 29-515 1-511 (511) 170 protein:vir:4898 Length: 502 # 99.2 1.3E-10 8.2E-14 74.7 24.8 409 59-516 1-498 (502) 171 protein:vir:81017 Length: 521 99.2 3.9E-10 2.4E-13 72.2 27.3 450 1-516 1-521 (521) 172 protein:vir:2427 Length: 485 # 99.2 6.5E-10 4E-13 70.9 28.0 422 21-516 1-482 (485) 173 protein:vir:94742 Length: 409 99.2 5.6E-10 3.4E-13 71.3 27.6 368 34-484 1-409 (409) 174 protein:vir:4223 Length: 486 # 99.2 1.1E-09 6.7E-13 69.8 29.1 419 21-515 1-486 (486) 175 protein:vir:100598 Length: 516 99.1 1E-09 6.4E-13 69.9 28.5 439 1-513 6-516 (516) 176 protein:vir:1634 Length: 409 # 99.1 6.7E-10 4.2E-13 70.9 27.1 360 34-484 1-409 (409) 177 protein:vir:99781 Length: 511 99.1 5.3E-10 3.3E-13 71.4 26.5 438 29-516 1-506 (511) 178 protein:vir:103860 Length: 528 99.1 1.6E-09 1E-12 68.8 34.9 420 1-516 1-445 (528) 179 protein:vir:267 Length: 348 # 99.1 3E-10 1.8E-13 72.8 24.8 328 34-446 1-348 (348) 180 protein:vir:98883 Length: 517 99.1 1.8E-10 1.1E-13 74.0 23.6 450 32-515 1-517 (517) 181 protein:vir:9568 Length: 410 # 99.1 1.1E-09 6.7E-13 69.8 27.7 363 73-500 1-410 (410) 182 protein:vir:96240 Length: 511 99.1 7.6E-10 4.7E-13 70.6 26.7 442 29-515 1-511 (511) 183 protein:vir:105461 Length: 470 99.1 3.9E-10 2.4E-13 72.2 24.8 398 62-514 1-470 (470) 184 protein:vir:106282 Length: 521 99.1 7.9E-10 4.9E-13 70.5 26.4 438 1-514 8-521 (521) 185 protein:vir:78191 Length: 351 99.1 1.9E-10 1.2E-13 73.9 22.9 335 4-444 1-351 (351) 186 protein:vir:98265 Length: 524 99.1 2E-09 1.2E-12 68.3 28.0 441 1-514 12-524 (524) 187 protein:vir:103177 Length: 533 99.1 2.4E-09 1.5E-12 67.8 30.0 432 35-515 1-533 (533) 188 protein:vir:104892 Length: 558 99.1 1.1E-09 6.7E-13 69.7 26.5 433 35-516 1-541 (558) 189 protein:vir:103971 Length: 376 99.1 4.1E-10 2.5E-13 72.0 24.1 343 13-444 1-376 (376) 190 protein:vir:733 Length: 453 # 99.1 5.1E-10 3.1E-13 71.5 24.4 416 42-513 1-453 (453) 191 protein:vir:2341 Length: 488 # 99.1 1.3E-09 7.8E-13 69.4 26.0 415 63-516 1-486 (488) 192 protein:vir:4782 Length: 522 # 99.0 5E-10 3.1E-13 71.6 23.1 442 32-515 1-522 (522) 193 protein:vir:106999 Length: 564 99.0 7.7E-10 4.8E-13 70.6 24.0 436 35-516 1-562 (564) 194 protein:vir:101189 Length: 516 99.0 4.9E-09 3E-12 66.2 28.0 439 1-513 6-516 (516) 195 protein:vir:101806 Length: 516 99.0 4.9E-09 3E-12 66.2 28.0 439 1-513 6-516 (516) 196 protein:vir:79207 Length: 351 99.0 9.4E-10 5.8E-13 70.1 23.3 335 4-444 1-351 (351) 197 protein:vir:103951 Length: 511 99.0 6.2E-09 3.8E-12 65.6 28.8 436 29-515 1-511 (511) 198 protein:vir:3780 Length: 345 # 99.0 1.6E-09 9.8E-13 68.8 23.8 325 32-437 1-345 (345) 199 protein:vir:99232 Length: 526 99.0 7.2E-09 4.4E-12 65.2 34.9 419 1-516 1-448 (526) 200 protein:vir:7208 Length: 524 # 99.0 4.5E-09 2.8E-12 66.3 26.1 445 1-514 8-524 (524) 201 protein:vir:103458 Length: 524 99.0 4.7E-09 2.9E-12 66.2 26.2 445 1-516 8-524 (524) 202 protein:vir:3743 Length: 345 # 99.0 2.7E-09 1.7E-12 67.6 24.3 327 32-437 1-345 (345) 203 protein:vir:9922 Length: 489 # 99.0 1.9E-09 1.2E-12 68.4 23.5 427 13-516 1-488 (489) 204 protein:vir:80680 Length: 441 99.0 2.5E-09 1.5E-12 67.8 23.9 390 63-513 1-441 (441) 205 protein:vir:99853 Length: 488 99.0 1E-08 6.4E-12 64.4 34.4 398 25-516 1-410 (488) 206 protein:vir:98567 Length: 340 98.9 4.9E-09 3.1E-12 66.1 24.7 325 3-441 1-340 (340) 207 protein:vir:78805 Length: 511 98.9 4.6E-09 2.8E-12 66.3 24.5 442 1-515 1-511 (511) 208 protein:vir:96366 Length: 511 98.9 4.6E-09 2.8E-12 66.3 24.5 442 1-515 1-511 (511) 209 protein:vir:108049 Length: 524 98.9 8.3E-09 5.1E-12 64.9 25.5 442 1-514 10-524 (524) 210 protein:vir:2013 Length: 344 # 98.9 6.6E-09 4.1E-12 65.4 24.1 328 4-440 1-344 (344) 211 protein:vir:100328 Length: 346 98.9 4.2E-09 2.6E-12 66.5 22.8 331 4-440 1-346 (346) 212 protein:vir:6896 Length: 523 # 98.9 1.7E-08 1.1E-11 63.2 25.8 447 1-516 8-523 (523) 213 protein:vir:79233 Length: 526 98.9 2.5E-08 1.5E-11 62.3 34.6 419 21-516 1-448 (526) 214 protein:vir:6058 Length: 344 # 98.8 1.5E-08 9.5E-12 63.4 24.2 328 3-440 1-344 (344) 215 protein:vir:94546 Length: 506 98.8 3.4E-08 2.1E-11 61.5 26.7 420 49-516 1-503 (506) 216 protein:vir:5665 Length: 511 # 98.8 3.7E-08 2.3E-11 61.3 27.4 444 2-510 1-511 (511) 217 protein:vir:5691 Length: 344 # 98.8 4.8E-08 3E-11 60.7 24.8 327 4-441 1-344 (344) 218 protein:vir:4698 Length: 251 # 98.8 1.2E-09 7.2E-13 69.6 15.5 239 42-340 1-251 (251) 219 protein:vir:78749 Length: 337 98.7 5E-08 3.1E-11 60.6 24.0 320 6-438 1-337 (337) 220 protein:vir:1150 Length: 350 # 98.7 7.8E-08 4.8E-11 59.6 25.7 333 4-438 1-350 (350) 221 protein:vir:78161 Length: 355 98.5 3.2E-07 2E-10 56.2 27.6 305 175-516 1-337 (355) 222 protein:vir:95254 Length: 488 98.5 5.7E-07 3.5E-10 54.8 34.3 428 48-516 1-480 (488) 223 protein:vir:1986 Length: 512 # 98.4 8.4E-07 5.2E-10 53.9 33.8 413 42-516 1-440 (512) 224 protein:vir:98853 Length: 219 98.3 2.1E-07 1.3E-10 57.2 16.5 208 191-442 1-219 (219) 225 protein:vir:98816 Length: 446 98.3 2E-06 1.2E-09 51.9 32.5 396 40-490 1-446 (446) 226 protein:vir:101494 Length: 527 98.2 2.1E-06 1.3E-09 51.7 28.6 432 42-516 1-523 (527) 227 protein:vir:102239 Length: 527 98.2 2.2E-06 1.4E-09 51.6 28.7 432 42-516 1-523 (527) 228 protein:vir:101418 Length: 569 98.0 9.2E-06 5.7E-09 48.2 20.9 486 1-516 1-568 (569) 229 protein:vir:79511 Length: 448 97.9 9.6E-06 6E-09 48.1 37.1 417 42-516 1-441 (448) 230 protein:vir:77981 Length: 448 97.6 3.3E-05 2E-08 45.2 35.5 419 42-516 1-439 (448) 231 protein:vir:7430 Length: 563 # 97.4 8.5E-05 5.3E-08 42.9 28.4 439 42-516 1-537 (563) 232 protein:vir:97265 Length: 513 94.9 0.0032 2E-06 34.3 27.5 417 59-515 1-513 (513) 233 protein:vir:94956 Length: 452 94.1 0.0054 3.4E-06 33.0 28.0 395 59-516 1-450 (452) 234 protein:vir:95149 Length: 501 94.0 0.0056 3.5E-06 32.9 28.3 414 59-516 1-497 (501) 235 protein:vir:80453 Length: 535 90.4 0.021 1.3E-05 29.8 31.3 441 2-516 1-518 (535) 236 protein:vir:78393 Length: 489 66.4 0.27 0.00017 23.7 26.5 419 38-515 1-489 (489) 237 protein:vir:102426 Length: 631 48.3 0.67 0.00042 21.5 24.2 425 17-516 1-524 (631) 238 protein:vir:80211 Length: 514 39.4 1 0.00063 20.5 20.5 420 34-512 1-514 (514) 239 protein:vir:3361 Length: 535 # 28.3 1.8 0.0011 19.2 22.0 442 21-516 1-535 (535) 240 protein:vir:4073 Length: 279 # 27.5 1.8 0.0011 19.1 7.3 245 188-480 1-279 (279) 241 protein:vir:94709 Length: 522 23.5 2.3 0.0014 18.6 22.4 420 63-516 1-517 (522) 242 protein:vir:8654 Length: 629 # 20.2 2.8 0.0017 18.1 25.2 419 12-516 1-525 (629) No 1 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=100.00 E-value=1.2e-139 Score=782.16 Aligned_cols=505 Identities=35% Similarity=0.586 Sum_probs=403.3 Q ss_pred chhhhhhhhcccccccccC--CCc---------------CCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCC Q lcl|NC_019527. 4 FDRKKFKREVADKLADAAR--AEE---------------QEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAG 66 (516) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~--~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~ 66 (516) ..|++.|||.+-+---..+ ++. ++-|..+-.+.+ .+.+.. ++-+-|++. +-++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~------~~~~~~~~~-~~~~~~ 72 (698) T protein:vir:10 1 MSRRNAKKRTQLAHTGRRPEVAKAAALAAAATIATATAAQPVPADMGRRGA-LNALDA------APVAEPSPS-LRLARQ 72 (698) T ss_pred CCccchhhhhhhhhcCCCcchhhhhhhhhhhhhhhhccccccchhhccccc-cccccc------ccccCCCcc-cccccc Confidence 5666666654432211111 110 111111111111 111110 111112211 111100 Q ss_pred c-----cchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccc-- Q lcl|NC_019527. 67 T-----TPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTK-- 139 (516) Q Consensus 67 ~-----~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~-- 139 (516) . ....+.++....++.+....+..+.|+..++|+|||+|++++|+|++|++|++++++|||+|+++.+..+++ T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~ 152 (698) T protein:vir:10 73 FEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKAD 152 (698) T ss_pred ceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccceeccccchhhh Confidence 0 000111122222233334455556788999999999999999999999999999999999999986553332 Q ss_pred -----------hhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCC--cccCcccccccccccceeeE Q lcl|NC_019527. 140 -----------AKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGAD--VSVPLILDPRTIKKGSLTGF 206 (516) Q Consensus 140 -----------~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~--~~~Pl~ld~~~I~~g~l~~l 206 (516) ...+.+++++|++++++|+||++|+++++|+|||||++++|.|++++ +++||.+++.+|+||+|+|| T Consensus 153 ~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL 232 (698) T protein:vir:10 153 TSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGL 232 (698) T ss_pred hhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEEeecCccccccccccccccccCccceee Confidence 22344789999999999999999999999999999999999998855 88999888889999999999 Q ss_pred EeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 207 SNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQ 286 (516) Q Consensus 207 ~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~ 286 (516) +|+|||||+|..++..||++|+||+|++|+|.|++||+||+++|.++++|+++|++|+|||+|++|++|++|.+|++++. T Consensus 233 ~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G~~IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~ 312 (698) T protein:vir:10 233 RVVEPYWVTPNNYNSINPVADDFYKPSTWWMIGSEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQ 312 (698) T ss_pred eeecccccccchhhhccchhhccCCCceEEEecceecceeEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCHHHHHHHHHHHHHhh Q lcl|NC_019527. 287 SVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLADLQSQSQEHMCSV 366 (516) Q Consensus 287 ~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d~~~~~~~~iaaa 366 (516) ++++|++++++.++++++++.|+++...++.+|++++++++||+|++++|+++|+|++++++||||++++++|+++||++ T Consensus 313 ~v~~Li~~~~~~~l~~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~st~lSGLddVi~qf~q~VAga 392 (698) T protein:vir:10 313 SVSDIVKQFSVSGILMDLAQALTPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPLSGLDALQAQAQEQMSAV 392 (698) T ss_pred hHHHHHHHhhHHHHHHHHHHhcCChhhHHHHHHHHHHHHhcCccceEEEecCCcceEEEecCcCCHHHHHHHHHHHHHhh Confidence 99999999999999999999999888778999999999999999999999777999999999999999999999999999 Q ss_pred hcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHH Q lcl|NC_019527. 367 SKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEI 446 (516) Q Consensus 367 s~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei 446 (516) ++||+|+||||||+|||||||+|++||||+|+++|++.|+|+|++|+++|++|+||.+|++|+|+|+|||+||++|+||| T Consensus 393 a~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~i~~~fnPL~qmtd~EkAeI 472 (698) T protein:vir:10 393 SHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAEA 472 (698) T ss_pred hcCchhhhhccCCcccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCC-CChhhhccccccchhcC------CCCCCCCCCC--CCCCCCC Q lcl|NC_019527. 447 RFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDN-IDGDLEIVQPEMFDDDG------ADPYMPDPDV--LPGEEGS 516 (516) Q Consensus 447 ~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~-~d~~~e~~~~e~~~~e~------~~~~~~~~~~--~~~~e~t 516 (516) ++++|+++++|++.|+|+++|+|++|+.+++|+|.+ +|.+.+...+++.+.++ .+..+++.++ .|++..- T Consensus 473 ~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (698) T protein:vir:10 473 RYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGARA 551 (698) T ss_pred HhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccccC Confidence 999999999999999999999999999999999977 55433321222111111 1111111111 1111111 No 2 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=100.00 E-value=9.5e-140 Score=782.75 Aligned_cols=505 Identities=35% Similarity=0.568 Sum_probs=404.2 Q ss_pred chhhhhhhhccccc--ccccCCCc----------CCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchh Q lcl|NC_019527. 4 FDRKKFKREVADKL--ADAARAEE----------QEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAV 71 (516) Q Consensus 4 ~~~~~~~~~~~~~~--~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~ 71 (516) ..|++.|||.+-+- +-|+.++. +..+.++++..+...++.... .++-+-|+ |. ......+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~--~~---~~~~~~~ 72 (694) T protein:vir:10 1 MSRRNAKKRTQLARTGRRPEVAKAAALAAAATIATAAAQPVPADFARRGALNALD---AAPVAEPS--PS---LRLARQF 72 (694) T ss_pred CCccchhhHHHHhhcCCCcchhhhhhhhhhhhhhhcCCCcccCCccccccchhhc---ccccCCCC--cc---hhhhhhc Confidence 55666665544322 11111111 122222222222222221111 11111111 10 0112224 Q ss_pred ccccccc---------chhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccc--- Q lcl|NC_019527. 72 AMDSLCG---------PTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTK--- 139 (516) Q Consensus 72 a~ds~~~---------~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~--- 139 (516) .+|.... ..+.+....+..+.|+..++|+|||+|++++|+|++|++|++++++|||+|+++.+..+++ T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~ 152 (694) T protein:vir:10 73 EVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADT 152 (694) T ss_pred cccccCCCccccchhhhhhccCcccccchhhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccceeccccchhhhh Confidence 4554432 1222333444455688899999999999999999999999999999999999986553332 Q ss_pred ----------hhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCC--cccCcccccccccccceeeEE Q lcl|NC_019527. 140 ----------AKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGAD--VSVPLILDPRTIKKGSLTGFS 207 (516) Q Consensus 140 ----------~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~--~~~Pl~ld~~~I~~g~l~~l~ 207 (516) ...+.+++++|++++++|+||++|+++++|+|||||++++|+|++++ +++||.+++.+|+||+||||+ T Consensus 153 ~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ 232 (694) T protein:vir:10 153 SGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLR 232 (694) T ss_pred hcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeecCccccccccccccccccCcceeeeE Confidence 22345789999999999999999999999999999999999998855 889998888899999999999 Q ss_pred eecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 208 NIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQS 287 (516) Q Consensus 208 v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~ 287 (516) |+||||++|..++..||++|+||+|++|+|.|++||+||+++|.++++|+++|++|+|||+|++|.++++|.+|++++.+ T Consensus 233 ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G~~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~ 312 (694) T protein:vir:10 233 VVEPYWVTPNNYNSINPVADDFYKPSTWWMIGTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQS 312 (694) T ss_pred eecccccccchhhhccchhhccCCCceEEEeceEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCHHHHHHHHHHHHHhhh Q lcl|NC_019527. 288 VSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLADLQSQSQEHMCSVS 367 (516) Q Consensus 288 ~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d~~~~~~~~iaaas 367 (516) +++|++++++.++++++++.|.++...++.+|++++++++||+|++++|+++|+|++++++||||++++++|+++||+++ T Consensus 313 v~~Li~~~~v~~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~stslSGLddVi~qf~q~VAgaa 392 (694) T protein:vir:10 313 VSDIVKQFSVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPLSGLDALQAQAQEQMSAVS 392 (694) T ss_pred HHHHHHhhhhHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecCCcceEEEecccCCHHHHHHHHHHHHHhhh Confidence 99999999999999999999998888889999999999999999999997779999999999999999999999999999 Q ss_pred cCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHH Q lcl|NC_019527. 368 KIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIR 447 (516) Q Consensus 368 ~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~ 447 (516) +||+|+||||||+|||||||+|++||||+|+++|++.|+|+|++|+++|++|+||.+|++|+|+|+|||+||++|+|||+ T Consensus 393 ~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~i~~~fnPL~qmtd~EkAeI~ 472 (694) T protein:vir:10 393 HIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAESR 472 (694) T ss_pred cCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCC-CChhhhccccccchhc------CCCCCCCC--------CCCCCC Q lcl|NC_019527. 448 FNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDN-IDGDLEIVQPEMFDDD------GADPYMPD--------PDVLPG 512 (516) Q Consensus 448 ~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~-~d~~~e~~~~e~~~~e------~~~~~~~~--------~~~~~~ 512 (516) +|+|+++++|++.|+|+++|+|++|+.+++++|.+ +|.+.+.-.+.+.+.. +....+++ ++.... T Consensus 473 ~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 552 (694) T protein:vir:10 473 YKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAP 552 (694) T ss_pred hhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCC Confidence 99999999999999999999999999999999976 5533321111111000 00111111 111111 Q ss_pred CCCC Q lcl|NC_019527. 513 EEGS 516 (516) Q Consensus 513 ~e~t 516 (516) ...+ T Consensus 553 ~~v~ 556 (694) T protein:vir:10 553 PTVA 556 (694) T ss_pred Cccc Confidence 1111 No 3 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=100.00 E-value=5.2e-139 Score=778.71 Aligned_cols=505 Identities=35% Similarity=0.566 Sum_probs=401.5 Q ss_pred chhhhhhhhcccccccccC--CCc---------------CCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCC Q lcl|NC_019527. 4 FDRKKFKREVADKLADAAR--AEE---------------QEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAG 66 (516) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~--~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~ 66 (516) ..|++.|||.+-+---..+ ++. ++-|..+-.+.+ .+.+.. ++-+-|++. +-++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~------~~~~~~~~~-~~~~~~ 72 (695) T protein:vir:78 1 MSRRNAKKRTQLAHTGRRPEVAKAAALAAAATIATATAAQPVPADMGRRGA-LNALDA------APVAEPSPS-LRLARQ 72 (695) T ss_pred CCccchhhhhhhhhcCCCcchhhhhhhhhhhhhhhhccccccchhhccccc-cccccc------ccccCCCcc-ccccee Confidence 5666666654432211111 110 111111111111 111110 111112211 111100 Q ss_pred c-----cchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccc-- Q lcl|NC_019527. 67 T-----TPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTK-- 139 (516) Q Consensus 67 ~-----~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~-- 139 (516) . ....+.++....++.+....+..+.|+..++|+|||+|++++|+|++|++|++++++|||+|+++.+..+++ T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~ 152 (695) T protein:vir:78 73 FEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKAD 152 (695) T ss_pred ceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccceeccccchhhh Confidence 0 000111122222233334455556788999999999999999999999999999999999999986553332 Q ss_pred -----------hhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCC--cccCcccccccccccceeeE Q lcl|NC_019527. 140 -----------AKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGAD--VSVPLILDPRTIKKGSLTGF 206 (516) Q Consensus 140 -----------~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~--~~~Pl~ld~~~I~~g~l~~l 206 (516) ...+.+++++|++++++|+||++|+++++|+|||||++++|+|++++ +++||.+++.+|+||+|||| T Consensus 153 ~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl 232 (695) T protein:vir:78 153 TSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGL 232 (695) T ss_pred hhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCccccccccccccccccCcceeee Confidence 22345789999999999999999999999999999999999998855 88999888889999999999 Q ss_pred EeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 207 SNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQ 286 (516) Q Consensus 207 ~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~ 286 (516) +|+||||++|..++..||++|+||+|++|+|.|++||+||+++|.++++|+++|++|+|||+|++|.++++|.+|++++. T Consensus 233 ~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~ 312 (695) T protein:vir:78 233 RVVEPYWVTPNNYNSINPVADDFYKPSTWWMIGTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQ 312 (695) T ss_pred EeecccccccchhhhccchhhccCCCceEEEeceEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCHHHHHHHHHHHHHhh Q lcl|NC_019527. 287 SVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLADLQSQSQEHMCSV 366 (516) Q Consensus 287 ~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d~~~~~~~~iaaa 366 (516) ++++|++++++.++++++++.|.++...++.+|++++++++||+|++++|+++|+|++++++||||++++++|+++||++ T Consensus 313 ~v~~Li~~~~v~~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~stslSGLddVi~qf~q~VAga 392 (695) T protein:vir:78 313 SVSDIVKQFSVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPLSGLDALQAQAQEQMSAV 392 (695) T ss_pred HHHHHHHhhhhHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecCCcceEEEecccCCHHHHHHHHHHHHHhh Confidence 99999999999999999999999888888999999999999999999999777999999999999999999999999999 Q ss_pred hcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHH Q lcl|NC_019527. 367 SKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEI 446 (516) Q Consensus 367 s~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei 446 (516) ++||+|+||||||+|||||||+|++||||+|+++|++.|+|+|++|+++|++|+||.+|++|+|+|+|||+||++|+||| T Consensus 393 a~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpdi~~~fnPL~qmtd~EkAeI 472 (695) T protein:vir:78 393 SHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAES 472 (695) T ss_pred hcCchhhhhccCCccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCC-CChhhhccccccchhc------CCCCCCCCC--------CCCC Q lcl|NC_019527. 447 RFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDN-IDGDLEIVQPEMFDDD------GADPYMPDP--------DVLP 511 (516) Q Consensus 447 ~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~-~d~~~e~~~~e~~~~e------~~~~~~~~~--------~~~~ 511 (516) ++|+|+++++|++.|+|+++|+|++|+.+++|+|.+ +|.+.+.-.+.+.+.. +....+++. +.+. T Consensus 473 ~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 552 (695) T protein:vir:78 473 RYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATA 552 (695) T ss_pred HhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCCCCCCCCC Confidence 999999999999999999999999999999999976 5533321111111000 011111111 1100 Q ss_pred CCCCC Q lcl|NC_019527. 512 GEEGS 516 (516) Q Consensus 512 ~~e~t 516 (516) ....+ T Consensus 553 ~~~~~ 557 (695) T protein:vir:78 553 PPTVA 557 (695) T ss_pred CCcee Confidence 00000 No 4 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=100.00 E-value=6.3e-139 Score=778.24 Aligned_cols=510 Identities=35% Similarity=0.560 Sum_probs=402.6 Q ss_pred chhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcC-------CCcc-----ccccCCCCCCCccCCCc---- Q lcl|NC_019527. 4 FDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRA-------SDAA-----TKWAPPQLMPGVVPAGT---- 67 (516) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~-----~~~~~~~~~~gv~~~~~---- 67 (516) ..|++.|||.+-+---. .|++ .+..+.+..+........++ ..+. ++-.-|++. +-++... T Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 77 (695) T protein:vir:36 1 MSRRNAKKRAQLARTGR-RPEV-AKAATLAAAATIATAAAAQPVPADFARRGALNALDAAPVVEPSPS-LRLARQFEVDV 77 (695) T ss_pred CCccchhhHHHHhhcCC-Ccch-hhhhhhhhhhhhhhhccccccchhhhhcccccccccccccCCCcc-cccceeceecc Confidence 56777776655432111 1111 11111111111111111111 0110 011112110 1111000 Q ss_pred -cchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccc------- Q lcl|NC_019527. 68 -TPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTK------- 139 (516) Q Consensus 68 -~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~------- 139 (516) ...-+.++....++.+....+..+.|+..++|+|||+|++++|+|++|+++++++++|||+|+++.+..+++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~ 157 (695) T protein:vir:36 78 SNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLA 157 (695) T ss_pred cccCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccceecccchhhhhhcccc Confidence 000111112222223334455556788999999999999999999999999999999999999876543332 Q ss_pred ------hhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCC--cccCcccccccccccceeeEEeecc Q lcl|NC_019527. 140 ------AKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGAD--VSVPLILDPRTIKKGSLTGFSNIEP 211 (516) Q Consensus 140 ------~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~--~~~Pl~ld~~~I~~g~l~~l~v~d~ 211 (516) ...+.+++++|+.++++|+||++|+++++|+|||||++++|+|++++ +++||.+++.+|+||+||||+|+|| T Consensus 158 ~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp 237 (695) T protein:vir:36 158 AGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEP 237 (695) T ss_pred ccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecc Confidence 22345789999999999999999999999999999999999998855 8899988888999999999999999 Q ss_pred eeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 212 MWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDL 291 (516) Q Consensus 212 ~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~L 291 (516) ||++|..++..||++|+||+|++|+|.|++||+||+++|.++++|+++|++|+|||+|++|.++++|.+|++++.++++| T Consensus 238 ~~vtP~~~n~~dP~spdfgkP~~y~V~G~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~L 317 (695) T protein:vir:36 238 YWVTPNNYNSINPVADDFYKPSTWWMIGTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDI 317 (695) T ss_pred cccccchhhhccchhhccCCCceEEEeceEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCHHHHHHHHHHHHHhhhcCCc Q lcl|NC_019527. 292 VDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLADLQSQSQEHMCSVSKIPA 371 (516) Q Consensus 292 l~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d~~~~~~~~iaaas~IP~ 371 (516) ++++++.++++++++.|.++...++.+|++++++++||+|++++|+++|+|++++++||||++++++|+++||++++||+ T Consensus 318 i~~~~v~~lk~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~stslSGLddVi~qf~q~VAgaa~IPl 397 (695) T protein:vir:36 318 VKQFSVSGILMDLAQALMPGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPL 397 (695) T ss_pred HHhhhHHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCccceEEEecCCcceEEEecccCCHHHHHHHHHHHHHhhhcCch Confidence 99999999999999999988888899999999999999999999977799999999999999999999999999999999 Q ss_pred eeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_019527. 372 IKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKA 451 (516) Q Consensus 372 t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a 451 (516) |+||||||+|||||||+|++||||+|+++|++.|+|+|++|+++|++|+||.+|++|+|+|+|||+||++|+|||++|+| T Consensus 398 tkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpdi~~~fnPL~qmtd~EkAeI~~k~A 477 (695) T protein:vir:36 398 IKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAESRYKQA 477 (695) T ss_pred hhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHcCCCCHHHHHHHHHhhhccCCCC-CChhhhccccccchhc------CCCCCCCC--------CCCCCCCCCC Q lcl|NC_019527. 452 QEAQIYITNSVIDPSEARQQLSDDPDSGWDN-IDGDLEIVQPEMFDDD------GADPYMPD--------PDVLPGEEGS 516 (516) Q Consensus 452 ~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~-~d~~~e~~~~e~~~~e------~~~~~~~~--------~~~~~~~e~t 516 (516) +++++|++.|+|+++|+|++|+.+++|+|.+ +|.+.+.-.+.+.+.. +....+++ ++.......+ T Consensus 478 ~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~ 557 (695) T protein:vir:36 478 QSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVA 557 (695) T ss_pred HHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCCCccc Confidence 9999999999999999999999999999976 5533321111111000 01111111 1111000000 No 5 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=100.00 E-value=6.4e-137 Score=767.26 Aligned_cols=505 Identities=22% Similarity=0.302 Sum_probs=403.3 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) ||||+|||.-.-++..+++...|+. ..+... ....+.....++......+.+|...-....++.+++|||||+.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~ 77 (537) T protein:vir:10 1 MFKFWRKKTVEAVQSSIAERIEPRV--GIFGAG-DDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEG 77 (537) T ss_pred CCCcccccccccccccccccccccc--CCCccc-chhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccch Confidence 9999999854322222222222211 111100 1122222223333323334333211111236678899999986433 Q ss_pred hhhcc-----cccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHH Q lcl|NC_019527. 81 YQFLN-----SAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACE 155 (516) Q Consensus 81 ~~~~~-----~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~ 155 (516) ..... ...+...++..++|+|||+|++|++||++|+|||+||+||||+||+|++.++++. +.+.+++|+++++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~--~~~~~~~l~~~~~ 155 (537) T protein:vir:10 78 GTFSAYANPNLSEGLVLWYAQQAFIGHQMCALIATHWLVNKACSQMPRDAMRKGYKIISDDGNEL--DPKDAKFIDRYDR 155 (537) T ss_pred hhhhhhccccccchhhhhccccCCccHHHHHHHHhCchhhhhhhhhhHHhhcCCceeecCCcccc--cHHHHHHHHHHHH Confidence 22211 1122234566778999999999999999999999999999999999999875543 3456789999999 Q ss_pred hcChhHHHHHHHHhcccceeeEEEEEecCCC---cccCcccccccccccceeeEEeecceeeccc--ccccccccccccc Q lcl|NC_019527. 156 YYGVMGIIQKAAEHDCFFGRGQISINIKGAD---VSVPLILDPRTIKKGSLTGFSNIEPMWTSPS--AYNALDPTAPDFY 230 (516) Q Consensus 156 ~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~---~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~--~~~~~dp~s~~yg 230 (516) +|+++++|+++++|+|||||++++|.+++.+ +++||++ ++|++|+|++|+|+||+|++|. .++..||++|+|| T Consensus 156 ~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~--~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg 233 (537) T protein:vir:10 156 AFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFNI--DGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFY 233 (537) T ss_pred HhhHHHHHHHHHHhcccccceEEEEeecCcCCccccccccc--ccccccceeEEEEechhhcccccchhhhccCCccccC Confidence 9999999999999999999999999986544 7888865 5899999999999999999994 5678899999999 Q ss_pred CcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcC Q lcl|NC_019527. 231 KPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNG 310 (516) Q Consensus 231 ~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~ 310 (516) +|++|+|+|++|||||||||+|+++|++++++|+|||+|+||++|++|++|+++++++++|++++++.|+|+++...|++ T Consensus 234 ~P~~y~v~g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~ 313 (537) T protein:vir:10 234 EPTYWLINGKKYHRSHLAIYINDEVVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLAN 313 (537) T ss_pred CceeeeecCeEecceeEEEecCCCCchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888865 Q ss_pred ccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHH Q lcl|NC_019527. 311 GEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEI 390 (516) Q Consensus 311 ~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~ 390 (516) ++++.+|+++++++++|+|++++|+++|+|++++++|+|+++++++++++||++++||+|+|||+||+||||||++|+ T Consensus 314 --~~~~~~r~~~~~~~r~n~g~~~id~e~e~~e~~~~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~ 391 (537) T protein:vir:10 314 --KQQFDETMSWWTATRDNYQVRVVDKDNEDVVQIDTTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEE 391 (537) T ss_pred --HHHHHHHHHHHHhhcCCcceeEecCCCceeEEEeccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHH Confidence 467999999999999999999999988999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHH Q lcl|NC_019527. 391 RSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQ 470 (516) Q Consensus 391 ~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~ 470 (516) ++||++|+++|+ .|+|+|++|+++|+++++|. +.+|+|+|+|||++|++||||+++++|+++++|+++|+|+++|+|+ T Consensus 392 ~~yyd~I~~~Qe-~l~p~l~~l~~ll~~~~~~~-~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~ 469 (537) T protein:vir:10 392 ASYHEECESTQD-DMRPLIDRHHQLVCRSHLRK-RIRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNE 469 (537) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCC-CcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHH Confidence 999999999998 59999999999999999997 6699999999999999999999999999999999999999999999 Q ss_pred HHHhhhccCCCCCChhhhcccccc--chhc----CCCCCCCCCCC--CCCCCC----------C Q lcl|NC_019527. 471 QLSDDPDSGWDNIDGDLEIVQPEM--FDDD----GADPYMPDPDV--LPGEEG----------S 516 (516) Q Consensus 471 ~l~~~~~~~~~~~d~~~e~~~~e~--~~~e----~~~~~~~~~~~--~~~~e~----------t 516 (516) .|+.++++++.++...+...+.+. .+++ +..+..+++++ .++.++ + T Consensus 470 ~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 533 (537) T protein:vir:10 470 YLRMDPTLGFTSITPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSSGESANDPRDSGA 533 (537) T ss_pred HHhccCccccccccCCCChhhhhcccCCccCCcCCCCCCCCCccccCCCCccccccCCCccCcc Confidence 999999988888743321111111 1111 11111111111 111111 1 No 6 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=100.00 E-value=2.9e-136 Score=763.61 Aligned_cols=494 Identities=25% Similarity=0.381 Sum_probs=404.7 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCC----ChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCc-cchhcccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKAR----KLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGT-TPAVAMDS 75 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~-~~~~a~ds 75 (516) -|-|-||| -..+|.+.+|++|.+-|+ .++.+..........++++. +.+.|++-.. ...||||| T Consensus 5 ~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~a~ds 73 (765) T protein:vir:96 5 SWIFGRKK----DNAACSESAPEKVARIPQHDPLDPMIKLGKIRGWNVEPEKAP-------VIRSVKDFLEPGLSVAMDS 73 (765) T ss_pred eeeccccc----ccccccccCchhhhhcCCCCCcccchhHHHHhhcccccccCC-------CCCCCCcccCcccceeccc Confidence 56677765 567899999999965554 33344443334333333221 1233333232 22589999 Q ss_pred cccch--hhhcccccC----------CcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhh Q lcl|NC_019527. 76 LCGPT--YQFLNSAAG----------GLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEM 143 (516) Q Consensus 76 ~~~~~--~~~~~~~~~----------~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~ 143 (516) +.... .... ++.+ ...++..+.|+|||||++|++||++|+|||+||+||||+||+|++.+++. + T Consensus 74 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~f~gyql~alY~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~---~ 149 (765) T protein:vir:96 74 AYGDGPTPAAK-AAAGGQNPYVVPTMLQDWYNSQGFIGYQACAIISQHWLVDKACSMSGEDAARNGWELKSDGRKL---S 149 (765) T ss_pred cccccccchHH-HhhhccCccchhhHHHhhhcccCCccHHHHHHHHhCchhhhhhhcchHHhhcCCceeecCcccc---C Confidence 85321 1111 1111 11345567899999999999999999999999999999999999865433 3 Q ss_pred HHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCC---CcccCcccccccccccceeeEEeecceeeccc--c Q lcl|NC_019527. 144 ASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGA---DVSVPLILDPRTIKKGSLTGFSNIEPMWTSPS--A 218 (516) Q Consensus 144 ~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~---~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~--~ 218 (516) .+++++|++++++|+++++|+++++|+|||||+++++++++. .+++||+ +++|++|+|++|+++||+|++|. . T Consensus 150 ~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~--~~~I~kg~~kgl~vldp~~~~~~~v~ 227 (765) T protein:vir:96 150 DEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFN--PDGIAPGSYKGISQIDPYWAMPQLTA 227 (765) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcchhhcccc--ccccccceeeEEEEechhhcccccch Confidence 456889999999999999999999999999999999998654 4678875 45899999999999999999995 4 Q ss_pred ccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_019527. 219 YNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT 298 (516) Q Consensus 219 ~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~ 298 (516) ++..||++|+||+|++|+|+|++|||||||+|.|.++|++++++++|||+|+||++|++|++|+++++++++|++++++. T Consensus 228 e~~~Dp~sp~fg~P~~y~i~g~~IH~SRli~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~ 307 (765) T protein:vir:96 228 ESTADPSAEHFYEPDFWIISGKKYHRSHLVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTS 307 (765) T ss_pred hccccccccccCcceeeeecCceeccceEEEecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 67889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCHHHHHHHHHHHHHhhhcCCceeeeccc Q lcl|NC_019527. 299 FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGIS 378 (516) Q Consensus 299 v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~s 378 (516) |+|+++...++. ++++.+|+++++++++|+|++++|++ |+|++++++||||++++++++++||++++||+|+|||+| T Consensus 308 v~k~~~~~~l~~--~~~l~~r~~~~~~~r~n~g~~~id~e-e~~e~~s~~lsgl~d~l~~~~~~iAaas~IP~t~LfGqs 384 (765) T protein:vir:96 308 TIHVDVEKAIAN--EDAFNARLAFWIANRDNHGVKVIGID-ETMEQFDTNLSDFDSVIMNQYQLVAAIAKTPATKLLGTS 384 (765) T ss_pred eeeechHhhhcc--HHHHHHHHHHHHHhcCCceeEEecCC-cceeEEecccCCHHHHHHHHHHHHHhhhCCCeeeeccCC Confidence 999998888764 46799999999999999999999986 899999999999999999999999999999999999999 Q ss_pred cccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 379 PSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYI 458 (516) Q Consensus 379 p~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~ 458 (516) |+|||||||+|++|||++|+++||+.|+|+|++|+++|+++ |.++++|+|+|+|||++|++||||+++++|+++++|+ T Consensus 385 p~GlnATGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s--~~i~~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~ 462 (765) T protein:vir:96 385 PKGFNATGEHETISYHEELESIQEHIFDPLLERHYLLLAKS--ESIDVQLEIVWNPVDSTTSQQQAELNNKKAATDEIYI 462 (765) T ss_pred cccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cCCCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999988 5678899999999999999999999999999999999 Q ss_pred HcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCC-CCCC-CCCCCCCC----------------------- Q lcl|NC_019527. 459 TNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGAD-PYMP-DPDVLPGE----------------------- 513 (516) Q Consensus 459 ~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~-~~~~-~~~~~~~~----------------------- 513 (516) ++|+|+++|+|++|+.+++++|+.++.+....++.+.+++..+ ...+ +....+++ T Consensus 463 ~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~ 542 (765) T protein:vir:96 463 NSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPENLAELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPR 542 (765) T ss_pred hcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCccccccccCCCcccccccCccccccCCCCccCCCCcccccCCc Confidence 9999999999999999999999988654432222221111110 0000 00000000 Q ss_pred ----------CCC Q lcl|NC_019527. 514 ----------EGS 516 (516) Q Consensus 514 ----------e~t 516 (516) +++ T Consensus 543 ~~~p~~~~~~~~~ 555 (765) T protein:vir:96 543 GTKPLAKAAEEGA 555 (765) T ss_pred ccCCccccccccC Confidence 000 No 7 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=100.00 E-value=9.3e-133 Score=744.42 Aligned_cols=489 Identities=36% Similarity=0.578 Sum_probs=400.5 Q ss_pred CCCcCCCCCChhhhHHHHhHHh--hcCCC-----ccccccC-CCCCCCccCCCccchhcccccccchhhhcccccCCccc Q lcl|NC_019527. 22 RAEEQEKARKLAMRRAVMKSME--RRASD-----AATKWAP-PQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYA 93 (516) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~-----~~~~~~~-~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~ 93 (516) -+...+.||+.--.+.|..+.. .++.. +...++- |...--+..++..+.+|||++.++.. .....+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~----~~~~~~~~ 76 (532) T protein:vir:94 1 MADTDPTPRPEITYATLQQAQRVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGR----NGRNALSF 76 (532) T ss_pred CCCCCCCCCcceehhhhhhHhhhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCccc----cccccccc Confidence 1111233332222222222111 11000 0000000 10000112355566789987654332 22234457 Q ss_pred ccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccc Q lcl|NC_019527. 94 ADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFF 173 (516) Q Consensus 94 ~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rly 173 (516) +..++|+||++|++|++||++|+||++||+||||+|++|++.++++.+ .+.+++|+.++++|++|++|+++++|+||| T Consensus 77 ~~~~~~~~~~l~a~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~~--~~~~~~i~~~~~~l~v~~~l~~a~~~~rly 154 (532) T protein:vir:94 77 VEATSWPGFPTLALLAQLPEYRTMHETPADECVRAWGKITCSSKDELA--ADKATRITQKLEQYNVRTLVRTVVIHDQAY 154 (532) T ss_pred ccccccchHHHHHHHHcCchhhhhhccchHHHhhCCceEeeCCccccc--hHHHHHHHHHHHhhhHHHHHHHHHHhhhcc Confidence 788899999999999999999999999999999999999987665543 456788999999999999999999999999 Q ss_pred eeeEEEEEecCCC----cccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe-eeEeccceEE Q lcl|NC_019527. 174 GRGQISINIKGAD----VSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL-GREMHASRLL 248 (516) Q Consensus 174 G~a~i~i~i~~~~----~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~-g~~iH~SRli 248 (516) |||+|+|+|++.+ ++.|+++++.+|++|+|++|+|+||+||+|..++..||++|+||+|++|+|. |++||||||| T Consensus 155 G~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~~g~~iH~SRli 234 (532) T protein:vir:94 155 GGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIATSGKKIHSSRIH 234 (532) T ss_pred cceEEEEEeccCCccccccccccccccccccceeeEEEeechheecccccccccccccccCCceeEEEccCeeeccceEE Confidence 9999999997654 5668889999999999999999999999999998899999999999999986 7899999999 Q ss_pred EecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcC Q lcl|NC_019527. 249 TIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQS 328 (516) Q Consensus 249 ~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~s 328 (516) ||.|+++|++++++++|||+|++|++|++|++|+++++++++|++++++.|+|+++++.++.++.+++.+|+++++++++ T Consensus 235 ~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~a~~ls~~~~~~~~~r~~~~~~~~~ 314 (532) T protein:vir:94 235 TVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLATDMAQLLAPGGAQSLDARLQLFNLYRD 314 (532) T ss_pred EecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcchhHHHHHHHHHHHHhhcC Confidence 99999999999999999999999999999999999999999999999999999999999998888899999999999999 Q ss_pred CcceEEEecCCcceeEEecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 329 NLGLAVMDFDSEDIVQVNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSP 408 (516) Q Consensus 329 n~g~~~id~~~e~~e~~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~ 408 (516) |+|++++++++|+|++++++|+||++++++++++||++++||+|+|||+||+|||||||+|+++||++|+++||+.++|+ T Consensus 315 n~g~~~id~~~e~~e~~~~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~ 394 (532) T protein:vir:94 315 NRNIGALDKGTEEIQQTNTPLSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWYDFIAGYQATNLTPL 394 (532) T ss_pred CccceEEcCCCceeEEEecccCCHHHHHHHHHHHHHhHhCCCeeeeecCCcccccccchHHHHHHHHHHHHHHHHHHHHH Confidence 99999999888999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhh Q lcl|NC_019527. 409 LDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLE 488 (516) Q Consensus 409 l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e 488 (516) |++|+++|+++++|.++++|+|+|+|||++|++|+||+++++|+++++|+++|+|+++|+|+.|+.++++++.++....+ T Consensus 395 le~l~~~l~~s~~g~~~~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~ 474 (532) T protein:vir:94 395 MEWIIDLIQLSEYGQIDPGLAWEWSPLMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERD 474 (532) T ss_pred HHHHHHHHHHHhcCCCCCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCcccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999988876532221 Q ss_pred ccc-cccchh----cCCCCCCCC-CCCCCCCCCC Q lcl|NC_019527. 489 IVQ-PEMFDD----DGADPYMPD-PDVLPGEEGS 516 (516) Q Consensus 489 ~~~-~e~~~~----e~~~~~~~~-~~~~~~~e~t 516 (516) ..+ .+...+ ...+++.+. .+.+|.++.. T Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (532) T protein:vir:94 475 ELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSE 508 (532) T ss_pred ccccccchhhhhcccccCCCCCCCCCCCCCCCCC Confidence 111 111111 111122221 1222222222 No 8 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=100.00 E-value=1.1e-132 Score=744.07 Aligned_cols=503 Identities=22% Similarity=0.309 Sum_probs=398.3 Q ss_pred CCcchhhhhhhhcccccccccCCCcC-----CCCC----ChhhhHHHHhHHhhcCCCccccccCCC------CCCCccCC Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQ-----EKAR----KLAMRRAVMKSMERRASDAATKWAPPQ------LMPGVVPA 65 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~gv~~~ 65 (516) || +|.|+|.+-...+.+++.||+ +.|+ .+..++. ..+.++++.+..-+.+.. -..++.+- T Consensus 1 ~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (862) T protein:vir:99 1 MF---KKLFKRFLIRQEQSPEPTPVEASIADEIPRHDPLDPLARTR--QNWPVQKEKPNPIIRSVKDFPFVEISDSVNAK 75 (862) T ss_pred Ch---hHHHHHHhhhhhcCCCCccchhhhhhhccccCccchHHhhc--ccCCcccccCCCCCCcccccccccccccccch Confidence 65 466666666666777777774 2332 2222221 234444443322111111 11122221 Q ss_pred Cc-----------------cchhcccccccchhhhcccc--------cCCcccccccCcccHHHHHHHHhCchhhhhhhh Q lcl|NC_019527. 66 GT-----------------TPAVAMDSLCGPTYQFLNSA--------AGGLYAADIQPFPGYQNLAALATRPEYRAFAST 120 (516) Q Consensus 66 ~~-----------------~~~~a~ds~~~~~~~~~~~~--------~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~ 120 (516) .. .+.++|||..+...+..... .....++.+++|+|||+|++|++||++|+|||+ T Consensus 76 ~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gyql~alY~~~~larkiVd~ 155 (862) T protein:vir:99 76 SVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGHQACALIAQHWLVDKACSL 155 (862) T ss_pred hhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccccccCcccHHHHHHHHhCchhhhhhhh Confidence 11 12334444333222221111 122456677899999999999999999999999 Q ss_pred hhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCC---cccCccccccc Q lcl|NC_019527. 121 LSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGAD---VSVPLILDPRT 197 (516) Q Consensus 121 ~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~---~~~Pl~ld~~~ 197 (516) ||+||||+||+|++.++++..+ .+++++|++++++|+++++|+++++|+|||||+++++++++.+ +++||++ ++ T Consensus 156 pAeDatR~g~~I~~~~d~~e~~-~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~--e~ 232 (862) T protein:vir:99 156 AGEDAIRNGWHLKSLGEGEEID-EESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNP--DG 232 (862) T ss_pred hhHHHhhCCceEeecCcccccC-HHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCc--cc Confidence 9999999999999987655433 4668999999999999999999999999999999998886554 6788765 58 Q ss_pred ccccceeeEEeecceeeccc--cccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHH Q lcl|NC_019527. 198 IKKGSLTGFSNIEPMWTSPS--AYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQ 275 (516) Q Consensus 198 I~~g~l~~l~v~d~~~v~p~--~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~ 275 (516) |.+|+|++|+||||+|++|. .++..||++|+||+|++|+|+|++||+||||+|.|+++|++++++|+|||+|+||++| T Consensus 233 I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y~I~g~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iy 312 (862) T protein:vir:99 233 ITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFWIISGQKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIY 312 (862) T ss_pred ccccceeEEEEechhhhcccccccccccccccccCCceeeeecCeeeccceeEEecCCCchhhhhccCCccCccHHHHHH Confidence 99999999999999999984 4688999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCHHHH Q lcl|NC_019527. 276 PYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLADL 355 (516) Q Consensus 276 ~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d~ 355 (516) ++|++|+++++++++|++++++.|+|+++++.+.. +.++.+|+++++++++|+|++++|++ |+|++++++||||+++ T Consensus 313 d~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~~--ed~l~~r~~~~~~~rdN~Gi~liD~e-Ee~e~ls~slSGL~dl 389 (862) T protein:vir:99 313 ERVYAAERTANEAPLLAMNKRTTAIHTDTAKAIAN--EDKFIQRLMFWVRYRDNHAVKVLGTD-ETMEQFDTSLADFDAV 389 (862) T ss_pred HHHHHHHHHHHHHHHHHHHhccceeechhHhhhcc--HHHHHHHHHHHHhccCcceeEEecCC-CceeEEecccCChHHH Confidence 99999999999999999999999999998888765 46799999999999999999999986 8999999999999999 Q ss_pred HHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCC Q lcl|NC_019527. 356 QSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSL 435 (516) Q Consensus 356 ~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL 435 (516) +++++++||++++||+|+|||+||+|||||||+|++|||++|+++|++.|+|+|++|+.+++++ +| +|++|+|+|+|| T Consensus 390 l~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~s~QE~~L~P~LerL~~li~~~-lg-~~~d~~ieFnpL 467 (862) T protein:vir:99 390 IMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETISYHEELESIQEHVYMPFLQRHYLISRLS-LG-IQHEIDVVMEPV 467 (862) T ss_pred HHHHHHHHHhhhCCCceeecccCcccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cC-CCCcceEEeCCC Confidence 9999999999999999999999999999999999999999999999999999999999887665 45 678999999999 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh-hccccccchhcCC-CCCCCCCCCCCC- Q lcl|NC_019527. 436 WQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL-EIVQPEMFDDDGA-DPYMPDPDVLPG- 512 (516) Q Consensus 436 ~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~-e~~~~e~~~~e~~-~~~~~~~~~~~~- 512 (516) |++|++|+||+++++|+++++|+++|+|+++|+|++|+.++.++|++++.+. |......++...+ +.++......|. T Consensus 468 ~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~d 547 (862) T protein:vir:99 468 ASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETASAK 547 (862) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCccccccccc Confidence 9999999999999999999999999999999999999999999999986443 3211111111000 000000000000 Q ss_pred --CCCC Q lcl|NC_019527. 513 --EEGS 516 (516) Q Consensus 513 --~e~t 516 (516) ..++ T Consensus 548 e~~aga 553 (862) T protein:vir:99 548 ETQAGA 553 (862) T ss_pred cccccc Confidence 0000 No 9 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=100.00 E-value=5.2e-119 Score=669.04 Aligned_cols=428 Identities=16% Similarity=0.180 Sum_probs=360.5 Q ss_pred hhcccccccchhhhcccccCC-cccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHH Q lcl|NC_019527. 70 AVAMDSLCGPTYQFLNSAAGG-LYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIK 148 (516) Q Consensus 70 ~~a~ds~~~~~~~~~~~~~~~-~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~ 148 (516) .-.+||..+...+..+..... +.++....+.+|++|++|++||++|+|||+||+||||+||+|++.+.+ .++++ T Consensus 1 ~~~~D~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~-----~~~~~ 75 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSNDLN-----SKQLD 75 (437) T ss_pred CchhhhhHhHHhcCCCccccceeecCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecCCCC-----HHHHH Confidence 224566544322211111111 112223457889999999999999999999999999999999985432 34578 Q ss_pred HHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccccccccccc Q lcl|NC_019527. 149 ELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPD 228 (516) Q Consensus 149 ~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~ 228 (516) +|++++++|++|++|+++++|+||||+|+|+++++++++++||.. +|++++|+|+++|+++|+.++..||++|+ T Consensus 76 ~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~------~~~~~~~~v~~~~~v~~~~~~~~dp~s~~ 149 (437) T protein:vir:52 76 LFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKP------TERLKRLIILPKWKISPTGTKDDDVLSPN 149 (437) T ss_pred HHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCccccccc------CCceeEEEEechhhccccccccccccccc Confidence 899999999999999999999999999999999999999999853 47899999999999999999999999999 Q ss_pred ccCcceeEEeee----EeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeec- Q lcl|NC_019527. 229 FYKPSTWWVLGR----EMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTN- 303 (516) Q Consensus 229 yg~P~~y~v~g~----~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~- 303 (516) ||+|++|+|++. +|||||||||+++++| ++.++|||+|+||++|++|++|+++++++++|++++++.|+|++ T Consensus 150 fg~p~~y~v~~~~~~~~iH~SRii~~~~~~~~---~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~ 226 (437) T protein:vir:52 150 FGRYSEYSILGGSQSITVHHSRLIILNANDAP---LSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAG 226 (437) T ss_pred cCcceEEEEecCCcceeEccceeEEecCccCC---CccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecch Confidence 999999999874 8999999999999998 46799999999999999999999999999999999999999995 Q ss_pred chhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCHHHHHHHHHHHHHhhhcCCceeeecccccccc Q lcl|NC_019527. 304 MAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLN 383 (516) Q Consensus 304 ~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln 383 (516) +++.++.+.++.+.+|++.++.+++|.|++++|++ |+|++++++|+||++++++++++||++++||+|+|||+||+|| T Consensus 227 l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~-~~~e~~~~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Gl- 304 (437) T protein:vir:52 227 LSDKIAAGMENEVASVISAVQEIKSATNSLLLDAE-NEYDRKELTFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGL- 304 (437) T ss_pred HHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCC-cceEEEecCcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccc- Confidence 77888888788899999999999999999999986 8999999999999999999999999999999999999999999 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_019527. 384 ASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVI 463 (516) Q Consensus 384 atge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi 463 (516) |||++|+++||++|+++||+.++|+|++|+++|+++++|.+|++|+|+|+|||++|++|+||+++++|+++++++++|++ T Consensus 305 asge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i 384 (437) T protein:vir:52 305 ASGDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGLPADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVL 384 (437) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCC Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CHHHHHHHHHhhhccCCCCCChh-hhccccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 464 DPSEARQQLSDDPDSGWDNIDGD-LEIVQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 464 ~~~e~r~~l~~~~~~~~~~~d~~-~e~~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) +++|+|+.|+.. +.|+.++++ ++..+...+..++..++...++ ++..++. T Consensus 385 ~~~e~r~~L~~~--g~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 435 (437) T protein:vir:52 385 NEYQIANELRES--GLFANISAEHIEELKNADEFAGNFEEPEKMEG-AQVQNSE 435 (437) T ss_pred CHHHHHHHHHhc--CCCCCCCccccccccCCCCCCCccCCCCCCCC-CCCCCCC Confidence 999999999875 456777533 2222211111111100000000 1111111 No 10 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=100.00 E-value=5.7e-119 Score=668.83 Aligned_cols=417 Identities=16% Similarity=0.179 Sum_probs=352.1 Q ss_pred CCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeecccc Q lcl|NC_019527. 59 MPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRT 138 (516) Q Consensus 59 ~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~ 138 (516) ++- +..||..+. .++...+.....+.+|.||++|++|++||++|+|||+||+||||+||+|++.++ T Consensus 1 ~~~---------~~~d~~~~~----~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~- 66 (427) T protein:vir:10 1 MKI---------VKHDGYNDI----FNGGADGSPKPFFMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMSGVKD- 66 (427) T ss_pred CCc---------cccchHHHH----hhcCCCCcccCccccCchHHHHHHHHcCchhhhhhccchHHhhcCCccccCccH- Confidence 222 333443221 112222223344678999999999999999999999999999999999987532 Q ss_pred chhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCC-cccCcccccccccccceeeEEeecceeeccc Q lcl|NC_019527. 139 KAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGAD-VSVPLILDPRTIKKGSLTGFSNIEPMWTSPS 217 (516) Q Consensus 139 ~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~-~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~ 217 (516) .++|+++|++|++|++|++|++|+||||||+|+++|++.+ +++| .+.+|+|++|+|+|+||++|. T Consensus 67 --------~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p------~~~~g~l~~l~v~d~~~~~~~ 132 (427) T protein:vir:10 67 --------EKEFKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQ------AKPGAKLEGVRVYDRFAITVE 132 (427) T ss_pred --------HHHHHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCccccc------cCCCcceeEEEEechhccccc Confidence 1468999999999999999999999999999999998765 3444 457899999999999999987 Q ss_pred cccccccccccccCcceeEEee------eEeccceEEEecCCcchhhhhhccCCCCchHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 218 AYNALDPTAPDFYKPSTWWVLG------REMHASRLLTIITRPLPDMLKPAYNFSGISMS-QLAQPYVENWLRTRQSVSD 290 (516) Q Consensus 218 ~~~~~dp~s~~yg~P~~y~v~g------~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~l-e~~~~~l~~~~~~~~~~~~ 290 (516) .. +.||++|+||+|++|+|++ ++|||||||||.|+++|++++++++|||.|+| |++|++|++|+++++++++ T Consensus 133 ~~-~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~ 211 (427) T protein:vir:10 133 KR-VTNARSPRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQ 211 (427) T ss_pred cc-ccCccccccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHH Confidence 54 5799999999999999986 58999999999999999999999999999987 5789999999999999999 Q ss_pred HHHHhCCceeee-cchhhhcCcc-HHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCHHHHHHHHHHHHHhhhc Q lcl|NC_019527. 291 LVDKFSRTFLKT-NMAQVLNGGE-GGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLADLQSQSQEHMCSVSK 368 (516) Q Consensus 291 Ll~~~~~~v~k~-~~~~~l~~~~-~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d~~~~~~~~iaaas~ 368 (516) |++++++.|+|+ +++++++.++ ..++.+|++.+++.+++++.+++++++|+|++++++||||++++++++++||++++ T Consensus 212 l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl~~~~~~~~~~iaaa~~ 291 (427) T protein:vir:10 212 ILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSDISGVPEFLSSKMDRIVSLSG 291 (427) T ss_pred HHHHhccccccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEecccCChHHHHHHHHHHHHhhhC Confidence 999999999999 5667776654 45788999999999999999999998899999999999999999999999999999 Q ss_pred CCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHH Q lcl|NC_019527. 369 IPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRF 448 (516) Q Consensus 369 IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~ 448 (516) ||+|+|||+||+|||||||+|++|||++|+++||+.++|+|++|+++|+++ ++|+|+|+|||++|++|+||+++ T Consensus 292 IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~~s------~~~~~~f~pL~~~s~kEkaei~~ 365 (427) T protein:vir:10 292 IHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVDE------EEWSIEFEPLSVPSKKEESEITK 365 (427) T ss_pred CCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC------CCcEEEeCCCCCCCHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999876 48999999999999999999999 Q ss_pred HHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCCCCCCCCCCCCC Q lcl|NC_019527. 449 NKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGADPYMPDPDVLPG 512 (516) Q Consensus 449 ~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~ 512 (516) ++|+++++|+++|+++++|+|+.|+... ++.++....+...++.+++++.+|+.++...-.+ T Consensus 366 ~~a~a~~~~~~~gvi~~~e~r~~L~~~~--~~~~~~~~~~~~~e~~~~~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 366 NNVESVTKAITEQIIDLEEARDTLRSIA--PEFKLKDGNNINIREPEETTEPEPGLGEKLEDEN 427 (427) T ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHhhh--ccccCCCCccccccccchhcCCCCCCCCCCCCCC Confidence 9999999999999999999999997642 3344322222333344444444444333322222 No 11 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=100.00 E-value=1.8e-117 Score=660.63 Aligned_cols=412 Identities=16% Similarity=0.180 Sum_probs=344.5 Q ss_pred CccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccch Q lcl|NC_019527. 61 GVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKA 140 (516) Q Consensus 61 gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~ 140 (516) =++.+|.+++++ ++...+.......+...|++|++|++||++|+|||+||+||||+||+|++.++. T Consensus 1 ~~~~D~~~n~~~------------gg~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~~~~-- 66 (422) T protein:vir:10 1 MVKTDSYANIFL------------GGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGIDDE-- 66 (422) T ss_pred CccchhhHHHHc------------CCCCCccccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccccCCCHH-- Confidence 112233333222 222222222333456679999999999999999999999999999999875431 Q ss_pred hhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEe-cCCCcccCcccccccccccceeeEEeecceeeccccc Q lcl|NC_019527. 141 KEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINI-KGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAY 219 (516) Q Consensus 141 ~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i-~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~ 219 (516) ++++.+|++|++|++|++|++|+||||||+|++.+ +++++++||+. +|+|++|+|+|||||+|..+ T Consensus 67 -------~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~------~g~~~~l~v~d~~~i~~~~~ 133 (422) T protein:vir:10 67 -------PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVRE------GAELETVRVYDRTQVKVQTR 133 (422) T ss_pred -------HHHHHHHHHhhHHHHHHHHHHhhccccceEEEEEecCCCCccccccc------cCceeeEEeeccccccchhc Confidence 35888999999999999999999999999999998 56789999864 47899999999999999765 Q ss_pred cccccccccccCcceeEEee------eEeccceEEEecCCcchhhhhhccCCCCchHHHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 220 NALDPTAPDFYKPSTWWVLG------REMHASRLLTIITRPLPDMLKPAYNFSGISMSQL-AQPYVENWLRTRQSVSDLV 292 (516) Q Consensus 220 ~~~dp~s~~yg~P~~y~v~g------~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~-~~~~l~~~~~~~~~~~~Ll 292 (516) + .||++|+||+|++|+|.+ ++|||||||+|.|+++|+++++.++|||.|+|++ ||++|++|+++++++++|+ T Consensus 134 ~-~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~ 212 (422) T protein:vir:10 134 E-ENPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLL 212 (422) T ss_pred c-cCccccccCcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 4 699999999999999976 5899999999999999999999999999999975 8999999999999999999 Q ss_pred HHhCCceeeec-chhhhcCcc-HHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCHHHHHHHHHHHHHhhhcCC Q lcl|NC_019527. 293 DKFSRTFLKTN-MAQVLNGGE-GGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLADLQSQSQEHMCSVSKIP 370 (516) Q Consensus 293 ~~~~~~v~k~~-~~~~l~~~~-~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d~~~~~~~~iaaas~IP 370 (516) +++++.|+|++ +++++..++ ..++.+|++.+.+.++|++.+++++++|+|++++++||||++++++++++||++++|| T Consensus 213 ~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl~~~~~~~~~~iaaa~~IP 292 (422) T protein:vir:10 213 KRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIH 292 (422) T ss_pred HHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEecccCChHHHHHHHHHHHHhhhCCC Confidence 99999999997 466665543 4568899999999999999999999889999999999999999999999999999999 Q ss_pred ceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_019527. 371 AIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNK 450 (516) Q Consensus 371 ~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~ 450 (516) +|+|||+||+||||||++|++|||++|+++||+.++|+|++|+++|+++ .+|+|+|+|||++|++||||+++++ T Consensus 293 ~t~L~G~s~~Glnatgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~~s------~~~~~~f~pL~~~sekekaei~~~~ 366 (422) T protein:vir:10 293 EIILKNKNVGGVSSSQNTALETFHKLVDRKRNAELLPILEFLIPFIVNA------EEWSVEFNPLAQESSKDKAEILEKN 366 (422) T ss_pred eeeeccCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc------CCcEEEeCCCCCCCHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999875 5899999999999999999999999 Q ss_pred HHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 451 AQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 451 a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e 514 (516) |+++++|+++|+++++|+|+.|+...+ ..++...+.. +++++.+..++| .+.|..+ T Consensus 367 a~a~~~~~~~g~i~~~e~r~~L~~~~~--~~~~~~~~~~--~~~~~~~~~~~~----~~~~~~d 422 (422) T protein:vir:10 367 VNSIAALIAAGAMDIDEARDTLRTIAP--EVKINDGSVE--TEVTISETSNDP----LEVPTDD 422 (422) T ss_pred HHHHHHHHhcCCCCHHHHHHHhhhhcc--cccCCCCCCc--cccchhhcCCCC----CCCCCCC Confidence 999999999999999999999976532 2222222211 112222211111 1222222 No 12 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=100.00 E-value=9.3e-116 Score=651.23 Aligned_cols=423 Identities=14% Similarity=0.142 Sum_probs=348.5 Q ss_pred CCcc-CCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeecccc Q lcl|NC_019527. 60 PGVV-PAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRT 138 (516) Q Consensus 60 ~gv~-~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~ 138 (516) =|+. .....+..++||..+...+-.+... ..++..+.+.+++++++|++||++|+|||+||+||||+||+|++.++. T Consensus 1 ~~~~m~~~~~~~~~~D~~~~~~~~~~g~~~--~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~ 78 (435) T protein:vir:79 1 MGVFMSDKVKAITKEDGYNEIFGSKDGTFR--PNAFYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGFKVDGVKNE 78 (435) T ss_pred CCcccccccccchhhcchhhhhcccccccc--cCcccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCceecCCChH Confidence 1321 1112233455665432111001100 112233456667889999999999999999999999999999875421 Q ss_pred chhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEe-cCCCcccCcccccccccccceeeEEeecceeeccc Q lcl|NC_019527. 139 KAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINI-KGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPS 217 (516) Q Consensus 139 ~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i-~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~ 217 (516) ++|++++++|++|++|+++++|+|+||||+|++.+ +++++++||++ +|+|++|+|+||+||+|. T Consensus 79 ---------~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~------~g~i~~i~v~d~~~i~~~ 143 (435) T protein:vir:79 79 ---------KSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKP------GAQLEDIRVYDRYQITIH 143 (435) T ss_pred ---------HHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCccccccc------CCceeeEEeechhhccch Confidence 45899999999999999999999999999999997 56789999864 378999999999999987 Q ss_pred cccccccccccccCcceeEEe------eeEeccceEEEecCCcchhhhhhccCCCCchHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 218 AYNALDPTAPDFYKPSTWWVL------GREMHASRLLTIITRPLPDMLKPAYNFSGISMS-QLAQPYVENWLRTRQSVSD 290 (516) Q Consensus 218 ~~~~~dp~s~~yg~P~~y~v~------g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~l-e~~~~~l~~~~~~~~~~~~ 290 (516) .++ .||++|+||+|++|+|+ +++|||||||||+|+++|++++++++|||.|+| |++|++|++|+++++++++ T Consensus 144 ~~~-~dp~sp~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~ 222 (435) T protein:vir:79 144 ERE-TNARSVRYGEPKLYKISPGGDIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQ 222 (435) T ss_pred hhc-cCCcccccCcceEEEEecCCCCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHH Confidence 654 79999999999999997 468999999999999999999999999999976 7999999999999999999 Q ss_pred HHHHhCCceeee-cchhhhcCcc-HHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCHHHHHHHHHHHHHhhhc Q lcl|NC_019527. 291 LVDKFSRTFLKT-NMAQVLNGGE-GGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLADLQSQSQEHMCSVSK 368 (516) Q Consensus 291 Ll~~~~~~v~k~-~~~~~l~~~~-~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d~~~~~~~~iaaas~ 368 (516) |++++++.|+|+ ++++.++.+. ...+.+|++.++..+++++.+++++++|+|++++++|+||++++++++++||++++ T Consensus 223 l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~~lsgl~~~~~~~~~~iaaa~~ 302 (435) T protein:vir:79 223 LLRRKQQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLNSDVSGVPEFLQEKIDRIVALTG 302 (435) T ss_pred HHHHhcCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEecccCCHHHHHHHHHHHHHhhhC Confidence 999999999999 5677776654 45688899999999999999999998899999999999999999999999999999 Q ss_pred CCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHH Q lcl|NC_019527. 369 IPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRF 448 (516) Q Consensus 369 IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~ 448 (516) ||+|+|||+||+|||||||+|+++||++|+++||..++|+|++|+++++++ ++|+|+|+|||++|++|+||+++ T Consensus 303 IP~t~L~G~s~~glnstgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li~~s------~d~~~~f~pL~~~sekEkAei~~ 376 (435) T protein:vir:79 303 IHEIIIKNKNTGGVSASQNTALETFYKLIDRKRVEDYKPILEFLLPFMISE------TEWSIEFEPLSVPSDKDKAEIMA 376 (435) T ss_pred CCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC------CCCeEEeCCCCCCCHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999876 69999999999999999999999 Q ss_pred HHHHHHHHHHHcCCCCHHHHHHHHHh-hhccCCCCCChhhhccccccchhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 449 NKAQEAQIYITNSVIDPSEARQQLSD-DPDSGWDNIDGDLEIVQPEMFDDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 449 ~~a~a~~~~~~~gvi~~~e~r~~l~~-~~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e~ 515 (516) ++|+++++|+++|+|+++|+|+.|+. .+.+++.+.+.+. .++.++.+| ++..+.|++. T Consensus 377 ~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~------~~~~~d~~~---~~~~e~g~~~ 435 (435) T protein:vir:79 377 KNVESVVKLKAEQAINLKETRDTLRSICPDLKIMDNDNIE------LPEPEDLDP---EPGQEGGLNK 435 (435) T ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHHhccccCCCCccccc------CCccccCCC---CCCCCCCCCC Confidence 99999999999999999999999975 4566655422111 111111111 1111122222 No 13 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=100.00 E-value=2.8e-106 Score=599.23 Aligned_cols=428 Identities=18% Similarity=0.234 Sum_probs=330.1 Q ss_pred HhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhccc-ccCCcccccccCcccHHHH-HHHHhCchhhh Q lcl|NC_019527. 39 MKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNS-AAGGLYAADIQPFPGYQNL-AALATRPEYRA 116 (516) Q Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~-~~~~~~~~~~~~f~gy~ll-~~y~~~~i~r~ 116 (516) ....+..+...+ .+ ..++....++ .. +.++. ......++.+..+.+|+.| ++|++||++|+ T Consensus 1 ~~~~~~a~~~~~---------~~-~a~~~~~~~~---~~----g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~ 63 (461) T protein:vir:80 1 MYSIDKAKQAKI---------DS-KIVNRNDFMV---GH----GKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMN 63 (461) T ss_pred Cccchhhhhhhh---------hh-hhhhhhHHHh---hc----CCcchhhhhhccccCcccccCHHHHHHHHHhCCccch Confidence 000000000000 00 0001111111 00 01110 1111223344567789887 88899999999 Q ss_pred hhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCccc---Cccc Q lcl|NC_019527. 117 FASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSV---PLIL 193 (516) Q Consensus 117 iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~---Pl~l 193 (516) |||+||+||||+|++|++.++ +..+.|++++++|++|++|+++++|+|+||+|+|+|.+++.+... +.+| T Consensus 64 iVd~~a~d~~r~g~~i~~~~~-------~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl 136 (461) T protein:vir:80 64 IVDIISEDMVRAGWSLKTDNK-------EMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAI 136 (461) T ss_pred hhccchHHhhcCCeeeecCCH-------HHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCc Confidence 999999999999999988643 235678999999999999999999999999999999986654322 2233 Q ss_pred ccccccccceeeEEeecceeeccccccccccccccccCcceeEEee-----------------eEeccceEEEecCCcch Q lcl|NC_019527. 194 DPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG-----------------REMHASRLLTIITRPLP 256 (516) Q Consensus 194 d~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g-----------------~~iH~SRli~~~~~~~p 256 (516) ++.+++ +|++|.+++++.+++ .++..||++|+||+|++|+|.+ ++||+||||||.+.++| T Consensus 137 ~~~~~~--~~~~l~~~~~~~i~~-~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~ 213 (461) T protein:vir:80 137 DPKTIK--SIPYINTFNTQKVTQ-LYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFE 213 (461) T ss_pred cccccc--ceeEEEeccccccch-hhhcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCCCCC Confidence 333332 355566666666664 4567899999999999999964 68999999999999988 Q ss_pred hhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEe Q lcl|NC_019527. 257 DMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMD 336 (516) Q Consensus 257 ~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id 336 (516) +. +||+|++|++|++|++|++++.++++|+++++++++|++....+.+.+..++.++++ .+++|+|+++++ T Consensus 214 ~~------~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~---~~~~~~g~~~~d 284 (461) T protein:vir:80 214 GE------TKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLD---FMFRTEALAIIK 284 (461) T ss_pred cc------ccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHH---HhcCCceEEEEc Confidence 64 689999999999999999999999999999999999999888887776666666654 567899999998 Q ss_pred cCCcceeEEecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 337 FDSEDIVQVNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVI 416 (516) Q Consensus 337 ~~~e~~e~~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l 416 (516) ++ |+|++++++|+|+++++++++++||++++||+|+|||+|| |.||||++|+++||++|+++||+.++|+|++|+++| T Consensus 285 ~~-e~~e~~~~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~-g~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i 362 (461) T protein:vir:80 285 GD-EQLTKESTNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEA-GTLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLL 362 (461) T ss_pred CC-cceEEEecCcCCHHHHHHHHHHHHhhhhcCCeeeeecccC-CccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 75 8999999999999999999999999999999999999999 777999999999999999999999999999999999 Q ss_pred HHHhCCCc---C---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHh----hhccCCCCCChh Q lcl|NC_019527. 417 QLSKWGEI---D---DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSD----DPDSGWDNIDGD 486 (516) Q Consensus 417 ~~s~~g~~---~---~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~----~~~~~~~~~d~~ 486 (516) +++.+|.. + .+|+|+|+|||++|+||+||+++++|+++++|+++|+|+++|+|+.|+. ++.+++++++.+ T Consensus 363 ~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~ 442 (461) T protein:vir:80 363 MWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDSAE 442 (461) T ss_pred HHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCCch Confidence 99998732 2 3899999999999999999999999999999999999999999999864 356677777655 Q ss_pred hhccccccchh-cCCCCCC Q lcl|NC_019527. 487 LEIVQPEMFDD-DGADPYM 504 (516) Q Consensus 487 ~e~~~~e~~~~-e~~~~~~ 504 (516) .+....+..+. .++.+.| T Consensus 443 ~~~~~~~~~~~~~~e~~~g 461 (461) T protein:vir:80 443 IDKLAKLVYDAYAKKNADG 461 (461) T ss_pred hhhhhhhccccccccCCCC Confidence 43322211111 1111111 No 14 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=100.00 E-value=5e-98 Score=554.04 Aligned_cols=423 Identities=12% Similarity=0.032 Sum_probs=307.4 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCccc-HHHHHHHHhCchhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPG-YQNLAALATRPEYRAFAST 120 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~g-y~ll~~y~~~~i~r~iVd~ 120 (516) |..+...++- + ..+..+...+.||+.+...+..+.-...+..+.+..-.+ .+|+++|++|||+|+||++ T Consensus 1 ~~~~~~~~~~---------~-~~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~ 70 (449) T protein:vir:10 1 MTDKLTLAVN---------H-ALNDARMARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYRRGGIAHGAVEK 70 (449) T ss_pred CchhhHHHHh---------h-hcchhHHHHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHhcCchhHHHHHh Confidence 1111110000 0 001112234556654443332221111111111111123 3688999999999999999 Q ss_pred hhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhc---ChhHHHHHHHHhcccceeeEEEEEe-cCCCcccCcccccc Q lcl|NC_019527. 121 LSTELTREGIEITSKDRTKAKEMASKIKELEEACEYY---GVMGIIQKAAEHDCFFGRGQISINI-KGADVSVPLILDPR 196 (516) Q Consensus 121 ~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l---~~~~~l~ea~~~~rlyG~a~i~i~i-~~~~~~~Pl~ld~~ 196 (516) |+|+|+|.|..|...++.+. ....++|+..+++| ++|++|+++++|+|+||||+|++.| +++++++|+. T Consensus 71 ~~d~~~~~~~~i~~g~~~~~---~~~~~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l~~Pl~---- 143 (449) T protein:vir:10 71 LVGKCWQTNPEIIEGDDADD---SEDETSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHIRDEKDWNLPAT---- 143 (449) T ss_pred hhhhhhhcCcccccCccccc---hhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEecCCCCCCcccc---- Confidence 99999999998864443332 22334555555554 6899999999999999999999998 5678898874 Q ss_pred cccccceeeEEeecceeeccccccccccccccccCcceeEEee---------eEeccceEEEecCCcchhhhhhccCCCC Q lcl|NC_019527. 197 TIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG---------REMHASRLLTIITRPLPDMLKPAYNFSG 267 (516) Q Consensus 197 ~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g---------~~iH~SRli~~~~~~~p~~~k~~~~~~G 267 (516) ..++|++|.|+|+++++|..+ +.||++|+||+|++|+|++ .+||||||++|++.++| | T Consensus 144 --~~~~i~~i~v~~~~~i~~~~~-~~dp~sp~yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~~~~----------g 210 (449) T protein:vir:10 144 --KGRGLQKVSVSWAGSLKVAEW-DTGINSKTYGQPKLWKYTERLPNGSSRRVDIHPDRVFILGDYSED----------A 210 (449) T ss_pred --cCcceeeEEeeccccCChhhh-hcCCCCCCCCCceEEEEeeeccCCCccceeeccceeEeecCCCCC----------C Confidence 235899999999999999765 5799999999999999874 37999999999887754 8 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhCC-----cee-ee---cchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecC Q lcl|NC_019527. 268 ISMSQLAQPYVENWLRTRQSVSDLVDKFSR-----TFL-KT---NMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFD 338 (516) Q Consensus 268 ~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~-----~v~-k~---~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~ 338 (516) +|+|+++|+.|.+++++..++++.+.+... .+. ++ ++...+..+.++...+....+..+.++.+.++++.+ T Consensus 211 ~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~i~~~ 290 (449) T protein:vir:10 211 IGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYGVSIDELQDKFNEVAGEINRGNDVLMTTQG 290 (449) T ss_pred hhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhhCCchHHHHHHHHHHHHHhccchheeecCC Confidence 999999999999999999998876554321 111 12 234444444332222222334344456666777764 Q ss_pred CcceeEEecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 339 SEDIVQVNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQL 418 (516) Q Consensus 339 ~e~~e~~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~ 418 (516) ++|++++++|+|++++++++++.|||+++||+|+|||+||+||||| +|++|||++|+++|+ .|+|.|++|+++|++ T Consensus 291 -~d~~~~~~~~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst--~D~~nyyd~i~~~Q~-~l~p~le~l~~~l~~ 366 (449) T protein:vir:10 291 -ATVTPLVTSVADPTATYNVNLQTAAAGVDIPTRILIGNQQAERSST--EDQKYFNARCQSRRV-DLSFEIEDFCDKLIE 366 (449) T ss_pred -cceEEEecccCChhHHHHHHHHHHHHHhCCCeeeeeccCccccccc--hhHHHHHHHHHHHHH-hhhHHHHHHHHHHHH Confidence 7899999999999999999999999999999999999999999976 489999999999997 599999999999999 Q ss_pred HhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcC---CCCHHHHHHHHHhhhccCCCCCChhhhccccccc Q lcl|NC_019527. 419 SKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNS---VIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMF 495 (516) Q Consensus 419 s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g---vi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~ 495 (516) +.+|.+|++|+|+|+|||+||+|||||+++++|+++++++++| +++++|+|+.++.++..+. ....+++++ T Consensus 367 s~~g~~~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~~~~EiR~~~~~~~~~~~------~~~~e~~de 440 (449) T protein:vir:10 367 LKIIDAVAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAFSREEIRTAAGYDNDDEE------PLGEEDGDE 440 (449) T ss_pred hhcCCCCCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCcCHHHHHHHhcccCCCCC------CCCCCCCcc Confidence 9999999999999999999999999999999999999999887 9999999999855443322 222122222 Q ss_pred hhcCCCCCC Q lcl|NC_019527. 496 DDDGADPYM 504 (516) Q Consensus 496 ~~e~~~~~~ 504 (516) .+++.++.+ T Consensus 441 ~~~~~d~~a 449 (449) T protein:vir:10 441 EDKATDSAA 449 (449) T ss_pred ccccCCcCC Confidence 222233322 No 15 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=100.00 E-value=3.8e-58 Score=335.37 Aligned_cols=200 Identities=23% Similarity=0.293 Sum_probs=171.6 Q ss_pred eeee-cchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCHHHHHHHHHHHHHhhhcCCceeeecc Q lcl|NC_019527. 299 FLKT-NMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGI 377 (516) Q Consensus 299 v~k~-~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~ 377 (516) |+|+ +++++++. ++.++.+|++++.+++++++++++|+++|+|++++++||||++++++++++|||+++||+|||||+ T Consensus 1 V~k~~~l~~~~~~-~~~~~~~r~~~~~~~~~~~~~~~ld~~~e~~e~~~~~lsGl~d~l~~~~~~iaa~s~iP~t~LfG~ 79 (201) T protein:vir:10 1 MWKAKGLADLCDD-SDGAARLRLAQVDNNSGVGQAIGIDADSEEYNVLNSDIGGIDTFLSQKFDRIVALSGIHEIILKGK 79 (201) T ss_pred CccchHHHHHhcC-ChHHHHHHHHHHHHhhhhhhhheeecCCcceeeeecCcCChHHHHHHHHHHHHhHhcCchhhhcCC Confidence 8997 56666655 466899999999999999999999998899999999999999999999999999999999999999 Q ss_pred ccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 378 SPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIY 457 (516) Q Consensus 378 sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~ 457 (516) ||+||||||++|++|||++|+++|++.|+|+|++|+++++ ++++|+|+|+|||++|++||||+++++|+++++| T Consensus 80 sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~~------~~~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~ 153 (201) T protein:vir:10 80 NVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLPFIV------TEQEWSVEFNPLSQVSDKDKSEILEKNVNSVAAL 153 (201) T ss_pred CCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhc------CCCCceEeeCCCCCCCHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999998654 5789999999999999999999999999999999 Q ss_pred HHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCCCCCCCCCC Q lcl|NC_019527. 458 ITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGADPYMPDPDV 509 (516) Q Consensus 458 ~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~ 509 (516) +++|+++++|+|++|+.++++++-+.+ ..++++...++.+|+...+++ T Consensus 154 ~~~g~i~~~e~r~~L~~~~~~~~~~~~----~~~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 154 IAAGIIDADEARDTLRAISTEVKIGEG----SIQTEVVINESEDPLDVSANN 201 (201) T ss_pred HHcCCCCHHHHHHHHHhcCCcCCCCCC----CCCccccccccCCCCCCCCCC Confidence 999999999999999999988774421 112222222333332222222 No 16 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=99.96 E-value=1.9e-28 Score=172.53 Aligned_cols=461 Identities=14% Similarity=0.114 Sum_probs=235.9 Q ss_pred chhhhhhhhcccccccccCCCcCCC-CCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhh Q lcl|NC_019527. 4 FDRKKFKREVADKLADAARAEEQEK-ARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQ 82 (516) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~ 82 (516) .-||-|.|+.=...+-....+...+ |.. |-.+|. -. .+|..+|+. .+......-|......-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~-~~-------~~p~~~~~~--~~~~~~~~~d~~~~~~~r 64 (648) T protein:vir:79 1 MARKVWGRGFWSRISLMWRDEDDDKEPLV------LEESMQ-LG-------EAPGAMPKG--GGGGGSAKRDPKMSLVKR 64 (648) T ss_pred CccchhcchhhhhhhhhccCccccccccc------cccccc-cC-------CCccccCCC--CcccccccccchhHHHHH Confidence 4455555443332222222111111 111 111110 00 111111111 010001111111111111 Q ss_pred ----hcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhc- Q lcl|NC_019527. 83 ----FLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYY- 157 (516) Q Consensus 83 ----~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l- 157 (516) ...+..++..+.+ .++.-.++..+|..++.++++|+++++++.+.+|.+...++...+... .+ ..+.+. T Consensus 65 ~g~~~~~~~~g~~~~~e-pp~d~~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~--~~---~ll~rPn 138 (648) T protein:vir:79 65 IGLAIMDGGGGGRDFEE-PEFDFNEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIR--MR---FTLMAEA 138 (648) T ss_pred hHHHHHhhcCCcccccc-CCcCHHHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCccchhhH--HH---HHhhccC Confidence 1111112211111 123233556778899999999999999999999999876654332211 11 112222 Q ss_pred ---ChhHHHHHHHHhcccceeeEEEEEecCCCcccCccccc-ccccccceeeEEeecceeeccccccccccccccccCcc Q lcl|NC_019527. 158 ---GVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDP-RTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPS 233 (516) Q Consensus 158 ---~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~-~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~ 233 (516) ...+.+...+..-.+||.+|+.+..++ +...|+.+.+ .......++++.+++|..++... .+||.+. T Consensus 139 ~~~t~~~f~~~l~~~lll~GNAYveiiRd~-~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~--------d~~g~~~ 209 (648) T protein:vir:79 139 TQIPTNQLFIEIAEDLVKYCNVVIAKSRAK-DALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKR--------DKFGMIK 209 (648) T ss_pred CCCCHHHHHHHHHHHHHhcCCeEEEEEecC-CCccchhhhhhhhccccceeeeEeecCceeEEEE--------cCCCcee Confidence 233334444555568999998876543 2222332221 11122345677788877766422 2467777 Q ss_pred eeEEe--e----eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecch Q lcl|NC_019527. 234 TWWVL--G----REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMA 305 (516) Q Consensus 234 ~y~v~--g----~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~ 305 (516) .|.+. + ..++++.||||.... +....+|+|.++.+.+.|.....+....+.++.+...+. ++++.. T Consensus 210 ~Y~y~~~g~~~~~~~~~~dIIHik~~~------~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~ 283 (648) T protein:vir:79 210 GWQQEQEGQDKPQKFKPEDIVHIYYKR------EKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLE 283 (648) T ss_pred eeEEEecCCceeEEecCccEEEEccCC------CCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC Confidence 76543 1 357899999997432 234567999999999999999999999999999877643 333211 Q ss_pred hhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCC------HHHHHHHHHHHHHhhhcCCceeeecccc Q lcl|NC_019527. 306 QVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSG------LADLQSQSQEHMCSVSKIPAIKLTGISP 379 (516) Q Consensus 306 ~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsg------l~d~~~~~~~~iaaas~IP~t~L~G~sp 379 (516) . ...+...+.++.+.....| +.+. +...+.+.+.++..+ +.+...+..++||.+.+||-.+ +|... T Consensus 284 ~----~~~e~~k~~~e~~~~~~~~--~~i~-gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~l-LG~~~ 355 (648) T protein:vir:79 284 Q----EGFGAEEGEVDLVRGEVEN--MDVE-GGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELM-MGRGG 355 (648) T ss_pred c----cchHHHHHHHHHHHHhccc--cccc-ccccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhH-cccCC Confidence 1 1112222223333333222 2222 223345555544322 2334566778999999999865 48766 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-Hh---CCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_019527. 380 SGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQL-SK---WGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQ 455 (516) Q Consensus 380 ~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~-s~---~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~ 455 (516) ++..++++....+|++.|...|....+.+...+...+.+ .. +...+..+.|.|++|...+++.+++. +. T Consensus 356 ~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~-------~~ 428 (648) T protein:vir:79 356 TASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQ-------AV 428 (648) T ss_pred CccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccchhhHHHHHHH-------HH Confidence 666677888888999999998876665555555444332 12 22223457889999988877766544 45 Q ss_pred HHHHcCCCCHHHHHHHHHhhhcc-CCCC--C-----Chhhhccccc---cchhcCC--------CCCCCCCCCCCCCCCC Q lcl|NC_019527. 456 IYITNSVIDPSEARQQLSDDPDS-GWDN--I-----DGDLEIVQPE---MFDDDGA--------DPYMPDPDVLPGEEGS 516 (516) Q Consensus 456 ~~~~~gvi~~~e~r~~l~~~~~~-~~~~--~-----d~~~e~~~~e---~~~~e~~--------~~~~~~~~~~~~~e~t 516 (516) .++++|++|++|+|+.++..+.. +... + ....+..++. .+..++. .....+.+...+.+++ T Consensus 429 ~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~g~ 508 (648) T protein:vir:79 429 FLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTPAGGSSASASGDKKKKATDNKTKPTNQHGT 508 (648) T ss_pred HHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccccchhccccccCCCCCCCCCCCCccccccccccCCCCCCCCCCCc Confidence 68999999999999998654421 1100 0 0000000000 0000000 0000001111111111 No 17 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=99.93 E-value=2.6e-26 Score=160.82 Aligned_cols=401 Identities=13% Similarity=0.135 Sum_probs=217.1 Q ss_pred CccCCCccchhcccccccchhhhcccccCCcccccccCcccHH-------HHHHHHhCchhhhhhhhhhHHHhhCCCeee Q lcl|NC_019527. 61 GVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQ-------NLAALATRPEYRAFASTLSTELTREGIEIT 133 (516) Q Consensus 61 gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~-------ll~~y~~~~i~r~iVd~~aed~~r~~~~i~ 133 (516) =..++|+..+--..+-..+. ...... +.++.|++ ....|.+++.+.+||+.+++++-+..+.+. T Consensus 1 ~~~~~~~~~~~p~~~~~~~~---~~~~~~------~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~ 71 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQ---MQDSYY------YAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCM 71 (518) T ss_pred CcccCceeeccchhhhhhhh---hhhccc------ccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEE Confidence 11223321110000000000 000000 00111111 124577899999999999999999888884 Q ss_pred eccccch-hhhHHHHHHHHHHHHhcChhHHHHHHHHhcc-cceeeEEEEEecCCCcccCcccccccccccceeeEEeecc Q lcl|NC_019527. 134 SKDRTKA-KEMASKIKELEEACEYYGVMGIIQKAAEHDC-FFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEP 211 (516) Q Consensus 134 ~~~~~~~-~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~r-lyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~ 211 (516) -.+.+.. +.....+..|...-....-...|.+.+.... ++|.+++++.-+ ..|.+.+|.+++| T Consensus 72 ~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~---------------~~G~~~~L~~l~p 136 (518) T protein:vir:78 72 FTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN---------------KSGTPEKLMPMHP 136 (518) T ss_pred EEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc---------------CCCcEEEEEEECC Confidence 4332221 1111222333333333445556666666554 679988887442 2245667888988 Q ss_pred eeeccccccccccccccccCcceeEEe--------eeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHH Q lcl|NC_019527. 212 MWTSPSAYNALDPTAPDFYKPSTWWVL--------GREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLR 283 (516) Q Consensus 212 ~~v~p~~~~~~dp~s~~yg~P~~y~v~--------g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~ 283 (516) .+|++..... +-...|++. ...++++.||||.+.- +.....|+|.++.+.+.|..... T Consensus 137 ~~Vtv~~~~~--------~~~~~y~~~~~~~~~~~~~~~~~~eIiHir~~~------~dg~~~G~Spi~~~~~~i~~~~a 202 (518) T protein:vir:78 137 SRVAIKRNSR--------TGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFN------PDGLERGLSLMESLKSTIFSEDS 202 (518) T ss_pred CceEEEEcCC--------CCEEEEEEEecCCccceeEEecCCcEEEecCCC------CCcccccccHHHHHHHHHHHHHH Confidence 8887533211 111223332 1357889999997532 11223599999999999999999 Q ss_pred HHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCC--HHHHHHHH Q lcl|NC_019527. 284 TRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSG--LADLQSQS 359 (516) Q Consensus 284 ~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsg--l~d~~~~~ 359 (516) +......++.+.... +++++ ..++....+++.++++......+|.|..++..++.+|+.++.+..+ +-+..... T Consensus 203 a~~~~~~~f~Ng~~p~gvl~~~--~~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~ 280 (518) T protein:vir:78 203 SRNATAAMWKNAGRPNLVLRHE--KRLSPEAQQRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLN 280 (518) T ss_pred HHHHHHHHHhcCCCccEEEecC--CCCCHHHHHHHHHHHHHHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHH Confidence 999999999887765 44443 4454444445555555554444565544433445788888766543 44556677 Q ss_pred HHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEe--CCCCC Q lcl|NC_019527. 360 QEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKF--KSLWQ 437 (516) Q Consensus 360 ~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f--~pL~~ 437 (516) .+.||.+.+||..+| |....+-.++.+.....||.. .|.|.+..|-..|-+..+-.......|+| ..|.. T Consensus 281 ~~eIa~afgVPp~~l-g~~~~st~sn~e~~~~~f~~~-------tL~P~~~~ie~eln~~L~~~~~~~~~~~fd~~~Llr 352 (518) T protein:vir:78 281 REEVCGVYDIAPPIV-HILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQ 352 (518) T ss_pred HHHHHHHhCCCHHHh-ccCCCCCchhHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcccccCcceEEeechhhhc Confidence 789999999998766 655444444445555555543 47788888877665543322223444555 47888 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhcc--C---------CCCCChhhhc-cccc--cchhcCCCCC Q lcl|NC_019527. 438 TSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDS--G---------WDNIDGDLEI-VQPE--MFDDDGADPY 503 (516) Q Consensus 438 ~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~--~---------~~~~d~~~e~-~~~e--~~~~e~~~~~ 503 (516) .|.++++ +++..++++|++|++|+|+.++..+.. + +.+++..... ...+ ....+..+.+ T Consensus 353 ~D~~~r~-------~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~ 425 (518) T protein:vir:78 353 PDWEAKS-------ESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTP 425 (518) T ss_pred cCHHHHH-------HHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceecccccccccCCCCCCCCCCCCccc Confidence 8887764 455778999999999999998643321 1 0111100000 0000 0000000000 Q ss_pred CCC-----CCCCCCCCCC Q lcl|NC_019527. 504 MPD-----PDVLPGEEGS 516 (516) Q Consensus 504 ~~~-----~~~~~~~e~t 516 (516) .++ +...++.+.+ T Consensus 426 ~~~~~~~~~~~~~~~~~~ 443 (518) T protein:vir:78 426 VASLDQSPPASVPGLSPT 443 (518) T ss_pred ccccccCccccCCCCCcc Confidence 000 0001111111 No 18 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=99.93 E-value=4.9e-26 Score=159.36 Aligned_cols=392 Identities=13% Similarity=0.109 Sum_probs=225.9 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTL 121 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~ 121 (516) |-++.. |.+.. ..+..+.. ....+.+....+ .+.. -..+.+++.++++|+.+ T Consensus 1 m~f~~~-----~~~~~-----------~~~~~~~~--~~~~~~g~~~~~-------~~v~---~~~al~~~~v~~~i~~i 52 (409) T protein:vir:10 1 MLFRKG-----FKNQS-----------QEISIDDK--KILEWLGINPSE-------TYVN---GKSCLKQATVFGCIRIL 52 (409) T ss_pred Cccccc-----ccCcC-----------CCCCCChH--HHHHHhcCCcCc-------ceec---hhhhhccHHHHHHHHHH Confidence 222211 11110 00111110 000111111111 0111 12345688899999999 Q ss_pred hHHHhhCCCeeeeccccchhhh-HHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCccccccccc Q lcl|NC_019527. 122 STELTREGIEITSKDRTKAKEM-ASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIK 199 (516) Q Consensus 122 aed~~r~~~~i~~~~~~~~~~~-~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~ 199 (516) |+++.+..+.+.-..++..+.. ......|...-.....+..|.+.+.+. .++|.|++++..++ T Consensus 53 a~~ia~lp~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~--------------- 117 (409) T protein:vir:10 53 SDNISKLPIKIYQKKDGIKRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKK--------------- 117 (409) T ss_pred HHhhhhCceEEEEecCCeeeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC--------------- Confidence 9999999888743322221111 111222333444445566676666654 66899998875432 Q ss_pred ccceeeEEeecceeeccccccccccccccccCcceeEEe-----eeEeccceEEEecCCcchhhhhhccCCCCchHHHHH Q lcl|NC_019527. 200 KGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL-----GREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLA 274 (516) Q Consensus 200 ~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~-----g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~ 274 (516) .|.+.+|.++.|.+|++...+ +... .+.....|.+. ...++++.|||+.+..+ ...+|.|.++.+ T Consensus 118 ~G~~~~L~~i~~~~V~v~~~~--~~~~-~~~~~~~y~~~~~~g~~~~~~~~evih~r~~~~-------d~~~G~s~i~~~ 187 (409) T protein:vir:10 118 NGEIKGLYPLKSDGMKIFVDD--TGLL-NSENNVWYLYTDDLGQRHKFMSDEILHFKGLTA-------DGLAGLSVIELL 187 (409) T ss_pred CCcEEEEEEEcCCceEEEEcC--Cccc-cccceEEEEEEeCCceeEEeccccEEEecCcCC-------CCcccccHHHHH Confidence 244667889998888764321 2211 11111233332 24689999999975432 245799999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccCC Q lcl|NC_019527. 275 QPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLSG 351 (516) Q Consensus 275 ~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~lsg 351 (516) .+.|.....+......++.+.... +++++ ..++.+..+++.++++.......|.+ +++++ ++.+|++++.+..+ T Consensus 188 ~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~--~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~~~~d 264 (409) T protein:vir:10 188 NHLIENGKSSETYLNNFFKNGLQVKGLVQYA--GDLNPEAEEVFKENFERMSSGLKNAHRIAMLP-IGYKFEPISQKLVD 264 (409) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCcEEEEcC--CCCCHHHHHHHHHHHHHHhccccccCCceecC-CCceEEEccCChhh Confidence 999999999999999999886654 44443 34554444556666665554444555 45555 45789998877654 Q ss_pred H--HHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cCCc Q lcl|NC_019527. 352 L--ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE--IDDA 427 (516) Q Consensus 352 l--~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~--~~~d 427 (516) . -+...+..++||.+.+||..+| |...++-.++.+.....||. ..|.|.++.|-..|-+..+.. ...+ T Consensus 265 ~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~~e~~~~~f~~-------~~l~P~~~~ie~~ln~kL~~~~~~~~~ 336 (409) T protein:vir:10 265 AQFLENSQLTIRQIASVFGVKMHQL-NDLDRATHSNITEQNREFYI-------DTLQSILNMYELEINYKLFLISEIKNG 336 (409) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCCccccHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcCchhccCC Confidence 3 4567778899999999999766 55555666666766666664 347888888877776655432 2345 Q ss_pred ceEE--eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccc-cchhcCCCCCC Q lcl|NC_019527. 428 ITFK--FKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPE-MFDDDGADPYM 504 (516) Q Consensus 428 ~~~~--f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e-~~~~e~~~~~~ 504 (516) +.|+ ++.|...|.+++++ ++.+++++|++|++|+|+.++. ++++.-.+...+. ....+. + T Consensus 337 ~~~~fd~~~ll~~d~~~~~~-------~~~~~~~~G~~T~NE~R~~lgl------~p~~ggD~~~~~~n~~~~~~----~ 399 (409) T protein:vir:10 337 FYSKFNVDTILRADIKTRYE-------SYKEAIQNGFKTPNEIRELEED------EPLEGGDVLLINGNMIPVKM----A 399 (409) T ss_pred cEEEEechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCC------CCCCCcCeeeeccCccchhh----c Confidence 5555 55787888887654 5566899999999999999844 4442211111110 000000 0 Q ss_pred CCCCCCCCCCC Q lcl|NC_019527. 505 PDPDVLPGEEG 515 (516) Q Consensus 505 ~~~~~~~~~e~ 515 (516) ......|+|+ T Consensus 400 -~~~~~kgGe~ 409 (409) T protein:vir:10 400 -GEQYSKGGEK 409 (409) T ss_pred -cccccccCCC Confidence 1112233334 No 19 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=99.93 E-value=3.7e-25 Score=154.55 Aligned_cols=410 Identities=13% Similarity=0.085 Sum_probs=225.5 Q ss_pred HHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhh Q lcl|NC_019527. 36 RAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYR 115 (516) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r 115 (516) =.++..+..- -. |..-..++.+ .+.-+. .....+.+....+.. -......+++.+. T Consensus 1 M~~~~r~~~~-~~----~~~r~~~~~~-------~~~~~~--~~~~~~~g~~~~~~~----------v~~~~al~~~~v~ 56 (432) T protein:vir:10 1 MKIVDSVKKF-FN----FEKRQTSQVI-------ELNKDD--EKLLEWLGISPSTIS----------VKGKNALKVATVF 56 (432) T ss_pred CChHHHHHHh-cC----ccccCccccc-------ccCCch--HHHHHHhCCCcCccc----------cchhhhhccHHHH Confidence 1122222100 00 1111101111 010000 000111111111110 0112345688899 Q ss_pred hhhhhhhHHHhhCCCeeeeccccchh--hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcc Q lcl|NC_019527. 116 AFASTLSTELTREGIEITSKDRTKAK--EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLI 192 (516) Q Consensus 116 ~iVd~~aed~~r~~~~i~~~~~~~~~--~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ 192 (516) ++|+.+|+++-+..+.+.-.+++... ........|...-....-+..|.+.+.+. .++|.+++++.-+. T Consensus 57 ~~i~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-------- 128 (432) T protein:vir:10 57 ACIKILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR-------- 128 (432) T ss_pred HHHHHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-------- Confidence 99999999999999987544332211 11112222333444444556666666655 66888888875421 Q ss_pred cccccccccceeeEEeecceeeccccccccccccccccCcc-eeEEe--e--eEeccceEEEecCCcchhhhhhccCCCC Q lcl|NC_019527. 193 LDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPS-TWWVL--G--REMHASRLLTIITRPLPDMLKPAYNFSG 267 (516) Q Consensus 193 ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~-~y~v~--g--~~iH~SRli~~~~~~~p~~~k~~~~~~G 267 (516) .|.+.+|.+++|.+|++... ....-++... +|.+. | ..++++.||||.... +...+.| T Consensus 129 -------~G~~~~L~~i~~~~v~v~~d----~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~------~~~~~~G 191 (432) T protein:vir:10 129 -------KGKVQALWPIDASKVTVYID----DVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGI------TLDGLVG 191 (432) T ss_pred -------CCcEEEEEEEcCceeEEEEc----CcccccccceEEEEEecCCeEEEEccccEEEecCCC------CCCCccc Confidence 24566788999988875321 1111111122 33332 2 468999999996432 3345779 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEe Q lcl|NC_019527. 268 ISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVN 346 (516) Q Consensus 268 ~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~ 346 (516) .|.++.+.+.|.....+......++.+....-........++.+..+++.++++.......|.+ +.+++. +-+|+.++ T Consensus 192 ~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~l~ 270 (432) T protein:vir:10 192 VPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPV-GYQFQPIS 270 (432) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCC-CceEEEcc Confidence 9999999999999999999999998887654333333334544444455666555444444554 455554 57899888 Q ss_pred cccCCH--HHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC- Q lcl|NC_019527. 347 TPLSGL--ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE- 423 (516) Q Consensus 347 ~~lsgl--~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~- 423 (516) .+..+. -+......++||.+.+||..+| |...+|-.++.+.....|| +..|+|.+..|-..|-+..+.. T Consensus 271 ~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~l-g~~~~~~~s~~e~~~~~~~-------~~~l~P~~~~ie~~ln~kLl~~~ 342 (432) T protein:vir:10 271 LNMSDAQFLENTELTIRQIATAFGIKMHQL-NDLSKATLNNIEQQQQQFY-------TDTLQATLTMYEQEMTYKLFLDS 342 (432) T ss_pred CChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcccHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcChh Confidence 765554 3556677899999999998766 5444454555555555554 3467898888888776655432 Q ss_pred -cCCcceEE--eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc--c--cch Q lcl|NC_019527. 424 -IDDAITFK--FKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP--E--MFD 496 (516) Q Consensus 424 -~~~d~~~~--f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~--e--~~~ 496 (516) ...++.|+ ++.|...|.+++++ ++..++++|++|++|+|+.++..+ ++...+...+ - ... T Consensus 343 ~~~~g~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~G~~t~NE~R~~~g~~p------i~ggD~~~~~~n~~~~~~ 409 (432) T protein:vir:10 343 ELDKGFYSKFNVDAILRADIKTRYE-------AYRTGIQGGFLKPNEARSKEDLPP------EAGGDRLLVNGNMLPIDM 409 (432) T ss_pred hcCCCcEEEeechhhhcCCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCCCC------CCCCCeEeecccccchhh Confidence 23445555 45788888888765 556689999999999999985443 3211110000 0 000 Q ss_pred hcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 497 DDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 497 ~e~~~~~~~~~~~~~~~e~t 516 (516) ..+....+++.+...+.++. T Consensus 410 ~~~~~~k~~~~~~~~~~~~~ 429 (432) T protein:vir:10 410 AGQAYLKGGDTNGEVSKEGN 429 (432) T ss_pred ccccccCCCCCCCCCCCCCC Confidence 00111112222222222222 No 20 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=99.93 E-value=3.7e-25 Score=154.55 Aligned_cols=410 Identities=13% Similarity=0.085 Sum_probs=225.5 Q ss_pred HHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhh Q lcl|NC_019527. 36 RAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYR 115 (516) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r 115 (516) =.++..+..- -. |..-..++.+ .+.-+. .....+.+....+.. -......+++.+. T Consensus 1 M~~~~r~~~~-~~----~~~r~~~~~~-------~~~~~~--~~~~~~~g~~~~~~~----------v~~~~al~~~~v~ 56 (432) T protein:vir:10 1 MKIVDSVKKF-FN----FEKRQTSQVI-------ELNKDD--EKLLEWLGISPSTIS----------VKGKNALKVATVF 56 (432) T ss_pred CChHHHHHHh-cC----ccccCccccc-------ccCCch--HHHHHHhCCCcCccc----------cchhhhhccHHHH Confidence 1122222100 00 1111101111 010000 000111111111110 0112345688899 Q ss_pred hhhhhhhHHHhhCCCeeeeccccchh--hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcc Q lcl|NC_019527. 116 AFASTLSTELTREGIEITSKDRTKAK--EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLI 192 (516) Q Consensus 116 ~iVd~~aed~~r~~~~i~~~~~~~~~--~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ 192 (516) ++|+.+|+++-+..+.+.-.+++... ........|...-....-+..|.+.+.+. .++|.+++++.-+. T Consensus 57 ~~i~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-------- 128 (432) T protein:vir:10 57 ACIKILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR-------- 128 (432) T ss_pred HHHHHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-------- Confidence 99999999999999987544332211 11112222333444444556666666655 66888888875421 Q ss_pred cccccccccceeeEEeecceeeccccccccccccccccCcc-eeEEe--e--eEeccceEEEecCCcchhhhhhccCCCC Q lcl|NC_019527. 193 LDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPS-TWWVL--G--REMHASRLLTIITRPLPDMLKPAYNFSG 267 (516) Q Consensus 193 ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~-~y~v~--g--~~iH~SRli~~~~~~~p~~~k~~~~~~G 267 (516) .|.+.+|.+++|.+|++... ....-++... +|.+. | ..++++.||||.... +...+.| T Consensus 129 -------~G~~~~L~~i~~~~v~v~~d----~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~------~~~~~~G 191 (432) T protein:vir:10 129 -------KGKVQALWPIDASKVTVYID----DVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGI------TLDGLVG 191 (432) T ss_pred -------CCcEEEEEEEcCceeEEEEc----CcccccccceEEEEEecCCeEEEEccccEEEecCCC------CCCCccc Confidence 24566788999988875321 1111111122 33332 2 468999999996432 3345779 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEe Q lcl|NC_019527. 268 ISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVN 346 (516) Q Consensus 268 ~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~ 346 (516) .|.++.+.+.|.....+......++.+....-........++.+..+++.++++.......|.+ +.+++. +-+|+.++ T Consensus 192 ~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~l~ 270 (432) T protein:vir:10 192 VPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPV-GYQFQPIS 270 (432) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCC-CceEEEcc Confidence 9999999999999999999999998887654333333334544444455666555444444554 455554 57899888 Q ss_pred cccCCH--HHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC- Q lcl|NC_019527. 347 TPLSGL--ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE- 423 (516) Q Consensus 347 ~~lsgl--~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~- 423 (516) .+..+. -+......++||.+.+||..+| |...+|-.++.+.....|| +..|+|.+..|-..|-+..+.. T Consensus 271 ~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~l-g~~~~~~~s~~e~~~~~~~-------~~~l~P~~~~ie~~ln~kLl~~~ 342 (432) T protein:vir:10 271 LNMSDAQFLENTELTIRQIATAFGIKMHQL-NDLSKATLNNIEQQQQQFY-------TDTLQATLTMYEQEMTYKLFLDS 342 (432) T ss_pred CChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcccHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcChh Confidence 765554 3556677899999999998766 5444454555555555554 3467898888888776655432 Q ss_pred -cCCcceEE--eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc--c--cch Q lcl|NC_019527. 424 -IDDAITFK--FKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP--E--MFD 496 (516) Q Consensus 424 -~~~d~~~~--f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~--e--~~~ 496 (516) ...++.|+ ++.|...|.+++++ ++..++++|++|++|+|+.++..+ ++...+...+ - ... T Consensus 343 ~~~~g~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~G~~t~NE~R~~~g~~p------i~ggD~~~~~~n~~~~~~ 409 (432) T protein:vir:10 343 ELDKGFYSKFNVDAILRADIKTRYE-------AYRTGIQGGFLKPNEARSKEDLPP------EAGGDRLLVNGNMLPIDM 409 (432) T ss_pred hcCCCcEEEeechhhhcCCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCCCC------CCCCCeEeecccccchhh Confidence 23445555 45788888888765 556689999999999999985443 3211110000 0 000 Q ss_pred hcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 497 DDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 497 ~e~~~~~~~~~~~~~~~e~t 516 (516) ..+....+++.+...+.++. T Consensus 410 ~~~~~~k~~~~~~~~~~~~~ 429 (432) T protein:vir:10 410 AGQAYLKGGDTNGEVSKEGN 429 (432) T ss_pred ccccccCCCCCCCCCCCCCC Confidence 00111112222222222222 No 21 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=99.93 E-value=3.7e-25 Score=154.55 Aligned_cols=410 Identities=13% Similarity=0.085 Sum_probs=225.5 Q ss_pred HHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhh Q lcl|NC_019527. 36 RAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYR 115 (516) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r 115 (516) =.++..+..- -. |..-..++.+ .+.-+. .....+.+....+.. -......+++.+. T Consensus 1 M~~~~r~~~~-~~----~~~r~~~~~~-------~~~~~~--~~~~~~~g~~~~~~~----------v~~~~al~~~~v~ 56 (432) T protein:vir:10 1 MKIVDSVKKF-FN----FEKRQTSQVI-------ELNKDD--EKLLEWLGISPSTIS----------VKGKNALKVATVF 56 (432) T ss_pred CChHHHHHHh-cC----ccccCccccc-------ccCCch--HHHHHHhCCCcCccc----------cchhhhhccHHHH Confidence 1122222100 00 1111101111 010000 000111111111110 0112345688899 Q ss_pred hhhhhhhHHHhhCCCeeeeccccchh--hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcc Q lcl|NC_019527. 116 AFASTLSTELTREGIEITSKDRTKAK--EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLI 192 (516) Q Consensus 116 ~iVd~~aed~~r~~~~i~~~~~~~~~--~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ 192 (516) ++|+.+|+++-+..+.+.-.+++... ........|...-....-+..|.+.+.+. .++|.+++++.-+. T Consensus 57 ~~i~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-------- 128 (432) T protein:vir:10 57 ACIKILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR-------- 128 (432) T ss_pred HHHHHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-------- Confidence 99999999999999987544332211 11112222333444444556666666655 66888888875421 Q ss_pred cccccccccceeeEEeecceeeccccccccccccccccCcc-eeEEe--e--eEeccceEEEecCCcchhhhhhccCCCC Q lcl|NC_019527. 193 LDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPS-TWWVL--G--REMHASRLLTIITRPLPDMLKPAYNFSG 267 (516) Q Consensus 193 ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~-~y~v~--g--~~iH~SRli~~~~~~~p~~~k~~~~~~G 267 (516) .|.+.+|.+++|.+|++... ....-++... +|.+. | ..++++.||||.... +...+.| T Consensus 129 -------~G~~~~L~~i~~~~v~v~~d----~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~------~~~~~~G 191 (432) T protein:vir:10 129 -------KGKVQALWPIDASKVTVYID----DVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGI------TLDGLVG 191 (432) T ss_pred -------CCcEEEEEEEcCceeEEEEc----CcccccccceEEEEEecCCeEEEEccccEEEecCCC------CCCCccc Confidence 24566788999988875321 1111111122 33332 2 468999999996432 3345779 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEe Q lcl|NC_019527. 268 ISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVN 346 (516) Q Consensus 268 ~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~ 346 (516) .|.++.+.+.|.....+......++.+....-........++.+..+++.++++.......|.+ +.+++. +-+|+.++ T Consensus 192 ~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~l~ 270 (432) T protein:vir:10 192 VPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPV-GYQFQPIS 270 (432) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCC-CceEEEcc Confidence 9999999999999999999999998887654333333334544444455666555444444554 455554 57899888 Q ss_pred cccCCH--HHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC- Q lcl|NC_019527. 347 TPLSGL--ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE- 423 (516) Q Consensus 347 ~~lsgl--~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~- 423 (516) .+..+. -+......++||.+.+||..+| |...+|-.++.+.....|| +..|+|.+..|-..|-+..+.. T Consensus 271 ~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~l-g~~~~~~~s~~e~~~~~~~-------~~~l~P~~~~ie~~ln~kLl~~~ 342 (432) T protein:vir:10 271 LNMSDAQFLENTELTIRQIATAFGIKMHQL-NDLSKATLNNIEQQQQQFY-------TDTLQATLTMYEQEMTYKLFLDS 342 (432) T ss_pred CChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcccHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcChh Confidence 765554 3556677899999999998766 5444454555555555554 3467898888888776655432 Q ss_pred -cCCcceEE--eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc--c--cch Q lcl|NC_019527. 424 -IDDAITFK--FKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP--E--MFD 496 (516) Q Consensus 424 -~~~d~~~~--f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~--e--~~~ 496 (516) ...++.|+ ++.|...|.+++++ ++..++++|++|++|+|+.++..+ ++...+...+ - ... T Consensus 343 ~~~~g~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~G~~t~NE~R~~~g~~p------i~ggD~~~~~~n~~~~~~ 409 (432) T protein:vir:10 343 ELDKGFYSKFNVDAILRADIKTRYE-------AYRTGIQGGFLKPNEARSKEDLPP------EAGGDRLLVNGNMLPIDM 409 (432) T ss_pred hcCCCcEEEeechhhhcCCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCCCC------CCCCCeEeecccccchhh Confidence 23445555 45788888888765 556689999999999999985443 3211110000 0 000 Q ss_pred hcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 497 DDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 497 ~e~~~~~~~~~~~~~~~e~t 516 (516) ..+....+++.+...+.++. T Consensus 410 ~~~~~~k~~~~~~~~~~~~~ 429 (432) T protein:vir:10 410 AGQAYLKGGDTNGEVSKEGN 429 (432) T ss_pred ccccccCCCCCCCCCCCCCC Confidence 00111112222222222222 No 22 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=99.92 E-value=2e-25 Score=155.96 Aligned_cols=395 Identities=13% Similarity=0.052 Sum_probs=214.7 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTL 121 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~ 121 (516) |.. -.+ .|.++.... .....+... ..+..... .+ . +-....+.+++.+.++|+.+ T Consensus 1 Mgl-~~~---~f~~~~~~~---------~~~~~~~~~---~~~~~~~~---~g---~---~v~~~~al~~~~v~~~v~~i 55 (409) T protein:vir:84 1 MSL-FTR---IFSGPSEER---------TLTKISGIP---SPAEDWAM---HG---D---RPGANSAMTLGAFYACVTLL 55 (409) T ss_pred Cch-hhh---hhcCCCccc---------ccccccccc---cccchhhc---cC---c---ccchhhhhccHHHHHHHHHH Confidence 210 000 011111000 000000000 00000000 00 0 11122344677899999999 Q ss_pred hHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccc Q lcl|NC_019527. 122 STELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKK 200 (516) Q Consensus 122 aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~ 200 (516) |+++-+..+.+.-.++............|...-....-+..|.+.+.+. .++|.++++|..++. . T Consensus 56 a~~iA~lp~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~--------------~ 121 (409) T protein:vir:84 56 ADTVASLSIDAYRKKDNVRIPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDE--------------A 121 (409) T ss_pred HHhhhhCceEEEEecCCcccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECC--------------C Confidence 9999998888754433322222222233333444445566666666655 567998888765332 2 Q ss_pred cceeeEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHH Q lcl|NC_019527. 201 GSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVEN 280 (516) Q Consensus 201 g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~ 280 (516) |.+.+|.+++|.+|...... |.....| -..|.+.|+.++++.|||+..... ...+.|+|.++.+.+.|.. T Consensus 122 g~~~~L~~l~p~~v~v~~~~--~~~~~~~--~~~~~~~g~~~~~~dvih~~~~~~------~~~~~G~s~i~~~~~~i~~ 191 (409) T protein:vir:84 122 NRPTAIMPIHPDCIHVTDAK--DEDGDWI--EPVYRIDGKVVPNHRIMHIKRYPV------AGCALGMSPIEKAASAIGL 191 (409) T ss_pred CceEEEEEEcCceeEEEEcC--CCcceEE--EEEecCCceEEchhhEEEecCCCC------CcccccccHHHHHHHHHHH Confidence 44567888888887653322 2222111 124567788999999999986532 2335799999999999999 Q ss_pred HHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCC--HHHHH Q lcl|NC_019527. 281 WLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSG--LADLQ 356 (516) Q Consensus 281 ~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsg--l~d~~ 356 (516) ...+......++.+.... +++++ ..++. ++..+..+.+.....|.|..++..++.+|+.++.+..+ +-+.. T Consensus 192 ~~~~~~~~~~~f~ng~~p~gil~~~--~~l~~---e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~ 266 (409) T protein:vir:84 192 GLAAERYGLRWFRDSANPSGILSSD--ADLTP---DQVKQTQKQWIQSHHNRRLPAVMSAGIKWQSVSITPNESQFLETR 266 (409) T ss_pred HHHHHHHHHHHHhcCCCccEEEecC--CCCCH---HHHHHHHHHHHHHhccCCCeeecCCCceEEEccCChhHHHHHHHH Confidence 999999999988886654 33433 23333 33344444455555566654444455789988866543 34456 Q ss_pred HHHHHHHHhhhcCCceeeecccccc-cc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCc--ceEEe Q lcl|NC_019527. 357 SQSQEHMCSVSKIPAIKLTGISPSG-LN-ASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDA--ITFKF 432 (516) Q Consensus 357 ~~~~~~iaaas~IP~t~L~G~sp~G-ln-atge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d--~~~~f 432 (516) ....++||.+.+||..+| |....+ .. +.-+....+|+. ..|.|.++.|...+-+. ++.+ ++|.+ T Consensus 267 ~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~sn~e~~~~~f~~-------~~l~P~~~~ie~~l~~~----L~~g~~i~fd~ 334 (409) T protein:vir:84 267 SFQRSEIAMWFRIPPHMI-GDVEKSTSWGTGIEEQGINFVR-------HTLLPWLRCIEQALDTF----LPRGQFVKFNV 334 (409) T ss_pred HHHHHHHHHHhCCCHHHh-CCCCCcccccchHHHHHHHHHH-------HHHHHHHHHHHHHHHHh----ccCCCeEEEec Confidence 677889999999998765 543222 21 222333344442 34778777776666542 2334 45666 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc-ccc---hhc-CCCCCCCCC Q lcl|NC_019527. 433 KSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP-EMF---DDD-GADPYMPDP 507 (516) Q Consensus 433 ~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~-e~~---~~e-~~~~~~~~~ 507 (516) +.|...|.++++ +++.+++++|++|++|+|+.++. +++++..+...+ ... +.. ..+...+++ T Consensus 335 ~~l~~~d~~~~~-------~~~~~~~~~G~~t~NE~R~~~g~------~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~ 401 (409) T protein:vir:84 335 DGLMRGDVTARF-------TAYQMGLQNGIWSVNEVRAWEDA------PPIPEGDIHLQPMNFVPLGYVPPEEPAQEPQP 401 (409) T ss_pred hhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhCC------CCCCCcceeeecccccccccCCccccCcCCCC Confidence 688888887765 45567899999999999999844 444221111100 000 000 000011111 Q ss_pred CCCCCCCC Q lcl|NC_019527. 508 DVLPGEEG 515 (516) Q Consensus 508 ~~~~~~e~ 515 (516) +...++.. T Consensus 402 ~~~~~gn~ 409 (409) T protein:vir:84 402 NSATEGNK 409 (409) T ss_pred CCccCCCC Confidence 11111111 No 23 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=99.92 E-value=7.4e-25 Score=152.87 Aligned_cols=403 Identities=14% Similarity=0.131 Sum_probs=213.9 Q ss_pred CccCCCcc---chhcccccccchhhhcccccCCccc--ccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeec Q lcl|NC_019527. 61 GVVPAGTT---PAVAMDSLCGPTYQFLNSAAGGLYA--ADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSK 135 (516) Q Consensus 61 gv~~~~~~---~~~a~ds~~~~~~~~~~~~~~~~~~--~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~ 135 (516) =..++|+. |++|-- .+.+.. ...+... .....+.+ -....|..++.+.+||+.+++++-+-.+.+.-. T Consensus 1 ~~~~~~~~~~~p~~~e~---~~~~~~---~~~~~~~~~~~~~~~~~-~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~ 73 (518) T protein:vir:10 1 MLLANGQTLSAPAMAEL---SPQMQD---SYYYAPAVGMQLERQFS-LYGGIYKNQPWVRTVIAKRAQALARLPVKCMFT 73 (518) T ss_pred CcccCceeecCchhhhh---hhhhhc---ccccccccceecccccc-hhhHHHhhhHHHHHHHHHHHHhhccCceEEEEE Confidence 11222221 111100 000000 0000000 00000000 112457788999999999999999888877333 Q ss_pred cc-cchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEeeccee Q lcl|NC_019527. 136 DR-TKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMW 213 (516) Q Consensus 136 ~~-~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~ 213 (516) .. ...+.....+..|...-........|.+.+... .++|.+++++.-++ .|.+.+|.+++|.+ T Consensus 74 ~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~---------------~G~~~~L~~l~p~~ 138 (518) T protein:vir:10 74 SGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK---------------SGTPEKLMPMHPSR 138 (518) T ss_pred cCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECC---------------CCcEEEEEEECCCc Confidence 22 222111122233333333334455666666655 56799988875422 24456788888888 Q ss_pred eccccccccccccccccCcceeEEe------e--eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_019527. 214 TSPSAYNALDPTAPDFYKPSTWWVL------G--REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTR 285 (516) Q Consensus 214 v~p~~~~~~dp~s~~yg~P~~y~v~------g--~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~ 285 (516) |++.... +. +. ..|.+. + ..+.++.||||.+.. +...+.|.|.++.+.+.|.....+. T Consensus 139 v~v~~~~--~~-----~~-~~y~~~~~~~~~~~~~~~~~~eViHir~~s------~dg~~~G~spi~~a~~~i~~~~a~~ 204 (518) T protein:vir:10 139 VAIKRNS--RT-----GR-YEYYFQAGAGVGTQLVSFADDEVVPIRFFN------PDGLERGLSLMESLKSTIFSEDSSR 204 (518) T ss_pred eEEEEcC--CC-----CE-EEEEEEecCCccceEEEecCCcEEEecCCC------CCcccccccHHHHHHHHHHHHHHHH Confidence 8753211 10 11 123322 1 356789999997542 1122469999999999999999999 Q ss_pred HHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCC--HHHHHHHHHH Q lcl|NC_019527. 286 QSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSG--LADLQSQSQE 361 (516) Q Consensus 286 ~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsg--l~d~~~~~~~ 361 (516) .....++.+.... +++++ +.++....+++.++++.......|.|..++..++.+|+.++.+..+ +-+...+..+ T Consensus 205 ~~~~~~f~ng~~p~gil~~~--~~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~ 282 (518) T protein:vir:10 205 NATAAMWKNAGRPNLVLRHE--KRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNRE 282 (518) T ss_pred HHHHHHHhcCCCccEEEecC--CCCCHHHHHHHHHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHH Confidence 9999999987765 44443 3454443344555554444444555544433456788888765543 3445567778 Q ss_pred HHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEe--CCCCCCC Q lcl|NC_019527. 362 HMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKF--KSLWQTS 439 (516) Q Consensus 362 ~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f--~pL~~~s 439 (516) .||.+.+||-.+| |..-.+-.++.+.....||.. .|.|.+..|-..|-+..+-.....+.|+| ..|...| T Consensus 283 eIa~afgVPp~~l-g~~~~~t~sn~eq~~~~f~~~-------tL~P~l~~ie~~ln~~L~~~~~~~~~~~fd~~~llr~D 354 (518) T protein:vir:10 283 EVCGVYDIAPPIV-HILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPD 354 (518) T ss_pred HHHHHhCCCHHHh-ccCCCCCchhHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcccccCCceEEEechhhhccC Confidence 9999999998666 655444445555555555543 37788888777666543322223445555 4787778 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhcc--C---------CCCCChhhhc-cccccchh--cCCCCCCC Q lcl|NC_019527. 440 AKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDS--G---------WDNIDGDLEI-VQPEMFDD--DGADPYMP 505 (516) Q Consensus 440 ekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~--~---------~~~~d~~~e~-~~~e~~~~--e~~~~~~~ 505 (516) .++++ +++..++++|++|++|+|+.++..+.. + +.++...... ...+.... +..+.+.+ T Consensus 355 ~~~r~-------~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~ 427 (518) T protein:vir:10 355 WEAKS-------ESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVA 427 (518) T ss_pred HHHHH-------HHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccceecccccccccCCCCCCCCCCCCccccc Confidence 77764 456778999999999999998644321 1 0111100000 00000000 00000000 Q ss_pred C-----CCCCCCCCCC Q lcl|NC_019527. 506 D-----PDVLPGEEGS 516 (516) Q Consensus 506 ~-----~~~~~~~e~t 516 (516) + +...++.+.+ T Consensus 428 ~~~~~~~~~~~~~~~~ 443 (518) T protein:vir:10 428 SLDQSPPTSVPGLSPT 443 (518) T ss_pred cccccccccCCCCCcc Confidence 0 0111111111 No 24 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=99.92 E-value=2.7e-25 Score=155.29 Aligned_cols=400 Identities=14% Similarity=0.098 Sum_probs=217.8 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTL 121 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~ 121 (516) |.-- ..|.+......+ ...++. ... +. .+ .+.++... ....+..++.++++|+.+ T Consensus 1 Mg~f-----~~~~~~~~~~~~--------~~~~~~-~~~--~~----~~----~~~~~~~~-~~~~~~~~~~v~~~i~~i 55 (406) T protein:vir:95 1 MGLF-----DRWRRTKRKSKI--------RADTGY-VGL--FM----SG----EDVSFLVP-GYVRLSDNPEVRMAVHKI 55 (406) T ss_pred Ccch-----hhhccccccccc--------cccchh-hhh--hc----cC----cccCcccc-CHHHHhhcHHHHHHHHHH Confidence 2110 011111000000 000000 000 00 00 01111111 123456789999999999 Q ss_pred hHHHhhCCCeeeeccccc-hhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCccccccccc Q lcl|NC_019527. 122 STELTREGIEITSKDRTK-AKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIK 199 (516) Q Consensus 122 aed~~r~~~~i~~~~~~~-~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~ 199 (516) |+++.+..+.+.-..++. ..........|...-..+..+..|.+.+.+. .++|.+++++.+.. -. T Consensus 56 a~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~-------------~~ 122 (406) T protein:vir:95 56 ADLISSMTIYLMQNTEDGDIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKY-------------TA 122 (406) T ss_pred HHhhccCceEEEEecCCcceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEE-------------CC Confidence 999999999884333222 1122223344555555555666777666665 55666665554321 12 Q ss_pred ccceeeEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHH Q lcl|NC_019527. 200 KGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVE 279 (516) Q Consensus 200 ~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~ 279 (516) .|.+.+|.+++|.+|++.... + | -.|.+.++.+.++.||||..... +...+.|.|.++.+.+.|. T Consensus 123 ~g~~~~l~~i~~~~v~~~~~~--~------~--~~~~~~~~~~~~~evih~~~~~~-----~~~~~~G~s~i~~~~~~i~ 187 (406) T protein:vir:95 123 DGLIDELVPLTPSKVNFLDTP--D------G--YQVLYGGQTFNYDEVLHFIYNPD-----PERPYIGRGYRVVLKDIAD 187 (406) T ss_pred CCcEEEEEEEcCceeEEEEcC--C------e--EEEEeccEEEchhHEEEeeccCC-----CCCCccccCHHHHHHHHHH Confidence 345667888998888753321 1 1 13455678899999999974321 2345679999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEe-cccC--CHHHH Q lcl|NC_019527. 280 NWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVN-TPLS--GLADL 355 (516) Q Consensus 280 ~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~-~~ls--gl~d~ 355 (516) ....+......++.+....-........++.+..+++.+++......-+|.+ ..++..+.++++++. .+.. .+.+. T Consensus 188 ~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~ 267 (406) T protein:vir:95 188 NLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEA 267 (406) T ss_pred HHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeecCCCccccccccCChhHHHHHHH Confidence 9999999999999887765333322334554444455555544444444544 445544556665542 3332 34466 Q ss_pred HHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCC Q lcl|NC_019527. 356 QSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSL 435 (516) Q Consensus 356 ~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL 435 (516) .....+.||.+.+||..+| |.. +..+....+||. ..|.|.++.|-..|-+..+....-.++|.++.| T Consensus 268 ~~~~~~~Ia~~fgVp~~~l-g~~-----~~~~~~~~~~~~-------~~l~P~~~~ie~~l~~~l~~~~~~~~~fd~~~l 334 (406) T protein:vir:95 268 VELDKRTVAGMFGVPAFLL-GIG-----EFNRDEYNNFIN-------STILPIAKGIEQELTRKLLISPDLYFKFNPRSL 334 (406) T ss_pred HHHHHHHHHHHhCCCHHHc-CCC-----CchHHHHHHHHH-------HHHHHHHHHHHHHHHHhcCCCCCcEEEeechhh Confidence 6788899999999998766 532 222334444443 458999999988887665543222355666778 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 436 WQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 436 ~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e~ 515 (516) ...|.+++++ ++..++++|++|++|+|+.++..+..+-..+-.. ....+.....+.....+++.+...+... T Consensus 335 ~~~d~~~~~~-------~~~~l~~~G~~t~NE~R~~~gl~p~~~gd~~~~~-~n~~~~~~~~~~~~~k~g~~~~~~~~~~ 406 (406) T protein:vir:95 335 YAYDLKELAE-------VGSNMYVRGIMEGNEVRDWLGLSPKEGLSELVIL-ENYIPLDKIGDQSKLKGGDNSGADGQTD 406 (406) T ss_pred hcCCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeec-cCccchhhcccccccCCCCCCCCCCCCC Confidence 7778877654 4566899999999999999855432110000000 0000000000111111111111111111 No 25 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=99.92 E-value=1.1e-24 Score=151.94 Aligned_cols=403 Identities=11% Similarity=0.061 Sum_probs=224.8 Q ss_pred HHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhh Q lcl|NC_019527. 36 RAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYR 115 (516) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r 115 (516) =.+...+-.+.... +.+..... .....+...... ...+... .++ +..-..+ .+++.+. T Consensus 1 MG~f~~lf~~~~~~----------~~~~~~~~-~~~~~~~~~~~~-------~~~~g~~--~~~-~v~~~~a-l~~~~v~ 58 (422) T protein:vir:13 1 MGFLRGLFNKKNNN----------DEKRSNYD-EDIGIDISDSNF-------WEKFGIK--LNF-SVRGKRA-LKENTVY 58 (422) T ss_pred CchhhhhhhccCCc----------cchhhhhh-hccccccCcchh-------hhhcccc--CCc-ccchhhh-hccHHHH Confidence 11222111111100 00000000 000000000000 0000000 000 1111122 3556788 Q ss_pred hhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccc Q lcl|NC_019527. 116 AFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILD 194 (516) Q Consensus 116 ~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld 194 (516) ++|+++++++.+..+.+.-..+...+ ......|...-.....+..|.+.+.+. .++|.|++++.-+. T Consensus 59 ~ci~~ia~~iA~lp~~~~~~~~~~~~--~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---------- 126 (422) T protein:vir:13 59 VCTKIRAESIGKLSLKIYKDKEEYKE--HELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDR---------- 126 (422) T ss_pred HHHHHHHHhhhhCceEEEecCccccc--chHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---------- Confidence 99999999999999988554433222 122333444444555556677776666 55788888774321 Q ss_pred cccccccceeeEEeecceeeccccccccccccccccCcceeEEe---e--eEeccceEEEecCCcchhhhhhccCCCCch Q lcl|NC_019527. 195 PRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL---G--REMHASRLLTIITRPLPDMLKPAYNFSGIS 269 (516) Q Consensus 195 ~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~---g--~~iH~SRli~~~~~~~p~~~k~~~~~~G~S 269 (516) .|.+.+|.+++|.+|.+... .|.....++.+ .|.+. | ..++++++||+.... +...+.|.| T Consensus 127 -----~G~~~~L~~i~~~~v~~~~~--~~~~~~~~~~~-~y~~~~~~g~~~~~~~~eiih~~~~~------~~~~~~G~s 192 (422) T protein:vir:13 127 -----KGKIIGLYPINSDNVTKIID--DDNFLSSLSKV-WYVVTDKNGKEHKLLPDEMLHFIGDI------TLDGLIGIK 192 (422) T ss_pred -----CCcEEEEEEECCcceEEEEc--CCcceeccceE-EEEEEeCCCeEEEEcccceEEEcCCC------CCCCccccc Confidence 24466788999998875432 23322233332 34443 2 368999999998542 234567999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEe Q lcl|NC_019527. 270 MSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVN 346 (516) Q Consensus 270 ~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~ 346 (516) .++.+.+.+.....+......++.+.... +++++ +.++.+..+++.++++.......|.+ +++++ ++-+|++++ T Consensus 193 ~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~ 269 (422) T protein:vir:13 193 PLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYV--GDLDEKAKKIFKKEFESMSNGLENAHSISLLP-FGYQFQPIS 269 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC--CCCCHHHHHHHHHHHHHHhcCccccCCceecC-CCceeeecc Confidence 99999999999999999999999986543 34443 34544444556666665555545555 45554 457899888 Q ss_pred cccCCH--HHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Q lcl|NC_019527. 347 TPLSGL--ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI 424 (516) Q Consensus 347 ~~lsgl--~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~ 424 (516) .+..+. -+...+....||.+.+||..+|.+ ..++..++.+.....||. ..|.|.+..+-..+-+..+... T Consensus 270 ~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~-~~~~~~sn~e~~~~~f~~-------~~l~P~~~~ie~~l~~~Ll~~~ 341 (422) T protein:vir:13 270 LSMADAQFLENSKLTKRELAATFGMKSYHLND-LERATFNNLTEQQKDFYV-------TTLQSSLTVYEQEIQDKLFSQY 341 (422) T ss_pred CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC-CCCCCcccHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhCChh Confidence 766543 355667778999999999876654 334444555655666553 3478888888877776554332 Q ss_pred --CCcceEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccc----cch Q lcl|NC_019527. 425 --DDAITFKF--KSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPE----MFD 496 (516) Q Consensus 425 --~~d~~~~f--~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e----~~~ 496 (516) ..++.|+| +.|...|.+++++ +++.++++|++|++|+|+.++. ++++...+...+- ... T Consensus 342 ~~~~g~~i~fd~~~l~r~d~~~~~~-------~~~~~~~~G~~T~NE~R~~~gl------~p~~ggD~~~~~~n~~~l~~ 408 (422) T protein:vir:13 342 ETLQDVKAEFNVDTILRSDIKTRYE-------AYRIGIQGGFIEANEARRRENL------PPVEGGDRLLVNGNMIPIEM 408 (422) T ss_pred hhcCCceEEeechhhhcCCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCC------CCCCCcCeeeeccCccchhh Confidence 23555555 4787777777655 5566899999999999999854 3332211111110 011 Q ss_pred hcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 497 DDGADPYMPDPDVLPGEE 514 (516) Q Consensus 497 ~e~~~~~~~~~~~~~~~e 514 (516) .++..+.++ ++|++ T Consensus 409 ~~~~~~~~g----~~~g~ 422 (422) T protein:vir:13 409 AGEQYKKGG----EKGGK 422 (422) T ss_pred cccccccCC----CcCCC Confidence 111112222 22222 No 26 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.92 E-value=2.9e-24 Score=149.65 Aligned_cols=408 Identities=13% Similarity=0.074 Sum_probs=220.7 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTL 121 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~ 121 (516) |..-.. -+-.+.+.. .....+..++. ....+.+....+. . -......+++.+++||+.+ T Consensus 1 M~~~~~-~f~~~~r~~--------~~~~~~~~~~~--~~~~~~g~~~~~~---------~-v~~~~al~~~~v~~~i~~i 59 (429) T protein:vir:10 1 MDSVKK-FFNFEKRQT--------SQVIELNKDDE--KLLEWLGISPSTI---------S-VKGKNALKVATVFACIKIL 59 (429) T ss_pred Cchhhh-hhcccccCc--------ccccccCCChH--HHHHHhcCCCCcc---------e-echhhhhccHHHHHHHHHH Confidence 211110 000000100 00000100000 0001111000000 0 0112344688899999999 Q ss_pred hHHHhhCCCeeeeccccchh--hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccc Q lcl|NC_019527. 122 STELTREGIEITSKDRTKAK--EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTI 198 (516) Q Consensus 122 aed~~r~~~~i~~~~~~~~~--~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I 198 (516) |+++-+..+.+--..++..+ ........|...-....-+..|.+.+.+. .++|.+++++.-+. T Consensus 60 a~~ia~l~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-------------- 125 (429) T protein:vir:10 60 SESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR-------------- 125 (429) T ss_pred HHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-------------- Confidence 99999998887443332211 11111222333333444455666666665 66888888875431 Q ss_pred cccceeeEEeecceeeccccccccccccccccCcceeEEe--e--eEeccceEEEecCCcchhhhhhccCCCCchHHHHH Q lcl|NC_019527. 199 KKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL--G--REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLA 274 (516) Q Consensus 199 ~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~--g--~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~ 274 (516) .|.+.+|.++++.+|++..... . ...++...+|.+. | +.++++.||||.... +...+.|.|.++.+ T Consensus 126 -~G~~~~L~~i~~~~v~v~~~~~--~-~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~------~~~~~~G~s~i~~~ 195 (429) T protein:vir:10 126 -KGKVQALWPIDASKVTVYIDDV--G-LLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGI------TLDGLVGVPTMEYL 195 (429) T ss_pred -CCcEEEEEEEcCceeEEEEcCc--c-cccccceEEEEEccCCeEEEEccccEEEecCCC------CCCCcccccHHHHH Confidence 2345678889888887532111 0 1111222233433 2 468999999996432 23456799999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccCCH- Q lcl|NC_019527. 275 QPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLSGL- 352 (516) Q Consensus 275 ~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~lsgl- 352 (516) .+.|.....+......++.+....-........++.+..+++.++++.......|.+ +.+++ ++-+++.++.+..+. T Consensus 196 ~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~~~~d~q 274 (429) T protein:vir:10 196 KSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMP-VGYQFQPISLNMSDAQ 274 (429) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhccccccCceeecC-CCceEEEccCChhHHH Confidence 999999999999999998887654333333334554444556666655544445555 45554 457888887665443 Q ss_pred -HHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cCCcce Q lcl|NC_019527. 353 -ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE--IDDAIT 429 (516) Q Consensus 353 -~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~--~~~d~~ 429 (516) -+......++||.+.+||..+| |...+|-.++.+.....|| +..|.|.+..+-..+-+..+.. ...++. T Consensus 275 ~~e~~~~~~~~Ia~~fgVP~~~l-g~~~~~~~sn~e~~~~~f~-------~~~l~P~~~~ie~~ln~kl~~~~~~~~g~~ 346 (429) T protein:vir:10 275 FLENTELTIRQIATAFGIKMHQL-NDLSKATLNNIEQQQQQFY-------TDTLQATLTMYEQEMTYKLFLDSELDKGFY 346 (429) T ss_pred HHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcccHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcChhhcCCCcE Confidence 3456677889999999998766 4444455555555555554 3457888888888776655432 234555 Q ss_pred EEeC--CCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccc----cchhcCCCCC Q lcl|NC_019527. 430 FKFK--SLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPE----MFDDDGADPY 503 (516) Q Consensus 430 ~~f~--pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e----~~~~e~~~~~ 503 (516) |+|+ .|...|.+++++ +++.++++|++|++|+|+.+...+ ++...+...+- .....+.... T Consensus 347 ~~fd~~~ll~~d~~~~~~-------~~~~~~~~G~~T~NE~R~~~gl~p------~~ggD~~~~~~n~~~~d~~~~~~~k 413 (429) T protein:vir:10 347 SKFNVDAILRADIKTRYE-------AYRTGIQGGFLKPNEARSKEDLPP------EAGGDRLLVNGNMLPIDMAGQAYLK 413 (429) T ss_pred EEeechhhhcCCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCCCC------CCCcCeeeecccccchhhccccccC Confidence 5554 788888888655 556789999999999999985443 32111111000 0000000111 Q ss_pred CCCCCCCCCCCCC Q lcl|NC_019527. 504 MPDPDVLPGEEGS 516 (516) Q Consensus 504 ~~~~~~~~~~e~t 516 (516) +++.+...+.++. T Consensus 414 ~g~~~~~~~~~~~ 426 (429) T protein:vir:10 414 GGDTNGEVSKEGN 426 (429) T ss_pred CCCCCCCCCCCCC Confidence 1111111111111 No 27 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.91 E-value=1.2e-23 Score=146.32 Aligned_cols=453 Identities=13% Similarity=0.094 Sum_probs=226.5 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCC----hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhccccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARK----LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSL 76 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~ 76 (516) |==|+|-.+- .......+.+..-+ ..++.-...+++........-|..|.. | .|+ T Consensus 5 ~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~--~----------~~~-- 63 (551) T protein:vir:80 5 LGLFESIRLV-------GVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVI--G----------SMS-- 63 (551) T ss_pred hhhHHHhhhc-------cCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccc--c----------cee-- Confidence 3333222100 01111111111110 001111111111111110001111110 0 000 Q ss_pred ccchhhhcccccCCcccccccCcccHH---HHHHHHhCchhhhhhhhhhHHHhh-----------CCCeeeecccc--ch Q lcl|NC_019527. 77 CGPTYQFLNSAAGGLYAADIQPFPGYQ---NLAALATRPEYRAFASTLSTELTR-----------EGIEITSKDRT--KA 140 (516) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~f~gy~---ll~~y~~~~i~r~iVd~~aed~~r-----------~~~~i~~~~~~--~~ 140 (516) ...++. ......+-+. ++..|+.|+++++||+++++.+.+ .+++|...+.+ .. T Consensus 64 ----------~~~~~~-~r~~~~~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~ 132 (551) T protein:vir:80 64 ----------ANPGFK-TKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPT 132 (551) T ss_pred ----------cCcccc-cCccccChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccC Confidence 000100 1111122233 456788999999999999998765 34566544322 11 Q ss_pred hhhHHHHHHHHHHHHhcChh--------HHHHHHHH-hcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecc Q lcl|NC_019527. 141 KEMASKIKELEEACEYYGVM--------GIIQKAAE-HDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEP 211 (516) Q Consensus 141 ~~~~~~i~~i~~~~~~l~~~--------~~l~ea~~-~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~ 211 (516) ......++.++..+++.+.. ..|.+.+. ...++|.+++++..+. .|.+.+|.+++| T Consensus 133 ~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~---------------~G~~~~L~~l~p 197 (551) T protein:vir:80 133 SHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNR---------------NQSMVRFVAKDP 197 (551) T ss_pred hhHHHHHHHHHHHHHhcCCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEECC---------------CCcEEEEEEeCC Confidence 22233455677777777642 33555544 4467888887765432 234667889999 Q ss_pred eeeccccccccccccccccCccee-EE-ee---eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 212 MWTSPSAYNALDPTAPDFYKPSTW-WV-LG---REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQ 286 (516) Q Consensus 212 ~~v~p~~~~~~dp~s~~yg~P~~y-~v-~g---~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~ 286 (516) .+|.+...+.. -....+..| ++ .+ ..+.++.||||..++.++. ....+|.|.++.+.+.|.....+.. T Consensus 198 ~~V~v~~~~~g----~~~~~~~~y~~~~~g~~~~~~~~~eiiH~~~n~~~~~---~~~~~G~spi~~a~~~i~~~~a~~~ 270 (551) T protein:vir:80 198 TTIFFATTADG----KIPDNGNRFVQVIDQKIVATFNAREMAFAVRNPRSDI---YATGYGYPELEIALKQFIAHENTEA 270 (551) T ss_pred ceeEEEECCcc----ccccCceEEEEEeCCcEEEEEcccceEEecccCCCCc---ccccccccHHHHHHHHHHHHHHHHH Confidence 88876432111 000111222 22 22 3578899999987654432 2345699999999999999999999 Q ss_pred HHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceE-EEecCCcceeEEecccCC--HHHHHHHHHH Q lcl|NC_019527. 287 SVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLA-VMDFDSEDIVQVNTPLSG--LADLQSQSQE 361 (516) Q Consensus 287 ~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~-~id~~~e~~e~~~~~lsg--l~d~~~~~~~ 361 (516) ....++.+.... ++.++....++....+.+.+.++....+..|.|.+ ++..++-+|+.++.+..+ +-+......+ T Consensus 271 ~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~ 350 (551) T protein:vir:80 271 FNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLIN 350 (551) T ss_pred HHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCccccccCCCceEEEccCChhHHHHHHHHHHHHH Confidence 999999887643 34444333343333344555554444445566654 554444577777655444 3455667778 Q ss_pred HHHhhhcCCceeeecccccc-cccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCC Q lcl|NC_019527. 362 HMCSVSKIPAIKLTGISPSG-LNAS--SEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQT 438 (516) Q Consensus 362 ~iaaas~IP~t~L~G~sp~G-lnat--ge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~ 438 (516) .||.+.+||-.+| |....+ ..++ +..-..|+-.....+-+..|.|.+..+-..|-+..+......+.|+|+.+... T Consensus 351 ~Ia~aFgVPp~~l-G~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~~~~f~f~~~~~~ 429 (551) T protein:vir:80 351 VISALYGIDPAEI-NIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIK 429 (551) T ss_pred HHHHHhcCCHHHc-CcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCCceEEEeeccChh Confidence 9999999998655 432111 1110 00011222223334444568888888877776655444445789999988877 Q ss_pred CHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhc--cC---CCCCC-----hhh--hccccccc-------hhcC Q lcl|NC_019527. 439 SAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPD--SG---WDNID-----GDL--EIVQPEMF-------DDDG 499 (516) Q Consensus 439 sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~--~~---~~~~d-----~~~--e~~~~e~~-------~~e~ 499 (516) +.++++++. .++.+|++|++|+|+.++..+. .| +.++. ... ...+.+.. .... T Consensus 430 ~~~~~~~~~--------~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (551) T protein:vir:80 430 SELESVKIL--------AEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQT 501 (551) T ss_pred hHHHHHHHH--------HHHhcCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcC Confidence 776665432 3456799999999999876442 11 11110 000 00000000 0000 Q ss_pred CCCCCCCCCCCCCCCCC Q lcl|NC_019527. 500 ADPYMPDPDVLPGEEGS 516 (516) Q Consensus 500 ~~~~~~~~~~~~~~e~t 516 (516) ..+..+++..+|....+ T Consensus 502 ~~~~~~~~~~~p~~~~~ 518 (551) T protein:vir:80 502 GNRVSTDVEDIPDGKDT 518 (551) T ss_pred CCCCCCCCCCCCCcccc Confidence 00111111112222111 No 28 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=99.91 E-value=2.1e-24 Score=150.38 Aligned_cols=396 Identities=13% Similarity=0.049 Sum_probs=218.9 Q ss_pred hhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCc Q lcl|NC_019527. 33 AMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRP 112 (516) Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~ 112 (516) ++...+. +++... + ...+.-..+.+ + .+...... ..+ -...+.+++ T Consensus 1 ~~f~~~f----~r~~~~------~---------~~~~~~~~~~~-----~------~~~~~~~g---~~v-~~~~~l~~~ 46 (413) T protein:vir:48 1 MFFSGLF----QRKSDA------P---------VTTPAELAEAI-----G------LSYDTYTG---KRI-SSQRAMRLT 46 (413) T ss_pred Cccchhh----ccCccC------C---------ccchHHHHHhh-----h------cCcccccC---cee-chhhhhccH Confidence 1111111 000000 0 00000011110 0 00000000 000 113355688 Q ss_pred hhhhhhhhhhHHHhhCCCeeeeccccchhh--hHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCccc Q lcl|NC_019527. 113 EYRAFASTLSTELTREGIEITSKDRTKAKE--MASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSV 189 (516) Q Consensus 113 i~r~iVd~~aed~~r~~~~i~~~~~~~~~~--~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~ 189 (516) .+.++|+.+|+++-+..+.+....++.... .......|...-....-+..|.+.+.+. .++|.|++++.-++ T Consensus 47 ~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~----- 121 (413) T protein:vir:48 47 AVYSCVRVLAESVGMLPCSLYKISGTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKAL----- 121 (413) T ss_pred HHHHHHHHHHHhhhhCceEEEEecCCcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeCC----- Confidence 899999999999999998875443332211 1112233333444445555666666655 56788887764221 Q ss_pred CcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe---e--eEeccceEEEecCCcchhhhhhccC Q lcl|NC_019527. 190 PLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL---G--REMHASRLLTIITRPLPDMLKPAYN 264 (516) Q Consensus 190 Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~---g--~~iH~SRli~~~~~~~p~~~k~~~~ 264 (516) |.+.+|.++++.+|++..... +.+ .|++. | ..++++.|+|+.+..+ .. T Consensus 122 -----------g~~~~L~~l~~~~v~~~~~~~--------~~~-~y~~~~~~g~~~~~~~~evih~~~~~~-------d~ 174 (413) T protein:vir:48 122 -----------GEVVELLPIDPGCVEPKLNSQ--------WQP-VYQVTFPDGSVDVLTQDEIWHVRTLTL-------DG 174 (413) T ss_pred -----------CcEEEEEEEcCceEEEEEcCC--------ceE-EEEEEecCceEEEEccccEEEecCcCC-------CC Confidence 345568888888887532111 112 23332 2 4689999999976432 24 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeE Q lcl|NC_019527. 265 FSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQ 344 (516) Q Consensus 265 ~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~ 344 (516) .+|.|.++.+.+.|.....+......++.+...+-........++.+..+++.++++......+|.|..++..++.+|+. T Consensus 175 ~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~ 254 (413) T protein:vir:48 175 LVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHTGLGNAHRPMILEMGLDWKS 254 (413) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEe Confidence 57999999999999999999999999998877543333223344444444566655555444456554443344578998 Q ss_pred EecccCCH--HHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019527. 345 VNTPLSGL--ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWG 422 (516) Q Consensus 345 ~~~~lsgl--~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g 422 (516) ++.+..+. .+........||.+.+||..+| |...++-.++.+.....||. ..|.|.++.+-+.|-+..+. T Consensus 255 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~n~e~~~~~f~~-------~~i~P~~~~ie~~l~~~L~~ 326 (413) T protein:vir:48 255 MALNAEDSQFLETRKFQLEEICRLFRVPLHMV-QNTDRATFNNIEELGLGFIN-------YSLVPYLTRIEQRINTGLVR 326 (413) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCcCCCcccHHHHHHHHHH-------HHHHHHHHHHHHHHHhhccC Confidence 88766554 4667788899999999998776 43334444555555555553 35778888887777664432 Q ss_pred CcC-Cc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc-ccchhc Q lcl|NC_019527. 423 EID-DA--ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP-EMFDDD 498 (516) Q Consensus 423 ~~~-~d--~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~-e~~~~e 498 (516) ... .+ ++|.+..|...|.+++++ ++++++++|++|++|+|+.++. ++++...+...+ ...... T Consensus 327 ~~~~~~~~~~fd~~~l~~~d~~~~~~-------~~~~~~~~g~~T~NE~R~~~g~------~p~~ggD~~~~~~n~~~~~ 393 (413) T protein:vir:48 327 ESKQGKFYAKFNAGALLRGDMKSRFE-------AYATGINWGIYSPNDCRDLEDM------NPRPGGDVYLTPMNMTTSP 393 (413) T ss_pred ccccCCeEEEEechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCC------CCCCCcceeeccccccccc Confidence 211 13 445555787777777654 5567899999999999998844 333221111111 011111 Q ss_pred -CCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 499 -GADPYMPDPDVLPGEEGS 516 (516) Q Consensus 499 -~~~~~~~~~~~~~~~e~t 516 (516) ..+..+ .+.+..+.+.| T Consensus 394 ~~~~~~~-~~~~~~~~~~~ 411 (413) T protein:vir:48 394 SAGDDNG-KKKESGDADKT 411 (413) T ss_pred cccccCC-CCCCCCCcccc Confidence 111111 11122222222 No 29 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=99.91 E-value=4.8e-24 Score=148.40 Aligned_cols=396 Identities=12% Similarity=0.070 Sum_probs=222.0 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHH-HHHHHHh Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQ-NLAALAT 110 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~-ll~~y~~ 110 (516) |.+.-.+. ++... .+ ...+. .+ .+.+. .+.. .+.|.. ....+.+ T Consensus 1 Mg~f~~lf----~r~~~------~~----~~~~~----~~-~~~~~-----------~~~~-----~~~g~~v~~~~al~ 45 (414) T protein:vir:44 1 MVFFSGLF----QRKSD------AP----VTTPA----EL-ADAIG-----------LSYD-----TYTGKQISSQRAMR 45 (414) T ss_pred Cchhhhhh----ccCcc------Cc----ccchh----hH-hHhhc-----------cCcc-----ccCCceechhhhhc Confidence 22221111 11110 00 00000 01 11100 0000 011100 0123456 Q ss_pred CchhhhhhhhhhHHHhhCCCeeeeccccchh-h-hHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCc Q lcl|NC_019527. 111 RPEYRAFASTLSTELTREGIEITSKDRTKAK-E-MASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADV 187 (516) Q Consensus 111 ~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~-~-~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~ 187 (516) ++.+.++|+.+|+++-+..+.+...+++... . .......|............|.+.+.+. .++|.|++++.-++ T Consensus 46 ~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~~--- 122 (414) T protein:vir:44 46 LTAVFSCVRVLAESVGMLPCNLYHLNGSLKQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKAF--- 122 (414) T ss_pred cHHHHHHHHHHHHHhccCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeCC--- Confidence 8889999999999999999887543332211 1 1112233444555556666777777765 55788887763322 Q ss_pred ccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe-----eeEeccceEEEecCCcchhhhhhc Q lcl|NC_019527. 188 SVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL-----GREMHASRLLTIITRPLPDMLKPA 262 (516) Q Consensus 188 ~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~-----g~~iH~SRli~~~~~~~p~~~k~~ 262 (516) |.+.+|.+++|.+|++.... .+.+ .|.+. ...++++.|+||.+..+ T Consensus 123 -------------g~~~~L~~l~~~~v~~~~~~--------~~~~-~y~~~~~~g~~~~~~~~evih~~~~~~------- 173 (414) T protein:vir:44 123 -------------GEVAELLPVDPGCVVPKLNS--------SWEP-VYQVTFPDGSTDVLSQEDIWHVRTLTL------- 173 (414) T ss_pred -------------CcEEEEEEEcCceEEEEECC--------CCcE-EEEEEecCceEEEEccccEEEecCCCC------- Confidence 23456888888888753221 1222 33332 25689999999975432 Q ss_pred cCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcc Q lcl|NC_019527. 263 YNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSED 341 (516) Q Consensus 263 ~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~ 341 (516) ..+.|.|.++.+.+.|.....+......++.+.............++.+..+++.++++......+|.+ .++++ ++.+ T Consensus 174 d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~ 252 (414) T protein:vir:44 174 DGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTGLGNAHRPMILE-MGLD 252 (414) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCce Confidence 345799999999999999999999999998887654333333344554444556666655544445555 45554 4578 Q ss_pred eeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 342 IVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLS 419 (516) Q Consensus 342 ~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s 419 (516) |+.++.+..+ +-+......+.||.+.+||..+| |..-++-.++.+.....||. ..|.|.++.+-..|-+. T Consensus 253 ~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l-~~~~~~t~~n~e~~~~~~~~-------~~l~P~~~~ie~~ln~~ 324 (414) T protein:vir:44 253 WKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMV-QNTDRATFNNIEELGLGFIN-------YSLVPYLTRIEQRINTG 324 (414) T ss_pred EEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcccHHHHHHHHHH-------HHHHHHHHHHHHHHHhh Confidence 9888766554 34556677789999999999776 43333444555555555553 35788888888877765 Q ss_pred hCCCcC-Cc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccccc-c Q lcl|NC_019527. 420 KWGEID-DA--ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEM-F 495 (516) Q Consensus 420 ~~g~~~-~d--~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~-~ 495 (516) .+.... .. ++|.+..|...|.+++++ ++++++++|++|++|+|+.++.. +++.......+.. . T Consensus 325 L~~~~~~~~~~i~fd~~~ll~~d~~~~~~-------~~~~~~~~G~~t~NE~R~~~gl~------p~~ggD~~~~~~n~~ 391 (414) T protein:vir:44 325 LVRKSKQGVFYAKFNAGALLRGDMKSRFE-------AYATGINWGIYSPNDCRDLEDMN------PRPGGDVYLTPMNMT 391 (414) T ss_pred cCCccccCceEEEEechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCCC------CCCCcceeccccccc Confidence 553332 23 345555788888888655 55668999999999999988543 3321111111110 0 Q ss_pred -hhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 496 -DDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 496 -~~e~~~~~~~~~~~~~~~e~t 516 (516) ........+.+.++....+.| T Consensus 392 ~~~~~~~~~~~~~~~~~~d~~~ 413 (414) T protein:vir:44 392 TKPSDGSKAGKQKDNANADETT 413 (414) T ss_pred ccCCccccCCCCCCCCCCCCCC Confidence 000111111222222223333 No 30 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.90 E-value=9.4e-23 Score=141.35 Aligned_cols=452 Identities=14% Similarity=0.084 Sum_probs=226.0 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHh---hcCCC-ccccccCCCCCCCccCCCccchhccccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSME---RRASD-AATKWAPPQLMPGVVPAGTTPAVAMDSL 76 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~-~~~~~~~~~~~~gv~~~~~~~~~a~ds~ 76 (516) |==|+|- .+. - -.....+++...+....-.+.+... .+..+ ...-+..| .+..+.+ T Consensus 1 ~~~~~~~--~~~----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~-------------~~~~~~~ 60 (547) T protein:vir:63 1 MGLFESI--RLA----G-VNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQP-------------VIGSMSA 60 (547) T ss_pred Cchhhhh--hhh----c-CCccccccccccccccchhhhhhhHHHHHHhhcccchhhhch-------------hhheeec Confidence 5545322 111 1 1112222333222111112222111 11111 00001111 1111111 Q ss_pred ccchhhhcccccCCcccccccCcccHH---HHHHHHhCchhhhhhhhhhHHHhhC-----------CCeeeecccc--ch Q lcl|NC_019527. 77 CGPTYQFLNSAAGGLYAADIQPFPGYQ---NLAALATRPEYRAFASTLSTELTRE-----------GIEITSKDRT--KA 140 (516) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~f~gy~---ll~~y~~~~i~r~iVd~~aed~~r~-----------~~~i~~~~~~--~~ 140 (516) . .++ .......+-|+ ++..|..++++++||+++++.+.+- +++|...+.+ .. T Consensus 61 ~-----------~g~-~~~~~~~~~~~l~~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~ 128 (547) T protein:vir:63 61 N-----------PGF-KTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPT 128 (547) T ss_pred c-----------ccc-ccCCccCChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccC Confidence 0 010 01112223344 4567889999999999999877642 3455443221 11 Q ss_pred hhhHHHHHHHHHHHHhcChh--------HHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEeecc Q lcl|NC_019527. 141 KEMASKIKELEEACEYYGVM--------GIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEP 211 (516) Q Consensus 141 ~~~~~~i~~i~~~~~~l~~~--------~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~ 211 (516) .....+++.++..+.+.+.. ..|.+++..+ .++|.+++++.-+. .|.+.+|.+++| T Consensus 129 ~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~~~lv~d~ll~Gn~~~~i~rd~---------------~G~~~~L~~l~p 193 (547) T protein:vir:63 129 SHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNR---------------NQSMVRFVAKDP 193 (547) T ss_pred hhhHHHHHHHHHHHHhhCCCCCCccchHHHHHHHHHHHHHhhCCEEEEEEECC---------------CCcEEEEEEecC Confidence 22233455677777776542 3455555544 66788877764421 244667888999 Q ss_pred eeeccccccccccccccccCcc-eeEE-ee---eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 212 MWTSPSAYNALDPTAPDFYKPS-TWWV-LG---REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQ 286 (516) Q Consensus 212 ~~v~p~~~~~~dp~s~~yg~P~-~y~v-~g---~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~ 286 (516) ..|.+...+ |-.- ...+. ++++ .+ ..++++.||||..+++.+. .+..+|.|.++.+.+.|.....+.. T Consensus 194 ~~V~~~~~~--~g~~--~~~~~~y~~~~~~~~~~~~~~~eiih~r~n~~~~~---~~~~~G~Spi~~~~~~i~~~~~a~~ 266 (547) T protein:vir:63 194 TTIFFATTA--DGKI--PDNGNRFVQVIDQKIVATFNAREMAFAVRNPRSDI---YATGYGYPELEIALKQFIAHENTEA 266 (547) T ss_pred ceeEEEECC--cccc--ccCceEEEEEcCCcEEEEeccccEEEecccCCCCc---ccccccccHHHHHHHHHHHHHHHHH Confidence 888764321 1110 01111 2222 22 3578899999987765432 2345799999999999999999999 Q ss_pred HHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceE-EEecCCcceeEEecccCC--HHHHHHHHHH Q lcl|NC_019527. 287 SVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLA-VMDFDSEDIVQVNTPLSG--LADLQSQSQE 361 (516) Q Consensus 287 ~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~-~id~~~e~~e~~~~~lsg--l~d~~~~~~~ 361 (516) ....++.+.... +++++....++....+.+.+.++.....-.|.|.+ ++..++-+|+.++.+..+ +-+......+ T Consensus 267 ~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~~~g~~~~~l~~~~~d~qfle~~~~~~~ 346 (547) T protein:vir:63 267 FNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLIN 346 (547) T ss_pred HHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEEcCCChhHHHHHHHHHHHHH Confidence 999999886643 34444333343333334444444443344566654 444444577777655443 3345666778 Q ss_pred HHHhhhcCCceeeecccccc-ccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCC Q lcl|NC_019527. 362 HMCSVSKIPAIKLTGISPSG-LNA--SSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQT 438 (516) Q Consensus 362 ~iaaas~IP~t~L~G~sp~G-lna--tge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~ 438 (516) .||.+.+||-.+| |....+ ..+ ++..-..|.-+....+-+..|.|.+..+-..|-...+......+.|+|+.+... T Consensus 347 ~Ia~afgVPP~~l-G~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~~~ 425 (547) T protein:vir:63 347 VISALYGIDPAEI-NIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIK 425 (547) T ss_pred HHHHHhCCCHHHc-CcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCceEEEeeccccc Confidence 9999999998666 432111 100 000001111111222233457888888877776655444445789999988888 Q ss_pred CHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhc--cC---CCCCC-----hhh--hccccc--------cchhc Q lcl|NC_019527. 439 SAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPD--SG---WDNID-----GDL--EIVQPE--------MFDDD 498 (516) Q Consensus 439 sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~--~~---~~~~d-----~~~--e~~~~e--------~~~~e 498 (516) +..+++++ ..++.+|++|++|+|+.+...+. .| +.++. ... +..+++ ..+.. T Consensus 426 ~~~~~~~~--------~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 497 (547) T protein:vir:63 426 SELESVKI--------LAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQT 497 (547) T ss_pred cHHHHHHH--------HHHHhCCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhcccccccc Confidence 87776542 23567899999999999876442 11 11110 000 000000 00000 Q ss_pred CCCCCCCCCCCCCCCC---------CC Q lcl|NC_019527. 499 GADPYMPDPDVLPGEE---------GS 516 (516) Q Consensus 499 ~~~~~~~~~~~~~~~e---------~t 516 (516) +. +..+++...|.+. +| T Consensus 498 ~~-~~~~~~~~~~~~~~~~~~~~~d~~ 523 (547) T protein:vir:63 498 GN-RVSTDVEDIPDGKDTTGDIGKDGQ 523 (547) T ss_pred CC-CCCCCCCCCCCCcccCCCcCcccc Confidence 00 0111111111111 11 No 31 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=99.89 E-value=3.9e-23 Score=143.42 Aligned_cols=399 Identities=12% Similarity=0.097 Sum_probs=221.1 Q ss_pred hhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCc Q lcl|NC_019527. 33 AMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRP 112 (516) Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~ 112 (516) |+. ..+-.++... .. . . .. .+......+ ++... .+.....- ..+.+++ T Consensus 1 m~~----~~~f~~~~~~-----------~~---~-~-~~-~~~~~~~~~-------~~~~~---~~~~~v~~-~~al~~~ 48 (416) T protein:vir:12 1 MLL----ERMFEKRSGS-----------SD---H-E-DG-FNNILLNMF-------GGRKT---ASGERVSE-SNSLVQP 48 (416) T ss_pred Ccc----chhcccccCc-----------cc---c-C-cc-chhHHHHhh-------cCccc---ccCceech-hhhhccH Confidence 111 1110011100 00 0 0 00 000000011 00000 01111111 1233567 Q ss_pred hhhhhhhhhhHHHhhCCCeeeeccccchhh--hHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCccc Q lcl|NC_019527. 113 EYRAFASTLSTELTREGIEITSKDRTKAKE--MASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSV 189 (516) Q Consensus 113 i~r~iVd~~aed~~r~~~~i~~~~~~~~~~--~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~ 189 (516) -+.++|+.+|+++.+..+.+--..++.... .......|...-..+..+..|.+.+... .++|.|++++..+. T Consensus 49 ~v~~~i~~Ia~~ia~l~~~~~~~~~~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~----- 123 (416) T protein:vir:12 49 DIFACVNVLSDDIAKLPIHTYKRTDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGS----- 123 (416) T ss_pred HHHHHHHHHHHhhhhCceEEEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC----- Confidence 778999999999999999874433322211 1122334555555566666777777766 55788888875432 Q ss_pred CcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE--eee--EeccceEEEecCCcchhhhhhccCC Q lcl|NC_019527. 190 PLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV--LGR--EMHASRLLTIITRPLPDMLKPAYNF 265 (516) Q Consensus 190 Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v--~g~--~iH~SRli~~~~~~~p~~~k~~~~~ 265 (516) .|.+.+|.+++|.+|++..... +.+.+|++ .|. .+++++|+|+.+..+ ..+ T Consensus 124 ----------~G~~~~L~~l~~~~v~v~~~~~--------~~~~~~~~~~~g~~~~~~~~eiih~~~~~~-------~~~ 178 (416) T protein:vir:12 124 ----------HGYPEALFPLRPDYTNAYVHPT--------TGMLWYQTVLNGKAIELYDYEVLHFKGLST-------DGI 178 (416) T ss_pred ----------CCcEEEEEEECCcceEEEEeCC--------CcEEEEEEecCCeEEEecCccEEEecCcCC-------CCc Confidence 1345678889888887543211 11223443 343 678999999975432 346 Q ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCccee Q lcl|NC_019527. 266 SGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIV 343 (516) Q Consensus 266 ~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e 343 (516) .|.|.++.+.+.|.....+......++.+.... +++++ ..++....+++.++ |+...+..++.++++ +-+|+ T Consensus 179 ~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~--~~~~~e~~~~~~~~---~~~~~~~~~~~vl~~-g~~~~ 252 (416) T protein:vir:12 179 HGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVP--AFLDEKPKENVRKE---WKRVNKVENIAIIDY-GLEYQ 252 (416) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecC--CCCCHHHHHHHHHH---HHHHhcCCCeeecCC-CceEE Confidence 799999999999999999999999999986654 44443 23433333334443 444445566677765 47899 Q ss_pred EEecccCCH--HHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019527. 344 QVNTPLSGL--ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKW 421 (516) Q Consensus 344 ~~~~~lsgl--~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~ 421 (516) +++.+..+. -+...+..++||.+.+||..+|.+. ..+-.++.+...+.||. ..|.|.+..+...|-+..+ T Consensus 253 ~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-~~~t~sn~e~~~~~f~~-------~~l~P~~~~ie~~l~~~l~ 324 (416) T protein:vir:12 253 SISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNEL-DKATFSNIEHQSIEYVR-------NTLQPWIVNFEQELNVKLF 324 (416) T ss_pred EccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCc-cCCCcccHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhc Confidence 988776653 4677888899999999999877543 33444555555556653 3578888888887766544 Q ss_pred CCc--CCcceEE--eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCC--Chhhhccccccc Q lcl|NC_019527. 422 GEI--DDAITFK--FKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNI--DGDLEIVQPEMF 495 (516) Q Consensus 422 g~~--~~d~~~~--f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~--d~~~e~~~~e~~ 495 (516) -.. ..++.|+ ++.|...|.++++ +++..++++|++|++|+|+.++..+.-+-+.+ ..+.... +... T Consensus 325 ~~~~~~~g~~i~fd~~~l~~~d~~~~~-------~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~n~~~~-~~~~ 396 (416) T protein:vir:12 325 LDHDQKSGHYVKFNIDSELRGDSKTQA-------EYLKTLHETGVLNKDEIRELLERNPIENGDKYISSLNYVFL-DFLE 396 (416) T ss_pred CchhhcCCceEEeechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccc-cccc Confidence 322 2244455 5578778888765 45567899999999999999855442111000 0000000 0000 Q ss_pred hhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 496 DDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 496 ~~e~~~~~~~~~~~~~~~e~ 515 (516) +.......+...+.+++.|| T Consensus 397 ~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 397 EYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred hhhccccccccCCCCCcCCC Confidence 10111111111222233333 No 32 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=99.89 E-value=5.2e-23 Score=142.76 Aligned_cols=408 Identities=11% Similarity=0.086 Sum_probs=213.2 Q ss_pred HHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhh Q lcl|NC_019527. 38 VMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAF 117 (516) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~i 117 (516) ++. .|.+. +.....+. .+. .+.....+.+.....++. ..+... -......+++-+.+| T Consensus 1 ~~~-----------~~~~~---~~~~~~~~--~~~-~~~~~~~~~~~~~~~~g~----~~~g~~-v~~~~al~~~~V~~~ 58 (454) T protein:vir:93 1 MWN-----------LLRRT---RKNQKSGR--DVR-EAGWTSLFQAVAEPFAGA----WQQGVK-ADPEAVLSFHAVFAC 58 (454) T ss_pred CCC-----------ccccC---cccccccc--ccc-chhhhhhhhhhhhhhcch----hhcCcc-cChHHhhccHHHHHH Confidence 111 11110 00000000 010 000001111110000000 000000 001223356678999 Q ss_pred hhhhhHHHhhCCCeeeeccccchh--hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccc Q lcl|NC_019527. 118 ASTLSTELTREGIEITSKDRTKAK--EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILD 194 (516) Q Consensus 118 Vd~~aed~~r~~~~i~~~~~~~~~--~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld 194 (516) |+++++++-+..+.+.-.+.+... .....+..|...-....-...|.+.+... .++|.+++++..++ T Consensus 59 v~~Ia~~iA~lp~~~~~~~~~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~---------- 128 (454) T protein:vir:93 59 ISLISQDIAKMRLRLMQTDAQGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNA---------- 128 (454) T ss_pred HHHHHHhhccCceEEEEeccCCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECC---------- Confidence 999999999999988543322211 11111223333333344455677666655 66788888875432 Q ss_pred cccccccceeeEEeecceeeccccccccccccccccCcceeEEe---------eeEeccceEEEecCCcchhhhhhccCC Q lcl|NC_019527. 195 PRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL---------GREMHASRLLTIITRPLPDMLKPAYNF 265 (516) Q Consensus 195 ~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~---------g~~iH~SRli~~~~~~~p~~~k~~~~~ 265 (516) .|.+.+|.+++|..|++.... | |. -.|++. ...+.++.||||.... ..... T Consensus 129 -----~G~~~~L~~i~~~~v~v~~~~--~------g~-~~y~~~~~~~~~~~~~~~~~~~eViH~k~~~------~~~~~ 188 (454) T protein:vir:93 129 -----RGQIKELRILDWNRVEPLVAD--D------GE-VFYRITPDRNCGITEAVTVPAREVIHDRFNC------FFHPL 188 (454) T ss_pred -----CCcEEEEEEEcCcceEEEEcC--C------Cc-EEEEEEeccccccceeEEecCcceEEeccCC------CCCCc Confidence 245667888998888754221 1 11 123332 2357889999996432 23456 Q ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcce Q lcl|NC_019527. 266 SGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDI 342 (516) Q Consensus 266 ~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~ 342 (516) .|.|.++.+.+.|.....+......++.+.... +++++ ..++....+++.+.++..... .|.| +.++++ +-+| T Consensus 189 ~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--~~l~~e~~~~~~~~~~~~~~g-~n~g~~~vl~~-g~~~ 264 (454) T protein:vir:93 189 IGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIP--GSITEENAKKLKSNWDSGYTG-ENAGKTAILSN-GAKY 264 (454) T ss_pred eeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC--CCCCHHHHHHHHHHHHHHhcc-cccCCceeccC-CceE Confidence 799999999999999999999999988886653 45554 345444445566666555444 3455 556654 5789 Q ss_pred eEEecccCCH--HHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 343 VQVNTPLSGL--ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSK 420 (516) Q Consensus 343 e~~~~~lsgl--~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~ 420 (516) ++++.+..+. -+........||.+.+||..+| |...++..++.+...+.| -+..|.|.+..+-..+.+.. T Consensus 265 ~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~sn~e~~~~~f-------~~~~l~P~~~~ie~~ln~~L 336 (454) T protein:vir:93 265 NPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKI-GVGQPPSSDNVEALEQQY-------YSQCLQTLIESIELLLDEAL 336 (454) T ss_pred EEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCCcchhHHHHHHHH-------HHHHHHHHHHHHHHHHHHhh Confidence 8887665433 3455577789999999998655 654334333333333333 33457888888877776654 Q ss_pred CCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCC---------CChhhhccc Q lcl|NC_019527. 421 WGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDN---------IDGDLEIVQ 491 (516) Q Consensus 421 ~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~---------~d~~~e~~~ 491 (516) +-.....++|.++.|...|.+++++ ++.+++++|++|++|+|+.+...+..+-.. ++.-..... T Consensus 337 ~~~~~~~~~f~~~~ll~~D~~~r~~-------~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~ 409 (454) T protein:vir:93 337 ETGENESTEFDVTTLLRMDSERRMK-------TLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYSLEALSRRDA 409 (454) T ss_pred cCCCCcEEEeechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccchHhhhccCc Confidence 3111123455556787777777654 566789999999999999986554321110 000000000 Q ss_pred cc-cchhcCCCCCCCC--CCCCCCCCCC Q lcl|NC_019527. 492 PE-MFDDDGADPYMPD--PDVLPGEEGS 516 (516) Q Consensus 492 ~e-~~~~e~~~~~~~~--~~~~~~~e~t 516 (516) .+ .....+.+.+.++ ++..++.+.+ T Consensus 410 ~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 437 (454) T protein:vir:93 410 REDPFASSGKTASVPQAVAASDGNKAIT 437 (454) T ss_pred ccCCCCCCccCCCCCCCCCCCCCCCCcc Confidence 00 0000111000000 1011111111 No 33 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=99.89 E-value=6.2e-23 Score=142.32 Aligned_cols=414 Identities=12% Similarity=0.075 Sum_probs=211.8 Q ss_pred HHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHH-HHHHHhCchh Q lcl|NC_019527. 36 RAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQN-LAALATRPEY 114 (516) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~l-l~~y~~~~i~ 114 (516) =.++..+ +.++..+... +. ++ +..-..|.. .. ..++ .+-.|... .....+++-+ T Consensus 1 Mg~~~~l-------~~r~~~~~~~-~~--~~-~~~~~~~~~----~~----~~~~------~~~~g~~V~~~~al~~~~V 55 (457) T protein:vir:13 1 MGFWSAL-------FGRGHSPALD-GI--EA-RAWEPYDPS----IY----NLGA------VAASGETVTPHDALQVSAV 55 (457) T ss_pred Cchhhhh-------hccccccccc-cc--cc-ccccccchH----HH----hhcc------cccCCceechHHhhccHHH Confidence 0111111 0011111100 00 00 000000100 00 0000 00011110 1234456778 Q ss_pred hhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHh----cChhHHHHHHHHhcccceeeEEEEEecCCCcccC Q lcl|NC_019527. 115 RAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEY----YGVMGIIQKAAEHDCFFGRGQISINIKGADVSVP 190 (516) Q Consensus 115 r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~----l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~P 190 (516) .+||+.+|+++-+..+.+.-..++..+... -..+...+.. +...+-++..+.+..++|.++++|.-++ T Consensus 56 ~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~--~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~~------ 127 (457) T protein:vir:13 56 FASVRLLSETIATLPLSTYSKRGGSRKEIV--TPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQG------ 127 (457) T ss_pred HHHHHHHHHhhccCceEEEEecCCcccccc--cchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC------ Confidence 999999999999998887544332221110 1122222222 2223344444455567888888764322 Q ss_pred cccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe--e-----eEeccceEEEecCCcchhhhhhcc Q lcl|NC_019527. 191 LILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL--G-----REMHASRLLTIITRPLPDMLKPAY 263 (516) Q Consensus 191 l~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~--g-----~~iH~SRli~~~~~~~p~~~k~~~ 263 (516) |.+.+|.+++|..|++..... +... +.....|++. + ..+++++|||+.+.. +.. T Consensus 128 ----------g~~~~l~~l~p~~v~v~~~~~-~~~~--~~~~~~y~~~~~~~~~~~~~~~~~diih~~~~~------~~~ 188 (457) T protein:vir:13 128 ----------PNIVGLDVLDPTKIHVHMVMV-DGLR--RKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMM------LPG 188 (457) T ss_pred ----------CcEEEEEEEccCceEEEEecC-CCcc--ceeEEEEEEecCCceeeEEeeCccceEEecCCC------CCC Confidence 233457788888777543211 1111 1111234333 1 347889999997542 122 Q ss_pred CCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCccee Q lcl|NC_019527. 264 NFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIV 343 (516) Q Consensus 264 ~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e 343 (516) .+.|.|.++.+.+.|.....+......++.+....-........++...-+++.++++......+|.+..++..++-+|+ T Consensus 189 ~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~ 268 (457) T protein:vir:13 189 DFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFS 268 (457) T ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEE Confidence 35799999999999999999999999999887765333333344544444556666655555555555444334557899 Q ss_pred EEecccCC--HHHHHHHHHHHHHhhhcCCceeeecccccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 344 QVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNAS--SEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLS 419 (516) Q Consensus 344 ~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnat--ge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s 419 (516) .++.+..+ +-+.......+||.+.+||-.+| |....+-..+ -+.... ++-+..|.|.++.|...|.+. T Consensus 269 ~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~sn~eq~~~-------~f~~~tl~P~~~~ie~~ln~~ 340 (457) T protein:vir:13 269 KVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLI-SDATNSTSWGSGLAEQNI-------AFTMFSLRPWLERIEAGFNRL 340 (457) T ss_pred EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCcccccchHHHHHH-------HHHHHHHHHHHHHHHHHHHHh Confidence 88766554 34556678889999999998755 7654333211 122222 233345789888888877766 Q ss_pred hCCCcC-Cc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhcc-C-----CCCC-----Ch Q lcl|NC_019527. 420 KWGEID-DA--ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDS-G-----WDNI-----DG 485 (516) Q Consensus 420 ~~g~~~-~d--~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~-~-----~~~~-----d~ 485 (516) .+.... .. ++|.+..|...+.+++++. +.+++++|++|++|+|+.++..+-. | +-++ .. T Consensus 341 L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~-------~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d~~~~~~n~~~~~~ 413 (457) T protein:vir:13 341 LFAETADRFRFVKFNLDEIKRGAPKERMEL-------WSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEVGE 413 (457) T ss_pred hcCccccCceeEEeechhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeeccccccccc Confidence 654332 23 4455558888888887554 5668999999999999998554321 1 1111 00 Q ss_pred hhhcc----ccccchhcCCCCCCCCCCCCCCCCC---------C Q lcl|NC_019527. 486 DLEIV----QPEMFDDDGADPYMPDPDVLPGEEG---------S 516 (516) Q Consensus 486 ~~e~~----~~e~~~~e~~~~~~~~~~~~~~~e~---------t 516 (516) ..+.. .+...+..+++...+++++.++.++ + T Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 414 EPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred cccccccCCCCCCCCCccccCCCCCCCCCCccccCCCCcccccC Confidence 00000 0000000001111111111111111 1 No 34 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=99.89 E-value=6e-23 Score=142.42 Aligned_cols=396 Identities=11% Similarity=0.057 Sum_probs=220.1 Q ss_pred hhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCc Q lcl|NC_019527. 33 AMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRP 112 (516) Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~ 112 (516) |+...+++. ++......| .+ .++...+ ....+....--.. ..+++ T Consensus 1 m~~~~~~~~---~~~~~~~~~------~~-----------~~~~~~~--------------~~~~~g~~v~~~~-al~~~ 45 (419) T protein:vir:57 1 MFIPQFWKG---RPSENRVNW------QV-----------VPGGMRS--------------SSSQAGVIITPET-ALALS 45 (419) T ss_pred Ccchhhhcc---CCccccccc------cc-----------ccccccc--------------ccccCCceechHH-hhccH Confidence 222222211 111000000 00 0000000 0000011111112 23467 Q ss_pred hhhhhhhhhhHHHhhCCCeeeeccccch-h--hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcc Q lcl|NC_019527. 113 EYRAFASTLSTELTREGIEITSKDRTKA-K--EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVS 188 (516) Q Consensus 113 i~r~iVd~~aed~~r~~~~i~~~~~~~~-~--~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~ 188 (516) -++++|+.+|+++-+..+.+--..++.. + ........|........-...|.+.+... .++|.+++++.-++ T Consensus 46 ~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~---- 121 (419) T protein:vir:57 46 AVRACVTLLAESVAQLPCVLYRRTENGGREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNG---- 121 (419) T ss_pred HHHHHHHHHHHhhccCceEEEEEcCCCceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---- Confidence 7999999999999998888733222221 1 11112233333444455556666665555 56788888874321 Q ss_pred cCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE--eeeEeccceEEEecCCcchhhhhhccCCC Q lcl|NC_019527. 189 VPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV--LGREMHASRLLTIITRPLPDMLKPAYNFS 266 (516) Q Consensus 189 ~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v--~g~~iH~SRli~~~~~~~p~~~k~~~~~~ 266 (516) .|.+.+|.+++|.+|++.... | |.+ +|++ .++.++.+.|+|+.+.. ...++ T Consensus 122 -----------~G~~~~L~pl~~~~v~v~~~~--~------g~~-~y~~~~~~~~~~~~~vih~r~~~-------~d~~~ 174 (419) T protein:vir:57 122 -----------RGDITELIPINPHKVIVLKGP--D------GMP-YYDIPSIGEILPMRMVHHIKSFS-------LDGYI 174 (419) T ss_pred -----------CCcEEEEEEEcCcceEEEECC--C------ceE-EEEEcCCceEEchhhEEEecCcC-------CCCcc Confidence 245667899999888753211 1 222 4555 35678999999997543 23567 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeec--chhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcc Q lcl|NC_019527. 267 GISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTN--MAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSED 341 (516) Q Consensus 267 G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~--~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~ 341 (516) |.|.++.+...+.....+......++.+.... +++++ ....++...-+++.+.+......-.|.+ +.+++ ++.+ T Consensus 175 G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~ 253 (419) T protein:vir:57 175 GTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQ-EGMT 253 (419) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecC-CCce Confidence 99999999999999999999999988886654 34442 1222222222334444433333333444 44554 4578 Q ss_pred eeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 342 IVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLS 419 (516) Q Consensus 342 ~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s 419 (516) |+.++.+... +-+......++||.+.+||..+| |...++-.++.|.....||+ ..|.|.++.+...|-+. T Consensus 254 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~sn~e~~~~~f~~-------~~l~P~~~~ie~~l~~~ 325 (419) T protein:vir:57 254 YKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMI-QDLQKSTNNNIEHQGLQYVI-------YTMLAILKRHESAMMRD 325 (419) T ss_pred EEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCccccHHHHHHHHHH-------HHHHHHHHHHHHHHHhh Confidence 8888766554 34566777789999999998766 44444555555656666654 34788888888777665 Q ss_pred hCCCcC-CcceEE--eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc-ccc Q lcl|NC_019527. 420 KWGEID-DAITFK--FKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP-EMF 495 (516) Q Consensus 420 ~~g~~~-~d~~~~--f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~-e~~ 495 (516) .+.... .++.|+ +..|...|.+++++ ++..++++|++|++|+|+.++. ++++...+...+ ... T Consensus 326 ll~~~~~~~~~i~fd~~~ll~~d~~~~~~-------~~~~~~~~G~~T~NE~R~~~gl------~p~~ggD~~~~~~n~~ 392 (419) T protein:vir:57 326 LLLPSERRDFYIEFNVSSLLRGDQKSRYE-------SYALGRQWGWLSVNDIRRMENL------TPIPGGDKYLTPLNMV 392 (419) T ss_pred ccCccccCCeEEEEechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCC------CCCCCcCeeeeccccc Confidence 543211 244555 45788888888765 4566899999999999999844 444221111111 011 Q ss_pred hhcC-CCCCCCCCCCCCCCCCC Q lcl|NC_019527. 496 DDDG-ADPYMPDPDVLPGEEGS 516 (516) Q Consensus 496 ~~e~-~~~~~~~~~~~~~~e~t 516 (516) +.+. .+...+.++.+++.++. T Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~ 414 (419) T protein:vir:57 393 DSKALTGIGKATPQQLKDIEAI 414 (419) T ss_pred cccccccccCCCcccCcchhhh Confidence 1111 11112334455555554 No 35 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=99.89 E-value=4.4e-23 Score=143.14 Aligned_cols=393 Identities=14% Similarity=0.114 Sum_probs=210.8 Q ss_pred ccccccchhhhcccccCCcccccccCcccHHH-HHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHH Q lcl|NC_019527. 73 MDSLCGPTYQFLNSAAGGLYAADIQPFPGYQN-LAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELE 151 (516) Q Consensus 73 ~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~l-l~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~ 151 (516) |-+...+. ++ +..+.+..++.+ -+.|..++.+.++|+.+++++-+..+.+...+. +.........-|. T Consensus 1 ~~~~~~~~--------g~--~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~-~~~~~~~l~~lL~ 69 (723) T protein:vir:94 1 MTTFPSGA--------GG--WNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDG-ELDELHPLSQLWN 69 (723) T ss_pred CcccccCC--------Cc--cccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCC-ccchhhHHHHHHh Confidence 22221110 00 000111111111 245678899999999999999998888754322 2222222223333 Q ss_pred HHHHhcChhHHHHHHHHhcc-cceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccccccccccccc Q lcl|NC_019527. 152 EACEYYGVMGIIQKAAEHDC-FFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFY 230 (516) Q Consensus 152 ~~~~~l~~~~~l~ea~~~~r-lyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg 230 (516) ..-....-...|.+.+.... ++|.+++++..++.+.. |....+.++.+..+........+.....+ T Consensus 70 ~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~------------g~p~~l~~l~~~~~~v~~~~~~~~~~~~~- 136 (723) T protein:vir:94 70 VMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPA------------GVPDEIWYVYDRVTTIVATRAADAVPQAQ- 136 (723) T ss_pred hCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccc------------cceeEEEEecCcceEEeecCCCccceeee- Confidence 33333445566777777654 67889988876554321 12234555555444332222222211111 Q ss_pred CcceeEE---eee--EeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC--ceeeec Q lcl|NC_019527. 231 KPSTWWV---LGR--EMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR--TFLKTN 303 (516) Q Consensus 231 ~P~~y~v---~g~--~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~--~v~k~~ 303 (516) ...|.+ .|. .++.+.||||.+.- +...+.|.|.++.+.+.|.....+......++.+... .+++++ T Consensus 137 -~~~y~~~~~~G~~~~~~~~dIiHir~~~------~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~ 209 (723) T protein:vir:94 137 -IIGYVIERTDGVRVPVLADEMLWLRFSD------PYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLG 209 (723) T ss_pred -eeEEEEEecCceeEEecccceEEecCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC Confidence 112333 343 57889999997542 2345689999999999999999999988888887654 344443 Q ss_pred chhhhcCccHHHHHHHHHHHHHhcCCcceEEEe-c---------CCcceeEEecccCC--HHHHHHHHHHHHHhhhcCCc Q lcl|NC_019527. 304 MAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMD-F---------DSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPA 371 (516) Q Consensus 304 ~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id-~---------~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~ 371 (516) + ++....+++.++++.......|.|..++. + ++-+|+.++.+..+ +-+...+..+.||.+.+||- T Consensus 210 --~-l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp 286 (723) T protein:vir:94 210 --D-MDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRK 286 (723) T ss_pred --C-CCHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCh Confidence 1 33333334444444333334565644432 2 23467766654333 23445666788999999998 Q ss_pred eeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCC--CCCCCHHHHHHHHHH Q lcl|NC_019527. 372 IKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKS--LWQTSAKEESEIRFN 449 (516) Q Consensus 372 t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~p--L~~~sekEkAei~~~ 449 (516) .+|.|.+ .+++.+.....||. ..|.|.++.|-..|-+..+...-.+++|+|+. |...|.++++ T Consensus 287 ~~i~~~s---t~sN~e~~~~~f~~-------~tL~P~~~~ie~~ln~~Ll~~~g~~~~~~f~~~~lLr~D~~~r~----- 351 (723) T protein:vir:94 287 DALLGGS---TYENQAEAKAAVWT-------ETLIPQMEVMASITDLQLLPDIGWTVEWDFNSVPALQEDLEAQA----- 351 (723) T ss_pred hHcCCCC---CcccHHHHHHHHHH-------HHHHHHHHHHHHHHhHhhcccccCceEEeecchhhhhcCHHHHH----- Confidence 8776643 22333444444543 44788888888877765553333467888885 5666766654 Q ss_pred HHHHHHHHHHcCCCCHHHHHHHHHhhhcc-CC-----CCCChhhhcc-ccccchhcC------------CCCCCCC---- Q lcl|NC_019527. 450 KAQEAQIYITNSVIDPSEARQQLSDDPDS-GW-----DNIDGDLEIV-QPEMFDDDG------------ADPYMPD---- 506 (516) Q Consensus 450 ~a~a~~~~~~~gvi~~~e~r~~l~~~~~~-~~-----~~~d~~~e~~-~~e~~~~e~------------~~~~~~~---- 506 (516) +++..++++|++|++|+|+.++..+.. |- .+........ .+.+...++ .+.|.++ T Consensus 352 --~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~ 429 (723) T protein:vir:94 352 --GRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAPAPAPAPAVEEGAARMLALLERVAADRPLPELPVR 429 (723) T ss_pred --HHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeccccccccCCCCCCccchhhhHhhhhhccccccccCcCCCCCC Confidence 466778999999999999998654321 11 1110000000 000000000 0001100 Q ss_pred ------CCCCCCCCCC Q lcl|NC_019527. 507 ------PDVLPGEEGS 516 (516) Q Consensus 507 ------~~~~~~~e~t 516 (516) .++-|..+.| T Consensus 430 ~~~~~~~~~~~~~~~~ 445 (723) T protein:vir:94 430 ATTVLHHDPGPDPQQT 445 (723) T ss_pred CCCCCCCCcccCCchh Confidence 1111222222 No 36 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=99.89 E-value=8.3e-23 Score=141.65 Aligned_cols=412 Identities=11% Similarity=0.028 Sum_probs=210.1 Q ss_pred HHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhh Q lcl|NC_019527. 36 RAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYR 115 (516) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r 115 (516) =.++..+ ++ ...+|. ...+..... +....... ..++..+ +...+. ...+.+++.+. T Consensus 1 Mg~~~~l-~~------~~~~~~-----~~~~~~~~~---~~~~~~~~----~~~~~~~----~g~~v~-~~~al~~~~v~ 56 (457) T protein:vir:62 1 MGFWSAL-FG------RGHSPA-----LDAAEGRAW---EPYDPSIY----NLGATAS----SGERVT-PHDALQVSAVF 56 (457) T ss_pred Cchhhhh-hc------cccccc-----ccccccccc---ccchhhhh----hcccccc----CCceec-hHHhhccHHHH Confidence 0111111 00 000110 000100000 00000000 0000000 000000 12344578899 Q ss_pred hhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhc---ChhHHHHHHHHh-cccceeeEEEEEecCCCcccCc Q lcl|NC_019527. 116 AFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYY---GVMGIIQKAAEH-DCFFGRGQISINIKGADVSVPL 191 (516) Q Consensus 116 ~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l---~~~~~l~ea~~~-~rlyG~a~i~i~i~~~~~~~Pl 191 (516) ++|+++++++-+..+.+.-..++..+... -..+...+.+. .-+..|.+.+.+ ..++|.|++++.-++ T Consensus 57 ~~i~~ia~~iA~lp~~~~~~~~~~~~~~~--~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~~------- 127 (457) T protein:vir:62 57 ASVRLLSETIATLPLSTYSKRGGTRKEID--TPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWAG------- 127 (457) T ss_pred HHHHHHHHhHhhCceEEEEecCCcccccc--chHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCC------- Confidence 99999999999999988544332211110 01122222222 224444445444 566788888763221 Q ss_pred ccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe--e-----eEeccceEEEecCCcchhhhhhccC Q lcl|NC_019527. 192 ILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL--G-----REMHASRLLTIITRPLPDMLKPAYN 264 (516) Q Consensus 192 ~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~--g-----~~iH~SRli~~~~~~~p~~~k~~~~ 264 (516) |.+.+|.++.|..|+...... +... +..-..|.+. | ..++++.||||.+.. +... T Consensus 128 ---------g~~~~l~~l~p~~v~v~~~~~-~~~~--~~~~~~y~~~~~g~~~~~~~~~~~eiih~r~~~------~~~~ 189 (457) T protein:vir:62 128 ---------PNIAGLDVLDPTKIHVHMVMV-DGLR--RKVFEAYDIDADGNEVLLGWFTPRDVLHIPGMM------LPGD 189 (457) T ss_pred ---------CcEEEEEEEcCcceEEEEecc-CCcc--ceeEEEEEEccCCceeEEEeeCccceEEecCCC------CCCc Confidence 234567888888776532211 1111 1111123333 1 357899999997542 2223 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcc Q lcl|NC_019527. 265 FSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSED 341 (516) Q Consensus 265 ~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~ 341 (516) +.|.|.++.+.+.|.....+......++++.... +++++ ..++.+..+++.+.++.......|.+ +.+++ ++-+ T Consensus 190 ~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--~~ls~e~~~~~~~~~~~~~~G~~nag~~~vl~-~g~~ 266 (457) T protein:vir:62 190 FVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVP--GTMSEEGLARAREAWRAANSGVDNAHRVALLT-EGAK 266 (457) T ss_pred eecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcC--CCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCce Confidence 6799999999999999999999999998887664 34443 34544433445555554444444555 45555 4578 Q ss_pred eeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeecccccccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 342 IVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQ 417 (516) Q Consensus 342 ~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~ 417 (516) |+.++.+..+ +-+.......+||.+.+||-.+| |....+-. +.-+.....||. ..|.|.++.+-..|- T Consensus 267 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~sn~eq~~~~f~~-------~~l~P~~~~ie~~ln 338 (457) T protein:vir:62 267 FSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLI-SDATNSTSWGSGLAEQNIAFTM-------FSLRPWLERIEAGFN 338 (457) T ss_pred EEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCcccccchHHHHHHHHHH-------HHHHHHHHHHHHHHH Confidence 9988766554 34566678889999999998655 76544332 212333334433 347888888877776 Q ss_pred HHhCCCcC-Cc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccC------CCCCCh--h Q lcl|NC_019527. 418 LSKWGEID-DA--ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSG------WDNIDG--D 486 (516) Q Consensus 418 ~s~~g~~~-~d--~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~------~~~~d~--~ 486 (516) +..+.... .. ++|.++.|...|.+++++. +.+++++|++|++|+|+.+...+-.+ +-++.. . T Consensus 339 ~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~-------~~~~~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~ 411 (457) T protein:vir:62 339 RLLFAETADRFRFVKFNLDEIKRGAPKERMEL-------WSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEI 411 (457) T ss_pred hhhcCccccCceEEEeechhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccccc Confidence 65543322 23 4455558888888887654 45689999999999999986533211 111100 0 Q ss_pred hhccccccchhc-------CCCCCCCC---CCCCCCCCCC Q lcl|NC_019527. 487 LEIVQPEMFDDD-------GADPYMPD---PDVLPGEEGS 516 (516) Q Consensus 487 ~e~~~~e~~~~e-------~~~~~~~~---~~~~~~~e~t 516 (516) ....+.+..... +++.+..+ .+..|.++.| T Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 451 (457) T protein:vir:62 412 GEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDEGET 451 (457) T ss_pred cccccccccCCCccCCCCccCCCCCCCCCCCCCCCccccc Confidence 000000000000 00000111 1112222222 No 37 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=99.89 E-value=1.7e-22 Score=139.93 Aligned_cols=444 Identities=14% Similarity=0.082 Sum_probs=225.6 Q ss_pred hhhhHHHHh--HHhhcCCC---ccccccCCCCC----CCccCCCccchhcccccccchhhhcccccCCccc-ccccCccc Q lcl|NC_019527. 32 LAMRRAVMK--SMERRASD---AATKWAPPQLM----PGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYA-ADIQPFPG 101 (516) Q Consensus 32 ~~~~~~~~~--~~~~~~~~---~~~~~~~~~~~----~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~-~~~~~f~g 101 (516) +++...+-+ ++-.|+.. +..+|+-..+- ||+.. .+......|.+.+.. .|+.. .......+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~--------~g~~~~~~~~~~~~ 71 (535) T protein:vir:10 1 MAILKDLRNAFSLSNKKSTSYIELGDYDKDIVNKAIRPGRAS-ARDTVDGIDIADGNV--------AGQYSVASISDVLS 71 (535) T ss_pred ChhhHHHHHHHHhhhhhhhhhHHHhhhhHHHHHhhhhhhhhh-hhccccccccccCCc--------ccccccCccccccC Confidence 333333322 12222211 11222222111 12111 000011112111111 11110 01111223 Q ss_pred H-HHHHHHHhCchhhhhhhhhhHHHhh-----------CCCeeeecccc--chhhhHHHHHHHHHHHH----hcC----h Q lcl|NC_019527. 102 Y-QNLAALATRPEYRAFASTLSTELTR-----------EGIEITSKDRT--KAKEMASKIKELEEACE----YYG----V 159 (516) Q Consensus 102 y-~ll~~y~~~~i~r~iVd~~aed~~r-----------~~~~i~~~~~~--~~~~~~~~i~~i~~~~~----~l~----~ 159 (516) + ++++.|..++++++||++.++.++. .++.|...+.+ ...........+...+. .+. . T Consensus 72 ~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~ 151 (535) T protein:vir:10 72 TKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDT 151 (535) T ss_pred HHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHH Confidence 3 5788899999999999999988662 23444332211 11111111222333332 222 3 Q ss_pred hHHHHHHHH-hcccceee-EEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE Q lcl|NC_019527. 160 MGIIQKAAE-HDCFFGRG-QISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV 237 (516) Q Consensus 160 ~~~l~ea~~-~~rlyG~a-~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v 237 (516) +..|.+.+. ...++||. ++++.-+ ..|.+.+|.+++|.+|.+.. |+..-+ +-+.+|++ T Consensus 152 ~~~~~~~lv~d~l~~~g~ay~~i~r~---------------~~G~~~~L~~l~p~~V~v~~----d~~~~~-~~~~~~~~ 211 (535) T protein:vir:10 152 FPRLLTKIINDMYVQDQINIERIFKN---------------DSNELDHFNAVDASKVVISY----SPRSKD-QPRKFEQF 211 (535) T ss_pred HHHHHHHHHHHHHhhCCceEEEEEEC---------------CCCcEEEEEEeCCceeEEEE----cCcccc-CceEEEEE Confidence 334554444 44667654 5555332 12446678899998887532 221111 11233443 Q ss_pred e----eeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecc--hhhhc Q lcl|NC_019527. 238 L----GREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNM--AQVLN 309 (516) Q Consensus 238 ~----g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~--~~~l~ 309 (516) . +..+.++.||||...+.++. ...++|+|.++.+.+.|.....+......++.+...+ +++++. ...++ T Consensus 212 ~~~~~~~~~~~~eiih~~~~~~~~~---~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls 288 (535) T protein:vir:10 212 VSETKSVKFSERNLTFINYWNLSDT---DRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQAN 288 (535) T ss_pred ecCceeEEECcccEEEEeccCCCCc---ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccC Confidence 3 34689999999987664432 2345699999999999999999999999999986653 555532 22333 Q ss_pred CccHHHHHHHHHHHHHhcCCcceE-EEecCCcceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccc Q lcl|NC_019527. 310 GGEGGDVFDRVEMYVNMQSNLGLA-VMDFDSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASS 386 (516) Q Consensus 310 ~~~~~~l~~r~~~~~~~~sn~g~~-~id~~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatg 386 (516) ...-+.+.+.++....+.+|.+.+ ++.+++-+|+.++.+..+ +-+...+....||.+.+||-.+| |..-.+--+.. T Consensus 289 ~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~l-G~~~~at~sn~ 367 (535) T protein:vir:10 289 QMMLAGIRRQWTSQGSGLGGAWKIPILAAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEI-NFPNNGGSTGK 367 (535) T ss_pred HHHHHHHHHHHHHHhcCcccccccccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-ccccCcccccc Confidence 333334444444444444566654 554445677777665544 34445667789999999998666 66533332333 Q ss_pred h-HHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_019527. 387 E-GEIRSFYDDISSVQQ----SYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNS 461 (516) Q Consensus 387 e-~D~~~yyd~I~~~Qe----~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g 461 (516) + .....|.+.++..+. ..|.|.+..+-..|-+..+.....++.|+|+.|...+.++++++.+ . ...| T Consensus 368 ~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~~~~~~f~f~~l~~~d~~~r~~~~~-------~-~~~g 439 (535) T protein:vir:10 368 SGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRYVDTDYRFSFTLGDAQDKLQEEQVWK-------L-KLAN 439 (535) T ss_pred hhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccCCeEEEEeccccccCHHHHHHHHH-------H-HHcC Confidence 3 234445555555444 3478888888777766555445567999999999999888766532 2 2357 Q ss_pred CCCHHHHHHHHHhhhccCCC----CC--------Chhhhccccccch--------h-----c--------C-CCCCCCCC Q lcl|NC_019527. 462 VIDPSEARQQLSDDPDSGWD----NI--------DGDLEIVQPEMFD--------D-----D--------G-ADPYMPDP 507 (516) Q Consensus 462 vi~~~e~r~~l~~~~~~~~~----~~--------d~~~e~~~~e~~~--------~-----e--------~-~~~~~~~~ 507 (516) ++|++|+|+.+...+.-+-. .+ ....+...++..+ . + + .+|.-+.+ T Consensus 440 ~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~ 519 (535) T protein:vir:10 440 GYFINEYRKDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLP 519 (535) T ss_pred CCCHHHHHHHhCCCCCCCccccccccchhhcccccccccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCC Confidence 89999999998654321100 00 0000000000000 0 0 0 01110111 Q ss_pred CCCCCCCCC Q lcl|NC_019527. 508 DVLPGEEGS 516 (516) Q Consensus 508 ~~~~~~e~t 516 (516) .+..+.+.| T Consensus 520 ~~~~~~~~~ 528 (535) T protein:vir:10 520 KPSESDDVS 528 (535) T ss_pred cCCCCCccc Confidence 111111222 No 38 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=99.89 E-value=8.9e-23 Score=141.48 Aligned_cols=405 Identities=12% Similarity=0.109 Sum_probs=214.2 Q ss_pred CCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHH Q lcl|NC_019527. 26 QEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNL 105 (516) Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll 105 (516) -...|.+.+...+ + ..|.+|.+ + ..+.....+-.+. .... . +...+ .+.... .. T Consensus 1 ~~~~~~~~~~~~~-~----------~~~~~~~~---~-~~~~~~~~~~~~~---~~~~---~-~~~~s---~~g~~v-~~ 54 (432) T protein:vir:10 1 MPDEKKLGLLGQL-K----------AMFVPPDP---V-DIGGGQTFTPVNA---TARD---L-GIIIS---DTGAAV-NA 54 (432) T ss_pred CCCCcccchhhhh-H----------hhcCCccc---c-ccccccccccCcc---hhhh---h-ccccc---ccCccc-ch Confidence 0111111111100 1 11222211 0 0000000000000 0000 0 00000 000000 12 Q ss_pred HHHHhCchhhhhhhhhhHHHhhCCCeeeeccccch-h-hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEe Q lcl|NC_019527. 106 AALATRPEYRAFASTLSTELTREGIEITSKDRTKA-K-EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINI 182 (516) Q Consensus 106 ~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~-~-~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i 182 (516) ..+.+++.+.++|+.+++++-+..+.+.-.+.+.. + ........|..+-....-+..|.+.+... .|+|.|++++.. T Consensus 55 ~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~ 134 (432) T protein:vir:10 55 DAIMRLDAVAACVKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVV 134 (432) T ss_pred hhhhcchHHHHHHHHHHHhhhhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 33557788999999999999999988743322211 1 11222333444455555666677766655 678999887644 Q ss_pred cCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE---ee--eEeccceEEEecCCcchh Q lcl|NC_019527. 183 KGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV---LG--REMHASRLLTIITRPLPD 257 (516) Q Consensus 183 ~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v---~g--~~iH~SRli~~~~~~~p~ 257 (516) ++ |.+.+|.+++|.+|++.. |.. |.. .|++ .| ..++++.|+|+.+.. T Consensus 135 ~~----------------g~~~~L~~l~~~~v~v~~----~~~----g~~-~y~~~~~~g~~~~~~~~~iih~~~~~--- 186 (432) T protein:vir:10 135 TD----------------GRIESLQYLANDRLTITT----DTK----GNT-AYRYRRTDGQMIDIPKQQIWKIMGYS--- 186 (432) T ss_pred cC----------------CcEEEEEEEcCCceEEEE----cCC----CcE-EEEEEecCceEEEEcCccEEEecCCC--- Confidence 32 234568888888887542 111 222 2332 23 368999999997643 Q ss_pred hhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEE Q lcl|NC_019527. 258 MLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVM 335 (516) Q Consensus 258 ~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~i 335 (516) ...+.|.|.++.+.+.|.....+......++.+.... +++++ ..++.+..+++.++ +....+..++.++ T Consensus 187 ----~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~--~~l~~e~~~~~~~~---~~~~~nag~~~vl 257 (432) T protein:vir:10 187 ----LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID--RFLTDDQYDSFAKK---VSGSVEAGRAPLL 257 (432) T ss_pred ----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC--CCCCHHHHHHHHHH---HhhhhhCCCceec Confidence 2345799999999999999999998888888876654 34443 33443332333333 3333333456666 Q ss_pred ecCCcceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccc---hHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 336 DFDSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASS---EGEIRSFYDDISSVQQSYYFSPLD 410 (516) Q Consensus 336 d~~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatg---e~D~~~yyd~I~~~Qe~~l~p~l~ 410 (516) ++ +.+|++++.+..+ +-+.......+||.+.+||..+| |....|-.+++ |.....|| +..|.|.++ T Consensus 258 ~~-g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~l-g~~~~~t~~~~sn~e~~~~~f~-------~~tl~P~~~ 328 (432) T protein:vir:10 258 EG-GMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMI-GHSSAGTTSWGSGIESQQLGFL-------SMTLSPWLR 328 (432) T ss_pred CC-CceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHc-CCccCCcccccchHHHHHHHHH-------HHHHHHHHH Confidence 55 5789988876554 34556788889999999998655 65544444333 22233333 235778888 Q ss_pred HHHHHHHHHhCCCcC-CcceEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh Q lcl|NC_019527. 411 TMLKVIQLSKWGEID-DAITFKF--KSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL 487 (516) Q Consensus 411 ~l~~~l~~s~~g~~~-~d~~~~f--~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~ 487 (516) .+...|-+..+.... ..+.|+| +.|...|.+++++ ++++++++|++|++|+|+.+. +++++.+. T Consensus 329 ~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~-------~~~~~~~~G~~T~NE~R~~~g------lppi~g~~ 395 (432) T protein:vir:10 329 RIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSS-------YYSQLVNNGLMTRDEAREIEG------LPKLGGNA 395 (432) T ss_pred HHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHhC------CCCCCCCc Confidence 887777665543322 2345555 4787788888655 556689999999999999984 44443221 Q ss_pred hcc-c-----c--ccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 488 EIV-Q-----P--EMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 488 e~~-~-----~--e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) ... . + ...+....++..++ ++..+.+.+ T Consensus 396 ~~~~~~~~~~pl~~~~~~~~~~~~~~~-~~~~~~~~~ 431 (432) T protein:vir:10 396 AVLTVQSAMVPLDSIGLQASPEPASGL-GNQQQDKVS 431 (432) T ss_pred ceEeecCcccchhhhcccCCCCCCCCC-CCccccccc Confidence 110 0 0 00000011111111 111111111 No 39 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=99.89 E-value=8e-23 Score=141.73 Aligned_cols=406 Identities=11% Similarity=0.096 Sum_probs=213.8 Q ss_pred CCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHH Q lcl|NC_019527. 26 QEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNL 105 (516) Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll 105 (516) -+.-+.|.++..+- ..|.++.. | ..+.....+--+. ... ..+...+ .+.... .. T Consensus 1 ~~~~~~mg~f~r~~-----------~~~~~~~~---~-~~~~~~~~~~~~~------~~~-~~~~~~~---~~g~~v-~~ 54 (432) T protein:vir:81 1 MPDEKKLGLFGQLK-----------AMFVPPDP---V-DIGGGQTFTPVNA------TAR-DLGIIIS---DTGAAV-NA 54 (432) T ss_pred CCchhhcchhhhhh-----------hhcccccc---c-ccccccccccCcc------chh-hhccccc---ccCccc-ch Confidence 11112222221110 01222210 0 0010000100000 000 0000000 001011 12 Q ss_pred HHHHhCchhhhhhhhhhHHHhhCCCeeeeccc-cchh-hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEe Q lcl|NC_019527. 106 AALATRPEYRAFASTLSTELTREGIEITSKDR-TKAK-EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINI 182 (516) Q Consensus 106 ~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~-~~~~-~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i 182 (516) ..+.+++.+.+||+.+|+++-+..+.+--..+ +..+ ........|...-........|.+.+... .++|.|++++.. T Consensus 55 ~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~ 134 (432) T protein:vir:81 55 DAIMRLDAVAACVKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVV 134 (432) T ss_pred HhhhccHHHHHHHHHHHHhhhhCceeeEEecCCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 33557788999999999999999988732222 1111 11122333444444455556677666655 667888877644 Q ss_pred cCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE---ee--eEeccceEEEecCCcchh Q lcl|NC_019527. 183 KGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV---LG--REMHASRLLTIITRPLPD 257 (516) Q Consensus 183 ~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v---~g--~~iH~SRli~~~~~~~p~ 257 (516) ++ |.+..|.+++|.+|++.. |+. |.+ .|.+ .| ..++++.|+||.+.. T Consensus 135 ~~----------------g~~~~L~~l~~~~v~v~~----~~~----g~~-~y~~~~~~g~~~~~~~~~iih~r~~~--- 186 (432) T protein:vir:81 135 TD----------------GRIESLQYLANDRLTITT----DPK----GNT-AYRYRRTDGQMIDIPKQQIWKIMGYS--- 186 (432) T ss_pred cC----------------CcEEEEEEEcCCceEEEE----CCC----CcE-EEEEEecCceEEEEccccEEEecCCC--- Confidence 32 234567888888887532 221 221 2333 23 368999999997643 Q ss_pred hhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEE Q lcl|NC_019527. 258 MLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVM 335 (516) Q Consensus 258 ~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~i 335 (516) ...+.|+|.++.+.+.|.....+......++.+.... +++++ ..++.+.-+++.++ +....+..+++++ T Consensus 187 ----~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~--~~l~~e~~~~~~~~---~~~~~nag~~~vl 257 (432) T protein:vir:81 187 ----LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID--RFLTDDQYDSFAKK---VSGSVEAGRAPLL 257 (432) T ss_pred ----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC--CCCCHHHHHHHHHH---HhhhhcCCCceec Confidence 2345799999999999999999999999888876654 44543 33433222233333 3333333446666 Q ss_pred ecCCcceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccc---hHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 336 DFDSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASS---EGEIRSFYDDISSVQQSYYFSPLD 410 (516) Q Consensus 336 d~~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatg---e~D~~~yyd~I~~~Qe~~l~p~l~ 410 (516) ++ +.+|++++.+..+ +-+......++||.+.+||-.+| |....|-.+++ |.....||. ..|.|.+. T Consensus 258 ~~-g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~~sn~eq~~~~f~~-------~tl~P~~~ 328 (432) T protein:vir:81 258 EG-GMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMI-GHSSAGTTSWGSGIESQQLGFLT-------MTLSPWLR 328 (432) T ss_pred CC-CceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHc-CCcCCccccccchHHHHHHHHHH-------HHHHHHHH Confidence 65 5789988876654 34566788899999999998655 66554444333 333344432 35788888 Q ss_pred HHHHHHHHHhCCCcC-CcceEE--eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh Q lcl|NC_019527. 411 TMLKVIQLSKWGEID-DAITFK--FKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL 487 (516) Q Consensus 411 ~l~~~l~~s~~g~~~-~d~~~~--f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~ 487 (516) .+-.-|-+..+.... ..+.|+ ++.|...|.++++ +++.+++++|++|++|+|+.+.. ++++.+. T Consensus 329 ~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~-------~~~~~~~~~G~~t~NE~R~~~gl------pp~~g~~ 395 (432) T protein:vir:81 329 RIEQSIALNLLSPAERRRYFADFDTSALLRADSAARS-------SYYSQLVNNGLMTRDEAREIEGL------PKLGGNA 395 (432) T ss_pred HHHHHHHhhccCccccCceEEEeechhhhccCHHHHH-------HHHHHHHhCCCCCHHHHHHHhCC------CCCCCCc Confidence 887777665543321 234444 4578888887764 45567899999999999999844 4443221 Q ss_pred hcc---cc----ccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 488 EIV---QP----EMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 488 e~~---~~----e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) +.. .. +.......+.++...++..+.+-+ T Consensus 396 ~~~~~~~~~~pl~~~~~~~~~~~~~~~~n~~~~~~~ 431 (432) T protein:vir:81 396 AVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKVS 431 (432) T ss_pred ceEeecCcccchhhhccCCCCCCCCCCCCccccccc Confidence 110 00 000000011111111111111111 No 40 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=99.89 E-value=3.6e-23 Score=143.63 Aligned_cols=392 Identities=13% Similarity=0.109 Sum_probs=212.5 Q ss_pred CCCccCCCccchhcccccccchh-----------hhcccc-cCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHh Q lcl|NC_019527. 59 MPGVVPAGTTPAVAMDSLCGPTY-----------QFLNSA-AGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELT 126 (516) Q Consensus 59 ~~gv~~~~~~~~~a~ds~~~~~~-----------~~~~~~-~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~ 126 (516) |||+-. .+.-+-|+=|..... .++... .....+.......+ +..+..++.+.+||+.+|.++- T Consensus 1 ~~~~~~--~~~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~v~~cI~~ia~~ia 75 (413) T protein:vir:96 1 MPGVSE--IRKDKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELISDG---YTKLSDSPEVRMAVDCIADLVS 75 (413) T ss_pred CCccch--hhhhhcCCccccCCCcchhhhhhccccccccccccchhhHhhhccch---hHHHhhchHHHHHHHHHHHhhc Confidence 233211 011111111100000 000000 00000000000011 2235568999999999999999 Q ss_pred hCCCeeeecccc-chhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCccccccccccccee Q lcl|NC_019527. 127 REGIEITSKDRT-KAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLT 204 (516) Q Consensus 127 r~~~~i~~~~~~-~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~ 204 (516) +..+.+-..+++ ...........|........-+..|.+.+.+. .++|.+++++.-++. .+.+. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~--------------g~~~~ 141 (413) T protein:vir:96 76 NMTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVS--------------GDKII 141 (413) T ss_pred cCceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC--------------CCceE Confidence 999988433322 22222222333333344444556666666655 567888888754321 12345 Q ss_pred eEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHH Q lcl|NC_019527. 205 GFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRT 284 (516) Q Consensus 205 ~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~ 284 (516) .|.+++|.+|++... ... + --.|.+.++.+.++.||||...+- +...+.|.|.++.+.+.|.....+ T Consensus 142 ~L~~l~~~~v~~~~~----~~~--~--~y~~~~~~~~~~~~evih~k~~~~-----~~~~~~G~s~~~~~~~~i~~~~~~ 208 (413) T protein:vir:96 142 GLTPISPYKVTFNVS----DDD--L--DYSITFDNKEYDPSTLLHFVLNPS-----IERPFIGTGYKVALKDIVGNLKQA 208 (413) T ss_pred EEEEecCceeEEEEc----CCe--E--EEEEeecCcEEchhhEEEEeccCC-----CCCccccccHHHHHHHHHHHHHHH Confidence 688888888875321 111 0 012344577899999999975432 223457999999999999999999 Q ss_pred HHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcce-EEEecCCcceeEEe-cccCC--HHHHHHHHH Q lcl|NC_019527. 285 RQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGL-AVMDFDSEDIVQVN-TPLSG--LADLQSQSQ 360 (516) Q Consensus 285 ~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~-~~id~~~e~~e~~~-~~lsg--l~d~~~~~~ 360 (516) ......++.+....-........++.+..+++.++++.......|.|. .++..+..++..+. .+..+ +-+...... T Consensus 209 ~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~ 288 (413) T protein:vir:96 209 SVTKKGFMASEYMPNLIVSVDSDSDELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDK 288 (413) T ss_pred HHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHH Confidence 999999999877643322223345444444555555544444455554 44544444444432 23322 334556777 Q ss_pred HHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEE--eCCCCCC Q lcl|NC_019527. 361 EHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFK--FKSLWQT 438 (516) Q Consensus 361 ~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~--f~pL~~~ 438 (516) ++||.+.+||..+| |.. ...+....+||. ..|.|.++.+.+.|-+..+ ++++.|+ ++.|... T Consensus 289 ~~Ia~~fgVP~~~l-g~~-----~~~~~~~~~~~~-------~~l~P~~~~ie~~ln~~ll---~~~~~~~fd~~~ll~~ 352 (413) T protein:vir:96 289 KTVAGIFGVPAFLL-GVG-----TYNKDEFNNFIN-------TKIMSIAQVIQQTYNKLIV---EEDMYFSLNPRSLYNY 352 (413) T ss_pred HHHHHHhCCCHHHc-CCC-----cchHHHHHHHHH-------HHHHHHHHHHHHHHHHhhC---CCCcEEEEechhhhcc Confidence 89999999999766 421 222333444443 3488999988888876554 3455555 4577777 Q ss_pred CHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccccc-chhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 439 SAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEM-FDDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 439 sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~-~~~e~~~~~~~~~~~~~~~e~ 515 (516) |.+++++ ++..++++|++|++|+|+.++ +++++...+...+-. ...+ ...+....+++|+ T Consensus 353 d~~~~~~-------~~~~~~~~G~~t~NE~R~~~g------~~p~~~gd~~~~~~n~~~~~----~~~~~~~~~~~dt 413 (413) T protein:vir:96 353 SLTEMVS-------AGAQMTQLNALRRNEFRNWVG------MPPDAEMDDLLVLENYLQQK----DLVNQKKLIQDET 413 (413) T ss_pred CHHHHHH-------HHHHHHhCCCcCHHHHHHHhC------CCCCCCcceeeecccccchh----hcccccCCCCCCC Confidence 7777654 556789999999999999884 444432111111100 0000 0111222333333 No 41 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=99.88 E-value=8.7e-23 Score=141.53 Aligned_cols=408 Identities=12% Similarity=0.081 Sum_probs=216.1 Q ss_pred HHhHHhhcCCCccccccCCCCCCCccCCCccchhc-ccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhh Q lcl|NC_019527. 38 VMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVA-MDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRA 116 (516) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a-~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~ 116 (516) +.+++.+....+.. +++.. ...-+. +.+. .|+..-.. +. +..... ....+ -....+++-+.+ T Consensus 1 ~~~~l~~~~~~~~~--~~~~~---~~~~~~-~~~~~~~~~~~~~--~~-----g~~~~~-g~~v~---~~~al~~~~V~~ 63 (434) T protein:vir:43 1 MSKSLGKVLSSATS--APRSS---LFGWGG-KTIRLTDGAFWSQ--FL-----GRESSS-GKKVT---VDKAMKLSAVWA 63 (434) T ss_pred Cccchhhhhhhccc--ccchh---hhcccc-cccccCchHHHHH--Hh-----cCCccC-Cceec---hhhhhccHHHHH Confidence 33333222222111 01100 000011 1111 12111000 00 000000 00111 123456778899 Q ss_pred hhhhhhHHHhhCCCeeeecc-ccchh--hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcc Q lcl|NC_019527. 117 FASTLSTELTREGIEITSKD-RTKAK--EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLI 192 (516) Q Consensus 117 iVd~~aed~~r~~~~i~~~~-~~~~~--~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ 192 (516) ||+.+|+++-+..+.+--.+ ++... ........|...-....-...|.+.+... .++|.+++++.-++ T Consensus 64 ~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~~-------- 135 (434) T protein:vir:43 64 CVRLISTSVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRAA-------- 135 (434) T ss_pred HHHHHHHhhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCC-------- Confidence 99999999999988873322 22111 11122233333444444555666666655 66788887764322 Q ss_pred cccccccccceeeEEeecceeeccccccccccccccccCcceeEE--ee--eEeccceEEEecCCcchhhhhhccCCCCc Q lcl|NC_019527. 193 LDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV--LG--REMHASRLLTIITRPLPDMLKPAYNFSGI 268 (516) Q Consensus 193 ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v--~g--~~iH~SRli~~~~~~~p~~~k~~~~~~G~ 268 (516) |.+.+|.+++|.+|++.... | |...++.. .| +.++++.|||+.+.. .....|. T Consensus 136 --------G~~~~L~~l~p~~v~~~~~~--~------g~~~y~~~~~~g~~~~~~~~eVih~~~~~-------~dg~~G~ 192 (434) T protein:vir:43 136 --------GRPAALDFLLPSRVDLECDE--N------GRLKYFYTTKKGARREIERTNMLHIPAFT-------LDGRIGL 192 (434) T ss_pred --------CcEEEEEEEcCcceEEEEcC--C------CeEEEEEEecCceEEEEccccEEEecCcC-------CCCcccc Confidence 33456888999888754321 1 22333222 23 578999999997643 2345799 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEec Q lcl|NC_019527. 269 SMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNT 347 (516) Q Consensus 269 S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~ 347 (516) |.++.+.+.|.....+......++.+....-........++.+..+.+.+.++.. ....|.| +.++++ +.+|+.++. T Consensus 193 spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~r~~~~~~-~g~~nag~~~vl~~-g~~~~~l~~ 270 (434) T protein:vir:43 193 SAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQREEFREYVKSV-SGAMNSGRSPVLEQ-GITPETIGI 270 (434) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHHHHHHHHHHHHh-cCccccCCccccCC-CceEEEccC Confidence 9999999999999999999999998866543222223345443334444444332 2234544 555654 578998887 Q ss_pred ccCC--HHHHHHHHHHHHHhhhcCCceeeecccccccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019527. 348 PLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE 423 (516) Q Consensus 348 ~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~ 423 (516) +..+ +-+......++||.+.+||-.+| |....+-+ ++-+.....|+ ...|.|.+..|-..|.+..+.. T Consensus 271 ~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~s~~e~~~~~f~-------~~~L~P~~~~ie~~ln~kL~~~ 342 (434) T protein:vir:43 271 NPVDAQLLETREHGVIEICRWFGVPPWMI-GQTDKGSNWGTGLEQQMLAFL-------TFSISSITNQIQQCVNKRLLTA 342 (434) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCcCCccccchHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcCCh Confidence 6553 45677788899999999997655 65543332 11122233333 3457898888887776655432 Q ss_pred cC-CcceEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCC---------CCCChhhhccc Q lcl|NC_019527. 424 ID-DAITFKF--KSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGW---------DNIDGDLEIVQ 491 (516) Q Consensus 424 ~~-~d~~~~f--~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~---------~~~d~~~e~~~ 491 (516) .. .++.|+| ..|...|.++++ +++.+++++|++|++|+|+.++..+.-+- -+++...+... T Consensus 343 ~~~~~~~~~fd~~~llr~d~~~r~-------~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~ 415 (434) T protein:vir:43 343 PERIRYYAEFSLEGFLKADSAGRA-------AWYSTMAQNGFMTRNEGRRKENLPELPGGDILTVQSNLVPIDQLGQSNK 415 (434) T ss_pred hhhcCceEEEechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeeccCccchhhhhccCC Confidence 21 2444555 478778887764 55667899999999999999865443211 11111111111 Q ss_pred cccchhcCCCCCCCCCCCCCCC Q lcl|NC_019527. 492 PEMFDDDGADPYMPDPDVLPGE 513 (516) Q Consensus 492 ~e~~~~e~~~~~~~~~~~~~~~ 513 (516) . +......+...+.++|.+ T Consensus 416 ~---~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 416 S---QAVRAALMNWFSQPEPQE 434 (434) T ss_pred C---cchhhhhhccCCCCCCCC Confidence 1 111011111111122222 No 42 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=99.88 E-value=1.6e-22 Score=140.13 Aligned_cols=406 Identities=12% Similarity=0.097 Sum_probs=213.5 Q ss_pred CCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHH Q lcl|NC_019527. 26 QEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNL 105 (516) Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll 105 (516) -...|.+.+...+ + ..|.||.+ +-..+ ....+-.+. .... . +...+ .+.... .. T Consensus 1 ~~~~~~~g~~~~~-~----------~~~~~~~~---~~~~~-~~~~~~~~~---~~~~---~-~~~~~---~~g~~v-~~ 54 (432) T protein:vir:97 1 MPDEKKLGLLGQL-K----------AMFVPPDP---VDIGG-GQTFTPVNA---TARD---L-GIIIS---DTGAAV-NA 54 (432) T ss_pred CCCcccCchhhhh-H----------hhcCCccc---ccccc-ccccccCch---hhhh---h-ccccc---ccCccc-ch Confidence 0111111111100 0 11222211 00000 000100000 0000 0 00000 001000 12 Q ss_pred HHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchh--hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEe Q lcl|NC_019527. 106 AALATRPEYRAFASTLSTELTREGIEITSKDRTKAK--EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINI 182 (516) Q Consensus 106 ~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~--~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i 182 (516) ..+.+++.+.++|+.+++++-+..+.+--.+.+... ........|...-.....+..|.+.+... .++|.|++++.- T Consensus 55 ~~a~~~~aV~~~v~~Ia~~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~ 134 (432) T protein:vir:97 55 DAIMRLDAVAACVKLVSQAVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVV 134 (432) T ss_pred HhhhcchHHHHHHHHHHHhhccCceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 345577889999999999999998887433222111 11122333444444555566677766655 667888877644 Q ss_pred cCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE---ee--eEeccceEEEecCCcchh Q lcl|NC_019527. 183 KGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV---LG--REMHASRLLTIITRPLPD 257 (516) Q Consensus 183 ~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v---~g--~~iH~SRli~~~~~~~p~ 257 (516) ++ |.+.+|.+++|.+|++.. |. .|.+ .|++ .| ..++++.|+|+.+.. T Consensus 135 ~~----------------g~~~~L~~l~p~~v~v~~----~~----~g~~-~y~~~~~~g~~~~~~~~~iih~r~~~--- 186 (432) T protein:vir:97 135 TD----------------GRIESLQYLANDRLTITT----DT----KGNT-AYRYRRTDGQMIDIPRQQIWKIMGYS--- 186 (432) T ss_pred cC----------------CcEEEEEEEcCcceEEEE----cC----CCcE-EEEEEecCceEEEEccccEEEecCcC--- Confidence 32 234568888888887642 11 1222 2333 23 368999999997643 Q ss_pred hhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEE Q lcl|NC_019527. 258 MLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVM 335 (516) Q Consensus 258 ~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~i 335 (516) ...+.|.|.++.+.+.|.....+......++.+.... +++++ ..++.+..+.+.+++ ....+..++.++ T Consensus 187 ----~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~--~~l~~e~~~~~~~~~---~~~~nag~~~vl 257 (432) T protein:vir:97 187 ----LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID--RFLTDDQYDSFSKKV---SGSVEAGRAPLL 257 (432) T ss_pred ----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecC--CCCCHHHHHHHHHHH---hhhhcCCCceec Confidence 2345799999999999999999999999998886664 45544 334433223333333 223333445666 Q ss_pred ecCCcceeEEecccCCH--HHHHHHHHHHHHhhhcCCceeeeccccccccccc---hHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 336 DFDSEDIVQVNTPLSGL--ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASS---EGEIRSFYDDISSVQQSYYFSPLD 410 (516) Q Consensus 336 d~~~e~~e~~~~~lsgl--~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatg---e~D~~~yyd~I~~~Qe~~l~p~l~ 410 (516) +. +.+|+.++.+..+. -+...+..++||.+.+||..+| |....|-.+++ |.....|+ +..|.|.++ T Consensus 258 ~~-g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~~s~~e~~~~~f~-------~~tl~P~~~ 328 (432) T protein:vir:97 258 EG-GMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMI-GHSSAGTTSWGSGIESQQLGFL-------TMTLSPWLR 328 (432) T ss_pred CC-CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCcCCcccccchhHHHHHHHHH-------HHHHHHHHH Confidence 54 57899988766543 4557788899999999998665 65443333222 22222232 234678888 Q ss_pred HHHHHHHHHhCCCcC-Cc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh Q lcl|NC_019527. 411 TMLKVIQLSKWGEID-DA--ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL 487 (516) Q Consensus 411 ~l~~~l~~s~~g~~~-~d--~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~ 487 (516) .+-..|-+..+.... .. ++|.++.|...|.+++++ ++.+++++|++|++|+|+.+.. ++++.+. T Consensus 329 ~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~-------~~~~~~~~G~~T~NE~R~~~gl------pp~~g~~ 395 (432) T protein:vir:97 329 RIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSS-------YYSQLVNNGLMTRDEAREIEGL------PKLGGNA 395 (432) T ss_pred HHHHHHhhhccCccccCceEEEeechhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHhCC------CCCCCCc Confidence 777777665543322 23 445555788888888655 5567899999999999999844 4443221 Q ss_pred hc-cc-----c-ccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 488 EI-VQ-----P-EMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 488 e~-~~-----~-e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) .. .. + +.......+.++....+..+.+.+ T Consensus 396 ~~~~~~~~~~pl~~~~~~~~~~~~~~~~~~~~~~~~ 431 (432) T protein:vir:97 396 AVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKVS 431 (432) T ss_pred ceEeecccccchhhhcccCCCCCCCCCCCccccccc Confidence 11 00 0 000001111111111111112222 No 43 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=99.88 E-value=8.9e-23 Score=141.48 Aligned_cols=444 Identities=10% Similarity=-0.005 Sum_probs=230.4 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) |+-+.++..+ +.++. ... .....++.-..+ ..|+-.....+++. T Consensus 3 ~~~~l~~~~~----------~~~~~-----~~~---~~~~~~~~~~~~----------~~~~~~~~~~~~~~-------- 46 (466) T protein:vir:81 3 LIDRLLSTRG----------AAPRM-----SID---DYAQMLNEFAFN----------GIGYGFGGGVPRIQ-------- 46 (466) T ss_pred hhHHHhhccC----------ccccc-----chh---hhhhhhhhhhcc----------ccccccccccHHHH-------- Confidence 2222222100 00000 000 000000000000 00110001111111 Q ss_pred hhhcccccCCcccccccCcccHHH-HHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhh-hHHHHHHHHHHHHhcC Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPFPGYQN-LAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKE-MASKIKELEEACEYYG 158 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f~gy~l-l~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~-~~~~i~~i~~~~~~l~ 158 (516) .+.++.. ....+..|-.+ ...|.+++.++++|+.+++++-+..+.+.-..++.... ....+..|...-.... T Consensus 47 -~~~~g~~-----~~~~~~~g~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~ 120 (466) T protein:vir:81 47 -QTLAGPS-----TELAPDTFVGLATQAYQANGPVFACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGG 120 (466) T ss_pred -Hhhcccc-----ccccCccccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCC Confidence 0000000 00111111111 34577889999999999999999999886554332211 1112222333333444 Q ss_pred hhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE Q lcl|NC_019527. 159 VMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV 237 (516) Q Consensus 159 ~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v 237 (516) ....|.+.+... .++|.|++++.-++.-. +. ....|....+.++.|..+.+.... |....-++ .|++ T Consensus 121 t~~~f~~~l~~~lll~Gnay~~i~r~~~g~---l~----~~~~g~~~~l~~l~~~~v~~~~~~--~~~~~~~y---~~~~ 188 (466) T protein:vir:81 121 TTQDMLSRMIQDADLAGNSYWTIVDGEFVR---MR----PDWVDVVVEERMVRGGRGELGGGQ--LGWRKVGY---LYTE 188 (466) T ss_pred CHHHHHHHHHHHHHhcCCeEEEEEecCccc---cc----cccCcceeEEEEecCcceEEEEcC--CCceEEEE---EEEe Confidence 556676666655 56888888875533211 11 123355677888988887754321 21111111 2333 Q ss_pred ee-------eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcC Q lcl|NC_019527. 238 LG-------REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNG 310 (516) Q Consensus 238 ~g-------~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~ 310 (516) .+ ..++++.||||.+... +...+.|.|.++.+.+.|.....+......++.+....-........++. T Consensus 189 ~~~~~~~~~~~~~~~dviHir~~~~-----~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~l~~ 263 (466) T protein:vir:81 189 GGRQSGNESVGFLAEDVVHFAPIPD-----PLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNPMADP 263 (466) T ss_pred cCcccccceeeeccccEEEEcCCCC-----cccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCH Confidence 32 4688999999975431 23456799999999999999999999999999987765332222234544 Q ss_pred ccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccc-- Q lcl|NC_019527. 311 GEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASS-- 386 (516) Q Consensus 311 ~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatg-- 386 (516) +..+++.+.++.......|.|..++..++-+|+.++.+..+ +-+......++||.+.+||-. ++|.+..+-.+|. T Consensus 264 e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~-~lG~~~~~~~st~sn 342 (466) T protein:vir:81 264 AAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPV-IVGLSEGLAAATYSN 342 (466) T ss_pred HHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHH-HcccccCCCcccccc Confidence 44444555554443444565544433445789988766544 335567788999999999975 5566543333332 Q ss_pred -hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcC-CcceEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019527. 387 -EGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEID-DAITFKF--KSLWQTSAKEESEIRFNKAQEAQIYITNSV 462 (516) Q Consensus 387 -e~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~-~d~~~~f--~pL~~~sekEkAei~~~~a~a~~~~~~~gv 462 (516) |.....|| +..|.|.+..+-..|-+..+..-. ..+.|+| .+|...+.++++++..++++.+..++++|+ T Consensus 343 ~eq~~~~f~-------~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g~ 415 (466) T protein:vir:81 343 YGQARRRLA-------DGTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEKDAADIQKVRAETINTLITAGY 415 (466) T ss_pred HHHHHHHHH-------HHHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcCC Confidence 33334444 345788888887777654432111 1345555 478899999999999999999999999995 Q ss_pred CCHHHHHHHHHhhhccCCCC--CChhhhccccccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 463 IDPSEARQQLSDDPDSGWDN--IDGDLEIVQPEMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 463 i~~~e~r~~l~~~~~~~~~~--~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e 514 (516) +++|+|..+.....-.+.. +............+ .+.+. ....+...+++ T Consensus 416 -t~nE~r~~~~~gd~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~Gg~~ngn 466 (466) T protein:vir:81 416 -EPESVVAAVNSGDLRLLKHTGLTSVQLLPPGVSAS-ASSDT-PTSGGADDNGN 466 (466) T ss_pred -ChhhccccccCCccccccCCCcchhhhcccccccc-cCCCC-cccCCCCcCCC Confidence 9999997642111101111 11000100111100 00000 00111112222 No 44 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.88 E-value=1.9e-22 Score=139.71 Aligned_cols=370 Identities=12% Similarity=0.034 Sum_probs=194.1 Q ss_pred HHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchh-hhHHHHHHHHHHHHhcC-------------h-hHHHHHHHH Q lcl|NC_019527. 104 NLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAK-EMASKIKELEEACEYYG-------------V-MGIIQKAAE 168 (516) Q Consensus 104 ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~-~~~~~i~~i~~~~~~l~-------------~-~~~l~ea~~ 168 (516) |..+...|+.+++||+++++++.+-++.+....+.... .....++.+...+.... . .+.++..+. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 44444568999999999999999999998644322211 11223333333333222 1 233344555 Q ss_pred hcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccc----------------ccccccccccC- Q lcl|NC_019527. 169 HDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYN----------------ALDPTAPDFYK- 231 (516) Q Consensus 169 ~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~----------------~~dp~s~~yg~- 231 (516) +..++|.+++++.-+.. |.+..|.++++.+|.+.... ..+....++.. T Consensus 81 ~l~l~Gn~~i~~~r~~~---------------G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 145 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTD---------------GTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGD 145 (467) T ss_pred HHHhcCCeEEEEEECCC---------------CcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccc Confidence 66778999988754321 22334555555554431100 00000000000 Q ss_pred --cceeEE------eeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eee Q lcl|NC_019527. 232 --PSTWWV------LGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLK 301 (516) Q Consensus 232 --P~~y~v------~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k 301 (516) +.++.+ ....++++.||||.... +...++|+|.+..+.+.|.....+......++.+.... +++ T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~diih~r~~~------~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 219 (467) T protein:vir:31 146 LDPVFVDADDGSTGTSVSNPANELIFKRNHS------PLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAII 219 (467) T ss_pred eeeeeeeeccccccceeEeccccEEEecCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 011111 12457889999996532 34567899999999999999888888888888776653 333 Q ss_pred ecchhhhcCccHHHHHHHHHHHHH-----------hcCCcceEEEecCCcceeEEecccC----------CHHHHHHHHH Q lcl|NC_019527. 302 TNMAQVLNGGEGGDVFDRVEMYVN-----------MQSNLGLAVMDFDSEDIVQVNTPLS----------GLADLQSQSQ 360 (516) Q Consensus 302 ~~~~~~l~~~~~~~l~~r~~~~~~-----------~~sn~g~~~id~~~e~~e~~~~~ls----------gl~d~~~~~~ 360 (516) +. ...++.+..+.+.+.++.... ...|.+..++...+.++..+.+.+. .+.+...... T Consensus 220 ~~-~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~ 298 (467) T protein:vir:31 220 VK-GAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNE 298 (467) T ss_pred ec-CcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHH Confidence 22 122333333334333322111 1223333332223344444433221 2234556677 Q ss_pred HHHHhhhcCCceeeeccccc-cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc--CCc--ceEEeCCC Q lcl|NC_019527. 361 EHMCSVSKIPAIKLTGISPS-GLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI--DDA--ITFKFKSL 435 (516) Q Consensus 361 ~~iaaas~IP~t~L~G~sp~-Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~--~~d--~~~~f~pL 435 (516) .+||++.+||..+| |.... ..+++.+.....|+. ..|.|.+..+-..|-...+... ..+ ++|.+..| T Consensus 299 ~~Ia~~fgVpp~~l-G~~~~~~~~s~~e~~~~~f~~-------~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l 370 (467) T protein:vir:31 299 HDILKVHDVPPVIA-GVVESGAFSTDAEEQRKEFAE-------ETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKP 370 (467) T ss_pred HHHHHHhCCCHHHc-ccCCCCCcccCHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchh Confidence 88999999998655 76533 333334544555543 3478888888777665544221 223 56666789 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhc---------cccccchhcCCCCCCCC Q lcl|NC_019527. 436 WQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEI---------VQPEMFDDDGADPYMPD 506 (516) Q Consensus 436 ~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~---------~~~e~~~~e~~~~~~~~ 506 (516) ...+.++++++. ..++++|++|++|+|+.++.. ++.++.-. .....+.....+.+... T Consensus 371 ~~~d~~~~~~~~-------~~~~~~G~~T~NE~R~~~Gl~------pi~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (467) T protein:vir:31 371 DTKLQDVEIASQ-------RVQAMQGLLTVNELRDEFGFE------PFPEEHVYGGETLVAEVTGGSGPGGGIGDQIEQL 437 (467) T ss_pred hccCHHHHHHHH-------HHHHhCCCcCHHHHHHHhCCC------CCCcccccCCcccccccccccCCCCcccCcCCCC Confidence 888888876654 558999999999999998543 33211100 00000000000000000 Q ss_pred CCC----------------CCCCCCC Q lcl|NC_019527. 507 PDV----------------LPGEEGS 516 (516) Q Consensus 507 ~~~----------------~~~~e~t 516 (516) +++ .+-|+|. T Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (467) T protein:vir:31 438 VEDRADEIIDSYQADLETEQLIEIGA 463 (467) T ss_pred CCCcccchHhhhhhccccchhhhhcc Confidence 000 0000000 No 45 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=99.88 E-value=6.4e-23 Score=142.25 Aligned_cols=407 Identities=12% Similarity=0.119 Sum_probs=215.2 Q ss_pred HhhcCCCccccccCCCC-CCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQL-MPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFAST 120 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~-~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~ 120 (516) |.....+.+..+.+... .-|+ +..+.|..+...+ ++.. ..+.... ....+.+++.+.++|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~------~~s~~~~~~~~~~-------~~~~---~~~g~~v-~~~~al~~~~v~~ci~~ 63 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGV------PISLTDGSFWSAW-------GGMG---SSSGETV-TADSALQLSAVWSCVRL 63 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCC------cccCCchhHHHhh-------cccc---cCCCcee-chHhhhccHHHHHHHHH Confidence 11111111110000000 0000 0000111000000 0000 0000000 01234577889999999 Q ss_pred hhHHHhhCCCeeeeccccchh---hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccc Q lcl|NC_019527. 121 LSTELTREGIEITSKDRTKAK---EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPR 196 (516) Q Consensus 121 ~aed~~r~~~~i~~~~~~~~~---~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~ 196 (516) +++++-+..+.+...+++... ........|...-....-...|.+.+... .++|.+++++.-++ T Consensus 64 Ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~------------ 131 (437) T protein:vir:10 64 IAETIATLPLNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA------------ 131 (437) T ss_pred HHHHHhhCceeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC------------ Confidence 999999988887443322211 11112233444444455566677776666 66899988875432 Q ss_pred cccccceeeEEeecceeeccccccccccccccccCcceeEE---ee--eEeccceEEEecCCcchhhhhhccCCCCchHH Q lcl|NC_019527. 197 TIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV---LG--REMHASRLLTIITRPLPDMLKPAYNFSGISMS 271 (516) Q Consensus 197 ~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v---~g--~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~l 271 (516) |.+.+|.+++|.+|++.... + |.. .|++ .| ..+.++.||||.+..+ ..++|.|.+ T Consensus 132 ----g~~~~L~~l~p~~v~i~~~~--~------g~~-~y~~~~~~g~~~~~~~~dIih~r~~~~-------d~~~G~spi 191 (437) T protein:vir:10 132 ----GVLIGLELMLPQRTTVKRLT--S------GAL-QYTYRNVDGTVSTLAEDDVFHVRGFSL-------DGLMGLTPI 191 (437) T ss_pred ----CcEEEEEEEcCcceEEEECC--C------CeE-EEEEEecCceEEEEccccEEEecCcCC-------CCcccccHH Confidence 23456888888887753321 1 111 2222 23 3578999999975321 346799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecc Q lcl|NC_019527. 272 QLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTP 348 (516) Q Consensus 272 e~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~ 348 (516) +.+.+.|.....+......++.+.... +++++ ..++.+..+++.++++.......|.| +++++ ++-+|++++.+ T Consensus 192 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--~~l~~e~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~ 268 (437) T protein:vir:10 192 QYAREVLGNSTAANKTSASVFRNGLRPSGVLSTD--QILQKEKRAEIRTDLAEQFGGAMQAGKTMVLE-AGMKYQAITMN 268 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC--CCCCHHHHHHHHHHHHHHhcCccccCcceecc-CCceEEeccCC Confidence 999999999999999999999886654 34443 44554444556555554444445555 45555 45788888766 Q ss_pred cCC--HHHHHHHHHHHHHhhhcCCceeeeccccccc--cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Q lcl|NC_019527. 349 LSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGL--NASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI 424 (516) Q Consensus 349 lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Gl--natge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~ 424 (516) ..+ +-+........||.+.+||..+| |....+- ++.-+.....|| +..|.|.+..|...|-+..+... T Consensus 269 ~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~e~~~~~f~-------~~tl~P~~~~ie~~l~~kll~~~ 340 (437) T protein:vir:10 269 PGDVQLLETRAFNIEEICRWYRVPPFMV-GHSEKSTSWGTGIEQQTLGFL-------TFTLRPWLTRIEQAARRSLLRPG 340 (437) T ss_pred hhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcccccchHHHHHHHHH-------HHHHHHHHHHHHHHHHhhccCcc Confidence 543 45566677889999999998666 6543221 122233333443 34578888888887776554322 Q ss_pred C-Cc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCC-C--Ch---hhhccccccc Q lcl|NC_019527. 425 D-DA--ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDN-I--DG---DLEIVQPEMF 495 (516) Q Consensus 425 ~-~d--~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~-~--d~---~~e~~~~e~~ 495 (516) . .. ++|.+..|...|.+++++ ++..++++|++|++|+|+.+...+..|-.. + .. ..+...++.+ T Consensus 341 e~~~~~~~fd~~~ll~~d~~~r~~-------~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~~~~~~~~~~~~~~~~~~~~ 413 (437) T protein:vir:10 341 ERDQFYAEFSVEGLLRADSAGRAA-------FYSTMTQNGLMTRDECRAKENLPPMGGNAAVLTVQSALLPIDKLGEHTT 413 (437) T ss_pred ccCceEEEEechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCCCCCCCCcceEeecCcccchhhccCcCC Confidence 1 12 445556787778877765 456689999999999999985543221100 0 00 0000000000 Q ss_pred hhc----CCCCCCCCCCCCCCCCC Q lcl|NC_019527. 496 DDD----GADPYMPDPDVLPGEEG 515 (516) Q Consensus 496 ~~e----~~~~~~~~~~~~~~~e~ 515 (516) ... ......+++.....+|+ T Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 414 ATAAQDALKAWLYQEEKTRATQER 437 (437) T ss_pred CcchhccccccCCCCCCCCccccC Confidence 000 00111112222222222 No 46 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=99.88 E-value=5.3e-23 Score=142.73 Aligned_cols=395 Identities=14% Similarity=0.141 Sum_probs=222.2 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhC Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATR 111 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~ 111 (516) |-++..+.+ . |.+. .....+. +.. ...+.+... .+ . ...-++ T Consensus 1 MG~~~~~~~-~----------~~~~---------~~~~~~~-~~~---~~~~~g~~~-------~~------~-~~al~~ 42 (411) T protein:vir:81 1 MGWWSRLTR-F----------FRPR---------NETVDMT-NPL---LLQWLGVDP-------DT------P-RNQLSE 42 (411) T ss_pred CchHHHHHh-h----------ccCc---------ccccccc-hHH---HHHHhcCcc-------cC------h-hhhhcc Confidence 222222111 0 1000 0000010 000 001110000 00 0 112256 Q ss_pred chhhhhhhhhhHHHhhCCCeeeeccccch-h-hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcc Q lcl|NC_019527. 112 PEYRAFASTLSTELTREGIEITSKDRTKA-K-EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVS 188 (516) Q Consensus 112 ~i~r~iVd~~aed~~r~~~~i~~~~~~~~-~-~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~ 188 (516) +-+.++|+.+|+++-+..+.+--.+++.. + ........|...-.....+..|.+.+.+. .++|.|++++..++ T Consensus 43 ~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~---- 118 (411) T protein:vir:81 43 ATYFACLKILSESLGKLPLKMYQKTERGIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSG---- 118 (411) T ss_pred HHHHHHHHHHHHhHhhCceeEEEecCCceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC---- Confidence 77899999999999999988843332211 1 11222333444444455666677666665 66788988875543 Q ss_pred cCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe----e--eEeccceEEEecCCcchhhhhhc Q lcl|NC_019527. 189 VPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL----G--REMHASRLLTIITRPLPDMLKPA 262 (516) Q Consensus 189 ~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~----g--~~iH~SRli~~~~~~~p~~~k~~ 262 (516) |.+.+|.++.|.+|++....... .... ...+|.+. | ..++++.||||.... +. T Consensus 119 ------------g~~~~l~~l~~~~v~~~~~~~~~-~~~~--~~~~~~~~~~~~g~~~~~~~~eiih~k~~~------~~ 177 (411) T protein:vir:81 119 ------------PQLQALWILPSQYVTIVVDDRGL-LGEK--NAIWYRYNDPYDGKMYVFRNDEILHFKTSV------TF 177 (411) T ss_pred ------------CceEEEEEECCceEEEEEcCccc-cccc--ceEEEEEEecCCceEEEEccccEEEEcCCC------CC Confidence 23456888888888764321110 0101 11123332 3 358999999997432 23 Q ss_pred cCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcce Q lcl|NC_019527. 263 YNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDI 342 (516) Q Consensus 263 ~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~ 342 (516) ..+.|.|.++.+.+.+.....+......++.+....-........++.+..+++.++++......+|.|..++..++-+| T Consensus 178 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~ 257 (411) T protein:vir:81 178 DGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFANGSKNAGKIIPVPLGMKL 257 (411) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceE Confidence 45679999999999999999999999999988765543333334455544456666666655554555544443445789 Q ss_pred eEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 343 VQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSK 420 (516) Q Consensus 343 e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~ 420 (516) ++++.+..+ +-+......++||.+.+||..+| |...++-.++.+.....||. ..|.|.++.+-+.+-+.. T Consensus 258 ~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~n~e~~~~~f~~-------~~l~P~~~~ie~~l~~~l 329 (411) T protein:vir:81 258 VPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQI-NDYEKSSYASAEAQNLAFYV-------DTLLYVLKQYEEEITYKI 329 (411) T ss_pred EEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhc Confidence 988765543 34566778899999999998655 65555555555555555543 457898888888777655 Q ss_pred CCC--cCCcc--eEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc-ccc Q lcl|NC_019527. 421 WGE--IDDAI--TFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP-EMF 495 (516) Q Consensus 421 ~g~--~~~d~--~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~-e~~ 495 (516) +.. ...++ +|.+..|...|.++++ ++++.++++|++|++|+|+.++ +++++...+...+ ... T Consensus 330 l~~~~~~~~~~~~fd~~~ll~~d~~~~~-------~~~~~~~~~g~~t~NE~R~~~g------l~p~~ggD~~~~~~n~~ 396 (411) T protein:vir:81 330 LSNDLISQGHYFKFNVNVILRADIKTQM-------DSLSTAVQNGIMTPNEARDYLD------MPADDYGNNLMANGNYI 396 (411) T ss_pred CChhhcCCCcEEEeechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhC------CCCCCCCCeeeeccCcc Confidence 432 23344 4555677777877764 5567789999999999999884 4444221111100 000 Q ss_pred hhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 496 DDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 496 ~~e~~~~~~~~~~~~~~~e~ 515 (516) ..+. ..+....|+|. T Consensus 397 pl~~-----~~~~~~kgGd~ 411 (411) T protein:vir:81 397 PLSM-----LGANYGKGGDS 411 (411) T ss_pred chhh-----hhhhhccCCCC Confidence 0000 00111222332 No 47 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=99.88 E-value=3.5e-22 Score=138.19 Aligned_cols=387 Identities=11% Similarity=0.018 Sum_probs=204.5 Q ss_pred ccCCCccchhcccccccchhhhcc-cccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccch Q lcl|NC_019527. 62 VVPAGTTPAVAMDSLCGPTYQFLN-SAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKA 140 (516) Q Consensus 62 v~~~~~~~~~a~ds~~~~~~~~~~-~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~ 140 (516) -.+++...++.. +..+....... ......+|. ..+|.|. ....+.+++.+.++|+.+|+++-+..+.+.-..+..+ T Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~~~~v-~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~~~~ 77 (409) T protein:vir:93 1 MAKENIVTRIKK-KLIDNWIDQSTSKLYDFSPWK-NRSFWGV-INNTLETNETIFSAITKLSNSMASLPLKMYEDYKVVN 77 (409) T ss_pred CCccchhhhhhh-hhhhhhhcccccccccccccc-Ccccccc-chhhhhccHHHHHHHHHHHHhhhhCceeEeecccccc Confidence 011111111100 00000000000 000000000 0122222 1134668888999999999999998888754433222 Q ss_pred hhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccc Q lcl|NC_019527. 141 KEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAY 219 (516) Q Consensus 141 ~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~ 219 (516) . .....|...-....-+..|.+.+.+. .++|.|++++..++ .|.+..|.+++|.+|++... T Consensus 78 ~---~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~---------------~G~~~~L~~l~~~~v~~~~~ 139 (409) T protein:vir:93 78 T---EVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI---------------YHQPSKLFLLNPDVVEMLIE 139 (409) T ss_pred c---hHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECC---------------CCcEEEEEEEcCceeEEEEe Confidence 1 22233444445555566666666655 67888988875432 23456788888888775332 Q ss_pred cccccccccccCcceeEEe---e--eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 220 NALDPTAPDFYKPSTWWVL---G--REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDK 294 (516) Q Consensus 220 ~~~dp~s~~yg~P~~y~v~---g--~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~ 294 (516) .. +.+.+|.+. | ..++++.||||.+.. +...+.|.|.++.+.+.+.....+.... +... T Consensus 140 ~~--------~~~~~y~~~~~~g~~~~~~~~eVih~r~~~------~~~~~~G~s~i~~~~~~i~~~~~~~~~~--~~~~ 203 (409) T protein:vir:93 140 NQ--------SRELYYSIHAATGNKLIVHNMDMLHFKHIV------ASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEM 203 (409) T ss_pred CC--------CcEEEEEEEcCCceEEEEccccEEEeCCCC------CCCccccccHHHHHHHHHHHHHHHHHHH--HHhc Confidence 11 123345554 2 358999999997542 2245679999998888777665554442 2222 Q ss_pred hCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCc-ceEEEecCCcceeEEecccCC--HHHHHHHHHHHHHhhhcC Q lcl|NC_019527. 295 FSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNL-GLAVMDFDSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKI 369 (516) Q Consensus 295 ~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~-g~~~id~~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~I 369 (516) .... +++.+ ..++. ++..+..+.|....+|. ++.+++ ++.+|++++.+... +-+......+.||.+.+| T Consensus 204 ~~~~~~i~~~~--~~l~~---e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgV 277 (409) T protein:vir:93 204 QKPDSFMLKYG--SNVGK---EKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVASENLTRERVANVFQL 277 (409) T ss_pred CCCCceEEecC--CCCCH---HHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCC Confidence 2111 22222 22322 33333334444433444 455555 45788888765443 344555677899999999 Q ss_pred CceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cCCcceEEeC--CCCCCCHHHHHH Q lcl|NC_019527. 370 PAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE--IDDAITFKFK--SLWQTSAKEESE 445 (516) Q Consensus 370 P~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~--~~~d~~~~f~--pL~~~sekEkAe 445 (516) |-.+|.+.. .+-.++.|.....||.. .|.|.++.|...+-+..+.. ...++.|+|+ .|...|.+++++ T Consensus 278 Pp~~lg~~~-~~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~ 349 (409) T protein:vir:93 278 PSVFLNARS-NTNFAKNEELNRFYLQH-------TLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAE 349 (409) T ss_pred CHHHhCCCC-CCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHH Confidence 988775432 33334455555556554 37898888887776655432 2334556654 677777777654 Q ss_pred HHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccc-----c-ccchhcCCCCCCCCCCCCCC Q lcl|NC_019527. 446 IRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQ-----P-EMFDDDGADPYMPDPDVLPG 512 (516) Q Consensus 446 i~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~-----~-e~~~~e~~~~~~~~~~~~~~ 512 (516) ++.+++++|++|++|+|+.++. ++++...+... + +...+......+++.+...+ T Consensus 350 -------~~~~~~~~G~~T~NE~R~~~g~------~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 350 -------VYFKAVRSGYYTINDIREWEDL------PPVEGGDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred -------HHHHHHhCCCcCHHHHHHHhCC------CCCCCcCeeeecccccccccchhhcccccCCCCCcCCC Confidence 5566899999999999999844 44421111110 0 00011111112222222222 No 48 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=99.88 E-value=5.7e-22 Score=137.03 Aligned_cols=387 Identities=11% Similarity=0.021 Sum_probs=204.6 Q ss_pred ccCCCccchhc---ccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeecccc Q lcl|NC_019527. 62 VVPAGTTPAVA---MDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRT 138 (516) Q Consensus 62 v~~~~~~~~~a---~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~ 138 (516) ...++..+++- .+...+... ........|. ..+|.|. ....|.+++.+.++|+.+|+++-+..+.+.-..+. T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~---~~~~~~~~~~-~~~~~~v-~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~~ 75 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDNWIDQSA---SKLYDFSPWK-NKSFWGV-INNTLETNETIFSAITKLSNSMASLPLKMYEDYKV 75 (409) T ss_pred CcccccchhhhhHHhhhhhcCCc---cccccccccc-Ccccccc-chhhhhccHHHHHHHHHHHHhhhhCceeEeecccc Confidence 01111111110 000000000 0000000000 1122222 12346678889999999999999999887543332 Q ss_pred chhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccc Q lcl|NC_019527. 139 KAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPS 217 (516) Q Consensus 139 ~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~ 217 (516) .+ ......|...-....-...|.+.+... .++|.+++++.-+. .|.+.+|.+++|.+|++. T Consensus 76 ~~---~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~---------------~G~~~~L~~l~~~~v~v~ 137 (409) T protein:vir:94 76 VN---TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI---------------YHQPSKLFLLNPDVVEML 137 (409) T ss_pred cc---hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---------------CCcEEEEEEEcCceeEEE Confidence 22 122233444455555667777666655 67899988875421 244567889999888754 Q ss_pred cccccccccccccCcceeEEe---e--eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 218 AYNALDPTAPDFYKPSTWWVL---G--REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLV 292 (516) Q Consensus 218 ~~~~~dp~s~~yg~P~~y~v~---g--~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll 292 (516) .... +.+-+|.+. | ..++++.|+||.+.. +.....|.|.+..+.+.+.....+.... +. T Consensus 138 ~~~~--------~~~~~y~~~~~~g~~~~~~~~dvih~r~~~------~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~ 201 (409) T protein:vir:94 138 IENQ--------SRELYYSIHAATGNKLIVHNMDMLHFKHIV------ASNMVQGISPIDVLKNTTDFDNAVRTFN--LT 201 (409) T ss_pred EeCC--------CcEEEEEEEcCCceEEEEccccEEEecCCC------CCCccccccHHHHHHHHHHHHHHHHHHH--HH Confidence 3211 223345553 3 357899999997532 2345679999998888777655554432 22 Q ss_pred HHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccC--CHHHHHHHHHHHHHhhhcC Q lcl|NC_019527. 293 DKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLS--GLADLQSQSQEHMCSVSKI 369 (516) Q Consensus 293 ~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~ls--gl~d~~~~~~~~iaaas~I 369 (516) .......+.......++. ++..+..+.|+...+|.| +++++ ++.+|+.++.+.. .+-+.......+||.+.+| T Consensus 202 ~~~~~~~~i~~~~~~l~~---e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgV 277 (409) T protein:vir:94 202 EMQKPDSFMLKYGSNVGK---EKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVASENLTRERVANVFQL 277 (409) T ss_pred hcCCCCeeEEecCCCCCH---HHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCC Confidence 222111111111122322 333333344444444444 55555 4578888876544 3445566677899999999 Q ss_pred CceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc--CCcceEEeC--CCCCCCHHHHHH Q lcl|NC_019527. 370 PAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI--DDAITFKFK--SLWQTSAKEESE 445 (516) Q Consensus 370 P~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~--~~d~~~~f~--pL~~~sekEkAe 445 (516) |-.+|-+.. .+-.++.|.....|+.. .|.|.++.|...|-+..+... ..+..|+|+ .|...|.++++ T Consensus 278 Pp~~lg~~~-~~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~- 348 (409) T protein:vir:94 278 PSVFLNARS-NTNFAKNEELNRFYLQH-------TLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQA- 348 (409) T ss_pred CHHHhCCCC-CCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHH- Confidence 988774432 23334445444555443 478999888888777665432 335556654 67777777765 Q ss_pred HHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccc-----c-ccchhcCCCCCCCCCCCCCC Q lcl|NC_019527. 446 IRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQ-----P-EMFDDDGADPYMPDPDVLPG 512 (516) Q Consensus 446 i~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~-----~-e~~~~e~~~~~~~~~~~~~~ 512 (516) +++..++++|++|++|+|+.++. ++++...+... + +...+......+++.+...+ T Consensus 349 ------~~~~~~~~~G~~T~NE~R~~~g~------~p~~ggD~~~~~~n~~~~~~~~~~~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 349 ------EVYFKAVRSGYYTINDIREWEDL------PPVEGGDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred ------HHHHHHHhCCCcCHHHHHHHhCC------CCCCCcCeEeecccccccccchhhcccccCCCCCcCCC Confidence 45567899999999999999844 44321111110 0 00110111111222111111 No 49 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=99.87 E-value=7.4e-22 Score=136.42 Aligned_cols=395 Identities=11% Similarity=0.032 Sum_probs=214.5 Q ss_pred ccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCC Q lcl|NC_019527. 51 TKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGI 130 (516) Q Consensus 51 ~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~ 130 (516) -.|.+...-+.. ...++.++.....++...+.. + .... .....+++-+.+||+.+++++-+-.+ T Consensus 1 ~~~~r~~~~~~~-----~~~~~~~~~~~~~~g~~~s~~-~-------~~vt---~~~al~~~~v~~~v~~ia~~iA~lp~ 64 (419) T protein:vir:14 1 MFFSRQLLSNLG-----QTQMSAGGWVSALLGSSRSDS-G-------QVVT---PASALALTVLQNCVTLLAESIAQLPI 64 (419) T ss_pred Cccccccccccc-----ccccCcchhhHHhhcCCCccC-C-------cccc---hHHhhccHHHHHHHHHHHHhhccCce Confidence 223222111110 111222221111111100000 0 0011 12234567789999999999999888 Q ss_pred eeeeccccchhh--hHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEE Q lcl|NC_019527. 131 EITSKDRTKAKE--MASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFS 207 (516) Q Consensus 131 ~i~~~~~~~~~~--~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~ 207 (516) .+.-.++++... .......|...-....-...|.+.+.+. .++|.+++++.-++ .|.+.+|. T Consensus 65 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~---------------~G~~~~l~ 129 (419) T protein:vir:14 65 ELYERSGEDRKPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDS---------------DGVIQGLY 129 (419) T ss_pred EEEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---------------CCcEEEEE Confidence 874433322211 1112233333444445566677776655 56788888874332 24466789 Q ss_pred eecceeeccccccccccccccccCcceeEEee-eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 208 NIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG-REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQ 286 (516) Q Consensus 208 v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g-~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~ 286 (516) +++|.+|++.... | +.+ .|++.+ ..++.+.|+|+.+.. ...++|.|.++.+.+.|.....+.. T Consensus 130 pl~~~~v~v~~~~--~------~~~-~y~~~~~~~~~~~~i~h~~~~~-------~dg~~G~s~i~~~~~~i~~~~~~~~ 193 (419) T protein:vir:14 130 PLDNEAVTVMRGS--D------LKP-VYRVRGSDPMPQRLVHHVRWMS-------INGYTGLSPVLLHANAIGHAQAIQQ 193 (419) T ss_pred EecCceEEEEECC--C------ceE-EEEEccCcccchhheeEecCcC-------CCCcccccHHHHHHHHHHHHHHHHH Confidence 9999888753211 1 222 455654 356778888886543 2356899999999999999999999 Q ss_pred HHHHHHHHhCCc--eeeecchhhhcCcc-H---HHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccCC--HHHHHH Q lcl|NC_019527. 287 SVSDLVDKFSRT--FLKTNMAQVLNGGE-G---GDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLSG--LADLQS 357 (516) Q Consensus 287 ~~~~Ll~~~~~~--v~k~~~~~~l~~~~-~---~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~lsg--l~d~~~ 357 (516) ....++.+.... +++++ ..+.... + +.+.++++......+|.+ +.++++ +-+|++++.+..+ +-+... T Consensus 194 ~~~~~f~ng~~p~gil~~~--~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~-g~~~~~l~~~~~d~q~~e~~~ 270 (419) T protein:vir:14 194 YAGKSFMNGTALSGVIERP--KDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQE-GMTFRPLSMTNVDAALIDALR 270 (419) T ss_pred HHHHHHhccCCccEEEEec--CCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCC-CceEEEccCChhhHHHHHHHH Confidence 999999886654 44442 2222111 2 223333433333334544 455654 4788888766543 345566 Q ss_pred HHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcC-CcceEEe--CC Q lcl|NC_019527. 358 QSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEID-DAITFKF--KS 434 (516) Q Consensus 358 ~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~-~d~~~~f--~p 434 (516) ...+.||.+.+||..+| |..-++-.++.|.....||.. .|.|.+..+-..|-+..+-... .++.|+| +. T Consensus 271 ~~~~~Ia~~fgVpp~~l-g~~~~~t~s~~E~~~~~f~~~-------~L~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~ 342 (419) T protein:vir:14 271 LSALDIARIYKIPAHMV-NELERATFSNIEHQSLQFVIY-------TLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAG 342 (419) T ss_pred HHHHHHHHHhCCCHHHh-cCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHhhhccCccccCCeEEEEechh Confidence 77889999999998776 433334444445455555443 3788888887777665442221 2445555 47 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc-ccch-hcCCCCCCCCCCCCCC Q lcl|NC_019527. 435 LWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP-EMFD-DDGADPYMPDPDVLPG 512 (516) Q Consensus 435 L~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~-e~~~-~e~~~~~~~~~~~~~~ 512 (516) |...|.++++ +++.+++++|++|++|+|+.++.. +++.-.....+ .... +.....+..++++.++ T Consensus 343 l~r~d~~~~~-------~~~~~~~~~G~~T~NE~R~~~gl~------p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~ 409 (419) T protein:vir:14 343 LLRGDQSSRY-------AAYAVGRQWGWLSINDIRRLENMP------PVKGGDIYLSPMNMVDASKPQQLPVGKSEPTKA 409 (419) T ss_pred hhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhCCC------CCCCcCeeeeccccccccccccccCCCCCCccc Confidence 7777777764 455668999999999999988443 33211111001 0000 0000111112222222 Q ss_pred CCCC Q lcl|NC_019527. 513 EEGS 516 (516) Q Consensus 513 ~e~t 516 (516) .++- T Consensus 410 ~~~e 413 (419) T protein:vir:14 410 AIDE 413 (419) T ss_pred cccc Confidence 2221 No 50 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=99.87 E-value=1.3e-21 Score=135.12 Aligned_cols=459 Identities=13% Similarity=0.074 Sum_probs=221.2 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCc-cccccCCCCCCCccCCCcc-chhcccccc- Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDA-ATKWAPPQLMPGVVPAGTT-PAVAMDSLC- 77 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~gv~~~~~~-~~~a~ds~~- 77 (516) |=.+.-|- +.-.....+. .+...--.|..+..+. +..+.+. +.. -.-|+++.. T Consensus 1 ~~~~~~~~----~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~ 56 (574) T protein:vir:80 1 MPKWLDKA----LGIEKSSIEE-----------TRNMENYKMHLREIDTNVVNNEPY---------SMESIEKGMNGKTT 56 (574) T ss_pred Ccchhhhh----hccchhhHHH-----------HHhhhhhccccchhhhhhhhccCC---------CHHHHHHhHhhhcc Confidence 32222221 1110000000 0000000011111100 0011110 000 001111110 Q ss_pred --cchhhhcccccCCccc----ccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhC-----------CCeeeecccc-- Q lcl|NC_019527. 78 --GPTYQFLNSAAGGLYA----ADIQPFPGYQNLAALATRPEYRAFASTLSTELTRE-----------GIEITSKDRT-- 138 (516) Q Consensus 78 --~~~~~~~~~~~~~~~~----~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~-----------~~~i~~~~~~-- 138 (516) ...+...-....++.. .+...+ ..+++.|..+++++++|++.++.+.++ +|+|.-.+.+ T Consensus 57 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~ 134 (574) T protein:vir:80 57 AYMQPIIGEMSVNPGYKTKPSIRNSQDL--HKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAE 134 (574) T ss_pred cccchhhhhccccccccCcCccCCcccH--HHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCC Confidence 0000000000000000 011111 356888999999999999998877643 4555433221 Q ss_pred chhhhHHHHHHHHHHHHhcC--------hhHHHHHHHHh-cccceeeEEEEEecCCCcccCcccccccccccceeeEEee Q lcl|NC_019527. 139 KAKEMASKIKELEEACEYYG--------VMGIIQKAAEH-DCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNI 209 (516) Q Consensus 139 ~~~~~~~~i~~i~~~~~~l~--------~~~~l~ea~~~-~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~ 209 (516) ....+......+...+.... -+..|.+.+.. ..++|.+++++.-+. .|.+.+|.++ T Consensus 135 ~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~---------------~G~~~~L~pl 199 (574) T protein:vir:80 135 PTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDK---------------DGNFIKFDTV 199 (574) T ss_pred ccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECC---------------CCcEEEEEEE Confidence 11111122233444444321 12235555554 467899888765432 2446789999 Q ss_pred cceeeccccccccccccccccCcceeEEe-e---eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_019527. 210 EPMWTSPSAYNALDPTAPDFYKPSTWWVL-G---REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTR 285 (516) Q Consensus 210 d~~~v~p~~~~~~dp~s~~yg~P~~y~v~-g---~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~ 285 (516) +|..|.+...+ +.... ...+.+|++. + ..+.++.||||..++.++. ...++|+|.++.+.+.|.....+. T Consensus 200 ~p~~V~v~~d~--~~~~~-~~~~~y~~~~~g~~~~~~~~~eiih~~~~~~~~~---~~~~~G~spi~~a~~~i~~~~~a~ 273 (574) T protein:vir:80 200 DPTTIFLATNG--EGKLI-KNGERFVQVIDNRIVAKFNERELAFAVRNPRADI---EVGQYGYPELEIALKQFIAHENTE 273 (574) T ss_pred cCceeEEEEcC--ccccc-cCceEEEEEeCCceEEEEccccEEEEeccCCCCc---ccccccccHHHHHHHHHHHHHHHH Confidence 99998864321 11110 1113344443 2 3568899999987665442 234579999999999999999999 Q ss_pred HHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceE-EEecCCcceeEEecccCC--HHHHHHHHH Q lcl|NC_019527. 286 QSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLA-VMDFDSEDIVQVNTPLSG--LADLQSQSQ 360 (516) Q Consensus 286 ~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~-~id~~~e~~e~~~~~lsg--l~d~~~~~~ 360 (516) .....++.+.... ++++.....++....+.+.+.++.......|.|.+ ++..++-+|..++.+..+ +-+...+.. T Consensus 274 ~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~ 353 (574) T protein:vir:80 274 VFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLI 353 (574) T ss_pred HHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCChhHHHHHHHHHHHH Confidence 9999999987653 35554333344433334444444433334566643 443445677777655543 345566688 Q ss_pred HHHHhhhcCCceeeecccccc-ccccchH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCC Q lcl|NC_019527. 361 EHMCSVSKIPAIKLTGISPSG-LNASSEG--EIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQ 437 (516) Q Consensus 361 ~~iaaas~IP~t~L~G~sp~G-lnatge~--D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~ 437 (516) +.||.+.+||-.+| |....+ +.+++.+ ...+.-.....+.+..|.|.+..+-..|-+..+-.....+.|+|+.... T Consensus 354 ~~Ia~afgVPp~~l-G~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~f~~~d~ 432 (574) T protein:vir:80 354 NVISALYGIDPAEI-NFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEFGEKYQFQFRGGDL 432 (574) T ss_pred HHHHHHhCCCHHHh-cccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCceEEEecccch Confidence 89999999998655 543222 2222211 1122223333455566889888888777665544444568889987654 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCC----C-----CChhhhcccccc----------chhc Q lcl|NC_019527. 438 TSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWD----N-----IDGDLEIVQPEM----------FDDD 498 (516) Q Consensus 438 ~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~----~-----~d~~~e~~~~e~----------~~~e 498 (516) .+..+++. ...++.+|++|++|+|+.++..+..+-. + ++........+. .... T Consensus 433 ~~~~~~~~--------~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (574) T protein:vir:80 433 SAQLDKLK--------IIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELS 504 (574) T ss_pred hhHHHHHH--------HHHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccchhcccccccccc Confidence 44433332 1346789999999999998665432211 0 110000000000 0000 Q ss_pred CCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 499 GADPYMPDPDVLPGEEGS 516 (516) Q Consensus 499 ~~~~~~~~~~~~~~~e~t 516 (516) +.++.. ++.++|..+.+ T Consensus 505 ~~~~~~-~~~~~p~~~~~ 521 (574) T protein:vir:80 505 GGDVEQ-PEPEEPKDSQN 521 (574) T ss_pred CCCCCC-CCCCCCCCccc Confidence 011111 11111111111 No 51 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=99.87 E-value=5.8e-22 Score=136.99 Aligned_cols=398 Identities=11% Similarity=0.035 Sum_probs=213.5 Q ss_pred cccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCe Q lcl|NC_019527. 52 KWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIE 131 (516) Q Consensus 52 ~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~ 131 (516) .|-.+...... +.. ....++-....++...+. +........++ +++.+.+||+++|+++.+-.+. T Consensus 1 m~~~~~~~~~~---~~~-~~~~~~~~~~~~g~~~s~----------~~~~v~~~~al-~~~~v~~cv~~ia~~ia~lp~~ 65 (419) T protein:vir:80 1 MFFSRQLLSNL---GQT-QPGSGGWVSALLGSARSE----------AGQVVTPASAL-SLTVLQNCVTLLAESIAQLPVE 65 (419) T ss_pred CCccccccccc---CcC-CCCcchhhHHhhcccccc----------cCcccChHHhh-ccHHHHHHHHHHHHhhccCceE Confidence 12111000000 000 000011001111000000 00001111222 5778999999999999999988 Q ss_pred eeeccccchhh--hHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEe Q lcl|NC_019527. 132 ITSKDRTKAKE--MASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSN 208 (516) Q Consensus 132 i~~~~~~~~~~--~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v 208 (516) +.-.+.+.... .......|...-....-...|.+.+.+. .++|.|++++.-++ .|.+.+|.+ T Consensus 66 ~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~---------------~G~~~~L~~ 130 (419) T protein:vir:80 66 LYERSGDDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQ---------------DGVIQGLYP 130 (419) T ss_pred EEEecCCCcccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---------------CCcEEEEEE Confidence 75433322211 1112233444444455566777777766 56788888774321 245678999 Q ss_pred ecceeeccccccccccccccccCcceeEEee-eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 209 IEPMWTSPSAYNALDPTAPDFYKPSTWWVLG-REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQS 287 (516) Q Consensus 209 ~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g-~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~ 287 (516) ++|.+|++.... | +.+ .|++.+ ..++++.|+|+.... ...++|.|.++.+.+.|.....+... T Consensus 131 i~~~~v~i~~~~--~------~~~-~y~~~~~~~~~~~~i~h~~~~~-------~d~~~G~s~i~~~~~~i~~~~~~~~~ 194 (419) T protein:vir:80 131 LDNEAVTVMKGP--D------LKP-MYRVAGADPLPQRLVHHVRWMS-------INGYTGLSPVLLHANAIGHAQAIQQY 194 (419) T ss_pred ecCceEEEEECC--C------ceE-EEEEcCccccchhheEEecCCC-------CCCcccccHHHHHHHHHHHHHHHHHH Confidence 999988753211 1 222 456655 457888888887543 23567999999999999999999999 Q ss_pred HHHHHHHhCCc--eeeecc--hhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCC--HHHHHHHHHH Q lcl|NC_019527. 288 VSDLVDKFSRT--FLKTNM--AQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSG--LADLQSQSQE 361 (516) Q Consensus 288 ~~~Ll~~~~~~--v~k~~~--~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsg--l~d~~~~~~~ 361 (516) ...++.+.... +++++. ...++....+++.+.++......+|.|..++..++.+|+.++.+..+ +-+......+ T Consensus 195 ~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~ 274 (419) T protein:vir:80 195 AGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSAL 274 (419) T ss_pred HHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHH Confidence 99988886654 445431 11111111122333333333333454544433445788888766543 4566778889 Q ss_pred HHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-CCcceEEe--CCCCCC Q lcl|NC_019527. 362 HMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI-DDAITFKF--KSLWQT 438 (516) Q Consensus 362 ~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~d~~~~f--~pL~~~ 438 (516) +||.+.+||..+| |..-++-.++.|.....||.. .|.|.+..+-..|-+..+-.. ..++.|+| ..|... T Consensus 275 ~Ia~~fgVPp~ll-g~~~~~t~~n~e~~~~~f~~~-------~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~ 346 (419) T protein:vir:80 275 DIARIYKIPAHMV-NELERATFSNIEHQSLQFVIY-------TLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRG 346 (419) T ss_pred HHHHHhCCCHHHh-cCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhcc Confidence 9999999998665 544344455555555555544 377888888777765444221 12444554 567777 Q ss_pred CHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccc-cchhcC-CCCCCCCCCCCCCCCCC Q lcl|NC_019527. 439 SAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPE-MFDDDG-ADPYMPDPDVLPGEEGS 516 (516) Q Consensus 439 sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e-~~~~e~-~~~~~~~~~~~~~~e~t 516 (516) |.+++++ ++.+++++|++|++|+|+.++. ++++.-.+...+- ...... .+.++.++++....-+. T Consensus 347 d~~~~~~-------~~~~~~~~G~~T~NE~R~~~g~------~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 413 (419) T protein:vir:80 347 DQSSRYA-------AYAVGRQWGWLSINDIRRLENM------PPVKGGDIYLSPMNMVDASKPQPIPMGKTEPTKAALDE 413 (419) T ss_pred CHHHHHH-------HHHHHHhCCCcCHHHHHHHhCC------CCCCCcceeeeccccccccccccccCCCCCchhhhHHH Confidence 7777655 5566899999999999998744 4442111111110 000000 00011111111100000 No 52 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=99.87 E-value=4.1e-22 Score=137.85 Aligned_cols=393 Identities=15% Similarity=0.140 Sum_probs=207.4 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhC Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATR 111 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~ 111 (516) |.++ +-..+ ..++ .|. ...+. .....+....+..+|. .+..+ T Consensus 1 Mg~~----~~f~~-k~~~-------------~~~-----~~~~~------------~~~~~~~~~~~~~~~~---~~~~~ 42 (403) T protein:vir:80 1 MGLF----NFFRR-KTRS-------------EPT-----NAISW------------FLTQEAYDTLAIPGYT---RLSDN 42 (403) T ss_pred Cccc----ccccc-cccc-------------ccc-----chhhh------------hcccccccccccchhh---hhhhh Confidence 1111 00000 0000 000 00000 0000111111222332 24457 Q ss_pred chhhhhhhhhhHHHhhCCCeeeeccccc-hhhhHHHHHHHHHHHHhcChhHHHHHHHHhccc-c--eeeEEEEEecCCCc Q lcl|NC_019527. 112 PEYRAFASTLSTELTREGIEITSKDRTK-AKEMASKIKELEEACEYYGVMGIIQKAAEHDCF-F--GRGQISINIKGADV 187 (516) Q Consensus 112 ~i~r~iVd~~aed~~r~~~~i~~~~~~~-~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rl-y--G~a~i~i~i~~~~~ 187 (516) +.++++|+.+|+++-+-.+.+--..++. ..........|...-..+..+..|.+.+.+..+ . |.|++++.-+ T Consensus 43 ~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~---- 118 (403) T protein:vir:80 43 PEVRMAVHKIAELISSMTIHLMQNTDNGDIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYT---- 118 (403) T ss_pred HHHHHHHHHHHHhhhhCceEEEEecCCceeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEc---- Confidence 8899999999999999888874332221 111222233344344444566678888777654 3 5566665332 Q ss_pred ccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCC Q lcl|NC_019527. 188 SVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSG 267 (516) Q Consensus 188 ~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G 267 (516) ..|.+..|.+++|..|++... .+ | ..+++.++.+-++.||||..... +...+.| T Consensus 119 -----------~~g~~~~L~~l~p~~v~~~~~--~~------g--~~~~y~~~~~~~~eiih~~~~~~-----~~~~~~G 172 (403) T protein:vir:80 119 -----------TSGLIDELIPLAPSKVSFVDT--DT------G--YQIWYQGKAYNYDEVLHFIVNPD-----PEKPYMG 172 (403) T ss_pred -----------CCCcEEEEEEEcCCeeEEEEc--CC------c--eEEEEeecccchhhEEEEeccCC-----CcCcccc Confidence 134566788888888875321 11 1 12334567788899999874332 2234569 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCc-ceEEEecCCcceeEEe Q lcl|NC_019527. 268 ISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNL-GLAVMDFDSEDIVQVN 346 (516) Q Consensus 268 ~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~-g~~~id~~~e~~e~~~ 346 (516) .|.++.+.+.|.....+......++.+....-........+.....++..+++........|. +..++.....+++++. T Consensus 173 ~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 252 (403) T protein:vir:80 173 RGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQVK 252 (403) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHHHHHHhhhhhcCCeeeecccccccceec Confidence 999999999999999888888888887765433222223344433344444443222222333 4455544444555443 Q ss_pred -cccC--CHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019527. 347 -TPLS--GLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE 423 (516) Q Consensus 347 -~~ls--gl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~ 423 (516) .+.. .+.+.......+||.+.+||..+| |. +..++....+||. ..|.|.++.|-..|-+..+ T Consensus 253 ~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~-----~~~~~~~~~~f~~-------~~l~P~~~~ie~~l~~kll-- 317 (403) T protein:vir:80 253 PLSLKDLAIHETVELDKRTVAGIFGVPAFLL-GV-----GKYDKDEYNNFIN-------STILPIAKGIEQELTRKLL-- 317 (403) T ss_pred cCCHHHHHHHHHHHHhHHHHHHHhCCCHHHc-CC-----CCccHHHHHHHHH-------HHHHHHHHHHHHHHHHhcc-- Confidence 2332 334566677788999999998665 53 2222333445543 3588999999887776554 Q ss_pred cCCcceEEeC--CCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCC Q lcl|NC_019527. 424 IDDAITFKFK--SLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGAD 501 (516) Q Consensus 424 ~~~d~~~~f~--pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~ 501 (516) .+.++.|+|+ .|...|.+++++ ++.+++++|++|++|+|+.+...+..+-+.+-. .....+.....+... T Consensus 318 ~~~~~~~~f~~~~ll~~d~~~~~~-------~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd~~~~-~~n~~pl~~~~~~~~ 389 (403) T protein:vir:80 318 ISPDLYFKFNPRSLYAYDLKELAE-------VGSNMYVRGLMEGNEVRDWLGLSPKEGLSELVI-LENYIPLDKIGDQNK 389 (403) T ss_pred CCCCcEEEeechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEee-cccccchhhccchhh Confidence 3456777775 677777777654 556689999999999999985443211110000 000101000011111 Q ss_pred CCCCCCCCCCCCCC Q lcl|NC_019527. 502 PYMPDPDVLPGEEG 515 (516) Q Consensus 502 ~~~~~~~~~~~~e~ 515 (516) ..+++.+...+... T Consensus 390 ~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 390 LKGGEKGGADGQTD 403 (403) T ss_pred ccCCCCCCCCCCCC Confidence 11222222222111 No 53 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=99.87 E-value=6.8e-22 Score=136.62 Aligned_cols=389 Identities=11% Similarity=0.038 Sum_probs=205.3 Q ss_pred hhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCc Q lcl|NC_019527. 33 AMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRP 112 (516) Q Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~ 112 (516) |.....+... ...-...+... . . ........|. ..+|.|+. ...+.+++ T Consensus 1 ~~~~~~~~~~---k~~~~~~~~~~---------~----~-------------~~~~~~~~~~-~~~~~~v~-~~~a~~~~ 49 (409) T protein:vir:96 1 MAKENIVTRI---KKKLIDNWIDQ---------S----A-------------SKLYDFSPWK-NKSFWGVI-NNTLETNE 49 (409) T ss_pred Cccccchhhh---hhHHhhhhhcc---------c----c-------------cccccccccc-Cccccccc-hhhHhhhH Confidence 1111111111 00000000000 0 0 0000000011 11233322 22466778 Q ss_pred hhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCc Q lcl|NC_019527. 113 EYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPL 191 (516) Q Consensus 113 i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl 191 (516) .+.++|+.+|+++-+..+.+.-..+..+ ......|...-....-...|.+.+.+. .++|.|++++.-+. T Consensus 50 ~V~~ci~~ia~~ia~lp~~~~~~~~~~~---~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~------- 119 (409) T protein:vir:96 50 TIFSAITKLSNSMASLPLKMYEDYKVVN---TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI------- 119 (409) T ss_pred HHHHHHHHHHHhhhhCceEEeecccccc---hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECC------- Confidence 8999999999999998888754332221 122333444445555566666666655 66888888875321 Q ss_pred ccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe---e--eEeccceEEEecCCcchhhhhhccCCC Q lcl|NC_019527. 192 ILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL---G--REMHASRLLTIITRPLPDMLKPAYNFS 266 (516) Q Consensus 192 ~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~---g--~~iH~SRli~~~~~~~p~~~k~~~~~~ 266 (516) .|.+.+|.+++|.+|+..... .+.+.+|.+. | ..+.++.||||.+.. +...+. T Consensus 120 --------~G~~~~L~~l~~~~v~v~~~~--------~~~~~~y~~~~~~g~~~~~~~~evih~r~~~------~~~~~~ 177 (409) T protein:vir:96 120 --------YHQPSKLFLLNPDVVEMLIEN--------QSRELYYSIHAATGNKLIVHNMDMLHFKHIV------ASNMVQ 177 (409) T ss_pred --------CCcEEEEEEEcCceeEEEEeC--------CCcEEEEEEEcCCceEEEEccccEEEeCCCC------CCCccc Confidence 234567888998888753321 1223344443 2 357889999997532 234567 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhCC-ceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeE Q lcl|NC_019527. 267 GISMSQLAQPYVENWLRTRQSVSDLVDKFSR-TFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQ 344 (516) Q Consensus 267 G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~-~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~ 344 (516) |.|.++.+.+.+.....+.... +++... ..+.......++.+ +..+..+.|....+|.+ +++++ ++.+|+. T Consensus 178 G~s~l~~~~~~i~~~~~~~~~~---~~~~~~~~~~i~~~~~~l~~e---~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~ 250 (409) T protein:vir:96 178 GISPIDVLKNTTDFDNAVRTFN---LTEMQKPDSFMLKYGSNVSTE---KRQQVLEDFKQYYEENGGILFQE-PGVEIEP 250 (409) T ss_pred cccHHHHHHHHHHHHHHHHHHH---HHhcCCCceeEEecCCCCCHH---HHHHHHHHHHHHhhcCCCeeecC-CCceEEE Confidence 9999998887776555444332 222222 11111112233333 33333344444444544 55554 4578998 Q ss_pred EecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019527. 345 VNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWG 422 (516) Q Consensus 345 ~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g 422 (516) ++.+... +-+......++||.+.+||-.+|.+ ...+-.++.|.....||.. .|.|.++.|...|-+..+. T Consensus 251 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~-~~~~~~s~~e~~~~~f~~~-------~l~P~~~~ie~~l~~~Ll~ 322 (409) T protein:vir:96 251 LPKKYVSEDIVASENLTRERVANVFQLPSIFLNA-RSNTNFAKNEELNRFYLQH-------TLLPIVKQYEEEFNRKLLT 322 (409) T ss_pred cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC-CCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCC Confidence 8766553 3445666778999999999877643 3233334455555555543 4889999998888776654 Q ss_pred Cc--CCcceEEeC--CCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccc-----c- Q lcl|NC_019527. 423 EI--DDAITFKFK--SLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQ-----P- 492 (516) Q Consensus 423 ~~--~~d~~~~f~--pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~-----~- 492 (516) .. ..+..|+|+ .|...|.+++++ ++.+++++|++|++|+|+.++. ++++...+... + T Consensus 323 ~~~~~~g~~i~fd~~~ll~~d~~~~~e-------~~~~~~~~G~~T~NE~R~~~g~------~pi~ggD~~~~~~n~~~~ 389 (409) T protein:vir:96 323 KTDREKNRYFKFNVKSYLRADSATQAE-------VYFKAVRSGYYTINDIREWEDL------PPVEGGDKPLISGDLYPI 389 (409) T ss_pred cccccCcceEEeechhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHhCC------CCCCCcceeeeccccccc Confidence 32 235566664 777777777654 5567899999999999999854 33321111110 0 Q ss_pred ccchhcCCCCCCCCCCCCCC Q lcl|NC_019527. 493 EMFDDDGADPYMPDPDVLPG 512 (516) Q Consensus 493 e~~~~e~~~~~~~~~~~~~~ 512 (516) +...+......+++.+...+ T Consensus 390 ~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 390 DTPLELRKSLKGGDKNVNES 409 (409) T ss_pred ccchhhcccccCCCCCcCCC Confidence 11111111112222222222 No 54 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=99.87 E-value=3.1e-22 Score=138.49 Aligned_cols=390 Identities=15% Similarity=0.124 Sum_probs=207.7 Q ss_pred CCccCCCccchhc-ccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeecccc Q lcl|NC_019527. 60 PGVVPAGTTPAVA-MDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRT 138 (516) Q Consensus 60 ~gv~~~~~~~~~a-~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~ 138 (516) =|..-..+....+ .+....+......+..+ .+...|.-..++ +++-+.++|+.+|+++-+..+.+...+.. T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~al-~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~ 72 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQG-------TKLRQYKDIEAI-RHSDIFTAVMMIASDLARMPIRVTVNGQI 72 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhccccc-------cCccccchhhhh-cchHHHHHHHHHHHhhccCceEEecCccc Confidence 0111101000000 00000000010000000 001111112222 33445678999999999988877543332 Q ss_pred chhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccc Q lcl|NC_019527. 139 KAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPS 217 (516) Q Consensus 139 ~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~ 217 (516) ..+ ......|...-..+.-...|.+++.+. .++|.|++++..++ .|.+.+|.+++|.+|++. T Consensus 73 ~~~--~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---------------~G~~~~L~~i~~~~v~v~ 135 (416) T protein:vir:45 73 NYS--DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK---------------TGEPMNLTFRKTSEIELK 135 (416) T ss_pred ccc--chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---------------CCcEEEEEEEcCceeEEE Confidence 211 111222333444445566777777766 56889888875431 244567888888888753 Q ss_pred cccccccccccccCcceeEE--e------eeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 218 AYNALDPTAPDFYKPSTWWV--L------GREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVS 289 (516) Q Consensus 218 ~~~~~dp~s~~yg~P~~y~v--~------g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~ 289 (516) .- +.|.+.++.. . .+.++++.|||+...+ ...+.|.|.++.+.+.|.....+..... T Consensus 136 ~~--------~~g~~~~~~~~~~~~~~~~~~~~~~~evihir~~~-------~d~~~G~s~i~~~~~~i~~~~~~~~~~~ 200 (416) T protein:vir:45 136 SD--------ARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYS-------LDGINGLSLLDTLSRTIESDNNGKDFLN 200 (416) T ss_pred EC--------CCccEEEEEEEecCCCceeEEEEccccEEEeccCC-------CCCccccCHHHHHHHHHHHHHHHHHHHH Confidence 21 1233333322 1 1468999999997543 2346799999999999999999999999 Q ss_pred HHHHHhCCc--eeeecchhhhcCcc-HHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccCC--HHHHHHHHHHHH Q lcl|NC_019527. 290 DLVDKFSRT--FLKTNMAQVLNGGE-GGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLSG--LADLQSQSQEHM 363 (516) Q Consensus 290 ~Ll~~~~~~--v~k~~~~~~l~~~~-~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~lsg--l~d~~~~~~~~i 363 (516) .++.+.... +++++ ..+.... .+++.++++.......|.| +++++ ++.+|+.++.+... +-+......+.| T Consensus 201 ~~f~ng~~~~gil~~~--~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~I 277 (416) T protein:vir:45 201 NFLRNGTHAGGILKMK--GVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-ESMTFDQLEVDTEVLKLIRENKSSTREI 277 (416) T ss_pred HHHhccCCCcEEEEeC--CCCCCHHHHHHHHHHHHHHhcCccccCceeecC-CCceeEeccCCHHHHHHHHHHHHHHHHH Confidence 998887653 44543 2332221 1234444444433334444 55565 45789988766543 445567777899 Q ss_pred HhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-CCcceEEeCCCCCCCHHH Q lcl|NC_019527. 364 CSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI-DDAITFKFKSLWQTSAKE 442 (516) Q Consensus 364 aaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~d~~~~f~pL~~~sekE 442 (516) |.+.+||..+| |...++.+ ..+...+|. ..|.|.+..|-..+-+..+..- .-.|+|.++.|...|.++ T Consensus 278 a~~fgVPp~~l-g~~~~~~~---~~~~~~~~~-------~~l~P~~~~ie~~ln~~l~~~~~~~~~~f~~~~l~~~D~~~ 346 (416) T protein:vir:45 278 AGVFGIPLHKF-GIETANMS---ITDANLDYL-------STLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKT 346 (416) T ss_pred HHHhCCCHHHc-CCCCCCcc---HHHHHHHHH-------HHHHHHHHHHHHHHhhhccccccCceEEEechhhhccCHHH Confidence 99999998754 65544432 222222222 2478888888777765443221 124666677888888888 Q ss_pred HHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhh-------cccc-ccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 443 ESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLE-------IVQP-EMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 443 kAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e-------~~~~-e~~~~e~~~~~~~~~~~~~~~e 514 (516) +++ ++..++++|++|++|+|+.+... +++.... ...+ +..+..............+|+| T Consensus 347 ~~~-------~~~~~~~~G~~T~NE~R~~~gl~------p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe 413 (416) T protein:vir:45 347 QAE-------IDKINIDSGKMNIDEIRQRDGLA------PIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGE 413 (416) T ss_pred HHH-------HHHHHHhCCCcCHHHHHHHhCCC------CCCCCCcceEeecccccccccccccCcccccccccccCCCC Confidence 654 55668999999999999998543 3321110 0000 1110000000000111112222 Q ss_pred CC Q lcl|NC_019527. 515 GS 516 (516) Q Consensus 515 ~t 516 (516) .= T Consensus 414 ~n 415 (416) T protein:vir:45 414 EN 415 (416) T ss_pred CC Confidence 11 No 55 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=99.87 E-value=3.1e-22 Score=138.49 Aligned_cols=390 Identities=15% Similarity=0.124 Sum_probs=207.7 Q ss_pred CCccCCCccchhc-ccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeecccc Q lcl|NC_019527. 60 PGVVPAGTTPAVA-MDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRT 138 (516) Q Consensus 60 ~gv~~~~~~~~~a-~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~ 138 (516) =|..-..+....+ .+....+......+..+ .+...|.-..++ +++-+.++|+.+|+++-+..+.+...+.. T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~al-~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~ 72 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQG-------TKLRQYKDIEAI-RHSDIFTAVMMIASDLARMPIRVTVNGQI 72 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhccccc-------cCccccchhhhh-cchHHHHHHHHHHHhhccCceEEecCccc Confidence 0111101000000 00000000010000000 001111112222 33445678999999999988877543332 Q ss_pred chhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccc Q lcl|NC_019527. 139 KAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPS 217 (516) Q Consensus 139 ~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~ 217 (516) ..+ ......|...-..+.-...|.+++.+. .++|.|++++..++ .|.+.+|.+++|.+|++. T Consensus 73 ~~~--~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---------------~G~~~~L~~i~~~~v~v~ 135 (416) T protein:vir:81 73 NYS--DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK---------------TGEPMNLTFRKTSEIELK 135 (416) T ss_pred ccc--chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---------------CCcEEEEEEEcCceeEEE Confidence 211 111222333444445566777777766 56889888875431 244567888888888753 Q ss_pred cccccccccccccCcceeEE--e------eeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 218 AYNALDPTAPDFYKPSTWWV--L------GREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVS 289 (516) Q Consensus 218 ~~~~~dp~s~~yg~P~~y~v--~------g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~ 289 (516) .- +.|.+.++.. . .+.++++.|||+...+ ...+.|.|.++.+.+.|.....+..... T Consensus 136 ~~--------~~g~~~~~~~~~~~~~~~~~~~~~~~evihir~~~-------~d~~~G~s~i~~~~~~i~~~~~~~~~~~ 200 (416) T protein:vir:81 136 SD--------ARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYS-------LDGINGLSLLDTLSRTIESDNNGKDFLN 200 (416) T ss_pred EC--------CCccEEEEEEEecCCCceeEEEEccccEEEeccCC-------CCCccccCHHHHHHHHHHHHHHHHHHHH Confidence 21 1233333322 1 1468999999997543 2346799999999999999999999999 Q ss_pred HHHHHhCCc--eeeecchhhhcCcc-HHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccCC--HHHHHHHHHHHH Q lcl|NC_019527. 290 DLVDKFSRT--FLKTNMAQVLNGGE-GGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLSG--LADLQSQSQEHM 363 (516) Q Consensus 290 ~Ll~~~~~~--v~k~~~~~~l~~~~-~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~lsg--l~d~~~~~~~~i 363 (516) .++.+.... +++++ ..+.... .+++.++++.......|.| +++++ ++.+|+.++.+... +-+......+.| T Consensus 201 ~~f~ng~~~~gil~~~--~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~I 277 (416) T protein:vir:81 201 NFLRNGTHAGGILKMK--GVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-ESMTFDQLEVDTEVLKLIRENKSSTREI 277 (416) T ss_pred HHHhccCCCcEEEEeC--CCCCCHHHHHHHHHHHHHHhcCccccCceeecC-CCceeEeccCCHHHHHHHHHHHHHHHHH Confidence 998887653 44543 2332221 1234444444433334444 55565 45789988766543 445567777899 Q ss_pred HhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-CCcceEEeCCCCCCCHHH Q lcl|NC_019527. 364 CSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI-DDAITFKFKSLWQTSAKE 442 (516) Q Consensus 364 aaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~d~~~~f~pL~~~sekE 442 (516) |.+.+||..+| |...++.+ ..+...+|. ..|.|.+..|-..+-+..+..- .-.|+|.++.|...|.++ T Consensus 278 a~~fgVPp~~l-g~~~~~~~---~~~~~~~~~-------~~l~P~~~~ie~~ln~~l~~~~~~~~~~f~~~~l~~~D~~~ 346 (416) T protein:vir:81 278 AGVFGIPLHKF-GIETANMS---ITDANLDYL-------STLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKT 346 (416) T ss_pred HHHhCCCHHHc-CCCCCCcc---HHHHHHHHH-------HHHHHHHHHHHHHHhhhccccccCceEEEechhhhccCHHH Confidence 99999998754 65544432 222222222 2478888888777765443221 124666677888888888 Q ss_pred HHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhh-------cccc-ccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 443 ESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLE-------IVQP-EMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 443 kAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e-------~~~~-e~~~~e~~~~~~~~~~~~~~~e 514 (516) +++ ++..++++|++|++|+|+.+... +++.... ...+ +..+..............+|+| T Consensus 347 ~~~-------~~~~~~~~G~~T~NE~R~~~gl~------p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe 413 (416) T protein:vir:81 347 QAE-------IDKINIDSGKMNIDEIRQRDGLA------PIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGE 413 (416) T ss_pred HHH-------HHHHHHhCCCcCHHHHHHHhCCC------CCCCCCcceEeecccccccccccccCcccccccccccCCCC Confidence 654 55668999999999999998543 3321110 0000 1110000000000111112222 Q ss_pred CC Q lcl|NC_019527. 515 GS 516 (516) Q Consensus 515 ~t 516 (516) .= T Consensus 414 ~n 415 (416) T protein:vir:81 414 EN 415 (416) T ss_pred CC Confidence 11 No 56 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=99.87 E-value=4.1e-22 Score=137.84 Aligned_cols=397 Identities=11% Similarity=0.094 Sum_probs=206.0 Q ss_pred HHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhh Q lcl|NC_019527. 36 RAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYR 115 (516) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r 115 (516) =.+...+..++.. +..++++ -+.+. ...+. ......-.+.+.|..++-++ T Consensus 1 Mg~~~~~~~~~~~-------~~~~~~~---------~~~~~----------~~~~~----~~~~~~~~~~~~~~~~~~v~ 50 (423) T protein:vir:81 1 MGFLQKLGLAPSV-------VATPEPI---------ELVGP----------IFESL----KLSTKNMTVEQIWEDQPHLR 50 (423) T ss_pred CchhHhhcccccc-------ccCcccc---------ccccc----------ccccc----ccccchhhHHHHHHhhhHHH Confidence 0111111000000 0000000 00000 00000 00001124566778899999 Q ss_pred hhhhhhhHHHhhCCCeee-eccccch-hhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcc Q lcl|NC_019527. 116 AFASTLSTELTREGIEIT-SKDRTKA-KEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLI 192 (516) Q Consensus 116 ~iVd~~aed~~r~~~~i~-~~~~~~~-~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ 192 (516) +||+.+++++-+-.+.+- ...++.. .....-+..|-..-..+..+..|.+++.+. .++|.+++++..+.... T Consensus 51 ~~i~~ia~~ia~lp~~~~~~~~dg~~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~----- 125 (423) T protein:vir:81 51 TVTTFIARNVASLQLQAFERVEDGGRERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVD----- 125 (423) T ss_pred HHHHHHHHhHhhCceEEEEEecCCceeeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcC----- Confidence 999999999999988872 2222211 111111222222333444666777666655 56788888875543211 Q ss_pred cccccccccceeeEEeecceeeccccccccccccccccCcceeEE------ee--eEeccceEEEecCCcchhhhhhccC Q lcl|NC_019527. 193 LDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV------LG--REMHASRLLTIITRPLPDMLKPAYN 264 (516) Q Consensus 193 ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v------~g--~~iH~SRli~~~~~~~p~~~k~~~~ 264 (516) +.+..+.++.+.++.+... .|. .+.. .|++ .| ..++++.|||+.+.. +... T Consensus 126 --------~~~~~l~p~~~~~v~~~~~--~~~----~~~~-~Y~~~~~~~~~g~~~~~~~~evih~r~~~------~~~~ 184 (423) T protein:vir:81 126 --------TPTLDIRPIPVSWVQRRAY--KDG----WGSL-DYIIIESGDNDGRSVKVPGERVIHRHGYN------PKTM 184 (423) T ss_pred --------cceEEEeecccceeeeeec--cCC----Ccce-EEEEEEecCCCceEEEEcccceEEecCCC------CCCc Confidence 1222344444444433221 121 1222 2222 23 357899999997432 2223 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchh---hhcCccHHHHHHHHHHHH-HhcCCcce-EEEec Q lcl|NC_019527. 265 FSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQ---VLNGGEGGDVFDRVEMYV-NMQSNLGL-AVMDF 337 (516) Q Consensus 265 ~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~---~l~~~~~~~l~~r~~~~~-~~~sn~g~-~~id~ 337 (516) +.|.|.++.+.+.|.....+......++.+.... +++++... .++....+++.++++... ...+|.|. ++++ T Consensus 185 ~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~- 263 (423) T protein:vir:81 185 KRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLE- 263 (423) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecC- Confidence 4699999999999999999999999998886543 45543221 222222233444443322 23355554 4554 Q ss_pred CCcceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 338 DSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKV 415 (516) Q Consensus 338 ~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~ 415 (516) ++.+|+.++.+..+ +-+...+...+||.+.+||..+ +|...++-.++.|...+.||. ..|.|.+..+-+. T Consensus 264 ~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~-lg~~~~~t~sn~e~~~~~f~~-------~~L~P~~~~ie~~ 335 (423) T protein:vir:81 264 DGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTM-VGQLDNANYSNVREFRKALYG-------DNLGSWIRIIQDV 335 (423) T ss_pred CCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHH-hcCCCCCCcccHHHHHHHHHH-------HHHHHHHHHHHHH Confidence 45788888765543 2334556678899999999754 476544444444444445544 3477888887777 Q ss_pred HHHHhCCCc---CCcceEEe--CCCCCCCHHHHHHHHHHHHHHHHHHH-HcCCCCHHHHHHHHHhhhccCCCCCChhhhc Q lcl|NC_019527. 416 IQLSKWGEI---DDAITFKF--KSLWQTSAKEESEIRFNKAQEAQIYI-TNSVIDPSEARQQLSDDPDSGWDNIDGDLEI 489 (516) Q Consensus 416 l~~s~~g~~---~~d~~~~f--~pL~~~sekEkAei~~~~a~a~~~~~-~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~ 489 (516) +-+..+... ..++.|+| +.|...|-+++++. +++++ +.|++|++|+|+.+ ++++++.-.+. T Consensus 336 l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~-------~~~~l~~~G~~T~NE~R~~~------gl~p~~gGD~~ 402 (423) T protein:vir:81 336 MNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEI-------KRAAVGNVAWMTINEVRAMD------NLPSIDGGDDL 402 (423) T ss_pred HhhhhcCccccccCccEEEecchhhhccCHHHHHHH-------HHHHHhCCCCcCHHHHHHHh------CCCCCCCccee Confidence 766554332 23455555 57877787776654 34455 46999999999987 44444322222 Q ss_pred cccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 490 VQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 490 ~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) ..+.... ++..++....++.| T Consensus 403 ~~p~n~~------~~~~~~~~~~~~~t 423 (423) T protein:vir:81 403 ARPLNTE------FGDSEDAPGEEVET 423 (423) T ss_pred ecccccc------cCccCCCCCCCCCC Confidence 2221110 11111111222222 No 57 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=99.87 E-value=1.1e-21 Score=135.45 Aligned_cols=392 Identities=10% Similarity=0.003 Sum_probs=207.3 Q ss_pred HhhcCC-CccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhh Q lcl|NC_019527. 42 MERRAS-DAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFAST 120 (516) Q Consensus 42 ~~~~~~-~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~ 120 (516) |.+-.. .......++..-+-+.... ..+ .....+. ..+|.++ ....+.+++.+.++|+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~--~~~----------------~~~~~~~-~~~~~~v-~~~~a~~~~~v~~~i~~ 60 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWIDQST--SKL----------------YDFSPWK-NRSFWGV-INNTLETNETIFSAITK 60 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhcccc--ccc----------------ccccccC-Ccccccc-chhhhhccHHHHHHHHH Confidence 322211 1112222221111100000 000 0000000 1122222 12456688889999999 Q ss_pred hhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHh-cccceeeEEEEEecCCCcccCccccccccc Q lcl|NC_019527. 121 LSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEH-DCFFGRGQISINIKGADVSVPLILDPRTIK 199 (516) Q Consensus 121 ~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~-~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~ 199 (516) +|+++-+..+.+.-..+..+. .....|...-.....+..|.+.+.. -.++|.+++++.-+. T Consensus 61 ia~~iA~lp~~~~~~~~~~~~---~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~--------------- 122 (412) T protein:vir:26 61 LSNSMASLPLKMYEDYKVVNT---EVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI--------------- 122 (412) T ss_pred HHHhHhhCceeEeeccccccc---hHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECC--------------- Confidence 999999998887543332221 1223344445555566666665554 477898888875431 Q ss_pred ccceeeEEeecceeeccccccccccccccccCcceeEEee-----eEeccceEEEecCCcchhhhhhccCCCCchHHHHH Q lcl|NC_019527. 200 KGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG-----REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLA 274 (516) Q Consensus 200 ~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g-----~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~ 274 (516) .|.+..|.++.|.+|++..... +.+.+|++.. ..+.++.|+||.+.. +...+.|.|.++.+ T Consensus 123 ~G~~~~L~~l~~~~v~v~~~~~--------~~~~~y~~~~~~g~~~~~~~~evih~~~~~------~~~~~~G~s~i~~~ 188 (412) T protein:vir:26 123 YHQPSKLFLLNPDVVEMLIENQ--------SRELYYSIHAATGNKLIVHNMDMLHFKHIV------ASNMVQGISPIDVL 188 (412) T ss_pred CCcEEEEEEEcCceeEEEEeCC--------CcEEEEEEEcCCceEEEEccccEEEeCCCC------CCCCcccccHHHHH Confidence 2445678888888887543211 1233455542 358899999997542 23456799999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccCC-- Q lcl|NC_019527. 275 QPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLSG-- 351 (516) Q Consensus 275 ~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~lsg-- 351 (516) .+.+.....+.... +........+.......++. ++..+..+.|....+|.| +++++ ++.+|+.++.+... T Consensus 189 ~~~i~~~~a~~~~~--~~~~~~~~~~i~~~~~~l~~---e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~q 262 (412) T protein:vir:26 189 KNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVGK---EKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSED 262 (412) T ss_pred HHHHHHHHHHHHHH--HHhcCCCCceEEecCCCCCH---HHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHH Confidence 87777655554442 22222222111111222322 333333444444444555 45554 45788888765443 Q ss_pred HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc--CCcce Q lcl|NC_019527. 352 LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI--DDAIT 429 (516) Q Consensus 352 l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~--~~d~~ 429 (516) +-+......++||.+.+||-.+|.+. .++-.++.|...+.|+.. .|.|.++.|-+.|-+..+... ..+.. T Consensus 263 ~~e~~~~~~~~Ia~afgVPp~~lg~~-~~~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~kLl~~~~~~~~~~ 334 (412) T protein:vir:26 263 IVASENLTRERVANVFQLPSVFLNAR-SNTNFAKNEELNRFYLQH-------TLLPIVKQYEEEFNRKLLTKTDREKNRY 334 (412) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhCCC-CCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCCcccccCcce Confidence 34455567789999999998777543 233334455455555544 478998888887776554332 23444 Q ss_pred EE--eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc------ccchhcCCC Q lcl|NC_019527. 430 FK--FKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP------EMFDDDGAD 501 (516) Q Consensus 430 ~~--f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~------e~~~~e~~~ 501 (516) |+ +..|...|.+++++ ++..++++|++|++|+|+.++. ++++...+..-. +...+.... T Consensus 335 ~~fd~~~l~~~d~~~~~~-------~~~~~~~~G~~t~NE~R~~~gl------~p~~ggD~~~~~~n~~~~~~~~~~~~~ 401 (412) T protein:vir:26 335 FKFNVKSYLRADSATQAE-------VYFKAVRSGYYTINDIREWEDL------PPVEGGDKPLISGDLYPIDTPLELRKS 401 (412) T ss_pred EEeechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCC------CCCCCcCeeeecccccccccchhhccc Confidence 55 45787788887655 5566899999999999999844 444211111100 001111111 Q ss_pred CCCCCCCCCCC Q lcl|NC_019527. 502 PYMPDPDVLPG 512 (516) Q Consensus 502 ~~~~~~~~~~~ 512 (516) ..+++++...+ T Consensus 402 ~~gG~~n~~e~ 412 (412) T protein:vir:26 402 LKGGDKNVNES 412 (412) T ss_pred ccCCCCCcCCC Confidence 12222222222 No 58 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=99.87 E-value=3.5e-21 Score=132.75 Aligned_cols=463 Identities=13% Similarity=0.082 Sum_probs=214.3 Q ss_pred CCc-chhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCC--ccccccCCCCCCCccCCCccchh-ccccc Q lcl|NC_019527. 1 MWP-FDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASD--AATKWAPPQLMPGVVPAGTTPAV-AMDSL 76 (516) Q Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~gv~~~~~~~~~-a~ds~ 76 (516) |-. |++-...|-... ...+.+.++...+..+....-.. ....+.-- ..|..+.-..+.+ .||.. T Consensus 1 ~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~a~~~~~~~~~~~~ 68 (563) T protein:vir:95 1 MADLFKQFRLGKDYGN----------NSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKS--LYGQQQAYAEPFIEMMDTN 68 (563) T ss_pred Chhhhhhhhccccccc----------ccccceeeccCChhhhHhhhhccchhHHHHHhh--hccCCCcchhhhHhhhccc Confidence 221 211111111111 11111111111111111000000 00000000 0000000000100 23322 Q ss_pred ccchhhhcccccCCcccccccCccc---HHHHHHHHhCchhhhhhhhhhHHHhh-----------CCCeeeeccccc--h Q lcl|NC_019527. 77 CGPTYQFLNSAAGGLYAADIQPFPG---YQNLAALATRPEYRAFASTLSTELTR-----------EGIEITSKDRTK--A 140 (516) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~f~g---y~ll~~y~~~~i~r~iVd~~aed~~r-----------~~~~i~~~~~~~--~ 140 (516) .+ +.......-+. -.++++|..|+++++||++.++++.+ .++.|.....+. . T Consensus 69 ~~------------~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~ 136 (563) T protein:vir:95 69 PE------------FRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPG 136 (563) T ss_pred cc------------ccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcc Confidence 11 11110001112 25788999999999999999998774 234443322111 1 Q ss_pred hhhHHHHHHHHHHHHhcCh--------hHHHHHH-HHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecc Q lcl|NC_019527. 141 KEMASKIKELEEACEYYGV--------MGIIQKA-AEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEP 211 (516) Q Consensus 141 ~~~~~~i~~i~~~~~~l~~--------~~~l~ea-~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~ 211 (516) +.....+..++..+..++. +..|.+. +....++|.+++++.+.- -..|.+.+|.+++| T Consensus 137 ~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~r-------------d~~G~~~~L~pl~p 203 (563) T protein:vir:95 137 RKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNK-------------NNKTKLEKFIAVDP 203 (563) T ss_pred hhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEe-------------cCCCceEEEEEeCC Confidence 1112223444444433321 2234444 445578898888765421 12345678999999 Q ss_pred eeeccccccccccccccccC-cceeEE-eee---EeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 212 MWTSPSAYNALDPTAPDFYK-PSTWWV-LGR---EMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQ 286 (516) Q Consensus 212 ~~v~p~~~~~~dp~s~~yg~-P~~y~v-~g~---~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~ 286 (516) .+|++..... ..-|.. ..++++ .|. .+.++.+||+...+.++ .....+|+|.++.+.+.|.....+.. T Consensus 204 ~~V~v~~~~~----g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d---~~~~~~G~Spi~~a~~~i~~~~~~~~ 276 (563) T protein:vir:95 204 STIFYATDKK----GKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTE---LSSSGYGLSEVEIAMKEFIAYNNTES 276 (563) T ss_pred ceeEEEECCC----CceeccceeEEEEeCCceeEEecCcceEEEeccCCCC---cccCcccchHHHHHHHHHHHHHHHHH Confidence 9988643222 111222 223332 332 45677777665544332 12245799999999999999999999 Q ss_pred HHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceE-EEecCCcceeEEecccCC--HHHHHHHHHH Q lcl|NC_019527. 287 SVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLA-VMDFDSEDIVQVNTPLSG--LADLQSQSQE 361 (516) Q Consensus 287 ~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~-~id~~~e~~e~~~~~lsg--l~d~~~~~~~ 361 (516) ....++.+.... ++++.....++....+++.+.++.......|.|.+ ++-.++-+|+.++.+... +-+......+ T Consensus 277 ~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~ 356 (563) T protein:vir:95 277 FNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLIN 356 (563) T ss_pred HHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHH Confidence 999999986654 34444333343333334444444433344565543 433445788888766554 4466777889 Q ss_pred HHHhhhcCCceeeecccccc-ccccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCC Q lcl|NC_019527. 362 HMCSVSKIPAIKLTGISPSG-LNASSEG---EIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQ 437 (516) Q Consensus 362 ~iaaas~IP~t~L~G~sp~G-lnatge~---D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~ 437 (516) .||.+.+||-.+| |....| ...+..+ ...+.-..-..+-+..|.|.+..+-..|-+..+-.....+.|+|. . T Consensus 357 ~Ia~afgVPp~~l-G~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~f~---r 432 (563) T protein:vir:95 357 IISALYGIDPAEI-GFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYGDKYTFQFV---G 432 (563) T ss_pred HHHHHhCCCHHHc-cccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcccccEEEec---c Confidence 9999999998666 654322 2111111 111111122233345578888877776655443222335666663 3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccC----CCCCCh--hh--hccc-cccchhc---------- Q lcl|NC_019527. 438 TSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSG----WDNIDG--DL--EIVQ-PEMFDDD---------- 498 (516) Q Consensus 438 ~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~----~~~~d~--~~--e~~~-~e~~~~e---------- 498 (516) .+.+.+++... ...++.+|++|++|+|+.++..+.-+ +.+... .. .... .+....+ T Consensus 433 ~D~~~~~e~~~-----~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (563) T protein:vir:95 433 GDTKSATDKLN-----ILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLL 507 (563) T ss_pred CCHHHHHHHHH-----HHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhccccc Confidence 45555544322 23467899999999999986543221 111000 00 0000 0000000 Q ss_pred C--CCCCCCCCCCCCCCCCC Q lcl|NC_019527. 499 G--ADPYMPDPDVLPGEEGS 516 (516) Q Consensus 499 ~--~~~~~~~~~~~~~~e~t 516 (516) + .+.+..+++..+..+.+ T Consensus 508 ~~~~~~~~~~~~~~~~~~~~ 527 (563) T protein:vir:95 508 EGDNDDSEEGQSTDSSNDDK 527 (563) T ss_pred CCCCCCCCCCCCCCCCCCcc Confidence 0 00000011111111111 No 59 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=99.87 E-value=3.5e-21 Score=132.75 Aligned_cols=463 Identities=13% Similarity=0.082 Sum_probs=214.3 Q ss_pred CCc-chhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCC--ccccccCCCCCCCccCCCccchh-ccccc Q lcl|NC_019527. 1 MWP-FDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASD--AATKWAPPQLMPGVVPAGTTPAV-AMDSL 76 (516) Q Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~gv~~~~~~~~~-a~ds~ 76 (516) |-. |++-...|-... ...+.+.++...+..+....-.. ....+.-- ..|..+.-..+.+ .||.. T Consensus 1 ~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~a~~~~~~~~~~~~ 68 (563) T protein:vir:99 1 MADLFKQFRLGKDYGN----------NSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKS--LYGQQQAYAEPFIEMMDTN 68 (563) T ss_pred Chhhhhhhhccccccc----------ccccceeeccCChhhhHhhhhccchhHHHHHhh--hccCCCcchhhhHhhhccc Confidence 221 211111111111 11111111111111111000000 00000000 0000000000100 23322 Q ss_pred ccchhhhcccccCCcccccccCccc---HHHHHHHHhCchhhhhhhhhhHHHhh-----------CCCeeeeccccc--h Q lcl|NC_019527. 77 CGPTYQFLNSAAGGLYAADIQPFPG---YQNLAALATRPEYRAFASTLSTELTR-----------EGIEITSKDRTK--A 140 (516) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~f~g---y~ll~~y~~~~i~r~iVd~~aed~~r-----------~~~~i~~~~~~~--~ 140 (516) .+ +.......-+. -.++++|..|+++++||++.++++.+ .++.|.....+. . T Consensus 69 ~~------------~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~ 136 (563) T protein:vir:99 69 PE------------FRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPG 136 (563) T ss_pred cc------------ccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcc Confidence 11 11110001112 25788999999999999999998774 234443322111 1 Q ss_pred hhhHHHHHHHHHHHHhcCh--------hHHHHHH-HHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecc Q lcl|NC_019527. 141 KEMASKIKELEEACEYYGV--------MGIIQKA-AEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEP 211 (516) Q Consensus 141 ~~~~~~i~~i~~~~~~l~~--------~~~l~ea-~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~ 211 (516) +.....+..++..+..++. +..|.+. +....++|.+++++.+.- -..|.+.+|.+++| T Consensus 137 ~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~r-------------d~~G~~~~L~pl~p 203 (563) T protein:vir:99 137 RKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNK-------------NNKTKLEKFIAVDP 203 (563) T ss_pred hhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEe-------------cCCCceEEEEEeCC Confidence 1112223444444433321 2234444 445578898888765421 12345678999999 Q ss_pred eeeccccccccccccccccC-cceeEE-eee---EeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 212 MWTSPSAYNALDPTAPDFYK-PSTWWV-LGR---EMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQ 286 (516) Q Consensus 212 ~~v~p~~~~~~dp~s~~yg~-P~~y~v-~g~---~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~ 286 (516) .+|++..... ..-|.. ..++++ .|. .+.++.+||+...+.++ .....+|+|.++.+.+.|.....+.. T Consensus 204 ~~V~v~~~~~----g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d---~~~~~~G~Spi~~a~~~i~~~~~~~~ 276 (563) T protein:vir:99 204 STIFYATDKK----GKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTE---LSSSGYGLSEVEIAMKEFIAYNNTES 276 (563) T ss_pred ceeEEEECCC----CceeccceeEEEEeCCceeEEecCcceEEEeccCCCC---cccCcccchHHHHHHHHHHHHHHHHH Confidence 9988643222 111222 223332 332 45677777665544332 12245799999999999999999999 Q ss_pred HHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceE-EEecCCcceeEEecccCC--HHHHHHHHHH Q lcl|NC_019527. 287 SVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLA-VMDFDSEDIVQVNTPLSG--LADLQSQSQE 361 (516) Q Consensus 287 ~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~-~id~~~e~~e~~~~~lsg--l~d~~~~~~~ 361 (516) ....++.+.... ++++.....++....+++.+.++.......|.|.+ ++-.++-+|+.++.+... +-+......+ T Consensus 277 ~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~ 356 (563) T protein:vir:99 277 FNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLIN 356 (563) T ss_pred HHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHH Confidence 999999986654 34444333343333334444444433344565543 433445788888766554 4466777889 Q ss_pred HHHhhhcCCceeeecccccc-ccccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCC Q lcl|NC_019527. 362 HMCSVSKIPAIKLTGISPSG-LNASSEG---EIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQ 437 (516) Q Consensus 362 ~iaaas~IP~t~L~G~sp~G-lnatge~---D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~ 437 (516) .||.+.+||-.+| |....| ...+..+ ...+.-..-..+-+..|.|.+..+-..|-+..+-.....+.|+|. . T Consensus 357 ~Ia~afgVPp~~l-G~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~f~---r 432 (563) T protein:vir:99 357 IISALYGIDPAEI-GFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYGDKYTFQFV---G 432 (563) T ss_pred HHHHHhCCCHHHc-cccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcccccEEEec---c Confidence 9999999998666 654322 2111111 111111122233345578888877776655443222335666663 3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccC----CCCCCh--hh--hccc-cccchhc---------- Q lcl|NC_019527. 438 TSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSG----WDNIDG--DL--EIVQ-PEMFDDD---------- 498 (516) Q Consensus 438 ~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~----~~~~d~--~~--e~~~-~e~~~~e---------- 498 (516) .+.+.+++... ...++.+|++|++|+|+.++..+.-+ +.+... .. .... .+....+ T Consensus 433 ~D~~~~~e~~~-----~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (563) T protein:vir:99 433 GDTKSATDKLN-----ILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLL 507 (563) T ss_pred CCHHHHHHHHH-----HHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhccccc Confidence 45555544322 23467899999999999986543221 111000 00 0000 0000000 Q ss_pred C--CCCCCCCCCCCCCCCCC Q lcl|NC_019527. 499 G--ADPYMPDPDVLPGEEGS 516 (516) Q Consensus 499 ~--~~~~~~~~~~~~~~e~t 516 (516) + .+.+..+++..+..+.+ T Consensus 508 ~~~~~~~~~~~~~~~~~~~~ 527 (563) T protein:vir:99 508 EGDNDDSEEGQSTDSSNDDK 527 (563) T ss_pred CCCCCCCCCCCCCCCCCCcc Confidence 0 00000011111111111 No 60 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=99.87 E-value=3.2e-22 Score=138.41 Aligned_cols=384 Identities=11% Similarity=0.033 Sum_probs=207.5 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhC Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATR 111 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~ 111 (516) |.+. .... ...+.....|.. +.....++ ........ ..+.++ T Consensus 1 M~~f------------------~~~~-------~~~~~~~~~~~~------~~~~~~~~----~~~~~v~~---~~al~~ 42 (397) T protein:vir:38 1 MPLL------------------KLNK-------SHSQGFSLNDPD------WVNFLTGG----EAQKYVSA---DTALKN 42 (397) T ss_pred Ccch------------------hhhh-------cccCcccCCchh------hhhhhcCC----cCCceech---HHhhcc Confidence 1111 0000 000000000000 00000000 00001111 223468 Q ss_pred chhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccC Q lcl|NC_019527. 112 PEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVP 190 (516) Q Consensus 112 ~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~P 190 (516) +.+.++|+.+|+++.+..+... + .....|........-+..|.+++.+. .++|.|++++.-++ T Consensus 43 ~~V~~~v~~ia~~ia~~p~~~~--~--------~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~------ 106 (397) T protein:vir:38 43 SDIFSLIMQLSGDLAMVRYTSE--S--------DRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNT------ 106 (397) T ss_pred HHHHHHHHHHHHHHhhCccccc--c--------cHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC------ Confidence 8899999999999987666432 1 11344555555555666777776666 55788888875432 Q ss_pred cccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe--------eeEeccceEEEecCCcchhhhhhc Q lcl|NC_019527. 191 LILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL--------GREMHASRLLTIITRPLPDMLKPA 262 (516) Q Consensus 191 l~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~--------g~~iH~SRli~~~~~~~p~~~k~~ 262 (516) .|.+.+|.+++|..|++..... |....|++. .+.+.++.||||.... +. T Consensus 107 ---------~g~~~~l~~l~~~~v~i~~~~~--------~~~~~y~~~~~~~~~~~~~~~~~~eiih~~~~~------~~ 163 (397) T protein:vir:38 107 ---------NGVDLSWEYLRPSQVQPMLLQD--------GSGLIYNINFDEPAIGYMENVPAADVIHIRLLS------KN 163 (397) T ss_pred ---------CCcEEEEEEEcCceeEEEEcCC--------CceEEEEEEeccccccceeEecCccEEEecCCC------CC Confidence 2345678888888877533211 122334442 1468899999997643 22 Q ss_pred cCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcce Q lcl|NC_019527. 263 YNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDI 342 (516) Q Consensus 263 ~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~ 342 (516) ...+|.|.++.+...|.....+......++.+....-........+..+..+++.++++......++.+.++++. +.+| T Consensus 164 ~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~-g~~~ 242 (397) T protein:vir:38 164 GGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVIDA-LEDY 242 (397) T ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecCC-CceE Confidence 235799999999999999999999999988887654333333334444444556666655544444444566654 5788 Q ss_pred eEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 343 VQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSK 420 (516) Q Consensus 343 e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~ 420 (516) +.++.+... +.+..+...++||++.+||..+|-|.. +-+++.+ +...|| +..|.|.+..+...|-+.. T Consensus 243 ~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~--~~~~~~e-~~~~~~-------~~~l~P~~~~ie~~ln~~l 312 (397) T protein:vir:38 243 KPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQG--DQQSSIT-QISGQY-------AKSLNRYVQAIVGELNDKL 312 (397) T ss_pred EecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC--CcccHHH-HHHHHH-------HHHHHHHHHHHHHHHHHhc Confidence 888766543 446678888999999999998775532 1122222 334444 2357888888877776654 Q ss_pred CCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh--hccccccchhc Q lcl|NC_019527. 421 WGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL--EIVQPEMFDDD 498 (516) Q Consensus 421 ~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~--e~~~~e~~~~e 498 (516) +. ++.+.+..+...+.++ +++++.+++++|++|++|+|+.++..+..+-+-...+. ......... + T Consensus 313 ~~----~~~~~~~~~~~~d~~~-------~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~~~~~~~-~ 380 (397) T protein:vir:38 313 HA----NISANIRFAIDAMGDQ-------YASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKEPQQAIQLIQQ-E 380 (397) T ss_pred cC----hhcccccccccCCHHH-------HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccccccccccccccccc-c Confidence 43 2333344455555544 45567779999999999999998654321111000000 000011111 1 Q ss_pred CCCCCCCCCCCCCCCCC Q lcl|NC_019527. 499 GADPYMPDPDVLPGEEG 515 (516) Q Consensus 499 ~~~~~~~~~~~~~~~e~ 515 (516) +.+..+.+.++..+... T Consensus 381 ~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 381 GGENDGNNSDERGSDPE 397 (397) T ss_pred cCCCCCCCCCCCCCCCC Confidence 11111111111111111 No 61 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=99.86 E-value=5.6e-21 Score=131.60 Aligned_cols=463 Identities=12% Similarity=0.084 Sum_probs=214.2 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchh-cccccccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAV-AMDSLCGP 79 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~-a~ds~~~~ 79 (516) |---.--.|.|..-. .++.+..-+.++.-.+-..+....+.+ +.....+ +++-+... T Consensus 1 ~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~a~~~ 58 (576) T protein:vir:96 1 MVTRLADIFKRLRLG-------RDYEDIIDTVPIDDGLQANIRNIEEKS---------------KELNKSLYGKQQAYAE 58 (576) T ss_pred ChhhHHHHHHHHhcc-------CccccchhhhhcccChhHHHHHhhhhh---------------hhhccccCCccchhhc Confidence 221111111110000 011111112111111111110000000 0000000 11111111 Q ss_pred hhhhcccccCC--cccccc-cCc--ccHHHHHHHHhCchhhhhhhhhhHHHhh-----------CCCeeeeccccch--h Q lcl|NC_019527. 80 TYQFLNSAAGG--LYAADI-QPF--PGYQNLAALATRPEYRAFASTLSTELTR-----------EGIEITSKDRTKA--K 141 (516) Q Consensus 80 ~~~~~~~~~~~--~~~~~~-~~f--~gy~ll~~y~~~~i~r~iVd~~aed~~r-----------~~~~i~~~~~~~~--~ 141 (516) ++. ...+++ +....+ -.+ ..-.++..|..++++++||+++++.+.+ -++.|...+.... + T Consensus 59 p~~--~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~ 136 (576) T protein:vir:96 59 PFL--EVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGK 136 (576) T ss_pred cee--eeeecCCCccccCcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccch Confidence 100 001111 000000 001 1125678899999999999999998765 3445543322211 1 Q ss_pred hhHHHHHHHHHHHHhcCh--------hHHHHHHHHh-cccceeeEEEEEecCCCcccCcccccccccccceeeEEeecce Q lcl|NC_019527. 142 EMASKIKELEEACEYYGV--------MGIIQKAAEH-DCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPM 212 (516) Q Consensus 142 ~~~~~i~~i~~~~~~l~~--------~~~l~ea~~~-~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~ 212 (516) .....+..+...+..+.. +..|.+.+.+ ..+||.+++++..+- + ..|.+.+|.+++|. T Consensus 137 ~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~r-d------------~~g~~~~L~pl~p~ 203 (576) T protein:vir:96 137 KEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNK-K------------NATTMDKFIAVDPS 203 (576) T ss_pred hhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEec-C------------CCCceEEEEEeCCc Confidence 112223444444443322 2345555554 577899998875421 1 12446678899999 Q ss_pred eeccccccccccccccccCccee-EEe-e---eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 213 WTSPSAYNALDPTAPDFYKPSTW-WVL-G---REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQS 287 (516) Q Consensus 213 ~v~p~~~~~~dp~s~~yg~P~~y-~v~-g---~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~ 287 (516) +|.+.... ....|..+..| ++. + ..+.++.+||+...+.++. ....+|+|.++.+.+.|.....+... T Consensus 204 ~V~v~~~~----dg~~~~~~~~~~~~~~~~~~~~~~~~dii~~~~~~~~d~---~~~~~G~Spi~~a~~~i~~~~~~~~~ 276 (576) T protein:vir:96 204 TIFYATDK----NGKIIKGGKRFVQVINKKVVASFTSREMAMGIRNPRTEL---SSSGYGLSEVEIAMKQFIAYNNTETF 276 (576) T ss_pred eeEEEECC----CCceeeeeeEEEEecCCceEEEecccceEEEeecCCCCc---ccCcccccHHHHHHHHHHHHHHHHHH Confidence 88864322 12223333333 222 2 3567788887766554432 12457999999999999999999999 Q ss_pred HHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcce-EEEecCCcceeEEecccCC--HHHHHHHHHHH Q lcl|NC_019527. 288 VSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGL-AVMDFDSEDIVQVNTPLSG--LADLQSQSQEH 362 (516) Q Consensus 288 ~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~-~~id~~~e~~e~~~~~lsg--l~d~~~~~~~~ 362 (516) ...++.+.... +++++....++.+.-+++.+.++.......|.+. .++..++.+|+.++.+... +.+......+. T Consensus 277 ~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~ 356 (576) T protein:vir:96 277 NDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFEKWLTYLINI 356 (576) T ss_pred HHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHHHHHHHHHhHHH Confidence 99999987653 4554433334333334455544444334445554 3443445788888765443 45667788899 Q ss_pred HHhhhcCCceeeeccccccc----cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCC Q lcl|NC_019527. 363 MCSVSKIPAIKLTGISPSGL----NASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQT 438 (516) Q Consensus 363 iaaas~IP~t~L~G~sp~Gl----natge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~ 438 (516) ||.+.+||..+| |....+- +.++...-.|.-..-.++-+..|.|.+..|-..|-+..+-....++.|+|.. . T Consensus 357 Ia~afgVPp~~l-G~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~f~r---~ 432 (576) T protein:vir:96 357 ISALYGIDPAEI-GFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISEYSDKYVFQFVG---G 432 (576) T ss_pred HHHHhCCCHHHc-cccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhccCceEEEecc---C Confidence 999999998766 6543221 1111111111112222233345778887777666554432333456777643 3 Q ss_pred CHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccC----CCCC-----Chhh--hccccccchh---------c Q lcl|NC_019527. 439 SAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSG----WDNI-----DGDL--EIVQPEMFDD---------D 498 (516) Q Consensus 439 sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~----~~~~-----d~~~--e~~~~e~~~~---------e 498 (516) +.+.+++.. +....+.+|++|++|+|+.+...+.-+ +.++ .... ...+.+..++ . T Consensus 433 d~~~~~e~~-----~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~ 507 (576) T protein:vir:96 433 DTKSELDKI-----KILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLN 507 (576) T ss_pred CHHHHHHHH-----HHHHHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCccccccccccccccC Confidence 444444322 122345579999999999986543211 1110 0000 0000000000 0 Q ss_pred CCCCCCCCC----CCCCCCCCC Q lcl|NC_019527. 499 GADPYMPDP----DVLPGEEGS 516 (516) Q Consensus 499 ~~~~~~~~~----~~~~~~e~t 516 (516) ..++..+.+ +...+.+++ T Consensus 508 ~~~~~~~~~~s~~~~~~g~~~~ 529 (576) T protein:vir:96 508 SPDDEEPQQESTEDKVDGRESN 529 (576) T ss_pred CCCCCCCCCCCCCCcccccccc Confidence 000000000 000111111 No 62 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=99.86 E-value=2e-21 Score=134.12 Aligned_cols=398 Identities=12% Similarity=0.064 Sum_probs=216.8 Q ss_pred cccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCe Q lcl|NC_019527. 52 KWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIE 131 (516) Q Consensus 52 ~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~ 131 (516) .|-+.. . .++...+...+....... ...... .. +.... ......+++-++++|+.+|+++-+..+. T Consensus 1 m~~~~~-----~-~~~~~~~s~~~~w~~~~~---~~~~~~-~~---~g~~v-t~~~al~~~~v~~~i~~Ia~~iA~lp~~ 66 (421) T protein:vir:10 1 MFIPQM-----F-EGKKRSVSGGGFWEAMLG---GVRSSH-SK---AGVMI-TPETALALSAVRACVTLLAESVAQLPVE 66 (421) T ss_pred CCCcch-----h-cccccccCcchhhHHHhh---hhccCc-cc---CCcee-chHHhhccHHHHHHHHHHHHhhccCceE Confidence 222221 1 111111211111000000 000000 00 00000 1123456778999999999999999988 Q ss_pred eeeccccchh---hhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEE Q lcl|NC_019527. 132 ITSKDRTKAK---EMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFS 207 (516) Q Consensus 132 i~~~~~~~~~---~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~ 207 (516) +.-.+.+... ........|........-...|.+.+... .++|.|++++.-++ .|.+.+|. T Consensus 67 ~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---------------~G~~~~L~ 131 (421) T protein:vir:10 67 LYRRDKNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDG---------------KGYPKELI 131 (421) T ss_pred EEEEcCCCceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC---------------CCcEEEEE Confidence 7433222211 11122333444444455566666665544 56788888875432 24456788 Q ss_pred eecceeeccccccccccccccccCcceeEE--eeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHH Q lcl|NC_019527. 208 NIEPMWTSPSAYNALDPTAPDFYKPSTWWV--LGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTR 285 (516) Q Consensus 208 v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v--~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~ 285 (516) ++.|.+|++.. |+ .|.+ +|++ .|+.++.+.|+|+.+.. ...+.|.|.++.+.+.+.....+. T Consensus 132 ~l~~~~v~v~~----~~----~g~~-~y~~~~~g~~~~~~eiih~~~~~-------~d~~~G~spi~~~~~~i~~~~~~~ 195 (421) T protein:vir:10 132 PINPKKVIVLK----GP----DGMP-YYEIPEIGETLPMRMMHHVKVFS-------LDGYIGSSPIQTNADVLGLNLAVE 195 (421) T ss_pred EecCceEEEEE----CC----CceE-EEEEcCCCcEEchhhEEEecCcC-------CCCcccccHHHHHHHHHHHHHHHH Confidence 99998888632 11 1222 3444 46789999999997653 234679999999999999999999 Q ss_pred HHHHHHHHHhCCc--eeeecc--hhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccCC--HHHHHHH Q lcl|NC_019527. 286 QSVSDLVDKFSRT--FLKTNM--AQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLSG--LADLQSQ 358 (516) Q Consensus 286 ~~~~~Ll~~~~~~--v~k~~~--~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~lsg--l~d~~~~ 358 (516) .....++.+.... +++++. ...++.+.-+++.++++.......|.+ +.+++. +-+|++++.+... +.+.... T Consensus 196 ~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~l~~~~~d~q~~e~~~~ 274 (421) T protein:vir:10 196 EHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQE-GMSYKQMSQDNEKAQLLQSRQW 274 (421) T ss_pred HHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecCC-CceEEecCCChhHHHHHHHHHH Confidence 9999998885543 455532 222222222334444443333334444 556654 5789988866543 3445667 Q ss_pred HHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-CCc--ceEEeCCC Q lcl|NC_019527. 359 SQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI-DDA--ITFKFKSL 435 (516) Q Consensus 359 ~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~d--~~~~f~pL 435 (516) ..+.||.+.+||..+| |...++..++.|.....||. ..|.|.+..+-..|-+..+-.- ..+ ++|....| T Consensus 275 ~~~~Ia~~fgVPp~~l-g~~~~~t~sn~e~~~~~f~~-------~tl~P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l 346 (421) T protein:vir:10 275 GVEEVCRLYKIPPHMV-QMLAKATNNNIEHQGLQFVM-------YTLLAWLKRHEGALQRDLLLPSERRDLYIEFNVSGL 346 (421) T ss_pred hHHHHHHHhCCCHHHc-CCCcCCccccHHHHHHHHHH-------HHHHHHHHHHHHHHhhhccCccccCCeEEEEechhh Confidence 8889999999997655 55544545555555555543 3577888888777766543221 124 45555578 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccc-------cccchhcCCCCCCCCCC Q lcl|NC_019527. 436 WQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQ-------PEMFDDDGADPYMPDPD 508 (516) Q Consensus 436 ~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~-------~e~~~~e~~~~~~~~~~ 508 (516) ...|.+++++ ++.+++++|++|++|+|+.++..+ ++...+... ......++.+....+.+ T Consensus 347 ~~~d~~~~~~-------~~~~~~~~G~~T~NE~R~~~gl~p------~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~e 413 (421) T protein:vir:10 347 LRGDQKSRYE-------SYALGRQWGWLSVNDIRRMENLPP------IAGGDKYLTPLNMVDSAQIIPGDKKPTAQQMAE 413 (421) T ss_pred hccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCCCC------CCCcceeeeccccccccccccCCCCcccccCcc Confidence 8888888765 556689999999999999985543 321111100 00111111100000001 Q ss_pred CCCCCCCC Q lcl|NC_019527. 509 VLPGEEGS 516 (516) Q Consensus 509 ~~~~~e~t 516 (516) ...=...| T Consensus 414 ~d~~~~~~ 421 (421) T protein:vir:10 414 IDTILSRT 421 (421) T ss_pred cccccccC Confidence 11111122 No 63 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=99.86 E-value=1.8e-21 Score=134.32 Aligned_cols=410 Identities=11% Similarity=0.043 Sum_probs=210.1 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhC Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATR 111 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~ 111 (516) |- +.....+++.. .--+.|..-|.+.+.........+++.+......... .+.. ..+...+ ......++ T Consensus 1 Mg----l~d~~r~~~~~--~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~-~~~~---~~~g~~v-~~~~al~~ 69 (431) T protein:vir:10 1 MG----LFDFIRREKQP--EAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEY-IRRG---ELNGGTG-RETRALRN 69 (431) T ss_pred Cc----chhhhhcCccc--ccccccccccccccccccccccccccccccchHHHHh-hccC---ccCccee-chhhhhcc Confidence 11 11111111100 0000011111110000000000111100000000000 0000 0000011 11233467 Q ss_pred chhhhhhhhhhHHHhhCCCeeeeccccchh-hhHHHHHHHHHHHHhcChhHHHHHHHHh-cccceeeEEEEEecCCCccc Q lcl|NC_019527. 112 PEYRAFASTLSTELTREGIEITSKDRTKAK-EMASKIKELEEACEYYGVMGIIQKAAEH-DCFFGRGQISINIKGADVSV 189 (516) Q Consensus 112 ~i~r~iVd~~aed~~r~~~~i~~~~~~~~~-~~~~~i~~i~~~~~~l~~~~~l~ea~~~-~rlyG~a~i~i~i~~~~~~~ 189 (516) +.+.++|+.+++++-+-.+.+.-.++.... ........|...-...--+..|.+.+.. -.++|.+++++..+++ T Consensus 70 ~~V~~ci~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g---- 145 (431) T protein:vir:10 70 MAVLRCVTLISGTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSGN---- 145 (431) T ss_pred HHHHHHHHHHHHhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC---- Confidence 889999999999999998887443322211 1112223344444445556666666554 4667889988754322 Q ss_pred CcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE---ee--eEeccceEEEecCCcchhhhhhccC Q lcl|NC_019527. 190 PLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV---LG--REMHASRLLTIITRPLPDMLKPAYN 264 (516) Q Consensus 190 Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v---~g--~~iH~SRli~~~~~~~p~~~k~~~~ 264 (516) .+..|.+++|.+|++..... +.+ .|.+ .| ..+..+.||||.+..+ .. T Consensus 146 ------------~~~~L~pl~~~~v~~~~~~~--------~~~-~y~~~~~~g~~~~~~~~dViHir~~~~-------dg 197 (431) T protein:vir:10 146 ------------RPIRLIPMDRGSAKGRLTST--------WQI-VYDYTTPTGDKIELPAREVFHLRDLSI-------DG 197 (431) T ss_pred ------------ceEEEEEEcCceeEEEEcCC--------CeE-EEEEEeCCceEEEEchhhEEEecCcCC-------CC Confidence 23457788888877532111 122 2322 23 3588899999975321 34 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcce-EEEecCCcc Q lcl|NC_019527. 265 FSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGL-AVMDFDSED 341 (516) Q Consensus 265 ~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~-~~id~~~e~ 341 (516) +.|.|.++.+.+.|.....+......++.+.... +++++ ..++.+.-+++.++++......+|.|. .+++ ++-+ T Consensus 198 ~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--~~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~ 274 (431) T protein:vir:10 198 VSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVP--KELSDNAYGRMKASVQENHTGSENAGSWMLLE-EGAT 274 (431) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC--CCCCHHHHHHHHHHHHHHhcCccccCCceecC-CCce Confidence 6799999999999999999999999999987664 44443 345544444455555444444456554 4554 4578 Q ss_pred eeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 342 IVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLS 419 (516) Q Consensus 342 ~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s 419 (516) |++++.+... +-+...+...+||.+.+||..+|-+ .-++..++.|.....|+ +..|.|.+..|-..|-+. T Consensus 275 ~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~-~~~~t~sn~eq~~~~f~-------~~tL~P~~~~ie~~ln~~ 346 (431) T protein:vir:10 275 AKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMM-DDTSWGSGIEQLAIFFI-------QYGLSHWFVSWEQAAARA 346 (431) T ss_pred EEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCC-CCCCccccHHHHHHHHH-------HHHHHHHHHHHHHHHHhh Confidence 8888765443 3345566678999999999987744 32333333343344443 345788888887777665 Q ss_pred hCCCc-CCc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcC----CCCHHHHHHHHHhhhccCCCCCCh--hhhcc Q lcl|NC_019527. 420 KWGEI-DDA--ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNS----VIDPSEARQQLSDDPDSGWDNIDG--DLEIV 490 (516) Q Consensus 420 ~~g~~-~~d--~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g----vi~~~e~r~~l~~~~~~~~~~~d~--~~e~~ 490 (516) .+... ..+ ++|.+..|...|.+++++.. .+++++| ++|++|+|+.+ ++++++. ..+.. T Consensus 347 Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~-------~~~~~~G~~~g~lT~NE~R~~~------gl~p~~~~~gD~~~ 413 (431) T protein:vir:10 347 FLPEKMLGQRQFKFNEGALLRGTLNDQAAFF-------SKALGAGGQSPWMKQNEVREML------DLPRADDPVADQLR 413 (431) T ss_pred ccChhhcCCceEEEechhhhccCHHHHHHHH-------HHHHhcccccCccCHHHHHHHh------CCCCCCCcccccee Confidence 54211 123 44555577777877776644 4456554 59999999998 4455532 11111 Q ss_pred ccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 491 QPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 491 ~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) .+-.. .+. ...++|.. +| T Consensus 414 ~p~n~------~~~-~~~~~~p~-~~ 431 (431) T protein:vir:10 414 NPMTQ------KQK-GSGDEPPA-TT 431 (431) T ss_pred ccccc------ccC-CCCCCCCC-CC Confidence 11100 000 11111211 12 No 64 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=99.86 E-value=1.8e-21 Score=134.35 Aligned_cols=402 Identities=10% Similarity=0.064 Sum_probs=212.6 Q ss_pred hhhHHHHhHHhhcCCCc-cccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhC Q lcl|NC_019527. 33 AMRRAVMKSMERRASDA-ATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATR 111 (516) Q Consensus 33 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~ 111 (516) +-.+.....+ ..+.. ...+..- ..+.+..-..++...+. +.+....+-.. =....+.++ T Consensus 1 ~~~~~~~~~~--~~~~g~~~~~~~~-------f~~~~~~~~~~~~~~~~----------~~~~~~~~~~~-v~~~~al~~ 60 (424) T protein:vir:18 1 MEEPKYTIDL--RTNNGWWARLKSW-------FVGGRLVTPNQGSQTGP----------VSAHGYLGDSS-INDERILQI 60 (424) T ss_pred CCCCcccccc--CCCCchHHHHHhh-------ccccccccccchhhccc----------ccccccccccc-ccHHHhhcc Confidence 0000000000 00000 0000000 00000000000000000 00000000000 011345667 Q ss_pred chhhhhhhhhhHHHhhCCCeeeeccccchhh----hHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCC Q lcl|NC_019527. 112 PEYRAFASTLSTELTREGIEITSKDRTKAKE----MASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGAD 186 (516) Q Consensus 112 ~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~----~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~ 186 (516) +.+.+||+.+++++-+..+.+--...++... .......|...-....-...|.+.+.+. .++|.+++++.-+ T Consensus 61 ~~v~~cv~~Ia~~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~--- 137 (424) T protein:vir:18 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN--- 137 (424) T ss_pred HHHHHHHHHHHHhhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEC--- Confidence 7889999999999999888874322222111 1112233444444444555666665554 6678888887432 Q ss_pred cccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE--ee--eEeccceEEEecCCcchhhhhhc Q lcl|NC_019527. 187 VSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV--LG--REMHASRLLTIITRPLPDMLKPA 262 (516) Q Consensus 187 ~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v--~g--~~iH~SRli~~~~~~~p~~~k~~ 262 (516) ..|.+.+|.+++|.+|++... .+ +-.|++ .| ..++++.|||+.+.. . T Consensus 138 ------------~~G~~~~L~~l~~~~v~v~~~--~~--------~~~y~~~~~g~~~~~~~~eVihir~~~-------~ 188 (424) T protein:vir:18 138 ------------SAGDVISLLPLQSANMDVKLV--GK--------KVVYRYQRDSEYADFSQKEIFHLKGFG-------F 188 (424) T ss_pred ------------CCCcEEEEEEecCcceEEEEc--CC--------eEEEEEEeCCeEEEeccccEEEecCcC-------C Confidence 124456788999988875321 12 223333 33 478999999997532 1 Q ss_pred cCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCc Q lcl|NC_019527. 263 YNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSE 340 (516) Q Consensus 263 ~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e 340 (516) ....|.|.++.+.+.|.....+......++.+.... +++++. ..++....+.+.++++......+..++++++. +- T Consensus 189 dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~~l~~e~~~~~~~~~~~~~~~~nag~~~vl~~-g~ 266 (424) T protein:vir:18 189 TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE-KVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-GF 266 (424) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-cCCCHHHHHHHHHHHHHHhCCcccCCceeccC-Cc Confidence 346799999999999999999999999999887654 444431 22333333445555554433333334566654 57 Q ss_pred ceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeecccccccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 341 DIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVI 416 (516) Q Consensus 341 ~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l 416 (516) +|+.++.+..+ +-+...+..+.||.+.+||-.+| |...++-. +.-+.....|+ +..|.|.+..+...| T Consensus 267 ~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~eq~~~~f~-------~~tl~P~~~~ie~~l 338 (424) T protein:vir:18 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV-GDVEKSTSWGSGIEQQNLGFL-------QYTLQPYISRWENSI 338 (424) T ss_pred eEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHh-CCCCCcccccccHHHHHHHHH-------HHHHHHHHHHHHHHH Confidence 88888766543 34566777789999999997655 65543322 22233333343 346889999988887 Q ss_pred HHHhCCCcC-Cc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc- Q lcl|NC_019527. 417 QLSKWGEID-DA--ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP- 492 (516) Q Consensus 417 ~~s~~g~~~-~d--~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~- 492 (516) -+..+.... .+ ++|.+..|...|.+++++ ++.+++++|++|++|+|+.+. +++++...+...+ T Consensus 339 n~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~-------~~~~~~~~G~~T~NE~R~~~g------l~pi~ggD~~~~~~ 405 (424) T protein:vir:18 339 QRWLIPSKDVGRLHAEHNLDGLLRGDSASRAA-------FMKAMGESGLRTINEMRRTDN------MPPLPGGDVAMRQA 405 (424) T ss_pred HhhcCCccccCCeEEEEechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhC------CCCCCCcCeeeecc Confidence 765543221 23 445556788888888755 455689999999999999884 4444221111100 Q ss_pred c-cc-hhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 493 E-MF-DDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 493 e-~~-~~e~~~~~~~~~~~~~~~e~t 516 (516) . .+ +..+. ..+|..++. T Consensus 406 n~~~l~~~~~-------~~~~~~n~a 424 (424) T protein:vir:18 406 QYVPITDLGT-------NKEPRNNGA 424 (424) T ss_pred Cccchhhhhc-------cCCccccCC Confidence 0 00 00000 111112222 No 65 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=99.86 E-value=1.9e-21 Score=134.20 Aligned_cols=411 Identities=13% Similarity=0.062 Sum_probs=206.0 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) .|||++|. .+ ...+.+...++. . T Consensus 77 ~~pfkkk~-------------------~~-------~~~d~f~~s~es----------------------~--------- 99 (945) T protein:vir:10 77 IVPYNHQE-------------------PP-------FKFNLFEYSPES----------------------L--------- 99 (945) T ss_pred cccccccc-------------------cc-------hhhhhhhccCcc----------------------c--------- Confidence 33332221 00 000111111100 0 Q ss_pred hhhcccccCCcccccccCcccHHH-HHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhh----hHHHHHHHHHHHH Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPFPGYQN-LAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKE----MASKIKELEEACE 155 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f~gy~l-l~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~----~~~~i~~i~~~~~ 155 (516) .+.... .+...|.+-.. .+....+..+..+|+.+++++-+..+.+--..++.... +......+...++ T Consensus 100 -s~vtsl------s~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~ 172 (945) T protein:vir:10 100 -MYLPSI------SDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLE 172 (945) T ss_pred -eecccc------cCccceeeehhhhhhhhccHHHHHHHHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHh Confidence 000000 00001111111 12234678899999999999999988873222111100 0000112233333 Q ss_pred hc-------ChhHHHHHHHH-hcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccc Q lcl|NC_019527. 156 YY-------GVMGIIQKAAE-HDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAP 227 (516) Q Consensus 156 ~l-------~~~~~l~ea~~-~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~ 227 (516) +. ..|+.|.+.+. .-.++|.+++++.-+ . .|.+.+|.+++|.+|++.... |. T Consensus 173 rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd-~--------------~G~ii~L~pLdPs~Vti~~dd--DG--- 232 (945) T protein:vir:10 173 RPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRD-E--------------QGNLVAITPVDGTTIKPILSE--DT--- 232 (945) T ss_pred CCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEEC-C--------------CCcEEEEEEECCcceEEEEcC--CC--- Confidence 33 34555666654 456788888877432 1 134567889999988764322 11 Q ss_pred cccCcceeE--Eee---eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhC-C--ce Q lcl|NC_019527. 228 DFYKPSTWW--VLG---REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFS-R--TF 299 (516) Q Consensus 228 ~yg~P~~y~--v~g---~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~-~--~v 299 (516) +....|. +.| ..++++.+|++....-++. ...+.|.|.++.+.+.+.....+....+.+..+.+ . .+ T Consensus 233 --~~~y~Yv~~idG~~~~~v~a~DvIlhirn~s~DG---~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGI 307 (945) T protein:vir:10 233 --GIVVGYVQEVDGAIVAHFDKRDVVLFRQNLTPDV---YMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGI 307 (945) T ss_pred --cEEEEEEEecCCceEEEecCCceEEEeccCCCCc---ccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceE Confidence 1111221 222 3567777766543332221 12345999999999999999999988888876533 2 24 Q ss_pred eeecch--------hhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCC--HHHHHHHHHHHHHhhhcC Q lcl|NC_019527. 300 LKTNMA--------QVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKI 369 (516) Q Consensus 300 ~k~~~~--------~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~I 369 (516) ++++.. ..++....+++.+.++....+ .|.|..++..++.++++++.+..+ +.+.......+||++.|| T Consensus 308 Lsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG-~NnG~piVLdeGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGV 386 (945) T protein:vir:10 308 LAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMG-DYTQVPILSGGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQV 386 (945) T ss_pred EEecCccccccccccccCHHHHHHHHHHHHHHhCC-cccccceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCC Confidence 554322 223333333455555444433 344443333445788888766554 345677777899999999 Q ss_pred CceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CcCCcceEEeCCCCCCCHHHHHHHHH Q lcl|NC_019527. 370 PAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWG-EIDDAITFKFKSLWQTSAKEESEIRF 448 (516) Q Consensus 370 P~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g-~~~~d~~~~f~pL~~~sekEkAei~~ 448 (516) |..+| |...++-.++.+.....|+. ..|.|.+..+-..|-+...- ....++.|+|+.+..++.+++ T Consensus 387 PP~lL-G~~e~st~SNiEqq~~~Fv~-------~tL~Pil~~IEqeLNrkLl~~~eg~~i~fdFd~ldl~D~ksr----- 453 (945) T protein:vir:10 387 SPQDV-GILEGSNKATAEVMASLTKA-------KGLEPLMATISKGFDEVVSEFRNEKDIKLWFKEDDLEKERDW----- 453 (945) T ss_pred CHHHc-ccCCCCCcchHHHHHHHHHH-------HHHHHHHHHHHHHHHHhccccccCceeEEEecchhccCHHHH----- Confidence 98777 54433333444545555543 23556666555554432210 112468999999887776654 Q ss_pred HHHHHHHHHHHcCCCCHHHHHHHHHhhhccC----CC------CCChhhhccccccc---hhcCCCCCCC---C---CCC Q lcl|NC_019527. 449 NKAQEAQIYITNSVIDPSEARQQLSDDPDSG----WD------NIDGDLEIVQPEMF---DDDGADPYMP---D---PDV 509 (516) Q Consensus 449 ~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~----~~------~~d~~~e~~~~e~~---~~e~~~~~~~---~---~~~ 509 (516) ++++..++++|++|++|+|+.++..+..| +. +.+..........+ .....+.+.+ + .++ T Consensus 454 --aEal~kli~sGiLTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~ 531 (945) T protein:vir:10 454 --WNIIQGQLNTGFRSINEARMEKGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSS 531 (945) T ss_pred --HHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCCCCCCCCC Confidence 45667789999999999999986544211 10 11100000000000 0000111111 0 111 Q ss_pred CCCCCCC Q lcl|NC_019527. 510 LPGEEGS 516 (516) Q Consensus 510 ~~~~e~t 516 (516) .|++.++ T Consensus 532 ~psE~kd 538 (945) T protein:vir:10 532 VPSEQKN 538 (945) T ss_pred CCCcccc Confidence 2222222 No 66 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=99.86 E-value=4.3e-22 Score=137.72 Aligned_cols=369 Identities=12% Similarity=0.064 Sum_probs=200.6 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTL 121 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~ 121 (516) |.--.. ..|...+...-. ...+++..... .++. ...+ =....+.+++.+.++|+.+ T Consensus 1 Mg~~~~---~~~~k~~~~~~~-------~~~~~~~~~~~-------~~~~----~~~~---v~~~~~l~~~~v~~~i~~i 56 (383) T protein:vir:10 1 MGLLTP---KNFSKRNAKNMV-------YPSNPAFFTTT-------VGGM----QLSY---VSALSALQNTNVYSVINRI 56 (383) T ss_pred CCcccc---cccccccccccc-------cccchhhhhhh-------ccCc----cccc---cchhHhhcchHHHHHHHHH Confidence 210000 000000000000 00000100000 0000 0000 0123355678899999999 Q ss_pred hHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccc Q lcl|NC_019527. 122 STELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKK 200 (516) Q Consensus 122 aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~ 200 (516) |+++.+..+++..... ..|-.....+..+..|.+.+.+. .++|.|++++.-+ . T Consensus 57 a~~ia~~~~~~~~~~~----------~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~---~------------- 110 (383) T protein:vir:10 57 ASDVSSAHFKTENTAT----------LNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ---N------------- 110 (383) T ss_pred HHhhccCceeecccch----------hhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC---c------------- Confidence 9999998887743211 11222233344556666555555 4588888876421 1 Q ss_pred cceeeEEeecceeeccccccccccccccccCcceeEEe------eeEeccceEEEecCCcchhhhhhccCCCCchHHHHH Q lcl|NC_019527. 201 GSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL------GREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLA 274 (516) Q Consensus 201 g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~------g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~ 274 (516) .++.++++..|..... .+ ...|.+. .+++++++|+||.+...+ ......|.|.++.+ T Consensus 111 ---~~~~p~~~~~v~~~~~--~~--------~~~~~~~~~~~~~~~~~~~~evih~r~~~~~----~~~~~~G~s~l~~~ 173 (383) T protein:vir:10 111 ---LEHIPNSDVQINYLPG--NM--------GIVYTVLESNDRPKMVLRQDQMLHFRLMPDP----QYRYLIGRSPLESL 173 (383) T ss_pred ---eeEeecCcceEEEEEc--CC--------ceEEEEEEcCCceEEEEcccceEEeccCCCC----cccccccccHHHHH Confidence 1244455555443211 11 1122221 356889999999753321 11235699999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcC-ccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccC Q lcl|NC_019527. 275 QPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNG-GEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLS 350 (516) Q Consensus 275 ~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~-~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~ls 350 (516) .+.|.....+......++.+.... +++++ ..+.. ...+.+.++++..... .|.+ +++++. +.+|+.++.+.. T Consensus 174 ~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~--~~~~~~e~~~~~~~~~~~~~~~-~n~~~~~vl~~-g~~~~~l~~~~~ 249 (383) T protein:vir:10 174 QNALNLDDKASKSNMSAMENQINPAGKLTIS--NYLSDGKDLESAREEFEKANTG-DNSGRLMVLPD-GFDYTQLEMKTD 249 (383) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcceEEEeC--CCCCCHHHHHHHHHHHHHHhCc-cccCCccccCC-CceEEecCCChh Confidence 999999999999999999987764 33433 22322 2223344444433333 3444 556654 588999988776 Q ss_pred CHH---HHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCc Q lcl|NC_019527. 351 GLA---DLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDA 427 (516) Q Consensus 351 gl~---d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d 427 (516) ..+ ++.....++||.+.+||-.+|.+...++.+.+.-+..+.+| + ..|+|.++.|...+-+..++ .+ T Consensus 250 d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~~------~-~~l~P~~~~ie~~l~~~l~~---~~ 319 (383) T protein:vir:10 250 VFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATY------L-ANLNSYVNPIVDELRLKMNA---PD 319 (383) T ss_pred HHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHHH------H-HHHHHHHHHHHHHHHHhhCC---ce Confidence 654 45666689999999999887755433333322222223232 2 23789888887777665543 36 Q ss_pred ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCCCCCCCC Q lcl|NC_019527. 428 ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGADPYMPDP 507 (516) Q Consensus 428 ~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~~ 507 (516) ++|.+.+|...|.+++++ ++..++++|++|++|+|+.++. +++..... ++... ...+..+++. T Consensus 320 ~~f~~~~l~~~d~~~~~~-------~~~~~~~~G~~t~nE~R~~lg~------~p~~~~d~---~~~~~-~~~~~~gGd~ 382 (383) T protein:vir:10 320 LELDIKDMLDVDDSILIN-------QVSNLAKSGVLGAEQAQFILTR------SGFLPDNL---PEFKP-LTNETKGGDD 382 (383) T ss_pred EEeechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCC------CcccCCcc---cccCC-CcccCCCCCC Confidence 889999999999988755 4567899999999999999844 33321110 00000 0011112222 Q ss_pred C Q lcl|NC_019527. 508 D 508 (516) Q Consensus 508 ~ 508 (516) + T Consensus 383 e 383 (383) T protein:vir:10 383 K 383 (383) T ss_pred C Confidence 2 No 67 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=99.85 E-value=1.3e-21 Score=135.16 Aligned_cols=387 Identities=13% Similarity=0.075 Sum_probs=209.6 Q ss_pred hhhhHHHHhH--H-hhcCCCc----------cccccCCCCCCCccCCCccchh-cccccccchhhhcccccCCccccccc Q lcl|NC_019527. 32 LAMRRAVMKS--M-ERRASDA----------ATKWAPPQLMPGVVPAGTTPAV-AMDSLCGPTYQFLNSAAGGLYAADIQ 97 (516) Q Consensus 32 ~~~~~~~~~~--~-~~~~~~~----------~~~~~~~~~~~gv~~~~~~~~~-a~ds~~~~~~~~~~~~~~~~~~~~~~ 97 (516) |.+|..+... + ..+++.. ...|..|.-++. ..+... +..+. + +....... . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~-~-------g~~~~~~~---~ 65 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPE----ARALPWIRPTAW-S-------GYPESWAT---P 65 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchh----hhhccccccccc-c-------cccccccc---c Confidence 3333222211 0 0111111 122322211111 100000 00000 0 00000000 0 Q ss_pred CcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeE Q lcl|NC_019527. 98 PFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQ 177 (516) Q Consensus 98 ~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~ 177 (516) +.. .--...+..++.+.++|+.+++.+-+..+.+.-.++..+ . ....+...-..+.-+..|.+.+.+..+.|+++ T Consensus 66 ~~~-~~t~~~~~~~~~v~acV~~Ia~~iA~lpl~~~~~~~~~~--~--~~~ll~~~PN~~~t~~~f~~~l~~~lllGnay 140 (409) T protein:vir:83 66 SWG-SAQDKLRTLIDVAWACIDLNASVLSSMPIYRMRNGRIID--S--VAWMSNPDPEVYTSWQEFAKQLFWDFQLGEAF 140 (409) T ss_pred Ccc-ccchhhHhhhHHHHHHHHHHHHhhccCceEEeeCCcccc--c--hhhhcccCCCCCCCHHHHHHHHHHHHhhCCcE Confidence 000 011244667788999999999999988887654332211 1 11223334445566778888888888889998 Q ss_pred EEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchh Q lcl|NC_019527. 178 ISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPD 257 (516) Q Consensus 178 i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~ 257 (516) +++...+.+ |.+..|.+++|..|++... .| |. ..|++.+. ..++.|||+.... T Consensus 141 ~~~i~r~~~--------------G~~~~L~pl~p~~v~v~~~--~~------g~-~~y~~~~~-~~~~eiiHir~~~--- 193 (409) T protein:vir:83 141 VLPMAHGSD--------------GYPIRFRVVPPWLVNVELK--KG------AR-REYRIGGL-NVTDEILHIRYQG--- 193 (409) T ss_pred EEEEEECCC--------------CcEEEEEEECCcceEEEEc--CC------ce-EEEEEccc-cCccceEEeCCCC--- Confidence 876543322 3345688888888775321 11 11 24667654 3468899986532 Q ss_pred hhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcce-EE Q lcl|NC_019527. 258 MLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGL-AV 334 (516) Q Consensus 258 ~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~-~~ 334 (516) +...+.|.|.++.+...+.....+......++.+.... +++++ ..++.+.-+++.++++. .+..|.|. ++ T Consensus 194 ---~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~--~~ls~e~~~~~~~~~~~--~~~~nag~~~i 266 (409) T protein:vir:83 194 ---NTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVE--RRLSETEAVDLMDRWIE--SRSKYAGHPAL 266 (409) T ss_pred ---CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecC--CCCCHHHHHHHHHHHHH--hhCCccCccce Confidence 23456799999999999999888888888888875543 33433 34544433445444432 22335553 45 Q ss_pred EecCCcceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeecccccccccc---chHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 335 MDFDSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNAS---SEGEIRSFYDDISSVQQSYYFSPL 409 (516) Q Consensus 335 id~~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnat---ge~D~~~yyd~I~~~Qe~~l~p~l 409 (516) +.++.+.++.++.+..+ +-+...+...+||.+.+||. .|+|....+-++| -|.....| -+..|.|.+ T Consensus 267 l~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp-~llg~~~~~~~~tysn~eq~~~~f-------~~~tL~P~~ 338 (409) T protein:vir:83 267 VTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPP-FLVGLPGATGSLTYSNIEQLFSFH-------DRSSLRPKA 338 (409) T ss_pred ecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCH-HHccCCCCccccccccHHHHHHHH-------HHHHHHHHH Confidence 54443333445554433 23444566788999999996 5667543332222 23333333 344578888 Q ss_pred HHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhc Q lcl|NC_019527. 410 DTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEI 489 (516) Q Consensus 410 ~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~ 489 (516) .++-+.|.+..+-. ...++|.+..|...+.+++ +++++.++++|++|++|+|+.+...+..|- .+. T Consensus 339 ~~ie~~l~~~Ll~~-~~~~~f~~~~llr~d~~~r-------~~~~~~~~~~G~lT~NE~R~~~glpp~~gg------d~l 404 (409) T protein:vir:83 339 TAVMAALDRWALPS-PQHLELNRDDYTRPSLVER-------ATAYKIMIEAGVMEPNEARAMERLHSEAAA------VRL 404 (409) T ss_pred HHHHHHHHHhhCCC-CcEEEeehhhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhCCCCCCCC------ccc Confidence 88888776654311 2235555567777777765 556778899999999999998744332221 111 Q ss_pred ccccc Q lcl|NC_019527. 490 VQPEM 494 (516) Q Consensus 490 ~~~e~ 494 (516) +..+. T Consensus 405 ~~~gv 409 (409) T protein:vir:83 405 SGGGV 409 (409) T ss_pred CCCCC Confidence 11222 No 68 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=99.85 E-value=3e-21 Score=133.08 Aligned_cols=391 Identities=11% Similarity=0.052 Sum_probs=202.2 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccch---hcccccccchhhhcccccCCcccccccCcccHHHHHHH Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPA---VAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAAL 108 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~---~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y 108 (516) |.+.+ .. ..+..... +.+.+.+. ...+ +|--..++ T Consensus 1 m~~~~---------~~---------------~~~~~~~~~~~~~~~~~~~-------~~~g-----------~~~~~~Al 38 (417) T protein:vir:38 1 MKLFR---------GL---------------ATEVDPHWADHLLDSGVIP-------SFRG-----------GYLGISAL 38 (417) T ss_pred Ccccc---------cc---------------ccCCCccchhhhccccccc-------ccCC-----------ceechhhc Confidence 11110 00 00000000 00000000 0000 11111122 Q ss_pred HhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCc Q lcl|NC_019527. 109 ATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADV 187 (516) Q Consensus 109 ~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~ 187 (516) +++-+.+||+.+++++-+-.+.+.....+...+.......|........-...|.+.+... .++|.|++++.-++. T Consensus 39 -~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~-- 115 (417) T protein:vir:38 39 -RNSDVLTAVSIVSGDVSRFPLVITDSSTDEVIDLANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPI-- 115 (417) T ss_pred -ccHHHHHHHHHHHHhhccCeeEEEEcCCcceeccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC-- Confidence 4555678999999999998888754333222221121222333334444555666665554 667889988754321 Q ss_pred ccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe------eeEeccceEEEecCCcchhhhhh Q lcl|NC_019527. 188 SVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL------GREMHASRLLTIITRPLPDMLKP 261 (516) Q Consensus 188 ~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~------g~~iH~SRli~~~~~~~p~~~k~ 261 (516) .|....|.++.|.+|.+.... . +. -.|++. ...++++.||||.+.. T Consensus 116 ------------g~~~~~l~~l~p~~v~v~~~~---~-----~~-~~y~~~~~~~~~~~~~~~~dviH~r~~~------- 167 (417) T protein:vir:38 116 ------------TNEPAMFEFYAPSQTQVDTSD---P-----DN-IIYRFTPYNSSMQKVCGFEDVIHWKFFS------- 167 (417) T ss_pred ------------CCEEEEEEEeCCceEEEEEcC---C-----Ce-EEEEEEEcCCcEEEEecCcceEEecCCC------- Confidence 123445777888877653221 1 11 123343 2346789999997643 Q ss_pred ccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCc Q lcl|NC_019527. 262 AYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSE 340 (516) Q Consensus 262 ~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e 340 (516) ...+.|+|.++.+.+.|.....+......++.+....-.-......++.+..+++.++++...... |.| .++++ ++. T Consensus 168 ~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~g~-n~g~~~vl~-~g~ 245 (417) T protein:vir:38 168 YDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDFERAQAGA-DAGSPIIVD-ATM 245 (417) T ss_pred CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhccc-ccCCceecc-CCc Confidence 234569999999999999999999999998888665433222234455554556666666554443 455 45555 457 Q ss_pred ceeEEecccCCH--HHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 341 DIVQVNTPLSGL--ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQL 418 (516) Q Consensus 341 ~~e~~~~~lsgl--~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~ 418 (516) +|+.++.+..+. -+...+..+.||.+.+||..+| |.+ +-+++.+.....|+ +..|.|.+..+...|-+ T Consensus 246 ~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~--~~~s~~e~~~~~~~-------~~tl~P~~~~ie~~l~~ 315 (417) T protein:vir:38 246 DYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRL-AQN--SPNQSVKQLADDYI-------RNDLPFYFEPITSEFEL 315 (417) T ss_pred eEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHh-CCC--CcchhHHHHHHHHH-------HHHHHHHHHHHHHHHHh Confidence 899887665432 3345555688999999998776 533 23344444444444 34578888888777766 Q ss_pred HhCCCc-CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhcc-CCC-C--CChhh--hccc Q lcl|NC_019527. 419 SKWGEI-DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDS-GWD-N--IDGDL--EIVQ 491 (516) Q Consensus 419 s~~g~~-~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~-~~~-~--~d~~~--e~~~ 491 (516) ..+... ..++.|+|+. ..+... ....+..++++|++|++|+|+.+...+.. |.. . +..+. .... T Consensus 316 ~Ll~~~~~~~~~~~fd~-~~l~~~--------~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~ 386 (417) T protein:vir:38 316 KLLDDAQRHQYCIGFDT-KSVNGL--------PIADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQK 386 (417) T ss_pred hhcChhhcccceEEech-hhhhHH--------HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccccccccc Confidence 554322 1356788852 112221 12235667899999999999998554321 100 0 00000 0000 Q ss_pred cccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 492 PEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 492 ~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) ++....+.....+++...+.+.+++ T Consensus 387 ~~~~~~~~~~~kgg~~~~~~~~~~~ 411 (417) T protein:vir:38 387 EAYQAEHAAELKGGDTNAKGNQNGS 411 (417) T ss_pred cccccccccccCCCCCCCCCCCcCC Confidence 0000001111112222222222222 No 69 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=99.85 E-value=5e-21 Score=131.88 Aligned_cols=403 Identities=11% Similarity=0.090 Sum_probs=215.0 Q ss_pred CCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHH Q lcl|NC_019527. 26 QEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNL 105 (516) Q Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll 105 (516) -+.|.- .|+-..++.. |.+ ..+....+.+ .....+..++... +....+.... -. T Consensus 1 ~~~~~~---------~~~~~~~~g~--~~~---~~~~~~~~~~-~~~~~~~~~~~~~----------~~~~~~~~~v-~~ 54 (424) T protein:vir:18 1 MEEPKY---------TIDLRTNNGW--WAR---LQSWFVGGRL-VTPNQGSQTGPVS----------AHGHLGDSSI-ND 54 (424) T ss_pred CCCCcc---------eEeecCCCch--HHH---HHhhhccccc-ccccccccccccc----------cccccccccc-cH Confidence 000000 0000000000 000 0000000000 0000000000000 0000000001 11 Q ss_pred HHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhh----hHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEE Q lcl|NC_019527. 106 AALATRPEYRAFASTLSTELTREGIEITSKDRTKAKE----MASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISI 180 (516) Q Consensus 106 ~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~----~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i 180 (516) ..+.+++.+.++|+.+++++-+..+.+--.+.+.... .......|...-........|.+.+... .++|.+++++ T Consensus 55 ~~al~~~~v~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i 134 (424) T protein:vir:18 55 ERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) T ss_pred HHhhccHHHHHHHHHHHHhhccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEE Confidence 3455677789999999999999998874322222111 1122233444444445566666666655 6678888887 Q ss_pred EecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCccee--EEee--eEeccceEEEecCCcch Q lcl|NC_019527. 181 NIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTW--WVLG--REMHASRLLTIITRPLP 256 (516) Q Consensus 181 ~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y--~v~g--~~iH~SRli~~~~~~~p 256 (516) .-+ ..|.+.+|.+++|..|++... .+ +-.| .+.| ..+.++.|||+.+.. T Consensus 135 ~r~---------------~~G~~~~L~pl~~~~V~v~~~--~~--------~~~y~~~~~g~~~~~~~~eIih~r~~~-- 187 (424) T protein:vir:18 135 DRN---------------SAGDVISLLPLQSANMDVKLV--GK--------KVVYRYQRDSEYADFSQKEIFHLKGFG-- 187 (424) T ss_pred EEC---------------CCCcEEEEEEecCcceEEEEc--CC--------eEEEEEEeCCeEEEeccccEEEecCcC-- Confidence 432 124466788999988875321 12 1233 3334 468899999997532 Q ss_pred hhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEE Q lcl|NC_019527. 257 DMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAV 334 (516) Q Consensus 257 ~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~ 334 (516) .....|.|.++.+.+.|.....+......++.+.... +++++. ..++....+.+.++++......+..++.+ T Consensus 188 -----~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~-~~l~~e~~~~~~~~~~~~~~g~nag~~~v 261 (424) T protein:vir:18 188 -----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE-KVLTEQQRSQVEENFKEIAGGPVKKRLWI 261 (424) T ss_pred -----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCC-cCCCHHHHHHHHHHHHHHhCCcccCCcee Confidence 1346799999999999999999999999999987654 444431 22333333456666655444433334566 Q ss_pred EecCCcceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeecccccccc--ccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 335 MDFDSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSEGEIRSFYDDISSVQQSYYFSPLD 410 (516) Q Consensus 335 id~~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge~D~~~yyd~I~~~Qe~~l~p~l~ 410 (516) ++. +-+|++++.+..+ +-+...+..++||.+.+||-.+| |....+-. +..|.....|| +..|.|.+. T Consensus 262 l~~-g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~eq~~~~f~-------~~tl~P~~~ 332 (424) T protein:vir:18 262 LEA-GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV-GDVEKSTSWGSGIEQQNLGFL-------QYTLQPYIS 332 (424) T ss_pred ccC-CceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcccccccHHHHHHHHH-------HHHHHHHHH Confidence 654 5788888766443 34556777889999999998666 65433321 22233344443 346789999 Q ss_pred HHHHHHHHHhCCCcC-Cc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh Q lcl|NC_019527. 411 TMLKVIQLSKWGEID-DA--ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL 487 (516) Q Consensus 411 ~l~~~l~~s~~g~~~-~d--~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~ 487 (516) .+-..|-+..+.... .. ++|.++.|...|.+++++ ++.+++++|++|++|+|+.++ +++++... T Consensus 333 ~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~-------~~~~~~~~G~~T~NE~R~~~g------l~pi~gGD 399 (424) T protein:vir:18 333 RWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAA-------FMKAMGEAGLRTINEMRRTDN------LPPLPGGD 399 (424) T ss_pred HHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhC------CCCCCCcC Confidence 988878765543322 23 445556788888888755 456689999999999999884 44443211 Q ss_pred hcccc-c-cc-hhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 488 EIVQP-E-MF-DDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 488 e~~~~-e-~~-~~e~~~~~~~~~~~~~~~e~t 516 (516) +...+ . .+ ++.+. ..+|..++. T Consensus 400 ~~~~~~n~~~l~~~~~-------~~~p~~~ga 424 (424) T protein:vir:18 400 VAMRQSQYVPITDLGT-------NKEPRNNGA 424 (424) T ss_pred eeeeccCccchHhhhc-------cCCCccCCC Confidence 11100 0 00 10000 111222222 No 70 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=99.85 E-value=3.4e-21 Score=132.79 Aligned_cols=407 Identities=14% Similarity=0.128 Sum_probs=207.5 Q ss_pred ccccCC----CCCCCccCCCccchhcccccccchh-hhcccccCC-------cccccccCcccHHHHHHHHhCchhhhhh Q lcl|NC_019527. 51 TKWAPP----QLMPGVVPAGTTPAVAMDSLCGPTY-QFLNSAAGG-------LYAADIQPFPGYQNLAALATRPEYRAFA 118 (516) Q Consensus 51 ~~~~~~----~~~~gv~~~~~~~~~a~ds~~~~~~-~~~~~~~~~-------~~~~~~~~f~gy~ll~~y~~~~i~r~iV 118 (516) -.|+.| .-..+..-+ +.-|.+.+.+.... +.......+ .......+...|--..++ +++-+.++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al-~~~~V~~cv 77 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRKQS--RKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAI-RHSDIFTAV 77 (441) T ss_pred CccccCccccccccccccc--hhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhh-ccHHHHHHH Confidence 222222 111111110 11122222211000 000000000 000000011112222233 344567799 Q ss_pred hhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHh-cccceeeEEEEEecCCCcccCccccccc Q lcl|NC_019527. 119 STLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEH-DCFFGRGQISINIKGADVSVPLILDPRT 197 (516) Q Consensus 119 d~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~-~rlyG~a~i~i~i~~~~~~~Pl~ld~~~ 197 (516) +.+|+++-+..+.+.-.+.... .......|..+-..+.-...|.+.+.+ -.++|.|++++.-++ T Consensus 78 ~~Ia~~iA~lp~~~~~~~~~~~--~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~------------- 142 (441) T protein:vir:79 78 MMIASDLARMPIRVTVNGQINY--SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK------------- 142 (441) T ss_pred HHHHHhhccCceeeecCccccc--cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECC------------- Confidence 9999999998887754332211 111222333333444445566666555 477899988874421 Q ss_pred ccccceeeEEeecceeeccccccccccccccccCcceeE--Ee------eeEeccceEEEecCCcchhhhhhccCCCCch Q lcl|NC_019527. 198 IKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWW--VL------GREMHASRLLTIITRPLPDMLKPAYNFSGIS 269 (516) Q Consensus 198 I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~--v~------g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S 269 (516) .|.+.+|.+++|..|++.. |. .|.+.++. +. .+.++++.||||...+ ...+.|.| T Consensus 143 --~G~~~~L~~i~~~~v~v~~----d~----~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~-------~dg~~G~s 205 (441) T protein:vir:79 143 --TGEPMNLTFRKTSEIELKS----DA----RGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYS-------LDGINGLS 205 (441) T ss_pred --CCcEEEEEEEcCceeEEEE----CC----CccEEEEEEEeccCCceeEEEEccccEEEeccCC-------CCCccccC Confidence 2445678888888887532 11 22332221 11 1468899999997643 23457999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCcc-HHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEE Q lcl|NC_019527. 270 MSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGE-GGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQV 345 (516) Q Consensus 270 ~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~-~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~ 345 (516) .++.+.+.|.....+......++.+.... +++++ ..+..+. .+++.++++.......|.| +++++ ++.+|+.+ T Consensus 206 pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~-~G~~~~~l 282 (441) T protein:vir:79 206 LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK--GVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-ESMTFDQL 282 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC--CCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEEc Confidence 99999999999999999999998886653 34443 3332221 1234444444333334444 45555 45789988 Q ss_pred ecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019527. 346 NTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE 423 (516) Q Consensus 346 ~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~ 423 (516) +.+... +-+........||.+.+||..+| |...++.+ ..+...+|. ..|.|.+..+-..|-+..+.. T Consensus 283 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~s---~~q~~~~~~-------~tl~P~~~~ie~eln~kl~~~ 351 (441) T protein:vir:79 283 EVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GIETANMS---ITDANLDYL-------STLKPYITCVCAELNFKFNDE 351 (441) T ss_pred cCChhHHHHHHHHHHhHHHHHHHhCCCHHHc-CCCCCCcc---HHHHHHHHH-------HHHHHHHHHHHHHHhhhcccc Confidence 766543 44566777889999999998765 76544332 222222221 247788888877666543311 Q ss_pred c-CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh----hcccc-ccchh Q lcl|NC_019527. 424 I-DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL----EIVQP-EMFDD 497 (516) Q Consensus 424 ~-~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~----e~~~~-e~~~~ 497 (516) - .-.++|.++.|...|.++++ +++..++++|++|++|+|+.+...+..+- |.+. ....+ +..+. T Consensus 352 ~~~~~~~fd~~~llr~D~~~~~-------~~~~~~i~~G~~T~NE~R~~~gl~Pi~gg---d~~~~~~~~n~~~~~~~~~ 421 (441) T protein:vir:79 352 YVNREFKFDTTEIRVVDEKTQA-------EIDKINIDSGKMNIDEIRQRDGLAPIPGG---NGSIHRVDLNHVNIELVDE 421 (441) T ss_pred ccCceEEeechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhCCCCCCCC---CcceEeecccccccccccc Confidence 1 11345555678777877764 45677899999999999999855432110 1000 00000 00000 Q ss_pred cCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 498 DGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 498 e~~~~~~~~~~~~~~~e~t 516 (516) .....+...+....|+|.- T Consensus 422 ~~~~~~~~~~~~~kgGe~~ 440 (441) T protein:vir:79 422 YQMNKSRATDKKLKGGEEN 440 (441) T ss_pred cccccccccccccCCCCCC Confidence 0000000011112222222 No 71 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=99.85 E-value=3.4e-21 Score=132.79 Aligned_cols=407 Identities=14% Similarity=0.128 Sum_probs=207.5 Q ss_pred ccccCC----CCCCCccCCCccchhcccccccchh-hhcccccCC-------cccccccCcccHHHHHHHHhCchhhhhh Q lcl|NC_019527. 51 TKWAPP----QLMPGVVPAGTTPAVAMDSLCGPTY-QFLNSAAGG-------LYAADIQPFPGYQNLAALATRPEYRAFA 118 (516) Q Consensus 51 ~~~~~~----~~~~gv~~~~~~~~~a~ds~~~~~~-~~~~~~~~~-------~~~~~~~~f~gy~ll~~y~~~~i~r~iV 118 (516) -.|+.| .-..+..-+ +.-|.+.+.+.... +.......+ .......+...|--..++ +++-+.++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al-~~~~V~~cv 77 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRKQS--RKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAI-RHSDIFTAV 77 (441) T ss_pred CccccCccccccccccccc--hhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhh-ccHHHHHHH Confidence 222222 111111110 11122222211000 000000000 000000011112222233 344567799 Q ss_pred hhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHh-cccceeeEEEEEecCCCcccCccccccc Q lcl|NC_019527. 119 STLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEH-DCFFGRGQISINIKGADVSVPLILDPRT 197 (516) Q Consensus 119 d~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~-~rlyG~a~i~i~i~~~~~~~Pl~ld~~~ 197 (516) +.+|+++-+..+.+.-.+.... .......|..+-..+.-...|.+.+.+ -.++|.|++++.-++ T Consensus 78 ~~Ia~~iA~lp~~~~~~~~~~~--~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~------------- 142 (441) T protein:vir:94 78 MMIASDLARMPIRVTVNGQINY--SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK------------- 142 (441) T ss_pred HHHHHhhccCceeeecCccccc--cchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECC------------- Confidence 9999999998887754332211 111222333333444445566666555 477899988874421 Q ss_pred ccccceeeEEeecceeeccccccccccccccccCcceeE--Ee------eeEeccceEEEecCCcchhhhhhccCCCCch Q lcl|NC_019527. 198 IKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWW--VL------GREMHASRLLTIITRPLPDMLKPAYNFSGIS 269 (516) Q Consensus 198 I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~--v~------g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S 269 (516) .|.+.+|.+++|..|++.. |. .|.+.++. +. .+.++++.||||...+ ...+.|.| T Consensus 143 --~G~~~~L~~i~~~~v~v~~----d~----~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~-------~dg~~G~s 205 (441) T protein:vir:94 143 --TGEPMNLTFRKTSEIELKS----DA----RGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYS-------LDGINGLS 205 (441) T ss_pred --CCcEEEEEEEcCceeEEEE----CC----CccEEEEEEEeccCCceeEEEEccccEEEeccCC-------CCCccccC Confidence 2445678888888887532 11 22332221 11 1468899999997643 23457999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCcc-HHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEE Q lcl|NC_019527. 270 MSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGE-GGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQV 345 (516) Q Consensus 270 ~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~-~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~ 345 (516) .++.+.+.|.....+......++.+.... +++++ ..+..+. .+++.++++.......|.| +++++ ++.+|+.+ T Consensus 206 pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~-~G~~~~~l 282 (441) T protein:vir:94 206 LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK--GVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-ESMTFDQL 282 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC--CCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEEc Confidence 99999999999999999999998886653 34443 3332221 1234444444333334444 45555 45789988 Q ss_pred ecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019527. 346 NTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE 423 (516) Q Consensus 346 ~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~ 423 (516) +.+... +-+........||.+.+||..+| |...++.+ ..+...+|. ..|.|.+..+-..|-+..+.. T Consensus 283 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~s---~~q~~~~~~-------~tl~P~~~~ie~eln~kl~~~ 351 (441) T protein:vir:94 283 EVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GIETANMS---ITDANLDYL-------STLKPYITCVCAELNFKFNDE 351 (441) T ss_pred cCChhHHHHHHHHHHhHHHHHHHhCCCHHHc-CCCCCCcc---HHHHHHHHH-------HHHHHHHHHHHHHHhhhcccc Confidence 766543 44566777889999999998765 76544332 222222221 247788888877666543311 Q ss_pred c-CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh----hcccc-ccchh Q lcl|NC_019527. 424 I-DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL----EIVQP-EMFDD 497 (516) Q Consensus 424 ~-~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~----e~~~~-e~~~~ 497 (516) - .-.++|.++.|...|.++++ +++..++++|++|++|+|+.+...+..+- |.+. ....+ +..+. T Consensus 352 ~~~~~~~fd~~~llr~D~~~~~-------~~~~~~i~~G~~T~NE~R~~~gl~Pi~gg---d~~~~~~~~n~~~~~~~~~ 421 (441) T protein:vir:94 352 YVNREFKFDTTEIRVVDEKTQA-------EIDKINIDSGKMNIDEIRQRDGLAPIPGG---NGSIHRVDLNHVNIELVDE 421 (441) T ss_pred ccCceEEeechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhCCCCCCCC---CcceEeecccccccccccc Confidence 1 11345555678777877764 45677899999999999999855432110 1000 00000 00000 Q ss_pred cCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 498 DGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 498 e~~~~~~~~~~~~~~~e~t 516 (516) .....+...+....|+|.- T Consensus 422 ~~~~~~~~~~~~~kgGe~~ 440 (441) T protein:vir:94 422 YQMNKSRATDKKLKGGEEN 440 (441) T ss_pred cccccccccccccCCCCCC Confidence 0000000011112222222 No 72 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=99.85 E-value=1e-20 Score=130.19 Aligned_cols=404 Identities=11% Similarity=0.066 Sum_probs=213.2 Q ss_pred ChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCccccc--ccCccc-HHHHHH Q lcl|NC_019527. 31 KLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAAD--IQPFPG-YQNLAA 107 (516) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~--~~~f~g-y~ll~~ 107 (516) -..-+- +.|..| .|+ .+.-|+++.......-..+....+.+ .....| |=-... T Consensus 1 ~~~~~~--------------~~~~~~---~~~-------~~~~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~ 56 (424) T protein:vir:45 1 MLYCWW--------------AHWLWP---EGG-------RVLLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSPET 56 (424) T ss_pred CeeEee--------------eceecC---cch-------hHHHHhhccccCCCCCccccchhhhhhhccccCCceechHH Confidence 000000 001111 111 01111111000000000000000000 000000 001133 Q ss_pred HHhCchhhhhhhhhhHHHhhCCCeeeeccccchhh--hHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecC Q lcl|NC_019527. 108 LATRPEYRAFASTLSTELTREGIEITSKDRTKAKE--MASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKG 184 (516) Q Consensus 108 y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~--~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~ 184 (516) +.+++-+.++|+.+|++.-+..+.+.-..++..+. .......|...-....-...|.+.+... .++|.+++++.-+. T Consensus 57 al~~~~v~~cv~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~ 136 (424) T protein:vir:45 57 AMKLAAVYSCIYVLSSSLAQMPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNR 136 (424) T ss_pred hhccHHHHHHHHHHHHHHhhCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcC Confidence 45677889999999999999998874333222211 1112233334444455556677766655 56788888774322 Q ss_pred CCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe----eeEeccceEEEecCCcchhhhh Q lcl|NC_019527. 185 ADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL----GREMHASRLLTIITRPLPDMLK 260 (516) Q Consensus 185 ~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~----g~~iH~SRli~~~~~~~p~~~k 260 (516) .|.+.+|.++.|..++.... . +. -.|++. ...++++.||||.+.. T Consensus 137 ---------------~G~~~~L~~l~~~~v~i~~~--~-------~~-~~y~~~~~~~~~~~~~~eVih~r~~~------ 185 (424) T protein:vir:45 137 ---------------RGEVISLDCCMPWETTLMNT--G-------GR-YTYGLYNEYGAFAISPDDMIHIRALG------ 185 (424) T ss_pred ---------------CCcEEEEEEecCceEEEEEc--C-------Ce-EEEEEEecCceEEECcccEEEecCcC------ Confidence 24455677888777764321 1 11 234443 2468999999997532 Q ss_pred hccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHh-cCCcc-eEEEecC Q lcl|NC_019527. 261 PAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNM-QSNLG-LAVMDFD 338 (516) Q Consensus 261 ~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~-~sn~g-~~~id~~ 338 (516) .....|.|.++.+.+.|.....+......++.+....-........++.+..+.+.+.++..... .+|.| +++++. T Consensus 186 -~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~- 263 (424) T protein:vir:45 186 -NNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQKASQALRRQENKTMLLPA- 263 (424) T ss_pred -CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeEcCC- Confidence 13467999999999999999999999999988876643322223345444444455555544333 34555 455554 Q ss_pred CcceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 339 SEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVI 416 (516) Q Consensus 339 ~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l 416 (516) +-+|+.++.+..+ +-+......++||.+.+||-.+| |..-++-.++.|.....| -+..|.|.+..|.+.| T Consensus 264 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~sn~eq~~~~f-------~~~tL~P~~~~ie~~l 335 (424) T protein:vir:45 264 DLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMI-NDLEKATFSNISAQAIQF-------VRYTMMPWVTNWEQEL 335 (424) T ss_pred CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCcccHHHHHHHH-------HHHHHHHHHHHHHHHH Confidence 5788888766544 34667788889999999998766 444334444444333333 3345778888887777 Q ss_pred HHHhCCC--cCCcceEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc Q lcl|NC_019527. 417 QLSKWGE--IDDAITFKF--KSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP 492 (516) Q Consensus 417 ~~s~~g~--~~~d~~~~f--~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~ 492 (516) -+..+.. ...++.|+| +.|...|.+++++ ++.+++++|++|++|+|+.+ |+++++...+...+ T Consensus 336 n~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~-------~~~~~~~~g~~T~NE~R~~~------gl~pi~ggD~~~~~ 402 (424) T protein:vir:45 336 NRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQ-------FYHFAITDGWMSRNEARAFE------DMNPVEGLDEMLVS 402 (424) T ss_pred HHhcCChhhhcCCcEEEeechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHh------CCCCCCCcceeeec Confidence 6655432 223455554 4777777777655 55668999999999999987 44444321111111 Q ss_pred cc--chhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 493 EM--FDDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 493 e~--~~~e~~~~~~~~~~~~~~~e~ 515 (516) -. ...++ ..+.+.+.+.++. T Consensus 403 ~n~~~~~~~---~~~~~~~~~~~~~ 424 (424) T protein:vir:45 403 VNAANPAGD---FKPPKNDEGKTNE 424 (424) T ss_pred ccccccccc---cCCCCCCCCCCCC Confidence 00 00000 0111111111111 No 73 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=99.85 E-value=1.4e-21 Score=134.94 Aligned_cols=372 Identities=12% Similarity=0.061 Sum_probs=203.9 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhC Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATR 111 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~ 111 (516) |-+. . +.. |.++. .......++..... ...+ +.. .. +-....+.++ T Consensus 1 Mg~~----~---~~~------~~~~~--------~~~~~~~~~~~~~~---~~~~---~~~-~~------~v~~~~al~~ 46 (385) T protein:vir:10 1 MGLL----T---PRN------FNKRK--------AKNMVYPSNPAFFT---TTVG---GMQ-LS------YVSALSALQN 46 (385) T ss_pred Cccc----c---chh------ccccc--------ccccccccchhhhh---hhcc---ccC-cc------ccCHHHhhcc Confidence 1111 0 000 11110 00000111111000 0000 000 00 0012335567 Q ss_pred chhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcc-cceeeEEEEEecCCCcccC Q lcl|NC_019527. 112 PEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDC-FFGRGQISINIKGADVSVP 190 (516) Q Consensus 112 ~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~r-lyG~a~i~i~i~~~~~~~P 190 (516) +.++++|+.+|+++-+..+++.-.. ...|...-..+.....|.+.+.+.+ ++|.|++++.-+ . T Consensus 47 ~~v~~~i~~ia~~ia~~p~~v~~~~----------~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~---~--- 110 (385) T protein:vir:10 47 TNVYSVINRIASDVASAHFKTENTA----------TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ---N--- 110 (385) T ss_pred HHHHHHHHHHHHHHhhCceeeeccc----------hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC---c--- Confidence 8899999999999999988874211 1223334445556777888777776 588888886421 1 Q ss_pred cccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe------eeEeccceEEEecCCcchhhhhhccC Q lcl|NC_019527. 191 LILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL------GREMHASRLLTIITRPLPDMLKPAYN 264 (516) Q Consensus 191 l~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~------g~~iH~SRli~~~~~~~p~~~k~~~~ 264 (516) .++.++++..|.+.. |... ..|++. ...++++.||||.+...+. ... T Consensus 111 -------------~~~~p~~~~~v~~~~----~~~~------~~~~~~~~~~~~~~~~~~~eiihik~~~~~~----~~~ 163 (385) T protein:vir:10 111 -------------LEHIPNSDVQINYLP----GNMG------IVYTVLESNDRPQMVLRQDQMLHFRLMPDPQ----YRY 163 (385) T ss_pred -------------eeEeecCCceEEEEE----cCCc------eEEEEEEcCCceEEEEccccEEEeccCCCCc----ccc Confidence 124455555554321 1111 122221 2468999999997543211 123 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcC-ccHHHHHHHHHHHHHhcCCcc-eEEEecCCcce Q lcl|NC_019527. 265 FSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNG-GEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDI 342 (516) Q Consensus 265 ~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~-~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~ 342 (516) ..|.|.++.+.+.+.....+......++.+....-........+.. ...+.+.++++..... .|.+ .++++ ++.+| T Consensus 164 ~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~-~n~~~~~vl~-~g~~~ 241 (385) T protein:vir:10 164 LIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTG-DNSGRLMVLP-DGFDY 241 (385) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCc-cccCCccccC-CCceE Confidence 4699999999999999999999999998886654322222222322 2234455555544333 3444 45555 45789 Q ss_pred eEEecccCCHH---HHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 343 VQVNTPLSGLA---DLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLS 419 (516) Q Consensus 343 e~~~~~lsgl~---d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s 419 (516) +.++.+..... +......++||.+.+||..+|-+...++.+.+.-+..+.+|. ..|.|.++.+.+.|.+. T Consensus 242 ~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~-------~~l~P~~~~ie~~l~~~ 314 (385) T protein:vir:10 242 TQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYL-------ANLNSYVNPIVDELRLK 314 (385) T ss_pred EecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHHH-------HHHHHHHHHHHHHHHHh Confidence 98887766544 445666789999999998766553333332222223333432 13788888888888765 Q ss_pred hCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCCh-hhhccccccchhc Q lcl|NC_019527. 420 KWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDG-DLEIVQPEMFDDD 498 (516) Q Consensus 420 ~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~-~~e~~~~e~~~~e 498 (516) .++ ++++|.+.+|...|.++++ +++++++++|++|++|+|+.+.. ++++. +........ T Consensus 315 l~~---~~~~f~~~~ll~~d~~~~~-------~~~~~~~~~G~~T~NE~R~~~g~------~p~p~~~~~~~~~~~---- 374 (385) T protein:vir:10 315 MNA---PDLELDIKDMLDVDDSALI-------NQVSNLAKSGVLGAEQAQFILTR------SGFLPDNLPEFKPLT---- 374 (385) T ss_pred hCC---ceEEeechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhCC------CccCCCCCccccCcc---- Confidence 543 4688888899999988864 55667899999999999998844 33321 100000000 Q ss_pred CCCCCCCCCCCC Q lcl|NC_019527. 499 GADPYMPDPDVL 510 (516) Q Consensus 499 ~~~~~~~~~~~~ 510 (516) ....+++.++- T Consensus 375 -~~~~~g~~~dn 385 (385) T protein:vir:10 375 -TQVKGGDEGDN 385 (385) T ss_pred -cccCCCCCCCC Confidence 00111111111 No 74 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=99.84 E-value=4.9e-21 Score=131.92 Aligned_cols=389 Identities=13% Similarity=0.060 Sum_probs=201.9 Q ss_pred cccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCC Q lcl|NC_019527. 50 ATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREG 129 (516) Q Consensus 50 ~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~ 129 (516) -..|.+. ... .+..|-.... ..++... . .|--..++ +++-+.++|+.+|+++-+.. T Consensus 1 m~~f~~~---------~~~-~~~~~~~~~~-------~~~~~~~---~---~~~~~~Al-~~~~V~~~i~~Ia~~iA~lp 56 (406) T protein:vir:97 1 MSFFQPL---------GTS-KVSYDDYISS-------VLAGDVS---Q---KYLGVSAL-KNSDILTATSIIAGDIARFP 56 (406) T ss_pred Ccccccc---------CCC-CCCcchHHHH-------HhcCCCC---c---ccccchhh-ccHHHHHHHHHHHHhhhhCe Confidence 2222111 000 0000000000 0000000 0 01111122 34556779999999999988 Q ss_pred CeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHh-cccceeeEEEEEecCCCcccCcccccccccccceeeEEe Q lcl|NC_019527. 130 IEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEH-DCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSN 208 (516) Q Consensus 130 ~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~-~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v 208 (516) +.+...+.+... .......|+..-....-+..|.+.+.+ -.++|.|++++..++. .|.+..|.+ T Consensus 57 ~~~~~~~g~~~~-~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~--------------~g~~~~L~~ 121 (406) T protein:vir:97 57 LVKKDVNGDIIH-DEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPK--------------TNQALQFQF 121 (406) T ss_pred eEEEecCccccc-cchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCC--------------CCeEEEEEE Confidence 776544332211 111222233333344445556655544 4668889988754321 134456888 Q ss_pred ecceeeccccccccccccccccCcceeEEe----e--eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHH Q lcl|NC_019527. 209 IEPMWTSPSAYNALDPTAPDFYKPSTWWVL----G--REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWL 282 (516) Q Consensus 209 ~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~----g--~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~ 282 (516) ++|.+|++... + .+.+ .|++. + ..+.++.||||.+.. ...+.|+|.++.+.+.|.... T Consensus 122 i~p~~v~v~~~---~-----~~~~-~y~~~~~~~~~~~~~~~~evih~r~~~-------~dg~~G~spi~~~~~~i~~~~ 185 (406) T protein:vir:97 122 YRPSETTVEET---D-----NHEI-VYTFTDMLTAKQVKCFAHDVIHWKFFS-------HDTILGRSPLLSLGDEIDLQT 185 (406) T ss_pred ECCCeeEEEEc---C-----CceE-EEEEEecCCceEEEEccccEEEecCCC-------CCCcccccHHHHHHHHHHHHH Confidence 88888775321 1 1222 23332 2 357889999997532 223559999999999999988 Q ss_pred HHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccCCHH--HHHHHH Q lcl|NC_019527. 283 RTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLSGLA--DLQSQS 359 (516) Q Consensus 283 ~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~lsgl~--d~~~~~ 359 (516) .+......++.+....-+-......++....+++.++++..... +|.| ..+++. +.+|++++.+..... +...+. T Consensus 186 a~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g-~n~g~~~vl~~-g~~~~~l~~~~~d~q~le~~~~~ 263 (406) T protein:vir:97 186 GGINTLIKFFKDGFSSGILTMKGAQLSGDARQRARQEFEKMREG-SVGGSPLVFDS-TMEYTPLEIDTNVLQLITSNNFS 263 (406) T ss_pred HHHHHHHHHHhccCCCceEEecCCCCCHHHHHHHHHHHHHHhcc-cccCceeecCC-CceEEEccCCHHHHHHHHHHHhh Confidence 89998888887754432211112334444445566666555444 3444 455554 578988876544322 445556 Q ss_pred HHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-CCcceEEeCCCCCC Q lcl|NC_019527. 360 QEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI-DDAITFKFKSLWQT 438 (516) Q Consensus 360 ~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~d~~~~f~pL~~~ 438 (516) ...||.+.+||-.+|.+.+ -+++.+.....||. ..|.|.+..|-+.+-+..+..- ...+.|+|+ +. . T Consensus 264 ~~~Ia~afgVPp~~lg~~~---~~~~~e~~~~~f~~-------~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd-~~-~ 331 (406) T protein:vir:97 264 TAQIAKALRVPSYKLGVNS---PNQSVAQLMEDYVT-------NDLPFYFDAITSELGLKTLNDKDRRLYHIEFD-TR-S 331 (406) T ss_pred HHHHHHHhCCCHHHcCCCC---CcchHHHHHHHHHH-------HHHHHHHHHHHHHHhhhhcChhhccceeEEEe-cC-c Confidence 7889999999998774422 12333434444433 4478888888777765444221 134667775 11 1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccC------CCCCChhhhccccccchhcCCCCCCCCCCCCCC Q lcl|NC_019527. 439 SAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSG------WDNIDGDLEIVQPEMFDDDGADPYMPDPDVLPG 512 (516) Q Consensus 439 sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~------~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~ 512 (516) ..+..++++.+++++|++|++|+|+.++..+..+ +-+.........++..+.......+++.+.+.+ T Consensus 332 -------~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~gg~~~~~~~ 404 (406) T protein:vir:97 332 -------VTGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKEEYQDKVGIKGKGGEVNAEED 404 (406) T ss_pred -------cchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccCccchhcccccccccccccCCCCCCCCCC Confidence 1233456677889999999999999985543211 000000000000011111111222333333333 Q ss_pred CC Q lcl|NC_019527. 513 EE 514 (516) Q Consensus 513 ~e 514 (516) .+ T Consensus 405 ~~ 406 (406) T protein:vir:97 405 KS 406 (406) T ss_pred CC Confidence 33 No 75 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=99.84 E-value=1.1e-20 Score=129.99 Aligned_cols=406 Identities=14% Similarity=0.122 Sum_probs=206.8 Q ss_pred ccccCC----CCCCCccCCCccchhcccccccch-hhhcccccCC-------cccccccCcccHHHHHHHHhCchhhhhh Q lcl|NC_019527. 51 TKWAPP----QLMPGVVPAGTTPAVAMDSLCGPT-YQFLNSAAGG-------LYAADIQPFPGYQNLAALATRPEYRAFA 118 (516) Q Consensus 51 ~~~~~~----~~~~gv~~~~~~~~~a~ds~~~~~-~~~~~~~~~~-------~~~~~~~~f~gy~ll~~y~~~~i~r~iV 118 (516) -.|+.| .-..|..- + +.-|++.+.+.-. .+.......+ .......+..+|--..++ +++-+.++| T Consensus 1 ~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al-~~~~V~acv 77 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQ-S-RKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAI-RHSDIFTAV 77 (441) T ss_pred CceecCccceeccccccc-h-hhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhhhh-ccHHHHHHH Confidence 122222 10111100 0 0011111211000 0000000000 000000001112222223 345567799 Q ss_pred hhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCccccccc Q lcl|NC_019527. 119 STLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRT 197 (516) Q Consensus 119 d~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~ 197 (516) +.+|+++-+..+.+.-.+.... .......|..+-....-...|.+++.+. .++|.|++++.-++ T Consensus 78 ~~Ia~~iA~lpl~~~~~~~~~~--~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~------------- 142 (441) T protein:vir:98 78 MMIASDLARMPIRVTVNGQINY--SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK------------- 142 (441) T ss_pred HHHHHhhccCceEEecCCcccc--cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcC------------- Confidence 9999999998887754332221 1122233444444444555666666555 66799888875432 Q ss_pred ccccceeeEEeecceeeccccccccccccccccCcceeE--Ee------eeEeccceEEEecCCcchhhhhhccCCCCch Q lcl|NC_019527. 198 IKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWW--VL------GREMHASRLLTIITRPLPDMLKPAYNFSGIS 269 (516) Q Consensus 198 I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~--v~------g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S 269 (516) .|...+|.+++|.+|++..- . -|.+.++. +. .+.+.++.||||.... ...+.|.| T Consensus 143 --~G~~~~L~~i~~~~v~v~~~----~----~g~~~~~~~~~~~~~~~~~~~~~~~dviHir~~~-------~dg~~G~s 205 (441) T protein:vir:98 143 --TGEPMNLTFRKTSEIELKLD----A----RGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYS-------LDGINGLS 205 (441) T ss_pred --CCcEEEEEEEcCceeEEEEC----C----CCcEEEEEEEeccCcceeeEEEccccEEEeccCC-------CCCccccC Confidence 23456788888888875321 1 12332222 11 1457899999997542 23457999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCcc-HHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEE Q lcl|NC_019527. 270 MSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGE-GGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQV 345 (516) Q Consensus 270 ~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~-~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~ 345 (516) .++.+.+.|.....+......++.+.... +++++ ..+..+. .+.+.++++.......|.| +++++ ++.+|+.+ T Consensus 206 pi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~--~~~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~-~g~~~~~l 282 (441) T protein:vir:98 206 LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK--GVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-ESMTFDQL 282 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC--CCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEEc Confidence 99999999999999999999998886653 34443 3333222 1234444444444434544 45555 45788888 Q ss_pred ecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019527. 346 NTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE 423 (516) Q Consensus 346 ~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~ 423 (516) +.+... +-+......++||.+.+||..+| |...++.+ -+ +...+| + ..|.|.+..+-..|-+..+.. T Consensus 283 ~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~l-g~~~~~~s--~~-q~~~~y--~-----~tl~P~~~~ie~~ln~~L~~~ 351 (441) T protein:vir:98 283 EVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GIETANMS--IT-DANLDY--L-----STLKPYITCVCAELNFKFNDE 351 (441) T ss_pred cCChhHHHHHHHHHHhHHHHHHHhCCCHHHc-CCCCCCcc--HH-HHHHHH--H-----HHHHHHHHHHHHHHHhhcccc Confidence 766443 34556677789999999998766 65544332 12 222222 1 247788888877776544321 Q ss_pred cCCc--ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh----hcccc-ccch Q lcl|NC_019527. 424 IDDA--ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL----EIVQP-EMFD 496 (516) Q Consensus 424 ~~~d--~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~----e~~~~-e~~~ 496 (516) ..+ |+|..+.|...|.++++ ++++.++++|++|++|+|+.+...+..+- |.+. ....+ +..+ T Consensus 352 -~~~~~~~fd~~~llr~d~~~~~-------~~~~~~~~~G~~T~NE~R~~~gl~pi~gG---d~~~~~~~~n~~~~~~~~ 420 (441) T protein:vir:98 352 -YVNREFKFDTTEIRVVDEKTQA-------EIDKINIDSGKMNIDEIRQRDGLAPIPGG---NGSIHRVDLNHVNIELVD 420 (441) T ss_pred -ccCceEEEechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhCCCCCCCC---CcceEeeccccccccccc Confidence 223 44555577777777764 45677899999999999999854432110 0000 00000 0000 Q ss_pred hcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 497 DDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 497 ~e~~~~~~~~~~~~~~~e~t 516 (516) +.....+...+...+|+|.- T Consensus 421 ~~q~~~~~~~~~~~kgGe~n 440 (441) T protein:vir:98 421 EYQMNKSRATDKKLKGGEEN 440 (441) T ss_pred ccccccccccccccCCCCCC Confidence 00000000011112222222 No 76 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=99.83 E-value=1.2e-20 Score=129.82 Aligned_cols=377 Identities=8% Similarity=0.019 Sum_probs=192.2 Q ss_pred cccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEA 153 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~ 153 (516) =+++...+...... ..+.+...+.+ -....|.+++.++++|+.+++++.+..+.+....... .......|... T Consensus 1 Mg~f~~lf~~~~~~---~~~~~~~~~~~-v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~---~~~~~~ll~~~ 73 (395) T protein:vir:95 1 MSILEKIFKTRKDI---TYMLDLDMIED-LSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQ---KNDVYYKLNIK 73 (395) T ss_pred CchhhhhhccCccc---cccccchhccc-cchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccc---cchHHHHHHhc Confidence 02222221111100 00111111111 1224567889999999999999999988875443222 22233445555 Q ss_pred HHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcc Q lcl|NC_019527. 154 CEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPS 233 (516) Q Consensus 154 ~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~ 233 (516) -..+..+..|.+++....++||.++++..++... .+++++.+.+... .+....+.... T Consensus 74 PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~-------------------~~~~~~~~~~~~~---~~~~~~~~~~~ 131 (395) T protein:vir:95 74 PNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKEL-------------------LIADSFYREEYAL---YDDIFKDVTVK 131 (395) T ss_pred cCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCe-------------------EecCCccceeEee---cCcceeEEEEc Confidence 5666778888888888888777776654433221 1111111111100 00000000000 Q ss_pred eeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce-eeecchhhhcCcc Q lcl|NC_019527. 234 TWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF-LKTNMAQVLNGGE 312 (516) Q Consensus 234 ~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v-~k~~~~~~l~~~~ 312 (516) .|.+ .+.+.++.|||+.... +....+|.|.++.+...+...... +........ +++. ...++... T Consensus 132 ~~~~-~~~~~~~evih~~~~~------~~~~~~G~spi~~~~~~~~~~~~~------~~~~~~~~gii~~~-~~~~~~e~ 197 (395) T protein:vir:95 132 DYTY-QRTFTMQEVIYLKYNN------NKVTHFVESLFEDYGKIFGRMIGA------QLKNYQIRGILKSA-SSAYDEKN 197 (395) T ss_pred Ccee-eeeeccccEEEEccCC------CCcccccchHHHHHHHHHHHHHHH------HHhcCCCceEEEeC-CCCCCHHH Confidence 1111 2467889999997532 233457999998877666543321 222333332 2332 12222222 Q ss_pred HHHHHHHHHHHHHhcCCcce-EEEecCCcceeEEecccCC-------HHHHHHHHHHHHHhhhcCCceeeeccccccccc Q lcl|NC_019527. 313 GGDVFDRVEMYVNMQSNLGL-AVMDFDSEDIVQVNTPLSG-------LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNA 384 (516) Q Consensus 313 ~~~l~~r~~~~~~~~sn~g~-~~id~~~e~~e~~~~~lsg-------l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glna 384 (516) .+++.+.++.......+.+. ++...++.+++.++.+..+ +-+......++||.+.+||-.+|-| -.+ T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~-----~~s 272 (395) T protein:vir:95 198 IEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG-----ETA 272 (395) T ss_pred HHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC-----ccc Confidence 23344444333333223333 3322345788888766543 3344456678899999999887732 234 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc--CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019527. 385 SSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI--DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSV 462 (516) Q Consensus 385 tge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~--~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gv 462 (516) +-+.....||. ..|.|.+..|-..+-+..++.. -..++|.++.|...|.+++++ ++..++++|+ T Consensus 273 n~e~~~~~~~~-------~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~-------~~~~~~~~G~ 338 (395) T protein:vir:95 273 DLEKNTLVFEK-------FCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAE-------AIDKLVSSGS 338 (395) T ss_pred CHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHH-------HHHHHHhCCC Confidence 44555666654 3478988888887777655432 235678888898888887655 4566899999 Q ss_pred CCHHHHHHHHHhhhcc-CCC-----CCCh-hhhccccccchhcCCCCCCCCCCCCCC Q lcl|NC_019527. 463 IDPSEARQQLSDDPDS-GWD-----NIDG-DLEIVQPEMFDDDGADPYMPDPDVLPG 512 (516) Q Consensus 463 i~~~e~r~~l~~~~~~-~~~-----~~d~-~~e~~~~e~~~~e~~~~~~~~~~~~~~ 512 (516) +|++|+|+.++..+.. ++. +.+. ..+..+.......+..+.++++++..+ T Consensus 339 lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:95 339 FTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred cCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 9999999998543321 100 0000 000000111111111222222222211 No 77 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=99.83 E-value=1.2e-20 Score=129.82 Aligned_cols=377 Identities=8% Similarity=0.019 Sum_probs=192.2 Q ss_pred cccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEA 153 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~ 153 (516) =+++...+...... ..+.+...+.+ -....|.+++.++++|+.+++++.+..+.+....... .......|... T Consensus 1 Mg~f~~lf~~~~~~---~~~~~~~~~~~-v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~---~~~~~~ll~~~ 73 (395) T protein:vir:10 1 MSILEKIFKTRKDI---TYMLDLDMIED-LSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQ---KNDVYYKLNIK 73 (395) T ss_pred CchhhhhhccCccc---cccccchhccc-cchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccc---cchHHHHHHhc Confidence 02222221111100 00111111111 1224567889999999999999999988875443222 22233445555 Q ss_pred HHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcc Q lcl|NC_019527. 154 CEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPS 233 (516) Q Consensus 154 ~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~ 233 (516) -..+..+..|.+++....++||.++++..++... .+++++.+.+... .+....+.... T Consensus 74 PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~-------------------~~~~~~~~~~~~~---~~~~~~~~~~~ 131 (395) T protein:vir:10 74 PNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKEL-------------------LIADSFYREEYAL---YDDIFKDVTVK 131 (395) T ss_pred cCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCe-------------------EecCCccceeEee---cCcceeEEEEc Confidence 5666778888888888888777776654433221 1111111111100 00000000000 Q ss_pred eeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce-eeecchhhhcCcc Q lcl|NC_019527. 234 TWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF-LKTNMAQVLNGGE 312 (516) Q Consensus 234 ~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v-~k~~~~~~l~~~~ 312 (516) .|.+ .+.+.++.|||+.... +....+|.|.++.+...+...... +........ +++. ...++... T Consensus 132 ~~~~-~~~~~~~evih~~~~~------~~~~~~G~spi~~~~~~~~~~~~~------~~~~~~~~gii~~~-~~~~~~e~ 197 (395) T protein:vir:10 132 DYTY-QRTFTMQEVIYLKYNN------NKVTHFVESLFEDYGKIFGRMIGA------QLKNYQIRGILKSA-SSAYDEKN 197 (395) T ss_pred Ccee-eeeeccccEEEEccCC------CCcccccchHHHHHHHHHHHHHHH------HHhcCCCceEEEeC-CCCCCHHH Confidence 1111 2467889999997532 233457999998877666543321 222333332 2332 12222222 Q ss_pred HHHHHHHHHHHHHhcCCcce-EEEecCCcceeEEecccCC-------HHHHHHHHHHHHHhhhcCCceeeeccccccccc Q lcl|NC_019527. 313 GGDVFDRVEMYVNMQSNLGL-AVMDFDSEDIVQVNTPLSG-------LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNA 384 (516) Q Consensus 313 ~~~l~~r~~~~~~~~sn~g~-~~id~~~e~~e~~~~~lsg-------l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glna 384 (516) .+++.+.++.......+.+. ++...++.+++.++.+..+ +-+......++||.+.+||-.+|-| -.+ T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~-----~~s 272 (395) T protein:vir:10 198 IEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG-----ETA 272 (395) T ss_pred HHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC-----ccc Confidence 23344444333333223333 3322345788888766543 3344456678899999999887732 234 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc--CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019527. 385 SSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI--DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSV 462 (516) Q Consensus 385 tge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~--~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gv 462 (516) +-+.....||. ..|.|.+..|-..+-+..++.. -..++|.++.|...|.+++++ ++..++++|+ T Consensus 273 n~e~~~~~~~~-------~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~-------~~~~~~~~G~ 338 (395) T protein:vir:10 273 DLEKNTLVFEK-------FCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAE-------AIDKLVSSGS 338 (395) T ss_pred CHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHH-------HHHHHHhCCC Confidence 44555666654 3478988888887777655432 235678888898888887655 4566899999 Q ss_pred CCHHHHHHHHHhhhcc-CCC-----CCCh-hhhccccccchhcCCCCCCCCCCCCCC Q lcl|NC_019527. 463 IDPSEARQQLSDDPDS-GWD-----NIDG-DLEIVQPEMFDDDGADPYMPDPDVLPG 512 (516) Q Consensus 463 i~~~e~r~~l~~~~~~-~~~-----~~d~-~~e~~~~e~~~~e~~~~~~~~~~~~~~ 512 (516) +|++|+|+.++..+.. ++. +.+. ..+..+.......+..+.++++++..+ T Consensus 339 lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 339 FTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred cCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 9999999998543321 100 0000 000000111111111222222222211 No 78 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=99.83 E-value=1.2e-20 Score=129.82 Aligned_cols=377 Identities=8% Similarity=0.019 Sum_probs=192.2 Q ss_pred cccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEA 153 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~ 153 (516) =+++...+...... ..+.+...+.+ -....|.+++.++++|+.+++++.+..+.+....... .......|... T Consensus 1 Mg~f~~lf~~~~~~---~~~~~~~~~~~-v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~---~~~~~~ll~~~ 73 (395) T protein:vir:10 1 MSILEKIFKTRKDI---TYMLDLDMIED-LSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQ---KNDVYYKLNIK 73 (395) T ss_pred CchhhhhhccCccc---cccccchhccc-cchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccc---cchHHHHHHhc Confidence 02222221111100 00111111111 1224567889999999999999999988875443222 22233445555 Q ss_pred HHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcc Q lcl|NC_019527. 154 CEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPS 233 (516) Q Consensus 154 ~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~ 233 (516) -..+..+..|.+++....++||.++++..++... .+++++.+.+... .+....+.... T Consensus 74 PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~-------------------~~~~~~~~~~~~~---~~~~~~~~~~~ 131 (395) T protein:vir:10 74 PNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKEL-------------------LIADSFYREEYAL---YDDIFKDVTVK 131 (395) T ss_pred cCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCe-------------------EecCCccceeEee---cCcceeEEEEc Confidence 5666778888888888888777776654433221 1111111111100 00000000000 Q ss_pred eeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce-eeecchhhhcCcc Q lcl|NC_019527. 234 TWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF-LKTNMAQVLNGGE 312 (516) Q Consensus 234 ~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v-~k~~~~~~l~~~~ 312 (516) .|.+ .+.+.++.|||+.... +....+|.|.++.+...+...... +........ +++. ...++... T Consensus 132 ~~~~-~~~~~~~evih~~~~~------~~~~~~G~spi~~~~~~~~~~~~~------~~~~~~~~gii~~~-~~~~~~e~ 197 (395) T protein:vir:10 132 DYTY-QRTFTMQEVIYLKYNN------NKVTHFVESLFEDYGKIFGRMIGA------QLKNYQIRGILKSA-SSAYDEKN 197 (395) T ss_pred Ccee-eeeeccccEEEEccCC------CCcccccchHHHHHHHHHHHHHHH------HHhcCCCceEEEeC-CCCCCHHH Confidence 1111 2467889999997532 233457999998877666543321 222333332 2332 12222222 Q ss_pred HHHHHHHHHHHHHhcCCcce-EEEecCCcceeEEecccCC-------HHHHHHHHHHHHHhhhcCCceeeeccccccccc Q lcl|NC_019527. 313 GGDVFDRVEMYVNMQSNLGL-AVMDFDSEDIVQVNTPLSG-------LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNA 384 (516) Q Consensus 313 ~~~l~~r~~~~~~~~sn~g~-~~id~~~e~~e~~~~~lsg-------l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glna 384 (516) .+++.+.++.......+.+. ++...++.+++.++.+..+ +-+......++||.+.+||-.+|-| -.+ T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~-----~~s 272 (395) T protein:vir:10 198 IEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG-----ETA 272 (395) T ss_pred HHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC-----ccc Confidence 23344444333333223333 3322345788888766543 3344456678899999999887732 234 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc--CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_019527. 385 SSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI--DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSV 462 (516) Q Consensus 385 tge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~--~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gv 462 (516) +-+.....||. ..|.|.+..|-..+-+..++.. -..++|.++.|...|.+++++ ++..++++|+ T Consensus 273 n~e~~~~~~~~-------~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~-------~~~~~~~~G~ 338 (395) T protein:vir:10 273 DLEKNTLVFEK-------FCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAE-------AIDKLVSSGS 338 (395) T ss_pred CHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHH-------HHHHHHhCCC Confidence 44555666654 3478988888887777655432 235678888898888887655 4566899999 Q ss_pred CCHHHHHHHHHhhhcc-CCC-----CCCh-hhhccccccchhcCCCCCCCCCCCCCC Q lcl|NC_019527. 463 IDPSEARQQLSDDPDS-GWD-----NIDG-DLEIVQPEMFDDDGADPYMPDPDVLPG 512 (516) Q Consensus 463 i~~~e~r~~l~~~~~~-~~~-----~~d~-~~e~~~~e~~~~e~~~~~~~~~~~~~~ 512 (516) +|++|+|+.++..+.. ++. +.+. ..+..+.......+..+.++++++..+ T Consensus 339 lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 339 FTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred cCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 9999999998543321 100 0000 000000111111111222222222211 No 79 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=99.83 E-value=1.5e-20 Score=129.29 Aligned_cols=378 Identities=14% Similarity=0.139 Sum_probs=200.5 Q ss_pred CccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhh Q lcl|NC_019527. 48 DAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTR 127 (516) Q Consensus 48 ~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r 127 (516) =++..|--+.+-|| .+............+ +..+..-..|..++.+.++|+.+|+.+.+ T Consensus 1 mg~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~---~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~ 58 (403) T protein:vir:10 1 MGFKSWITEKLNPG-------------------QRIIRDMEPVSHRTN---RKPFTTGQAYSKIEILNRTANMVIDSAAE 58 (403) T ss_pred Ccchhhhhhccchh-------------------hhhhhcccccccccC---CcccccHHHHHHHHHHHHHHHHHHHHHhh Confidence 00000100000000 000000000000011 12222336677889999999999999999 Q ss_pred CCCeeeeccccchh----hhHHHHHHHHHHHHhcChhHHHHHHHHhcc-cceeeEEEEEecCCCcccCcccccccccccc Q lcl|NC_019527. 128 EGIEITSKDRTKAK----EMASKIKELEEACEYYGVMGIIQKAAEHDC-FFGRGQISINIKGADVSVPLILDPRTIKKGS 202 (516) Q Consensus 128 ~~~~i~~~~~~~~~----~~~~~i~~i~~~~~~l~~~~~l~ea~~~~r-lyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~ 202 (516) ..+.+......... .....-..|...-....-...|.+.+...+ ++|.|++++. +.. T Consensus 59 ~p~~v~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~--~~~---------------- 120 (403) T protein:vir:10 59 CSYTVGDKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWD--GTS---------------- 120 (403) T ss_pred CceeEeecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe--Cce---------------- Confidence 99988532211110 011111223333333445667777777665 5677776652 221 Q ss_pred eeeEEeecceeeccccccccccccccccCcceeEE--eeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHH Q lcl|NC_019527. 203 LTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV--LGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVEN 280 (516) Q Consensus 203 l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v--~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~ 280 (516) +..+.+..+..... .+ + ..++.+ .+..+.+++++||....+-. -+.....|.|.++.+.+.+.. T Consensus 121 ---l~~l~~~~~~v~~~--~~------~-~~~~~~~~~~~~~~~~eiih~~~~~~~~--~~~~~~~G~s~i~~~~~~i~~ 186 (403) T protein:vir:10 121 ---LYHVPAALMQVEAD--AN------K-FIKKFIFNNQINYRVDEIIFIKDNSYVC--GTNSQISGQSRVATVIDSLEK 186 (403) T ss_pred ---eEeecCcceEEEEc--CC------c-eEEEEEecCceeecccceEEeccccccc--CCCCCcccccHHHHHHHHHHH Confidence 11122221111000 00 0 011112 23446678899987543211 133556799999999999999 Q ss_pred HHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccC--C--HHH Q lcl|NC_019527. 281 WLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLS--G--LAD 354 (516) Q Consensus 281 ~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~ls--g--l~d 354 (516) ...+......++.+.... +++++ ..++.+.-+++.++++.......|.|..++..++-+|+.++.+.+ + +.+ T Consensus 187 ~~~~~~~~~~~f~ng~~~~gil~~~--~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e 264 (403) T protein:vir:10 187 RSKMLNFKEKFLDNGTVIGLILETD--EILNKKLRERKQEELQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKE 264 (403) T ss_pred HHHHHHHHHHHHhccCCcceEEEeC--CCCCHHHHHHHHHHHHHHhCCcccCcceeecCCCceeEEecccCCHHHHHHHH Confidence 999999998888776554 44443 445544444555555554444566665444344578888874433 3 345 Q ss_pred HHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCC Q lcl|NC_019527. 355 LQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKS 434 (516) Q Consensus 355 ~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~p 434 (516) ...+....||.+.+||..+| |. |-+++-+.....|+. ..|.|.+..|-+.+-+. +| ..+.|+++. T Consensus 265 ~~~~~~~~Ia~~fgVPp~~l-g~---~~~sn~e~~~~~f~~-------~tl~P~~~~ie~~l~~~-L~---~~~~~d~~~ 329 (403) T protein:vir:10 265 DIEGFNKSICLAFGVPQVLL-DG---GNNANIRPNIELFYY-------MTIIPMLNKLTSSLTFF-FG---YKITPNTKE 329 (403) T ss_pred HHHHHHHHHHHHhCCCHHHc-CC---CCCcCHHHHHHHHHH-------HHHHHHHHHHHHHHHHh-cC---ceeeeccch Confidence 56677789999999998655 63 334444555555553 44788888888777653 22 356666765 Q ss_pred C--CCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh--hcccc--------ccchhcCCCC Q lcl|NC_019527. 435 L--WQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL--EIVQP--------EMFDDDGADP 502 (516) Q Consensus 435 L--~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~--e~~~~--------e~~~~e~~~~ 502 (516) + ...+.+. +++++..++++|++|++|+|+.+ |++++++.. +..-+ .....++.+| T Consensus 330 ~~~l~~D~~~-------~~~~~~~~~~~G~lT~NE~R~~~------gl~pi~~~~~d~~~~p~n~~~~~~~~~~~e~~~~ 396 (403) T protein:vir:10 330 VAALTPDKEA-------EAKHLTSLVNNGIITGNEARSEL------NLEPLDDEQMNKIRIPANVAGSATGVSGQEGGRP 396 (403) T ss_pred hhhcccCHHH-------HHHHHHHHHhCCCcCHHHHHHHh------CCCCCCcccccccccccccccccccCCCCcCCCC Confidence 5 4445443 46677889999999999999998 445543111 01100 0011111112 Q ss_pred CCCCCCCCCCC Q lcl|NC_019527. 503 YMPDPDVLPGE 513 (516) Q Consensus 503 ~~~~~~~~~~~ 513 (516) ++. .+|+ T Consensus 397 ~~~----~~g~ 403 (403) T protein:vir:10 397 KGS----TEGD 403 (403) T ss_pred CCC----cCCC Confidence 121 2222 No 80 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=99.82 E-value=3.8e-20 Score=127.08 Aligned_cols=381 Identities=10% Similarity=0.080 Sum_probs=197.8 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHH-HHHHHHhCchhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQ-NLAALATRPEYRAFAST 120 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~-ll~~y~~~~i~r~iVd~ 120 (516) |.-.. .|.+- ..+.+.+ +..+ .| ..++. .+..|-. ......+++.++++|+. T Consensus 1 MGl~~-----~~~~~-~~~~~~~---~~~~-~~------------~~~~~-----~~~~~~~vt~~~al~~~~v~~~i~~ 53 (394) T protein:vir:62 1 MGLRD-----RFSNY-LFKKAEK---RGYL-DN------------VLGKS-----IRYSGVYVTDSNILQSSDVYELLQD 53 (394) T ss_pred Cchhh-----hhhhh-ccCCCCc---hhhh-hh------------hhhcc-----cccCccccChhhhhccHHHHHHHHH Confidence 11000 00000 0000000 0000 00 00000 0001100 01223466889999999 Q ss_pred hhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCccccccccc Q lcl|NC_019527. 121 LSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIK 199 (516) Q Consensus 121 ~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~ 199 (516) +++++-+..+.+...+.... .....-.|...-........|.+.+.+. .++|.+++++. +.....+ T Consensus 54 Ia~~iA~lp~~v~~~~g~~~--~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~--~~~~~~~--------- 120 (394) T protein:vir:62 54 ISNQMVLADIVVEDEFGNEI--KDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILN--GAQIHLA--------- 120 (394) T ss_pred HHHhhcccceEEEcCCCccc--chhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEe--cceeecc--------- Confidence 99999999998875433221 1112223333333344455666666655 56788888763 2211100 Q ss_pred ccceeeEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHH Q lcl|NC_019527. 200 KGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVE 279 (516) Q Consensus 200 ~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~ 279 (516) ..+. |.. |. +.+ .+|.+.++++.++.|+|+.+.. ...+.|.|.++.+.+.|. T Consensus 121 ----~~~~--------~~~----~~---~~~--~~~~~~~~~~~~~eiih~r~~~-------~d~~~G~s~~~~~~~~i~ 172 (394) T protein:vir:62 121 ----SNVF--------TEL----DD---NLV--EHFNIGGHEIPPCMIRHVKNIG-------ADHLRGKGILDLGRDTLE 172 (394) T ss_pred ----ccce--------EEE----CC---ceE--EEEeeCCEEechhheEEecCcC-------CCCccccChHHHHHHHHH Confidence 0011 100 00 111 2456678899999999997543 234679999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeecchhhhcCccH--HHHHHHHHHHHHhcCCcce-EEEecCCcceeEEecc--cCC--H Q lcl|NC_019527. 280 NWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEG--GDVFDRVEMYVNMQSNLGL-AVMDFDSEDIVQVNTP--LSG--L 352 (516) Q Consensus 280 ~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~--~~l~~r~~~~~~~~sn~g~-~~id~~~e~~e~~~~~--lsg--l 352 (516) ....+......++.+....-........++..+. +++.++++......+|.|. +++.. +.+++....+ ... + T Consensus 173 ~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~~~~l~~~~~d~q~ 251 (394) T protein:vir:62 173 GVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLESIDEARSVKMIPL-GKGYSIDTLKSPLDDEKT 251 (394) T ss_pred HHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccccccCceeEeeC-CCceeEEecCCCcchHHH Confidence 9999999999999987655332222223332221 2344444433333355554 45544 4656644433 332 3 Q ss_pred HHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcC-CcceEE Q lcl|NC_019527. 353 ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEID-DAITFK 431 (516) Q Consensus 353 ~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~-~d~~~~ 431 (516) -+.......+||.+.+||-.+|.+ . -++..+...+.||. ..|.|.+..|-..|-+..++.-. ..+.|+ T Consensus 252 ~e~~~~~~~~Ia~~fgVPp~~lg~-~---~~sn~e~~~~~~~~-------~~l~P~~~~ie~~l~~kll~~~~~~~~~~~ 320 (394) T protein:vir:62 252 LAYLNVYKKDLGKFLGINVDTYTE-L---IKEDIEKAMMYIHN-------KAVRPIMKNFEDHLSLLFYAQNSGKRIKFK 320 (394) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCC-C---CCcCHHHHHHHHHH-------HHHHHHHHHHHHHHhhhhcCccccCceEEE Confidence 445567778999999999877743 2 12333444444443 34889988888877766554322 358899 Q ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccC--CCCCChhhhccccc-cchhcCCCCCCCCCC Q lcl|NC_019527. 432 FKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSG--WDNIDGDLEIVQPE-MFDDDGADPYMPDPD 508 (516) Q Consensus 432 f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~--~~~~d~~~e~~~~e-~~~~e~~~~~~~~~~ 508 (516) |+.+.-++..++ ++++.+++++|++|++|+|+.+...+..+ ...+-.. ....+. ..+.......+++ T Consensus 321 fd~~~~~~~~~~-------~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~-~n~~~~~~~~~~~~~~kgge-- 390 (394) T protein:vir:62 321 INILDFVTYSNK-------TNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAIYIS-NDVTEIGKKEATDGSLGGGE-- 390 (394) T ss_pred echhhhcCHHHH-------HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeecc-cccccccccccccccCCCCC-- Confidence 988877776554 34667899999999999999985443210 0000000 000000 0000111111111 Q ss_pred CCCCCC Q lcl|NC_019527. 509 VLPGEE 514 (516) Q Consensus 509 ~~~~~e 514 (516) .+++ T Consensus 391 --~~en 394 (394) T protein:vir:62 391 --ENEN 394 (394) T ss_pred --CCCC Confidence 1111 No 81 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=99.82 E-value=1.1e-19 Score=124.61 Aligned_cols=411 Identities=9% Similarity=0.005 Sum_probs=209.6 Q ss_pred HHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHH-HHHHHHhCchhhh Q lcl|NC_019527. 38 VMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQ-NLAALATRPEYRA 116 (516) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~-ll~~y~~~~i~r~ 116 (516) +++-..+...+ .+-..+++ .. .+.+.....+ ..++..|-. ....+.+++.+.+ T Consensus 1 ~~~~~~~~~~~--------------------~~~~~~~~-~~--~~~~~~g~~~---~~~~~~~~~~~~~~a~~~~~v~~ 54 (460) T protein:vir:10 1 MANRIIRALRE--------------------LTGLDNKF-ND--AFIKYIGQTF---TKYDNNGKTYLEQGYNINPDVYS 54 (460) T ss_pred CchhHHHHHhh--------------------hhccCCCc-hH--HHHHhhcccc---CCCccchhhhhHHHHhcchHHHH Confidence 11111110000 00000010 00 1111000000 001122222 2344778889999 Q ss_pred hhhhhhHHHhhCCCeeeeccccchhhhH-----------------------------HHHHHHHHHHHhcChhHHHHHHH Q lcl|NC_019527. 117 FASTLSTELTREGIEITSKDRTKAKEMA-----------------------------SKIKELEEACEYYGVMGIIQKAA 167 (516) Q Consensus 117 iVd~~aed~~r~~~~i~~~~~~~~~~~~-----------------------------~~i~~i~~~~~~l~~~~~l~ea~ 167 (516) +|+.+++++-+..+.+.-...+...... .....|...-..+.-...|.+.+ T Consensus 55 ~v~~ia~~iA~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~ 134 (460) T protein:vir:10 55 CISQMAAKTVAVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLY 134 (460) T ss_pred HHHHHHHhhhhCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHH Confidence 9999999999999988544333211100 00111222222233445555555 Q ss_pred H-hcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE--ee--eEe Q lcl|NC_019527. 168 E-HDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV--LG--REM 242 (516) Q Consensus 168 ~-~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v--~g--~~i 242 (516) . .-.++|.|++++.-++. ....|.+.+|.+++|.+|++.......+....|.. ..|.+ .| ..+ T Consensus 135 ~~~lll~Gnay~~i~r~~~-----------~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~-~~~~~~~~g~~~~~ 202 (460) T protein:vir:10 135 KTYMRLNGNCYFYLMSPDD-----------GINAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPI-KSYMLIQGDQFIEF 202 (460) T ss_pred HHHHhhcCCeEEEEEecCC-----------CccCceeEEEEEEcCceEEEEEcCCCceeeeeeee-eEEEEecCceeEEe Confidence 5 55678998888755332 12335567899999988886443322222212111 12222 22 468 Q ss_pred ccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHH Q lcl|NC_019527. 243 HASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEM 322 (516) Q Consensus 243 H~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~ 322 (516) .++.||||........ -....++|+|.++.+.+.|.....+......++.+....-+.......++.+..+++.++++. T Consensus 203 ~~~evih~r~~~~~~~-~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~ 281 (460) T protein:vir:10 203 NEDEVIHTKYANPNFD-LQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGSTGLTQPQADSLKQRLTE 281 (460) T ss_pred cccceEEEecCCCCcc-cccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecCCCCCHHHHHHHHHHHHH Confidence 8999999975432211 112346799999999999999999999988888886544322222333444444455555555 Q ss_pred HHHhcCCcceEEEecCCcceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccc--cccchHHHHHHHHHHH Q lcl|NC_019527. 323 YVNMQSNLGLAVMDFDSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGL--NASSEGEIRSFYDDIS 398 (516) Q Consensus 323 ~~~~~sn~g~~~id~~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Gl--natge~D~~~yyd~I~ 398 (516) ......|.|..++..++-+|+.++.+... +.+......++||.+.+||..+| |...+|- .++.|.....|+. T Consensus 282 ~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~e~~~~~f~~--- 357 (460) T protein:vir:10 282 MDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLL-NNNEGGGLNTGNLEEERKRVVT--- 357 (460) T ss_pred HhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCCccccHHHHHHHHHH--- Confidence 54444565544433445788888766443 34566777899999999998754 5443221 2333444444443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCC--CcCCcceEEeC--CCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHh Q lcl|NC_019527. 399 SVQQSYYFSPLDTMLKVIQLSKWG--EIDDAITFKFK--SLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSD 474 (516) Q Consensus 399 ~~Qe~~l~p~l~~l~~~l~~s~~g--~~~~d~~~~f~--pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~ 474 (516) ..|.|.+..|-..+-+..+- +...++.|+|+ .|..+. + ...+...++++|++|++|+|+.+ T Consensus 358 ----~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~--~-------d~~~~~~~~~~g~~T~NE~R~~~-- 422 (460) T protein:vir:10 358 ----DNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQ--T-------DMVAMASWLNTIPVTPNEIRIAM-- 422 (460) T ss_pred ----HHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHH--H-------HHHHHHHHHhCCCCCHHHHHHHh-- Confidence 34677777777666654432 23445666664 331111 1 11223447899999999999998 Q ss_pred hhccCCCCCChh-h-hcccc----ccchhcCCCCCCCCCCCC Q lcl|NC_019527. 475 DPDSGWDNIDGD-L-EIVQP----EMFDDDGADPYMPDPDVL 510 (516) Q Consensus 475 ~~~~~~~~~d~~-~-e~~~~----e~~~~e~~~~~~~~~~~~ 510 (516) |+++++++ . +...+ ...+.++...+..+...+ T Consensus 423 ----g~~pi~~~~gD~~~~~~n~~~~~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 423 ----KYETLNQDGMDIVFMPSNKVRIDDVSNNLIDSAFNQNQ 460 (460) T ss_pred ----CCCCCCCCCCCeeeecccccchhhcccccCCCcccCCC Confidence 44554311 0 10000 000001110011111111 No 82 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=99.82 E-value=2.8e-20 Score=127.81 Aligned_cols=329 Identities=10% Similarity=0.033 Sum_probs=178.6 Q ss_pred HhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccce Q lcl|NC_019527. 125 LTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSL 203 (516) Q Consensus 125 ~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l 203 (516) +-+-.+.+.-..+..+ ......|........-+..|.+.+.+. .++|.+++++..+. .|.+ T Consensus 1 ia~lp~~~~~~~~~~~---~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~---------------~G~~ 62 (348) T protein:vir:93 1 MASLPLKMYEDYKVVN---TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI---------------YHQP 62 (348) T ss_pred CcccceEeEecCcCcc---cHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECC---------------CCcE Confidence 1122333322211111 111223333333344566666666654 66888888875432 2445 Q ss_pred eeEEeecceeeccccccccccccccccCcceeEEe---e--eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHH Q lcl|NC_019527. 204 TGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL---G--REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYV 278 (516) Q Consensus 204 ~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~---g--~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l 278 (516) ..|.++.|..++..... .+.+-.|.+. | ..++++.|+||.+.. +...++|.|.++.+.+.+ T Consensus 63 ~~L~~l~~~~v~~~~~~--------~~~~~~y~~~~~~g~~~~~~~~eiih~r~~~------~~~~~~G~s~~~~~~~~i 128 (348) T protein:vir:93 63 SKLFLLNPDVVEMLIEN--------QSRELYYSIHAATGNKLIVHNMDMLHFKHIV------ASNMVQGISPIDVLKNTT 128 (348) T ss_pred EEEEEEcCCceEEEEeC--------CCcEEEEEEEcCCCeEEEEccccEEEecCCC------CCCceeeccHHHHHHHHH Confidence 67888888877753211 1223334443 2 358899999997643 234567999999988887 Q ss_pred HHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccCC--HHHH Q lcl|NC_019527. 279 ENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLSG--LADL 355 (516) Q Consensus 279 ~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~lsg--l~d~ 355 (516) .....+.... +........+.......++.+..+++.++ |....+|.+ +++++ ++.+|+.++.+... +.+. T Consensus 129 ~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~---~~~~~~n~~~~~vl~-~g~~~~~l~~~~~d~q~~e~ 202 (348) T protein:vir:93 129 DFDNAVRTFN--LTEMQKPDSFMLKYGSNVSTEKRQQVLED---FKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVAS 202 (348) T ss_pred HHHHHHHHHH--HHhcCCCceeEEecCCCCCHHHHHHHHHH---HHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHH Confidence 7655554442 22222222222222233333333334444 444444555 45554 45789988766553 4455 Q ss_pred HHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc--CCc--ceEE Q lcl|NC_019527. 356 QSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI--DDA--ITFK 431 (516) Q Consensus 356 ~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~--~~d--~~~~ 431 (516) .......||.+.+||..+|.+ ..++-.++.|...+.|+.. .|.|.++.+.+.|-+..+-.. ..+ |+|. T Consensus 203 ~~~~~~~Ia~~fgVP~~~lg~-~~~~~~~~~e~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd 274 (348) T protein:vir:93 203 ENLTRERVANVFQLPSIFLNA-RSNTNFAKNEELNRFYLQH-------TLLPIVKQYEEEFNRKLLTKTDREKNRYFKFN 274 (348) T ss_pred HHHHHHHHHHHhCCCHHHhCC-CCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCCcccccCcceEEee Confidence 667788999999999877743 3333445555555555544 378988888887766544221 123 4455 Q ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccc-----c-ccchhcCCCCCCC Q lcl|NC_019527. 432 FKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQ-----P-EMFDDDGADPYMP 505 (516) Q Consensus 432 f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~-----~-e~~~~e~~~~~~~ 505 (516) ++.|...|.+++|+ ++.+++++|++|++|+|+.++. ++++...+..- + +...+......++ T Consensus 275 ~~~l~~~d~~~~a~-------~~~~~~~~G~~T~NE~R~~~g~------~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gg 341 (348) T protein:vir:93 275 VKSYLRADSATQAE-------VYFKAVRSGYYTINDIREWEDL------PPVEGGDKPLISGDLYPIDTPLELRKSLKGG 341 (348) T ss_pred chhhhccCHHHHHH-------HHHHHHhCCCCCHHHHHHHhCC------CCCCCcCeEeecccccccccchhhcccccCC Confidence 66888888888665 5566899999999999999844 44422111010 0 0011011111222 Q ss_pred CCCCCCC Q lcl|NC_019527. 506 DPDVLPG 512 (516) Q Consensus 506 ~~~~~~~ 512 (516) +++...+ T Consensus 342 ~~n~~~~ 348 (348) T protein:vir:93 342 DKNVNES 348 (348) T ss_pred CCCcCCC Confidence 2222222 No 83 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=99.82 E-value=3.6e-19 Score=121.70 Aligned_cols=409 Identities=13% Similarity=0.084 Sum_probs=203.6 Q ss_pred hhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhh Q lcl|NC_019527. 43 ERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLS 122 (516) Q Consensus 43 ~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~a 122 (516) .+.....+.....|+.-.. .+..+.+..|.+ ..+++ .++.=-.|.++|+.+++++.||+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~--------------~~~~~-pp~~~~~La~~~~~n~~v~scI~~ia 62 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKG---DTDSQALKEDRF--------------EEYVE-PKVHPLVLLSLLQVNPYHASACSIKA 62 (540) T ss_pred CCCcccChhhccchhhhhc---cccccccccCCC--------------Ccccc-CCCCHHHHHHHHHhcHHHHHHHHHHH Confidence 1222222222222211111 111112211111 01111 01111245678889999999999999 Q ss_pred HHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHH-hcccceeeEEEEEecCCCcccCccccccccccc Q lcl|NC_019527. 123 TELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAE-HDCFFGRGQISINIKGADVSVPLILDPRTIKKG 201 (516) Q Consensus 123 ed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~-~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g 201 (516) +++.+.++.+...+.... .+. .....-+..|.+.+. ...++|.|++++.-++ .| T Consensus 63 ~~ia~~~~~i~~~~~~~~--------~~l--pN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~---------------~G 117 (540) T protein:vir:41 63 NDILRTGYLIDGDDGGVE--------ELL--RACRPSFEFILLQALEDLQVFNYCTLEVVRDD---------------QG 117 (540) T ss_pred HHHhcCCceEecCccchh--------hhc--cCCCCCHHHHHHHHHHHHHhcCCeEEEEEECC---------------CC Confidence 999999999977654321 111 122223344555555 4566899998875432 13 Q ss_pred ceeeEEeecceeeccccc-----ccccc----ccccccCcceeEEe----eeEeccceEEEecCCcchhhhhhccCCCCc Q lcl|NC_019527. 202 SLTGFSNIEPMWTSPSAY-----NALDP----TAPDFYKPSTWWVL----GREMHASRLLTIITRPLPDMLKPAYNFSGI 268 (516) Q Consensus 202 ~l~~l~v~d~~~v~p~~~-----~~~dp----~s~~yg~P~~y~v~----g~~iH~SRli~~~~~~~p~~~k~~~~~~G~ 268 (516) .+.+|.++++.+|....- ...|. ....|+.+..+... ...+.++.||||.... +....+|+ T Consensus 118 ~~~~L~~i~~~~V~v~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~------~~~~~~G~ 191 (540) T protein:vir:41 118 EPVRLDYIPAHTVRVHRDGSRYMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPS------PICSYYGV 191 (540) T ss_pred cEEEEEEeCCcceEEeEcCceeEeeecCceeeeeecccccceeeccccccceeecccceEEecCCC------CCCCcccc Confidence 345566676666653210 01111 11112222111111 2357788999996542 34567899 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecc----hhhhcCccHHHHHHHHHHHHH-----hcCCcceEEEec Q lcl|NC_019527. 269 SMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNM----AQVLNGGEGGDVFDRVEMYVN-----MQSNLGLAVMDF 337 (516) Q Consensus 269 S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~----~~~l~~~~~~~l~~r~~~~~~-----~~sn~g~~~id~ 337 (516) |.+..+...+.....+......++++.... +++++. ...+.......+.++++..-. ...|.|..++.. T Consensus 192 Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe 271 (540) T protein:vir:41 192 PRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFS 271 (540) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEe Confidence 999999999999999999998888887654 334321 111222222334444443222 124555544421 Q ss_pred ------CCcceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeecccccc-cc-ccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 338 ------DSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSG-LN-ASSEGEIRSFYDDISSVQQSYYFS 407 (516) Q Consensus 338 ------~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~G-ln-atge~D~~~yyd~I~~~Qe~~l~p 407 (516) ++-+|+.++.+..+ +-+......+.||++.+||..+| |...+| .| |+.+.....||. ..|.| T Consensus 272 ~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~l-G~~~~~~~n~sn~eq~~~~f~~-------~tL~P 343 (540) T protein:vir:41 272 IPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRL-GITDVGPLGGNFAEVARRTYYE-------SVVRP 343 (540) T ss_pred cCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHc-CcccCCCCCcccHHHHHHHHHH-------HHHHH Confidence 22356666544333 34567778889999999998755 765432 22 445555666654 34677 Q ss_pred HHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccC---CCCCC Q lcl|NC_019527. 408 PLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSG---WDNID 484 (516) Q Consensus 408 ~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~---~~~~d 484 (516) .++.|-..|-+..+-....++.|+|+...-+.. ++ +..+..++++|++|++|+|+.|...+..+ ..++. T Consensus 344 ~~~~ie~~ln~~L~~~~~~~~~i~f~~~~ll~~-D~-------~~~~~~lv~~G~lT~NE~Re~L~g~e~gdd~~l~p~n 415 (540) T protein:vir:41 344 QQEIVSSVLTDFIQLKLDPGARFVFNEEILMES-EF-------VHNYALLVQCGVLTPSEVREKLFGLDGGPDMFMVPSS 415 (540) T ss_pred HHHHHHHHHHHhhhhccCCceEEEecchhhcch-HH-------HHHHHHHHhCCCCCHHHHHHHhCcCcCCCcccccccc Confidence 777776666543322334578888875332221 21 22345678999999999998662111000 01110 Q ss_pred hhh------hcc--ccccchh----cCCCC-CCCCCCCCCCCCCC Q lcl|NC_019527. 485 GDL------EIV--QPEMFDD----DGADP-YMPDPDVLPGEEGS 516 (516) Q Consensus 485 ~~~------e~~--~~e~~~~----e~~~~-~~~~~~~~~~~e~t 516 (516) ... +.. .++..+. ...++ ...+..+....|.+ T Consensus 416 ~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~ 460 (540) T protein:vir:41 416 IGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDK 460 (540) T ss_pred cccccccccccccCCCCccccccccchhcccccCccccccccccc Confidence 000 000 0000000 00011 01111111111221 No 84 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=99.82 E-value=3e-20 Score=127.62 Aligned_cols=375 Identities=9% Similarity=-0.025 Sum_probs=204.2 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhC Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATR 111 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~ 111 (516) |.+. +...+.+.. +. ...+++.+.. ... .. ....+...+. ...+.++ T Consensus 1 M~~f----~~~~~~~~~------~~--------------~~~~~~~~~~-~~~--~~-----~~~~~~~~v~-~~~al~~ 47 (386) T protein:vir:49 1 MPIF----NITNLATES------PP--------------INQESFFDIA-DSD--FL-----ASLNSSEWVS-AENALKN 47 (386) T ss_pred Cchh----hhhccCCCC------cc--------------cchhhhhhhh-hcc--cc-----ccccCCceec-hhhhhcc Confidence 2222 111111100 00 0001100000 000 00 0000111111 1223457 Q ss_pred chhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcc-cceeeEEEEEecCCCcccC Q lcl|NC_019527. 112 PEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDC-FFGRGQISINIKGADVSVP 190 (516) Q Consensus 112 ~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~r-lyG~a~i~i~i~~~~~~~P 190 (516) +.+.++|+.+|+++-+..+++..... ..|......+.-...|.+.+.+.+ ++|.|++++..++. T Consensus 48 ~~v~~~i~~ia~~ia~~p~~~~~~~~----------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~----- 112 (386) T protein:vir:49 48 SDLFSIISQLSNDLATAKITTSRKQL----------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDN----- 112 (386) T ss_pred HHHHHHHHHHHHHhhhCceeeccchh----------hhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC----- Confidence 78899999999999998888754321 234444555556677777777775 57888887754321 Q ss_pred cccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe--------eeEeccceEEEecCCcchhhhhhc Q lcl|NC_019527. 191 LILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL--------GREMHASRLLTIITRPLPDMLKPA 262 (516) Q Consensus 191 l~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~--------g~~iH~SRli~~~~~~~p~~~k~~ 262 (516) |.+.+|.+++|.+|++...... ..-.|.+. .+.++++.||||.... +. T Consensus 113 ----------g~~~~l~~i~~~~v~v~~~~~~--------~~~~y~~~~~~~~~~~~~~~~~~evih~~~~~------~~ 168 (386) T protein:vir:49 113 ----------GRDMKWEYLRPSQVSFNRLDNQ--------NGLYYNITFDDPHIAPKQHVPQNDILHFRLLS------VD 168 (386) T ss_pred ----------CcEEEEEEecCceeEEEEcCCC--------ceEEEEEEEcCccccceeEEccccEEEecCCC------CC Confidence 3345688888888775432211 11233332 2468899999997543 22 Q ss_pred cCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcce Q lcl|NC_019527. 263 YNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDI 342 (516) Q Consensus 263 ~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~ 342 (516) ..+.|.|.++.+.+.|.....+......++++....-......+.+..... .+..+.+..+..|.|..++..++.+| T Consensus 169 ~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~---~~~~~~~~~~~~n~g~~~vl~~g~~~ 245 (386) T protein:vir:49 169 GGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFK---TKVSRSRQAMKQMQGGPLVLDDLEDF 245 (386) T ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHH---HHHHHHHHHhccCCCCceecCCCceE Confidence 335799999999999999999999999999987654333222233333222 22233444555666654444445789 Q ss_pred eEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 343 VQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSK 420 (516) Q Consensus 343 e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~ 420 (516) +.++.+... +.+......++||.+.+||-.+| |.+.++. ++++ ..+.|| ...++|.++.+...+-+.. T Consensus 246 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~-~~~~-~~~~~~-------~~~i~~~l~~i~~~~~~~l 315 (386) T protein:vir:49 246 TPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIV-GGDGDQQ-SSLE-MIYNIY-------FKSVSRYLRPFVSEMSKKL 315 (386) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcc-chHH-HHHHHH-------HHHHHHHHHHHHHHHHHHh Confidence 988766554 34567888899999999998776 4332222 2333 223332 3345666666655554432 Q ss_pred CCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCC Q lcl|NC_019527. 421 WGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGA 500 (516) Q Consensus 421 ~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~ 500 (516) . ..+.|....+...+.++++ .....++.+|++|++|+|+.|...+-. .+++..... ..... T Consensus 316 ~----~~~~~~~~~~~~~d~~~~~-------~~~~~l~~~g~~t~nE~r~~l~~~~~~-~~~~~~~~~---~~~~~---- 376 (386) T protein:vir:49 316 S----CEVDVDISPAVDPTGSNYI-------SLINSMVKSGTLAQNQGLYILQQAEIL-PKELPDGKN---PNRTS---- 376 (386) T ss_pred c----chhcccchhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHHhhCCCC-CCcCcchhc---cCCCC---- Confidence 2 2455666666666666654 445668999999999999998653311 111111000 00011 Q ss_pred CCCCCCCCCCC Q lcl|NC_019527. 501 DPYMPDPDVLP 511 (516) Q Consensus 501 ~~~~~~~~~~~ 511 (516) ..+++.+++. T Consensus 377 -~~gGd~~~~~ 386 (386) T protein:vir:49 377 -LKGGEINEQD 386 (386) T ss_pred -CCCCCCCCCC Confidence 1122222111 No 85 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=99.81 E-value=2e-19 Score=123.14 Aligned_cols=376 Identities=12% Similarity=0.074 Sum_probs=203.1 Q ss_pred hhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCch Q lcl|NC_019527. 34 MRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPE 113 (516) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i 113 (516) +.-.+.+-. ++...+.. -..+ .+..+.+..+. ........ +...+ ......+++. T Consensus 1 m~m~~~~~~-~~~~~~~~-~~~~---~~~~~~~~~~~---------~~~~~~~~----------~g~~v-~~~~al~~~~ 55 (392) T protein:vir:74 1 MILPILNFI-NQTNDPPE-AGSV---QSYFPDGNDAQ---------IMESLLGD----------NNEWV-SARAALRNSD 55 (392) T ss_pred Ccchhhhhh-hcccCccc-cccc---ccccccCchhh---------hhhhccCC----------CCccc-chhhhhcchH Confidence 111222211 11111000 0000 00000000000 00000000 00000 1122346788 Q ss_pred hhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcc Q lcl|NC_019527. 114 YRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLI 192 (516) Q Consensus 114 ~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ 192 (516) +++||+.+|+++-+-.+.+..... ..|...-.....+..|.+.+.+. .++|.|++++.-+. T Consensus 56 v~~~v~~ia~~ia~lp~~~~~~~~----------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~-------- 117 (392) T protein:vir:74 56 LFSIILQLSSDLAIVKINAEKKKN----------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNA-------- 117 (392) T ss_pred HHHHHHHHHHhhccCceeeccchh----------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC-------- Confidence 999999999999888777643221 22333444445566777776655 66788888875422 Q ss_pred cccccccccceeeEEeecceeeccccccccccccccccCcceeEEee--------eEeccceEEEecCCcchhhhhhccC Q lcl|NC_019527. 193 LDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG--------REMHASRLLTIITRPLPDMLKPAYN 264 (516) Q Consensus 193 ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g--------~~iH~SRli~~~~~~~p~~~k~~~~ 264 (516) .|.+.+|.+++|.+|+..... ++....|++.. ..+++++||||....+ ... T Consensus 118 -------~G~~~~L~~i~~~~v~v~~~~--------~~~~~~y~~~~~~~~~~~~~~~~~~evih~~~~~~------~~~ 176 (392) T protein:vir:74 118 -------NGADMKWEYLRPSQVNTYYFE--------YENGMYYNITFDDPKIEPILQAPQSDLIHMKLLSI------DGG 176 (392) T ss_pred -------CCcEEEEEEEcCceeEEEEcC--------CCceEEEEEEecCCccceeEEEcCccEEEecCCCC------CCc Confidence 244667889998888754211 11122344431 3578899999976432 223 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcc Q lcl|NC_019527. 265 FSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSED 341 (516) Q Consensus 265 ~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~ 341 (516) +.|.|.++.+.+.|.....+......++.+.... +++++.. .... .++..+..+.+. ...|.| ..+++ ++.+ T Consensus 177 ~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~--~~~~-~~~~~~~~~~~~-~~~n~g~~~vl~-~g~~ 251 (392) T protein:vir:74 177 KTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGG--GLLS-DKDKASRSRSFM-KRSRSGGPVVLD-DLEE 251 (392) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC--CCch-HHHHHHHHHHHh-ccccCCCeeecC-CCce Confidence 5699999999999999999999999998887654 3444322 1111 222222233333 333444 45565 4578 Q ss_pred eeEEecccC--CHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 342 IVQVNTPLS--GLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLS 419 (516) Q Consensus 342 ~e~~~~~ls--gl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s 419 (516) |++++.+.. .+-+.......+||.+.+||..+| |.....- +..+... ++-+..|.|.+..+.+.+-+. T Consensus 252 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~--~~~e~~~-------~~~~~~l~p~~~~ie~~l~~~ 321 (392) T protein:vir:74 252 FTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYI-GGQGDQQ--SSIQQIS-------GMYASALNRYLRPAISELEYK 321 (392) T ss_pred EEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcc--cHHHHHH-------HHHHHHHHHHHHHHHHHHHHh Confidence 998876543 345566777889999999998666 4331111 1122233 333455788888887777654 Q ss_pred hCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcC Q lcl|NC_019527. 420 KWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDG 499 (516) Q Consensus 420 ~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~ 499 (516) . .+++.+.+..+...+.++++ +.+..++.+|++|++|+|+.+.. .|+.+.+ . ...++ T Consensus 322 l----~~~~~~~~~~~~~~d~~~~~-------~~~~~l~~~g~~t~near~~~~~---~g~~pne--~-------r~~en 378 (392) T protein:vir:74 322 L----SDHISVNMRPAIDPLGDNYL-------STISTATRWGALAENQATFVLQE---AGYIPKD--L-------PAPEN 378 (392) T ss_pred c----cchhcccchhhhcCCHHHHH-------HHHHHHHhCCCcCHHHHHHHHHh---CCCCccc--c-------chhcC Confidence 3 24566767777777766654 45677899999999999998754 2333211 1 11111 Q ss_pred CCC-CCCCC-CCCC Q lcl|NC_019527. 500 ADP-YMPDP-DVLP 511 (516) Q Consensus 500 ~~~-~~~~~-~~~~ 511 (516) -+| ++++. ++-| T Consensus 379 l~~~~~Gd~~~p~p 392 (392) T protein:vir:74 379 TNKKTTGQSNEPVP 392 (392) T ss_pred CCCCCCCCCCCCCC Confidence 111 11111 1111 No 86 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=99.80 E-value=1e-18 Score=119.27 Aligned_cols=405 Identities=13% Similarity=0.112 Sum_probs=195.8 Q ss_pred hhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhh Q lcl|NC_019527. 43 ERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLS 122 (516) Q Consensus 43 ~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~a 122 (516) .+...-.+.-...|..... . .+..+. ......+ .+++ .++.--.+.++++.|+.+++||+.++ T Consensus 1 ~~~~~~~i~s~~~~~~i~~----~---~~~s~~--------~~~~~~~-~~~~-pp~~~~~la~l~~~n~~v~scI~~ia 63 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKR----E---EVESQA--------LGETRFE-EYVE-PKVNPLVLLSLLQVNPYHASACSIKA 63 (542) T ss_pred Cccccccccccccchhhhh----c---cccccc--------cccccCC-cccc-CCCCHHHHHHHHhhcHHHHHHHHHHH Confidence 1221111111111110000 0 000000 0000000 0111 12222356678889999999999999 Q ss_pred HHHhhCCCeeeeccccchhhhHHHHHHHHHHH-Hhc-ChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccc Q lcl|NC_019527. 123 TELTREGIEITSKDRTKAKEMASKIKELEEAC-EYY-GVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKK 200 (516) Q Consensus 123 ed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~-~~l-~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~ 200 (516) +++.+.++.+...... .+...+ ... ...+.+...+..-.++|.|++++..+. . T Consensus 64 ~~IA~l~~~~~~~~~~----------~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~---------------~ 118 (542) T protein:vir:41 64 NDIIRTGYILEGDDEG----------VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDD---------------R 118 (542) T ss_pred HHHhhCceeeecccch----------hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC---------------C Confidence 9999999998654321 111111 111 233334444556678899998875432 2 Q ss_pred cceeeEEeecceeeccccc-----cccccccccccC----ccee-EEe---eeEeccceEEEecCCcchhhhhhccCCCC Q lcl|NC_019527. 201 GSLTGFSNIEPMWTSPSAY-----NALDPTAPDFYK----PSTW-WVL---GREMHASRLLTIITRPLPDMLKPAYNFSG 267 (516) Q Consensus 201 g~l~~l~v~d~~~v~p~~~-----~~~dp~s~~yg~----P~~y-~v~---g~~iH~SRli~~~~~~~p~~~k~~~~~~G 267 (516) |.+.+|.++++.+|....- ...+.....|.+ +..+ ... +..+.++.||||.... +...++| T Consensus 119 G~~~~L~~l~~~~v~v~~d~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~------~~~~~~G 192 (542) T protein:vir:41 119 GDPIRFEYIPSHTIRVHKDGSRYRQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANELVFIHIPS------PVCSYYG 192 (542) T ss_pred CcEEEEEEEcCcceEEEEcCCeeEeeecCCcceeEEeecccccccccccccccccCcccEEEecCCC------CCCCccc Confidence 3455677777777664211 001111111111 1100 011 2346678899986542 3345689 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecc--hh------hhcCccHHHHHHHHHHH-HHhcCCcceEEEe Q lcl|NC_019527. 268 ISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNM--AQ------VLNGGEGGDVFDRVEMY-VNMQSNLGLAVMD 336 (516) Q Consensus 268 ~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~--~~------~l~~~~~~~l~~r~~~~-~~~~sn~g~~~id 336 (516) .|.++.+...|.....+......++.+.... ++++.. .. .++....+.+.+.++.. .....|.|..++. T Consensus 193 lspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL 272 (542) T protein:vir:41 193 VPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVF 272 (542) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEe Confidence 9999999999998888888888888876654 344321 11 11111222334444332 2223555544432 Q ss_pred c------CCcceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeecccccc-cc-ccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 337 F------DSEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSG-LN-ASSEGEIRSFYDDISSVQQSYYF 406 (516) Q Consensus 337 ~------~~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~G-ln-atge~D~~~yyd~I~~~Qe~~l~ 406 (516) . ++-+|..++.+..+ +-+......++||++.+||..+| |...++ +| ++.|.....| .+..|. T Consensus 273 ~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G~~~~~t~n~sn~Eq~~~~f-------~~~tL~ 344 (542) T protein:vir:41 273 SIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRL-GIADTGPLGGNFAEVTRRTY-------YESVVR 344 (542) T ss_pred eccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CcCCCcccccccHHHHHHHH-------HHHHHH Confidence 1 12245544433222 33445666789999999998765 665433 33 3344444444 345567 Q ss_pred HHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChh Q lcl|NC_019527. 407 SPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGD 486 (516) Q Consensus 407 p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~ 486 (516) |.++.|-..|-...+-....++.|+|+....+.. + ++..+..++++|++|++|+|+.|. ++++.++- T Consensus 345 P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~ll~~-d-------~~~~~~~~v~~GilT~NE~Re~L~-----g~~pgdd~ 411 (542) T protein:vir:41 345 PQQNIISSILTDFFQVKFNPKTRFKFNDETLLES-D-------SVRNCALLVQSGVLTPAEARERLF-----GLDGGPDI 411 (542) T ss_pred HHHHHHHHHHHhhcccccCCceEEEecchhhcch-H-------HHHHHHHHHhCCCCCHHHHHHhhC-----CCCCCCcc Confidence 8887777766543222222467788875433321 1 122345689999999999998662 12222110 Q ss_pred ---------------hhcccc-ccchhcCCCCCCC---CC---CCCCCCCCC Q lcl|NC_019527. 487 ---------------LEIVQP-EMFDDDGADPYMP---DP---DVLPGEEGS 516 (516) Q Consensus 487 ---------------~e~~~~-e~~~~e~~~~~~~---~~---~~~~~~e~t 516 (516) ..+.+. +..+.+..+.... ++ +..+.++.. T Consensus 412 ~l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~ 463 (542) T protein:vir:41 412 FMVPSKGAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKK 463 (542) T ss_pred ccccccccccccccCCcCCCCCchhhhhhcccccCccccccccccccchhhc Confidence 000010 0111011010000 00 000111111 No 87 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=99.78 E-value=1.1e-18 Score=118.98 Aligned_cols=377 Identities=11% Similarity=0.050 Sum_probs=203.0 Q ss_pred hhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCch Q lcl|NC_019527. 34 MRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPE 113 (516) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i 113 (516) +.-.+.+-. ++...+... ..+ .+ ...++..........+. +...+. ....-+++. T Consensus 1 m~m~~f~~~-~~~~~~~~~-~~~---~~---------~~~~~~~~~~~~~~~~~----------~~~~v~-~~~al~~~~ 55 (392) T protein:vir:10 1 MILPILNFI-NQTNDPPEV-GSV---QS---------YFPDGNDAQIMESLLGD----------NNEWVS-ARAALRNSD 55 (392) T ss_pred Ccchhhhhh-hcccccccc-ccc---cc---------ccccCchhhhhhhhcCC----------CCceec-hHHhhccHH Confidence 111122211 111110000 000 00 11111000000000000 000111 122235788 Q ss_pred hhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcc Q lcl|NC_019527. 114 YRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLI 192 (516) Q Consensus 114 ~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ 192 (516) +.++|+.+|+++-+..+++.-... ..|...-........|.+.+.+. .++|.+++++.-+. T Consensus 56 v~~~i~~ia~~ia~lp~~~~~~~~----------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~-------- 117 (392) T protein:vir:10 56 LFSIILQLSSDLAIVKINAEKKKN----------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNA-------- 117 (392) T ss_pred HHHHHHHHHHhhccCceeeccchh----------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC-------- Confidence 999999999999888887653221 22333344444556676666655 66788888875432 Q ss_pred cccccccccceeeEEeecceeeccccccccccccccccCcceeEEee--------eEeccceEEEecCCcchhhhhhccC Q lcl|NC_019527. 193 LDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG--------REMHASRLLTIITRPLPDMLKPAYN 264 (516) Q Consensus 193 ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g--------~~iH~SRli~~~~~~~p~~~k~~~~ 264 (516) .|.+.+|.++++..|++.... .+....|++.. ..++++.|||+.+.. +... T Consensus 118 -------~g~~~~L~~l~~~~v~~~~~~--------~~~~~~y~~~~~~~~~~~~~~~~~~eiih~~~~~------~~~~ 176 (392) T protein:vir:10 118 -------NGADMKWEYLRPSQVNTYYFE--------YENGMYYNITFDDPKIEPILQAPQSDLIHMKLLS------IDGG 176 (392) T ss_pred -------CCcEEEEEEEcCceeEEEEcC--------CCceEEEEEEecCcccceeEEEccccEEEecCCC------CCCc Confidence 244667888988888754311 12233455531 358899999997643 2233 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcce Q lcl|NC_019527. 265 FSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDI 342 (516) Q Consensus 265 ~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~ 342 (516) +.|.|.++.+.+.|.....+......++.+.... +++++... ... +++..+..+.+....+..++++++. +.+| T Consensus 177 ~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~--~~~-~~~~~~~~~~~~~~~~~g~~~vl~~-g~~~ 252 (392) T protein:vir:10 177 KTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGG--LLS-DKDKASRSRSFMKRSRSGGPVVLDD-LEEF 252 (392) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC--Cch-HHHHHHHHHHHhccccCCCeeecCC-CceE Confidence 5799999999999999999999999998886654 44543221 111 2222222233333333334566654 5789 Q ss_pred eEEecccC--CHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 343 VQVNTPLS--GLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSK 420 (516) Q Consensus 343 e~~~~~ls--gl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~ 420 (516) ++++.+.. .+-+......++||.+.+||..+| |.+... .+..+....|| +..|.|.++.+.+.+-+.. T Consensus 253 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l-g~~~~~--~~~~~~~~~f~-------~~~l~P~~~~ie~~l~~~L 322 (392) T protein:vir:10 253 TALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYI-GGQGDQ--QSSIQQISGMY-------ASALNRYLRPAISELEYKL 322 (392) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCc--ccHHHHHHHHH-------HHHHHHHHHHHHHHHHHhc Confidence 98876544 345567777789999999998766 433111 12222333333 3457788888777666543 Q ss_pred CCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCC Q lcl|NC_019527. 421 WGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGA 500 (516) Q Consensus 421 ~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~ 500 (516) + +++.+....+...+.+++ ++.+..++..|++|++|+|+.+.. .|+.+.+ . ...++- T Consensus 323 ~----~~~~~d~~~~~~~d~~~~-------~~~~~~l~~~g~~t~nE~r~~l~~---~g~~p~e--~-------r~~e~l 379 (392) T protein:vir:10 323 S----DHISVNMRPAIDPLGDNY-------LSTISTATRWGALAENQATFVLQE---AGYIPKD--L-------PAPENT 379 (392) T ss_pred c----ccccccchhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHHHh---cCCCccc--c-------chhcCC Confidence 2 355666666666666555 445677899999999999998754 2343211 1 111111 Q ss_pred CC-CCCCC-CCCC Q lcl|NC_019527. 501 DP-YMPDP-DVLP 511 (516) Q Consensus 501 ~~-~~~~~-~~~~ 511 (516) +| ++++. ++.| T Consensus 380 ~~~~~Gd~~~p~p 392 (392) T protein:vir:10 380 NKKTTGQSNEPVP 392 (392) T ss_pred CCCCCCCCCCCCC Confidence 11 11111 1122 No 88 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=99.78 E-value=1.1e-18 Score=118.98 Aligned_cols=377 Identities=11% Similarity=0.050 Sum_probs=203.0 Q ss_pred hhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCch Q lcl|NC_019527. 34 MRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPE 113 (516) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i 113 (516) +.-.+.+-. ++...+... ..+ .+ ...++..........+. +...+. ....-+++. T Consensus 1 m~m~~f~~~-~~~~~~~~~-~~~---~~---------~~~~~~~~~~~~~~~~~----------~~~~v~-~~~al~~~~ 55 (392) T protein:vir:39 1 MILPILNFI-NQTNDPPEV-GSV---QS---------YFPDGNDAQIMESLLGD----------NNEWVS-ARAALRNSD 55 (392) T ss_pred Ccchhhhhh-hcccccccc-ccc---cc---------ccccCchhhhhhhhcCC----------CCceec-hHHhhccHH Confidence 111122211 111110000 000 00 11111000000000000 000111 122235788 Q ss_pred hhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcc Q lcl|NC_019527. 114 YRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLI 192 (516) Q Consensus 114 ~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ 192 (516) +.++|+.+|+++-+..+++.-... ..|...-........|.+.+.+. .++|.+++++.-+. T Consensus 56 v~~~i~~ia~~ia~lp~~~~~~~~----------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~-------- 117 (392) T protein:vir:39 56 LFSIILQLSSDLAIVKINAEKKKN----------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNA-------- 117 (392) T ss_pred HHHHHHHHHHhhccCceeeccchh----------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC-------- Confidence 999999999999888887653221 22333344444556676666655 66788888875432 Q ss_pred cccccccccceeeEEeecceeeccccccccccccccccCcceeEEee--------eEeccceEEEecCCcchhhhhhccC Q lcl|NC_019527. 193 LDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG--------REMHASRLLTIITRPLPDMLKPAYN 264 (516) Q Consensus 193 ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g--------~~iH~SRli~~~~~~~p~~~k~~~~ 264 (516) .|.+.+|.++++..|++.... .+....|++.. ..++++.|||+.+.. +... T Consensus 118 -------~g~~~~L~~l~~~~v~~~~~~--------~~~~~~y~~~~~~~~~~~~~~~~~~eiih~~~~~------~~~~ 176 (392) T protein:vir:39 118 -------NGADMKWEYLRPSQVNTYYFE--------YENGMYYNITFDDPKIEPILQAPQSDLIHMKLLS------IDGG 176 (392) T ss_pred -------CCcEEEEEEEcCceeEEEEcC--------CCceEEEEEEecCcccceeEEEccccEEEecCCC------CCCc Confidence 244667888988888754311 12233455531 358899999997643 2233 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcce Q lcl|NC_019527. 265 FSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDI 342 (516) Q Consensus 265 ~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~ 342 (516) +.|.|.++.+.+.|.....+......++.+.... +++++... ... +++..+..+.+....+..++++++. +.+| T Consensus 177 ~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~--~~~-~~~~~~~~~~~~~~~~~g~~~vl~~-g~~~ 252 (392) T protein:vir:39 177 KTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGG--LLS-DKDKASRSRSFMKRSRSGGPVVLDD-LEEF 252 (392) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC--Cch-HHHHHHHHHHHhccccCCCeeecCC-CceE Confidence 5799999999999999999999999998886654 44543221 111 2222222233333333334566654 5789 Q ss_pred eEEecccC--CHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 343 VQVNTPLS--GLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSK 420 (516) Q Consensus 343 e~~~~~ls--gl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~ 420 (516) ++++.+.. .+-+......++||.+.+||..+| |.+... .+..+....|| +..|.|.++.+.+.+-+.. T Consensus 253 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l-g~~~~~--~~~~~~~~~f~-------~~~l~P~~~~ie~~l~~~L 322 (392) T protein:vir:39 253 TALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYI-GGQGDQ--QSSIQQISGMY-------ASALNRYLRPAISELEYKL 322 (392) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCc--ccHHHHHHHHH-------HHHHHHHHHHHHHHHHHhc Confidence 98876544 345567777789999999998766 433111 12222333333 3457788888777666543 Q ss_pred CCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCC Q lcl|NC_019527. 421 WGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGA 500 (516) Q Consensus 421 ~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~ 500 (516) + +++.+....+...+.+++ ++.+..++..|++|++|+|+.+.. .|+.+.+ . ...++- T Consensus 323 ~----~~~~~d~~~~~~~d~~~~-------~~~~~~l~~~g~~t~nE~r~~l~~---~g~~p~e--~-------r~~e~l 379 (392) T protein:vir:39 323 S----DHISVNMRPAIDPLGDNY-------LSTISTATRWGALAENQATFVLQE---AGYIPKD--L-------PAPENT 379 (392) T ss_pred c----ccccccchhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHHHh---cCCCccc--c-------chhcCC Confidence 2 355666666666666555 445677899999999999998754 2343211 1 111111 Q ss_pred CC-CCCCC-CCCC Q lcl|NC_019527. 501 DP-YMPDP-DVLP 511 (516) Q Consensus 501 ~~-~~~~~-~~~~ 511 (516) +| ++++. ++.| T Consensus 380 ~~~~~Gd~~~p~p 392 (392) T protein:vir:39 380 NKKTTGQSNEPVP 392 (392) T ss_pred CCCCCCCCCCCCC Confidence 11 11111 1122 No 89 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=99.78 E-value=2.9e-19 Score=122.20 Aligned_cols=348 Identities=14% Similarity=0.083 Sum_probs=185.1 Q ss_pred cccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccc-hhhh------HHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTK-AKEM------ASK 146 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~-~~~~------~~~ 146 (516) =++++....... .....+..++..++....++.+..+++||+.+|+++-+-.+.+--....+ ..+. ... T Consensus 1 Mg~f~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l 76 (378) T protein:vir:94 1 MNLFGKVVSFSR----GKLNNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDL 76 (378) T ss_pred CCccccchhccc----ccccCCcceeeeeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccccccccchH Confidence 034433322211 11112223445555566677888999999999999999888753222111 1100 111 Q ss_pred HHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccc Q lcl|NC_019527. 147 IKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPT 225 (516) Q Consensus 147 i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~ 225 (516) ...|+.+-...-....|.+.+.+. .++|.+++++..++.. |.+.++. |. T Consensus 77 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~--------------g~~~~l~--------p~-------- 126 (378) T protein:vir:94 77 DEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNT--------------GELLDLL--------FA-------- 126 (378) T ss_pred HHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCC--------------ceEEEEE--------ec-------- Confidence 122222222333455666666555 5678888887655421 2222221 10 Q ss_pred cccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC-ceeeecc Q lcl|NC_019527. 226 APDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR-TFLKTNM 304 (516) Q Consensus 226 s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~-~v~k~~~ 304 (516) -.+++++++.||||.+ +.+...|.|.++.+...+..+.+. ... .+++++ T Consensus 127 -----------~~~~~~~~~diiH~~~--------~~~~~~g~s~l~~~~~~i~~~~~~----------~~~~gil~~~- 176 (378) T protein:vir:94 127 -----------DDKKEYKPEELVRLTS--------PFYINEDTSILDNALASIQTKLEQ----------GKLRGLLKIN- 176 (378) T ss_pred -----------CCeeEeeeeeeEEecC--------cCCccchhHHHHHHHHHHHHHHhc----------ccccceeeeC- Confidence 0134577889999974 223446899998888777544321 111 234443 Q ss_pred hhhhcCccHHHHHHHHHHHH----HhcCCcceEEEecCCcceeEEecccCCHH-HHHHHHHHHHHhhhcCCceeeecccc Q lcl|NC_019527. 305 AQVLNGGEGGDVFDRVEMYV----NMQSNLGLAVMDFDSEDIVQVNTPLSGLA-DLQSQSQEHMCSVSKIPAIKLTGISP 379 (516) Q Consensus 305 ~~~l~~~~~~~l~~r~~~~~----~~~sn~g~~~id~~~e~~e~~~~~lsgl~-d~~~~~~~~iaaas~IP~t~L~G~sp 379 (516) ..++.....++.++++... ....+.++++++. +.+|++++.+....+ ....+...+||.+.+||..+|-|. T Consensus 177 -~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~-g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVP~~~l~~~-- 252 (378) T protein:vir:94 177 -AFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDN-KTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGT-- 252 (378) T ss_pred -CcCCHHHHHHHHHHHHHHHHHhhcccccccceecCC-CceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC-- Confidence 2233322233444443322 2222334666765 578998887655443 234566688999999999888431 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----------cCCcceEEeCCCCCCCHHHHHHHHHH Q lcl|NC_019527. 380 SGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE----------IDDAITFKFKSLWQTSAKEESEIRFN 449 (516) Q Consensus 380 ~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~----------~~~d~~~~f~pL~~~sekEkAei~~~ 449 (516) ..+....+|| ...|.|.+..+-..+-+..+-. ...++.|++..|...|.+++++ T Consensus 253 -----~se~~~~~f~-------~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~---- 316 (378) T protein:vir:94 253 -----ASQEQQIYFY-------NSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELID---- 316 (378) T ss_pred -----hHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHH---- Confidence 1233444444 3458888888877776554421 1124678888899888888755 Q ss_pred HHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhc-----ccc-ccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 450 KAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEI-----VQP-EMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 450 ~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~-----~~~-e~~~~e~~~~~~~~~~~~~~~e 514 (516) ++.+++++|++|++|+|+.++.. +++.-.+. ..+ +...+.........++++++.+ T Consensus 317 ---~~~~~~~~G~~T~NE~R~~~gl~------p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 317 ---LYHENINGPIFTQNQLLVKMGEQ------PIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred ---HHHHHHhCCCcCHHHHHHHhCCC------CCCCCCeeeecccccccccchhhcCCcCCCCCCCCCCCC Confidence 45668999999999999998543 33211110 011 0000000011111222222222 No 90 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=99.78 E-value=2.1e-19 Score=122.99 Aligned_cols=350 Identities=15% Similarity=0.101 Sum_probs=181.4 Q ss_pred cccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccch-hh------hHHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKA-KE------MASK 146 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~-~~------~~~~ 146 (516) =|+++....... +....+...+..|+-.+..+.+..+++||+.+|+++-+..+.+--....+. .+ .... T Consensus 1 Mg~f~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~~~~~l 76 (378) T protein:vir:16 1 MNLFGKVVSFSR----GKLNNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDL 76 (378) T ss_pred Cccchhhhhhhc----ccccCCcceeeecccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccccccccccccchH Confidence 033333222111 111122223344555555567788999999999999999887532221111 00 0111 Q ss_pred HHHHHHHHHhcChhHHHHHHHHhcc-cceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccc Q lcl|NC_019527. 147 IKELEEACEYYGVMGIIQKAAEHDC-FFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPT 225 (516) Q Consensus 147 i~~i~~~~~~l~~~~~l~ea~~~~r-lyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~ 225 (516) ...|+.+-........|.+.+.... ++|.+++++..++.. |.+.++. |.. T Consensus 77 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~--------------g~~~~l~--------~~~------- 127 (378) T protein:vir:16 77 DEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNT--------------GELLDLL--------FAD------- 127 (378) T ss_pred HHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCC--------------ceEEEEE--------ecC------- Confidence 1222222333345556666666555 478888887655421 2222221 110 Q ss_pred cccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC-ceeeecc Q lcl|NC_019527. 226 APDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR-TFLKTNM 304 (516) Q Consensus 226 s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~-~v~k~~~ 304 (516) .++++.++.||||.+ +.+...|.|.++.+...+..+.. .... .+++++ T Consensus 128 ------------~~~~~~~~diih~r~--------~~~~~~~~s~l~~~~~~i~~~~~----------~~~~~g~l~~~- 176 (378) T protein:vir:16 128 ------------DKKEYKPEELVRLTS--------PFYINEDTSILDNALASIQTKLE----------QGKLRGLLKIN- 176 (378) T ss_pred ------------CeeEecccceEEecC--------ccCccchhHHHHHHHHHHHHHHh----------cCccceeeEeC- Confidence 124567889999964 22334577887777666543221 1111 233432 Q ss_pred hhhhcCccHHHHHHHHHHHH----HhcCCcceEEEecCCcceeEEecccCCHH-HHHHHHHHHHHhhhcCCceeeecccc Q lcl|NC_019527. 305 AQVLNGGEGGDVFDRVEMYV----NMQSNLGLAVMDFDSEDIVQVNTPLSGLA-DLQSQSQEHMCSVSKIPAIKLTGISP 379 (516) Q Consensus 305 ~~~l~~~~~~~l~~r~~~~~----~~~sn~g~~~id~~~e~~e~~~~~lsgl~-d~~~~~~~~iaaas~IP~t~L~G~sp 379 (516) ..+......++.++++..- ...++.+++++++ +.+|+.++.+....+ ........+||.+.+||..+|.|. T Consensus 177 -~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~-g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g~-- 252 (378) T protein:vir:16 177 -AFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDN-KTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGT-- 252 (378) T ss_pred -CcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCC-CceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC-- Confidence 2233332333444443322 2223334666665 578998887654322 234666789999999999888442 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc----------CCcceEEeCCCCCCCHHHHHHHHHH Q lcl|NC_019527. 380 SGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI----------DDAITFKFKSLWQTSAKEESEIRFN 449 (516) Q Consensus 380 ~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~----------~~d~~~~f~pL~~~sekEkAei~~~ 449 (516) +.+.....|| ...|.|.+..+-..|-+..+... ..++.|+++.|...|.+++++. T Consensus 253 -----~~e~~~~~f~-------~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~--- 317 (378) T protein:vir:16 253 -----ASQEQQIYFY-------NSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDL--- 317 (378) T ss_pred -----chHHHHHHHH-------HHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHH--- Confidence 2243444443 34578888888777765544221 1246788889999999887664 Q ss_pred HHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc--ccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 450 KAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP--EMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 450 ~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~--e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) +..++++|++|++|+|+.++.. +++.-.+...+ -.+.+.-....+...+..++.|+- T Consensus 318 ----~~~~~~~G~~T~NE~R~~~g~~------p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~ 376 (378) T protein:vir:16 318 ----YHENINGPIFTQNQLLVKMGEQ------PIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETN 376 (378) T ss_pred ----HHHHHhCCCcCHHHHHHHhCCC------CCCCCCeEeeccccccccchhhhcCccCCCCCCCCCC Confidence 4668999999999999998544 33211111000 000000000111111122222222 No 91 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=99.77 E-value=9e-19 Score=119.53 Aligned_cols=373 Identities=9% Similarity=-0.014 Sum_probs=205.5 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhC Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATR 111 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~ 111 (516) |.+... ... .. -.|+....+. ....|....+ .. .+.. +-....+.++ T Consensus 1 M~~f~~----~~~--~~----~~~~~~~~~~-------~~~~~~~~~~------~~---------~~~~-~v~~~~~~~~ 47 (386) T protein:vir:48 1 MPIFNI----TNL--AT----ESPPISQGGF-------FDITDPDFLS------TL---------NGSE-WVSAESALRN 47 (386) T ss_pred Cccccc----ccc--cc----cccccccccc-------cccccchhcc------cc---------cCCc-eechhhhhcc Confidence 222210 000 00 0000000000 0000000000 00 0000 1112335678 Q ss_pred chhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccC Q lcl|NC_019527. 112 PEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVP 190 (516) Q Consensus 112 ~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~P 190 (516) +.++++|+.+++++-+..+++.-.. ...|.........+..|.+.+.+. .++|.+++++..+. T Consensus 48 ~~v~~~i~~ia~~ia~~p~~~~~~~----------~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~------ 111 (386) T protein:vir:48 48 SDLFSIINQLSNDLATVKLTASRKQ----------LQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE------ 111 (386) T ss_pred hHHHHHHHHHHHhhccCceeeccch----------hHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC------ Confidence 9999999999999988888764211 123444444444566666666655 66788888875532 Q ss_pred cccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe--e------eEeccceEEEecCCcchhhhhhc Q lcl|NC_019527. 191 LILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL--G------REMHASRLLTIITRPLPDMLKPA 262 (516) Q Consensus 191 l~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~--g------~~iH~SRli~~~~~~~p~~~k~~ 262 (516) .|.+.+|.++++.+|+..... .+.+.+|+|. + +.+-++.||||.+.. +. T Consensus 112 ---------~g~~~~L~~l~~~~v~v~~~~--------~~~~~~y~~~~~~~~~~~~~~~~~~evih~~~~~------~~ 168 (386) T protein:vir:48 112 ---------NGRDMKWEYLRPSQVSFNRLD--------NKDGIYYNITFDDPRIPPKQHVPQGDVLHFKLLS------VD 168 (386) T ss_pred ---------CCcEEEEEEecCceeEEEEcC--------CCceEEEEEEecCccccceeEecCccEEEecCCC------CC Confidence 234557888888888753321 1233445542 1 346678999997543 22 Q ss_pred cCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcce-EEEecCCcc Q lcl|NC_019527. 263 YNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGL-AVMDFDSED 341 (516) Q Consensus 263 ~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~-~~id~~~e~ 341 (516) ..+.|.|.++.+.+.|.....+......++.+....-........++.+ +..+..+.+..+..|.+. .+++ ++.+ T Consensus 169 ~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e---~~~~~~~~~~~~~~n~g~~~vl~-~g~~ 244 (386) T protein:vir:48 169 GGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLD---FKTKLSRSRQAMKQMQGGPLVLD-DLEE 244 (386) T ss_pred CceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHH---HHHHHHHHHHHhhcCCCCceecC-CCce Confidence 3357999999999999999999999999999866543332222233332 223333445555566655 4554 4578 Q ss_pred eeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 342 IVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLS 419 (516) Q Consensus 342 ~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s 419 (516) |+.++.+... +.+......++||.+.+||-.+| |.+ +-+++.+.....||. ..|.|.++.+-..|-+. T Consensus 245 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~--~~~~~~e~~~~~~~~-------~~l~P~~~~ie~~l~~~ 314 (386) T protein:vir:48 245 FTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVV-GGQ--GDQQSSLEMSLDLYN-------KAVSRYLRPFLSELSQK 314 (386) T ss_pred EEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCC--CCcccHHHHHHHHHH-------HHHHHHHHHHHHHHHHh Confidence 9988766554 45667788899999999998765 533 233445555555553 34788888887777665 Q ss_pred hCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCCh-hhhccccccchhc Q lcl|NC_019527. 420 KWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDG-DLEIVQPEMFDDD 498 (516) Q Consensus 420 ~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~-~~e~~~~e~~~~e 498 (516) .+. ++.+.+..+..++...+ +..+..++.+|++|++|+|+.+...+ +.. +....+. .. T Consensus 315 l~~----~~~~~~~~~~~~d~~~~-------~~~~~~l~~~g~~t~nE~r~~lg~~~------~~~~~~~~~~~----~~ 373 (386) T protein:vir:48 315 LSC----DVDADILPAVDPTGSNS-------VSRINSMVKSGTLAQNQGLYILQQAE------ILPKELPEGEN----PN 373 (386) T ss_pred hcc----hhhcchhhhhccChHHH-------HHHHHHHHhCCCcCHHHHHHHhhcCC------CCCccchhhcC----CC Confidence 442 33444444444444433 34456789999999999999985433 221 1110000 00 Q ss_pred CCCCCCCCCCCCC Q lcl|NC_019527. 499 GADPYMPDPDVLP 511 (516) Q Consensus 499 ~~~~~~~~~~~~~ 511 (516) ..+..++++++.. T Consensus 374 ~~~~~gGd~~~~~ 386 (386) T protein:vir:48 374 KTTLKGGEINGED 386 (386) T ss_pred CCccCCCCCCCCC Confidence 0111122222211 No 92 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=99.77 E-value=1.4e-18 Score=118.53 Aligned_cols=369 Identities=9% Similarity=-0.046 Sum_probs=199.6 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchh-cccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAV-AMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFAST 120 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~-a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~ 120 (516) |..-... +.+ .+....... ..++.... ...+...+ ....+.+++.+.++|+. T Consensus 1 Mg~f~~~----~~~-------~~~~~~~~~~~~~~~~~~---------------~~~~~~~v-~~~~~l~~~~v~~~i~~ 53 (382) T protein:vir:48 1 MPIFNLA----TES-------PPDNQGGFFDVVDSDFLA---------------SLKGNEWV-SAETALRNSDLFSIINQ 53 (382) T ss_pred Ccccccc----ccC-------Ccccccccccchhhhccc---------------cccCCccc-chHhhhccHHHHHHHHH Confidence 2110000 000 000000000 00110000 00000111 12334578889999999 Q ss_pred hhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCccccccccc Q lcl|NC_019527. 121 LSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIK 199 (516) Q Consensus 121 ~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~ 199 (516) +|+++-+..+++..... ..|......+.-+..|.+.+.+. .++|.|++++..+. T Consensus 54 ia~~ia~~~~~~~~~~~----------~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~--------------- 108 (382) T protein:vir:48 54 LSNDLATVKLITSRKKL----------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE--------------- 108 (382) T ss_pred HHHhhccCceeeecchh----------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC--------------- Confidence 99999988888764332 23455555555667777777766 55788888775432 Q ss_pred ccceeeEEeecceeeccccccccccccccccCcceeEEe--------eeEeccceEEEecCCcchhhhhhccCCCCchHH Q lcl|NC_019527. 200 KGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL--------GREMHASRLLTIITRPLPDMLKPAYNFSGISMS 271 (516) Q Consensus 200 ~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~--------g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~l 271 (516) .|.+.+|.+++|.+|+..... .+....|+|. .+.++++.||||.... +....+|.|.+ T Consensus 109 ~G~~~~l~~i~~~~v~v~~~~--------~~~~~~y~~~~~~~~~~~~~~~~~~evih~~~~~------~~~~~~G~s~l 174 (382) T protein:vir:48 109 NGRDMKWEYLRPSQVSFNRLD--------NKDGIYYNITFDDPRIPPKQHVPQNDVLHFRLLS------VDGGMTSVSPL 174 (382) T ss_pred CCcEEEEEEEcCceeEEEEcC--------CCCeEEEEEEecCccccceeEEcCccEEEecCCC------CCCccccccHH Confidence 134567888988888754321 1223345542 1357889999997542 22346799999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcce-EEEecCCcceeEEecccC Q lcl|NC_019527. 272 QLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGL-AVMDFDSEDIVQVNTPLS 350 (516) Q Consensus 272 e~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~-~~id~~~e~~e~~~~~ls 350 (516) +.+.+.|.....+......++.+....-........+..+.. .+-.+.+.....|.|. +++++ +.+|+.++.+.. T Consensus 175 ~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~---~~~~~~~~~~~~n~g~~~vl~~-g~~~~~l~~~~~ 250 (382) T protein:vir:48 175 MALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFK---TKLSRSRQAMKQMQGGPLVLDD-LEDFTPLEIKSN 250 (382) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHH---HHHHHHHHhhccCCCCeeEcCC-CceEEEccCChh Confidence 999999999999999999999987765333322233333322 2223334444455554 55654 578998886655 Q ss_pred C--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcc Q lcl|NC_019527. 351 G--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAI 428 (516) Q Consensus 351 g--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~ 428 (516) . +.+......+.||.+.+||..+| |.+..+ ++.+...+.|| +..|.|.+..+.+.+-+..+.....++ T Consensus 251 d~q~~e~~~~~~~~Ia~afgVp~~~l-g~~~~~--~~~~~~~~~~~-------~~~l~p~~~~i~~~l~~~l~~~~~~~~ 320 (382) T protein:vir:48 251 VSQLLKQADWTTGQFAKVYGIPDNVV-GGQGDQ--QSSLEMSSDLY-------SKAVSRYLRPFLSELSQKLSCDVDADI 320 (382) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCc--ccHHHHHHHHH-------HHHHHHHHHHHHHHHHHHhcChhhhhh Confidence 4 34667778899999999997666 543222 22222333333 455778888777777654432221111 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCCCCCCCCC Q lcl|NC_019527. 429 TFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGADPYMPDPD 508 (516) Q Consensus 429 ~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~~~ 508 (516) .. ....+... .......++.+|+++++|+|+.|...+ +.. + +....+...+. ..+++++ T Consensus 321 ~~----~~~~~~~~-------~~~~~~~l~~~g~~t~~e~r~~l~~~g---~~~-~-~~~~~~~~~~~-----~~GGd~~ 379 (382) T protein:vir:48 321 FP----AVDPTGSN-------YISRINSLVKTGTLAQNQGLYILQQAE---ILP-K-ELPNGENPNST-----LKGGEED 379 (382) T ss_pred hh----hhccchhH-------HHHHHHHHhhcCccCHHHHHHHHhhCC---CCC-c-chhhhhcCCCC-----CCCCCCC Confidence 11 11122211 122345678999999999999885433 221 1 01000010011 1222222 Q ss_pred CCC Q lcl|NC_019527. 509 VLP 511 (516) Q Consensus 509 ~~~ 511 (516) +.. T Consensus 380 ~~~ 382 (382) T protein:vir:48 380 GQD 382 (382) T ss_pred CCC Confidence 222 No 93 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=99.76 E-value=2.1e-18 Score=117.53 Aligned_cols=367 Identities=10% Similarity=0.052 Sum_probs=181.6 Q ss_pred cccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEA 153 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~ 153 (516) =|++.-.+.......... +...|. .-....|.++..++++|+.+++++.+..+.+.-.+.... ......|... T Consensus 1 Mg~f~~~f~~~~~~~~~~---~~~~~~-~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~~---~~l~~lL~~~ 73 (385) T protein:vir:95 1 MGLFDSVFKRHSELSWMY---DLEFLQ-DKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTKEK---GTLYYLLNVR 73 (385) T ss_pred CchhhhhhccCccccccc---chhhhh-ccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCcccc---chHHHHHhcc Confidence 012211111100000000 000000 001234567788999999999999999998865433221 1122223333 Q ss_pred HHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCc Q lcl|NC_019527. 154 CEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKP 232 (516) Q Consensus 154 ~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P 232 (516) -....-+..|.+.+.+. .++|.|++++.-+++-+ .... +.+.....+.+.. ..+....+|+ T Consensus 74 PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~~~~---~~~~-----------~~~~~~~~~~~~~--~~~~~~~~~~-- 135 (385) T protein:vir:95 74 PNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEGHFF---VADD-----------FEKEDELGLYSHR--FTNVLVNDFE-- 135 (385) T ss_pred cCcCCCHHHHHHHHHHHHhhcCceEEEEecCCCee---eccc-----------ccccccccccccc--ceeeeecccc-- Confidence 34445556666665555 46788887764333211 0000 0000000000000 0011111111 Q ss_pred ceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC-ceeeecchhhhcCc Q lcl|NC_019527. 233 STWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR-TFLKTNMAQVLNGG 311 (516) Q Consensus 233 ~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~-~v~k~~~~~~l~~~ 311 (516) + .+.+-++.||||..... .....|.|.++.+.+.+.....+. .+..+. .+++++....++.+ T Consensus 136 ----~-~~~~~~~eiih~~~~~~------~~~~~G~s~~~~~~~~i~~~~~~~------~~~~~~~g~l~~~~~~~~~~e 198 (385) T protein:vir:95 136 ----F-KRVFTMDDVIYLKYNNQ------KLDAFSLGLFEDYGEIFGRMIDLQ------MLNNQIRGILKVDATKFYNKE 198 (385) T ss_pred ----e-eeeeccccEEEecCCCC------CcccccchHHHHHHHHHHHHHHHH------HhcCCCceEEEeCCccCCCHH Confidence 1 23456788999876432 223569999998888775433322 122222 33444433333333 Q ss_pred cHHHHHHHHHHHHHh-cCCcc-eEEEecCCcceeEEecccC--------CHHHHHHHHHHHHHhhhcCCceeeecccccc Q lcl|NC_019527. 312 EGGDVFDRVEMYVNM-QSNLG-LAVMDFDSEDIVQVNTPLS--------GLADLQSQSQEHMCSVSKIPAIKLTGISPSG 381 (516) Q Consensus 312 ~~~~l~~r~~~~~~~-~sn~g-~~~id~~~e~~e~~~~~ls--------gl~d~~~~~~~~iaaas~IP~t~L~G~sp~G 381 (516) ..+.+.++++..... .++.+ ++++++ +.+|+.++.... .+.+.......+||.+.+||..+|-| T Consensus 199 ~~~~~~~~~~~~~~g~~~~~~~i~~l~~-g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~----- 272 (385) T protein:vir:95 199 KQKELQAYIDTLFDAFQNNTIAVVPLTE-GLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLG----- 272 (385) T ss_pred HHHHHHHHHHHHhhhhhhcCCceEEcCC-CceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcC----- Confidence 233344444433222 23344 344554 578888865332 24456667778899999999988832 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 382 LNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEID---DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYI 458 (516) Q Consensus 382 lnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~---~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~ 458 (516) -.++.+.....||. ..|.|.+..|...|-+..+.... -.+.|.++.|...|.+++++ ++.+++ T Consensus 273 ~~sn~e~~~~~~~~-------~~l~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~-------~~~~~~ 338 (385) T protein:vir:95 273 EMADLEKTIESYLQ-------FCINPLLRKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSE-------AIDKLV 338 (385) T ss_pred CCcCHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCChhhcccceEEEechhhhccCHHHHHH-------HHHHHH Confidence 23444444444443 44788888888877665544221 13566666888888887655 556689 Q ss_pred HcCCCCHHHHHHHHHhhhccCCCCCCh-hh-hcccc-ccchhcCCCCCCCCCCCC Q lcl|NC_019527. 459 TNSVIDPSEARQQLSDDPDSGWDNIDG-DL-EIVQP-EMFDDDGADPYMPDPDVL 510 (516) Q Consensus 459 ~~gvi~~~e~r~~l~~~~~~~~~~~d~-~~-e~~~~-e~~~~e~~~~~~~~~~~~ 510 (516) ++|++|++|+|+.+ |+++++. .. +...+ .....+ + ..+++.+++ T Consensus 339 ~~g~lt~NE~R~~~------g~~p~~~~~gd~~~~~~n~~~~~-~-~kgge~~~e 385 (385) T protein:vir:95 339 ASGTFTRNQVRIMT------GEEPADDPELDKFIITKNLQSAD-A-FKGGESNEE 385 (385) T ss_pred hCCCcCHHHHHHHh------CCCCCCCCCCceeeecccceecc-c-ccCCCCCCC Confidence 99999999999998 4444421 11 11000 000000 0 011111111 No 94 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=99.75 E-value=5.9e-19 Score=120.53 Aligned_cols=350 Identities=15% Similarity=0.091 Sum_probs=182.3 Q ss_pred cccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccch-hhh------HHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKA-KEM------ASK 146 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~-~~~------~~~ 146 (516) =++++....... .....+...+..|+-.+.++.+..+.+||+.+|++..+..+.+--..+.+. .+. ... T Consensus 1 Mg~f~~~~~f~~----~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l 76 (378) T protein:vir:93 1 MNLFGKVVSFSR----GKLNNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDL 76 (378) T ss_pred Cccchhhhhhhc----cccCCCcceeeecccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccccccccccccccchH Confidence 033333222111 111111222333444555667788999999999999999987633222211 110 111 Q ss_pred HHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccc Q lcl|NC_019527. 147 IKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPT 225 (516) Q Consensus 147 i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~ 225 (516) ...|+.+-...--...|.+.+.+. .++|.|++++..++.. |.+.++. |. T Consensus 77 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~--------------g~~~~l~--------~~-------- 126 (378) T protein:vir:93 77 DEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNT--------------GELLDLL--------FA-------- 126 (378) T ss_pred HHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCC--------------ceEEEEE--------ec-------- Confidence 122222223333455666665555 4578888887654321 2221121 11 Q ss_pred cccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC-ceeeecc Q lcl|NC_019527. 226 APDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR-TFLKTNM 304 (516) Q Consensus 226 s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~-~v~k~~~ 304 (516) -.++++.++.|+|+.+ +.+...|.|.++.+...+..+.. .... .+++++ T Consensus 127 -----------~~~~~~~~~diih~r~--------~~~~~~~~s~l~~~~~~i~~~~~----------~~~~~g~l~~~- 176 (378) T protein:vir:93 127 -----------DDKKEYKTEELVRLTS--------PFYINEDTSILDNALASIQTKLE----------QGKLRGLLKIN- 176 (378) T ss_pred -----------CCeeEeccceeEEecC--------ccccchhhHHHHHHHHHHHHHHh----------cCcccceeeeC- Confidence 0134677889999974 23445688888877776654322 1111 234443 Q ss_pred hhhhcCccHHHHHHHHHHH-H---HhcCCcceEEEecCCcceeEEecccCCHH-HHHHHHHHHHHhhhcCCceeeecccc Q lcl|NC_019527. 305 AQVLNGGEGGDVFDRVEMY-V---NMQSNLGLAVMDFDSEDIVQVNTPLSGLA-DLQSQSQEHMCSVSKIPAIKLTGISP 379 (516) Q Consensus 305 ~~~l~~~~~~~l~~r~~~~-~---~~~sn~g~~~id~~~e~~e~~~~~lsgl~-d~~~~~~~~iaaas~IP~t~L~G~sp 379 (516) ..+......++.++++.. . ...++.+++++++ +.+|+.++.+....+ +.......+||.+.+||..+|.|. T Consensus 177 -~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g~-- 252 (378) T protein:vir:93 177 -AFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDN-KTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGT-- 252 (378) T ss_pred -CcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCC-CceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC-- Confidence 223333333344444332 2 2223335666765 578998877655433 345667789999999999887441 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc----------CCcceEEeCCCCCCCHHHHHHHHHH Q lcl|NC_019527. 380 SGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI----------DDAITFKFKSLWQTSAKEESEIRFN 449 (516) Q Consensus 380 ~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~----------~~d~~~~f~pL~~~sekEkAei~~~ 449 (516) ..+....+|| ...|.|.+..+-..|-+..+... ..++.|.++.|...|.+++++ T Consensus 253 -----~~e~~~~~f~-------~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~---- 316 (378) T protein:vir:93 253 -----ATQEQQIYFY-------NSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELID---- 316 (378) T ss_pred -----cHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHH---- Confidence 2233333333 34578888888777765444211 124678888899998888755 Q ss_pred HHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc--ccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 450 KAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP--EMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 450 ~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~--e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) ++.+++++|++|++|+|+.++.. +++.-.+...+ -.+.+.-.+......+..++.|+. T Consensus 317 ---~~~~~~~~G~~t~NE~R~~~gl~------p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~ 376 (378) T protein:vir:93 317 ---LYHENINGPIFTQNQLLVKMGEQ------PIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETN 376 (378) T ss_pred ---HHHHHHhCCCcCHHHHHHHhCCC------CCCCCCeeeeccccccccchhhhcCccCCCCCCCCCC Confidence 45668999999999999998544 33211110000 000000000011111122222222 No 95 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=99.75 E-value=2.9e-17 Score=111.25 Aligned_cols=435 Identities=12% Similarity=0.066 Sum_probs=195.7 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccc----hhccccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTP----AVAMDSL 76 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~----~~a~ds~ 76 (516) |=.= .+.-.+.+-.+.+ +|+.++...+ .+..|.+ T Consensus 1 ~~~~---------------------------------------~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 38 (651) T protein:vir:99 1 MTDT---------------------------------------TGETQETKVHVEG---LGGEADLAKSPNSTQIPDHRI 38 (651) T ss_pred CCCc---------------------------------------cceeeeeEEEeec---ccccccccccccccccchhhh Confidence 0000 0011111111111 2222221110 1211111 Q ss_pred ccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccch-hhhHHHHHHHHHHHH Q lcl|NC_019527. 77 CGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKA-KEMASKIKELEEACE 155 (516) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~-~~~~~~i~~i~~~~~ 155 (516) .. ..+... .-+.+ -+|+.+..+|++.++||++.+++...-||++.-..+-+. +...++.++.+..++ T Consensus 39 ~~--------~~~~~~-p~~~~---~~L~~~~e~~~~~~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 106 (651) T protein:vir:99 39 QS--------HNVGVN-PPYNP---DRLAAFLELNETLATGIRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVARNFWR 106 (651) T ss_pred cc--------cCCCCC-CCCCH---HHHHHHHhcChHHHHHHHHHhhhhhccCceeeecccCCCCccchHHHHHHHHHhh Confidence 10 001110 00011 145555567999999999999999999999865332221 112222333333332 Q ss_pred h-----------cChh----HHHHHHHHhcccceeeEEEEEecCCCcccCcc---ccccccc--ccc------------- Q lcl|NC_019527. 156 Y-----------YGVM----GIIQKAAEHDCFFGRGQISINIKGADVSVPLI---LDPRTIK--KGS------------- 202 (516) Q Consensus 156 ~-----------l~~~----~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~---ld~~~I~--~g~------------- 202 (516) . ++.. ..+..++......|.+++=+..++. ..|+. +++..++ +.. T Consensus 107 ~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~--g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~ 184 (651) T protein:vir:99 107 GRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIE--GRPVGLAYVPARTVRVRRPQNRFDQPRHPEEGR 184 (651) T ss_pred ccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCc--cchhhhhhcChhheeeecccccccchhhhhhhc Confidence 2 1221 2233333333455666553322221 01110 0000000 000 Q ss_pred ---------------------eeeEEee-------------cceeeccccc-cccccccccccCcceeEEe------eeE Q lcl|NC_019527. 203 ---------------------LTGFSNI-------------EPMWTSPSAY-NALDPTAPDFYKPSTWWVL------GRE 241 (516) Q Consensus 203 ---------------------l~~l~v~-------------d~~~v~p~~~-~~~dp~s~~yg~P~~y~v~------g~~ 241 (516) ..++.++ .+..+..... .......+.+..+..+.+. ... T Consensus 185 ~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~ 264 (651) T protein:vir:99 185 YVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYREDEESEREPIFVDRETGDVTTGDANGLEN 264 (651) T ss_pred ccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEEeccCcceeeeeecccceeeeEEEcCCCceeE Confidence 0000000 0000000000 0000011122223333221 235 Q ss_pred eccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHH Q lcl|NC_019527. 242 MHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDR 319 (516) Q Consensus 242 iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r 319 (516) +.++.||||.... +.....|+|.++.+...|.....+......++.+.... ++++.. ..++.+..+++.+. T Consensus 265 ~~~~eViHir~~~------~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~-~~ls~e~~~~lr~~ 337 (651) T protein:vir:99 265 RPANELIFIPNPS------ILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDTIPRMVIKVTG-GELSEESKRDLRQM 337 (651) T ss_pred ecccceEEecCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHHHH Confidence 6788999997532 23456799999999999999999999999998886653 344321 22433333344444 Q ss_pred HHHHHHhcCCcce-EEEecC----------CcceeEEecccC---CHHHHHHHHHHHHHhhhcCCceeeecccccccccc Q lcl|NC_019527. 320 VEMYVNMQSNLGL-AVMDFD----------SEDIVQVNTPLS---GLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNAS 385 (516) Q Consensus 320 ~~~~~~~~sn~g~-~~id~~----------~e~~e~~~~~ls---gl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnat 385 (516) |+...+|.|. +++..+ +-+|+.++...+ .+-+........||++.+||-.+| |...++-.|+ T Consensus 338 ---~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~afgVPp~~l-G~~~~~~~sn 413 (651) T protein:vir:99 338 ---LNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVLEVPPVKI-GVTDSANRSN 413 (651) T ss_pred ---HHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHhCCCHHHh-ccCCCCCccc Confidence 4444455554 444321 234555543332 233445667788999999997555 7665555566 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cCC--cceEEeCC--CCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 386 SEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE--IDD--AITFKFKS--LWQTSAKEESEIRFNKAQEAQIYIT 459 (516) Q Consensus 386 ge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~--~~~--d~~~~f~p--L~~~sekEkAei~~~~a~a~~~~~~ 459 (516) .|.....|+.. .|.|.+..+-..|-+..+.. ... .+.|+|+. |...+.+.+ ++++..+++ T Consensus 414 ~E~~~~~f~~~-------tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~~-------~e~~~~~i~ 479 (651) T protein:vir:99 414 SDQQDKDFALE-------VIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQEAQLA-------EQRVRAMRL 479 (651) T ss_pred HHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhccHHHH-------HHHHHHHHh Confidence 66666666443 47788887777666544321 112 34566654 665565554 556677899 Q ss_pred cCCCCHHHHHHHHHhhhccC-CCCCChh-hhccccccchhcCCCCC---CCCCCCCCCCCC-C Q lcl|NC_019527. 460 NSVIDPSEARQQLSDDPDSG-WDNIDGD-LEIVQPEMFDDDGADPY---MPDPDVLPGEEG-S 516 (516) Q Consensus 460 ~gvi~~~e~r~~l~~~~~~~-~~~~d~~-~e~~~~e~~~~e~~~~~---~~~~~~~~~~e~-t 516 (516) +|++|++|+|+.++..+..+ +.+.... ............++.+. .+++......+. + T Consensus 480 ~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~~~~~e~~~ 542 (651) T protein:vir:99 480 AGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENKIGEREWDT 542 (651) T ss_pred CCCcCHHHHHHHhCCCCCCCccccccccccccccccccccCCCCcccccCccccccccchhhh Confidence 99999999999985433210 1000000 00000000000000000 000001111111 0 No 96 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=99.74 E-value=1.6e-17 Score=112.65 Aligned_cols=373 Identities=9% Similarity=0.006 Sum_probs=169.2 Q ss_pred cccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEA 153 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~ 153 (516) =|.+.-. ..-+... ........+..+-....|.++..+.++|+.+|+++-+..+.+...+..... .......|... T Consensus 1 Mgl~d~~-~~~~~~~--~~~~~~~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~~-~~~~~~lL~~~ 76 (395) T protein:vir:96 1 MGILDFF-SFKKSGT--LSDDDSGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTEN-QKDWLYWINTK 76 (395) T ss_pred Ccchhhh-cCCCCcc--ccccccccchhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCccccc-cchHHHHHhhc Confidence 0111110 0000000 000111112222233455677888999999999999999988654332211 11122223333 Q ss_pred HHhcChhHHHHHHHHhcc-cceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCc Q lcl|NC_019527. 154 CEYYGVMGIIQKAAEHDC-FFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKP 232 (516) Q Consensus 154 ~~~l~~~~~l~ea~~~~r-lyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P 232 (516) .........|.+.+.+.. ++|.+++++.-+..-. +. + .+ .+. ..+.|..+. ..... T Consensus 77 PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~--~~--~-------~~---~~~--~~~~~~~~~--~v~~~----- 133 (395) T protein:vir:96 77 ANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIY--VA--D-------AF---TQD--KKLSGNKFK--VSRVQ----- 133 (395) T ss_pred CCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCcee--cC--C-------cc---ccc--cccccceee--eeeec----- Confidence 333344556666655554 4677887764332211 00 0 00 000 001111000 00000 Q ss_pred ceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHH---H---HHHHHHHHHHHHhCCceeeecchh Q lcl|NC_019527. 233 STWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENW---L---RTRQSVSDLVDKFSRTFLKTNMAQ 306 (516) Q Consensus 233 ~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~---~---~~~~~~~~Ll~~~~~~v~k~~~~~ 306 (516) .|.+ .+.+.++.|+||.....+.. ..+.++++..-+.+... . .+...................... T Consensus 134 -~~~~-~~~~~~~dvih~k~~~~~~~------~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 205 (395) T protein:vir:96 134 -GQTY-EKIFTFDQVIYLKNDNSDLM------LKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDG 205 (395) T ss_pred -ccee-eeEeccCceEEecccCCccc------cccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCc Confidence 0111 34578899999975432111 11222222211111111 1 111111122222211111111111 Q ss_pred hhcCccHHHHHHHHHHHH-HhcCCcceEE-EecCCcceeEEecccCCH--------HHHHHHHHHHHHhhhcCCceeeec Q lcl|NC_019527. 307 VLNGGEGGDVFDRVEMYV-NMQSNLGLAV-MDFDSEDIVQVNTPLSGL--------ADLQSQSQEHMCSVSKIPAIKLTG 376 (516) Q Consensus 307 ~l~~~~~~~l~~r~~~~~-~~~sn~g~~~-id~~~e~~e~~~~~lsgl--------~d~~~~~~~~iaaas~IP~t~L~G 376 (516) ....+...+.++.+. ....+.+..+ +++ +-+++.++.+..+. .++.....++||.+.|||..+|-| T Consensus 206 ---~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~-g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~~ 281 (395) T protein:vir:96 206 ---GRQPKSDKDFFKRTIEKIRTESVVGIPVTA-NTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHG 281 (395) T ss_pred ---hhhHHHHHHHHHHHHHHhhcCCcceEEccC-CceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC Confidence 111233344444433 2233444333 443 46788777654432 222334457899999999987732 Q ss_pred cccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cCCcceEEeCCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_019527. 377 ISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE--IDDAITFKFKSLWQTSAKEESEIRFNKAQEA 454 (516) Q Consensus 377 ~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~--~~~d~~~~f~pL~~~sekEkAei~~~~a~a~ 454 (516) -.++.+.....||. ..|.|.+..+-..|-+..+.. ...++.|.|+.|...|.+++++.. T Consensus 282 -----~~sn~e~~~~~f~~-------~~L~P~~~~ie~~l~~~Ll~~~e~~~~~~f~~~~l~~~d~~~~~~~~------- 342 (395) T protein:vir:96 282 -----DIADNQKNYELLLE-------GPIESLITNIVDGLEYAIFDKSETLEGSFIKVTGLKNYDLFSISSQA------- 342 (395) T ss_pred -----CCccHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCChhhhcCceeEeecchhccCHHHHHHHH------- Confidence 22344544555554 447888888888777655532 234678999999988888876654 Q ss_pred HHHHHcCCCCHHHHHHHHHhhhccCCCCCCh--hhhccccc-cchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 455 QIYITNSVIDPSEARQQLSDDPDSGWDNIDG--DLEIVQPE-MFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 455 ~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~--~~e~~~~e-~~~~e~~~~~~~~~~~~~~~e~t 516 (516) ..++++|++|++|+|+.+. +++++. ..+...+. ....+ ..+++ ..++.+| T Consensus 343 ~~~~~~G~~T~NE~R~~~g------l~pi~~~~gD~~~~~~N~~~~~---~~gge---~~~~~~~ 395 (395) T protein:vir:96 343 DKLISSGFVFIDEVREEIG------LPELPDGLGKVLYMTKNYESVL---ERGGE---VDEEVET 395 (395) T ss_pred HHHHhCCCcCHHHHHHHhC------CCCCCCCCCceeeecccceech---hccCC---CCCCCCC Confidence 5589999999999999984 444422 11111110 00000 01111 1111111 No 97 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=99.72 E-value=4.4e-18 Score=115.73 Aligned_cols=366 Identities=10% Similarity=-0.018 Sum_probs=181.9 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTL 121 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~ 121 (516) |.-=.........|+ ...++..+...... ..+ ..+. -|-....+.+++.+.+||+.+ T Consensus 1 Mglf~~~~~~~~~~~--------------~~~~~~~~~~~~~~------~~~--~~~~-~~v~~~~al~~~~V~~~i~~I 57 (384) T protein:vir:49 1 MPIFNITNLATESPP--------------SNQDSFFDITDPEF------LDA--LNGS-EWVSAETALKNSDLFSIISQL 57 (384) T ss_pred CccccccccCccccc--------------ccchhhccccchhh------ccc--ccCC-ceechhhhhccHHHHHHHHHH Confidence 211000000000111 10111100000000 000 0000 011123345678899999999 Q ss_pred hHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccc Q lcl|NC_019527. 122 STELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKK 200 (516) Q Consensus 122 aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~ 200 (516) ++++-+..+.+.-... ..|...-..+.....|.+.+... .++|.+++++..+. . T Consensus 58 a~~ia~l~~~~~~~~~----------~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~---------------~ 112 (384) T protein:vir:49 58 SNDLATAKITTSRKQL----------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE---------------N 112 (384) T ss_pred HHHHhhCceeeecchh----------hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECC---------------C Confidence 9999999888753221 23444455555677777777766 45788888775432 1 Q ss_pred cceeeEEeecceeeccccccccccccccccCcceeEEe--------eeEeccceEEEecCCcchhhhhhccCCCCchHHH Q lcl|NC_019527. 201 GSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL--------GREMHASRLLTIITRPLPDMLKPAYNFSGISMSQ 272 (516) Q Consensus 201 g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~--------g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le 272 (516) |.+.+|.+++|.+|++..... .+ ..+|++. .+.++++.|||+.+.. +.....|.|.++ T Consensus 113 g~~~~L~~l~~~~v~v~~~~~-------~~-~~~y~~~~~~~~~~~~~~~~~~eVih~~~~~------~~~~~~G~s~i~ 178 (384) T protein:vir:49 113 GRDMKWEYLRPSQVSFNRLDN-------QN-GLYYNITFDDPRIPPKQHVPQGDILHFRLLS------VDGGLTSVSPLM 178 (384) T ss_pred CcEEEEEEEcCceeEEEEcCC-------Cc-eEEEEEEecCccccceeEecCccEEEecCCC------CCCceeeccHHH Confidence 345678889988887643211 11 2234442 1468899999997643 222357999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccC Q lcl|NC_019527. 273 LAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLS 350 (516) Q Consensus 273 ~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~ls 350 (516) .+.+.|.....+......++.+.... +++++ ..+.. ++..+..........|.+-.++..++.+|+.++.+.. T Consensus 179 ~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~--~~~~~---~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~ 253 (384) T protein:vir:49 179 ALGRELNIQKASDKLTLNALKNALNANGILKIK--GGGLL---DFKTKQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSN 253 (384) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC--CCCCh---HHHHHHHHHHHhcccCCccceecCCCceEEEccCChh Confidence 99999999999999999999986654 34443 22222 1222333333444556554444445688988876655 Q ss_pred C--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHhCCCc Q lcl|NC_019527. 351 G--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSY----YFSPLDTMLKVIQLSKWGEI 424 (516) Q Consensus 351 g--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~----l~p~l~~l~~~l~~s~~g~~ 424 (516) . +.+......++||.+.+||..+|.+ .. +-.++.+ .++...... ++|+++.+-..+....+ T Consensus 254 d~q~~e~~~~~~~~Ia~~fgVp~~~lg~-~~-~~~~~~~--------~~~~~~~~~i~~~l~pi~~~i~~~l~~~l~--- 320 (384) T protein:vir:49 254 VAQLLSQADWTTGQFAKVYGIPESVVGG-EG-DKQSSLE--------MIYNIYFKAVSRFLRPFVSELSKKLSCEVD--- 320 (384) T ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhCC-CC-CccccHH--------HHHHHHHHHHHHHHHHHHHHHHHHhchhhh--- Confidence 4 3466778889999999999876644 32 2223332 222233333 44444444443322110 Q ss_pred CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCC-CC Q lcl|NC_019527. 425 DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGAD-PY 503 (516) Q Consensus 425 ~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~-~~ 503 (516) +.. +..++.+-..... -...++.+|+.+.+|+|+.+...+ +.. + |..+.++.+ -+ T Consensus 321 -----~~~------~~~~~~~~~~~~~-~~~~l~~~~~~t~~e~~~~l~~~g---~~~-n--------e~r~~~~~~p~~ 376 (384) T protein:vir:49 321 -----ADI------LPAVDPTGSNYIG-LINSMVKTGTLAQNQGLYVLQQAE---ILP-K--------DLPEGETDSTLK 376 (384) T ss_pred -----hhh------hhhhhccchHHHH-HHHHHhhcCcccHHHHHHHHhhCC---CCC-h--------hHHHHcCCCCCC Confidence 000 0000000000111 112345555666666655543321 111 0 011111111 11 Q ss_pred CCCCCCCC Q lcl|NC_019527. 504 MPDPDVLP 511 (516) Q Consensus 504 ~~~~~~~~ 511 (516) +++.+++= T Consensus 377 gGd~~~~~ 384 (384) T protein:vir:49 377 GGETNEQY 384 (384) T ss_pred CCCCCCCC Confidence 11111111 No 98 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=99.71 E-value=2.9e-17 Score=111.28 Aligned_cols=362 Identities=8% Similarity=0.033 Sum_probs=177.1 Q ss_pred cccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEA 153 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~ 153 (516) =|+++..++.........+ ...|.. -....|.+++.+.++|+.+++++.+..+.+...+.... ......|... T Consensus 1 Mg~f~~l~~~~~~~~~~~~---~~~~~~-~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~---~~l~~ll~~~ 73 (376) T protein:vir:78 1 MGFFSELFKRNKEIEWMWD---LDFLED-KTTKVYLKKMALNTCVKHIARTIAKSDFRLKNGETSVR---DKLYYKLNIR 73 (376) T ss_pred CchhhhhhccCCccccccc---hhhccc-cchhhhhhhHHHHHHHHHHHHhhcccceeecccccccc---chHHHHHhhc Confidence 0222211111100000000 001100 01234556788999999999999999988854332211 1223334445 Q ss_pred HHhcChhHHHHHHHHhccc-ceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCc Q lcl|NC_019527. 154 CEYYGVMGIIQKAAEHDCF-FGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKP 232 (516) Q Consensus 154 ~~~l~~~~~l~ea~~~~rl-yG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P 232 (516) -.....+..|.+.+.+..+ +|.+++++..++.-. +..+.++.+..+.+..+. .....+|+. T Consensus 74 PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~---------------~~~~~~~~~~~~~~~~~~--~~~~~~~~~- 135 (376) T protein:vir:78 74 PNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFL---------------IADSYVRKEFAFFPDVFE--GVTVKDYRY- 135 (376) T ss_pred cccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCee---------------eccceeecccceeeeeee--eeeeeccee- Confidence 5556677777777776655 577777765444211 111222222222221110 111111111 Q ss_pred ceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHH-HHHHhCC-ceeeecchhhhcC Q lcl|NC_019527. 233 STWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSD-LVDKFSR-TFLKTNMAQVLNG 310 (516) Q Consensus 233 ~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~-Ll~~~~~-~v~k~~~~~~l~~ 310 (516) .+.+.++.|+||.....| |.+.+..+...... +...+.. ..++.+. ..+++.....++. T Consensus 136 ------~~~~~~~evih~~~~~~~----------~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (376) T protein:vir:78 136 ------NRNFSMDDVIFLEYGNER----------LSAFTDGMFEDYGE---LFGKMIRAQMRNFQIRGAVNFKMAGVADK 196 (376) T ss_pred ------eeeeccccEEEeccCCCC----------chhhhhHHHHHHHH---HHHHHHHHHHhcCCCceeEEEccCCCCCH Confidence 234678899998643221 22222222222211 1111111 2223232 2333433344444 Q ss_pred ccHHHHHHHHHHHHHhcCCcc--eEEEecCCcceeEEecccCC-------HHHHHHHHHHHHHhhhcCCceeeecccccc Q lcl|NC_019527. 311 GEGGDVFDRVEMYVNMQSNLG--LAVMDFDSEDIVQVNTPLSG-------LADLQSQSQEHMCSVSKIPAIKLTGISPSG 381 (516) Q Consensus 311 ~~~~~l~~r~~~~~~~~sn~g--~~~id~~~e~~e~~~~~lsg-------l~d~~~~~~~~iaaas~IP~t~L~G~sp~G 381 (516) ...+.+.++++.......+.+ ++++++ +-+++.++.+... +-+........||.+.+||..+|-| T Consensus 197 e~~~~~~~~~~~~~~g~~~~~~~v~~l~~-g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~----- 270 (376) T protein:vir:78 197 DKQTKLQEYIDKVYASFNNNEIAIVPQLE-GFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHG----- 270 (376) T ss_pred HHHHHHHHHHHHHhccccccCcceEEcCC-CceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCC----- Confidence 333445555544433322333 333544 5788888876643 3444556677899999999987733 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_019527. 382 LNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNS 461 (516) Q Consensus 382 lnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g 461 (516) -.++-+.....||. ..|.|.+..+-..+-+..++...-.+.|.+..|...|.+++ ++++.+++++| T Consensus 271 ~~s~~e~~~~~f~~-------~~l~P~~~~ie~~l~~kll~~~~~~~~~~~~~ll~~d~~~~-------~~~~~~~~~~G 336 (376) T protein:vir:78 271 DMADLSNNMKAYME-------YCIDPLTKKLEDELNAKLFTFSEFLAGEHIKIIHKKDIIEN-------AEAVDKLVASG 336 (376) T ss_pred CCCCHHHHHHHHHH-------HHHHHHHHHHHHHHHhhhCCcccceecccchhhcccCHHHH-------HHHHHHHHhCC Confidence 22333444444443 34788888888888776665422234455566777777765 55667789999 Q ss_pred CCCHHHHHHHHHhhhccCCCCCChh--hhccccccchhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 462 VIDPSEARQQLSDDPDSGWDNIDGD--LEIVQPEMFDDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 462 vi~~~e~r~~l~~~~~~~~~~~d~~--~e~~~~e~~~~e~~~~~~~~~~~~~~~e~ 515 (516) ++|++|+|+.+ |+++++.. .+...+-. -.+-+..+++| T Consensus 337 ~~t~NE~R~~l------g~~p~~~g~~d~~~~~~n----------~~~~~~~~e~g 376 (376) T protein:vir:78 337 SFNRNEVRELL------GAERVDNPELDKYLITKN----------YQSADEGGEDG 376 (376) T ss_pred CcCHHHHHHHh------CCCCCCCCCCceeeeccC----------ceehhccccCC Confidence 99999999998 44444321 01110000 00011112222 No 99 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=99.70 E-value=1.3e-17 Score=113.14 Aligned_cols=352 Identities=14% Similarity=0.079 Sum_probs=172.4 Q ss_pred cccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccc-hh------hhHHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTK-AK------EMASK 146 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~-~~------~~~~~ 146 (516) =+.++-...+.. ..... +..+...|.-....+.+..+.+||+.+|+++.+..+.+--....+ .. ..... T Consensus 1 M~if~~~~~~~~-~~~~~---~~~~~~~~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l 76 (378) T protein:vir:94 1 MNLFGKVVSFSR-GKLNN---DTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDL 76 (378) T ss_pred CchhHHhHhhhh-ccccc---CcceeeeeecchhhhhhHHHHHHHHHHHHhHhhCceeeeeecccccccccccccccchH Confidence 011111111000 00000 000111111112233455688999999999999888652221111 10 01112 Q ss_pred HHHHHHHHHhcChhHHHHHHHHhcc-cceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccc Q lcl|NC_019527. 147 IKELEEACEYYGVMGIIQKAAEHDC-FFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPT 225 (516) Q Consensus 147 i~~i~~~~~~l~~~~~l~ea~~~~r-lyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~ 225 (516) ...|+.+-........|.+.+.+.. +.|.|+++.+.++. .|.+.++. T Consensus 77 ~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~--------------~g~~~~~~------------------ 124 (378) T protein:vir:94 77 DEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSE--------------TGELLDLL------------------ 124 (378) T ss_pred HHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCC--------------CCcEEEEE------------------ Confidence 2223333333445667777666665 56888887544321 12222211 Q ss_pred cccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC-ceeeecc Q lcl|NC_019527. 226 APDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR-TFLKTNM 304 (516) Q Consensus 226 s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~-~v~k~~~ 304 (516) |...++.+.++.|+|+.. +.+..-+.+.++.+...+....+ ..+. .+++++ T Consensus 125 ---------~~~~~~~~~~~dvih~~~--------~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~g~l~~~- 176 (378) T protein:vir:94 125 ---------FANDKKEYKPEELVRLTS--------PFYINEDTSILDNALASIQTKLE----------QGKLRGLLKIN- 176 (378) T ss_pred ---------EecCcEEechhceeeecC--------cCCcccchhHHHHHHHHHHHHHh----------hCCcccceeeC- Confidence 112345678889999863 22233456666666554433221 1121 233432 Q ss_pred hhhhcCccHHHHHHHHHH-HH---HhcCCcceEEEecCCcceeEEecccCCHH-HHHHHHHHHHHhhhcCCceeeecccc Q lcl|NC_019527. 305 AQVLNGGEGGDVFDRVEM-YV---NMQSNLGLAVMDFDSEDIVQVNTPLSGLA-DLQSQSQEHMCSVSKIPAIKLTGISP 379 (516) Q Consensus 305 ~~~l~~~~~~~l~~r~~~-~~---~~~sn~g~~~id~~~e~~e~~~~~lsgl~-d~~~~~~~~iaaas~IP~t~L~G~sp 379 (516) ..++.+...++.++++. +. ...++.+++++++ +.+|+.++.+...++ +.+.+...+||.+.|||..+|.|. T Consensus 177 -~~l~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~-g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgvPp~~l~g~-- 252 (378) T protein:vir:94 177 -AFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDN-KTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGT-- 252 (378) T ss_pred -CcCCHHHHHHHHHHHHHHHHHhhcccccccceeccC-CceEEEccCChHHhhHHHHHHHHHHHHHHhCCCHHHhcCC-- Confidence 23333322334444433 22 2223345667765 588999987765443 334566789999999998888542 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----------cCCcceEEeCCCCCCCHHHHHHHHHH Q lcl|NC_019527. 380 SGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE----------IDDAITFKFKSLWQTSAKEESEIRFN 449 (516) Q Consensus 380 ~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~----------~~~d~~~~f~pL~~~sekEkAei~~~ 449 (516) ..+.....||. ..|.|.+..+-..|-+..+-. ...++.|.++.|...|.+++++. T Consensus 253 -----~~e~~~~~f~~-------~tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~--- 317 (378) T protein:vir:94 253 -----ATQEQQIYFYN-------STIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDL--- 317 (378) T ss_pred -----chHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHH--- Confidence 12333344443 357888888877776544311 11246788889999999887664 Q ss_pred HHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc--ccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 450 KAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP--EMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 450 ~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~--e~~~~e~~~~~~~~~~~~~~~e 514 (516) +.+++++|++|++|+|+.+...+..+...+-.. ....+ ...+.+..... ..++++.+.+ T Consensus 318 ----~~~~~~~G~~t~NE~R~~~g~~p~~ggd~~~~~-~n~~~~~~~~~~~~~~~~-~~~~~e~~n~ 378 (378) T protein:vir:94 318 ----YHENINGPIFTQNQLLVKMGEQPIEGGDVYIAN-LNAVAVKNLSDLQGNRKD-VTSTDETNNQ 378 (378) T ss_pred ----HHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeec-ccccchhcchhcccccCC-CCCCCCCCCC Confidence 466899999999999999855443221100000 00000 01111111100 0111111111 No 100 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=99.69 E-value=2.1e-16 Score=106.55 Aligned_cols=377 Identities=9% Similarity=-0.028 Sum_probs=173.0 Q ss_pred CCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccc Q lcl|NC_019527. 60 PGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTK 139 (516) Q Consensus 60 ~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~ 139 (516) =|+ ...+- +.++....+ .....+.....+ .+.....|.++..+.++|+.+++++.+..+.+...++.. T Consensus 1 Mg~-----~~~~~--~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~~~ 68 (395) T protein:vir:40 1 MGF-----KSWVS--GFFNEEQRT----LNLTDTVWCSIP-SEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGEEV 68 (395) T ss_pred Cch-----HHHHH--hhhcccccc----cccccchhhccc-cccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCccc Confidence 010 00000 000100000 011111111111 111223456778899999999999999888875433221 Q ss_pred hhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccc Q lcl|NC_019527. 140 AKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSA 218 (516) Q Consensus 140 ~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~ 218 (516) .......|...-..+.....|.+++... .|+|.|++++.-+ +.+ -|. . +.+.. ..+.+.. T Consensus 69 ---~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~-~~~-~~~-----~--------~~~~~-~~~~~~~ 129 (395) T protein:vir:40 69 ---RKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDE-YIY-VAD-----S--------FTKND-KSLYENT 129 (395) T ss_pred ---cchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecC-cee-ecC-----C--------ccccc-cccccce Confidence 1122222333333444456676665555 5578888776322 110 000 0 00000 0000000 Q ss_pred ccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC- Q lcl|NC_019527. 219 YNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR- 297 (516) Q Consensus 219 ~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~- 297 (516) + ..... ..|.+ .+.+.+++|+||..... ....++.++++.+...+... .....+..+. T Consensus 130 ~--~~v~~------~~~~~-~~~~~~~evih~r~~~~------~~~~~~~~l~~~~~~~~~~~------~~~~~~~~~~~ 188 (395) T protein:vir:40 130 Y--TEVTL------KDLTL-KKEFKESEVLHLTLNNE------SIKSIIDGFYLLYGDLLTAA------VNKYKKLNSRK 188 (395) T ss_pred e--eeeee------cCcee-eeeeccccEEEeecCCC------CccccchhHHHHHHHHHHHH------HHHHHhcCCCC Confidence 0 00000 01111 24577889999863221 11123334444333332221 1222223222 Q ss_pred ceeeecchhhhcCccHHHHHHHHHHHH-HhcCCcc-eEEEecCCcceeEEecccCCHH--HHH---HHHHHHHHhhhcCC Q lcl|NC_019527. 298 TFLKTNMAQVLNGGEGGDVFDRVEMYV-NMQSNLG-LAVMDFDSEDIVQVNTPLSGLA--DLQ---SQSQEHMCSVSKIP 370 (516) Q Consensus 298 ~v~k~~~~~~l~~~~~~~l~~r~~~~~-~~~sn~g-~~~id~~~e~~e~~~~~lsgl~--d~~---~~~~~~iaaas~IP 370 (516) .+++.+....++....+++.+.++... ...+|.+ +++++. +-+|+.++.+..... ++- +....+||.+.+|| T Consensus 189 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~-g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVP 267 (395) T protein:vir:40 189 IIVKLKAMFGQTPEAEEKLRLMLSERMKKFLAEGDSALPVED-GMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIP 267 (395) T ss_pred ceEEEecccCCCHHHHHHHHHHHHHHHHHhhccCCceeecCC-CceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCC Confidence 334443333343333344555554332 2234444 455554 578999987766532 222 22346899999999 Q ss_pred ceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc--CCc--ceEEeCCCCCCCHHHHHHH Q lcl|NC_019527. 371 AIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI--DDA--ITFKFKSLWQTSAKEESEI 446 (516) Q Consensus 371 ~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~--~~d--~~~~f~pL~~~sekEkAei 446 (516) ..+|-| -.++.+.....|| +..|.|.++.|-+.|-+..+... ..+ ++|.+..|...|.+++++ T Consensus 268 p~~l~~-----~~sn~e~~~~~f~-------~~~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~- 334 (395) T protein:vir:40 268 LGLAKG-----DTVGLSEQVNSFL-------MFSINPIAEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIAS- 334 (395) T ss_pred HHHhcC-----CCcCHHHHHHHHH-------HHHHHHHHHHHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHHHH- Confidence 987732 2233343333333 34578888888887777665432 223 445556888888888765 Q ss_pred HHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCCh-hhh-cccc-c-cc-hhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 447 RFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDG-DLE-IVQP-E-MF-DDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 447 ~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~-~~e-~~~~-e-~~-~~e~~~~~~~~~~~~~~~e~t 516 (516) ++.+++++|++|++|+|+.++ +++++. +.+ ...+ . .+ +..+....+++.++. ...+ T Consensus 335 ------~~~~~~~~G~~t~NE~R~~~g------~~pi~~~~gD~~~~~~n~~~~~~~~~~~kgge~~~~--~~~~ 395 (395) T protein:vir:40 335 ------SMDVLFHIGVNTIDDNLRMIG------REPVMSPETQERFVTKNYAPLGENEEDLKGGDINEN--KGDS 395 (395) T ss_pred ------HHHHHHhCCCCCHHHHHHHhC------CCCCCCCCCceeeeccccccccccccccCCCCCCCC--cCCC Confidence 456689999999999999984 444421 110 0000 0 00 000111112221111 1111 No 101 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=99.69 E-value=2.8e-17 Score=111.32 Aligned_cols=347 Identities=13% Similarity=0.083 Sum_probs=182.5 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTL 121 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~ 121 (516) |.-- -+|.. +.....+.. .......+. ..+........++ +++-+..+|+.+ T Consensus 1 M~~~-----~~f~~------------r~~~~~~~~-~~~~~~~~~---------~~~~~~v~~~~al-~~~av~~cv~~i 52 (359) T protein:vir:10 1 MSIL-----NPFER------------RSSITPNNY-YPFMVQNGS---------IVPNSLVDATEAL-KNSDLYAVTSLI 52 (359) T ss_pred Cccc-----chhhc------------cccCCCCcc-hhhhhcccc---------ccCCcccCHHHhh-cchHHHHHHHHH Confidence 1000 00100 000000000 000000000 0000001111222 345567899999 Q ss_pred hHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccc Q lcl|NC_019527. 122 STELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKK 200 (516) Q Consensus 122 aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~ 200 (516) ++++-+..+. ++ .....|...-....-...|.+.+... .++|.+++++.-++ . T Consensus 53 a~~ia~~p~~----~~-------~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~---------------~ 106 (359) T protein:vir:10 53 SSDIAGTRFI----GN-------QVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGD---------------N 106 (359) T ss_pred HHhhhcCccc----cc-------hHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECC---------------C Confidence 9988766552 11 12344544445455666676666654 56788888775322 1 Q ss_pred cceeeEEeecceeeccccccccccccccccCcceeEE------eeeEeccceEEEecCCcchhhhhhccCCCCchHHHHH Q lcl|NC_019527. 201 GSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV------LGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLA 274 (516) Q Consensus 201 g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v------~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~ 274 (516) |.+..+.++.+.++++... .+. -.|+| ....++++.|+||.....+. .+.....|.|.++.+ T Consensus 107 g~~~~l~~l~~~~v~i~~~--~~~--------~~y~~~~~~~~~~~~~~~~evih~~~~~~~~--~~~dg~~G~spi~~~ 174 (359) T protein:vir:10 107 SLMKELRLIPSNAITIDLT--DDT--------LTYEVNQFDDYPSAKYNASEMIHVKIMAYGV--DTLHNLVGHSPLESL 174 (359) T ss_pred CeEEEEEEeCCceEEEEEc--CCe--------EEEEEEecCCceEEEEcccceEEeccCCCCC--CccCccccccHHHHH Confidence 3345577777777664221 111 12222 24578999999997643211 122445799999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccCC Q lcl|NC_019527. 275 QPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLSG 351 (516) Q Consensus 275 ~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~lsg 351 (516) .+.+.....+......++.+...+ +++++. ..++.+..+.+.++++..... .|.| ..+++ ++.+|+.++.+... T Consensus 175 ~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~~l~~e~~~~~~~~~~~~~~~-~n~g~~~vl~-~g~~~~~l~~~~~d 251 (359) T protein:vir:10 175 TSEIGQQKEANRLSLSTLKGALNPTSVVKVPQ-GTLSSEAKDSIRKEFEKANGG-NNSGRVMVLD-QSADFSTVSINADV 251 (359) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCCHHHHHHHHHHHHHHhCc-cccCCceecC-CCcceeeecCCHHH Confidence 999999999999999988886643 445432 123343344566666554433 4555 45665 45788888765443 Q ss_pred H--HHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcce Q lcl|NC_019527. 352 L--ADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAIT 429 (516) Q Consensus 352 l--~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~ 429 (516) . -+......+.||.+.+||-.+|-|. +.-+++. +.+++.-...+.+.|+.+.+-|..... ..+. T Consensus 252 ~q~le~~~~~~~~Ia~~fgVPp~~lg~~--~~~~~~~--------~~~e~~~~~~l~~~l~p~~~~l~~~l~----~~~~ 317 (359) T protein:vir:10 252 ANYLNSMNWGRTQIAKAFGVSDSYLNGT--GDQQSSL--------DQIKDLYVNALNRFIEPLISELRIKCD----SSIG 317 (359) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhCCC--CcccccH--------HHHHHHHHHHHHHHHHHHHHHHHHHhh----hhhc Confidence 2 3556667789999999998776432 2222222 222322233344444444443332211 1222 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhcc Q lcl|NC_019527. 430 FKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDS 478 (516) Q Consensus 430 ~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~ 478 (516) +....+...+...+ ...+..++++|++|++|+|+.|...+-- T Consensus 318 ~~~~~~~~~d~~~~-------~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 318 VDMSPITDYSNSVF-------KADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred ccchhhhhcCHHHH-------HHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 33333333333222 2234568999999999999998554432 No 102 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=99.67 E-value=4.6e-17 Score=110.16 Aligned_cols=407 Identities=13% Similarity=0.067 Sum_probs=198.7 Q ss_pred CCCccCCCccchhc--ccccccchhhhcccccCCcccccccC--c-ccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeee Q lcl|NC_019527. 59 MPGVVPAGTTPAVA--MDSLCGPTYQFLNSAAGGLYAADIQP--F-PGYQNLAALATRPEYRAFASTLSTELTREGIEIT 133 (516) Q Consensus 59 ~~gv~~~~~~~~~a--~ds~~~~~~~~~~~~~~~~~~~~~~~--f-~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~ 133 (516) +...-|+..-..+. .+.. .........++-|-+.-...+ . .-|........+.++++|||..++-++-+|+++. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~-~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~ 79 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDG-MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) T ss_pred CCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecC Confidence 11111110000000 0000 000111111222211110000 0 0111122223456789999999999999999987 Q ss_pred eccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeeccee Q lcl|NC_019527. 134 SKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMW 213 (516) Q Consensus 134 ~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~ 213 (516) +.++.+.. +.+.+.+++-++.....++.+...+||.|++++..+... .| .+.+++|.+ T Consensus 80 ~~~d~~~~------~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg--~~--------------~i~~~~p~~ 137 (456) T protein:vir:79 80 GSADSDLA------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDG--TA--------------TITADSPET 137 (456) T ss_pred CCCCccHH------HHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCC--ce--------------EEEEeccce Confidence 76554322 345666777788888999999999999999877653221 01 022333333 Q ss_pred ecccccc------------ccccc-c----ccccCcceeEE--ee--------e--Eeccce------EEEecCCcchhh Q lcl|NC_019527. 214 TSPSAYN------------ALDPT-A----PDFYKPSTWWV--LG--------R--EMHASR------LLTIITRPLPDM 258 (516) Q Consensus 214 v~p~~~~------------~~dp~-s----~~yg~P~~y~v--~g--------~--~iH~SR------li~~~~~~~p~~ 258 (516) +.+..-. ..+.. . -.|..+..|.. .+ . ..+... ..++.+.+ |-. T Consensus 138 ~~~i~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-pvv 216 (456) T protein:vir:79 138 MVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPP-PVV 216 (456) T ss_pred eEEEEcCCCCCceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCce-eEE Confidence 3221100 00000 0 00111111100 00 0 000000 00111111 111 Q ss_pred hhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeee---cchhhhcCccHHHHHHHHHHHHHhcCCcce-EE Q lcl|NC_019527. 259 LKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKT---NMAQVLNGGEGGDVFDRVEMYVNMQSNLGL-AV 334 (516) Q Consensus 259 ~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~---~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~-~~ 334 (516) +..|-.|.|.++.+.+.+.+++++......-+..++...... +.........+ +.+...+..+...+. +. T Consensus 217 --~~~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g----~~i~~~~~~~~~~~~~~~ 290 (456) T protein:vir:79 217 --VYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENG----NAIDYASIFEAAPGALWE 290 (456) T ss_pred --EecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccc----cccchhhhhhhhcccccc Confidence 123557999999999988888887655443333333222211 11110101111 111222222222333 33 Q ss_pred EecCCcceeEE-ecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHH---HHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 335 MDFDSEDIVQV-NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIR---SFYDDISSVQQSYYFSPLD 410 (516) Q Consensus 335 id~~~e~~e~~-~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~---~yyd~I~~~Qe~~l~p~l~ 410 (516) ++ ++.++.++ .+++++..+.++....+|++.+++|...|.|.+ -|.||+.=.. .....++.+| ..+++.|+ T Consensus 291 ~~-~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~---~N~Sg~Al~~~~~~l~~k~~~~~-~~f~~~l~ 365 (456) T protein:vir:79 291 LP-PGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS---ANQSAEGAHNIEKGFLFKCEDRL-SIAKIGLE 365 (456) T ss_pred CC-CCcceeeecccChHHHHHHHHHHHHHHHhhcCCChhHhcccc---cCcHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 43 34556554 466788999999999999999999998886643 2456664333 3334444444 56889999 Q ss_pred HHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcc Q lcl|NC_019527. 411 TMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIV 490 (516) Q Consensus 411 ~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~ 490 (516) +++++++.-.......++++.|.+....|..+.|+ ++++++++|+++...++..| ||... +++.. T Consensus 366 ~~~~l~~~~~g~~~~~~i~v~w~~~~~~s~~~~ad-------a~~kl~~~G~~~~~~~~~~l------g~~~~--~i~~~ 430 (456) T protein:vir:79 366 AILVKALQIEGESVEDTVDVSFESPDRVTLGEKYS-------AASLAKAAGESWASIRRNIL------NYNAD--QIKQD 430 (456) T ss_pred HHHHHHHHhcCCCccccceEEeCCCCCcCHHHHHH-------HHHHHHhcCCChHHHHHhcC------CCCHH--HHHHH Confidence 99998875443333347899999998888877654 55667888999887766554 34322 11111 Q ss_pred ccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 491 QPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 491 ~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) +.+...++ .+..+..+-..++++++ T Consensus 431 e~~r~~~e-~~~~~~~~~~~~~~~~~ 455 (456) T protein:vir:79 431 DLDRAREQ-ITLFAGNPVQRPQEDGS 455 (456) T ss_pred HHHHHHHH-HHHHhhhHhhcCCCCCC Confidence 11111111 11122233344555555 No 103 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=99.66 E-value=1.5e-16 Score=107.31 Aligned_cols=346 Identities=14% Similarity=0.093 Sum_probs=168.7 Q ss_pred cccccchhhhccc-ccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccch-h------hhHH Q lcl|NC_019527. 74 DSLCGPTYQFLNS-AAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKA-K------EMAS 145 (516) Q Consensus 74 ds~~~~~~~~~~~-~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~-~------~~~~ 145 (516) =+.++-....... ...+. .+..++.-......+..+++||+.+|++..+-.+.+--...++. . .... T Consensus 1 M~~f~k~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~ 75 (378) T protein:vir:85 1 MNLFGKVVSFSRGKLNNDT-----QRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSD 75 (378) T ss_pred CchhhhhhhhhhcccccCC-----cceeeeeccchhhhhHHHHHHHHHHHHhHhhCceeEEEEeccccccccccccccch Confidence 0222211100000 00000 01111111122346677899999999999998887643222111 0 0011 Q ss_pred HHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccccccc Q lcl|NC_019527. 146 KIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDP 224 (516) Q Consensus 146 ~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp 224 (516) ....|+.+-........|.+.+... .++|.|+++++.++.. |.+.+... T Consensus 76 l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~--------------g~~~~~~~---------------- 125 (378) T protein:vir:85 76 LDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSET--------------GELLDLLF---------------- 125 (378) T ss_pred HHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCC--------------ceEEEEEe---------------- Confidence 1222322333334455666665544 5689999886554321 22211110 Q ss_pred ccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc-eeeec Q lcl|NC_019527. 225 TAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT-FLKTN 303 (516) Q Consensus 225 ~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~-v~k~~ 303 (516) ...++.+.++++||+.. +-+...+.+.++.+.+.+..+ +...... +++.+ T Consensus 126 -----------~~~~~~~~~~dvih~~~--------~~~~~~~~~~~~~a~~~~~~~----------~~~~~~~g~l~~~ 176 (378) T protein:vir:85 126 -----------ANDKKEYKPEELVRLVS--------PFYINEDTSILDNALASIQTK----------LEQGKLRGLLKIN 176 (378) T ss_pred -----------cCCCEEEcccceEEEec--------CcCccchhhHHHHHHHHHHHH----------HhcCCcceEEEeC Confidence 11234455677787752 111122344444444433221 1222222 33432 Q ss_pred chhhhcCccHHHHHHHHHHHH-H---hcCCcceEEEecCCcceeEEecccCCHH-HHHHHHHHHHHhhhcCCceeeeccc Q lcl|NC_019527. 304 MAQVLNGGEGGDVFDRVEMYV-N---MQSNLGLAVMDFDSEDIVQVNTPLSGLA-DLQSQSQEHMCSVSKIPAIKLTGIS 378 (516) Q Consensus 304 ~~~~l~~~~~~~l~~r~~~~~-~---~~sn~g~~~id~~~e~~e~~~~~lsgl~-d~~~~~~~~iaaas~IP~t~L~G~s 378 (516) ..++.+..+++.++++.+. . ...+.+++++++ +.+|++++.+...++ +.+++...+||.+.|||..+|.| T Consensus 177 --~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~-g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~~-- 251 (378) T protein:vir:85 177 --AFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDN-KTEIVELKKDYSVLNKDEIELIKSELLTGYFMNENILLG-- 251 (378) T ss_pred --CcCCHHHHHHHHHHHHHHHHHhhcccccccceecCC-CceEEeccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcC-- Confidence 2344333344555554332 2 223334566665 588998877655443 23456667899999999988844 Q ss_pred cccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc----------CCcceEEeCCCCCCCHHHHHHHHH Q lcl|NC_019527. 379 PSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI----------DDAITFKFKSLWQTSAKEESEIRF 448 (516) Q Consensus 379 p~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~----------~~d~~~~f~pL~~~sekEkAei~~ 448 (516) +..+.....|| ...|.|.+..+-.-|-+..+-.. ..++.|+++.|...|.+++++ T Consensus 252 -----s~~e~~~~~f~-------~~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~--- 316 (378) T protein:vir:85 252 -----TATQEQQIYFY-------NSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELID--- 316 (378) T ss_pred -----CchHHHHHHHH-------HHHHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHH--- Confidence 12233333343 34588999888887765543211 124567777898888888755 Q ss_pred HHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhc-----ccc--ccchhcCCCCCCCCCCCCCCC Q lcl|NC_019527. 449 NKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEI-----VQP--EMFDDDGADPYMPDPDVLPGE 513 (516) Q Consensus 449 ~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~-----~~~--e~~~~e~~~~~~~~~~~~~~~ 513 (516) ++..++++|++|++|+|+.++.. +++.-... ..+ +..+.+.........++..++ T Consensus 317 ----~~~~~~~~G~~T~NE~R~~lgl~------p~~gGD~~~~~~N~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 317 ----LYHENINGPIFTQNQLLVKMGEQ------PIEGGDIYIANLNAVAVKNLSDLQGSRKDVASTDETNNQ 378 (378) T ss_pred ----HHHHHHhCCCcCHHHHHHHhCCC------CCCCCCeEeecccccccccchhhcCccCCCCCCCCCCCC Confidence 55668999999999999998544 33211110 000 111111111111111111111 No 104 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=99.66 E-value=1.6e-15 Score=101.68 Aligned_cols=376 Identities=9% Similarity=0.021 Sum_probs=169.6 Q ss_pred cccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEA 153 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~ 153 (516) =|.++.. ..-.... ............-....|.+++.+.++|+.+|+++.+..+.+...+++... .......|... T Consensus 1 MGlf~~~-~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~~-~~~~~~lL~~~ 76 (395) T protein:vir:98 1 MGILDFF-SFKKSGT--LSDDDSGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTEN-QKDWLYWINTK 76 (395) T ss_pred Ccchhhh-cCCCccc--ccccccchhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcccc-cchHHHHHhhc Confidence 0111110 0000000 000001111111223445577889999999999999999988654432221 11222333333 Q ss_pred HHhcChhHHHHHHHHhcc-cceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCc Q lcl|NC_019527. 154 CEYYGVMGIIQKAAEHDC-FFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKP 232 (516) Q Consensus 154 ~~~l~~~~~l~ea~~~~r-lyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P 232 (516) -..+.....|.+.+.+.+ ++|.|++++.-++.-+ ++ + . +.+.. .+.+..+. .... T Consensus 77 PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~~---~~-~--~--------~~~~~--~~~~~~~~--~~~~------ 132 (395) T protein:vir:98 77 ANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGIY---VA-D--S--------FTQDK--KISGSQFK--VSRV------ 132 (395) T ss_pred CCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCcee---cC-C--c--------ccccc--cccCcccc--eeee------ Confidence 334445566666655554 5788887764432110 00 0 0 00000 00000000 0000 Q ss_pred ceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHH--HHHHhCCceeeecchhhhcC Q lcl|NC_019527. 233 STWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSD--LVDKFSRTFLKTNMAQVLNG 310 (516) Q Consensus 233 ~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~--Ll~~~~~~v~k~~~~~~l~~ 310 (516) ..|.+ .+.+-++.|+||..... .....+.++++..-+.+...........+ ...+................ T Consensus 133 ~~~~~-~~~~~~~evih~k~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 205 (395) T protein:vir:98 133 QGQTY-EKTFTFDQVIYLKNDNS------DLMSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENSDG 205 (395) T ss_pred cCcee-eeEecCccEEEecCCCC------CccccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccCCc Confidence 01111 23455678899874321 11122444555444433332222211111 11111111111111111111 Q ss_pred cc-HHHHHHHHHHHHHh-cCCcc-eEEEecCCcceeEEecccC--------CHHHHHHHHHHHHHhhhcCCceeeecccc Q lcl|NC_019527. 311 GE-GGDVFDRVEMYVNM-QSNLG-LAVMDFDSEDIVQVNTPLS--------GLADLQSQSQEHMCSVSKIPAIKLTGISP 379 (516) Q Consensus 311 ~~-~~~l~~r~~~~~~~-~sn~g-~~~id~~~e~~e~~~~~ls--------gl~d~~~~~~~~iaaas~IP~t~L~G~sp 379 (516) .. .+...+.++..... ..|.+ ++++.+ +-+++.++.... .+.++......+||.+.+||..+| | T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~-g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l-~--- 280 (395) T protein:vir:98 206 GRQSKSDKDFFKRTVEKIRTESVVGIPVTA-NTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLL-H--- 280 (395) T ss_pred HHHHHHHHHHHHHHHhhhhcCCcceeecCC-CceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHh-c--- Confidence 11 11222223332222 22332 333433 457777764422 233445566678999999999877 3 Q ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 380 SGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE--IDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIY 457 (516) Q Consensus 380 ~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~--~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~ 457 (516) |-.++.+.....|| +..|.|.+..+-..|-+..+.. ...++.|+|+.|...|.+++++ ++.++ T Consensus 281 -~~~sn~e~~~~~f~-------~~tl~P~~~~ie~~l~~kll~~~~~~~g~~f~~~~l~~~d~~~~~~-------~~~~~ 345 (395) T protein:vir:98 281 -GDIADNQKNYELLL-------EGPIESLITNIVDGLEYAIFDKSETLQGSFIKVTGLKNYDLFSISN-------QADKL 345 (395) T ss_pred -CCcccHHHHHHHHH-------HHHHHHHHHHHHHHHHHhcCChhhhcCcceeeehhhhccCHHHHHH-------HHHHH Confidence 22233333334443 3568898888888777766543 2346789999999999887655 55668 Q ss_pred HHcCCCCHHHHHHHHHhhhccCCCCCChh--hhcccc-ccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 458 ITNSVIDPSEARQQLSDDPDSGWDNIDGD--LEIVQP-EMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 458 ~~~gvi~~~e~r~~l~~~~~~~~~~~d~~--~e~~~~-e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) +++|++|++|+|+.++ +++++.. .+.... ..... +..+++ ..++.+| T Consensus 346 ~~~G~~T~NE~R~~~g------~~Pi~~~~gD~~~~~~n~~~~---~~~gge---~~~~~~~ 395 (395) T protein:vir:98 346 ISSGFVFIDEVREEIG------LPELPDGLGKVLYMTKNYESV---LERGGE---VDEEVET 395 (395) T ss_pred HhCCCcCHHHHHHHhC------CCCCCCCCCceeeecccceec---ccccCC---CCCCCCC Confidence 9999999999999984 4444220 011000 00000 000111 1111111 No 105 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.65 E-value=1.6e-16 Score=107.22 Aligned_cols=406 Identities=15% Similarity=0.105 Sum_probs=199.4 Q ss_pred CCCccCCCccchh--cccccccchhhhcccccCCccccccc--Ccc-cHHHHHHHHhCchhhhhhhhhhHHHhhCCCeee Q lcl|NC_019527. 59 MPGVVPAGTTPAV--AMDSLCGPTYQFLNSAAGGLYAADIQ--PFP-GYQNLAALATRPEYRAFASTLSTELTREGIEIT 133 (516) Q Consensus 59 ~~gv~~~~~~~~~--a~ds~~~~~~~~~~~~~~~~~~~~~~--~f~-gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~ 133 (516) |.-.-|+..-..| ..+... ........++-|.+.-.+. .++ -+.....-..+.+++.|||..++-++-+|+.+. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~-~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~ 79 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGM-SRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecC Confidence 1000010100000 000000 0001111122221110000 010 111111123467789999999999999999987 Q ss_pred eccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeeccee Q lcl|NC_019527. 134 SKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMW 213 (516) Q Consensus 134 ~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~ 213 (516) +.++.+.. ..+.+.|++-++.....++.+...+||.|++++..+... .|. +++++|.+ T Consensus 80 ~~~d~~~~------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g--~~~--------------i~~~~p~~ 137 (456) T protein:vir:10 80 GSADSDLA------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDG--TAT--------------ITADSPET 137 (456) T ss_pred CCCCcchH------HHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCC--ceE--------------EEEEccce Confidence 66544322 345666777788899999999999999999877653211 110 22333333 Q ss_pred eccccc------------ccccc-------------ccccccCcce--eEEeeeEe---ccc-----eEEEecCCcchhh Q lcl|NC_019527. 214 TSPSAY------------NALDP-------------TAPDFYKPST--WWVLGREM---HAS-----RLLTIITRPLPDM 258 (516) Q Consensus 214 v~p~~~------------~~~dp-------------~s~~yg~P~~--y~v~g~~i---H~S-----Rli~~~~~~~p~~ 258 (516) +.+..- ...++ ....|+.+.. +....... +.+ ...++-+.+ | + T Consensus 138 ~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-p-v 215 (456) T protein:vir:10 138 MVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPP-P-V 215 (456) T ss_pred eEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCce-e-E Confidence 222100 00000 0001111000 00000000 000 001111111 1 1 Q ss_pred hhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC--ceee-ecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEE Q lcl|NC_019527. 259 LKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR--TFLK-TNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAV 334 (516) Q Consensus 259 ~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~--~v~k-~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~ 334 (516) -+..|-+|.|.++.+.+.+.+++++......-...++. .+++ ++.........+..+ .....+.. ..+ ++. T Consensus 216 -v~~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~-~~~~~~~~---~~~~~~~ 290 (456) T protein:vir:10 216 -VVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAI-DYASIFEA---APGALWE 290 (456) T ss_pred -EEecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccccc-chhhhhhh---hcccccc Confidence 12345689999999999999988887654433333332 2221 111111111111111 11222222 223 333 Q ss_pred EecCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHH---HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 335 MDFDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEI---RSFYDDISSVQQSYYFSPLD 410 (516) Q Consensus 335 id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~---~~yyd~I~~~Qe~~l~p~l~ 410 (516) ++. +.++.+++ .++.+..+.++...++|++.+++|...|.|.+ -|.||+.=. ......++.+| ..+++.++ T Consensus 291 ~~~-~~~~~q~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~---~N~Sg~Ai~~~~~~l~~k~~~~~-~~f~~~l~ 365 (456) T protein:vir:10 291 LPP-GVDIWESQANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS---ANQSAEGAHNIEKGFLFKCEDRL-SIAKIGLE 365 (456) T ss_pred CCC-CcceEEecccChhHHHHHHHHHHHHHHhccCCChHHhcccc---cChHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 443 45665554 56778888999999999999999988876643 245666432 23344444444 46788999 Q ss_pred HHHHHHHHHhCCCcC-CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhc Q lcl|NC_019527. 411 TMLKVIQLSKWGEID-DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEI 489 (516) Q Consensus 411 ~l~~~l~~s~~g~~~-~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~ 489 (516) +++++++.. .|..+ .++++.|.+....|..|.| +++.++.++|+++...+++.| ||.. .+.+. T Consensus 366 ~~~rl~~~~-~g~~~~~~~~v~w~~~~~~~~~~~a-------da~~kl~~~gi~~~~~~~~~l------g~~~--~~i~~ 429 (456) T protein:vir:10 366 AILVKALQI-EGESVEDTVDVSFESPDRVTLGEKY-------SAASLAKAAGESWASIRRNIL------NYNA--DQIKQ 429 (456) T ss_pred HHHHHHHHh-cCCCcccceeEEecCCCCcCHHHHH-------HHHHHHHHcCCChHHHHHhhC------CCCH--HHHHH Confidence 999987654 34443 4789999999989888764 555667888998887776654 3332 22221 Q ss_pred cccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 490 VQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 490 ~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) .+.+...++ ....+.++...|..+++ T Consensus 430 ~e~er~~~e-~~~~~~~~~~~~~~~~~ 455 (456) T protein:vir:10 430 DDLDRAREQ-ITLFAGNPVQRPQEDGS 455 (456) T ss_pred HHHHHHHHH-HHHHhhhhhhcCCCCCC Confidence 122211111 11223333444555555 No 106 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.65 E-value=1.6e-16 Score=107.22 Aligned_cols=406 Identities=15% Similarity=0.105 Sum_probs=199.4 Q ss_pred CCCccCCCccchh--cccccccchhhhcccccCCccccccc--Ccc-cHHHHHHHHhCchhhhhhhhhhHHHhhCCCeee Q lcl|NC_019527. 59 MPGVVPAGTTPAV--AMDSLCGPTYQFLNSAAGGLYAADIQ--PFP-GYQNLAALATRPEYRAFASTLSTELTREGIEIT 133 (516) Q Consensus 59 ~~gv~~~~~~~~~--a~ds~~~~~~~~~~~~~~~~~~~~~~--~f~-gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~ 133 (516) |.-.-|+..-..| ..+... ........++-|.+.-.+. .++ -+.....-..+.+++.|||..++-++-+|+.+. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~-~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~ 79 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGM-SRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecC Confidence 1000010100000 000000 0001111122221110000 010 111111123467789999999999999999987 Q ss_pred eccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeeccee Q lcl|NC_019527. 134 SKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMW 213 (516) Q Consensus 134 ~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~ 213 (516) +.++.+.. ..+.+.|++-++.....++.+...+||.|++++..+... .|. +++++|.+ T Consensus 80 ~~~d~~~~------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g--~~~--------------i~~~~p~~ 137 (456) T protein:vir:10 80 GSADSDLA------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDG--TAT--------------ITADSPET 137 (456) T ss_pred CCCCcchH------HHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCC--ceE--------------EEEEccce Confidence 66544322 345666777788899999999999999999877653211 110 22333333 Q ss_pred eccccc------------ccccc-------------ccccccCcce--eEEeeeEe---ccc-----eEEEecCCcchhh Q lcl|NC_019527. 214 TSPSAY------------NALDP-------------TAPDFYKPST--WWVLGREM---HAS-----RLLTIITRPLPDM 258 (516) Q Consensus 214 v~p~~~------------~~~dp-------------~s~~yg~P~~--y~v~g~~i---H~S-----Rli~~~~~~~p~~ 258 (516) +.+..- ...++ ....|+.+.. +....... +.+ ...++-+.+ | + T Consensus 138 ~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-p-v 215 (456) T protein:vir:10 138 MVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPP-P-V 215 (456) T ss_pred eEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCce-e-E Confidence 222100 00000 0001111000 00000000 000 001111111 1 1 Q ss_pred hhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC--ceee-ecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEE Q lcl|NC_019527. 259 LKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR--TFLK-TNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAV 334 (516) Q Consensus 259 ~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~--~v~k-~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~ 334 (516) -+..|-+|.|.++.+.+.+.+++++......-...++. .+++ ++.........+..+ .....+.. ..+ ++. T Consensus 216 -v~~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~-~~~~~~~~---~~~~~~~ 290 (456) T protein:vir:10 216 -VVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAI-DYASIFEA---APGALWE 290 (456) T ss_pred -EEecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccccc-chhhhhhh---hcccccc Confidence 12345689999999999999988887654433333332 2221 111111111111111 11222222 223 333 Q ss_pred EecCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHH---HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 335 MDFDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEI---RSFYDDISSVQQSYYFSPLD 410 (516) Q Consensus 335 id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~---~~yyd~I~~~Qe~~l~p~l~ 410 (516) ++. +.++.+++ .++.+..+.++...++|++.+++|...|.|.+ -|.||+.=. ......++.+| ..+++.++ T Consensus 291 ~~~-~~~~~q~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~---~N~Sg~Ai~~~~~~l~~k~~~~~-~~f~~~l~ 365 (456) T protein:vir:10 291 LPP-GVDIWESQANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS---ANQSAEGAHNIEKGFLFKCEDRL-SIAKIGLE 365 (456) T ss_pred CCC-CcceEEecccChhHHHHHHHHHHHHHHhccCCChHHhcccc---cChHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 443 45665554 56778888999999999999999988876643 245666432 23344444444 46788999 Q ss_pred HHHHHHHHHhCCCcC-CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhc Q lcl|NC_019527. 411 TMLKVIQLSKWGEID-DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEI 489 (516) Q Consensus 411 ~l~~~l~~s~~g~~~-~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~ 489 (516) +++++++.. .|..+ .++++.|.+....|..|.| +++.++.++|+++...+++.| ||.. .+.+. T Consensus 366 ~~~rl~~~~-~g~~~~~~~~v~w~~~~~~~~~~~a-------da~~kl~~~gi~~~~~~~~~l------g~~~--~~i~~ 429 (456) T protein:vir:10 366 AILVKALQI-EGESVEDTVDVSFESPDRVTLGEKY-------SAASLAKAAGESWASIRRNIL------NYNA--DQIKQ 429 (456) T ss_pred HHHHHHHHh-cCCCcccceeEEecCCCCcCHHHHH-------HHHHHHHHcCCChHHHHHhhC------CCCH--HHHHH Confidence 999987654 34443 4789999999989888764 555667888998887776654 3332 22221 Q ss_pred cccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 490 VQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 490 ~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) .+.+...++ ....+.++...|..+++ T Consensus 430 ~e~er~~~e-~~~~~~~~~~~~~~~~~ 455 (456) T protein:vir:10 430 DDLDRAREQ-ITLFAGNPVQRPQEDGS 455 (456) T ss_pred HHHHHHHHH-HHHHhhhhhhcCCCCCC Confidence 122211111 11223333444555555 No 107 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.58 E-value=8.8e-15 Score=97.65 Aligned_cols=374 Identities=9% Similarity=-0.059 Sum_probs=184.3 Q ss_pred ccccCc-ccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhccc Q lcl|NC_019527. 94 ADIQPF-PGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCF 172 (516) Q Consensus 94 ~~~~~f-~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rl 172 (516) +-..+. .-|-.++......+.++|||..++-+.=+||+.. +.++ -+.+.+.|++-++.....++.+.+.+ T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~~--d~~~-------~~~~~~i~~~N~~d~~~~~~~~~a~i 71 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVTGP--DGEP-------DTRASRWWQANRLDSRQKLVWRMAMA 71 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccCceecC--CCch-------HHHHHHHHHhcChhHHHHHHHHHHhh Confidence 000111 1122222223456899999999998877887642 2211 13356667777888999999999999 Q ss_pred ceeeEEEEEecCCCc---ccCcccccccccccceeeEEeecceeeccccccccccccccccC------------------ Q lcl|NC_019527. 173 FGRGQISINIKGADV---SVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYK------------------ 231 (516) Q Consensus 173 yG~a~i~i~i~~~~~---~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~------------------ 231 (516) ||.|++++..+.... .+|.+ -|++++|.++.+..- ... ..+.++- T Consensus 72 ~G~ay~~v~~~~~~~~~~~~~~~------------~I~~~~p~~~~~i~D-~~~-~~~~~ai~~~~~~~~~~~~~~~~~~ 137 (434) T protein:vir:98 72 QSAGYMLVGAHPTRTEDNGRPSP------------LITMEHPSECIVEYD-PET-GEPLVGLKVWHNDIDGFGYARVFFD 137 (434) T ss_pred cCceEEEEecCCCcccccCCcee------------EEEEeccceeEEEEe-CCC-CceEEEEEEEEeccCCceEEEEEEe Confidence 999998886532211 11111 133444444432110 000 0011110 Q ss_pred -cceeEEe----e-------------eEeccceEEEecCCcch-hhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 232 -PSTWWVL----G-------------REMHASRLLTIITRPLP-DMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLV 292 (516) Q Consensus 232 -P~~y~v~----g-------------~~iH~SRli~~~~~~~p-~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll 292 (516) -.++... + ..+|...-+.|..-|+. ...++....+|.|.++.+.+.+.+++++........ T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~liDa~~~~~s~~~~~~ 217 (434) T protein:vir:98 138 DTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDIQDRVNLGILNRMAAS 217 (434) T ss_pred CcEEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHHHHHHHHHHHHHHHHH Confidence 0000000 0 00111111111111111 112223335799999999999999999987766555 Q ss_pred HHhCCceeee---cchhhhcCccHHHHHHHHHHHHHhcC-CcceEEEecCCcceeEEe-cccCCHHHHHHHHHHHHHhhh Q lcl|NC_019527. 293 DKFSRTFLKT---NMAQVLNGGEGGDVFDRVEMYVNMQS-NLGLAVMDFDSEDIVQVN-TPLSGLADLQSQSQEHMCSVS 367 (516) Q Consensus 293 ~~~~~~v~k~---~~~~~l~~~~~~~l~~r~~~~~~~~s-n~g~~~id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas 367 (516) .-++.....+ ++.... .+.. ......+.... ...+.++.+++-++-+++ .++.+..+.+....++|++.+ T Consensus 218 ~~~a~p~~~i~G~~~~~~~-~~~~----~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~ 292 (434) T protein:vir:98 218 RFSGFRQKWIKGHKFAKRT-DPAT----GMTVVDQPFVPSPSAVWASEGENTQFGQLDATDLSGFLKEHASDVRDMLTIS 292 (434) T ss_pred HHhcchhhhhcCCCccccc-cccc----ccchhhhhhhccccccccCCCCCceEEEecCcchHHHHHHHHHHHHHHhccc Confidence 5444432221 111111 1111 11111111111 122344443333443432 345566777888899999999 Q ss_pred cCCceeeeccccccccccchHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcC-CcceEEeCCCCCCCHHHH Q lcl|NC_019527. 368 KIPAIKLTGISPSGLNASSEGE---IRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEID-DAITFKFKSLWQTSAKEE 443 (516) Q Consensus 368 ~IP~t~L~G~sp~Glnatge~D---~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~-~d~~~~f~pL~~~sekEk 443 (516) ++|...|.|. --|+||+.= .......++.+| ..++..+++++++++........ .++++.|.+-...|..+. T Consensus 293 ~~p~~~~~~~---~~n~Sg~Al~~~~~~l~~k~~~k~-~~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ 368 (434) T protein:vir:98 293 QTPTYLYATD---LVNISADTIGALDILHVAKVREHI-ASFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVK 368 (434) T ss_pred CCCHHHhccc---cCChHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCCCCHHHH Confidence 9998776553 124566643 333444455555 45788899998877655432221 368899999999998886 Q ss_pred HHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccch----------hcCCCCCCCCCCCCCCC Q lcl|NC_019527. 444 SEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFD----------DDGADPYMPDPDVLPGE 513 (516) Q Consensus 444 Aei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~----------~e~~~~~~~~~~~~~~~ 513 (516) |+.. +++.++|+ +.+.+++.| +++. .+++..+.+... ..+.++.+.++++.... T Consensus 369 ada~-------~kl~~~g~-~~e~~~~~l------g~~~--~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 432 (434) T protein:vir:98 369 ADAA-------TKLKSIGY-PLDVIAEEL------DESP--ARVRRIVAGAASQALLAASLLPAPGAPSAGNVPDSGGAV 432 (434) T ss_pred HHHH-------HHHHhcCC-cHHHHHHhC------CCCH--HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcccCCC Confidence 6654 55666664 444444433 2221 122211111111 11222233333333322 Q ss_pred CC Q lcl|NC_019527. 514 EG 515 (516) Q Consensus 514 e~ 515 (516) +| T Consensus 433 dg 434 (434) T protein:vir:98 433 DG 434 (434) T ss_pred CC Confidence 33 No 108 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.54 E-value=1.8e-14 Score=95.92 Aligned_cols=434 Identities=7% Similarity=0.018 Sum_probs=205.3 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCC--ccccccCCCCCCCccCCCccchhccccccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASD--AATKWAPPQLMPGVVPAGTTPAVAMDSLCG 78 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~ 78 (516) |+++.|-.+-+.+....-+...+..+- ....-..+......+..+ ....|+..... +.+.. ...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~---i~~~~---~~~~---- 67 (474) T protein:vir:97 1 MFNIIRMPWDKPYGEEVVEQLKPQFET---QEEMIVRLIDDHRKQLDKITVGQRYYDKDND---IVKQM---KKVD---- 67 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccC---HHHHHHHHHHHHHHHHHHHHHHHHHhccccc---hhccc---chhc---- Confidence 999987765555543322222222211 111111121111111110 01122222100 00000 0000 Q ss_pred chhhhcccccCCcccccccCcccHHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhc Q lcl|NC_019527. 79 PTYQFLNSAAGGLYAADIQPFPGYQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYY 157 (516) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l 157 (516) ..+ ..-.+. +-.+ .+.+++.||+..+.-++.+++++++.++.. .+.|+. +.+- T Consensus 68 ---------~~~------~~~~~~---~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~-------~~~l~~-~~~n 121 (474) T protein:vir:97 68 ---------VHG------NIDYDK---PDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENV-------LKVIHD-VLDT 121 (474) T ss_pred ---------ccc------cccccc---CcceeecchHHHHHHHHHhhhhcCCceeccCcHHH-------HHHHHH-HHhc Confidence 000 000000 0011 357799999999999999999998765432 122333 2234 Q ss_pred ChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE Q lcl|NC_019527. 158 GVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV 237 (516) Q Consensus 158 ~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v 237 (516) +....+.++.+....||.|++++.++... .+ .+.+++|..+.|.. ...+...+.++ -.+|.. T Consensus 122 ~~~~~~~e~~~~~~~~G~~~~~~~~d~~~---------------~~-~i~~~~p~~~~~v~-d~~~~~~~~~~-ir~~~~ 183 (474) T protein:vir:97 122 RWDNKLIDILTATSNKGIDWLQVYINENG---------------EM-KLFRVPAEQAIPIW-VDKEREELKSF-IRYYKF 183 (474) T ss_pred cHHHHHHHHHHHHhhcCceEEEEEecCCC---------------ee-EEEEEcccceEEEE-cCCCCCceEEE-EEEEEe Confidence 78889999999999999999888764321 11 13344454444331 00011111111 111111 Q ss_pred eee---E-eccceEEEecC-------------------------CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 238 LGR---E-MHASRLLTIIT-------------------------RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSV 288 (516) Q Consensus 238 ~g~---~-iH~SRli~~~~-------------------------~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~ 288 (516) .+. . +.+.++.+|.- ..+|-.. -.++-.|.|.++.+.+.+.+++.+.... T Consensus 184 ~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~-~~nn~~g~sd~e~v~~liDa~n~~~s~~ 262 (474) T protein:vir:97 184 NNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIA-FKNNPEEVSDIWMYKSIIDAIDKRLSDA 262 (474) T ss_pred cCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEE-ecCCcCCCCcHHHHHHHHHHHHHHHHHH Confidence 110 0 11112211110 0111111 1224579999999999999999998888 Q ss_pred HHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhh Q lcl|NC_019527. 289 SDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSV 366 (516) Q Consensus 289 ~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaa 366 (516) +.-+..++...+...... ..+.......+ + ..+++.++++ .+++.+ ..+.+++...++.+.+.|... T Consensus 263 ~~~~~~~~~~~lv~~g~~---~~~~~~~~~~~------~-~~~~i~~~~~-~~~~~l~~~~~~~~~~~~~~~l~~~I~~~ 331 (474) T protein:vir:97 263 QNMFDESVELIYILKGYE---GEDLEEFMRGL------K-YYKAINVDGD-GGVETIQVEVPVSSTKEYIDLMRVYIMEF 331 (474) T ss_pred HHHHHHhcCceeeeecCC---cccchhhhhhh------h-ccceeeccCC-CceeEEeecCCHHHHHHHHHHHHHHHHHH Confidence 877776666665543221 11111221111 1 2333444443 455554 456678888999999999999 Q ss_pred hcCCceeeeccccccccccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCcC-CcceEEeCCCCCCCHHHH Q lcl|NC_019527. 367 SKIPAIKLTGISPSGLNASSEGEIRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEID-DAITFKFKSLWQTSAKEE 443 (516) Q Consensus 367 s~IP~t~L~G~sp~Glnatge~D~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~~-~d~~~~f~pL~~~sekEk 443 (516) +++|-.-. + +.+| |.||..=...|...+. ..++..++..+++++.+++.-.....+ .++++.|++-...+++|. T Consensus 332 s~~p~~~~-~-~~~~-n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~ 408 (474) T protein:vir:97 332 GQGVDFQT-D-KFGS-APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQ 408 (474) T ss_pred hCccccCc-c-cccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccCHHHH Confidence 99996322 1 2222 4566532222322222 344567889999999887653322322 468999999888888887 Q ss_pred HHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhh--hccCCCCCChhhhccccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 444 SEIRFNKAQEAQIYITNSVIDPSEARQQLSDD--PDSGWDNIDGDLEIVQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 444 Aei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~--~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) |++. .++|++|.+.+.+.+..- +..-+..+..+.+..........+.......+++.++.+.+ T Consensus 409 a~~~----------~~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 473 (474) T protein:vir:97 409 SQII----------AQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKES 473 (474) T ss_pred HHHH----------HHcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCccccc Confidence 7642 346889988887765210 00001111110000000111111111111111122222222 No 109 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.54 E-value=1.8e-14 Score=95.92 Aligned_cols=434 Identities=7% Similarity=0.018 Sum_probs=205.3 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCC--ccccccCCCCCCCccCCCccchhccccccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASD--AATKWAPPQLMPGVVPAGTTPAVAMDSLCG 78 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~ 78 (516) |+++.|-.+-+.+....-+...+..+- ....-..+......+..+ ....|+..... +.+.. ...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~---i~~~~---~~~~---- 67 (474) T protein:vir:94 1 MFNIIRMPWDKPYGEEVVEQLKPQFET---QEEMIVRLIDDHRKQLDKITVGQRYYDKDND---IVKQM---KKVD---- 67 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccC---HHHHHHHHHHHHHHHHHHHHHHHHHhccccc---hhccc---chhc---- Confidence 999987765555543322222222211 111111121111111110 01122222100 00000 0000 Q ss_pred chhhhcccccCCcccccccCcccHHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhc Q lcl|NC_019527. 79 PTYQFLNSAAGGLYAADIQPFPGYQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYY 157 (516) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l 157 (516) ..+ ..-.+. +-.+ .+.+++.||+..+.-++.+++++++.++.. .+.|+. +.+- T Consensus 68 ---------~~~------~~~~~~---~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~-------~~~l~~-~~~n 121 (474) T protein:vir:94 68 ---------VHG------NIDYDK---PDWRITTNFHQNLVDQKVSYVASKPVTYSCEDENV-------LKVIHD-VLDT 121 (474) T ss_pred ---------ccc------cccccc---CcceeecchHHHHHHHHHhhhhcCCceeccCcHHH-------HHHHHH-HHhc Confidence 000 000000 0011 357799999999999999999998765432 122333 2234 Q ss_pred ChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE Q lcl|NC_019527. 158 GVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV 237 (516) Q Consensus 158 ~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v 237 (516) +....+.++.+....||.|++++.++... .+ .+.+++|..+.|.. ...+...+.++ -.+|.. T Consensus 122 ~~~~~~~e~~~~~~~~G~~~~~~~~d~~~---------------~~-~i~~~~p~~~~~v~-d~~~~~~~~~~-ir~~~~ 183 (474) T protein:vir:94 122 RWDNKLIDILTATSNKGIDWLQVYINENG---------------EM-KLFRVPAEQAIPIW-VDKEREELKSF-IRYYKF 183 (474) T ss_pred cHHHHHHHHHHHHhhcCceEEEEEecCCC---------------ee-EEEEEcccceEEEE-cCCCCCceEEE-EEEEEe Confidence 78889999999999999999888764321 11 13344454444331 00011111111 111111 Q ss_pred eee---E-eccceEEEecC-------------------------CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 238 LGR---E-MHASRLLTIIT-------------------------RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSV 288 (516) Q Consensus 238 ~g~---~-iH~SRli~~~~-------------------------~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~ 288 (516) .+. . +.+.++.+|.- ..+|-.. -.++-.|.|.++.+.+.+.+++.+.... T Consensus 184 ~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~-~~nn~~g~sd~e~v~~liDa~n~~~s~~ 262 (474) T protein:vir:94 184 NNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIA-FKNNPEEVSDIWMYKSIIDAIDKRLSDA 262 (474) T ss_pred cCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEE-ecCCcCCCCcHHHHHHHHHHHHHHHHHH Confidence 110 0 11112211110 0111111 1224579999999999999999998888 Q ss_pred HHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhh Q lcl|NC_019527. 289 SDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSV 366 (516) Q Consensus 289 ~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaa 366 (516) +.-+..++...+...... ..+.......+ + ..+++.++++ .+++.+ ..+.+++...++.+.+.|... T Consensus 263 ~~~~~~~~~~~lv~~g~~---~~~~~~~~~~~------~-~~~~i~~~~~-~~~~~l~~~~~~~~~~~~~~~l~~~I~~~ 331 (474) T protein:vir:94 263 QNMFDESVELIYILKGYE---GEDLEEFMRGL------K-YYKAINVDGD-GGVETIQVEVPVSSTKEYIDLMRVYIMEF 331 (474) T ss_pred HHHHHHhcCceeeeecCC---cccchhhhhhh------h-ccceeeccCC-CceeEEeecCCHHHHHHHHHHHHHHHHHH Confidence 877776666665543221 11111221111 1 2333444443 455554 456678888999999999999 Q ss_pred hcCCceeeeccccccccccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCcC-CcceEEeCCCCCCCHHHH Q lcl|NC_019527. 367 SKIPAIKLTGISPSGLNASSEGEIRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEID-DAITFKFKSLWQTSAKEE 443 (516) Q Consensus 367 s~IP~t~L~G~sp~Glnatge~D~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~~-~d~~~~f~pL~~~sekEk 443 (516) +++|-.-. + +.+| |.||..=...|...+. ..++..++..+++++.+++.-.....+ .++++.|++-...+++|. T Consensus 332 s~~p~~~~-~-~~~~-n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~ 408 (474) T protein:vir:94 332 GQGVDFQT-D-KFGS-APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQ 408 (474) T ss_pred hCccccCc-c-cccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccCHHHH Confidence 99996322 1 2222 4566532222322222 344567889999999887653322322 468999999888888887 Q ss_pred HHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhh--hccCCCCCChhhhccccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 444 SEIRFNKAQEAQIYITNSVIDPSEARQQLSDD--PDSGWDNIDGDLEIVQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 444 Aei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~--~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) |++. .++|++|.+.+.+.+..- +..-+..+..+.+..........+.......+++.++.+.+ T Consensus 409 a~~~----------~~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 473 (474) T protein:vir:94 409 SQII----------AQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKES 473 (474) T ss_pred HHHH----------HHcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCccccc Confidence 7642 346889988887765210 00001111110000000111111111111111122222222 No 110 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.50 E-value=1.5e-12 Score=85.42 Aligned_cols=408 Identities=11% Similarity=0.059 Sum_probs=208.4 Q ss_pred cccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcc Q lcl|NC_019527. 13 VADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLY 92 (516) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~ 92 (516) +...+-.+-.-|++...... .+...+.-.... . ..| ..|..|....+-+ +.. + T Consensus 1 m~~~i~~~~g~p~~~~~~~~----~~~~~ia~~~~~-~---~~~--~~~~~~~~~~~iL----------r~~----~--- 53 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDK----SLSSQIATRARS-I---DFF--ALGMYLPNPDPVL----------KAL----G--- 53 (491) T ss_pred CCCceeCCCCCccCcccCCh----HHHHHHHhhhcc-c---ccc--cccCCccchHHHH----------Hhc----C--- Confidence 21111111111111111011 111111100000 0 000 0122221111111 000 0 Q ss_pred cccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhccc Q lcl|NC_019527. 93 AADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCF 172 (516) Q Consensus 93 ~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rl 172 (516) -++.++..+....-+..++++...-++..-|.|...++++ ...+.+++.++++.+.+.+.+.+ .+.+ T Consensus 54 -------~~~~~y~~m~~D~~i~s~l~~Rk~av~~~~w~i~~~~~~~-----~~~e~v~e~l~~~~~~~~l~~~l-da~~ 120 (491) T protein:vir:10 54 -------KDIRVYRELRADAHVGGCVRRRKAAVKALEWGLDRGKAKS-----RVAKSIADVFADLDLSRIVTEML-DAVL 120 (491) T ss_pred -------CCHHHHHHHhhChHHHHHHHHHHHHHhCCCcEEecCCCCH-----HHHHHHHHHHhcCCHHHHHHHHH-Hhhh Confidence 0234444555788899999999988888888887654432 12456777888888877777776 6899 Q ss_pred ceeeEEEEEe--cCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEe Q lcl|NC_019527. 173 FGRGQISINI--KGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTI 250 (516) Q Consensus 173 yG~a~i~i~i--~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~ 250 (516) ||.++.=+.= +++. ..++.+...++.|+... +............+.|..+++.+.+++ T Consensus 121 ~G~s~~Ei~w~~~~g~--------------~~~~~l~~r~~~~f~~d------~~~~l~~~~~~~~~~g~~l~~~k~i~~ 180 (491) T protein:vir:10 121 YGYQPMEITWGKVGNY--------------IVPIDVVGKPADWFVYD------PENQLRFRSKDHWMQGEELPARKFLVP 180 (491) T ss_pred hcceeEEEEEeecCCe--------------eEEEEeeeecccceeec------cCCceEEecCCCCCCcceecCCCEEEE Confidence 9999864432 2211 12345666666665532 111111111122344667888888887 Q ss_pred cCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCc Q lcl|NC_019527. 251 ITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNL 330 (516) Q Consensus 251 ~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~ 330 (516) .... ...+.+|.|++..|+..+.--..+...-+.++.++++++..-+..... +.++..+-++.+..+.++ T Consensus 181 ~~~~------~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a---~~~ek~~l~~al~~~~~~- 250 (491) T protein:vir:10 181 RQEA------TYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSA---SDGEKNLLLDCLEDMVQD- 250 (491) T ss_pred EecC------CCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCCC---CHHHHHHHHHHHHHHhcC- Confidence 7543 344678999999999998888888888889999999765433211111 122333334445555443 Q ss_pred ceEEEecCCcceeEEecccC-C----HHHHHHHHHHHHHhh-hcCCceeeeccccccccccchHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 331 GLAVMDFDSEDIVQVNTPLS-G----LADLQSQSQEHMCSV-SKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSY 404 (516) Q Consensus 331 g~~~id~~~e~~e~~~~~ls-g----l~d~~~~~~~~iaaa-s~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~ 404 (516) +..++.. +.+++.+...-+ | ...+++..-++|+-+ .|--+| ..+ +|-+|.|+--....-+.+++... . T Consensus 251 a~~viP~-~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlT---t~~-~gs~a~~~vh~~v~~di~~~D~~-~ 324 (491) T protein:vir:10 251 AVAVVPD-DSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQT---TEA-TSTRASAQAGLEVTDDIRDGDKA-V 324 (491) T ss_pred cEEEecC-CceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcc---cCc-ccchhHHHHHHHHHHHHHHHHHH-H Confidence 4445544 478888876532 2 345677777777643 222111 111 34445555555566666666554 4 Q ss_pred HHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCC-CCHHHHHHHHHhhhccCCCCC Q lcl|NC_019527. 405 YFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSV-IDPSEARQQLSDDPDSGWDNI 483 (516) Q Consensus 405 l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gv-i~~~e~r~~l~~~~~~~~~~~ 483 (516) +...+++++.-++.-.||..+ ...|+|...- +..+..|++++++++.|+ ++.+.+++.++ ++.- T Consensus 325 i~~tln~li~~l~~~N~~~~~-~p~f~~~~~~--------e~~~~~a~~~~~L~~~G~~i~~~~i~e~~G------ip~~ 389 (491) T protein:vir:10 325 VSEAMNMLIRWICDLNFDGAD-RPVFDMWEQE--------QVDEIQAGRDQKLTQAGARFTPAYFKRAYN------LQDG 389 (491) T ss_pred HHHHHHHHHHHHHHhcCCCCC-cceEEecCcC--------chhHHHHHHHHHHHhCCCcCCHHHHHHHhC------CCCC Confidence 555667787777666666433 4567774321 333567888899999997 78888988873 3221 Q ss_pred ChhhhccccccchhcCCCCCCCCCCCCCCC--CCC Q lcl|NC_019527. 484 DGDLEIVQPEMFDDDGADPYMPDPDVLPGE--EGS 516 (516) Q Consensus 484 d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~--e~t 516 (516) ..+.+.. + .......+. .+...+.. ... T Consensus 390 ~~~~~~~-~---~~~~~~~~~-~~~~~~~~~~~~~ 419 (491) T protein:vir:10 390 DLDERPL-P---VSAVDTVGA-ASFAEFEAPDQDA 419 (491) T ss_pred CcCcccc-c---cCCCCCccc-ccccccCCCCCCc Confidence 1111000 0 000000000 00000000 000 No 111 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.50 E-value=1e-13 Score=91.85 Aligned_cols=413 Identities=11% Similarity=0.088 Sum_probs=195.8 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhh----HHHHhHHhhcCCCc--cccccCCCCCCCccCCCccchhccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMR----RAVMKSMERRASDA--ATKWAPPQLMPGVVPAGTTPAVAMD 74 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~--~~~~~~~~~~~gv~~~~~~~~~a~d 74 (516) |-.| | ++...+..+. ..|.+....+..+- ...|+-... . +++ . T Consensus 1 ~~~~------------p--------~~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~--~-i~~-----~--- 49 (479) T protein:vir:99 1 MIDL------------P--------DEDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQ--E-VPD-----L--- 49 (479) T ss_pred CccC------------C--------cccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCC--c-ccc-----c--- Confidence 4333 1 1222222222 22222222211110 112221110 0 000 0 Q ss_pred ccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHH Q lcl|NC_019527. 75 SLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEAC 154 (516) Q Consensus 75 s~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~ 154 (516) . .... -.-+..+.....+...++||+..++-+.=+||++. +++.. ..+...| T Consensus 50 -----------------~-~~~~-~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~gf~~~--d~~~~-------~~~~~i~ 101 (479) T protein:vir:99 50 -----------------A-TRHK-NKEREVLQQLSRKPWMGLMVNSFAQQLIVDGYRKT--GTNEN-------AKGWDTW 101 (479) T ss_pred -----------------c-cccC-ChhHHHHHHHhhcCcHHHHHHHHHhhcccccccCC--Cchhh-------HHHHHHH Confidence 0 0000 00012233333456699999999998877777653 22111 2345556 Q ss_pred HhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccccccccc---ccc-- Q lcl|NC_019527. 155 EYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTA---PDF-- 229 (516) Q Consensus 155 ~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s---~~y-- 229 (516) ++-++.....++.+.+.+||.|++++.- +.+.. | ..|. ..+++++|.++.+.. .|+.. +-| T Consensus 102 ~~N~~d~~~~~~~~~a~~~G~af~~v~~-~~~~~-----d----~~g~-~~i~~~~p~~~~~iy---dd~~~~~~~~~~~ 167 (479) T protein:vir:99 102 RLNQMDKQQFWLNRAVLTFGYAFIKVTS-GISPL-----D----GTTV-ARIKCIDPRDAFAIW---EDPYWDEWPKYLL 167 (479) T ss_pred HhcChhHHHHHHHHHHhhcCceEEEEec-CCCCc-----C----CCCc-eEEEEechhheEEEe---cCCcccceeeEEE Confidence 6667788889999999999999887642 21110 0 0011 125566666665532 11111 001 Q ss_pred ----------cCcceeEE----eee------Eecc-ce--EEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 230 ----------YKPSTWWV----LGR------EMHA-SR--LLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQ 286 (516) Q Consensus 230 ----------g~P~~y~v----~g~------~iH~-SR--li~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~ 286 (516) +.+..|.+ .+. .=|. .+ ++.|.++ +....+|.|.++.+.+.+.+++++.. T Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~-------~~~~~~g~sd~e~v~~liDa~~~~~s 240 (479) T protein:vir:99 168 ERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNV-------MDLRGVCYGDVEPLVTVAKAIDKTGL 240 (479) T ss_pred eecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeecC-------CCcCcCCcchhHHHHHHHHHHHHHHH Confidence 00111000 000 0011 11 1112221 11124799999999999999998877 Q ss_pred HHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEe-cccCCHHHHHHHHHHHHHh Q lcl|NC_019527. 287 SVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN-TPLSGLADLQSQSQEHMCS 365 (516) Q Consensus 287 ~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaa 365 (516) .....+..++...+.+-... +........ ..+. ....++++..+++-++-+++ .++....+.++....+|++ T Consensus 241 ~~~~~~~~~a~p~~~i~G~~-~~~~~~~~~----~~~~--~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~ 313 (479) T protein:vir:99 241 DILLVQHHQSFQIRWATGLM-LPEGANADQ----EKMR--FAQESMLISQNEKASFGAIPAAPLDGLLNAYKESLLEFLA 313 (479) T ss_pred HHHHHHHHhhchhhhhcCCC-cccccccch----hccc--cccccceeecCCCceEEEecccchHHHHHHHHHHHHHHhc Confidence 76655555554433221111 111110000 0011 11233444444434444443 4456667778888899999 Q ss_pred hhcCCceeeeccccccccccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCcC---CcceEEeCCCCCCCH Q lcl|NC_019527. 366 VSKIPAIKLTGISPSGLNASSEGEIRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEID---DAITFKFKSLWQTSA 440 (516) Q Consensus 366 as~IP~t~L~G~sp~Glnatge~D~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~~---~d~~~~f~pL~~~se 440 (516) .++||.. .||.+ | |+||+.=...+...+. ...+..+++.|++++++++....+..+ -++++.|.+...++. T Consensus 314 ~t~~p~~-~~g~~--~-n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~~~~~~i~~~w~~~~~~s~ 389 (479) T protein:vir:99 314 LAQLPPH-IAGQI--V-NVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEEATDLDFTITWQDVTIQSL 389 (479) T ss_pred cCCCCHH-Hcccc--c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCH Confidence 9999975 66753 2 4677754444443333 223356788999999988766554433 257889999888888 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCC-h-----------------hhh------ccccccch Q lcl|NC_019527. 441 KEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNID-G-----------------DLE------IVQPEMFD 496 (516) Q Consensus 441 kEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d-~-----------------~~e------~~~~e~~~ 496 (516) .+.|+. +.+++++|+++.+.+.+.+- +++.-+ + ... ...+.... T Consensus 390 ~~~ad~-------~~kl~~ag~is~et~l~~l~-----gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (479) T protein:vir:99 390 AQFADA-------WAKMVESLKIPAEGVWDMIP-----NLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNG 457 (479) T ss_pred HHHHHH-------HHHHHhcCCCCHHHHHHhcC-----CCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCC Confidence 886554 45566677777766655441 111000 0 000 00000111 Q ss_pred hcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 497 DDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 497 ~e~~~~~~~~~~~~~~~e~t 516 (516) ..+..++.+.+++..+..+| T Consensus 458 ~~~~~~~~~~~~~~~~~~~~ 477 (479) T protein:vir:99 458 ATNMQQANNKTGEPASLNKS 477 (479) T ss_pred CCCCCCCCCCCcchhccCCC Confidence 11112222333444444444 No 112 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.47 E-value=1.3e-13 Score=91.29 Aligned_cols=439 Identities=11% Similarity=0.034 Sum_probs=206.0 Q ss_pred cccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccch-hcccccccchhhhccc-ccCC Q lcl|NC_019527. 13 VADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPA-VAMDSLCGPTYQFLNS-AAGG 90 (516) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~-~a~ds~~~~~~~~~~~-~~~~ 90 (516) ++. ++++......++... -|. -.. ..+.. -+.|++..+....... .+.. T Consensus 1 ~~r-------~~~~~~~~dr~i~~~-----------------~~~---~~~--~~~~~~~~y~aa~~~r~~~~w~~~~~~ 51 (505) T protein:vir:96 1 MKR-------AEKKPSLAQRMVNWA-----------------WYR---YVE--PQKNAARAFEAARRDRLGKAWLRRASR 51 (505) T ss_pred CCC-------Cccccchhhcccchh-----------------hhh---hHH--HHHHhhhhcccccCCCccccccCCCCC Confidence 100 000011000000000 000 000 00000 0111111110000000 0000 Q ss_pred cc-cccccC---cccHHHHHHHHhCchhhhhhhhhhHHHhh-CCCeeeeccccch-hhhHHHHHHHHHHHHhc------- Q lcl|NC_019527. 91 LY-AADIQP---FPGYQNLAALATRPEYRAFASTLSTELTR-EGIEITSKDRTKA-KEMASKIKELEEACEYY------- 157 (516) Q Consensus 91 ~~-~~~~~~---f~gy~ll~~y~~~~i~r~iVd~~aed~~r-~~~~i~~~~~~~~-~~~~~~i~~i~~~~~~l------- 157 (516) .. ..+... ...-.--.+|+-|++++.+|+......+= .|+.+....+... ....+.-++|+..|++. T Consensus 52 ~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D 131 (505) T protein:vir:96 52 LSADEEIYADLASLVQRAREQSINNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCD 131 (505) T ss_pred CChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcc Confidence 00 000000 00011235677899999999999999994 7999876532211 00112334455555542 Q ss_pred -----ChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccc-cccc-------c Q lcl|NC_019527. 158 -----GVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAY-NALD-------P 224 (516) Q Consensus 158 -----~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~-~~~d-------p 224 (516) .+.+....+++.-..-|-+++.+... .....| + .|.+|+|.+|.-... ...+ . T Consensus 132 ~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~-~~~~~~------------~-~lqliepd~l~~~~n~~~~~~~~i~~GI 197 (505) T protein:vir:96 132 VTGRYHFVTLLHLWMETLARDGEVLVREHRG-YPNKWG------------Y-ALQILECDRLDLNYNADLQNGNRIRMSI 197 (505) T ss_pred eeccCCHHHHHHHHHHHHhhCCceEEEEeec-CCCCcc------------e-EEEEechhhcCCCCCcccCCcCeEEece Confidence 23333444555555667776655432 111111 1 145566665531100 0000 0 Q ss_pred ccccccCcceeEEee-----------------eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 225 TAPDFYKPSTWWVLG-----------------REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQS 287 (516) Q Consensus 225 ~s~~yg~P~~y~v~g-----------------~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~ 287 (516) .-..+|+|..|+|.. ..|..++|+|+-... ...+.-|+|.|-.++..|++++.-..+ T Consensus 198 e~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~------r~gQ~RGis~lapvl~~l~~l~~y~da 271 (505) T protein:vir:96 198 ELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYERVPADEIIHTFVPW------RPHQNRGIPWTHASMVELHHIGEYRKS 271 (505) T ss_pred EECCCCceEEEEEeecCCCccccccccccccccccCHhHhhhhhccc------CCccccCcchHHHHHHHHHHHhHHHHH Confidence 112478888888842 235556677665433 334556999999999999888777666 Q ss_pred HHHHHHH-hCC-ceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecc--cCCHHHHHHHHHHHH Q lcl|NC_019527. 288 VSDLVDK-FSR-TFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTP--LSGLADLQSQSQEHM 363 (516) Q Consensus 288 ~~~Ll~~-~~~-~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~--lsgl~d~~~~~~~~i 363 (516) .-.-..- +.+ -++|.+....- ......-..... . -.-|++.....+++++.++.+ -++..++...+...| T Consensus 272 el~~a~i~A~~a~fi~~~~~~~~-~~~~~~~~~~~~---~--l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~i 345 (505) T protein:vir:96 272 EMIAAELGAKKVGFYEQDPEAYD-QPPEDDQGEIVE---E--VEAGTYQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGV 345 (505) T ss_pred HHHHHHHhhhheeeeecCCccCC-CccccccCcccc---c--cCCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHH Confidence 4442222 222 34455432211 111110001111 1 123444444567889888755 468899999999999 Q ss_pred HhhhcCCceeeeccccccc-cccchHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHhCCCcC--C---c--ceEE Q lcl|NC_019527. 364 CSVSKIPAIKLTGISPSGL-NASSEGEIRSFYDDISSVQQSYY----FSPLDTMLKVIQLSKWGEID--D---A--ITFK 431 (516) Q Consensus 364 aaas~IP~t~L~G~sp~Gl-natge~D~~~yyd~I~~~Qe~~l----~p~l~~l~~~l~~s~~g~~~--~---d--~~~~ 431 (516) |+.+|||.-.|.|- .++- =||.-..+..+...++.+|+.++ +|+.+.+++..+++ |.++ . + .... T Consensus 346 aaglgi~ye~lt~D-~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~--G~i~~p~~~~~~~~~~~ 422 (505) T protein:vir:96 346 AAGMGPAYNRLAHD-LEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLT--QALPLNMVDIDRLSQYA 422 (505) T ss_pred HhhcCCCHHHHhcc-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc--CCcCCCCccchhhceee Confidence 99999999989884 3332 24555567778888888887654 45555555544433 5543 1 1 1234 Q ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc-ccc----hhcCCCCCCCC Q lcl|NC_019527. 432 FKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP-EMF----DDDGADPYMPD 506 (516) Q Consensus 432 f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~-e~~----~~e~~~~~~~~ 506 (516) |.+ .-..-.| -.|.+++....+++|+.|..++..+.+.+.+..+..+-.+.+..++ .+. .......+.++ T Consensus 423 w~~----p~~~~iD-P~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~ 497 (505) T protein:vir:96 423 FQP----RGWDWVD-PAKDSKAHSESIKNRTRSRSSIIRAAGDDPEDVFDEIAWEEQLMRDKGVNPTPPEQESKDATTDE 497 (505) T ss_pred ecc----CCccccC-hHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCC Confidence 432 1111111 1356777788999999999998776654443222222111110000 000 00001112222 Q ss_pred CCCCCCCC Q lcl|NC_019527. 507 PDVLPGEE 514 (516) Q Consensus 507 ~~~~~~~e 514 (516) +++.++.| T Consensus 498 ~~~~~~d~ 505 (505) T protein:vir:96 498 EDDSASDD 505 (505) T ss_pred CCCCCCCC Confidence 22333333 No 113 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.47 E-value=1.1e-13 Score=91.58 Aligned_cols=425 Identities=13% Similarity=0.055 Sum_probs=203.7 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchh--cccccccchhhhcccccCCcc-cccccC---cccHHHHHHHHhCchhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAV--AMDSLCGPTYQFLNSAAGGLY-AADIQP---FPGYQNLAALATRPEYR 115 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~--a~ds~~~~~~~~~~~~~~~~~-~~~~~~---f~gy~ll~~y~~~~i~r 115 (516) |+--+.. +.+|. +|.. ++.. +-|++..+. ++ ....+.. ..+..+ ...-..-.+++-|++++ T Consensus 1 m~~~~~~----~~a~~--~~~~----~~~~~~~y~aa~~~~-~~--~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~ 67 (495) T protein:vir:10 1 MNMTPSG----YQSLA--SGLL----VPVGASAYEGASGGH-RW--QDIGDYGPDTAVASGIQTLRARSHHNVRNNPWAT 67 (495) T ss_pred CCccccc----ccccc--hhhh----hHHHhhhhhccccCc-cc--CCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHH Confidence 2222221 22221 1110 1111 112211110 00 0000000 000000 00001124567899999 Q ss_pred hhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHh----------cChhHHHHHHHHhcccceeeEEEEEecCC Q lcl|NC_019527. 116 AFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEY----------YGVMGIIQKAAEHDCFFGRGQISINIKGA 185 (516) Q Consensus 116 ~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~----------l~~~~~l~ea~~~~rlyG~a~i~i~i~~~ 185 (516) .+|+......+=.|+......+++ . .-++|+..|++ +.+....+.+++....-|-+++.+..... T Consensus 68 ~av~~~~~~vVG~Gi~p~~~~~~~--~---~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~ 142 (495) T protein:vir:10 68 NAVATWVAAAVGNGLTPRWRMKEQ--E---LRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPL 142 (495) T ss_pred HHHHHHHHhhcCCCcccccCCchH--H---HHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeeccc Confidence 999999999998999887764432 1 22344444443 33444455556666666777766554321 Q ss_pred CcccCcccccccccccceeeEEeecceeec-cccccccccc---------cccccCcceeEEee---------------e Q lcl|NC_019527. 186 DVSVPLILDPRTIKKGSLTGFSNIEPMWTS-PSAYNALDPT---------APDFYKPSTWWVLG---------------R 240 (516) Q Consensus 186 ~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~-p~~~~~~dp~---------s~~yg~P~~y~v~g---------------~ 240 (516) ....+++ + .|.+|+|.+|. |.... .++. -..+|+|..|+|.. . T Consensus 143 ~~g~~~~----------~-~lqliepd~l~~~~~~~-~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~ 210 (495) T protein:vir:10 143 SEGLSVP----------L-QLQIIEPDMLASDIPDE-TLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTV 210 (495) T ss_pred CCCCccc----------e-EEEEechhhcCCCCCCC-CCCCCCEEEeceEECCCCceEEEEEeecCCCccccccccccee Confidence 1111111 1 25566666663 21110 0111 12478888888741 3 Q ss_pred EeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHH-HhCC-ceeeecchhhhcC-----ccH Q lcl|NC_019527. 241 EMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVD-KFSR-TFLKTNMAQVLNG-----GEG 313 (516) Q Consensus 241 ~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~-~~~~-~v~k~~~~~~l~~-----~~~ 313 (516) .|..++|+|+.. +- ..+.-|+|.+..+. .|++++.-..+.-.-.+ .+.+ -+++.+....... ... T Consensus 211 rvpA~~vlH~f~-~r------~gQ~RGis~la~i~-~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~ 282 (495) T protein:vir:10 211 WIKAEHVLHVTV-LT------VRSDAGAPWFQLLL-RLNELDQYEDAELVRKKTAALFAAFIQEATADSTGGPTIGQPKR 282 (495) T ss_pred eechhheEeccc-cC------CCcccCcchhHHHH-HHHHhhHHHHHHHHHHHHhhhheeeeecCCCccccccccCcccc Confidence 577888988853 22 23455888886654 46666655554222111 1222 2344432221111 011 Q ss_pred HHHHHHHHHHHHhcCCcceEEEecCCcceeEEecc--cCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHH Q lcl|NC_019527. 314 GDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTP--LSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIR 391 (516) Q Consensus 314 ~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~--lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~ 391 (516) ..-..+.. .-.-|++.....+++++.++.+ -++..++.......||+..|||.-.|.|--.+..=||.-..+. T Consensus 283 ~~~~~~~~-----~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~ 357 (495) T protein:vir:10 283 SKGGKRIT-----GLNPGTLQYLQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLL 357 (495) T ss_pred ccCcccce-----ecCCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHH Confidence 10001111 1123444444567889988754 4689999999999999999999999988532222344555677 Q ss_pred HHHHHHHHHHHH-----HHHHHHHHHHHHHHHHhCCCcCCc-------ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 392 SFYDDISSVQQS-----YYFSPLDTMLKVIQLSKWGEIDDA-------ITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT 459 (516) Q Consensus 392 ~yyd~I~~~Qe~-----~l~p~l~~l~~~l~~s~~g~~~~d-------~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~ 459 (516) .+...+++.|.+ +++|+.+.+++...++.--.+|+. +..+|. ..-.+-.| -.|.+++....++ T Consensus 358 e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~----~p~~~~vD-P~Ke~~A~~~~i~ 432 (495) T protein:vir:10 358 EFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWR----TPRWEEVD-PLKKHLADLGDVR 432 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccc----cCCccccC-hHHHHHHHHHHHH Confidence 788888888765 346677777776655422223332 123332 22111111 1356777888999 Q ss_pred cCCCCHHHHHHHHHhhhccCCCCCChhhhcccc-ccc-h----h--cCCCCCCCCCCCCCCCC Q lcl|NC_019527. 460 NSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP-EMF-D----D--DGADPYMPDPDVLPGEE 514 (516) Q Consensus 460 ~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~-e~~-~----~--e~~~~~~~~~~~~~~~e 514 (516) +|+.|..++..+.+.+.+..+..+..+.+..++ .+. + . .....+.+.+++..+.| T Consensus 433 ~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 433 AGFAPISDKQAERGYDMEELFDMISDANQLIDEYDLRLDSDPRYVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred cCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCcCCCccCCCCCCCCCCCCCC Confidence 999999998776654443222222111110000 000 0 0 00000111111111111 No 114 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=99.47 E-value=2e-13 Score=90.17 Aligned_cols=429 Identities=8% Similarity=0.026 Sum_probs=202.3 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCC--ccccccCCCCCCCccCCCccchhccccccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASD--AATKWAPPQLMPGVVPAGTTPAVAMDSLCG 78 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~ 78 (516) |++--|.-|-+-+...-=+.+. .........-..+++....+..+ ....|+..... +....+ .++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~---i~~r~~---~~~---- 67 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLK---PQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDND---IVKQMK---KVD---- 67 (474) T ss_pred CcceeecCCCCchhhHHHHhhh---hccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCc---hhcccc---ccc---- Confidence 7775444332222111001110 11111111222333222111111 01122221100 000000 000 Q ss_pred chhhhcccccCCcccccccCcccHHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhc Q lcl|NC_019527. 79 PTYQFLNSAAGGLYAADIQPFPGYQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYY 157 (516) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l 157 (516) . .. ....+ ....+ .+.+++.||+..+.-++.+++++++.++.. .+.|+..++ = T Consensus 68 ---------~-~~-----~~~~~---~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~-------~~~l~~~~~-n 121 (474) T protein:vir:95 68 ---------V-YG-----NIDYD---KPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDESV-------LKIIHDVLD-T 121 (474) T ss_pred ---------c-cc-----ccccc---cccceeccchHHHHHHHHHhhhccCCceeccCchHH-------HHHHHHHHh-c Confidence 0 00 00000 01111 357899999999999999999998765432 123333333 3 Q ss_pred ChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE Q lcl|NC_019527. 158 GVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV 237 (516) Q Consensus 158 ~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v 237 (516) ++...+.++.+....||.|++++.+++.. .+ .+.+++|..+.|..- ..+...+.+. -.+|.. T Consensus 122 ~~~~~~~e~~~~~~~~G~~~~~v~~d~~~---------------~~-~i~~~~p~~~~~v~d-~~~~~~~~~~-i~~~~~ 183 (474) T protein:vir:95 122 RWDNKLIDILTATSNKGIDWLQVYINENG---------------EM-KLFRVPAEQAIPIWV-DKEREELKSF-IRYYKF 183 (474) T ss_pred cHHHHHHHHHHHHhhcCcEEEEEEecCCC---------------ce-EEEEEcccceEEEEc-CCCCCceEEE-EEEEEE Confidence 67788999999999999999988765421 11 134445554444311 0000011111 111111 Q ss_pred eee---E-eccceEEEecC-------------------------CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 238 LGR---E-MHASRLLTIIT-------------------------RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSV 288 (516) Q Consensus 238 ~g~---~-iH~SRli~~~~-------------------------~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~ 288 (516) .+. . +.+.++.+|.. ..+|-+. -.++-.|.|.++.+.+.+.+++.+.... T Consensus 184 ~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~-~~nn~~g~sd~e~v~~liDa~d~~~S~~ 262 (474) T protein:vir:95 184 NNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIA-FKNNPEEVSDIWMYKSLIDAIDKRLSDA 262 (474) T ss_pred cCeeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEe-ecCCCCCCCcHHHHHHHHHHHHHHHHHH Confidence 110 1 11122222210 0111111 1224569999999999999999998888 Q ss_pred HHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhh Q lcl|NC_019527. 289 SDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSV 366 (516) Q Consensus 289 ~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaa 366 (516) +.-+..++..++...... ..+.......+ ...+++.++++ .+++.+ ..+.+++...++.+.++|... T Consensus 263 ~~~~~~~~~p~lv~~g~~---~~~~~~~~~~~-------~~~~~i~~~~~-~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 331 (474) T protein:vir:95 263 QNMFDESVELIYILKGYE---GQDLEEFMRGL-------KYYKAINVDGD-GGVETIQVEVPVSSTKEYIDLMRAYIMEF 331 (474) T ss_pred HHHHHHhcCceeeeecCC---cccchhhhhhh-------hccceeeccCC-CceeEEeecCCHHHHHHHHHHHHHHHHHH Confidence 887777776665532211 11111221211 12334444443 445444 466778899999999999999 Q ss_pred hcCCceeeeccccccccccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCc-CCcceEEeCCCCCCCHHHH Q lcl|NC_019527. 367 SKIPAIKLTGISPSGLNASSEGEIRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEI-DDAITFKFKSLWQTSAKEE 443 (516) Q Consensus 367 s~IP~t~L~G~sp~Glnatge~D~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~d~~~~f~pL~~~sekEk 443 (516) +++|-. .++ +.+| |.||..=...|..... ...+..++..+++++++|..-..... ..++++.|++-...+++|. T Consensus 332 s~~p~~-~~~-~~~~-n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~v~f~~~~p~d~~e~ 408 (474) T protein:vir:95 332 GQGVDF-QTD-KFGS-APSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLKMDVKDIEISFNFNRMMNDAEQ 408 (474) T ss_pred hCCccc-ccc-cccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCCCcCHHHH Confidence 999952 222 2222 4566543333333222 33345678889999988765433333 2478899999888888887 Q ss_pred HHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccccc-------chhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 444 SEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEM-------FDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 444 Aei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~-------~~~e~~~~~~~~~~~~~~~e~t 516 (516) |++. .++|+||.+.+...+... ...+...+....|. .......+...++++.++.+.+ T Consensus 409 a~~~----------~~~g~iS~et~i~~l~~v-----~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 473 (474) T protein:vir:95 409 SQII----------AQSQYLSRETLVKSSPLV-----DDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNDKES 473 (474) T ss_pred HHHH----------HhcCCCchHHHHHhCCCC-----CCHHHHHHHHHHHHHHHHhcccccccccCCCCcCCCCCccCCC Confidence 7642 345888887777654210 10011111111110 0000111111122222222222 No 115 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.45 E-value=1.4e-13 Score=91.04 Aligned_cols=436 Identities=8% Similarity=-0.013 Sum_probs=209.1 Q ss_pred CCCCCcc----CCCccchhc----ccccccchhhhcccccCCcc-ccccc-C--cccHHHHHHHHhCchhhhhhhhhhHH Q lcl|NC_019527. 57 QLMPGVV----PAGTTPAVA----MDSLCGPTYQFLNSAAGGLY-AADIQ-P--FPGYQNLAALATRPEYRAFASTLSTE 124 (516) Q Consensus 57 ~~~~gv~----~~~~~~~~a----~ds~~~~~~~~~~~~~~~~~-~~~~~-~--f~gy~ll~~y~~~~i~r~iVd~~aed 124 (516) .-+||+. ++..++... ++++.+.-.+..+-...... ..+.. . ...-.--.+++-|++++.+|+..... T Consensus 1 ~~~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~n 80 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDH 80 (533) T ss_pred CCCchhhhhhcccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Confidence 1112211 111111110 11000000000000000000 00000 0 00011234567899999999999999 Q ss_pred HhhCCCeeeecccc-----chhhhHHHHHHHHHHHHhc--------------ChhHHHHHHHHhcccceeeEEEEEecCC Q lcl|NC_019527. 125 LTREGIEITSKDRT-----KAKEMASKIKELEEACEYY--------------GVMGIIQKAAEHDCFFGRGQISINIKGA 185 (516) Q Consensus 125 ~~r~~~~i~~~~~~-----~~~~~~~~i~~i~~~~~~l--------------~~~~~l~ea~~~~rlyG~a~i~i~i~~~ 185 (516) .+=.|+.+...-+- +.+...++-++|+..|++. .+......+++....-|-+++.+.... T Consensus 81 vVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~- 159 (533) T protein:vir:34 81 IVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDT- 159 (533) T ss_pred hhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeecc- Confidence 99999998774210 0011122334555555432 233444455555566677776654432 Q ss_pred CcccCcccccccccccceeeEEeecceeeccccccccc-------cccccccCcceeEEee-----------------eE Q lcl|NC_019527. 186 DVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALD-------PTAPDFYKPSTWWVLG-----------------RE 241 (516) Q Consensus 186 ~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~d-------p~s~~yg~P~~y~v~g-----------------~~ 241 (516) ....+++ + .|.+|+|.+|.-. ++..+ ..-..+|+|..|+|.. .. T Consensus 160 ~~g~~~~----------~-~lq~ie~d~l~~~-~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~ 227 (533) T protein:vir:34 160 SSSRLFR----------T-QFRMVSPKRISNP-NNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELP 227 (533) T ss_pred CCCCccc----------e-EEEEechhhcCCC-CCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeec Confidence 1111111 1 1555666555421 11000 1123467788887731 23 Q ss_pred eccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHH-hCC-ceeeecchh-----hhcCc--- Q lcl|NC_019527. 242 MHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDK-FSR-TFLKTNMAQ-----VLNGG--- 311 (516) Q Consensus 242 iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~-~~~-~v~k~~~~~-----~l~~~--- 311 (516) |+.++|||+-... ...+-.|+|.|-.++..|.+++.-..+--.-..- +.+ -++|.+... .+... T Consensus 228 v~a~~VlH~f~~~------r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~ 301 (533) T protein:vir:34 228 GGRASFIHVFEPV------EDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQ 301 (533) T ss_pred cChhHeeeecccc------CCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcc Confidence 5667888875433 3344569999999999998887776654332221 222 334443211 11111 Q ss_pred -cHHHHHHHHHHHHHh------cCCcceEEEecCCcceeEEecc--cCCHHHHHHHHHHHHHhhhcCCceeeeccccccc Q lcl|NC_019527. 312 -EGGDVFDRVEMYVNM------QSNLGLAVMDFDSEDIVQVNTP--LSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGL 382 (516) Q Consensus 312 -~~~~l~~r~~~~~~~------~sn~g~~~id~~~e~~e~~~~~--lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gl 382 (516) ....+.......... .-.-|++.....+++++.++.+ -++..++...+...||+..|||.-.|.|--.... T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~n 381 (533) T protein:vir:34 302 EQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMS 381 (533) T ss_pred cccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhccccc Confidence 111121111111100 0123444444566888888744 4689999999999999999999999988532333 Q ss_pred cccchHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhCCCcCCcc------------eEEeCCCCCCCHHHHHHH Q lcl|NC_019527. 383 NASSEGEIRSFYDDISSVQQSYYF----SPLDTMLKVIQLSKWGEIDDAI------------TFKFKSLWQTSAKEESEI 446 (516) Q Consensus 383 natge~D~~~yyd~I~~~Qe~~l~----p~l~~l~~~l~~s~~g~~~~d~------------~~~f~pL~~~sekEkAei 446 (516) =||.-..+..+...++.+|...+. |+.+.+++..+++....+|... ...|. .....-.| T Consensus 382 YSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~----~p~~~~iD- 456 (533) T protein:vir:34 382 YSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWI----GSGRMAID- 456 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeec----cCCccccC- Confidence 355666778888899999976654 5555555544444322333211 23332 22222111 Q ss_pred HHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc-ccchhcC----CCCCCCCCCCCCCCCCC Q lcl|NC_019527. 447 RFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP-EMFDDDG----ADPYMPDPDVLPGEEGS 516 (516) Q Consensus 447 ~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~-e~~~~e~----~~~~~~~~~~~~~~e~t 516 (516) -.|.+++....+++|+.|..++..+.+.+.+..+..+..+.+..++ .+..... .....+.+++.++.+.+ T Consensus 457 P~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~~~ 531 (533) T protein:vir:34 457 GLKEVQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSR 531 (533) T ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCcccCC Confidence 1456778888999999999998777655544333222111111100 0100000 00011122222222222 No 116 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.45 E-value=2.9e-13 Score=89.30 Aligned_cols=433 Identities=8% Similarity=0.009 Sum_probs=204.3 Q ss_pred hhcCCCccccccCCCCCCCcc-CCCccc--hh--cccccccchhhhcccccCCccc-cccc-C--cccHHHHHHHHhCch Q lcl|NC_019527. 43 ERRASDAATKWAPPQLMPGVV-PAGTTP--AV--AMDSLCGPTYQFLNSAAGGLYA-ADIQ-P--FPGYQNLAALATRPE 113 (516) Q Consensus 43 ~~~~~~~~~~~~~~~~~~gv~-~~~~~~--~~--a~ds~~~~~~~~~~~~~~~~~~-~~~~-~--f~gy~ll~~y~~~~i 113 (516) .+ .|.+. |++..+ .+ ++.++.+.-.+.++-.+..... .+.. . ...-.--.+++-|++ T Consensus 1 ~~--------------~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~ 66 (530) T protein:vir:38 1 MK--------------IPSLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGY 66 (530) T ss_pred Cc--------------cceeecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChH Confidence 00 00000 011000 00 0111000000000000000000 0000 0 000112345678999 Q ss_pred hhhhhhhhhHHHhhCCCeeeecccc-----chhhhHHHHHHHHHHHHh--------------cChhHHHHHHHHhcccce Q lcl|NC_019527. 114 YRAFASTLSTELTREGIEITSKDRT-----KAKEMASKIKELEEACEY--------------YGVMGIIQKAAEHDCFFG 174 (516) Q Consensus 114 ~r~iVd~~aed~~r~~~~i~~~~~~-----~~~~~~~~i~~i~~~~~~--------------l~~~~~l~ea~~~~rlyG 174 (516) ++++|+......+=.|+.+...-+- +.+.-.++-++|+..|++ +.+.+...-+++....-| T Consensus 67 a~~av~~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dG 146 (530) T protein:vir:38 67 AANAVQLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNG 146 (530) T ss_pred HHHHHHHHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCC Confidence 9999999999999999998764210 011112334556666643 223344444555555667 Q ss_pred eeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccccc-c------cccccccCcceeEEee-------- Q lcl|NC_019527. 175 RGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNAL-D------PTAPDFYKPSTWWVLG-------- 239 (516) Q Consensus 175 ~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~-d------p~s~~yg~P~~y~v~g-------- 239 (516) -+++.+.... ....|+++ .|.+|++.+|.-. ++.. + ..-..+|+|..|+|.. T Consensus 147 E~~~~~~~~~-~~g~~~~~-----------~lq~ie~d~l~~~-~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~ 213 (530) T protein:vir:38 147 ELCVQATWDS-DSTRLFRT-----------QFKMVSPKRVSNP-NNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMA 213 (530) T ss_pred ceEEEeeecc-CCCCccce-----------EEEEechhhcCCC-CCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccc Confidence 7776654332 11112211 1455555554311 1000 0 0112467788777731 Q ss_pred ---------eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHH-hC-Cceeeecchh-- Q lcl|NC_019527. 240 ---------REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDK-FS-RTFLKTNMAQ-- 306 (516) Q Consensus 240 ---------~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~-~~-~~v~k~~~~~-- 306 (516) ..|+..+|||+-... ...+.-|+|.|-.++..|++++.-..+.-.-..- +. .-++|.+... T Consensus 214 ~~~~~~~~~~~v~a~~vlH~f~~~------r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~ 287 (530) T protein:vir:38 214 QNWTYIPRELPGGRPSFIHVFEPM------EDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQS 287 (530) T ss_pred cccceeeeeeccChhHeEeecccc------CCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccc Confidence 235556788876443 2345569999999999998887776654332222 22 2344543211 Q ss_pred ---hhcCccH----HHHH----HHHHHHHH--hcCCcceEEEecCCcceeEEecc--cCCHHHHHHHHHHHHHhhhcCCc Q lcl|NC_019527. 307 ---VLNGGEG----GDVF----DRVEMYVN--MQSNLGLAVMDFDSEDIVQVNTP--LSGLADLQSQSQEHMCSVSKIPA 371 (516) Q Consensus 307 ---~l~~~~~----~~l~----~r~~~~~~--~~sn~g~~~id~~~e~~e~~~~~--lsgl~d~~~~~~~~iaaas~IP~ 371 (516) .+...+. ..+. .+....+. ..-.-|++.....+++++.++.+ -++..++...+...||+..|||. T Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~y 367 (530) T protein:vir:38 288 AMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSY 367 (530) T ss_pred cccccccCCcccccccccccchhhhhcccccceeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCH Confidence 1111111 0111 11111000 01123444444556888888755 46889999999999999999999 Q ss_pred eeeeccccccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hCCCc--CCcc------------eEEeC- Q lcl|NC_019527. 372 IKLTGISPSGL-NASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLS--KWGEI--DDAI------------TFKFK- 433 (516) Q Consensus 372 t~L~G~sp~Gl-natge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s--~~g~~--~~d~------------~~~f~- 433 (516) -.|.|- .++- =||.-..+..+...++++|+..+.+++..+.+..+.. .-|.+ |... ..+|. T Consensus 368 e~lt~D-~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~ 446 (530) T protein:vir:38 368 EQLSRN-YSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIG 446 (530) T ss_pred HHHhcc-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeec Confidence 999884 3333 3556667888889999999877655544444433221 12444 3211 12332 Q ss_pred -CCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccc-ccchhc-C-CCC----CCC Q lcl|NC_019527. 434 -SLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQP-EMFDDD-G-ADP----YMP 505 (516) Q Consensus 434 -pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~-e~~~~e-~-~~~----~~~ 505 (516) ..-..|. .|.+++....+++|+.|..++..+.+.+.+..+..+-.+.+..+. .+.... . .++ ..+ T Consensus 447 p~~~~iDP-------~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~ 519 (530) T protein:vir:38 447 SGRMAIDG-------LKEVQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKS 519 (530) T ss_pred CCccccCh-------HHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCC Confidence 2222222 456777788999999999998776655443332222111111000 010000 0 000 111 Q ss_pred CCCCCCCCCCC Q lcl|NC_019527. 506 DPDVLPGEEGS 516 (516) Q Consensus 506 ~~~~~~~~e~t 516 (516) +++++.+..++ T Consensus 520 ~~~~~d~~~~a 530 (530) T protein:vir:38 520 NEEEQDGARAA 530 (530) T ss_pred CCCCCCCCCCC Confidence 11111111112 No 117 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.45 E-value=6.9e-13 Score=87.25 Aligned_cols=415 Identities=14% Similarity=0.082 Sum_probs=208.2 Q ss_pred HHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcc--cccCCcc-cccccCcccHHHHHHH---- Q lcl|NC_019527. 36 RAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLN--SAAGGLY-AADIQPFPGYQNLAAL---- 108 (516) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~--~~~~~~~-~~~~~~f~gy~ll~~y---- 108 (516) -.+.+.+ . ++.. .++-++-... .++.......+... ..+-+.. ........-++-+..| T Consensus 1 ~~~~~~~----~----~~~~----~~~~~e~i~~--~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (474) T protein:vir:10 1 MTLYKLI----D----DIEA----QGILPKHIEA--LIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGG 66 (474) T ss_pred CchHHHH----h----hccc----cCCCHHHHHH--HHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcc Confidence 0000000 0 0000 1111100000 01111111111110 1111100 0000000000111111 Q ss_pred -----------H-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceee Q lcl|NC_019527. 109 -----------A-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRG 176 (516) Q Consensus 109 -----------~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a 176 (516) + .+.+++.||+..+.-++.+++++.+.++.+.++ ...+.|...+++-++.....++.+...+||.| T Consensus 67 ~~~~~~~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e--~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a 144 (474) T protein:vir:10 67 NVRRLDVSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNE--KLKKFITNFAIRNSVDDEDSEIGKMAAICGYG 144 (474) T ss_pred cccccccCcccccccchHHHHHHhHhhheeccceeEeeCCCCcchH--HHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeE Confidence 1 278999999999999999999998865543322 22345666677778999999999999999999 Q ss_pred EEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe----e-----eEe-ccce Q lcl|NC_019527. 177 QISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL----G-----REM-HASR 246 (516) Q Consensus 177 ~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~----g-----~~i-H~SR 246 (516) ++++.++... .+ .+.+++|.++.|..-+..++. ++ -.+|.+. + ..+ ...+ T Consensus 145 ~~~~~~d~~~---------------~~-~~~~i~p~~~~~v~d~~~~~~---~~-i~~~~~~~~~~~~~~~~~~~y~~~~ 204 (474) T protein:vir:10 145 ARLAYIDTNG---------------DI-RIKNIDPYNVIFVGDNILEPT---YS-LRYFYEKDDDNGTDYVYAEFYDNAY 204 (474) T ss_pred EEEEEeCCCC---------------ee-EEEEEcccceEEEEcCCCceE---EE-EEEEEEeeCCCceEEEEEEEEcCce Confidence 9988764321 11 144455555444321111111 00 0111110 0 000 1222 Q ss_pred EEEecCCc----------------chhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcC Q lcl|NC_019527. 247 LLTIITRP----------------LPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNG 310 (516) Q Consensus 247 li~~~~~~----------------~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~ 310 (516) +.+|.+.. +|-. .-.++.+|.|.++.+.+.+.+++.+....+..+..++..++.+.... + T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~g~vPvv-~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~-~-- 280 (474) T protein:vir:10 205 YYVFRGEGIDALQEVGRYEHLFDYNPLF-GVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMG-M-- 280 (474) T ss_pred EEEEeecCCCcccccccccCCCCccceE-EecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCC-C-- Confidence 22222211 1111 12345679999999999999999999888888887777665542211 1 Q ss_pred ccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchH Q lcl|NC_019527. 311 GEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEG 388 (516) Q Consensus 311 ~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~ 388 (516) .++.+ ..+ . ..|+..+..++.+++.+. .+.++....++.+.++|...+++|-.-. + +.+| |.||.. T Consensus 281 -~~~~~-~~~------~-~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~~~~-n~Sg~A 348 (474) T protein:vir:10 281 -SEEMI-QET------Q-KSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNS-D-EFNG-NVPIIG 348 (474) T ss_pred -Cchhh-hhh------h-hcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccc-c-cccc-cchHHH Confidence 11111 111 1 234444444556677665 4456788889999999999999996432 2 1223 567764 Q ss_pred HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHh----CCCcC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 389 EIRSFYDDI--SSVQQSYYFSPLDTMLKVIQLSK----WGEID---DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT 459 (516) Q Consensus 389 D~~~yyd~I--~~~Qe~~l~p~l~~l~~~l~~s~----~g~~~---~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~ 459 (516) =...|.... ...++..++..++++++++..-. .+..+ .++++.|.+-...+++|.|++..+. T Consensus 349 l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--------- 419 (474) T protein:vir:10 349 MKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL--------- 419 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH--------- Confidence 333333322 24445677888888888766421 11122 3689999999999999998876553 Q ss_pred cCCCCHHHHHHHHHhhhccCCCCCChhhhccccccch--hcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 460 NSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFD--DDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 460 ~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~--~e~~~~~~~~~~~~~~~e~t 516 (516) .|++|.+.+.+.+... ...+...+..+.|..+ ....+...++.++.++.+.| T Consensus 420 ~g~iS~et~~~~l~~v-----~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~~s 473 (474) T protein:vir:10 420 KGQVSERTRLGQSQLV-----DDVDYELDEMEKESLEFNDKLPDIDEGDANDKSQNNQS 473 (474) T ss_pred hccCchHHHHHhCCCC-----CCHHHHHHHHHHHHHHHHhhcccccCCCcCCCCccccC Confidence 3888888887765211 1111111111111111 11122223333333333333 No 118 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.45 E-value=6.9e-13 Score=87.25 Aligned_cols=415 Identities=14% Similarity=0.082 Sum_probs=208.2 Q ss_pred HHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcc--cccCCcc-cccccCcccHHHHHHH---- Q lcl|NC_019527. 36 RAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLN--SAAGGLY-AADIQPFPGYQNLAAL---- 108 (516) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~--~~~~~~~-~~~~~~f~gy~ll~~y---- 108 (516) -.+.+.+ . ++.. .++-++-... .++.......+... ..+-+.. ........-++-+..| T Consensus 1 ~~~~~~~----~----~~~~----~~~~~e~i~~--~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (474) T protein:vir:94 1 MTLYKLI----D----DIEA----QGILPKHIEA--LIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGG 66 (474) T ss_pred CchHHHH----h----hccc----cCCCHHHHHH--HHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcc Confidence 0000000 0 0000 1111100000 01111111111110 1111100 0000000000111111 Q ss_pred -----------H-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceee Q lcl|NC_019527. 109 -----------A-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRG 176 (516) Q Consensus 109 -----------~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a 176 (516) + .+.+++.||+..+.-++.+++++.+.++.+.++ ...+.|...+++-++.....++.+...+||.| T Consensus 67 ~~~~~~~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e--~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a 144 (474) T protein:vir:94 67 NVRRLDVSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNE--KLKKFITNFAIRNSVDDEDSEIGKMAAICGYG 144 (474) T ss_pred cccccccCcccccccchHHHHHHhHhhheeccceeEeeCCCCcchH--HHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeE Confidence 1 278999999999999999999998865543322 22345666677778999999999999999999 Q ss_pred EEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe----e-----eEe-ccce Q lcl|NC_019527. 177 QISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL----G-----REM-HASR 246 (516) Q Consensus 177 ~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~----g-----~~i-H~SR 246 (516) ++++.++... .+ .+.+++|.++.|..-+..++. ++ -.+|.+. + ..+ ...+ T Consensus 145 ~~~~~~d~~~---------------~~-~~~~i~p~~~~~v~d~~~~~~---~~-i~~~~~~~~~~~~~~~~~~~y~~~~ 204 (474) T protein:vir:94 145 ARLAYIDTNG---------------DI-RIKNIDPYNVIFVGDNILEPT---YS-LRYFYEKDDDNGTDYVYAEFYDNAY 204 (474) T ss_pred EEEEEeCCCC---------------ee-EEEEEcccceEEEEcCCCceE---EE-EEEEEEeeCCCceEEEEEEEEcCce Confidence 9988764321 11 144455555444321111111 00 0111110 0 000 1222 Q ss_pred EEEecCCc----------------chhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcC Q lcl|NC_019527. 247 LLTIITRP----------------LPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNG 310 (516) Q Consensus 247 li~~~~~~----------------~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~ 310 (516) +.+|.+.. +|-. .-.++.+|.|.++.+.+.+.+++.+....+..+..++..++.+.... + T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~g~vPvv-~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~-~-- 280 (474) T protein:vir:94 205 YYVFRGEGIDALQEVGRYEHLFDYNPLF-GVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMG-M-- 280 (474) T ss_pred EEEEeecCCCcccccccccCCCCccceE-EecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCC-C-- Confidence 22222211 1111 12345679999999999999999999888888887777665542211 1 Q ss_pred ccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchH Q lcl|NC_019527. 311 GEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEG 388 (516) Q Consensus 311 ~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~ 388 (516) .++.+ ..+ . ..|+..+..++.+++.+. .+.++....++.+.++|...+++|-.-. + +.+| |.||.. T Consensus 281 -~~~~~-~~~------~-~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~~~~-n~Sg~A 348 (474) T protein:vir:94 281 -SEEMI-QET------Q-KSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNS-D-EFNG-NVPIIG 348 (474) T ss_pred -Cchhh-hhh------h-hcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccc-c-cccc-cchHHH Confidence 11111 111 1 234444444556677665 4456788889999999999999996432 2 1223 567764 Q ss_pred HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHh----CCCcC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 389 EIRSFYDDI--SSVQQSYYFSPLDTMLKVIQLSK----WGEID---DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT 459 (516) Q Consensus 389 D~~~yyd~I--~~~Qe~~l~p~l~~l~~~l~~s~----~g~~~---~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~ 459 (516) =...|.... ...++..++..++++++++..-. .+..+ .++++.|.+-...+++|.|++..+. T Consensus 349 l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--------- 419 (474) T protein:vir:94 349 MKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL--------- 419 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH--------- Confidence 333333322 24445677888888888766421 11122 3689999999999999998876553 Q ss_pred cCCCCHHHHHHHHHhhhccCCCCCChhhhccccccch--hcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 460 NSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFD--DDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 460 ~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~--~e~~~~~~~~~~~~~~~e~t 516 (516) .|++|.+.+.+.+... ...+...+..+.|..+ ....+...++.++.++.+.| T Consensus 420 ~g~iS~et~~~~l~~v-----~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~~s 473 (474) T protein:vir:94 420 KGQVSERTRLGQSQLV-----DDVDYELDEMEKESLEFNDKLPDIDEGDANDKSQNNQS 473 (474) T ss_pred hccCchHHHHHhCCCC-----CCHHHHHHHHHHHHHHHHhhcccccCCCcCCCCccccC Confidence 3888888887765211 1111111111111111 11122223333333333333 No 119 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.44 E-value=1.1e-13 Score=91.63 Aligned_cols=429 Identities=12% Similarity=0.048 Sum_probs=208.3 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcc---cccccchhhhcccccCCccccccc---CcccHHHH Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAM---DSLCGPTYQFLNSAAGGLYAADIQ---PFPGYQNL 105 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~---ds~~~~~~~~~~~~~~~~~~~~~~---~f~gy~ll 105 (516) |.+.. +.+..+.|- -++ .-.+.+..+ |++.++. .............+.. ..+.-.-- T Consensus 1 mn~~d-----------r~i~~~sP~---~~~--~R~~ar~~~~~y~aa~~~r-~~~~~~~~~s~~~~~~~~~~~lr~RaR 63 (502) T protein:vir:79 1 MAILD-----------DVIGVFSPG---WKA--ARLRSRAVIQAYEAVKTTR-THKARRENRTADQLSQYGAVSLREQAR 63 (502) T ss_pred CchHh-----------hHHhhcChH---HHH--HHHhhHHHHhhccccCccc-ccCCCCCCCChHHHHHHHHHHHHHHHH Confidence 11111 111111110 000 000001111 1110000 0000000000000000 00000113 Q ss_pred HHHHhCchhhhhhhhhhHHHhh-CCCeeeeccccch-hhhHHHHHHHHHHHH----------hcChhHHHHHHHHhcccc Q lcl|NC_019527. 106 AALATRPEYRAFASTLSTELTR-EGIEITSKDRTKA-KEMASKIKELEEACE----------YYGVMGIIQKAAEHDCFF 173 (516) Q Consensus 106 ~~y~~~~i~r~iVd~~aed~~r-~~~~i~~~~~~~~-~~~~~~i~~i~~~~~----------~l~~~~~l~ea~~~~rly 173 (516) .+|+-|++++++|+....-.+= .|+.+...-.... ....+.-++|+..|+ ++.+.....-+++.-..- T Consensus 64 dl~rNn~~a~~av~~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~d 143 (502) T protein:vir:79 64 YLDNNHDLVIGVFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRD 143 (502) T ss_pred HHHhcChHHHHHHHHHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhC Confidence 5677899999999999999985 5777755432221 111223455666665 334444455566666667 Q ss_pred eeeEEEEEecC-CC--cccCcccccccccccceeeEEeecceeeccccccccc-----cccccccCcceeEEee------ Q lcl|NC_019527. 174 GRGQISINIKG-AD--VSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALD-----PTAPDFYKPSTWWVLG------ 239 (516) Q Consensus 174 G~a~i~i~i~~-~~--~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~d-----p~s~~yg~P~~y~v~g------ 239 (516) |-+++.+..+. .. ...++++ .|.+|+|.+|. ...+..+ ..-..+|+|..|+|.. T Consensus 144 GE~f~~~~~~~~~~~~~g~~~~l-----------~lq~iepd~l~-~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~ 211 (502) T protein:vir:79 144 GEVFAQMVSGRINSLTPSAGVHF-----------WLEALEPDFIP-MTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSG 211 (502) T ss_pred CceEEEEeecccCccCCCcccce-----------EEEEecchhcC-CCCCCCCeeEeeeEECCCCceEEEEEeecCCCCC Confidence 77777665422 11 1122221 25566666653 1111100 0123578999998852 Q ss_pred -----eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHH-HhCC-ceeeecchhhhcCc- Q lcl|NC_019527. 240 -----REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVD-KFSR-TFLKTNMAQVLNGG- 311 (516) Q Consensus 240 -----~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~-~~~~-~v~k~~~~~~l~~~- 311 (516) ..|+.++|+|+-... ...+.-|+|.|-.++..|++++.-..+.-.-.. .+.+ -+++++........ T Consensus 212 ~~~~~~rvpA~~vlH~f~~~------r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~ 285 (502) T protein:vir:79 212 RQMETKEVDAERMLHLKFVR------RLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDG 285 (502) T ss_pred cccceeEechhheEEeeccc------CCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccccc Confidence 468889999886543 334556999999999999888877665433222 2222 23454332211110 Q ss_pred cHHHHHHHHHHHHHhcCCcceEE-EecCCcceeEEecc--cCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchH Q lcl|NC_019527. 312 EGGDVFDRVEMYVNMQSNLGLAV-MDFDSEDIVQVNTP--LSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEG 388 (516) Q Consensus 312 ~~~~l~~r~~~~~~~~sn~g~~~-id~~~e~~e~~~~~--lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~ 388 (516) .+..-..... .-..|+.+ ....+++++.++.+ -++..++.......||+.+|||.-.|.|-- ++.-||.-. T Consensus 286 ~~~~~~~~~~-----~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~-s~nySs~R~ 359 (502) T protein:vir:79 286 NGSKENEREL-----TIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNY-NGTYSAQRQ 359 (502) T ss_pred CCCCCccccc-----cccCCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccc-cchHHHHHH Confidence 0000000000 01134432 23456888888744 468999999999999999999999999864 444455566 Q ss_pred HHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHhCCCcC--C------cceEEeC--CCCCCCHHHHHHHHHHHHHHH Q lcl|NC_019527. 389 EIRSFYDDISSVQQSYY----FSPLDTMLKVIQLSKWGEID--D------AITFKFK--SLWQTSAKEESEIRFNKAQEA 454 (516) Q Consensus 389 D~~~yyd~I~~~Qe~~l----~p~l~~l~~~l~~s~~g~~~--~------d~~~~f~--pL~~~sekEkAei~~~~a~a~ 454 (516) .+..+...++.+|+..+ +|+.+.+++..+++ |.++ . -+...|. ..-..|. .|.+++. T Consensus 360 ~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~--G~i~~p~~~~~~~~~~~~W~~p~~~~iDP-------~Ke~~a~ 430 (502) T protein:vir:79 360 ELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVAS--GVIRLPRDLDRSSLYTAVYSGPVMPWIDP-------VKEAEAW 430 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc--CCCCCCCCCCchhhcceeeecCCccccCh-------HHHHHHH Confidence 77888899999997544 44555555544443 5443 2 1233442 2222222 3466777 Q ss_pred HHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhc----------------cccccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 455 QIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEI----------------VQPEMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 455 ~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~----------------~~~e~~~~e~~~~~~~~~~~~~~~e 514 (516) ...+++|+.|..++..+.+.+.+..+..+..+.+. ........+++ +++++.+.++ T Consensus 431 ~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~e----~~~~~~~~e~ 502 (502) T protein:vir:79 431 KIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATKRQE----PQHTDDQSEE 502 (502) T ss_pred HHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCC----CCCCCCCCCC Confidence 88899999999888766544433222211111000 00000000001 1111111222 No 120 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=99.44 E-value=5e-13 Score=88.03 Aligned_cols=434 Identities=14% Similarity=0.098 Sum_probs=205.2 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) |=-| |+|+|.- +.. + ...+.. ...-|.+|...+|..+=. .-+++.... T Consensus 1 ~~~~--~~w~~~d-------------e~~-----~---~~~~~~----~~~~~~~p~~~dG~s~i~----~~~~~~~~~- 48 (533) T protein:vir:58 1 MPSL--EKYKKLN-------------EAV-----N---FTNFLS----PMYGMGAPHGAGGSSMIP----INMYHPFAT- 48 (533) T ss_pred CCCc--chhhhhh-------------HHH-----H---HHHhhc----hhhcccCccCCCCCcccc----CCCCcchhh- Confidence 3333 2222110 000 0 000111 111245665555541100 011111000 Q ss_pred hhhcccccCCcccccccCcccHHHHHHHH----hCchhhhhhhhhhHHHhhC-----CCeeeeccccchhhhHHHHHHHH Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPFPGYQNLAALA----TRPEYRAFASTLSTELTRE-----GIEITSKDRTKAKEMASKIKELE 151 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f~gy~ll~~y~----~~~i~r~iVd~~aed~~r~-----~~~i~~~~~~~~~~~~~~i~~i~ 151 (516) ...+.+++++ ..+..++|...|+ +|+.+..+|+.++++|+-+ .+++...+.+- + +.+. T Consensus 49 ~~~~~~~~gg------~~~n~~eLI~~YR~ma~~~pEVd~AideIvneaiv~d~~~~pV~v~l~~~e~----s---~~iK 115 (533) T protein:vir:58 49 AGYASRFYGG------IEFNRFFLYDMYDRMDYTDPLISTVLDIIADECTIPNENGNIVDVVTKDIEL----A---KAIL 115 (533) T ss_pred hhhhhhhhcc------ccccHHHHHHHHHHhhccCcchhhHHHhhhceeeEecCCCceeEeecccccc----c---HHHH Confidence 0111122221 2345678888885 4799999999999999742 22232222221 1 2222 Q ss_pred HH-HHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccccccccccccc Q lcl|NC_019527. 152 EA-CEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFY 230 (516) Q Consensus 152 ~~-~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg 230 (516) .. .+-|++...-.+.+|.--++|..+.-.++++ .+++|+.|+.+||..+.....-.++ ..|+ T Consensus 116 ~kI~~lldf~~~~~~~fR~WYVDGriy~Hkiik~--------------~k~GI~elr~lDPr~i~~vr~~~t~---~eyy 178 (533) T protein:vir:58 116 SYLDYVINIEKNAYPIIRNMIKYGDMFLHILEKG--------------SDGTIEKFQVVSPYIFSKRYNPETD---TWYY 178 (533) T ss_pred HHHHHHhcchhhhhHHHHhhhhcceeEEEeccCC--------------cccchhhheecCCeeeEEEEeeccc---eEEE Confidence 22 2334677777777776667666554443331 1245666788888877654322122 1111 Q ss_pred Ccc--ee-EE---eeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc----ee Q lcl|NC_019527. 231 KPS--TW-WV---LGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT----FL 300 (516) Q Consensus 231 ~P~--~y-~v---~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~----v~ 300 (516) -.. +. .. .+.+|+++.|+++...- .......++|.|..+...+.+.--. ..|-++++.+.. |+ T Consensus 179 vy~~~~~~~~s~~~~~kI~~daI~y~~SGl-----~d~~~~~iisyLhkAiKp~NQLkmi--EDAlVIYRisRAPeRRvF 251 (533) T protein:vir:58 179 VITDVYRNVVSGYFNEDIPEEDVIHFSHKI-----DTNFFPYGRSYLESARAIWNQLRLM--EDALMLYRVVRSVDRRVF 251 (533) T ss_pred eecccccccccCccccccchhheeeeeecc-----ccCCCCceehhhhHHHHHHHHHHHH--HHHHHHHhhcCChhheEE Confidence 100 00 01 13689999999887532 2334567889999985555544332 234567776654 66 Q ss_pred eecchhhhcCccHHH----HHHHHHHHHHhcCCcceEEEecC--------------------CcceeEEe-cccCCHHHH Q lcl|NC_019527. 301 KTNMAQVLNGGEGGD----VFDRVEMYVNMQSNLGLAVMDFD--------------------SEDIVQVN-TPLSGLADL 355 (516) Q Consensus 301 k~~~~~~l~~~~~~~----l~~r~~~~~~~~sn~g~~~id~~--------------------~e~~e~~~-~~lsgl~d~ 355 (516) -+|+.++-.. ..++ ++.+++.-..+-+++|-+.-|.. +-+++++. -+|+.+ +- T Consensus 252 YIDVGNlpk~-KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg~lgem-eD 329 (533) T protein:vir:58 252 YVDVGNVPPD-KINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGSKVDLA-ED 329 (533) T ss_pred EEeecCCCcc-CHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCCCCCCcH-HH Confidence 6665553222 1122 22222111112222332211110 11344442 123334 44 Q ss_pred HHHHHHHHHhhhcCCceeeeccccccccccchH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-CCcceEEe Q lcl|NC_019527. 356 QSQSQEHMCSVSKIPAIKLTGISPSGLNASSEG--EIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI-DDAITFKF 432 (516) Q Consensus 356 ~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~--D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~d~~~~f 432 (516) +.+|+..+-.+.++|.++|-.. .|++.+++= |.-.|..+|.+.|..+ ..++++- ++ + + |.+ +++|.|+| T Consensus 330 V~YF~kkLy~ALnVP~sRl~~e--~~fgr~~eItRDEiKF~KFI~rLR~rF-~~ll~~q--Li-l-k-~iit~eew~~~f 401 (533) T protein:vir:58 330 VEYMLNRLISALKVPKAFIGYE--GDVNAKNTLATQDIKFNNTIKRIQGFF-VEELERM--VR-M-N-KEFADQDFRLVM 401 (533) T ss_pred HHHHHHHHHHHhCCCeeecCCC--CCCccchhhhHHHHHHHHHHHHHHHHH-HHHHhcc--cc-c-c-cCcchhheeeee Confidence 6799999999999999999543 477777765 5666999999999764 4555442 12 2 2 333 67899999 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHH-----------Hhh----hccCCCCCChhhhccccccchh Q lcl|NC_019527. 433 KSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQL-----------SDD----PDSGWDNIDGDLEIVQPEMFDD 497 (516) Q Consensus 433 ~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l-----------~~~----~~~~~~~~d~~~e~~~~e~~~~ 497 (516) +.=.--+|-..+|+...+..+++.+- +.+.-+-+++.+ ... ..+.|..-+. ++...+...+. T Consensus 402 ~~Dn~f~ElKe~Eil~~Ri~~l~~~d--pyvgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~~~~~~-~~e~~~~~~~~ 478 (533) T protein:vir:58 402 NRSNSIVEGERFAVIEQRIGIAERLK--GWVREDWIYSNILQIPYDLKPQEEVAEAAGGGGLFDTGGF-GEETTPADFLG 478 (533) T ss_pred eccchHHHHHHHHHHHHHHHHHHHhc--chhhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCCCCCCc-ccccCCcccCc Confidence 98888888888999888888776532 122222222221 111 1111211110 11111111111 Q ss_pred cCCCCCCCCCC-------CCCCCC------------------CC Q lcl|NC_019527. 498 DGADPYMPDPD-------VLPGEE------------------GS 516 (516) Q Consensus 498 e~~~~~~~~~~-------~~~~~e------------------~t 516 (516) +..+|-..+.+ .+|+++ |. T Consensus 479 ~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~ 522 (533) T protein:vir:58 479 ERGSPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGG 522 (533) T ss_pred cccCcccCCCChhhHhcccCCcccccccccccccchhhhhhcCC Confidence 11111000000 000000 00 No 121 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=99.43 E-value=1.3e-12 Score=85.66 Aligned_cols=399 Identities=9% Similarity=-0.010 Sum_probs=194.1 Q ss_pred CCCCccCCCccchhcccccccchhhhcccccCCccc----ccccCcccHHHHHHHH-hCchhhhhhhhhhHHHhhCCCee Q lcl|NC_019527. 58 LMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYA----ADIQPFPGYQNLAALA-TRPEYRAFASTLSTELTREGIEI 132 (516) Q Consensus 58 ~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~----~~~~~f~gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i 132 (516) +..+-+- ...+++ .....++.|.+. .....-.+. .-.+ .+.+++.||+..+.-++.++.++ T Consensus 1 ~~~~~~~-~~~~r~----------~~l~~yy~g~~~~~~~~~~~~~~~~---~~~ki~~n~~~~ivd~~~~~l~g~~~~~ 66 (440) T protein:vir:95 1 MLAAFLG-SQKQRL----------AILASYAQGDNFSILSGHRRLDDEK---ADYRVRHKWGGYISSFATGYVIGNPVSI 66 (440) T ss_pred ChhhHHH-HHHHHH----------HHHHHHhccCCcccccccccccccC---CcceeecchHHHHHHhhhhheeccCceE Confidence 1111100 111111 111112211110 000000010 0111 46888999999999999999999 Q ss_pred eeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecce Q lcl|NC_019527. 133 TSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPM 212 (516) Q Consensus 133 ~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~ 212 (516) ++.++++.+ ....|...+++-++...+.++.+...+||.|++++.++... .| .+.+++|. T Consensus 67 ~~~~~~~~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~--~~--------------~i~~~~p~ 126 (440) T protein:vir:95 67 GVMEGGSAD----QLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDK--VD--------------RVVLISPL 126 (440) T ss_pred eeCCCccHH----HHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCC--ce--------------EEEEEccc Confidence 876654322 24557777778889999999999999999999988764321 01 13333444 Q ss_pred eeccccccccccccccccCcceeEEeee---Ee-ccceEEEecC------------------CcchhhhhhccCCCCchH Q lcl|NC_019527. 213 WTSPSAYNALDPTAPDFYKPSTWWVLGR---EM-HASRLLTIIT------------------RPLPDMLKPAYNFSGISM 270 (516) Q Consensus 213 ~v~p~~~~~~dp~s~~yg~P~~y~v~g~---~i-H~SRli~~~~------------------~~~p~~~k~~~~~~G~S~ 270 (516) ++.|..-.. ....+.|. -.+|..... .| -+.++.++.. ..+|-.. -.++-+|.|. T Consensus 127 ~~~~~~d~~-~~~~~~~~-i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~-~~n~~~g~sd 203 (440) T protein:vir:95 127 EMFVIRDLT-VEQNIIAA-VHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVE-WWNNRFRMGD 203 (440) T ss_pred ceEEEEcCC-CCCceEEE-EEEEEecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEE-eeCCCCCCCc Confidence 433321000 00001110 000100000 00 0111111100 0011111 1234579999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEE--ecc Q lcl|NC_019527. 271 SQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTP 348 (516) Q Consensus 271 le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~ 348 (516) ++.+.+.+.+++.+....+.-+..++...+.+.........+++....-...-.............+++.+++.+ +.+ T Consensus 204 ~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~ 283 (440) T protein:vir:95 204 YESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYD 283 (440) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecCC Confidence 999999999999999888888887777666543321111111222111111000000011011111222334444 456 Q ss_pred cCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHh---CC Q lcl|NC_019527. 349 LSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFY---DDISSVQQSYYFSPLDTMLKVIQLSK---WG 422 (516) Q Consensus 349 lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yy---d~I~~~Qe~~l~p~l~~l~~~l~~s~---~g 422 (516) .+++...++.+.+.|...+++|-.-+ +. .+| |.||+.=...|. ..++. ++..++..+++++++++.-. .| T Consensus 284 ~~~~~~~~~~l~~~i~~~s~~p~~~~-~~-~~~-n~Sg~Al~~~~~~l~~k~~~-k~~~~~~~l~~~~~li~~~~~~~~~ 359 (440) T protein:vir:95 284 VNGTEAYKNRLANDIHRFSRIPNLDD-DR-FNS-TSSGIALLYKMIGLEQVRKD-KETYFTKALRRRYELISNIHKAING 359 (440) T ss_pred HHHHHHHHHHHHHHHHHHhCCccccc-cc-ccc-cchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcCC Confidence 67888999999999999999996433 21 122 456664222222 23333 34567888888888765432 12 Q ss_pred -Cc-CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCCh-h-hhccccccc-h- Q lcl|NC_019527. 423 -EI-DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDG-D-LEIVQPEMF-D- 496 (516) Q Consensus 423 -~~-~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~-~-~e~~~~e~~-~- 496 (516) .. ..++++.|++-...++++.|++..+. .|+|+.+.+.+.| +.++. + .+....|.. + T Consensus 360 ~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl---------~g~iS~et~~~~l--------~~~d~~~E~~ri~~E~~~~~ 422 (440) T protein:vir:95 360 PVIEANKLTFTFHPNIPQDVWTEIKAYIEA---------GGEISQETLMENA--------SFTDYKTEHSRILKQGGSSD 422 (440) T ss_pred cccccccceEEeCCCCCCCHHHHHHHHHHH---------hccCcHHHHHHhC--------CCCCcHHHHHHHHHHHHHhh Confidence 22 24789999999999999988876542 4788877776654 12221 1 111111100 0 Q ss_pred hcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 497 DDGADPYMPDPDVLPGEE 514 (516) Q Consensus 497 ~e~~~~~~~~~~~~~~~e 514 (516) .+.....+...+...+.| T Consensus 423 ~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 423 LEIGQIVGDADVGQADTE 440 (440) T ss_pred hhHHhhccCCCCCCcCCC Confidence 000000011111111111 No 122 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.43 E-value=4.1e-13 Score=88.49 Aligned_cols=439 Identities=12% Similarity=0.111 Sum_probs=207.4 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccc-cccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDS-LCGP 79 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds-~~~~ 79 (516) ||-+-+..|+|.... ..+.+.+..... ++.++++. .... T Consensus 3 ~~~~ik~~~~~~~~~--------------------~~~~~~~~~i~d--------------------~~~i~~~~~~~~~ 42 (505) T protein:vir:79 3 FWDTLKNLFRKGSAA--------------------VGMTKSLGQIID--------------------DPRINLPADEVER 42 (505) T ss_pred hHHHHHHHHHHhhhh--------------------hcchhhhhhhhc--------------------ccCCCCCHHHHHH Confidence 555544333221110 000011111000 11122211 0000 Q ss_pred hhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcCh Q lcl|NC_019527. 80 TYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGV 159 (516) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~ 159 (516) ...+-.-+.+..++...+---|.+.-..+....+++.||+..|+-++.+...|+..++.. -+.|++.+++-++ T Consensus 43 i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i~~~~A~ll~~e~~~i~~~d~~~-------~e~l~~i~~~n~f 115 (505) T protein:vir:79 43 IARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLASAKLASLIFNEQCQVTVSDETA-------NDFLDDVFQQNDF 115 (505) T ss_pred HHHHHHHhcCCCccccccccCCCccccceeecchHHHHHHHHHhhhcCCCceeecCChHH-------HHHHHHHHHhccH Confidence 111111111111111111000111111233447899999999999999998888765322 2457777888789 Q ss_pred hHHHHHHHHhcccceeeEEEEEecCCCc----ccCcccccccccccceeeEEeecceeecccccccccccccccc-Ccc- Q lcl|NC_019527. 160 MGIIQKAAEHDCFFGRGQISINIKGADV----SVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFY-KPS- 233 (516) Q Consensus 160 ~~~l~ea~~~~rlyG~a~i~i~i~~~~~----~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg-~P~- 233 (516) +..+.+++..+..+|++++-+.++++.+ ..|-.+-|.....+.+..+..+.++..... ....|+ .-+ T Consensus 116 ~~~~~~~~e~a~a~G~~~~k~~~D~~~~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~-------~~~~~yt~lE~ 188 (505) T protein:vir:79 116 YTTFEEKLEEWIALGSGCVRPYVDSGKIKLAWATADQVYPLQADTNQVNELAIASRTTEVEN-------HRTIYYTLLEF 188 (505) T ss_pred HHHHHHHHHHHhhcCCeEEEEEEeCCceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEecC-------CcceEEEEEEE Confidence 9999999999999999988887765432 111110010111222333333332221110 000011 011 Q ss_pred ------eeEEe------------eeEeccceE---------EEecCCcchhh--------h-hhccCCCCchHHHHHHHH Q lcl|NC_019527. 234 ------TWWVL------------GREMHASRL---------LTIITRPLPDM--------L-KPAYNFSGISMSQLAQPY 277 (516) Q Consensus 234 ------~y~v~------------g~~iH~SRl---------i~~~~~~~p~~--------~-k~~~~~~G~S~le~~~~~ 277 (516) .|+|. |..|.-+.+ ..+.|-+.|.+ . ......+|+|++..+.+. T Consensus 189 h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~ 268 (505) T protein:vir:79 189 HQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTV 268 (505) T ss_pred EEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHH Confidence 12222 111111110 11112111110 0 111245799999999999 Q ss_pred HHHHHHHHHHHHHHHHHhCCceeee-cchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEeccc--CCHHH Q lcl|NC_019527. 278 VENWLRTRQSVSDLVDKFSRTFLKT-NMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPL--SGLAD 354 (516) Q Consensus 278 l~~~~~~~~~~~~Ll~~~~~~v~k~-~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~l--sgl~d 354 (516) +..++.+......-+......++-- .+......+.+.........+.....-+..+..++.+..+++++.++ ....+ T Consensus 269 id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~ 348 (505) T protein:vir:79 269 IDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQASETHPPMFDPDETVYQAMYGDASEVGFHDATSPIRVADYQA 348 (505) T ss_pred HHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCcccccccccCCCccceeeeeccCCCCCCceEEecccCCHHHHHH Confidence 9999988877776555544443331 11111111111110000000110000111122233345677777655 45667 Q ss_pred HHHHHHHHHHhhhcCCceeeeccccccccccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------- Q lcl|NC_019527. 355 LQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEG---EIRSFYDDISSVQQSYYFSPLDTMLKVIQLSK----------- 420 (516) Q Consensus 355 ~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~---D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~----------- 420 (516) .++.+.++|+..+|++..- ||...+|. .|+.. ....-|.++..+|. .++..|+.|+..++... T Consensus 349 ~l~~~l~~i~~~~g~s~~~-~~~~~~~~-~TAtei~s~~~~l~~t~~~~~~-~~~~al~~li~~i~~~~~~~~~~~~g~~ 425 (505) T protein:vir:79 349 TMDFFLREFENQTGLSQGT-FTTSPSGI-QTATEVVTNNSQTYQTRSSYIT-QVEKTIKALTYAILELASVPSFYADGQA 425 (505) T ss_pred HHHHHHHHHHHHhCCChhh-cCCCcccc-chHHHHHHHHhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcccccccc Confidence 7888888999999997653 45444554 34432 23445777777764 46778888877776421 Q ss_pred --CCCcCC-cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh-hccccccch Q lcl|NC_019527. 421 --WGEIDD-AITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL-EIVQPEMFD 496 (516) Q Consensus 421 --~g~~~~-d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~-e~~~~e~~~ 496 (516) .+.+++ +++|.|++-...++.+.++.. .+++++|+++.++++... .+++++. +..-+++ . T Consensus 426 ~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~-------~~~v~~Gi~s~e~~l~~~--------~~~~eeea~~el~ri-~ 489 (505) T protein:vir:79 426 RWTGDVDSLDITINFNDGVFVDQESKRAAD-------LQAVQAQVMPKKQFLMRN--------YGLDEEEADEWLAQI-D 489 (505) T ss_pred cccCCCCceeEEEEeCCCCCCCHHHHHHHH-------HHHHHcCCCCHHHHHHhc--------CCCChHHHHHHHHHH-H Confidence 123333 789999998888877765443 557889999998877553 3343222 1111111 1 Q ss_pred hcCCCCCCCCCCCCCCC Q lcl|NC_019527. 497 DDGADPYMPDPDVLPGE 513 (516) Q Consensus 497 ~e~~~~~~~~~~~~~~~ 513 (516) +|. ....|+..+..|+ T Consensus 490 ~E~-~~~~p~~~~~gg~ 505 (505) T protein:vir:79 490 AEN-STAEPEFNQFGGD 505 (505) T ss_pred Hhc-cccCCCchhccCC Confidence 111 1122333333333 No 123 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.43 E-value=7.6e-12 Score=81.54 Aligned_cols=450 Identities=13% Similarity=0.061 Sum_probs=196.1 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchh---c--ccccccchhhhcccccCCcccccccC-cccHHHHHHHHhCchhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAV---A--MDSLCGPTYQFLNSAAGGLYAADIQP-FPGYQNLAALATRPEYR 115 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~---a--~ds~~~~~~~~~~~~~~~~~~~~~~~-f~gy~ll~~y~~~~i~r 115 (516) |......+.+ -+-..+|+ ++.....+ . .+.. .........++-|.+.....+ -..-.+-....-..+.+ T Consensus 1 ~~~~~~~~~~---~~~~~~~l-~~~e~~~i~~L~~~~~~~-~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~ 75 (504) T protein:vir:99 1 MTEETTSASK---FTFRIPEL-NDDVVDKVNGLYQQLVDR-TPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSA 75 (504) T ss_pred CCccCCcccc---cccccCCC-CHHHHHHHHHHHHHHHHH-hHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHH Confidence 2111111111 11112342 22221111 0 0000 011111112222221111110 01122333334456689 Q ss_pred hhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCc--cc Q lcl|NC_019527. 116 AFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPL--IL 193 (516) Q Consensus 116 ~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl--~l 193 (516) +||+..++-+.=+||.+....+. -..+.+.|++=++.....++.+...+||.|++++.-++..-..|+ .+ T Consensus 76 ~iVd~~a~rl~~~Gf~~~d~~~~--------~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~ 147 (504) T protein:vir:99 76 KAVDTLARRCNLESFVWPDGDYG--------SIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVK 147 (504) T ss_pred HHHHHHHhhhccceeeCCCCChh--------hHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEe Confidence 99999999887789876432211 134666677777888899999999999999988754321111111 01 Q ss_pred ccccccccceeeEEeeccee--ecc-cccccccc----ccccccCcce-eEEe---eeEeccceEEEecCCcchhh-hh- Q lcl|NC_019527. 194 DPRTIKKGSLTGFSNIEPMW--TSP-SAYNALDP----TAPDFYKPST-WWVL---GREMHASRLLTIITRPLPDM-LK- 260 (516) Q Consensus 194 d~~~I~~g~l~~l~v~d~~~--v~p-~~~~~~dp----~s~~yg~P~~-y~v~---g~~iH~SRli~~~~~~~p~~-~k- 260 (516) +|+. -+.++|+.. +.. ..+...|. ..-.++.|.. |.+. +......+.-+.-|-|+..+ .+ T Consensus 148 sP~~-------~~~iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~gvPvV~~~n~~ 220 (504) T protein:vir:99 148 SAMQ-------ATGEWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKLGVPVEVLPYKP 220 (504) T ss_pred ccce-------eEEEEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCCCcceEEecccc Confidence 1111 122222211 000 00000111 1111222221 2211 11111111111112222111 11 Q ss_pred hccCCCCchHH-HHHHHHHHHHHHHHHHHHHHHHHhCCceee---ecchhhhcCccH---HHHHHHHHHHHHhcCCcceE Q lcl|NC_019527. 261 PAYNFSGISMS-QLAQPYVENWLRTRQSVSDLVDKFSRTFLK---TNMAQVLNGGEG---GDVFDRVEMYVNMQSNLGLA 333 (516) Q Consensus 261 ~~~~~~G~S~l-e~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k---~~~~~~l~~~~~---~~l~~r~~~~~~~~sn~g~~ 333 (516) .....+|.|.+ +.+++.+.++.++......-..-++.+... .+.... ...++ ..+..++..+...-.+.... T Consensus 221 ~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~-~~~d~~~~~~~~~~~~~i~~~~~~~~~~ 299 (504) T protein:vir:99 221 REDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNF-RNKDGSMKPAWQIALARVFALPDDEDEP 299 (504) T ss_pred cCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCcccc-ccccccccchhhhhhhhhhcCCCccccc Confidence 11335688876 467777777777766554434434433222 111111 11111 12222222222221111111 Q ss_pred EEecCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHH---HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 334 VMDFDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGE---IRSFYDDISSVQQSYYFSPL 409 (516) Q Consensus 334 ~id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D---~~~yyd~I~~~Qe~~l~p~l 409 (516) +..+..-++.+++ +++.++.+.+.....+||+.++||.. -||.....-|+||+.= .......++.+|+ .+...+ T Consensus 300 ~~~~~~~~~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~-~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~-~f~~~l 377 (504) T protein:vir:99 300 DAARARADVKQFPASSPQPHIEMLEQIAMMFSGETSIPVE-SLGFSNRANPTSADAYIASREDLIAEAEGATD-DWSPAF 377 (504) T ss_pred cccCccceeeecCCCChHHHHHHHHHHHHHHHhhhCCCHH-HhcccccccccHHHHHHHHHHHHHHHHHHHHH-HHHHHH Confidence 1111122344443 44667778888899999999999964 5565533445777653 3344455555554 568889 Q ss_pred HHHHHHHHHHhC--CCcCC---cceEEeCCCCCCCHHHHHHHHHHHHHHHH-------HHHH-cCCCCHHHHHHHHHhhh Q lcl|NC_019527. 410 DTMLKVIQLSKW--GEIDD---AITFKFKSLWQTSAKEESEIRFNKAQEAQ-------IYIT-NSVIDPSEARQQLSDDP 476 (516) Q Consensus 410 ~~l~~~l~~s~~--g~~~~---d~~~~f~pL~~~sekEkAei~~~~a~a~~-------~~~~-~gvi~~~e~r~~l~~~~ 476 (516) ++++++++.... +..++ ++++.|.+....|..+.|+...|.+++.. .+.. .| ++++|+.+...... T Consensus 378 ~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg-~~~~ei~r~~~e~~ 456 (504) T protein:vir:99 378 RRSMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLG-LTPQQAKRALAERR 456 (504) T ss_pred HHHHHHHHHHhcCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcC-CCHHHHHHHHHHHH Confidence 998887655433 23343 46789999999999999988777666532 1222 24 36666654332111 Q ss_pred c-cC---CCCC-Chhh-hccccccchhcCCCCCC---CCCCCCCCCCC Q lcl|NC_019527. 477 D-SG---WDNI-DGDL-EIVQPEMFDDDGADPYM---PDPDVLPGEEG 515 (516) Q Consensus 477 ~-~~---~~~~-d~~~-e~~~~e~~~~e~~~~~~---~~~~~~~~~e~ 515 (516) . .. ...+ +.+. ........++...++++ +..+..|..+| T Consensus 457 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 457 RASSVSIIEALNRRQQEAATAGEDQDQGAGEPPANEPPAALGRPTLVG 504 (504) T ss_pred HHhhHHHHHHHhcccCCCCCCCCCCCcCCCCCCCCCCCccCCCcccCC Confidence 0 00 0000 0000 00000001111111111 23345666666 No 124 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=99.41 E-value=1.2e-12 Score=86.01 Aligned_cols=404 Identities=7% Similarity=-0.010 Sum_probs=194.1 Q ss_pred CCCc-cCCCccchh-ccccccc--------------------chhhhcccccCCccccccc---Cccc---HHHHHHHH- Q lcl|NC_019527. 59 MPGV-VPAGTTPAV-AMDSLCG--------------------PTYQFLNSAAGGLYAADIQ---PFPG---YQNLAALA- 109 (516) Q Consensus 59 ~~gv-~~~~~~~~~-a~ds~~~--------------------~~~~~~~~~~~~~~~~~~~---~f~g---y~ll~~y~- 109 (516) |.-+ .|-+..-.- -+++... ........++.|-+.--.. .... +...+-.+ T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRM 80 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcccccccccccccccccccee Confidence 2222 111100000 0011000 0000011111111100000 0000 00000011 Q ss_pred hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCccc Q lcl|NC_019527. 110 TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSV 189 (516) Q Consensus 110 ~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~ 189 (516) .+.+++.||+..+.-++.+++++++.++.. .+.|...++ -++...+.++.+....||.|++++..+... T Consensus 81 ~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~-------~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g--- 149 (478) T protein:vir:10 81 YTNYHQNLVDQKVAYAVANPVTFGVDNDKA-------LKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEG--- 149 (478) T ss_pred ccchHHHHHHHHHhhhccCCeeeecCChHH-------HHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEecCCC--- Confidence 357889999999999999999998765432 233444443 478899999999999999999887664321 Q ss_pred CcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeee---Eec-cceEEEecC------------- Q lcl|NC_019527. 190 PLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGR---EMH-ASRLLTIIT------------- 252 (516) Q Consensus 190 Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~---~iH-~SRli~~~~------------- 252 (516) .+ .+.+++|..+.|..-. .....+.++ -.+|.+.+. .+| +.++.++.. T Consensus 150 ------------~~-~~~~~~p~~~~~i~d~-~~~~~~~~~-v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~ 214 (478) T protein:vir:10 150 ------------EF-KTFRVPAEQAVPIWTN-KERDELQAF-IRVYELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSD 214 (478) T ss_pred ------------ee-EEEEEcccceEEEEcC-CCCCceEEE-EEEEEecCceEEEEEeCCeEEEEEEcCCeeeccccccc Confidence 11 1444555554443110 001111111 111222111 111 112211100 Q ss_pred ----------------CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHH Q lcl|NC_019527. 253 ----------------RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDV 316 (516) Q Consensus 253 ----------------~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l 316 (516) ..+|-+ .-.++.+|.|.++.+.+.+.+++.+....+.-+..++..++....... .+.... T Consensus 215 ~~~~~~~~~~~~~~~~~~vPvv-~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~---~~~~~~ 290 (478) T protein:vir:10 215 DHIQPHYYQGNKLMSWGRVPFI-PFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEG---EDMKDF 290 (478) T ss_pred cccccceecccccccCCccceE-EeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCc---cccchh Confidence 011111 112456799999999999999999988888887777776665432211 111111 Q ss_pred HHHHHHHHHhcCCcceEEEec-CCccee--EEecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHH Q lcl|NC_019527. 317 FDRVEMYVNMQSNLGLAVMDF-DSEDIV--QVNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSF 393 (516) Q Consensus 317 ~~r~~~~~~~~sn~g~~~id~-~~e~~e--~~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~y 393 (516) ...+. ..+++.+.+ ++.+.+ +...+.+++...++.+.+.|...+++|-.-.-+ - |-|.||..=...| T Consensus 291 ~~~~~-------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~-~~n~Sg~Al~~~~ 360 (478) T protein:vir:10 291 MHNLK-------YYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDK--F-GNSPSGIALKFMY 360 (478) T ss_pred hhhhh-------hcceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccc--c-ccccHHHHHHHHH Confidence 11111 122333322 223344 445677889999999999999999999643211 1 2256776433333 Q ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCcC-CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHH Q lcl|NC_019527. 394 YDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEID-DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQ 470 (516) Q Consensus 394 yd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~~-~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~ 470 (516) ..... ...+..+...++++++++..-..+..+ .++++.|++-...++++.|++..+ + +|+||.+.+.+ T Consensus 361 ~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~~~~~~~i~i~f~~~~p~d~~e~a~~~~k-------l--~g~iS~et~~~ 431 (478) T protein:vir:10 361 SNLDLKANKLKNKTLTALQELLQYIIDFYRLDVKVQDIEITFNFNVMVNELENSQIAMN-------S--TGLLSKETILS 431 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEecCCCCCCHHHHHHHHHH-------H--hCCCChHHHHH Confidence 33322 444566888899998887654433332 478999999999999998776543 2 57788777766 Q ss_pred HHHhhhccCCCCCChhhhccccccchhcC--CC-CCCCCCCCCCCCCCC Q lcl|NC_019527. 471 QLSDDPDSGWDNIDGDLEIVQPEMFDDDG--AD-PYMPDPDVLPGEEGS 516 (516) Q Consensus 471 ~l~~~~~~~~~~~d~~~e~~~~e~~~~e~--~~-~~~~~~~~~~~~e~t 516 (516) .+... ...+.+.+....|..+... .+ ..+...+.+++.+.. T Consensus 432 ~l~~v-----~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (478) T protein:vir:10 432 NHAWV-----EDPVAEMERIEQENIELNQQLPDIEEGLNGEQQRQSENN 475 (478) T ss_pred hCCCC-----CCHHHHHHHHHHHHHHHHhhccccccccCCCCCCCCCCC Confidence 65210 1111111111111000000 00 000011111111111 No 125 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.40 E-value=8.9e-13 Score=86.65 Aligned_cols=420 Identities=7% Similarity=-0.030 Sum_probs=192.3 Q ss_pred hhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhccccccc--chhhhcccccCCcccccccC---cccH---HHH Q lcl|NC_019527. 34 MRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCG--PTYQFLNSAAGGLYAADIQP---FPGY---QNL 105 (516) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~--~~~~~~~~~~~~~~~~~~~~---f~gy---~ll 105 (516) +.+++.-..+....- |..+.. +-.....-..+ .+.... ........++-|-+.--... +..+ ... T Consensus 1 ~~~~~~~~~~~~~~~----~~~~~~--~~~~~~~i~~~-i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~ 73 (472) T protein:vir:93 1 MYPSQPTQTEIFDAI----VRTNNK--PETLEEMIVRY-IKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLK 73 (472) T ss_pred CCCCCCcchhhhhce----eeecCc--hhhHHHHHHHH-HHHHHHHHHHHHHHHHHhccccccccccchhhccccccccc Confidence 111111111111110 111100 00000000000 000000 00000111111111000000 0000 011 Q ss_pred HHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecC Q lcl|NC_019527. 106 AALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKG 184 (516) Q Consensus 106 ~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~ 184 (516) .-.+ .+.+++.||+..+.-++.+++++.+.+++. .+.|+..++. ++...+.++.+....||.|++++..++ T Consensus 74 ~~~ri~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~-------~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~~~v~~d~ 145 (472) T protein:vir:93 74 PDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEV-------VKRIDEVLGN-RFDDKLHSVLTGASNKGIEWLHPYLDE 145 (472) T ss_pred cccccccchHHHHHHHHhhhhcccCeeeccCChHH-------HHHHHHHHhc-cHHHHHHHHHHHHhhcCeEEEEEEECC Confidence 1111 358899999999999999999998765432 2334444443 788889999999999999998887643 Q ss_pred CCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee---eEe-ccceEEEec--------- Q lcl|NC_019527. 185 ADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG---REM-HASRLLTII--------- 251 (516) Q Consensus 185 ~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g---~~i-H~SRli~~~--------- 251 (516) .. .+ .+.+++|.++.|..-+ .....+.++ -.+|.... ..+ .+.++.+|. T Consensus 146 d~---------------~~-~i~~~~p~~~~~i~d~-~~~~~~~~~-ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 207 (472) T protein:vir:93 146 EG---------------EF-KLFRVPAEQGIPIWTD-KEHEELEAF-IRMYKLENETKVEYWDKVTVNYYVYENGSLIPD 207 (472) T ss_pred CC---------------ce-EEEEEcccceEEEEcC-CCCCceEEE-EEEEEeecceeEEEEecCeEEEEEEecCeeeec Confidence 21 01 1334444444432100 000001110 00010000 000 001111110 Q ss_pred -------------C---CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHH Q lcl|NC_019527. 252 -------------T---RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGD 315 (516) Q Consensus 252 -------------~---~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~ 315 (516) . ..+|-. .-.++-+|.|.++.+.+.+.+++.+....+.-+..++..++...... ..+... T Consensus 208 ~~~~~~~~~~~~~~~~~~~vPvv-~~~nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~---~~~~~~ 283 (472) T protein:vir:93 208 YSNNLENSKTHFSTGSWGKIPFI-PFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPE 283 (472) T ss_pred ccccccccccccccCCCCCcceE-EecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCC---cccchh Confidence 0 001111 11234579999999999999999988888887777777666543211 111112 Q ss_pred HHHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHH Q lcl|NC_019527. 316 VFDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSF 393 (516) Q Consensus 316 l~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~y 393 (516) ..+.+. ..+++.++. +.+.+.+ ..+.+++...++.+.+.|+..+++|-.-+ + +- |-|.||+.=...| T Consensus 284 ~~~~~~-------~~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~-~~-~~n~Sg~Al~~~~ 352 (472) T protein:vir:93 284 FKRLLR-------YYGAIKVSD-NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSS-D-KF-GSAPSGVALEFLY 352 (472) T ss_pred hHHHHh-------hccccccCC-CCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCc-c-cc-ccCchHHHHHHHH Confidence 222111 223333333 3445544 56677889999999999999999996432 2 11 2245665422223 Q ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCcC--CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHH Q lcl|NC_019527. 394 YDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEID--DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEAR 469 (516) Q Consensus 394 yd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~~--~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r 469 (516) ...+. ..++..+...+++++++++. .+|... .++++.|++-...+.++.+++..+. +|++|.+.+. T Consensus 353 ~~l~~ka~~~~~~~~~~l~~~~~li~~-~~~~~~~~~~i~v~f~~~~p~~~~~~~~~~~k~---------~giis~et~l 422 (472) T protein:vir:93 353 TNLNLKADKLARKAKVAIQELLWFVFE-HFDIKGEHKDVDISFNYNKVANTELQVQTAQQS---------MGIVSHETVL 422 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-HhCCCcccceeeEEeCCCCCCCHHHHHHHHHHH---------hccCchHHHH Confidence 22222 33456678899999988764 445433 4788999999999999887766553 3677776666 Q ss_pred HHHHhhhccCCCCCChhhhcccccc-------chhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 470 QQLSDDPDSGWDNIDGDLEIVQPEM-------FDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 470 ~~l~~~~~~~~~~~d~~~e~~~~e~-------~~~e~~~~~~~~~~~~~~~e~t 516 (516) +.+... +..+...+....|. .+..+..+...++++.++.+.+ T Consensus 423 ~~l~~~-----~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 471 (472) T protein:vir:93 423 ENHPFV-----EDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKES 471 (472) T ss_pred HhCCCC-----CCHHHHHHHHHHHHHHHHHhccCcCcccCCCCCCCCCCCcccC Confidence 554110 10011111111110 0000111111111222222222 No 126 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=99.39 E-value=2.3e-12 Score=84.38 Aligned_cols=387 Identities=9% Similarity=0.012 Sum_probs=194.9 Q ss_pred cccccc---------------chhhhcccccCCcccccccCcccH------HH--------HHHHH-hCchhhhhhhhhh Q lcl|NC_019527. 73 MDSLCG---------------PTYQFLNSAAGGLYAADIQPFPGY------QN--------LAALA-TRPEYRAFASTLS 122 (516) Q Consensus 73 ~ds~~~---------------~~~~~~~~~~~~~~~~~~~~f~gy------~l--------l~~y~-~~~i~r~iVd~~a 122 (516) ||--.- ........++.|.+.--...+.-+ .. ..-.+ .+.+++.||+..+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 432100 000111112212110000000000 00 00011 3578999999999 Q ss_pred HHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccc Q lcl|NC_019527. 123 TELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGS 202 (516) Q Consensus 123 ed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~ 202 (516) .-++.+.+++++.++.. .+.|+..++ =++.....++.+....||.|++++.++..+. . T Consensus 81 ~yl~G~p~~~~~~~~~~-------~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g--------------~ 138 (471) T protein:vir:10 81 AYALTYPPTFDVDDKKV-------NDMIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDASDN--------------S 138 (471) T ss_pred hhhcccCceeccCChHH-------HHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCCCC--------------e Confidence 99999999998765422 123333333 3678889999999999999999987764321 1 Q ss_pred eeeEEeecceeeccccccccccccccccCcceeEEe----ee-----Ee-ccceEEEecC-------------------- Q lcl|NC_019527. 203 LTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL----GR-----EM-HASRLLTIIT-------------------- 252 (516) Q Consensus 203 l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~----g~-----~i-H~SRli~~~~-------------------- 252 (516) + .+.+++|.++.|..-...+ ..+.++ -.+|... +. .| -..++.+|.. T Consensus 139 ~-~~~~~~p~~~~~i~d~~~~-~~~~~~-ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~ 215 (471) T protein:vir:10 139 F-RYACVDSKEVIPIYSKSLD-KKSIGV-LRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDT 215 (471) T ss_pred e-EEEEEcccceEEEEcCCCC-CceEEE-EEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCccccccccccccccccc Confidence 1 1334444444432211000 001100 0111110 00 00 0011111110 Q ss_pred ---------------CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHH Q lcl|NC_019527. 253 ---------------RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVF 317 (516) Q Consensus 253 ---------------~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~ 317 (516) ..+|-. .-.++-.|.|.++.+.+.+.+++.+....+..+..++..++....... ....... T Consensus 216 ~~~~~~~~~~~~~~~g~iPvv-~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~---~~~~~~~ 291 (471) T protein:vir:10 216 MNGDRSSDNSFKHDFGLVPFI-PFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGG---QDKQEFL 291 (471) T ss_pred ccccccccccccCCCCceeEE-EeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCc---cccchhH Confidence 001111 012344689999999999999999888888888777776665432211 1111111 Q ss_pred HHHHHHHHhcCCcceEEEecC----Cccee--EEecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHH Q lcl|NC_019527. 318 DRVEMYVNMQSNLGLAVMDFD----SEDIV--QVNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIR 391 (516) Q Consensus 318 ~r~~~~~~~~sn~g~~~id~~----~e~~e--~~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~ 391 (516) .. + + ..++..+... +.+++ +...+.+++...++.+.+.|...+++|-.-..+ .| |+||.. ++ T Consensus 292 ~~---~---~-~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~---~g-n~Sg~A-lk 359 (471) T protein:vir:10 292 ED---L---K-RYKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDK---LG-NSSGVA-LK 359 (471) T ss_pred HH---h---h-cCCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCccc---cc-CccHHH-HH Confidence 11 1 1 1222222211 12344 445667889999999999999999999643322 13 566754 43 Q ss_pred HHHHHH---HHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHH Q lcl|NC_019527. 392 SFYDDI---SSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEA 468 (516) Q Consensus 392 ~yyd~I---~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~ 468 (516) .-+..+ ....+..++..++++++++..-....-..++++.|++....+++|.+++..+ + +|+||.+.+ T Consensus 360 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~k-------l--~g~iS~et~ 430 (471) T protein:vir:10 360 FLYSLLELKAGNMETQFRSGYATLVKMILKHLGLSDKLKIKQTWTRNSINNDTEMAQVVST-------L--ATITSRENV 430 (471) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEEeCCCCCCCHHHHHHHHHH-------H--hccCchHHH Confidence 333222 2344567888899998887653322224579999999999999998886544 2 578998888 Q ss_pred HHHHHhhhccCCCCCChhhhccccccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 469 RQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 469 r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e 514 (516) .+.+.. ...-+.+.+....|.....+..+...+-++.+..| T Consensus 431 ~~~~p~-----v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 431 AKSNPI-----VEDWQDELRLQKAEQEGRSEKLYDMEEVEHESEVE 471 (471) T ss_pred HHhCCC-----CCCHHHHHHHHHHHHHHHHhcccccCCCCCccccC Confidence 766511 11111112222222122222223333333333333 No 127 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=99.39 E-value=8.1e-12 Score=81.40 Aligned_cols=409 Identities=9% Similarity=-0.019 Sum_probs=196.4 Q ss_pred ccccccCCCCCCC--ccC--CCc---cc----hh-cccccccchhhhcccccCCcccccccC-c--c---cHHHHHHHH- Q lcl|NC_019527. 49 AATKWAPPQLMPG--VVP--AGT---TP----AV-AMDSLCGPTYQFLNSAAGGLYAADIQP-F--P---GYQNLAALA- 109 (516) Q Consensus 49 ~~~~~~~~~~~~g--v~~--~~~---~~----~~-a~ds~~~~~~~~~~~~~~~~~~~~~~~-f--~---gy~ll~~y~- 109 (516) -..-|+|-..... ++- ... .. .+ ...............++.|-+.--... . . .....+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRM 80 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccccccccccccc Confidence 1111222110000 000 000 00 00 000000011111112222221000000 0 0 000111122 Q ss_pred hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCccc Q lcl|NC_019527. 110 TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSV 189 (516) Q Consensus 110 ~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~ 189 (516) .+.+++.||+..+.-++.+++.+++.++.. .+.|...++. ++...+.++.+....||.+++++.++... T Consensus 81 ~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~-------~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~~~v~~d~~~--- 149 (468) T protein:vir:96 81 YTNYHQNLVDQKVAYAVANPVTYGTEDEKS-------LKTIQEVLNH-KWDDKLVDILTAASNKGVEWIQPYVDEQG--- 149 (468) T ss_pred ccchHHHHHHHHHhhhccCCceeccCChHH-------HHHHHHHHhc-CHHHHHHHHHHHHhhcCeEEEEEEEcCCC--- Confidence 368999999999999999999998765432 2334444432 67888899999999999999887764321 Q ss_pred CcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee---eEe-ccceEEEecC------------- Q lcl|NC_019527. 190 PLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG---REM-HASRLLTIIT------------- 252 (516) Q Consensus 190 Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g---~~i-H~SRli~~~~------------- 252 (516) .+ .+.+++|..+.|.. ...+...+.++ -.+|.+.+ ..+ .+.++.++.. T Consensus 150 ------------~~-~i~~~~p~~~~~v~-~~~~~~~~~~~-ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (468) T protein:vir:96 150 ------------EF-KTFRVPAEQAIPIW-TNKERDELKAF-IRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQGE 214 (468) T ss_pred ------------ce-EEEEEcccceEEEE-cCCCCCceEEE-EEEEEecCceEEEEEeCCeEEEEEEcCCceeecccccc Confidence 11 13444554444321 00000011111 01111111 011 1112221110 Q ss_pred ----------------CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHH Q lcl|NC_019527. 253 ----------------RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDV 316 (516) Q Consensus 253 ----------------~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l 316 (516) ..+|-.. -.++-.|.|.++.+.+.+.+++.+....+..+..++..++...... ..+.... T Consensus 215 ~~~~~~~~~~~~~~~~~~iPvv~-~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~---~~~~~~~ 290 (468) T protein:vir:96 215 EHVQAHYYVGNKSMSWNRVPFIP-FKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYE---GEDLEEF 290 (468) T ss_pred cccccceeeccccccCCcccEEE-ecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC---ccccchh Confidence 0011110 1124569999999999999999998888887777776665543211 1111122 Q ss_pred HHHHHHHHHhcCCcceEEEecC-CcceeEEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHH Q lcl|NC_019527. 317 FDRVEMYVNMQSNLGLAVMDFD-SEDIVQVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSF 393 (516) Q Consensus 317 ~~r~~~~~~~~sn~g~~~id~~-~e~~e~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~y 393 (516) ...+ ...+++.++++ +.+++.++ .+.++....++.+.++|...+++|-.- ++ +. |-|.||+.=...| T Consensus 291 ~~~~-------~~~~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~-~~-~~-~~n~Sg~Alk~~~ 360 (468) T protein:vir:96 291 MYNL-------KYYKAINVDGDGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQ-QD-KF-GNSPSGIALKFMY 360 (468) T ss_pred hhhh-------hcCceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccc-cc-cc-ccchHHHHHHHHH Confidence 1111 12344445433 23455554 455688889999999999999999632 22 22 3356776433333 Q ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCc-CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHH Q lcl|NC_019527. 394 YDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEI-DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQ 470 (516) Q Consensus 394 yd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~ 470 (516) ..... ...+..++..++++++++..-..... +.++.+.|++-...+++|.|++. .++|++|.+.+.+ T Consensus 361 ~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~i~f~~~~p~d~~e~a~~~----------~~~g~iS~et~i~ 430 (468) T protein:vir:96 361 SNLDLKANKLKNKTLTALQELLQYIIDFYKLSIKVQDVEITFNFNVMVNELEQSQIG----------VNSQYLSKETVVT 430 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCCcCHHHHHHHH----------HhcCCCchHHHHH Confidence 32221 33456688999999988765433333 24789999998888888876642 3469999888876 Q ss_pred HHHhhhccCCCCCChhhhccccccchhcCC-CCCCCCCCCCCC Q lcl|NC_019527. 471 QLSDDPDSGWDNIDGDLEIVQPEMFDDDGA-DPYMPDPDVLPG 512 (516) Q Consensus 471 ~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~-~~~~~~~~~~~~ 512 (516) .+.. ....+.+.+....|..+.... +.-.+..+++|. T Consensus 431 ~l~~-----v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 431 NHPW-----VDDPVAEMERIDQEELALPSIEEGLNGKENNEPT 468 (468) T ss_pred hCCC-----CCCHHHHHHHHHHHHHHHHHHhhccCCCCCCCCC Confidence 6511 111111222222221111111 112222223332 No 128 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=99.39 E-value=6.5e-13 Score=87.39 Aligned_cols=267 Identities=10% Similarity=0.003 Sum_probs=151.8 Q ss_pred HhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccccce Q lcl|NC_019527. 125 LTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKKGSL 203 (516) Q Consensus 125 ~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l 203 (516) +-+..+.+...++... ......|...-.....+..|.+.+.+. .++|-|++++..+. .|.+ T Consensus 1 ia~l~~~~~~~~~~~~---~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~---------------~G~~ 62 (278) T protein:vir:78 1 MASLPLKMYEDYKVVN---TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI---------------YHQP 62 (278) T ss_pred CccceeEEEecCcccc---cHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECC---------------CCcE Confidence 2233333322222111 122233333333444455566666655 55688877765421 2446 Q ss_pred eeEEeecceeeccccccccccccccccCcceeEEe---e--eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHH Q lcl|NC_019527. 204 TGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL---G--REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYV 278 (516) Q Consensus 204 ~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~---g--~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l 278 (516) .+|.+++|.+|+..... .+.+.+|++. | ..+.++.|+||.... +...+.|.|.+..+...+ T Consensus 63 ~~l~~l~~~~v~v~~~~--------~~~~~~y~~~~~~g~~~~~~~~evih~~~~~------~~~~~~G~s~~~~~~~~i 128 (278) T protein:vir:78 63 SKLFLLNPDVVEMLIEN--------QSRELYYSIHAATGNKLIVHNMDMLHFKHIV------ASNMVQGISPIDVLKNTT 128 (278) T ss_pred EEEEEECCceeEEEEcC--------CCceEEEEEEcCCceEEEEccccEEEECCCC------CCCCeeeccHHHHHHHHH Confidence 67999999988753211 2234455553 2 468899999996532 234567999999999988 Q ss_pred HHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEecccCCH--HHH Q lcl|NC_019527. 279 ENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVNTPLSGL--ADL 355 (516) Q Consensus 279 ~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~~~lsgl--~d~ 355 (516) .....+.........+....+++.+ ..++.+..+++.++++ ...+|.| +++++ ++.++++++.+..+. .+. T Consensus 129 ~~~~~~~~~~~~~~~~~~~~i~~~~--~~l~~e~~~~~~~~~~---~~~~~~g~~~vl~-~g~~~~~l~~~~~d~~~~e~ 202 (278) T protein:vir:78 129 DFDNAVRTFNLTEMQKPDSFMLKYG--SNVGKEKRQQVLEDFK---QYYEENGGILFQE-PGVEIEPLPKKYVSEDIVAS 202 (278) T ss_pred HHHHHHHHHHHHHhcCCCcEEEEeC--CCCCHHHHHHHHHHHH---HHhccCCCceecC-CCceEEEccCChhHHHHHHH Confidence 8777666553221111111222322 2233333334444443 4444555 55554 457899888766544 356 Q ss_pred HHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc--CCcceEEeC Q lcl|NC_019527. 356 QSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI--DDAITFKFK 433 (516) Q Consensus 356 ~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~--~~d~~~~f~ 433 (516) .....+.||.+.|||. .++|...++-.++.+.....||.. .|.|.++.+...|-+..+.+. ..++.|+|+ T Consensus 203 ~~~~~~~Ia~~fgVpp-~~lg~~~~~~~sn~~~~~~~~~~~-------~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~ 274 (278) T protein:vir:78 203 ENLTRERVANVFQLPS-VFLNARSNTNFAKNEELNRFYLQH-------TLLPIVKQYEEEFNRKLLTKTDREKIGILNLT 274 (278) T ss_pred HHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhHhcCCceEEEe Confidence 6778899999999996 455766555555555555555544 378999888888877665432 234566665 Q ss_pred CCCCC Q lcl|NC_019527. 434 SLWQT 438 (516) Q Consensus 434 pL~~~ 438 (516) +..+ T Consensus 275 -~~~l 278 (278) T protein:vir:78 275 -LNLI 278 (278) T ss_pred -cccC Confidence 2222 No 129 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.39 E-value=4.2e-12 Score=82.95 Aligned_cols=419 Identities=11% Similarity=0.052 Sum_probs=198.8 Q ss_pred HhHHhhcCC--CccccccCCCCCCCccCCCccchh-cccccccchhhhcccccCCcccccccCcccHHHHHHHH-hCchh Q lcl|NC_019527. 39 MKSMERRAS--DAATKWAPPQLMPGVVPAGTTPAV-AMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALA-TRPEY 114 (516) Q Consensus 39 ~~~~~~~~~--~~~~~~~~~~~~~gv~~~~~~~~~-a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~-~~~i~ 114 (516) .+.+...+. .+...|.-|.- ....++.....+ .+-.-..+.......++-|-+.--.... ....+-++ .+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~--~~~~~~~ki~~n~~ 77 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKG-EKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPE--KETGADNRIVVNSA 77 (470) T ss_pred CccccCCcccccCCceEEeCCC-CCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccCcc--cccCCcceeecchH Confidence 222222221 11112222210 000000000000 0000000111111112222110000000 00001111 24689 Q ss_pred hhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccc Q lcl|NC_019527. 115 RAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILD 194 (516) Q Consensus 115 r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld 194 (516) ++||+..+.-.+.+++++++.++... .+.|...+++-++...+.++.+...+||.|++++.++... T Consensus 78 ~~Ivd~~~~~l~g~p~~~~~~~d~~~------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg-------- 143 (470) T protein:vir:99 78 KYVVDVYNGYFCGIEPKLALLNDSSK------IDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDA-------- 143 (470) T ss_pred HHHHHHHhhhhccCCeeEeeCCchhH------HHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCC-------- Confidence 99999999999999999988654332 2456777888889999999999999999999888764321 Q ss_pred cccccccceeeEEeecceeeccccccccccccccccCcceeEEee-------e-EeccceEEEecCC------------- Q lcl|NC_019527. 195 PRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG-------R-EMHASRLLTIITR------------- 253 (516) Q Consensus 195 ~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g-------~-~iH~SRli~~~~~------------- 253 (516) .+ .+.+++|..+.|..-+..+. .+.++ -.+|...+ . -+.+.++.++... T Consensus 144 -------~~-~i~~~~p~~~~~i~d~~~~~-~~~~~-vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (470) T protein:vir:99 144 -------RP-HLMYSSPNHAFIIYDDTVQR-QPLAF-VHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAI 213 (470) T ss_pred -------eE-EEEEEccceeEEEEcCCCCc-ceEEE-EEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccc Confidence 11 14445555554431110000 01111 00111100 0 0111222222110 Q ss_pred ----cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCC Q lcl|NC_019527. 254 ----PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSN 329 (516) Q Consensus 254 ----~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn 329 (516) .+|-.. -.++-+|.|.++.+.+.+.+++.+....+..+..++...+.......-....++.+ . .... T Consensus 214 ~~~g~vPvv~-~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~----~---~~~~- 284 (470) T protein:vir:99 214 NPYGLVPAVE-FFENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPK----F---DFKN- 284 (470) T ss_pred cCCCccceEe-ecCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchh----h---hhhh- Confidence 011110 12345799999999999999999888888777777766665432221111111111 1 1111 Q ss_pred cceEEEe----cCCcceeEEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHH---HHHHHHH Q lcl|NC_019527. 330 LGLAVMD----FDSEDIVQVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSF---YDDISSV 400 (516) Q Consensus 330 ~g~~~id----~~~e~~e~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~y---yd~I~~~ 400 (516) .++..+. .++.+++.++ .+.+++...++.+.+.|+..+++|-. .++.. +| |.||..=...| ...++ . T Consensus 285 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~~~-~~-n~Sg~Ai~~~~~~l~~k~~-~ 360 (470) T protein:vir:99 285 NRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNI-QDKNF-AG-NSSGVALQYKLFAMKNKAD-S 360 (470) T ss_pred cceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccc-ccccc-cc-CchHHHHHHHHHHHHHHHH-H Confidence 2222222 1223455554 45567888999999999999999953 33321 22 45665422222 22333 3 Q ss_pred HHHHHHHHHHHHHHHHHHHh--CCCcC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhh Q lcl|NC_019527. 401 QQSYYFSPLDTMLKVIQLSK--WGEID---DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDD 475 (516) Q Consensus 401 Qe~~l~p~l~~l~~~l~~s~--~g~~~---~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~ 475 (516) ++..++..|++++++++... .+..+ .++++.|++-...++.|.|++..+. .|+|+.+.+.+.+ T Consensus 361 ~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl---------~giis~et~l~~l--- 428 (470) T protein:vir:99 361 KERKFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNA---------EGIVSKKTQLGMI--- 428 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHH---------hccCCHHHHHHhC--- Confidence 34667888999888775432 22222 3789999999999999988866543 2677776666554 Q ss_pred hccCCCCCChh--hhccccccchh------cCCCCCCCCCCCCCCCC Q lcl|NC_019527. 476 PDSGWDNIDGD--LEIVQPEMFDD------DGADPYMPDPDVLPGEE 514 (516) Q Consensus 476 ~~~~~~~~d~~--~e~~~~e~~~~------e~~~~~~~~~~~~~~~e 514 (516) +.++.+ .+....|..+. ...+.+..+.++..+++ T Consensus 429 -----~~vd~~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 429 -----PDIEPDAEMKQIAKEKADAIKQTQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred -----CCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCCCCCccCC Confidence 122211 11111111000 00001111111111111 No 130 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.37 E-value=4.9e-11 Score=77.13 Aligned_cols=405 Identities=11% Similarity=0.079 Sum_probs=205.2 Q ss_pred cccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcc Q lcl|NC_019527. 13 VADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLY 92 (516) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~ 92 (516) +...+--+-.-+++......+.... +... .+....+ +. .|+.|+. .+-+ +.. T Consensus 1 ~~~~i~~~~g~~~~~~~~~~~~~~~-ia~~----~~~~~~~-~~---~~~~p~~-~~il----------~~~-------- 52 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDKSLSSQ-IATR----ARSIDFF-AL---GMYLPNP-DPVL----------KAL-------- 52 (491) T ss_pred CCCeeeCCCCCcccccccchhHHHH-Hhhh----ccccccc-cc---cccCcch-hHHH----------hhc-------- Confidence 2222211111111111111111111 0000 0000000 00 1111111 0000 000 Q ss_pred cccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhccc Q lcl|NC_019527. 93 AADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCF 172 (516) Q Consensus 93 ~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rl 172 (516) .-.++++..+....-+..++++...-.+..-|+|...++++. ..+.+++.++++.+.+.+.+.+ .+.+ T Consensus 53 ------~~~~~~y~~m~~D~~i~s~l~~Rk~av~~~~w~i~~~~~~~~-----~a~~i~e~l~~~~~~~~i~~~l-da~~ 120 (491) T protein:vir:79 53 ------GKDIRVYRELRADAHVGGCVRRRKAAVKALEWGLDRGKAKSR-----VAKSIADVFADLDLSRIATEML-DAVL 120 (491) T ss_pred ------cCCHHHHHHHhhChHHHHHHHHHHHHHhCCCcEEecCCCCHH-----HHHHHHHHHhcCCHHHHHHHHH-Hhhh Confidence 012455555567888999999999888888888876554321 2356778888888777666665 6899 Q ss_pred ceeeEEEEE--ecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEe Q lcl|NC_019527. 173 FGRGQISIN--IKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTI 250 (516) Q Consensus 173 yG~a~i~i~--i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~ 250 (516) ||.++.=+. .+++. ..++.|...+|.|+... +..............|..+.+-+.+++ T Consensus 121 ~G~s~~Ei~w~~~~g~--------------~~~~~l~~r~~~~f~~d------~~~~l~l~~~~~~~~g~~lp~~k~i~~ 180 (491) T protein:vir:79 121 YGYQPMEITWGKVGNY--------------IVPIDVVGKPADWFVYD------PENQLRFRSKEHWVQGEELPARKFLVP 180 (491) T ss_pred hcceeEEEEEeecCCe--------------eeEEeeeeecccceeec------cCCceEEeecCCCCCceeecCCCeEEE Confidence 999986443 32221 12334555666655421 111001111111234566777777877 Q ss_pred cCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhhcCccHHHHHHHHHHHHHhcC Q lcl|NC_019527. 251 ITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVLNGGEGGDVFDRVEMYVNMQS 328 (516) Q Consensus 251 ~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l~~~~~~~l~~r~~~~~~~~s 328 (516) .... ...+.+|.|++..|+....--..+...-+.++.++++++ .|++-. ..+++..+-++.+..+.+ T Consensus 181 ~~~~------~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~-----a~~~ek~~l~~al~~~~~ 249 (491) T protein:vir:79 181 RQEA------TYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRS-----ASDAETNLLLDRLEDMVQ 249 (491) T ss_pred EecC------CCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCC-----CCHHHHHHHHHHHHHHhc Confidence 6542 334578999999999888777777888889999999765 444311 112233333445555544 Q ss_pred CcceEEEecCCcceeEEecc-cCC----HHHHHHHHHHHHHhhhcCCceeeeccc----cccccccchHHHHHHHHHHHH Q lcl|NC_019527. 329 NLGLAVMDFDSEDIVQVNTP-LSG----LADLQSQSQEHMCSVSKIPAIKLTGIS----PSGLNASSEGEIRSFYDDISS 399 (516) Q Consensus 329 n~g~~~id~~~e~~e~~~~~-lsg----l~d~~~~~~~~iaaas~IP~t~L~G~s----p~Glnatge~D~~~yyd~I~~ 399 (516) + +..++. ++.+++.+... -+| ...+++..-++|+-+. +|++ -+|-+|.|+--.....+.+++ T Consensus 250 ~-a~~viP-~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i-------LGqtlTt~~~gs~a~~~vh~~v~~~i~~~ 320 (491) T protein:vir:79 250 D-AVAVIP-DDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIAL-------LGQNQTTEATSTRASAQAGLEVTDDIRDG 320 (491) T ss_pred C-eEEEec-CCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHH-------hhhhhccCcccchhhHHHHHHHHHHHHHH Confidence 4 344444 35788888765 333 3456776667776432 3333 234445555556666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCC-CCHHHHHHHHHhhhcc Q lcl|NC_019527. 400 VQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSV-IDPSEARQQLSDDPDS 478 (516) Q Consensus 400 ~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gv-i~~~e~r~~l~~~~~~ 478 (516) -... +...+++++.-++.-.|+.. +.+.|.|... | ++.+..|++++.+++.|+ ++.+.+++.+ T Consensus 321 D~~~-i~~tln~li~~l~~~N~~~~-~~p~f~~~e~------e--e~~~~~a~~~~~L~~~G~~i~~~~~~e~~------ 384 (491) T protein:vir:79 321 DKAI-VVEAMNMLIRWICDLNFDGA-ARPVFDMWEQ------E--QVDEIQAGRDEKLTRAGARFTPAYFKRAY------ 384 (491) T ss_pred HHHH-HHHHHHHHHHHHHHhcCCCC-CcceEeecCc------C--chhHHHHHHHHHHHhCCCccCHHHHHHHh------ Confidence 6544 44556677777766566533 2455655432 1 334567888899999997 7989998887 Q ss_pred CCCCCChhhhccccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 479 GWDNIDGDLEIVQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 479 ~~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) |++.-..+.+.. + .......+..+..........+ T Consensus 385 Gip~~~~~e~~~-~--~~~~~~~~~~~~~~~~~~~~~~ 419 (491) T protein:vir:79 385 NLQDGDLDERPL-P--VSAVDAVGAASFAEFEAPDQDA 419 (491) T ss_pred CCCCCCCCcccc-C--cCcccccccccccccCCCCCcc Confidence 333211111100 0 0000000111111111111111 No 131 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.37 E-value=1.1e-11 Score=80.76 Aligned_cols=428 Identities=10% Similarity=0.028 Sum_probs=186.0 Q ss_pred hhcCCCccccccCCCCCCCc-cCCCcc-c----hhcccc--c---ccchhhhcccccCCcccccccC--c-ccHHHHHHH Q lcl|NC_019527. 43 ERRASDAATKWAPPQLMPGV-VPAGTT-P----AVAMDS--L---CGPTYQFLNSAAGGLYAADIQP--F-PGYQNLAAL 108 (516) Q Consensus 43 ~~~~~~~~~~~~~~~~~~gv-~~~~~~-~----~~a~ds--~---~~~~~~~~~~~~~~~~~~~~~~--f-~gy~ll~~y 108 (516) .-.+-+.+. =+|+ .++ +|+..- . .+..+= . ..........++.|-+.....+ . .-|...+.- T Consensus 1 ~~~~~~~~~-~~~~---~~~~~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~ 76 (501) T protein:vir:25 1 MTVPVDVIA-DAPA---ADVEFPEDSMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKL 76 (501) T ss_pred Ccccchhhh-ccCc---ccccCCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhh Confidence 111111111 1111 233 343320 0 001000 0 0001111112222211100000 0 112222222 Q ss_pred HhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcc Q lcl|NC_019527. 109 ATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVS 188 (516) Q Consensus 109 ~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~ 188 (516) ..+.+.+.||+..++-+.-+||.+...+ +. +.+...+++=++.....++.+...+||.|++++..+... T Consensus 77 ~v~n~~~~ivd~~a~~l~~~gf~~~d~~--~~-------~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~-- 145 (501) T protein:vir:25 77 SVKNVLSLVRDSFAQNLSVVGYRNALAK--EN-------DPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEG-- 145 (501) T ss_pred hhcChHHHHHHHHHhhhcccceecCCcc--ch-------HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCC-- Confidence 3457899999999998877887654222 11 235556676678888889999999999999877543211 Q ss_pred cCcccccccccccceeeEEeecceeecccccc-ccc--cc-------------c---ccccCcce-eEEe--ee------ Q lcl|NC_019527. 189 VPLILDPRTIKKGSLTGFSNIEPMWTSPSAYN-ALD--PT-------------A---PDFYKPST-WWVL--GR------ 240 (516) Q Consensus 189 ~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~-~~d--p~-------------s---~~yg~P~~-y~v~--g~------ 240 (516) | .+++++|.++.+.... ..+ |. . -.++.|.. |.+. +. T Consensus 146 -~--------------~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~ 210 (501) T protein:vir:25 146 -P--------------VFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAG 210 (501) T ss_pred -C--------------eEEEeccccEEEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeecc Confidence 1 1334444444322100 000 00 0 00111111 1100 00 Q ss_pred ----------------Eeccc-eEEEecCCcch-hhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeee Q lcl|NC_019527. 241 ----------------EMHAS-RLLTIITRPLP-DMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKT 302 (516) Q Consensus 241 ----------------~iH~S-Rli~~~~~~~p-~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~ 302 (516) ..|+. .-..|..-|+. ...++..+.+|.|.++.+++.+.+++++.........-++.+...+ T Consensus 211 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i 290 (501) T protein:vir:25 211 GGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVI 290 (501) T ss_pred ccccccccccccccccccccccccCCccceeeEeccCccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHH Confidence 00000 00000000110 1112223567999999999888888888777555444444432211 Q ss_pred cchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeecccccc Q lcl|NC_019527. 303 NMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSG 381 (516) Q Consensus 303 ~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~G 381 (516) .+.+.+... .+... ...++++.+++-++-+++ .++.+..+.++...++||+.+++|...|.|.+ T Consensus 291 ------~G~~~~~~~----~~~~~--~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~--- 355 (501) T protein:vir:25 291 ------SGWTGSKAE----VLKAS--ALRVWTFEDPEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKM--- 355 (501) T ss_pred ------hCCCCCccc----hhhhc--ccceeccCCCCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhcccc--- Confidence 111111111 11111 112334433333444443 34566777888999999999999987776542 Q ss_pred ccccchHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcC---CcceEEeCCCCCCCHHHHHHHHHHHHHH-H Q lcl|NC_019527. 382 LNASSEGEIR---SFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEID---DAITFKFKSLWQTSAKEESEIRFNKAQE-A 454 (516) Q Consensus 382 lnatge~D~~---~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~---~d~~~~f~pL~~~sekEkAei~~~~a~a-~ 454 (516) -|.||+.=.. ..-..++.+| ..+...+++++++++.-..+..+ .++++.|.+....|.++.|+...|.+++ + T Consensus 356 ~N~Sg~Al~~~~~~l~~ka~~k~-~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi 434 (501) T protein:vir:25 356 INVSAEALAAAEANQQRKLAAKR-ESFGESWEQLLRLAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI 434 (501) T ss_pred CChHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC Confidence 2456664332 3334444444 45788899999887766654432 3688999999999999988777654433 1 Q ss_pred --HHHH-HcCCCCHHHHHHHHHhhhccCCCCCChhhhccc--cccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 455 --QIYI-TNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQ--PEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 455 --~~~~-~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~--~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) ..++ ..--+++.++.+...........++-....... +..+...++.+...+++...+..|+ T Consensus 435 s~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 435 PIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGVNGNGGA 501 (501) T ss_pred CHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccCCCCCCC Confidence 1111 112234444322221110000000000000000 0000000000000011111112222 No 132 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=99.36 E-value=1.2e-11 Score=80.45 Aligned_cols=406 Identities=8% Similarity=0.002 Sum_probs=190.8 Q ss_pred CccccccCCCCCCCccCCCc--cchhcccccc---------------cchhhhcccccCCcccccccCcc------cHHH Q lcl|NC_019527. 48 DAATKWAPPQLMPGVVPAGT--TPAVAMDSLC---------------GPTYQFLNSAAGGLYAADIQPFP------GYQN 104 (516) Q Consensus 48 ~~~~~~~~~~~~~gv~~~~~--~~~~a~ds~~---------------~~~~~~~~~~~~~~~~~~~~~f~------gy~l 104 (516) =+...| |.. +|-.. -..+.-.+.. -........++.|.+.--...+. .+.. T Consensus 1 ~~~~~~--~~~----~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~ 74 (478) T protein:vir:10 1 MISINW--PWD----KPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDET 74 (478) T ss_pred Cccccc--cCC----chhhhHHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccccccc Confidence 000100 100 00000 0000000000 00000011111111100000000 0000 Q ss_pred HHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEec Q lcl|NC_019527. 105 LAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIK 183 (516) Q Consensus 105 l~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~ 183 (516) .+-.+ .+.+++.||+..+.-++.+++++++.++.. .+.|...++ =++...+.++.+....||.+++++.++ T Consensus 75 ~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~~~-------~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d 146 (478) T protein:vir:10 75 KPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKA-------LKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVD 146 (478) T ss_pred cccceeccchHHHHHHHHhhhhcccCceeecCChHH-------HHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEec Confidence 01111 267899999999999999999998765432 233444443 267888899999999999999888765 Q ss_pred CCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee---eEec-cceEEEecCC------ Q lcl|NC_019527. 184 GADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG---REMH-ASRLLTIITR------ 253 (516) Q Consensus 184 ~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g---~~iH-~SRli~~~~~------ 253 (516) ... .+ .+.+++|..+.|..- ..+...+.++ -.+|.+.+ ..+| +.++.++... T Consensus 147 ~~~---------------~~-~~~~~~p~~~~~v~d-~~~~~~~~~~-ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~ 208 (478) T protein:vir:10 147 EEG---------------EF-KTFRVPAEQAVPIWT-NKERDELQAF-IRVYELDGAERVEYWTKDDVTFYELKEGQLIP 208 (478) T ss_pred CCC---------------ce-EEEEEcccceEEEEc-CCCCCceEEE-EEEEeeeCceEEEEEeCCcEEEEEecCCeeec Confidence 321 11 144455555444311 0000111111 01122211 1111 2232222110 Q ss_pred -----------------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcC Q lcl|NC_019527. 254 -----------------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNG 310 (516) Q Consensus 254 -----------------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~ 310 (516) .+|-+ .-.++-.|.|.++.+.+.+.+++.+....+.-+..++...+....... T Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~g~vPvv-~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~--- 284 (478) T protein:vir:10 209 DFYRSEDHIQPHYYQGNKLMSWGRVPFI-PFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEG--- 284 (478) T ss_pred cccccccccccceecccccccCCcceEE-EeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCc--- Confidence 01111 112345699999999999999999988888877777766655432211 Q ss_pred ccHHHHHHHHHHHHHhcCCcceEEEec-CCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccch Q lcl|NC_019527. 311 GEGGDVFDRVEMYVNMQSNLGLAVMDF-DSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSE 387 (516) Q Consensus 311 ~~~~~l~~r~~~~~~~~sn~g~~~id~-~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge 387 (516) .+.......+. ..+++.+++ ++.+++.+ +.+.+++...++.+.+.|...+++|-.-. +. - |-|.||. T Consensus 285 ~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~-~-~~n~Sg~ 354 (478) T protein:vir:10 285 EDMKDFMHNLK-------YYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQ-DK-F-GNSPSGI 354 (478) T ss_pred ccccchhhhhh-------hCceeEecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCc-cc-c-ccchHHH Confidence 11111111111 123333432 23344444 45667888999999999999999996422 21 1 2255665 Q ss_pred HHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCcC-CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCC Q lcl|NC_019527. 388 GEIRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEID-DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVID 464 (516) Q Consensus 388 ~D~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~~-~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~ 464 (516) .=...|..... ...+..++..+++++++++.-..+..+ .++++.|++-...+++|.|++..+ + +|++| T Consensus 355 Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~d~~~i~i~f~~~~p~~~~e~~~~~~~-------~--~g~iS 425 (478) T protein:vir:10 355 ALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVRVQDIEITFNFNVMVNELENSQIAMN-------S--TGLLS 425 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEeCCCCCCCHHHHHHHHHH-------H--hCCCC Confidence 32222322222 333566788999999888654433333 379999999998899998776543 2 57777 Q ss_pred HHHHHHHHHhhhccCCCCCChhhhccccccchh-------cC--CCCCCCCCCCCCCCC Q lcl|NC_019527. 465 PSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDD-------DG--ADPYMPDPDVLPGEE 514 (516) Q Consensus 465 ~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~-------e~--~~~~~~~~~~~~~~e 514 (516) .+.+.+.+.. ....+.+.+....|..+. .+ .++....+++ .++| T Consensus 426 ~et~i~~~~~-----v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~d-~~~e 478 (478) T protein:vir:10 426 KETILGNHSW-----VQDPVAEMERIEQENIELNQQLPDIEEGLNDEQQRQSED-NQSE 478 (478) T ss_pred hHHHHHhCCC-----CCCHHHHHHHHHHHHHHHHHhccccCCCCcccccccCcC-CCCC Confidence 7666654411 000011111111110000 00 0000001111 1111 No 133 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=99.34 E-value=3.2e-11 Score=78.13 Aligned_cols=410 Identities=9% Similarity=-0.024 Sum_probs=195.2 Q ss_pred CccccccCCCCCCCccCCCccchhcccc-------------c--ccchhhhcccccCCcccccccC------cccHHHHH Q lcl|NC_019527. 48 DAATKWAPPQLMPGVVPAGTTPAVAMDS-------------L--CGPTYQFLNSAAGGLYAADIQP------FPGYQNLA 106 (516) Q Consensus 48 ~~~~~~~~~~~~~gv~~~~~~~~~a~ds-------------~--~~~~~~~~~~~~~~~~~~~~~~------f~gy~ll~ 106 (516) -+...| |=-.+....+ -..+--.. . .-........++.|-+.-.... .......+ T Consensus 1 ~~~~~~-~~~~~~~~~~---~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 76 (474) T protein:vir:96 1 MIVIFW-PNEKPYHERV---VEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKP 76 (474) T ss_pred Ceeecc-CCCchhhhhH---HHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhccccccccccc Confidence 111111 1100000000 00000000 0 0000000001111110000000 00000001 Q ss_pred HHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCC Q lcl|NC_019527. 107 ALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGA 185 (516) Q Consensus 107 ~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~ 185 (516) -.+ .+++++.||+..+.-.+.+++.+++.++.. .+.|...++. +......++.+....||.|++++.++.. T Consensus 77 ~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~-------~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~~~~y~d~~ 148 (474) T protein:vir:96 77 DWRMFTNYHQNLVDQKVAYAVANPVTFSSDDDKS-------LKTIQEVLNH-KWDDKLVDILTAASNKGIEWLQPYIDEN 148 (474) T ss_pred chhcccchHHHHHHhhhhhhcccCceeecCchHH-------HHHHHHHHhc-CHHHHHHHHHHHHHhcCeeEEEEEecCC Confidence 111 358889999999999999999998765432 2334444432 6778888899999999999988776432 Q ss_pred CcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee---eEec-cceEEEecC--------- Q lcl|NC_019527. 186 DVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG---REMH-ASRLLTIIT--------- 252 (516) Q Consensus 186 ~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g---~~iH-~SRli~~~~--------- 252 (516) . .+ .+.+++|..+.|..-+. ....+.++ -.+|...+ ..+| ..++.++.. T Consensus 149 ~---------------~~-~i~~~~p~~~~~v~d~~-~~~~~~~~-vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~ 210 (474) T protein:vir:96 149 G---------------EF-KTFRVPAEQAIPIWTNK-ERDTLKAF-IRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDY 210 (474) T ss_pred C---------------ce-EEEEEcccceEEEEcCC-CCCceEEE-EEEEeecCceEEEEEeCCeEEEEEecCCceeecc Confidence 1 11 14444554444421100 00001100 01111111 0011 011111100 Q ss_pred --------------------CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCcc Q lcl|NC_019527. 253 --------------------RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGE 312 (516) Q Consensus 253 --------------------~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~ 312 (516) ..+|-.. -.++-.|.|.++.+.+.+.+++.+....+.-+..++..++....... .+ T Consensus 211 ~~~~~~~~~~~~~~~~~~~~g~iPvv~-~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~---~~ 286 (474) T protein:vir:96 211 YHGEEHIQSHYYVGNKRVSWGRVPFIP-FKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEG---QD 286 (474) T ss_pred ccccccccccccccccccCCCceeEEE-eccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCc---cc Confidence 0011110 12345689999999999999999999988888887777665422111 11 Q ss_pred HHHHHHHHHHHHHhcCCcceEEEecCCcceeEEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHH Q lcl|NC_019527. 313 GGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEI 390 (516) Q Consensus 313 ~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~ 390 (516) ...... ..+ ..+++.+++++.+++.++ .+.++....++...++|...+++|-.- ++ .- |-|.||..=. T Consensus 287 ~~~~~~------~~~-~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~-~~-~~n~Sg~Al~ 356 (474) T protein:vir:96 287 LDEFMR------NLK-YYKAINVDGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQ-QD-KF-GNSPSGIALK 356 (474) T ss_pred ccchhh------hhh-cCceEEecCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCcccc-cc-cc-ccccHHHHHH Confidence 111111 111 234555555555666654 566789999999999999999999643 22 11 3356666433 Q ss_pred HHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCc-CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHH Q lcl|NC_019527. 391 RSFYDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEI-DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSE 467 (516) Q Consensus 391 ~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e 467 (516) ..|...+. ...+..++..+++++++|..-..... +.++.+.|++-...+++|.+++ +.++|++|.+. T Consensus 357 ~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~i~i~f~~~~p~~~~e~~~~----------~~~ag~iS~et 426 (474) T protein:vir:96 357 FMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLNIKVQDVEITFNFNVMVNELEQSQI----------GVQSQYLSKET 426 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHHH----------HHhcCCCchHH Confidence 33333322 44456788899999987765433222 2478999999988888887764 23579999988 Q ss_pred HHHHHHhhhccCCCCCChhhhccccccchhcC-CCC--CCCCCCCCCCCCCC Q lcl|NC_019527. 468 ARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDG-ADP--YMPDPDVLPGEEGS 516 (516) Q Consensus 468 ~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~-~~~--~~~~~~~~~~~e~t 516 (516) +++.+.. ...-+...+....|..+... .++ ...+...+.+++.| T Consensus 427 ~~~~~~~-----v~d~~~E~~ri~~E~~e~~~~~~~~~~~~~~~~~d~~~e~ 473 (474) T protein:vir:96 427 VVTNHPW-----VDDPVAELERIEQDNIDFNKQLPPLEGDANGRAQDNESET 473 (474) T ss_pred HHHhCCC-----CCCHHHHHHHHHHHHHHHHhcccccccccccccCCCcccC Confidence 8876521 01111111111111111000 000 00011112222233 No 134 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.34 E-value=2.6e-12 Score=84.14 Aligned_cols=444 Identities=10% Similarity=0.026 Sum_probs=205.9 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcc-cccCCccccccc---CcccHHHHHHHHhCchhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLN-SAAGGLYAADIQ---PFPGYQNLAALATRPEYRAF 117 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~-~~~~~~~~~~~~---~f~gy~ll~~y~~~~i~r~i 117 (516) |.+...++.....+.. .+.... ...-+.+++...-....+ .........+.. ....-..-.+++.|++++.+ T Consensus 1 m~~~~~r~~~~~a~~~---~~~~~~-~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~a 76 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGR---PEQSAS-LGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGA 76 (553) T ss_pred Ccchhhhhhccccccc---chhhhh-hhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHH Confidence 2222222222111110 000000 000011111000000000 000000000000 00011123567789999999 Q ss_pred hhhhhHHHhhCCCeeeecccc------chhhhHHHHHHHHHHHHhc--------------ChhHHHHHHHHhcccceeeE Q lcl|NC_019527. 118 ASTLSTELTREGIEITSKDRT------KAKEMASKIKELEEACEYY--------------GVMGIIQKAAEHDCFFGRGQ 177 (516) Q Consensus 118 Vd~~aed~~r~~~~i~~~~~~------~~~~~~~~i~~i~~~~~~l--------------~~~~~l~ea~~~~rlyG~a~ 177 (516) |+......+=.|+.+...-+. +.+....+-++|+..|++. .+......+++.....|-++ T Consensus 77 v~~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~ 156 (553) T protein:vir:63 77 VGYQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVL 156 (553) T ss_pred HHHHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceE Confidence 999999999999998765211 1112223345566555533 23333445555556667776 Q ss_pred EEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccc-------cccccccCcceeEEee----------- Q lcl|NC_019527. 178 ISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALD-------PTAPDFYKPSTWWVLG----------- 239 (516) Q Consensus 178 i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~d-------p~s~~yg~P~~y~v~g----------- 239 (516) +.+... .....|.+ + .|.+|+|..|... ++..+ ..-..+|+|..|+|.. T Consensus 157 ~~~~~~-~~~~~~~~----------~-~lq~ie~drl~~~-~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~ 223 (553) T protein:vir:63 157 ATAEWD-RAANRPYA----------T-CFQMVSTDRLSNP-YQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAP 223 (553) T ss_pred EEeeec-cCCCCccc----------c-eEEEechhhcCCC-CCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccc Confidence 665442 11111111 1 1455666655321 11000 0112467888887731 Q ss_pred -----------eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHH-HhC-Cceeeecchh Q lcl|NC_019527. 240 -----------REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVD-KFS-RTFLKTNMAQ 306 (516) Q Consensus 240 -----------~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~-~~~-~~v~k~~~~~ 306 (516) ..|+.++|||+-... ...+.-|+|.|-.++..|++++.-..+.-.-.. .+. ..+++.+... T Consensus 224 ~~~~~~r~~~~~~v~a~~vlH~f~~~------r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~ 297 (553) T protein:vir:63 224 DMYKWKFVQQSKPWGRRQVIHILEPR------EPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPP 297 (553) T ss_pred cccceeeeccccccChhHheeccccc------CCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCh Confidence 246778888775433 234456999999999999888776665433222 222 2344543211 Q ss_pred -----hhcCccHHH-----HHHHHHHH-------HHhcCCcceEEEecCCcceeEEecc--cCCHHHHHHHHHHHHHhhh Q lcl|NC_019527. 307 -----VLNGGEGGD-----VFDRVEMY-------VNMQSNLGLAVMDFDSEDIVQVNTP--LSGLADLQSQSQEHMCSVS 367 (516) Q Consensus 307 -----~l~~~~~~~-----l~~r~~~~-------~~~~sn~g~~~id~~~e~~e~~~~~--lsgl~d~~~~~~~~iaaas 367 (516) .+..+.+.. ........ ....-.-|++.....+++++.++.+ -++..++.......||+.. T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaagl 377 (553) T protein:vir:63 298 EFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAF 377 (553) T ss_pred hhhhhhcccccccccccccccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhc Confidence 111111000 00000000 0001123455544556888888755 4689999999999999999 Q ss_pred cCCceeeeccccccccccchHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHhCCCcCCcc--------------- Q lcl|NC_019527. 368 KIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYY----FSPLDTMLKVIQLSKWGEIDDAI--------------- 428 (516) Q Consensus 368 ~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l----~p~l~~l~~~l~~s~~g~~~~d~--------------- 428 (516) |||.-.|.|--....=||.-..+..+...++.+|+.++ +|+.+.+++..+++.--.+|..+ T Consensus 378 Gi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~ 457 (553) T protein:vir:63 378 GMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALS 457 (553) T ss_pred CCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhh Confidence 99999998853222224555667778888888887554 44555555444433222233321 Q ss_pred eEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh----------------hcccc Q lcl|NC_019527. 429 TFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL----------------EIVQP 492 (516) Q Consensus 429 ~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~----------------e~~~~ 492 (516) ..+|. ..-.+-.| -.|.+++....+++|+.|..++..+.+.+.+..+..+-.+. ..... T Consensus 458 ~~~w~----~p~~~~iD-P~Ke~~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~ 532 (553) T protein:vir:63 458 KCEWI----GASQGQID-QLKETQAAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDG 532 (553) T ss_pred ceeee----cCCccccC-hHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCC Confidence 12222 22222111 14567788888999999998887665444322221111100 00000 Q ss_pred ccchhcCCCCCCCCCCCCCCC Q lcl|NC_019527. 493 EMFDDDGADPYMPDPDVLPGE 513 (516) Q Consensus 493 e~~~~e~~~~~~~~~~~~~~~ 513 (516) .....++++.+..+...+.++ T Consensus 533 ~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 533 RDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred cccCCCCCCCCCCCCcccccC Confidence 111111111111111111111 No 135 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.33 E-value=6.7e-12 Score=81.85 Aligned_cols=406 Identities=12% Similarity=0.104 Sum_probs=178.5 Q ss_pred hhhHHH-HhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHH-Hh Q lcl|NC_019527. 33 AMRRAV-MKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAAL-AT 110 (516) Q Consensus 33 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y-~~ 110 (516) +-...- +..+.++-.. ..+++ .....++-|-+.-...+-.-.+-++.. .. T Consensus 1 ~~t~~~~i~~L~~~~~~------------------~~~r~----------~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~ 52 (480) T protein:vir:78 1 MTTYHEHVERLQGLLAR------------------DLPNL----------LEAEAYRNGTRRLKTIGIGAPPELAYLDVQ 52 (480) T ss_pred CCCHHHHHHHHHHHHHH------------------HHHHH----------HHHHHHHhccccccccccccchhHhhhhhh Confidence 111000 0111000000 00000 001111111110000000000111222 24 Q ss_pred CchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCC-Cccc Q lcl|NC_019527. 111 RPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGA-DVSV 189 (516) Q Consensus 111 ~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~-~~~~ 189 (516) +.++++||+..++-+.=+|+.+..+.+ . .+.|...|++-++.....++.+.+.+||.|++++..... +.+. T Consensus 53 ~n~~~~ivd~~~~~l~~~g~~~~~d~~--~------~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~ 124 (480) T protein:vir:78 53 PGWVATYLRTLSDRLDIEGFRISEDSE--G------LEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDP 124 (480) T ss_pred cchHHHHHHHHHhhhccCceecCCCch--h------HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCC Confidence 567899999999988888886543221 1 245667777778899999999999999999988754211 1110 Q ss_pred CcccccccccccceeeEEeecceeecccccccccccc-----------ccccCcceeEE--eee---------------- Q lcl|NC_019527. 190 PLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTA-----------PDFYKPSTWWV--LGR---------------- 240 (516) Q Consensus 190 Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s-----------~~yg~P~~y~v--~g~---------------- 240 (516) .|.. .+.+++|.++.+..-. ..... .+.+.+.++.+ .+. T Consensus 125 ----------~g~~-~i~~~~p~~~~~~~D~-~~~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~ 192 (480) T protein:vir:78 125 ----------AGIP-LIRVESPLYMYAELDP-RNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVV 192 (480) T ss_pred ----------CCee-EEEEEcccceEEEEcC-CCccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCcccccc Confidence 0111 1444555554432100 00000 01111111111 000 Q ss_pred ---Ee-cc-c--eEEEecCCcchhhhhhccCCCCchHHHH-HHHHHHHHHHHHHHHHHHHHHhCCceeee-cch-hhhcC Q lcl|NC_019527. 241 ---EM-HA-S--RLLTIITRPLPDMLKPAYNFSGISMSQL-AQPYVENWLRTRQSVSDLVDKFSRTFLKT-NMA-QVLNG 310 (516) Q Consensus 241 ---~i-H~-S--Rli~~~~~~~p~~~k~~~~~~G~S~le~-~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~-~~~-~~l~~ 310 (516) .+ |. . -|+.|.++ ......+|.|.++. +.+.+.+++++.......+..++...+.+ +.. ..... T Consensus 193 ~~~~~~~~~g~vPvv~f~n~------~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~ 266 (480) T protein:vir:78 193 DGDVIKHGLGVVPVVPLTND------PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTN 266 (480) T ss_pred ccccccCCCCCcceEEeecc------cccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCcccccc Confidence 00 10 0 11112111 11233689998875 77777888887777666555555433221 110 00000 Q ss_pred ccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHH Q lcl|NC_019527. 311 GEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGE 389 (516) Q Consensus 311 ~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D 389 (516) ..... .+..+ +..+..+.++.-++.+++ .++....+.++....+|++.++||..-|-| ... -++||+.- T Consensus 267 ~~~~~------~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~-~~~-n~~Sg~Al 336 (480) T protein:vir:78 267 DGENT------TLDIY--YGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSS-SSE-NPASAEAI 336 (480) T ss_pred ccccc------hhhhh--hhhhccCCCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCChHHhcc-ccC-cchHHHHH Confidence 00000 11111 111222333223343333 234456667778888999999999877744 322 23567654 Q ss_pred HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCcCC---cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcC--C Q lcl|NC_019527. 390 IRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEIDD---AITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNS--V 462 (516) Q Consensus 390 ~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~~~---d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g--v 462 (516) ...+...+. ..++..+.+.|.+++++++.-..+..+. ++++.|.+...++..+.|+...+. +++| + T Consensus 337 k~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl-------~~~g~~~ 409 (480) T protein:vir:78 337 IATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKL-------YANGQGP 409 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHH-------HHhcccc Confidence 433433333 2334567888999999887665544433 578899999999999877765543 3333 4 Q ss_pred CCHHHHHHHHHhhhccCCCCCChh-hhc---------------cccccchhcCCCCCCCC----CCCC-CCCCCC Q lcl|NC_019527. 463 IDPSEARQQLSDDPDSGWDNIDGD-LEI---------------VQPEMFDDDGADPYMPD----PDVL-PGEEGS 516 (516) Q Consensus 463 i~~~e~r~~l~~~~~~~~~~~d~~-~e~---------------~~~e~~~~e~~~~~~~~----~~~~-~~~e~t 516 (516) ++.+.+++.| +|..-..+ ++. ..++..+. ..++.+++ .+.. .|-..| T Consensus 410 ~s~et~~~~l------g~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 477 (480) T protein:vir:78 410 IPKEQARIDL------GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA-TPKPTVTETKTETQTSPSGFNRT 477 (480) T ss_pred CCHHHHHhcC------CCCHhHHHHHHHHHHHHHHHHHHHhhccccccCCC-CCCCCCCCCCCccccccCCCCcc Confidence 4554444433 11110000 000 00000000 00011111 0011 111111 No 136 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.33 E-value=3.2e-11 Score=78.14 Aligned_cols=404 Identities=11% Similarity=0.068 Sum_probs=194.3 Q ss_pred CCCccCCCccchhcccccccchhhhcccccC-Cc-ccccccCcccHHHHHHHHh-------------------------- Q lcl|NC_019527. 59 MPGVVPAGTTPAVAMDSLCGPTYQFLNSAAG-GL-YAADIQPFPGYQNLAALAT-------------------------- 110 (516) Q Consensus 59 ~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~-~~-~~~~~~~f~gy~ll~~y~~-------------------------- 110 (516) |-...|=+....+..++.......-...... .. ...+......|+.+..|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 80 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKT 80 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccc Confidence 3333333322122222211100000000000 00 0000000112333333331 Q ss_pred -----CchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCC Q lcl|NC_019527. 111 -----RPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGA 185 (516) Q Consensus 111 -----~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~ 185 (516) +.+++.||+..+.-++.+++++++.++... +.|+ .+.+=++...+.++.+....||.|++++.++.. T Consensus 81 ~~ri~~n~~~~ivd~~~~yl~g~~~~~~~~d~~~~-------~~l~-~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d 152 (503) T protein:vir:59 81 NNRTSHAWHKLFVDQKTQYLVGEPVTFTSDNKTLL-------EYVN-ELADDDFDDILNETVKNMSNKGIEYWHPFVDEE 152 (503) T ss_pred cceeecchHHHHHHHHHhhhhcCCeeeccCcHHHH-------HHHH-HHHhcCHHHHHHHHHHHHhhCCeEEEEEeecCC Confidence 467899999999999999999987654221 2222 233347888899999999999999998877542 Q ss_pred CcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe---e-----eEec-cceEEEecCC--- Q lcl|NC_019527. 186 DVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL---G-----REMH-ASRLLTIITR--- 253 (516) Q Consensus 186 ~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~---g-----~~iH-~SRli~~~~~--- 253 (516) . .+ .+.+++|..+.|..-+. ....+.++ -.+|... + ..|| +.++.+|... T Consensus 153 g---------------~~-~i~~~~p~~~~~i~d~~-~~~~~~~~-ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~ 214 (503) T protein:vir:59 153 G---------------EF-DYVIFPAEEMIVVYKDN-TRRDILFA-LRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGV 214 (503) T ss_pred C---------------ce-EEEEEccceeEEEEeCC-CCCceEEE-EEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCc Confidence 1 11 14455555544421110 11111111 0111110 0 0111 1222222110 Q ss_pred --------------------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhh Q lcl|NC_019527. 254 --------------------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQV 307 (516) Q Consensus 254 --------------------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~ 307 (516) .+|-. .-.++-+|.|.++.+.+.+.+++.+....+.-+..++..++...... T Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv-~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~- 292 (503) T protein:vir:59 215 YQMDYSYGENNPRPHMTKGGQAIGWGRVPII-PFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYD- 292 (503) T ss_pred ccccccccccccccceeecceeccCCccceE-EecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCC- Confidence 01111 11234579999999999999999998888887777777776543211 Q ss_pred hcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeecccccccccc Q lcl|NC_019527. 308 LNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNAS 385 (516) Q Consensus 308 l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnat 385 (516) ..+.......+. ..+++.++++ .+.+.+ +.+.+++...++.+.+.|...+.+|-.-. + .-+| |.| T Consensus 293 --~~~~~~~~~~~~-------~~~~~~~~~~-~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~-~~~~-~~S 359 (503) T protein:vir:59 293 --GENPKEFTANLR-------YHSVIKVSGD-GGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSP-E-TIGG-GAT 359 (503) T ss_pred --ccccchhhhhhh-------cccceeccCC-CcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCc-c-cccc-ccc Confidence 111112222111 1233344433 445444 45667888899999999988888885322 1 1112 455 Q ss_pred chHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH---hCCCc-C--CcceEEeCCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 386 SEGEIRSF---YDDISSVQQSYYFSPLDTMLKVIQLS---KWGEI-D--DAITFKFKSLWQTSAKEESEIRFNKAQEAQI 456 (516) Q Consensus 386 ge~D~~~y---yd~I~~~Qe~~l~p~l~~l~~~l~~s---~~g~~-~--~d~~~~f~pL~~~sekEkAei~~~~a~a~~~ 456 (516) |..=...| ...++. .+..++..|++++++++.- ..+.. + .++++.|++-...+.++.++. +.+ T Consensus 360 g~Ai~~~~~~l~~k~~~-~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~-------~~k 431 (503) T protein:vir:59 360 GPALENLYALLDLKANM-AERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEIVQS-------LVQ 431 (503) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHHHHHHH-------HHH Confidence 55322222 333333 3456788888887776432 11211 2 368999999999999886554 455 Q ss_pred HHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccccc----------chhcCCCCCCCCC-----CCCCCCCCC Q lcl|NC_019527. 457 YITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEM----------FDDDGADPYMPDP-----DVLPGEEGS 516 (516) Q Consensus 457 ~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~----------~~~e~~~~~~~~~-----~~~~~~e~t 516 (516) ++++|++|.+.+.+.+..- +....+.+....|. .+.+.......+. +++.+++|= T Consensus 432 l~~~GiiS~et~l~~l~~v-----~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 501 (503) T protein:vir:59 432 GVTGGIMSKETAVARNPFV-----QDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAESGGAGQ 501 (503) T ss_pred HHhCCCCchHHHHHhCCCC-----CCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCCCcccCCCCCC Confidence 7888888887777664210 00001111111000 0000000000111 111111111 No 137 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=99.31 E-value=5.3e-12 Score=82.41 Aligned_cols=432 Identities=14% Similarity=0.061 Sum_probs=194.8 Q ss_pred CCc----chhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhccc-c Q lcl|NC_019527. 1 MWP----FDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMD-S 75 (516) Q Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~d-s 75 (516) ||- +-|..++ ++.- . ++++. . -.+..++++ . T Consensus 1 m~~~~~~~~~~~~~-~~~~-------------------~----~~~~~-----------~---------~~~~~i~~~~~ 36 (499) T protein:vir:80 1 MINQIIAGVKGVMR-RMGL-------------------L----KSLKD-----------V---------TDHKKVNANDE 36 (499) T ss_pred ChhHHHHHHHHHHH-Hhcc-------------------c----cchhh-----------h---------hcCCCCcCCHH Confidence 443 2221111 1100 0 00000 0 000111110 0 Q ss_pred cccchhhhcccccCCcc----cccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHH Q lcl|NC_019527. 76 LCGPTYQFLNSAAGGLY----AADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELE 151 (516) Q Consensus 76 ~~~~~~~~~~~~~~~~~----~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~ 151 (516) +... .......+.|.+ ....+ ..+...-.......+++.+|+..|+-++.+..+|++.++.. -+.|+ T Consensus 37 ~~~~-i~~~~~~Y~g~~~~~~~~~~~-~~~~~~~~~~~s~n~~~~iv~~~a~~l~~ep~~i~~~d~~~-------~e~l~ 107 (499) T protein:vir:80 37 DYKY-IDMWKRLYQGNYAEWHNLNYE-HNGNPVNRRQLSMNLPKVTAKYMSKLLFNEKVKINIDDETA-------EEFVL 107 (499) T ss_pred HHHH-HHHHHHHhcCCcchhhccccc-cCCCccccceeecchHHHHHHHHHHhhhCCcceEeeCCHHH-------HHHHH Confidence 0000 011111111110 00000 01111111223458899999999999999999998865422 24567 Q ss_pred HHHHhcChhHHHHHHHHhcccceeeEEEEEecCC-C-------ccc--CcccccccccccceeeEEeecceeeccccccc Q lcl|NC_019527. 152 EACEYYGVMGIIQKAAEHDCFFGRGQISINIKGA-D-------VSV--PLILDPRTIKKGSLTGFSNIEPMWTSPSAYNA 221 (516) Q Consensus 152 ~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~-~-------~~~--Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~ 221 (516) ..++.-++...+++++..+..+|++++.+.++.. . .++ |+.-+ .+.+..+..+.++......+.. T Consensus 108 ~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~a~~~~Pi~~d-----~~~~~~~~f~~~~~~~~~~y~~ 182 (499) T protein:vir:80 108 NVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSND-----SENVDECLIANSFHKNNKYYKL 182 (499) T ss_pred HHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEcCCceEEEEec-----CCCeEEEEEEEEEeecCeEEEE Confidence 7777778999999999999999999998887642 1 111 22111 1223333222222111000000 Q ss_pred ccccccccc-CcceeEEe------------eeEec---------c-------ceE--EEecCCcchhhhhhccCCCCchH Q lcl|NC_019527. 222 LDPTAPDFY-KPSTWWVL------------GREMH---------A-------SRL--LTIITRPLPDMLKPAYNFSGISM 270 (516) Q Consensus 222 ~dp~s~~yg-~P~~y~v~------------g~~iH---------~-------SRl--i~~~~~~~p~~~k~~~~~~G~S~ 270 (516) . ..-...+ +-..|+|. |..|- + +|. ++|.. +.+.. ......+|+|+ T Consensus 183 l-E~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~-~~~N~-~~~~splG~S~ 259 (499) T protein:vir:80 183 L-EWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKP-NIANN-KNLTSPLGISV 259 (499) T ss_pred E-EEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCccceEeecC-Ccccc-ccCCCccCCch Confidence 0 0000000 00012221 11110 0 111 01110 11100 01123569999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCceee-ecchhhhcCccHHHHHHHHHHHHHhcCCcceEE-EecC-CcceeEEec Q lcl|NC_019527. 271 SQLAQPYVENWLRTRQSVSDLVDKFSRTFLK-TNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAV-MDFD-SEDIVQVNT 347 (516) Q Consensus 271 le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k-~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~-id~~-~e~~e~~~~ 347 (516) +..+.+.+..++.+......-+......++- ..+.......+++.. ..+.....-+.... .+++ ...++.++. T Consensus 260 ~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~~----~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 335 (499) T protein:vir:80 260 YANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTT----QYFDSTDEAFFLYQGEQDDNGKAIKDISV 335 (499) T ss_pred HhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCcc----cCCCcccceeeEeeccCCCCcCceeEecC Confidence 9999999999999988877666554444432 122221211111111 00111000011111 1111 123666654 Q ss_pred cc--CCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--h Q lcl|NC_019527. 348 PL--SGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGE---IRSFYDDISSVQQSYYFSPLDTMLKVIQLS--K 420 (516) Q Consensus 348 ~l--sgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D---~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s--~ 420 (516) .+ ....+.++.+.+++...+|++... ||...+|. .|+..= ...-+.++..+| ..++..|++|+..|... . T Consensus 336 ~ir~e~~~~~l~~~l~~i~~~~g~s~~~-fg~~~~g~-~TAtei~s~~~~l~~~~~~~~-~~~~~~l~~l~~~il~~~~~ 412 (499) T protein:vir:80 336 EIRSTEFIESINAMLRIYAMQVGLSAGT-FTFDENGL-KTATEVVSEKSETYQTKNSHS-QLIEQGIKEMIVSILEVGKL 412 (499) T ss_pred cCChHHHHHHHHHHHHHHHHhcCCChhh-cCCCcccc-hhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 44 345677888888999999998654 55555564 344332 233355566555 45688888887776542 1 Q ss_pred ----CCCc--CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh-hccccc Q lcl|NC_019527. 421 ----WGEI--DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL-EIVQPE 493 (516) Q Consensus 421 ----~g~~--~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~-e~~~~e 493 (516) .|.. +.+++|.|++-...++.+.++. ..+++.+|+++.+.++..+ .+.+++. +..-++ T Consensus 413 ~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~-------~~~~~~~Gi~S~et~l~~~--------~~~~d~ea~~el~~ 477 (499) T protein:vir:80 413 IKAYDGDTVELDTITVDFDDSIAQDEDTTINR-------YTTAKNQGMIPLKIALQRA--------WNITEAEADEWAEM 477 (499) T ss_pred hccccCCCCCccceEEEeCCCCCCCHHHHHHH-------HHHHHHcCCCCHHHHHhhc--------CCCChHHHHHHHHH Confidence 1222 3479999999888888775554 4557889999988776542 2222211 111111 Q ss_pred cchhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 494 MFDDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 494 ~~~~e~~~~~~~~~~~~~~~e~ 515 (516) ..++.....+.+++...-|++- T Consensus 478 i~~E~~~~~~~~d~~g~~ge~e 499 (499) T protein:vir:80 478 LAKEKQAEIPNNDMTGIFGEEE 499 (499) T ss_pred HHHHhhcCCCCCCccccCCCCC Confidence 1111111111122222111111 No 138 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=99.31 E-value=2.2e-11 Score=78.97 Aligned_cols=408 Identities=9% Similarity=0.022 Sum_probs=193.4 Q ss_pred cccCCCCCCCccCCCccchhccc----------ccccchhhhcccccCCcccccccCc----ccHHHHHHH-----HhCc Q lcl|NC_019527. 52 KWAPPQLMPGVVPAGTTPAVAMD----------SLCGPTYQFLNSAAGGLYAADIQPF----PGYQNLAAL-----ATRP 112 (516) Q Consensus 52 ~~~~~~~~~gv~~~~~~~~~a~d----------s~~~~~~~~~~~~~~~~~~~~~~~f----~gy~ll~~y-----~~~~ 112 (516) ...-+..+.+++=.....-.+.+ ............++.|.+.-..... .+......+ -.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~ 80 (479) T protein:vir:79 1 MLNIYISETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINN 80 (479) T ss_pred CCCceecccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecc Confidence 22223223333110000000100 0000001111111111110000000 000000001 1267 Q ss_pred hhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcc Q lcl|NC_019527. 113 EYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLI 192 (516) Q Consensus 113 i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ 192 (516) +++.||+..+.-++.+++++++.++.. ..+.+.+.+=++...+.++.+...+||.+++++.++... T Consensus 81 ~~~~Ivd~~~~~l~g~p~~~~~~~~~~--------~~~~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~------ 146 (479) T protein:vir:79 81 YHKLLVDQKVGYSVGNPIVFNADDDNL--------TKLLNDLLGEEFDDTITELYLNASNKGVEWLHPYINRKG------ 146 (479) T ss_pred hHHHHHHHHHhhhhcCCceeccCCHHH--------HHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCC------ Confidence 799999999999999999998765432 122233434478899999999999999999888764321 Q ss_pred cccccccccceeeEEeecceeeccccccccccccccccCcceeEEe---e-----eEec-cceEEEecCC---------- Q lcl|NC_019527. 193 LDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL---G-----REMH-ASRLLTIITR---------- 253 (516) Q Consensus 193 ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~---g-----~~iH-~SRli~~~~~---------- 253 (516) .+ .+.+++|..+.|..-. .....+.++ -.+|.+. + ..+| +.++.+|... T Consensus 147 ---------~~-~i~~~~p~~~~~v~d~-~~~~~~~~~-ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~ 214 (479) T protein:vir:79 147 ---------EF-KYVIIPAEEAIPIWDS-KRQRELVAF-IRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLY 214 (479) T ss_pred ---------ce-EEEEEccceeEEEEeC-CCCCceEEE-EEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccc Confidence 11 1444555554443110 000001111 0011110 0 0011 1122111100 Q ss_pred ------------------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhc Q lcl|NC_019527. 254 ------------------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLN 309 (516) Q Consensus 254 ------------------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~ 309 (516) .+|-. .-.++-+|.|.++.+.+.+.+++.+....+.-+..++...+...... T Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~vPvv-~~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~--- 290 (479) T protein:vir:79 215 DEYGKMTDIQEGHFRINNKEQGWGKVPFI-PFKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYP--- 290 (479) T ss_pred cccccccccccccccccccccCCCcccEE-EecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCC--- Confidence 00111 11234579999999999999999998888877777766655432211 Q ss_pred CccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccch Q lcl|NC_019527. 310 GGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSE 387 (516) Q Consensus 310 ~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge 387 (516) .....+....++ ..++..+++ +.+++.+ +.+.+++...++.+.+.|...+.+|..-.-+ .| |+||+ T Consensus 291 ~~~~~~~~~~~~-------~~~~i~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~g-n~Sg~ 358 (479) T protein:vir:79 291 GTSLQEFIDNIR-------YYKSIKVDG-GGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQN---TG-DKSGV 358 (479) T ss_pred ccccccchhhhh-------hccceecCC-CCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccccccc---cc-chhHH Confidence 111111111111 122333333 3555554 4556678889999999999999999643322 13 45665 Q ss_pred HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH---hCCC-c-CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019527. 388 GEIRSFYDD--ISSVQQSYYFSPLDTMLKVIQLS---KWGE-I-DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITN 460 (516) Q Consensus 388 ~D~~~yyd~--I~~~Qe~~l~p~l~~l~~~l~~s---~~g~-~-~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~ 460 (516) .=...|... .....+..++..+++++++++.- ..+. + +.+++|.|++-...++++.|++..+ + . T Consensus 359 Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~k-------l--~ 429 (479) T protein:vir:79 359 ALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEKIDMAAK-------S--T 429 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHH-------H--h Confidence 322222222 12334556788888888877542 1221 2 2478999999999999998776544 2 4 Q ss_pred CCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 461 SVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 461 gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) |++|.+.+.+.+.. .+..+.+.+....|..+..+.....++. ..+..+.| T Consensus 430 g~iS~et~l~~l~~-----v~d~~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~~e~ 479 (479) T protein:vir:79 430 GIVSDETIVSNHPW-----VEDVNDELERLKKQEDTQKEYDDLIPNN-QDGVIDET 479 (479) T ss_pred ccCcHHHHHHhCCC-----CCCHHHHHHHHHHHHHHHHHHHhccCcc-cCCCcCcC Confidence 88998888766511 1111112222222211111111111111 12222222 No 139 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.31 E-value=9.9e-12 Score=80.91 Aligned_cols=379 Identities=13% Similarity=0.023 Sum_probs=186.0 Q ss_pred hhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCc-ccHHHHHHHH-hC Q lcl|NC_019527. 34 MRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPF-PGYQNLAALA-TR 111 (516) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f-~gy~ll~~y~-~~ 111 (516) +-...+..+.++-.+- .........++-|-+.....+- .--++..+++ .. T Consensus 1 m~~~~i~~L~~~~~~~----------------------------~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~ 52 (422) T protein:vir:97 1 MNYMGMGYLRRKLALF----------------------------KTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVL 52 (422) T ss_pred CChHHHHHHHHHHHHH----------------------------HHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhc Confidence 2212222111111000 0000001112212111100100 0012223333 23 Q ss_pred chhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCc Q lcl|NC_019527. 112 PEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPL 191 (516) Q Consensus 112 ~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl 191 (516) ...++||+..++-+.=.||+.. +. .+.+.|.+-++.....++.+-+.+||.|++++.-+..+ ..|. T Consensus 53 nw~~~~Vd~~a~rl~~~Gf~~~-----d~--------~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~-~~p~ 118 (422) T protein:vir:97 53 EWTAKGVDSLADRIIFREFTND-----DF--------NAWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAED-GLPK 118 (422) T ss_pred chhHHHHHHHHhccccceeeCC-----ch--------hHHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCC-CeeE Confidence 5679999999986666777531 11 24556667778888889999999999999988543222 2231 Q ss_pred ccccccccccceeeEEeecceeeccc----------cc--cccc----cccccccCccee-EEe--ee---Eeccc---e Q lcl|NC_019527. 192 ILDPRTIKKGSLTGFSNIEPMWTSPS----------AY--NALD----PTAPDFYKPSTW-WVL--GR---EMHAS---R 246 (516) Q Consensus 192 ~ld~~~I~~g~l~~l~v~d~~~v~p~----------~~--~~~d----p~s~~yg~P~~y-~v~--g~---~iH~S---R 246 (516) +++++|.++... .+ ...| +..-.|+.+..+ .+. +. .-|+- = T Consensus 119 --------------i~~~sp~~~~~i~D~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 184 (422) T protein:vir:97 119 --------------MQVIEASKATGILDPTTFLLTEGYAILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPL 184 (422) T ss_pred --------------EEEechhhEEEEEeCCCCcceeeEEEEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcc Confidence 222333332211 00 0000 001111111111 110 00 01111 1 Q ss_pred EEEecCCcchhhhhhccCCCCchHH-HHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHH Q lcl|NC_019527. 247 LLTIITRPLPDMLKPAYNFSGISMS-QLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVN 325 (516) Q Consensus 247 li~~~~~~~p~~~k~~~~~~G~S~l-e~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~ 325 (516) ++.|.+++ .....+|.|.+ +.++..+.++.++......-..-++.+...+-... ..+...+.+..++. T Consensus 185 vv~~~n~~------~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d-~d~~~~~~~~~~~~---- 253 (422) T protein:vir:97 185 LVPIIHRP------DAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMD-PDAKPMEKWRATVS---- 253 (422) T ss_pred eEEecccC------CCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccC-cccccCchhhhhhh---- Confidence 22222221 22345899977 77888888888877665444444444333221111 00000111222221 Q ss_pred hcCCcceEEEecC--Cc--ceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchH---HHHHHHHHH Q lcl|NC_019527. 326 MQSNLGLAVMDFD--SE--DIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEG---EIRSFYDDI 397 (516) Q Consensus 326 ~~sn~g~~~id~~--~e--~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~---D~~~yyd~I 397 (516) .++.+..+ ++ ++.+++ .++.+..+.+.....++|+.++||..-|-|.+ . .++|++. ........+ T Consensus 254 -----~i~~~~~de~~~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~-~-NpsSa~Ai~a~~~~L~~ka 326 (422) T protein:vir:97 254 -----TLLEISKDEDGDKPTVGQFTTASMAPFMEHLKMYASLFAGGSGLTLDDLGFPS-D-NPSSVESIKAAHENLRAAG 326 (422) T ss_pred -----hhhccCCCCCCCcceeeecCCCChhHHHHHHHHHHHHHhcccCCCHHHhcccc-C-chhHHHHHHHHHHHHHHHH Confidence 12333221 11 344443 56777888899999999999999987665533 2 2266664 344455566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCC--cCC---cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHc--CCCCHHHHHH Q lcl|NC_019527. 398 SSVQQSYYFSPLDTMLKVIQLSKWGE--IDD---AITFKFKSLWQTSAKEESEIRFNKAQEAQIYITN--SVIDPSEARQ 470 (516) Q Consensus 398 ~~~Qe~~l~p~l~~l~~~l~~s~~g~--~~~---d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~--gvi~~~e~r~ 470 (516) +.+|+ .+...++++.++++.-..+. .++ ++.+.|.|....+..+.|+ .|+++.+++++ |+.+.+.+++ T Consensus 327 ~~k~~-~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~----~aDa~~Kl~~a~~~~~~~~~~~~ 401 (422) T protein:vir:97 327 RKAQR-SFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTL----VGDGAIKLNQAIPGFMDADVIRD 401 (422) T ss_pred HHHHH-HHHHHHHHHHHHHHHHhcCCcccchhhccceEEEccCCCCChHHHHH----HHHHHHHHHhhccccccHHHHHH Confidence 66665 46888999888766543332 233 4789999888777666555 45778888888 6788888888 Q ss_pred HHHhhhccCCCCCChhhhccccccchhcC Q lcl|NC_019527. 471 QLSDDPDSGWDNIDGDLEIVQPEMFDDDG 499 (516) Q Consensus 471 ~l~~~~~~~~~~~d~~~e~~~~e~~~~e~ 499 (516) .| ||+..+......+....+ + T Consensus 402 ~l------g~~~~~~~~~~~~~~~~d--~ 422 (422) T protein:vir:97 402 LT------GVKGADKPIPAITEVTTD--G 422 (422) T ss_pred Hc------CCCchhHHHHHHHhhhcc--C Confidence 87 665443333222222111 1 No 140 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=99.30 E-value=1.4e-11 Score=80.04 Aligned_cols=408 Identities=9% Similarity=0.013 Sum_probs=197.1 Q ss_pred HhhcCCCccccccCCCCCCCccCCCcc--chhccccc-------------cc--chhhhcccccCCcccccccCcc---- Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTT--PAVAMDSL-------------CG--PTYQFLNSAAGGLYAADIQPFP---- 100 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~--~~~a~ds~-------------~~--~~~~~~~~~~~~~~~~~~~~f~---- 100 (516) |..+-+. |- -.|.+.- ..+--|+. .. ........++.|-+.-....+- T Consensus 1 ~~~~~~~-------~~----~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~ 69 (474) T protein:vir:96 1 MINIIRM-------PW----DKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLH 69 (474) T ss_pred CcccccC-------CC----CCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhc Confidence 2222221 10 0111110 00100110 00 0001111122221100000000 Q ss_pred --cHHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeE Q lcl|NC_019527. 101 --GYQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQ 177 (516) Q Consensus 101 --gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~ 177 (516) .+...+-.+ .+.+++.||+..+.-++.+++++++.++... +.|+..++ =+....+.++.+....||.|+ T Consensus 70 ~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~-------~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~ 141 (474) T protein:vir:96 70 GNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVL-------DVIHQVLD-TRWDNKLIDILTAASNKGIDW 141 (474) T ss_pred ccccccccccccccchHHHHHHhhhhhhcccCceeccCChHHH-------HHHHHHHh-ccHHHHHHHHHHHHhhCCeEE Confidence 000001111 3588899999999999999999987654321 23333332 368888999999999999999 Q ss_pred EEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeee---Eec-cceEEEecCC Q lcl|NC_019527. 178 ISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGR---EMH-ASRLLTIITR 253 (516) Q Consensus 178 i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~---~iH-~SRli~~~~~ 253 (516) +++.++... .+ .+.+++|..+.|..- ..+...+.++ -.+|.+.+. .|| +.++.+|... T Consensus 142 ~~~~~d~~~---------------~~-~i~~~~p~~~~~v~d-~~~~~~~~a~-ir~~~~~~~~~~~vy~~~~i~~~~~~ 203 (474) T protein:vir:96 142 LQVYINEDG---------------EL-KLFRVPAEQAIPIWT-DKEREQLNAF-IRIFTFNGETKVEYWTAETVTYYVYE 203 (474) T ss_pred EEeeeCCCC---------------ce-EEEEEcccceEEEEc-CCCCCceEEE-EEEEeecCeeEEEEEeCCeEEEEEEc Confidence 988774321 11 144556655554321 1111111111 122222221 122 2333333210 Q ss_pred -------------------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhh Q lcl|NC_019527. 254 -------------------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVL 308 (516) Q Consensus 254 -------------------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l 308 (516) .+|-. .-.++-.|.|.++.+.+.+.+++.+....+.-+..++..++...... T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv-~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~-- 280 (474) T protein:vir:96 204 NGGLIPDFYYGDEHIQTHFSTGSWERVPFI-AFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYE-- 280 (474) T ss_pred CCceeeccccccccccCcccccCCCccceE-EecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCC-- Confidence 01111 11234468999999999999999988888887777776665432111 Q ss_pred cCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccc Q lcl|NC_019527. 309 NGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASS 386 (516) Q Consensus 309 ~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatg 386 (516) ..+.......+ + ..+++.+++ +.+++.+ +.+.+++...++.+.++|...+++|-.-.-+ -+| |.|| T Consensus 281 -~~~~~~~~~~~------~-~~~~i~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~--~~~-n~Sg 348 (474) T protein:vir:96 281 -GEDLSEFMEGL------K-YYKAINVSS-DGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDK--FGS-ATSG 348 (474) T ss_pred -cccccchhhhh------h-ccceeeccC-CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccc--ccc-ccHH Confidence 11111111111 1 122333333 3445544 5667789999999999999999999643322 122 4566 Q ss_pred hHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCc-CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_019527. 387 EGEIRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEI-DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVI 463 (516) Q Consensus 387 e~D~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi 463 (516) ..=...|..... ..++..++..++++++++..-..... ..++++.|++-...++.|.|++. .++|+| T Consensus 349 ~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~----------~~~gii 418 (474) T protein:vir:96 349 IALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIG----------AQSQYL 418 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHHH----------HHcCCC Confidence 532222222222 34556788999999998865432233 24789999999888888887753 235899 Q ss_pred CHHHHHHHHHhhhccCCCCCChhhhccccccchhc-C-CCCCCCCC---CCCCCCCCC Q lcl|NC_019527. 464 DPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDD-G-ADPYMPDP---DVLPGEEGS 516 (516) Q Consensus 464 ~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e-~-~~~~~~~~---~~~~~~e~t 516 (516) |.+.++..+... ...+...+....|..+.. . .......+ .+..+.+++ T Consensus 419 S~et~~~~lp~v-----~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:96 419 SKETLVRHHPWV-----DDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENN 471 (474) T ss_pred ChHHHHHhCCCC-----CCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCcc Confidence 988887765210 000111111111100000 0 00000011 111111111 No 141 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=99.30 E-value=1.4e-11 Score=80.04 Aligned_cols=408 Identities=9% Similarity=0.013 Sum_probs=197.1 Q ss_pred HhhcCCCccccccCCCCCCCccCCCcc--chhccccc-------------cc--chhhhcccccCCcccccccCcc---- Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTT--PAVAMDSL-------------CG--PTYQFLNSAAGGLYAADIQPFP---- 100 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~--~~~a~ds~-------------~~--~~~~~~~~~~~~~~~~~~~~f~---- 100 (516) |..+-+. |- -.|.+.- ..+--|+. .. ........++.|-+.-....+- T Consensus 1 ~~~~~~~-------~~----~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~ 69 (474) T protein:vir:95 1 MINIIRM-------PW----DKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLH 69 (474) T ss_pred CcccccC-------CC----CCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhc Confidence 2222221 10 0111110 00100110 00 0001111122221100000000 Q ss_pred --cHHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeE Q lcl|NC_019527. 101 --GYQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQ 177 (516) Q Consensus 101 --gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~ 177 (516) .+...+-.+ .+.+++.||+..+.-++.+++++++.++... +.|+..++ =+....+.++.+....||.|+ T Consensus 70 ~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~-------~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~ 141 (474) T protein:vir:95 70 GNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKVL-------DVIHQVLD-TRWDNKLIDILTAASNKGIDW 141 (474) T ss_pred ccccccccccccccchHHHHHHhhhhhhcccCceeccCChHHH-------HHHHHHHh-ccHHHHHHHHHHHHhhCCeEE Confidence 000001111 3588899999999999999999987654321 23333332 368888999999999999999 Q ss_pred EEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeee---Eec-cceEEEecCC Q lcl|NC_019527. 178 ISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGR---EMH-ASRLLTIITR 253 (516) Q Consensus 178 i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~---~iH-~SRli~~~~~ 253 (516) +++.++... .+ .+.+++|..+.|..- ..+...+.++ -.+|.+.+. .|| +.++.+|... T Consensus 142 ~~~~~d~~~---------------~~-~i~~~~p~~~~~v~d-~~~~~~~~a~-ir~~~~~~~~~~~vy~~~~i~~~~~~ 203 (474) T protein:vir:95 142 LQVYINEDG---------------EL-KLFRVPAEQAIPIWT-DKEREQLNAF-IRIFTFNGETKVEYWTAETVTYYVYE 203 (474) T ss_pred EEeeeCCCC---------------ce-EEEEEcccceEEEEc-CCCCCceEEE-EEEEeecCeeEEEEEeCCeEEEEEEc Confidence 988774321 11 144556655554321 1111111111 122222221 122 2333333210 Q ss_pred -------------------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhh Q lcl|NC_019527. 254 -------------------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVL 308 (516) Q Consensus 254 -------------------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l 308 (516) .+|-. .-.++-.|.|.++.+.+.+.+++.+....+.-+..++..++...... T Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv-~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~-- 280 (474) T protein:vir:95 204 NGGLIPDFYYGDEHIQTHFSTGSWERVPFI-AFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYE-- 280 (474) T ss_pred CCceeeccccccccccCcccccCCCccceE-EecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCC-- Confidence 01111 11234468999999999999999988888887777776665432111 Q ss_pred cCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccc Q lcl|NC_019527. 309 NGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASS 386 (516) Q Consensus 309 ~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatg 386 (516) ..+.......+ + ..+++.+++ +.+++.+ +.+.+++...++.+.++|...+++|-.-.-+ -+| |.|| T Consensus 281 -~~~~~~~~~~~------~-~~~~i~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~--~~~-n~Sg 348 (474) T protein:vir:95 281 -GEDLSEFMEGL------K-YYKAINVSS-DGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDK--FGS-ATSG 348 (474) T ss_pred -cccccchhhhh------h-ccceeeccC-CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccc--ccc-ccHH Confidence 11111111111 1 122333333 3445544 5667789999999999999999999643322 122 4566 Q ss_pred hHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCc-CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_019527. 387 EGEIRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEI-DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVI 463 (516) Q Consensus 387 e~D~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi 463 (516) ..=...|..... ..++..++..++++++++..-..... ..++++.|++-...++.|.|++. .++|+| T Consensus 349 ~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~----------~~~gii 418 (474) T protein:vir:95 349 IALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIG----------AQSQYL 418 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHHH----------HHcCCC Confidence 532222222222 34556788999999998865432233 24789999999888888887753 235899 Q ss_pred CHHHHHHHHHhhhccCCCCCChhhhccccccchhc-C-CCCCCCCC---CCCCCCCCC Q lcl|NC_019527. 464 DPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDD-G-ADPYMPDP---DVLPGEEGS 516 (516) Q Consensus 464 ~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e-~-~~~~~~~~---~~~~~~e~t 516 (516) |.+.++..+... ...+...+....|..+.. . .......+ .+..+.+++ T Consensus 419 S~et~~~~lp~v-----~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:95 419 SKETLVRHHPWV-----DDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENN 471 (474) T ss_pred ChHHHHHhCCCC-----CCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCcc Confidence 988887765210 000111111111100000 0 00000011 111111111 No 142 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=99.29 E-value=4e-11 Score=77.59 Aligned_cols=433 Identities=13% Similarity=0.095 Sum_probs=200.9 Q ss_pred CChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhccccc--ccchhhhcccccCCcccccccCcccHHHHHH Q lcl|NC_019527. 30 RKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSL--CGPTYQFLNSAAGGLYAADIQPFPGYQNLAA 107 (516) Q Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~--~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~ 107 (516) ....+.+.+...+......... . + .+-. ..........++.|.+.--......- ..+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~-----------------~-~-i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~-~~~~ 60 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPNIEAIN-----------------Y-A-IRELQNRKKRLDKLSDYYNGKQEIEKHEFDNA-TVEA 60 (499) T ss_pred CccchhhhHHhhhhcCCHHHHH-----------------H-H-HHHHHHHHHHHHHHHHHhccccchhcCCcCcC-CCCc Confidence 2222222222221111000000 0 0 0000 00000111111111110000000000 0011 Q ss_pred HH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCC Q lcl|NC_019527. 108 LA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGAD 186 (516) Q Consensus 108 y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~ 186 (516) .+ .+.+++.||+..+.-++.+.+.+++.++.. .+.|...+++-++...+.++.+...+||.|++++.++... T Consensus 61 ~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~~~~~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g 133 (499) T protein:vir:10 61 ANVMVNHAKYITDMNVGFMTGNPVKYVAEKGKN-------IDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTD 133 (499) T ss_pred ceeecchHHHHHHHHhhhhcccCceeecCChhH-------HHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccc Confidence 11 357899999999999999999998765432 3457777788889999999999999999999988775432 Q ss_pred --cccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe---------eeEec-cceEEEecCC- Q lcl|NC_019527. 187 --VSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL---------GREMH-ASRLLTIITR- 253 (516) Q Consensus 187 --~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~---------g~~iH-~SRli~~~~~- 253 (516) +........ .+....-..+.+++|..+.|..-+..+ ..+.++ -.+|.+. -..|| +.++.+|... T Consensus 134 ~~~~~~~~~~~-~~~~~~~~~~~~v~p~~~~~v~~d~~~-~~~~~~-i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~ 210 (499) T protein:vir:10 134 PISVRDELGNE-KLTPNTELKIEVIDPRATVVVCDDTVE-HDPLFA-VFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKT 210 (499) T ss_pred ccccccccccc-ccccccceEEEEEcccceEEEecCCCC-cceEEE-EEEEEEeecCCCceEEEEEEEeCCeEEEEEecC Confidence 111110010 111112223677888777654221111 001111 0111110 01111 2333333211 Q ss_pred -------------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHH Q lcl|NC_019527. 254 -------------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGG 314 (516) Q Consensus 254 -------------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~ 314 (516) .+|-. .-.++.+|.|.++.+.+.+.+++.+....+..+..++..++....... .. +. T Consensus 211 ~~~~~~~~~~~~~~~~~~g~vPvv-~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~-~~-~~- 286 (499) T protein:vir:10 211 TMEVSANDPIVYDGENLFGAVPII-EFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGL-GD-DK- 286 (499) T ss_pred CccccCcceecccccCCCCccceE-EecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcc-cc-cc- Confidence 01111 112346789999999999999999888888877777766655432111 11 11 Q ss_pred HHHHHHHHHHHhcCCcceEEEe-cCCcceeEEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHH Q lcl|NC_019527. 315 DVFDRVEMYVNMQSNLGLAVMD-FDSEDIVQVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIR 391 (516) Q Consensus 315 ~l~~r~~~~~~~~sn~g~~~id-~~~e~~e~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~ 391 (516) .....+. ..++.+++ .++.+++.++ .+.+++...++.+.+.|...+++|..- ++ +- +-|.||..=.. T Consensus 287 ~~~~~~~-------~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~-~~-~~-~gn~Sg~Al~~ 356 (499) T protein:vir:10 287 DDIQRLK-------RGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMN-DE-KF-MGNVSGEAMKF 356 (499) T ss_pred chhhhhh-------hcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCC-ch-hh-cccchHHHHHH Confidence 1111111 11122222 2234455554 456788899999999999999999532 22 11 12445653222 Q ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHHHHh--CCCc--CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCH Q lcl|NC_019527. 392 SFYDDI--SSVQQSYYFSPLDTMLKVIQLSK--WGEI--DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDP 465 (516) Q Consensus 392 ~yyd~I--~~~Qe~~l~p~l~~l~~~l~~s~--~g~~--~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~ 465 (516) .|.... .+..+..+++.+++++++++.-. -|.- ..++++.|++-...++.+.|++..+. .|+||. T Consensus 357 ~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl---------~g~iS~ 427 (499) T protein:vir:10 357 KLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGANDDASGCKISLVANIPSNLSDVVNNVKNA---------DGIIPR 427 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHH---------hccCCh Confidence 232222 23345678888999888877532 1221 23789999999999999988877653 356666 Q ss_pred HHHHHHHHhh--hccCCCCCChhh--------hccccccchhcCCCCCCCCCCCCCCCCC-C Q lcl|NC_019527. 466 SEARQQLSDD--PDSGWDNIDGDL--------EIVQPEMFDDDGADPYMPDPDVLPGEEG-S 516 (516) Q Consensus 466 ~e~r~~l~~~--~~~~~~~~d~~~--------e~~~~e~~~~e~~~~~~~~~~~~~~~e~-t 516 (516) +.+.+.+... ++.-...|.++. +....+.++..+ +....+....++.++ | T Consensus 428 et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 488 (499) T protein:vir:10 428 KYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLE-LEDKQDDSSENDKEAGS 488 (499) T ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCC-CCCCCcccCCCCCCCcc Confidence 5555443110 000000000000 000000000000 000000001111111 1 No 143 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=99.29 E-value=3.4e-11 Score=78.01 Aligned_cols=427 Identities=7% Similarity=-0.005 Sum_probs=196.6 Q ss_pred hhhHHHHhHHhhcCC-Cccccc--cCCCC--CCCccCCCccchhccc-------cccc--chhhhcccccCCcccccccC Q lcl|NC_019527. 33 AMRRAVMKSMERRAS-DAATKW--APPQL--MPGVVPAGTTPAVAMD-------SLCG--PTYQFLNSAAGGLYAADIQP 98 (516) Q Consensus 33 ~~~~~~~~~~~~~~~-~~~~~~--~~~~~--~~gv~~~~~~~~~a~d-------s~~~--~~~~~~~~~~~~~~~~~~~~ 98 (516) +..-+++....+..- ..-.+| -|-.+ ..+++--......+-+ .... ........++.|-+.--... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~ 80 (492) T protein:vir:94 1 MQFIQLISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEP 80 (492) T ss_pred ChHHHHHHHHHHHHhcCCceeecCccchhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 333344333221111 111111 11000 0000000000000000 0000 01111112222211000000 Q ss_pred c------ccHHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcc Q lcl|NC_019527. 99 F------PGYQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDC 171 (516) Q Consensus 99 f------~gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~r 171 (516) . ..+....-.+ .+.+++.||+..+.-++.+++++++.++.. .+.|+..++ =++...+.++.+... T Consensus 81 ~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~~~d~~~-------~~~l~~~~~-n~~~~~~~~~~~~a~ 152 (492) T protein:vir:94 81 KPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEV-------VKRIDEVLG-NRFDDKLHSVLTGAS 152 (492) T ss_pred ccccccccccccccccccccchHHHHHHHHHhhhcccCceeccCchHH-------HHHHHHHHh-ccHHHHHHHHHHHHh Confidence 0 0011111112 368899999999999999999998765432 233443333 267888899999999 Q ss_pred cceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee---eEe-ccceE Q lcl|NC_019527. 172 FFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG---REM-HASRL 247 (516) Q Consensus 172 lyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g---~~i-H~SRl 247 (516) .||.|++++..+... .+ .+.+++|..+.|..-. .....+.++ -.+|.+.. ..+ .+.++ T Consensus 153 ~~G~a~~~v~~d~dg---------------~~-~~~~~~p~~~~~v~d~-~~~~~~~a~-ir~~~~~~~~~~~~y~~~~v 214 (492) T protein:vir:94 153 NKGIEWLHPYLDEEG---------------EF-KLFRVPAEQGIPIWTD-KEHEELEAF-IRMYKLENETKVEYWDKVTV 214 (492) T ss_pred hCCeEEEEEEecCCC---------------ce-EEEEEcccceEEEEcC-CCCCceEEE-EEEEeeccceeEEEEecCeE Confidence 999999888764321 11 1444555544432100 000001110 00111100 000 11111 Q ss_pred EEec-------------------------CCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeee Q lcl|NC_019527. 248 LTII-------------------------TRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKT 302 (516) Q Consensus 248 i~~~-------------------------~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~ 302 (516) .+|. -..+|-.. -.++-+|.|.++.+.+.+.+++.+....+..+..++..++.. T Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~-~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~ 293 (492) T protein:vir:94 215 NYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIP-FKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL 293 (492) T ss_pred EEEEEecCeeeeccccccccccccccccCCCccceEE-ecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeee Confidence 1110 00111111 122457999999999999999999888888887777776654 Q ss_pred cchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeE--EecccCCHHHHHHHHHHHHHhhhcCCceeeeccccc Q lcl|NC_019527. 303 NMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQ--VNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPS 380 (516) Q Consensus 303 ~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~--~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~ 380 (516) .... ..+.....+.+. ..++..++. +.+.+. .+.+.+++...++.+.+.|...+++|-.-. + +-+ T Consensus 294 ~g~~---~~~~~~~~~~~~-------~~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~~~ 360 (492) T protein:vir:94 294 KNYD---DQELPEFKRLLR-------YYGAIKVSD-NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSS-D-KFG 360 (492) T ss_pred ecCC---cccchhhHHHHh-------hccceecCC-CCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCc-c-ccc Confidence 3211 111112222111 122233333 344444 456677889999999999999999996322 2 222 Q ss_pred cccccchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcC--CcceEEeCCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_019527. 381 GLNASSEGEIRSF---YDDISSVQQSYYFSPLDTMLKVIQLSKWGEID--DAITFKFKSLWQTSAKEESEIRFNKAQEAQ 455 (516) Q Consensus 381 Glnatge~D~~~y---yd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~--~d~~~~f~pL~~~sekEkAei~~~~a~a~~ 455 (516) | |.||+.=...| ...+ ..++..++..+++++++++. .+|... .++.+.|++-...++++.+++..+ T Consensus 361 ~-n~Sg~Al~~~~~~l~~k~-~~k~~~f~~~l~~~~~li~~-~~~~~~~~~~i~v~f~~~~p~~~~e~~~~~~k------ 431 (492) T protein:vir:94 361 S-APSGVALEFLYTNLNLKA-DKLARKAKVAIQELLWFVFE-HFDIKGEHKDVDISFNYNKVANTELQVQTAQQ------ 431 (492) T ss_pred c-CchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH-HhcCCcccceeeEEecCCCCCCHHHHHHHHHH------ Confidence 2 45665322222 2233 34456678899999988765 344332 478999999999999998776554 Q ss_pred HHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchh--cCCCCCCCCCCCCCC----CCCC Q lcl|NC_019527. 456 IYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDD--DGADPYMPDPDVLPG----EEGS 516 (516) Q Consensus 456 ~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~--e~~~~~~~~~~~~~~----~e~t 516 (516) + .|++|.+.+.+.+... +..+.+.+....|..+. ...+.....++..++ ++++ T Consensus 432 -l--~giiS~et~~~~l~~v-----~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 490 (492) T protein:vir:94 432 -S--MGIVSHETVLENHPFV-----EDLQAELERIEQEQMEYNKQLPNLDDGGADSAQQQERSNNKE 490 (492) T ss_pred -H--hccCchHHHHHhCCCC-----CCHHHHHHHHHHHHHHHHhhccccccccCCCCccccCCcccc Confidence 2 3788877776665110 11111111111110000 000000111111111 1111 No 144 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=99.27 E-value=4.4e-11 Score=77.39 Aligned_cols=414 Identities=7% Similarity=-0.018 Sum_probs=191.0 Q ss_pred HhhcCC-CccccccCCCCCCCccCCCc-cchhcccccc--c---------------chhhhcccccCCcccccccCc--- Q lcl|NC_019527. 42 MERRAS-DAATKWAPPQLMPGVVPAGT-TPAVAMDSLC--G---------------PTYQFLNSAAGGLYAADIQPF--- 99 (516) Q Consensus 42 ~~~~~~-~~~~~~~~~~~~~gv~~~~~-~~~~a~ds~~--~---------------~~~~~~~~~~~~~~~~~~~~f--- 99 (516) |.+..- ..-.+|.-- -..... ...+-+|.-. . ........++.|-+.--.... T Consensus 1 ~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~ 75 (483) T protein:vir:12 1 MAQALIKGGNILYPSQ-----PTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVD 75 (483) T ss_pred CccchhcCCceeecCc-----chhhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccc Confidence 111110 111111100 000000 0001111000 0 000000011111100000000 Q ss_pred ccH---HHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhccccee Q lcl|NC_019527. 100 PGY---QNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGR 175 (516) Q Consensus 100 ~gy---~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~ 175 (516) .++ ...+..+ .+.+++.||+..+.-++.+++++++.++.. .+.|+..++. +....+.++.+....||. T Consensus 76 ~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~~d~~~-------~~~l~~~~~n-~~~~~~~~~~~~~~~~G~ 147 (483) T protein:vir:12 76 ATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEV-------VKRIDEVLGN-RFDDKLHSVLTGASNKGI 147 (483) T ss_pred ccccccccccccccccchHHHHHHHHhhhhcccCceeccCChHH-------HHHHHHHHhc-cHHHHHHHHHHHHhhCCe Confidence 000 0000011 268899999999999999999998765432 2334443332 678888899999999999 Q ss_pred eEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee---eEe-ccceEEEec Q lcl|NC_019527. 176 GQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG---REM-HASRLLTII 251 (516) Q Consensus 176 a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g---~~i-H~SRli~~~ 251 (516) |++++.++... .+ .+.+++|..+.|..-+. ....+.++ -.+|.... ..+ .+.++.++. T Consensus 148 ~y~~v~~d~d~---------------~~-~i~~~~p~~~~~v~d~~-~~~~~~~~-ir~~~~~~~~~~~~y~~~~v~~~~ 209 (483) T protein:vir:12 148 EWLHPYLDEEG---------------EF-KLFRVPAEQGIPIWTDK-EHEELEAF-IRMYKLENETKVEYWDKVTVNYYV 209 (483) T ss_pred EEEEEEEcCCC---------------ce-EEEEEcccceEEEEcCC-CCCceEEE-EEEEEeecceEEEEEecCeEEEEE Confidence 99888765321 11 14445555444321100 00000110 00111110 111 111222211 Q ss_pred --C-----------------------CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchh Q lcl|NC_019527. 252 --T-----------------------RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQ 306 (516) Q Consensus 252 --~-----------------------~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~ 306 (516) + ..+|-.. -.++-+|.|.++.+.+.+.+++.+....+.-+..++..++...... T Consensus 210 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~-~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~ 288 (483) T protein:vir:12 210 YENGSLIPDYSNNLENSKTHFSTGSWGKIPFIP-FKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD 288 (483) T ss_pred EeCCeeeecccccccccccccccCCCCccceEE-ecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC Confidence 0 0111111 1124579999999999999999988888887777776665542211 Q ss_pred hhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeE--EecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccc Q lcl|NC_019527. 307 VLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQ--VNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNA 384 (516) Q Consensus 307 ~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~--~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glna 384 (516) ..+.......++ ..+++.++. +.+++. ...+.+++...++.+.+.|...+++|-.-+ + .-+| |. T Consensus 289 ---~~~~~~~~~~~~-------~~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~~~~-n~ 354 (483) T protein:vir:12 289 ---DQELPEFKRLLR-------YYGAIKVSD-NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSS-D-KFGS-AP 354 (483) T ss_pred ---cccchhHHHhhh-------hccccccCC-CCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCc-c-cccc-Cc Confidence 111112222121 122232333 244444 456677889999999999999999996422 2 2222 55 Q ss_pred cchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCcC--CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019527. 385 SSEGEIRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEID--DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITN 460 (516) Q Consensus 385 tge~D~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~~--~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~ 460 (516) ||+.=...|...+. ..++..++..+++++++++. .+|... .++++.|++-...+.++.|++..+ + . T Consensus 355 Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~-~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k-------l--~ 424 (483) T protein:vir:12 355 SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE-HFDIKGEHKDVDISFNYNKVANTELQVQTAQQ-------S--M 424 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhcCCCccceeeEEeCCCCCCCHHHHHHHHHH-------H--h Confidence 66642222322222 44456788999999988765 344332 478999999999999998876554 2 4 Q ss_pred CCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhc--CCCCCCCCCCC-----CCCCCCC Q lcl|NC_019527. 461 SVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDD--GADPYMPDPDV-----LPGEEGS 516 (516) Q Consensus 461 gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e--~~~~~~~~~~~-----~~~~e~t 516 (516) |+||.+.+.+.+... ...+.+.+....|..+.. ..+.....++. .++.+.+ T Consensus 425 GiiS~et~~~~~~~v-----~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~e~ 482 (483) T protein:vir:12 425 GIVSHETVLENHPFV-----EDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKES 482 (483) T ss_pred ccCchHHHHHhCCCC-----CCHHHHHHHHHHHHHHHHhhcccccccccCCcccCCCCCcccC Confidence 778877776654210 000111111111100000 00011111111 1111111 No 145 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.27 E-value=1.4e-10 Score=74.61 Aligned_cols=419 Identities=13% Similarity=0.098 Sum_probs=181.7 Q ss_pred cccccccccCCCcCCCCCChhhhHHHHhHHhhcCCC--ccccccCCCCCCCccCCCccchhcccccccchhhhcccccCC Q lcl|NC_019527. 13 VADKLADAARAEEQEKARKLAMRRAVMKSMERRASD--AATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGG 90 (516) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~ 90 (516) ++... +.............+.+....+..+ -...|+-.. + .+ T Consensus 1 ~~~~~------~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~----------~-~i------------------- 44 (484) T protein:vir:77 1 MTSPL------QKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESE----------R-RP------------------- 44 (484) T ss_pred CCCcc------cccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------c-cc------------------- Confidence 22211 1111111112222233322222111 011121110 0 00 Q ss_pred cccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc Q lcl|NC_019527. 91 LYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD 170 (516) Q Consensus 91 ~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~ 170 (516) .........++-.....+.++++||+..++-+.=+|+.+...++ . -+.+...+++-++.....++.+.. T Consensus 45 ---~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~~--~------~~~l~~i~~~N~~d~~~~~~~~~a 113 (484) T protein:vir:77 45 ---DAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQELEGFRLGGADK--A------DEQLWDWWQANDLDIESTLGHTDS 113 (484) T ss_pred ---hhcccccchhHHhhhhhcCcHHHHHHHHHhhhccCceecCCcch--h------HHHHHHHHHhcCHhHHHHHHHHHH Confidence 00000111222233345678899999999988778887643211 1 244677788888899999999999 Q ss_pred ccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccc-cccc-------cccccCcce-------- Q lcl|NC_019527. 171 CFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNA-LDPT-------APDFYKPST-------- 234 (516) Q Consensus 171 rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~-~dp~-------s~~yg~P~~-------- 234 (516) .+||.|++++..+... ..+.. .. ..-.|++++|.++.+..-.. .++. ..+.++... T Consensus 114 ~~~G~a~~~v~~~~~~-~~~~~-~~------~~~~i~~~~p~~~~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~ 185 (484) T protein:vir:77 114 LVHGRSYITISKPDPN-IDPGV-DP------EVPIIRVEPPTNLYAQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNN 185 (484) T ss_pred hhcCceEEEEecCCCC-ccccc-cc------ccceEEEeccceeEEEecCCCCceEEEEEEEEeecCCcEEEEEEEecCe Confidence 9999999887653321 11000 00 00013333333333211000 0000 001111111 Q ss_pred -eEE---ee------eEeccc-eE--EEecCCcchhhhhhccCCCCchHHHH-HHHHHHHHHHHHHHHHHHHHHhCCcee Q lcl|NC_019527. 235 -WWV---LG------REMHAS-RL--LTIITRPLPDMLKPAYNFSGISMSQL-AQPYVENWLRTRQSVSDLVDKFSRTFL 300 (516) Q Consensus 235 -y~v---~g------~~iH~S-Rl--i~~~~~~~p~~~k~~~~~~G~S~le~-~~~~l~~~~~~~~~~~~Ll~~~~~~v~ 300 (516) |++ .| ..=|+- +| +.|.++ ......+|.|.++. +...+.+++++.......+..++.... T Consensus 186 ~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~------~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~ 259 (484) T protein:vir:77 186 TVIWNREDGQWVQVANVAHNLEMVPVIPIPNR------TRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQR 259 (484) T ss_pred EEEEEecCCceEeeccccCCCCCcceEEeccc------cccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHH Confidence 111 01 001221 11 122211 12234589998874 666667777776654444443433222 Q ss_pred e---ecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeec Q lcl|NC_019527. 301 K---TNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTG 376 (516) Q Consensus 301 k---~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G 376 (516) . .+... +....+. ....+... ...++++.+++-++.+++ .++.+..+.+.....+||+++++|..-|-| T Consensus 260 ~i~G~~~~~-~~~~~~~----~~~~~~~~--~~~~~~~~~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~ 332 (484) T protein:vir:77 260 LLFGVKGEE-LGVDPET----GQTLFDAY--LARILAFEDHESKAQQFSAAELRNFVDALDALDRKAAAYTGLPPYYLSF 332 (484) T ss_pred HHhCCCcch-hcccccc----cchhhhhh--hhhhcccCCCCceeEeecCCChHHHHHHHHHHHHHHhcccCCCHHHhcc Confidence 1 11111 1111100 01111111 112233433333444443 334556677888889999999999877744 Q ss_pred cccccccccchHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhCCC-cC---CcceEEeCCCCCCCHHHHHHHHHH Q lcl|NC_019527. 377 ISPSGLNASSEGEIRSFY---DDISSVQQSYYFSPLDTMLKVIQLSKWGE-ID---DAITFKFKSLWQTSAKEESEIRFN 449 (516) Q Consensus 377 ~sp~Glnatge~D~~~yy---d~I~~~Qe~~l~p~l~~l~~~l~~s~~g~-~~---~d~~~~f~pL~~~sekEkAei~~~ 449 (516) +..+ ++||+.=...+. ..++.+| ..+.+.+++++.++.....+. .+ .++++.|.+....|.++.|+...| T Consensus 333 -~~~n-~~Sg~Al~~~~~~l~~ka~~k~-~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~k 409 (484) T protein:vir:77 333 -SSEN-PASAEAIRSSESRLVKTVERKN-KIFGGAWEQAMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATK 409 (484) T ss_pred -ccCc-chHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHH Confidence 3222 256654333333 3344444 456788898888877654332 22 257899999999999988766554 Q ss_pred HHHHHHHHHHcC--CCCHHHHHHHHHhhhccCCCCCChh-hhccccc--------------c-----chhcCCCCCCCCC Q lcl|NC_019527. 450 KAQEAQIYITNS--VIDPSEARQQLSDDPDSGWDNIDGD-LEIVQPE--------------M-----FDDDGADPYMPDP 507 (516) Q Consensus 450 ~a~a~~~~~~~g--vi~~~e~r~~l~~~~~~~~~~~d~~-~e~~~~e--------------~-----~~~e~~~~~~~~~ 507 (516) ++++| +++.+.+++.| +|..-+.+ .+...++ . .+.+..+++.+.+ T Consensus 410 -------l~~~g~gi~s~et~~~~l------~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (484) T protein:vir:77 410 -------LYNNGQGVIPKERARIDM------GYSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPETPEPQP 476 (484) T ss_pred -------HHhccCCCCCHHHHHhcC------CCChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCcccccC Confidence 44443 55555554443 11111000 0000000 0 0000011111111 Q ss_pred CCCCCCCC Q lcl|NC_019527. 508 DVLPGEEG 515 (516) Q Consensus 508 ~~~~~~e~ 515 (516) +...+..+ T Consensus 477 ~~~~~~~~ 484 (484) T protein:vir:77 477 NPAEEAAA 484 (484) T ss_pred CCccccCC Confidence 11111111 No 146 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.26 E-value=4e-11 Score=77.61 Aligned_cols=432 Identities=13% Similarity=0.069 Sum_probs=204.7 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) ||-..+..|+|.... . ...+++..... ++.++++.-.-.. T Consensus 3 ~~~~~k~~~~~~~~~-~-------------------~~~~~~~~~~~--------------------~~~i~~~~~~~~r 42 (508) T protein:vir:15 3 LIQRIKDLFWKGAAA-T-------------------GVTGSLSKITD--------------------DPRISIDPDEYVR 42 (508) T ss_pred hHHHHHHHHHHHHHH-h-------------------ccccchHHhhc--------------------ccccccCHHHHHH Confidence 333322221111100 0 00001111111 1223333211111 Q ss_pred hhhcc-cccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcCh Q lcl|NC_019527. 81 YQFLN-SAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGV 159 (516) Q Consensus 81 ~~~~~-~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~ 159 (516) ..... -+.+..++..++...|-+.-..+..-.+++.||+..|+-++-+..+|+..+++.. -+.|++.++.-++ T Consensus 43 i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i~~~~A~lv~~e~~~i~v~~~~~~------~e~l~~il~~n~f 116 (508) T protein:vir:15 43 IQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTAARRIASVVFNEKAEIHVKDNNEA------DKFLNDVLEDNDF 116 (508) T ss_pred HHHHHHHhcCCCcccccccCCCCccccceeecchHHHHHHHHHhhhhCCCceEEeCCchHH------HHHHHHHHHhccH Confidence 11111 1222222222222222222222334588999999999999999988886543221 1346777888889 Q ss_pred hHHHHHHHHhcccceeeEEEEEecCCCc-------ccCcccccccccccceeeEEeecceeecccccccccccccccc-C Q lcl|NC_019527. 160 MGIIQKAAEHDCFFGRGQISINIKGADV-------SVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFY-K 231 (516) Q Consensus 160 ~~~l~ea~~~~rlyG~a~i~i~i~~~~~-------~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg-~ 231 (516) +..+++++..+..+|++++-+.++++.. +.-+++. ...+.+..+..+.+...... ..-.|+ . T Consensus 117 ~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~~v~ad~~~P~~---~d~~~~~~~af~~~~~~~~~-------~~~~~yt~ 186 (508) T protein:vir:15 117 KNKFEEALEKGVALGGFAMRPYIDGNHIKIAWVRADQFYPLQ---SNTNDISEAAIASRTQRTES-------NQTKYYTL 186 (508) T ss_pred HHHHHHHHHHHhhcCceEEEEEEeCCeeEEEEEcCCeeEEEE---EcCCCeEEEEEEEEEEeecC-------CCceEEEE Confidence 9999999999999999998877765421 1111111 11122322322222211000 000000 0 Q ss_pred c--------ceeEEe------------eeEeccce---------EEEecCCcchh--hhh-------hccCCCCchHHHH Q lcl|NC_019527. 232 P--------STWWVL------------GREMHASR---------LLTIITRPLPD--MLK-------PAYNFSGISMSQL 273 (516) Q Consensus 232 P--------~~y~v~------------g~~iH~SR---------li~~~~~~~p~--~~k-------~~~~~~G~S~le~ 273 (516) - ..|+|. |..|.-+. ...+.|-+-|. +.+ ..+..+|+|++.. T Consensus 187 lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~ 266 (508) T protein:vir:15 187 LEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDN 266 (508) T ss_pred EEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhh Confidence 0 011111 11111010 11122211111 000 1134679999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceE-EEe---cCCcceeEEeccc Q lcl|NC_019527. 274 AQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLA-VMD---FDSEDIVQVNTPL 349 (516) Q Consensus 274 ~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~-~id---~~~e~~e~~~~~l 349 (516) +.+.+..++.+......-+......++--. .++..+.. .. . .+. .+.... .+. ..+..+++++.++ T Consensus 267 ~~~lid~lD~~~s~~~~e~~~~~~~i~v~~--~~l~~d~~-~~-~---~~~---~~~~~~~~~~~~~~~~~~i~~~~~~i 336 (508) T protein:vir:15 267 AKHVLDDINDTHDQFIWEIRLGQKHIAVQP--GMLRFDDE-HK-P---TFD---TEQNVYVGVLSDDNNGLGVKDMTTPI 336 (508) T ss_pred hHHHHHHHHHHHHHHHHHHHhcccceeech--HHhcCCCC-Cc-c---ccC---CCCeeEEeccCCCCCCCceeEeeccc Confidence 999999999988888776654444443321 11211111 00 0 011 111111 111 1123477666554 Q ss_pred --CCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---- Q lcl|NC_019527. 350 --SGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEG---EIRSFYDDISSVQQSYYFSPLDTMLKVIQLSK---- 420 (516) Q Consensus 350 --sgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~---D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~---- 420 (516) ....+.++.+.+.+...+|++.. -||...+|. .|+.. ..+.-|.++..+|. .++..|+.++..|+... T Consensus 337 r~e~~~~~~~~~l~~~~~~~gls~~-~f~~~~~~~-~TAtei~s~~~~~~~t~~~~~~-~~~~al~~lv~~il~l~~~~~ 413 (508) T protein:vir:15 337 RTVQYKDAIDHFIKEFEVQIGLSTG-TFSYSNDGV-KTATEVVSNNSMTYQTRSSYLT-MVEKAIDELCQSIFELANAGA 413 (508) T ss_pred ChHHHHHHHHHHHHHHHHHhCCCch-hcccccCcc-ccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhc Confidence 34677788888888888888754 445554554 35543 24455677777664 57888888877765432 Q ss_pred ---CC---------CcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh- Q lcl|NC_019527. 421 ---WG---------EIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL- 487 (516) Q Consensus 421 ---~g---------~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~- 487 (516) .| ..+.+++|.|++-...+..++++ ...+++.+|+++.++++..+ .+++++. T Consensus 414 ~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~-------~~~~~v~aGi~s~e~~i~~~--------~g~~deea 478 (508) T protein:vir:15 414 LFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLE-------EDAKVLAIGALSKQTFLQRN--------YGMTDEQA 478 (508) T ss_pred cccccccccccccccCCcceEEEeCCCCCCCHHHHHH-------HHHHHHhcCCCCHHHHHHhc--------CCCChHHH Confidence 11 11346889999988888776544 34557889999998877542 2333221 Q ss_pred hcccccc-chhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 488 EIVQPEM-FDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 488 e~~~~e~-~~~e~~~~~~~~~~~~~~~e~t 516 (516) +..-++. .+....++.++..+...|.+|= T Consensus 479 ~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 479 AEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred HHHHHHHHHhccccCccccccccCCCCCCC Confidence 1111111 1111222333322222222222 No 147 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=99.26 E-value=5e-11 Score=77.05 Aligned_cols=401 Identities=15% Similarity=0.123 Sum_probs=197.5 Q ss_pred ccccCCCCCCCccCCCccchhcccc---c------ccchhhhcccccCCcccc--cccCcccHHHHHHHHhCchhhhhhh Q lcl|NC_019527. 51 TKWAPPQLMPGVVPAGTTPAVAMDS---L------CGPTYQFLNSAAGGLYAA--DIQPFPGYQNLAALATRPEYRAFAS 119 (516) Q Consensus 51 ~~~~~~~~~~gv~~~~~~~~~a~ds---~------~~~~~~~~~~~~~~~~~~--~~~~f~gy~ll~~y~~~~i~r~iVd 119 (516) -.|.||.+.. .|... -++.+- + ..........++.+-+.- ......+....++ .+.+++.||+ T Consensus 1 ~~~~~~~~~~--~~~~~--~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki--~~n~~~~ivd 74 (452) T protein:vir:36 1 MKYKPPKLMT--FSKDE--PITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAKDSWKPDNRL--AVNFTKYIVD 74 (452) T ss_pred CcccCceeEE--cCCcc--CCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCccccccCcccee--ecchHHHHHH Confidence 3455543211 11111 111110 0 000111111222221100 0001111221111 3578999999 Q ss_pred hhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCccccccccc Q lcl|NC_019527. 120 TLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIK 199 (516) Q Consensus 120 ~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~ 199 (516) ..+.-++.+++.+++.++.. .+.|+..++.-++...+.++.+....||.|++++..+... T Consensus 75 ~~~~~l~g~~~~~~~~d~~~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g------------- 134 (452) T protein:vir:36 75 TFTGYFNGIPVKKSHSDKEI-------LTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDT------------- 134 (452) T ss_pred HHhhhhcccCceeecCChhH-------HHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCC------------- Confidence 99999999999998765432 2456777777789999999999999999999888764321 Q ss_pred ccceeeEEeecceeeccccccccccccccccCcceeEEe-e---eEe-ccceEEEecCC---------------cchhhh Q lcl|NC_019527. 200 KGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL-G---REM-HASRLLTIITR---------------PLPDML 259 (516) Q Consensus 200 ~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~-g---~~i-H~SRli~~~~~---------------~~p~~~ 259 (516) .+ .+.+++|..+.|..-... ...+-|+. .+|.-. + ..| -+.++.++... .+|-. T Consensus 135 --~~-~i~~~~p~~~~~v~d~~~-~~~~~~~i-~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iPvv- 208 (452) T protein:vir:36 135 --QT-NVVYNSPENMFMVYDDTV-KQEPLFAV-RYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLPVV- 208 (452) T ss_pred --ee-EEEEEcccceEEEEcCCC-CCceEEEE-EEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCcccEE- Confidence 11 133444444443211000 00111110 000000 0 000 01122211110 01111 Q ss_pred hhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCC Q lcl|NC_019527. 260 KPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDS 339 (516) Q Consensus 260 k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~ 339 (516) .-.++-.|.|.++.+.+.+.+++.+....+..+..++..++....... .. +. ...+ +. .++..+..++ T Consensus 209 ~~~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~-~~---~~-~~~~------~~-~~~~~~~~~~ 276 (452) T protein:vir:36 209 EFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAV-EE---ED-LKNI------RS-NRVINYYADG 276 (452) T ss_pred EecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCc-Cc---hh-hhhh------hh-cceEEecCCC Confidence 112234699999999999999999999988888877777665432211 11 11 1111 11 2233332221 Q ss_pred ----cceeEEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHH---HHHHHHHHHHHHHHHHH Q lcl|NC_019527. 340 ----EDIVQVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSF---YDDISSVQQSYYFSPLD 410 (516) Q Consensus 340 ----e~~e~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~y---yd~I~~~Qe~~l~p~l~ 410 (516) .+++.+. .+.+++...++.+.++|...+++|-. -++.. | |+||+.=...| ...+..+ +..++..++ T Consensus 277 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~-~~~~~--g-n~Sg~Al~~~~~~l~~k~~~~-~~~~~~~l~ 351 (452) T protein:vir:36 277 EGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANI-SDESF--G-SSSGVSLAYKLQAMSNLALSF-QRKFQSSLN 351 (452) T ss_pred CccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCcccc-Ccccc--c-CCcHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 2344443 55678888999999999999999963 33322 3 56776432323 2333333 455788888 Q ss_pred HHHHHHHHHh--CCCcC--CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChh Q lcl|NC_019527. 411 TMLKVIQLSK--WGEID--DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGD 486 (516) Q Consensus 411 ~l~~~l~~s~--~g~~~--~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~ 486 (516) +++++++.-. .|... .++++.|++-...++++.|++..+. +|+||.+.+.+.+.. .+..+.+ T Consensus 352 ~~~~li~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~---------~g~iS~et~~~~~~~-----~~d~~~E 417 (452) T protein:vir:36 352 SRYKLFCELSTNVSNKDSWKDIEYTFTRNEPKDIKEQAETANIL---------MGITSQETALSVISV-----IPDVQAE 417 (452) T ss_pred HHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHH---------hccCChHHHHHhCCC-----CCCHHHH Confidence 8888766422 23222 4789999999999999988876543 477887777665411 0111111 Q ss_pred hhccccccch-------hcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 487 LEIVQPEMFD-------DDGADPYMPDPDVLPGEE 514 (516) Q Consensus 487 ~e~~~~e~~~-------~e~~~~~~~~~~~~~~~e 514 (516) .+....|..+ .+..++...+..+..+.| T Consensus 418 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 418 MEKIKKEEASTAIFDKDKQPSEKGTDTVVSETNEE 452 (452) T ss_pred HHHHHHHHHHHHHHHhhccCCCCcccccCccccCC Confidence 1111111100 001111111111111111 No 148 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.26 E-value=7.5e-12 Score=81.57 Aligned_cols=431 Identities=14% Similarity=0.074 Sum_probs=201.6 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) ||...|+.++|.. ..-..++......+ +.++++...-.. T Consensus 3 ~~~~~k~~~~~~~---------------------~~~~~~~~~~~~~~--------------------~~i~~~~~~~~~ 41 (500) T protein:vir:98 3 VIQKIKNLVTRSK---------------------YVMTTQSLTNITDH--------------------PKIAISKLEYDR 41 (500) T ss_pred hHHHHHHHHHHHH---------------------HHhhcchhhhhhcc--------------------ccccCCHHHHHH Confidence 6666554433110 00001111111111 112221111011 Q ss_pred hhhcccccC-CcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcCh Q lcl|NC_019527. 81 YQFLNSAAG-GLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGV 159 (516) Q Consensus 81 ~~~~~~~~~-~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~ 159 (516) .......+. .+++..+....+-..-.-+....+++.+|+..|+-++.+..+|+..++.. -+.|++.++.-++ T Consensus 42 i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv~~e~~~i~~~d~~~-------~~~l~~il~~n~f 114 (500) T protein:vir:98 42 ITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAAKKIASLVFNEQAEIKVDDDAA-------NEFISETLKNDRF 114 (500) T ss_pred HHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHHHHHhhhhcCCcceEecCChHH-------HHHHHHHHhhccH Confidence 111111111 11111111111111111233448899999999999999998888765322 2457777888889 Q ss_pred hHHHHHHHHhcccceeeEEEEEecCCCcc----c-----CcccccccccccceeeEEeecceeecccccccccccccccc Q lcl|NC_019527. 160 MGIIQKAAEHDCFFGRGQISINIKGADVS----V-----PLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFY 230 (516) Q Consensus 160 ~~~l~ea~~~~rlyG~a~i~i~i~~~~~~----~-----Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg 230 (516) +..+.+++..+..+|++++-+.+++..+. . |+..+..++ ....++-++. . . .+ ....|| T Consensus 115 ~~~~~~~~e~a~a~G~~~~k~~~d~~~~~I~~v~ad~~~P~~~d~~~~-----~~~a~~~~~~-~-~-~~----~~~~~y 182 (500) T protein:vir:98 115 NKNFERYLESCLALGGLAMRPYVDGDKVRVAFVQAPVFLPLQSNTQDV-----SSAAVVIKSV-K-T-IN----GKEVYY 182 (500) T ss_pred HHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeeEEEEEcCCCe-----EEEEEEEEEe-e-e-ec----CCceEE Confidence 99999999999999999888777664321 1 211111111 1111111000 0 0 00 000010 Q ss_pred -Ccc--------eeEEe------------eeEe-----c----cceEEEecCCc--------chhhhhhccCCCCchHHH Q lcl|NC_019527. 231 -KPS--------TWWVL------------GREM-----H----ASRLLTIITRP--------LPDMLKPAYNFSGISMSQ 272 (516) Q Consensus 231 -~P~--------~y~v~------------g~~i-----H----~SRli~~~~~~--------~p~~~k~~~~~~G~S~le 272 (516) +-+ .|+|. |..| + +.-.+.-..+| .+.. ......+|+|++. T Consensus 183 t~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~-~~~~sp~G~S~~~ 261 (500) T protein:vir:98 183 TLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNN-KDINSPLGLSIFD 261 (500) T ss_pred EEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEecCCcccc-ccCCCccCCchhh Confidence 000 11121 1111 1 11111111111 1110 0112457999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCceeee-cchhh-hcCccHHHHHHHHHHHHHhcCCcceEEEe---cCCcceeEEec Q lcl|NC_019527. 273 LAQPYVENWLRTRQSVSDLVDKFSRTFLKT-NMAQV-LNGGEGGDVFDRVEMYVNMQSNLGLAVMD---FDSEDIVQVNT 347 (516) Q Consensus 273 ~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~-~~~~~-l~~~~~~~l~~r~~~~~~~~sn~g~~~id---~~~e~~e~~~~ 347 (516) .+.+.+..++.+......-+......++-- .+... ....+++.+... .+.. +...-..++ +++..++.++. T Consensus 262 ~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~--~~d~--~~~~~~~~~~~~~~~~~i~~~~~ 337 (500) T protein:vir:98 262 NAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRP--RFES--DQNVYIRMGGRDLDSSAIQDLTT 337 (500) T ss_pred hhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCc--ccCC--CcceEEEcCCCCCcCcceeEecc Confidence 999999999999888887666544444321 11110 001111111000 0000 000001111 22345777665 Q ss_pred cc--CCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-- Q lcl|NC_019527. 348 PL--SGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEG---EIRSFYDDISSVQQSYYFSPLDTMLKVIQLSK-- 420 (516) Q Consensus 348 ~l--sgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~---D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~-- 420 (516) ++ ..+...++.+.+.++..+|++...| |...+|. .|+.+ ....-|.++..+|. .++..|+.|+..|+... T Consensus 338 ~ir~e~~~~~l~~~l~~i~~~~gls~~~~-~~~~~g~-~TAtei~s~~~~~~~t~~~~~~-~~~~al~~lv~~il~~~~~ 414 (500) T protein:vir:98 338 PIRADDYIKAINEGLSLFEMQIGVSAGLF-SFDGKSM-KTATEIVSENSDTYQMRNSIVA-LVEQSLKELVISIFEIAKA 414 (500) T ss_pred ccChHHHHHHHHHHHHHHHHHhCCCcccc-ccCcCcc-ccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 54 3467778888889999999876544 4444454 34443 23455677777774 46888888888776431 Q ss_pred ---C-CCcC--CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh-hccccc Q lcl|NC_019527. 421 ---W-GEID--DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL-EIVQPE 493 (516) Q Consensus 421 ---~-g~~~--~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~-e~~~~e 493 (516) . |.++ .+++|.|++-...+..+.++. +.+++.+|+++..+++..+ .+.++++ +..-++ T Consensus 415 ~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~-------~~~~v~aGi~s~~~~i~~~--------~g~~eeea~~~l~~ 479 (500) T protein:vir:98 415 YDLYQSEVPSMDNISISLDDGVFTDRDAELDY-------WIKVVNAGFGTREMAIQKV--------LNVTEEKAQEIAAE 479 (500) T ss_pred HhhcCCCCCCCcceEEEeCCCCCCCHHHHHHH-------HHHHHHcCCCCHHHHHHhc--------CCCCHHHHHHHHHH Confidence 1 2333 368899998777777665443 4567899999999887543 2233221 111111 Q ss_pred cchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 494 MFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 494 ~~~~e~~~~~~~~~~~~~~~e 514 (516) ..++...+...++++.-+-+| T Consensus 480 i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 480 INTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred HHHhccccCCCCCccccccCC Confidence 111111111112222222222 No 149 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.26 E-value=7.5e-12 Score=81.57 Aligned_cols=431 Identities=14% Similarity=0.074 Sum_probs=201.6 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) ||...|+.++|.. ..-..++......+ +.++++...-.. T Consensus 3 ~~~~~k~~~~~~~---------------------~~~~~~~~~~~~~~--------------------~~i~~~~~~~~~ 41 (500) T protein:vir:30 3 VIQKIKNLVTRSK---------------------YVMTTQSLTNITDH--------------------PKIAISKLEYDR 41 (500) T ss_pred hHHHHHHHHHHHH---------------------HHhhcchhhhhhcc--------------------ccccCCHHHHHH Confidence 6666554433110 00001111111111 112221111011 Q ss_pred hhhcccccC-CcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcCh Q lcl|NC_019527. 81 YQFLNSAAG-GLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGV 159 (516) Q Consensus 81 ~~~~~~~~~-~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~ 159 (516) .......+. .+++..+....+-..-.-+....+++.+|+..|+-++.+..+|+..++.. -+.|++.++.-++ T Consensus 42 i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv~~e~~~i~~~d~~~-------~~~l~~il~~n~f 114 (500) T protein:vir:30 42 ITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAAKKIASLVFNEQAEIKVDDDAA-------NEFISETLKNDRF 114 (500) T ss_pred HHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHHHHHhhhhcCCcceEecCChHH-------HHHHHHHHhhccH Confidence 111111111 11111111111111111233448899999999999999998888765322 2457777888889 Q ss_pred hHHHHHHHHhcccceeeEEEEEecCCCcc----c-----CcccccccccccceeeEEeecceeecccccccccccccccc Q lcl|NC_019527. 160 MGIIQKAAEHDCFFGRGQISINIKGADVS----V-----PLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFY 230 (516) Q Consensus 160 ~~~l~ea~~~~rlyG~a~i~i~i~~~~~~----~-----Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg 230 (516) +..+.+++..+..+|++++-+.+++..+. . |+..+..++ ....++-++. . . .+ ....|| T Consensus 115 ~~~~~~~~e~a~a~G~~~~k~~~d~~~~~I~~v~ad~~~P~~~d~~~~-----~~~a~~~~~~-~-~-~~----~~~~~y 182 (500) T protein:vir:30 115 NKNFERYLESCLALGGLAMRPYVDGDKVRVAFVQAPVFLPLQSNTQDV-----SSAAVVIKSV-K-T-IN----GKEVYY 182 (500) T ss_pred HHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeeEEEEEcCCCe-----EEEEEEEEEe-e-e-ec----CCceEE Confidence 99999999999999999888777664321 1 211111111 1111111000 0 0 00 000010 Q ss_pred -Ccc--------eeEEe------------eeEe-----c----cceEEEecCCc--------chhhhhhccCCCCchHHH Q lcl|NC_019527. 231 -KPS--------TWWVL------------GREM-----H----ASRLLTIITRP--------LPDMLKPAYNFSGISMSQ 272 (516) Q Consensus 231 -~P~--------~y~v~------------g~~i-----H----~SRli~~~~~~--------~p~~~k~~~~~~G~S~le 272 (516) +-+ .|+|. |..| + +.-.+.-..+| .+.. ......+|+|++. T Consensus 183 t~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~-~~~~sp~G~S~~~ 261 (500) T protein:vir:30 183 TLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNN-KDINSPLGLSIFD 261 (500) T ss_pred EEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEecCCcccc-ccCCCccCCchhh Confidence 000 11121 1111 1 11111111111 1110 0112457999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCceeee-cchhh-hcCccHHHHHHHHHHHHHhcCCcceEEEe---cCCcceeEEec Q lcl|NC_019527. 273 LAQPYVENWLRTRQSVSDLVDKFSRTFLKT-NMAQV-LNGGEGGDVFDRVEMYVNMQSNLGLAVMD---FDSEDIVQVNT 347 (516) Q Consensus 273 ~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~-~~~~~-l~~~~~~~l~~r~~~~~~~~sn~g~~~id---~~~e~~e~~~~ 347 (516) .+.+.+..++.+......-+......++-- .+... ....+++.+... .+.. +...-..++ +++..++.++. T Consensus 262 ~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~--~~d~--~~~~~~~~~~~~~~~~~i~~~~~ 337 (500) T protein:vir:30 262 NAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRP--RFES--DQNVYIRMGGRDLDSSAIQDLTT 337 (500) T ss_pred hhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCc--ccCC--CcceEEEcCCCCCcCcceeEecc Confidence 999999999999888887666544444321 11110 001111111000 0000 000001111 22345777665 Q ss_pred cc--CCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-- Q lcl|NC_019527. 348 PL--SGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEG---EIRSFYDDISSVQQSYYFSPLDTMLKVIQLSK-- 420 (516) Q Consensus 348 ~l--sgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~---D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~-- 420 (516) ++ ..+...++.+.+.++..+|++...| |...+|. .|+.+ ....-|.++..+|. .++..|+.|+..|+... T Consensus 338 ~ir~e~~~~~l~~~l~~i~~~~gls~~~~-~~~~~g~-~TAtei~s~~~~~~~t~~~~~~-~~~~al~~lv~~il~~~~~ 414 (500) T protein:vir:30 338 PIRADDYIKAINEGLSLFEMQIGVSAGLF-SFDGKSM-KTATEIVSENSDTYQMRNSIVA-LVEQSLKELVISIFEIAKA 414 (500) T ss_pred ccChHHHHHHHHHHHHHHHHHhCCCcccc-ccCcCcc-ccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Confidence 54 3467778888889999999876544 4444454 34443 23455677777774 46888888888776431 Q ss_pred ---C-CCcC--CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh-hccccc Q lcl|NC_019527. 421 ---W-GEID--DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL-EIVQPE 493 (516) Q Consensus 421 ---~-g~~~--~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~-e~~~~e 493 (516) . |.++ .+++|.|++-...+..+.++. +.+++.+|+++..+++..+ .+.++++ +..-++ T Consensus 415 ~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~-------~~~~v~aGi~s~~~~i~~~--------~g~~eeea~~~l~~ 479 (500) T protein:vir:30 415 YDLYQSEVPSMDNISISLDDGVFTDRDAELDY-------WIKVVNAGFGTREMAIQKV--------LNVTEEKAQEIAAE 479 (500) T ss_pred HhhcCCCCCCCcceEEEeCCCCCCCHHHHHHH-------HHHHHHcCCCCHHHHHHhc--------CCCCHHHHHHHHHH Confidence 1 2333 368899998777777665443 4567899999999887543 2233221 111111 Q ss_pred cchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 494 MFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 494 ~~~~e~~~~~~~~~~~~~~~e 514 (516) ..++...+...++++.-+-+| T Consensus 480 i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 480 INTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred HHHhccccCCCCCccccccCC Confidence 111111111112222222222 No 150 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=99.25 E-value=4.3e-11 Score=77.41 Aligned_cols=412 Identities=13% Similarity=0.054 Sum_probs=196.2 Q ss_pred cccccCCCCCCCc--cCCCccc-h-hc------cccc-----cc-------chhhhcccccCCcccc------cccCccc Q lcl|NC_019527. 50 ATKWAPPQLMPGV--VPAGTTP-A-VA------MDSL-----CG-------PTYQFLNSAAGGLYAA------DIQPFPG 101 (516) Q Consensus 50 ~~~~~~~~~~~gv--~~~~~~~-~-~a------~ds~-----~~-------~~~~~~~~~~~~~~~~------~~~~f~g 101 (516) ..-| +|.-+ +-+-... . +. ++.. .. .........+-|.+.. ......+ T Consensus 1 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~ 76 (481) T protein:vir:10 1 MTVY----TINNINTKFSPLANDDFVVSDLAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGD 76 (481) T ss_pred CeeE----eeehhchhcccccCceeeeecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccc Confidence 0000 01000 0000000 0 00 0000 00 0000000111110000 0000000 Q ss_pred HHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEE Q lcl|NC_019527. 102 YQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISI 180 (516) Q Consensus 102 y~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i 180 (516) + .-.+ .+++++.||+..+.-++.+++++++.++.. .+.|...+++-++...+.++.+...+||.|++++ T Consensus 77 ~---~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~ 146 (481) T protein:vir:10 77 K---ADHRAVHNYAKYVSRFIVGYLTGNPITITHQDNQT-------NDKIIELNDLNDADEVNSDLALNLSIYGRAYEIV 146 (481) T ss_pred c---ccceeecchHHHHHHHHHhhhccCCceEecCChhH-------HHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEE Confidence 0 0011 457889999999999999999998865433 2457777888889999999999999999999888 Q ss_pred EecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee--------eEec-cceEEEec Q lcl|NC_019527. 181 NIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG--------REMH-ASRLLTII 251 (516) Q Consensus 181 ~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g--------~~iH-~SRli~~~ 251 (516) .++... .+ .+.+++|.++.|..-+. ....+.++ -.+|.... ..|+ +.++.++. T Consensus 147 ~~d~dg---------------~~-~i~~~~p~~~~~v~d~~-~~~~~~~~-i~~~~~~~~~~~~~~~~~~y~~~~i~~~~ 208 (481) T protein:vir:10 147 YRDFED---------------RD-TFKVLDPKSTFVVYDQT-LDKKVVAG-VRYFEKQDKDKVPVQHVEVYTTDKIYYIE 208 (481) T ss_pred EeCCCC---------------eE-EEEEEcccceEEEEcCC-CCCceEEE-EEEEEEeeCCCceEEEEEEEecCeEEEEE Confidence 764321 11 14455555554431100 00011111 00111100 0111 12222221 Q ss_pred CC---------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHH Q lcl|NC_019527. 252 TR---------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDV 316 (516) Q Consensus 252 ~~---------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l 316 (516) .. .+|-. .-.++-+|.|.++.+.+.+.+++++....+.-+..++..++...............+ T Consensus 209 ~~~~~~~~~~~~~~~~g~vPvv-~~~n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 287 (481) T protein:vir:10 209 IKGGTYHRVEEVEHYYNDVPII-EYLNDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAF 287 (481) T ss_pred ecCCceeecccccccCCceeEE-EeecCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhh Confidence 10 11111 112345799999999999999999988888777777777665432111111111111 Q ss_pred HHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHH--- Q lcl|NC_019527. 317 FDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIR--- 391 (516) Q Consensus 317 ~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~--- 391 (516) .. -..+... .... .....++.+++.+ +.+.+++...++.+.+.|...+++|-. .+|. .+ -|.||+.=.. T Consensus 288 ~~-~~~~~~~-~~~~-~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~~-~~-~n~Sg~Al~~~~~ 361 (481) T protein:vir:10 288 RD-ANMIHLE-PGTN-ANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDL-NDEQ-FS-GVQSGESMKYKLF 361 (481) T ss_pred hh-ccceecc-cccc-ccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc-cccc-cc-cccHHHHHHHHHH Confidence 11 0000000 0000 0111222344444 445577899999999999999999963 4442 22 2456653222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh--CCCcC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHH Q lcl|NC_019527. 392 SFYDDISSVQQSYYFSPLDTMLKVIQLSK--WGEID---DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPS 466 (516) Q Consensus 392 ~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~--~g~~~---~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~ 466 (516) .-...++. ++..++..+++++++++.-. -+..+ .++++.|++-...++++.|++..+. .|+||.+ T Consensus 362 ~l~~k~~~-~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl---------~g~is~e 431 (481) T protein:vir:10 362 GLEQVRAI-KERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESINAFNAL---------SGGVSES 431 (481) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHHHHHHH---------hccCChH Confidence 22333444 34668888999888776431 11111 3689999999999999988766442 3788887 Q ss_pred HHHHHHHhhhccCCCCCChhhhccccccchh----c--CCCCCCCCCCCCCCCCC Q lcl|NC_019527. 467 EARQQLSDDPDSGWDNIDGDLEIVQPEMFDD----D--GADPYMPDPDVLPGEEG 515 (516) Q Consensus 467 e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~----e--~~~~~~~~~~~~~~~e~ 515 (516) .+.+.|.. .....++.+....|..+. . +.....++.++..+++| T Consensus 432 t~~~~l~~-----i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 432 TRLSLLDF-----IDNPKEELEKMQEEEAQREKQADKRGYGEAFENHLNVDDSNG 481 (481) T ss_pred HHHHhCCC-----CCCHHHHHHHHHHHHHHHHhhhhhccCCccCCCCCCCCCCCC Confidence 77665521 011111111111111100 0 01112222333344444 No 151 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=99.25 E-value=5.8e-11 Score=76.70 Aligned_cols=435 Identities=14% Similarity=0.142 Sum_probs=200.8 Q ss_pred hhHHHH-hHHhh-cCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHH-- Q lcl|NC_019527. 34 MRRAVM-KSMER-RASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALA-- 109 (516) Q Consensus 34 ~~~~~~-~~~~~-~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~-- 109 (516) +..+|. =++++ +..+..+.++||...+|..+ +.+. ++++.....+..-...++|...|+ T Consensus 1 ~~~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~------~~~~-----------~~~g~~~~~e~~~~~~~eLI~~YR~m 63 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGPSFVQKDSLDGSQP------IVGG-----------GYFGYSVDFDGTIRNDHELITRYREM 63 (537) T ss_pred CccccccceeecccccccCCcccCCCcccccce------eecc-----------cccccccccccccchHHHHHHHHHHH Confidence 111121 11111 11222234444433333211 1111 111111112222234688988886 Q ss_pred -hCchhhhhhhhhhHHHhhC-----CCeeeeccccchhhhHHH-HHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEe Q lcl|NC_019527. 110 -TRPEYRAFASTLSTELTRE-----GIEITSKDRTKAKEMASK-IKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINI 182 (516) Q Consensus 110 -~~~i~r~iVd~~aed~~r~-----~~~i~~~~~~~~~~~~~~-i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i 182 (516) .++++..+|+.+++||+-+ .+.+...+.+.++...++ ..+++..++-|++...-.+.+|.--+.|.-+.-..| T Consensus 64 a~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKii 143 (537) T protein:vir:10 64 VLNPECDSAVDDVVNETICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVI 143 (537) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEE Confidence 6899999999999999743 223333332222211111 133444444557777777777776676766555556 Q ss_pred cCCCcccCcccccccccccceeeEEeecceeecccccc---------ccccccc-------cc-cCcceeEEe---eeEe Q lcl|NC_019527. 183 KGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYN---------ALDPTAP-------DF-YKPSTWWVL---GREM 242 (516) Q Consensus 183 ~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~---------~~dp~s~-------~y-g~P~~y~v~---g~~i 242 (516) +..++.. +|+.|+.+||..+...... ..+-... .| |.|..+..+ +.+| T Consensus 144 d~k~pk~------------GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI 211 (537) T protein:vir:10 144 DPKKPRQ------------GLVELRYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKI 211 (537) T ss_pred eCCCccc------------cceeeeeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceec Confidence 6554432 2444555555554332221 0011110 01 223222221 3466 Q ss_pred ccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhC----CceeeecchhhhcCccHHHHHH Q lcl|NC_019527. 243 HASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFS----RTFLKTNMAQVLNGGEGGDVFD 318 (516) Q Consensus 243 H~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~----~~v~k~~~~~~l~~~~~~~l~~ 318 (516) +.+=+. |+..-+ ........+|.|+.+...+.+.-....+ -++++.+ -.|+-+|+.++-.. ..++..+ T Consensus 212 ~~dAI~-y~hSGl----~d~n~~~i~syLhkAiKp~NQLkm~EDA--lVIYRitRAPeRRvFYIDVGnLPk~-KAeqYlr 283 (537) T protein:vir:10 212 APDSIA-YCHSGI----QDLNKNMVLSHLHKAIKAVNQLRMIEDS--LVIYRLSRAPERRIFYIDVGNLPKN-KAEQYLR 283 (537) T ss_pred cHhhee-eecccc----eeCCCCeeeeeehhhhHHHHhhHHHHhh--HHHHhhhccccceEEEEecCCCCch-hHHHHHH Confidence 664433 332222 2233456788888887777765544433 3445444 34555554443211 1111111 Q ss_pred HHHHHHHhcCCcceEEEecCC--------------------------cceeEEe--cccCCHHHHHHHHHHHHHhhhcCC Q lcl|NC_019527. 319 RVEMYVNMQSNLGLAVMDFDS--------------------------EDIVQVN--TPLSGLADLQSQSQEHMCSVSKIP 370 (516) Q Consensus 319 r~~~~~~~~sn~g~~~id~~~--------------------------e~~e~~~--~~lsgl~d~~~~~~~~iaaas~IP 370 (516) ..+++++ +-++-|+.+ -+++++. -+|+.++|+ .+|+..+-.+.++| T Consensus 284 --~iM~k~K---NklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnlgem~DV-~YF~kKLy~aLnVP 357 (537) T protein:vir:10 284 --EVMGRYR---NKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDV-KYFQKKLYKALNVP 357 (537) T ss_pred --HHHHhcc---ceEEEeccCceecccchhhhhhhhhcccccCCCcccceeeccccCCcChHHHH-HHHHHHHHHHhCCC Confidence 1111111 011111111 1344442 235556554 58999999999999 Q ss_pred ceeeeccccccccccchH----HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH------HhCCCcCCcceEEeCCCCCCC Q lcl|NC_019527. 371 AIKLTGISPSGLNASSEG----EIRSFYDDISSVQQSYYFSPLDTMLKVI-QL------SKWGEIDDAITFKFKSLWQTS 439 (516) Q Consensus 371 ~t~L~G~sp~Glnatge~----D~~~yyd~I~~~Qe~~l~p~l~~l~~~l-~~------s~~g~~~~d~~~~f~pL~~~s 439 (516) .++|-. .+|+|-+.-+ |.-.|..+|.+.|..+ ..++..+++.= .+ ..|-.+-+++.|+|..=.-.+ T Consensus 358 ~SRl~~--e~~f~~Gr~~EItRDEiKF~KFI~RLR~rF-s~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ 434 (537) T protein:vir:10 358 SSRLET--ETTFNIGRAAEITRDEVKFQKFIARLRKRF-SELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFT 434 (537) T ss_pred ccccCC--CCcccccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHH Confidence 999943 3576543222 5567999999998654 45555554421 11 123334457889999888888 Q ss_pred HHHHHHHHHHHHHHHHHHHH-cC-CCCHHHHHHHHHh---------------h-hccCCCCCChhhhccccccchhcCCC Q lcl|NC_019527. 440 AKEESEIRFNKAQEAQIYIT-NS-VIDPSEARQQLSD---------------D-PDSGWDNIDGDLEIVQPEMFDDDGAD 501 (516) Q Consensus 440 ekEkAei~~~~a~a~~~~~~-~g-vi~~~e~r~~l~~---------------~-~~~~~~~~d~~~e~~~~e~~~~e~~~ 501 (516) |-..+|+...+.++++.+-. .| .++.+-+++.+-. . .+..|..-+ +++.-+....+.+--+ T Consensus 435 ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~-~~~~~~~~~~~~~~~~ 513 (537) T protein:vir:10 435 ELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQ-AMQAMEMGIGDEEPVP 513 (537) T ss_pred HHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcc-cccccccCCCCcccCC Confidence 88899999988888876521 11 3455545443211 1 111222110 0000000000000001 Q ss_pred CCCCC----------CCCCCCCCC Q lcl|NC_019527. 502 PYMPD----------PDVLPGEEG 515 (516) Q Consensus 502 ~~~~~----------~~~~~~~e~ 515 (516) +.+.+ |....++|= T Consensus 514 ~~~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:10 514 EGGEEPQTDPNSAVSPADQKRGEL 537 (537) T ss_pred CCCCCcccCCccCCCCCCccCCCC Confidence 11111 111112222 No 152 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.25 E-value=3.3e-10 Score=72.59 Aligned_cols=415 Identities=9% Similarity=0.005 Sum_probs=196.4 Q ss_pred cccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHH-HhCchhhhhhhhhhHHHhhC Q lcl|NC_019527. 50 ATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAAL-ATRPEYRAFASTLSTELTRE 128 (516) Q Consensus 50 ~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y-~~~~i~r~iVd~~aed~~r~ 128 (516) -..-..|.+|...++......++ +++ +.... .- ...+.+..-+..+...+ ....-+..++++...-+++. T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~-~~~--~~~~~---~e---~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~ 71 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVV-DGW--TVWDP---FE---QTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIRST 71 (469) T ss_pred CCCcccCCCCccchhhhhhcccc-cch--hhccc---cc---cccccccccchHHHHHHHhhChHHHHHHHHHHHHHhcC Confidence 11111222232222221111111 100 00000 00 00111111234444444 46899999999999888888 Q ss_pred CCeeeeccccchhhhHHHHHHHHHHHHh-----------------cChhHHHHHHHHhcccceeeEEEEE--ecCCCccc Q lcl|NC_019527. 129 GIEITSKDRTKAKEMASKIKELEEACEY-----------------YGVMGIIQKAAEHDCFFGRGQISIN--IKGADVSV 189 (516) Q Consensus 129 ~~~i~~~~~~~~~~~~~~i~~i~~~~~~-----------------l~~~~~l~ea~~~~rlyG~a~i~i~--i~~~~~~~ 189 (516) .|+|...++++. . .+.+.+.++. ....+.|.+.+-....||.++.=++ .++...+ T Consensus 72 ~w~v~p~~~~~e--~---~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~d- 145 (469) T protein:vir:10 72 PWRIRANGASDE--V---TEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPD- 145 (469) T ss_pred CceEecCCCCHH--H---HHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCC- Confidence 888876544321 1 1223332222 1234566666777888999986443 2211100 Q ss_pred CcccccccccccceeeEEeecce-----eeccc----cccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhh Q lcl|NC_019527. 190 PLILDPRTIKKGSLTGFSNIEPM-----WTSPS----AYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLK 260 (516) Q Consensus 190 Pl~ld~~~I~~g~l~~l~v~d~~-----~v~p~----~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k 260 (516) .. -.++.|....+. .+.+. ......+..+.-+.+..+...+..+++.+.|++.... T Consensus 146 G~---------~~~~~l~~rp~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~------ 210 (469) T protein:vir:10 146 GR---------FWLRKLAPRPQWTISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNK------ 210 (469) T ss_pred Cc---------eeeeeeeecCcccceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecC------ Confidence 00 001112111111 11110 0111111111111111122335678888888887543 Q ss_pred hccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecC Q lcl|NC_019527. 261 PAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFD 338 (516) Q Consensus 261 ~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~ 338 (516) ...+.+|.|++..|+....--......-+.++.++++++ .|++-. .++++...-++++..++.+....++..+ T Consensus 211 ~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~-----a~~~ek~~l~~a~~~~~~g~~a~~iip~ 285 (469) T protein:vir:10 211 RPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSA-----TDEDEVRKMAALARSVRGGINAGVGLAQ 285 (469) T ss_pred CCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCC-----CCHHHHHHHHHHHHHHhcCCceEEEccC Confidence 345678999999999987666667778888999988654 444311 1222333333444444433333333344 Q ss_pred CcceeEEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 339 SEDIVQVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVI 416 (516) Q Consensus 339 ~e~~e~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l 416 (516) +.+++.++.+-++ ...+++..-++|+-+.=-. | |..++-+|-.|.|+.-.....+.+++.......-+.+.|+.-| T Consensus 286 ~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~-t-lTs~~~gGS~a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l 363 (469) T protein:vir:10 286 GQILELLGVSGNLPDIRRAIEGHDRSIALSGLAH-F-LNLDGKGGSYALASVLEDPFTQAVHAYATSICRIANQHIIEDL 363 (469) T ss_pred CceEEEeecCCCchHHHHHHHHHHHHHHHHHhcc-c-ccccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5788988865332 3446666666776443211 1 2222223444556656666777777766554433445688877 Q ss_pred HHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCC-----CHHHHHHHHHhhhccCCCCCChhhhccc Q lcl|NC_019527. 417 QLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVI-----DPSEARQQLSDDPDSGWDNIDGDLEIVQ 491 (516) Q Consensus 417 ~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi-----~~~e~r~~l~~~~~~~~~~~d~~~e~~~ 491 (516) +.-.||....-..|+|...-. ..+..|++++.++++|++ +.+.+++.+ |++.-....+... T Consensus 364 ~~lN~g~~~~~P~~~~~~~e~--------~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~------gip~~~~~~~~~~ 429 (469) T protein:vir:10 364 VDINFGVDTPAPVLTFDPIGS--------RQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRF------NLPSELNDTPSAE 429 (469) T ss_pred HHhcCCCCCCccEEEecCCCC--------cHHHHHHHHHHHHhcCCccCccccHHHHHHHh------CCCCCCCCccccc Confidence 776677544445788864321 123457888999999995 445566665 3332221111111 Q ss_pred cccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 492 PEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 492 ~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) +..+......++++ +...+....| T Consensus 430 ~~~~~~~~~~~~~~-~~~~~~~~~~ 453 (469) T protein:vir:10 430 PEEPAAVPNQSAAP-ARTRSSGNAD 453 (469) T ss_pred chhcccCCCCCccc-cccCCCCCcc Confidence 11111111111111 1111122222 No 153 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.24 E-value=1.9e-12 Score=84.81 Aligned_cols=434 Identities=12% Similarity=0.107 Sum_probs=202.6 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCc-cchhcc---cccccchhhhcccccCCcccccccC---cccHHH Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGT-TPAVAM---DSLCGPTYQFLNSAAGGLYAADIQP---FPGYQN 104 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~-~~~~a~---ds~~~~~~~~~~~~~~~~~~~~~~~---f~gy~l 104 (516) |.+.. +.+..|.|.. ++ .+ +.+.++ |++..+..........+ ...+... ...-.- T Consensus 1 Mn~iD-----------r~i~~~sP~~---a~---~R~~ar~~~~~y~aa~~~r~~~~~~~~~s-~~~~i~~~~~~lr~Ra 62 (548) T protein:vir:95 1 MNLID-----------RLLEPLAPEL---VA---RRLAAREAIQAYEAARPGRTHKAKRQPLG-ADTSLQKSAVSMREQC 62 (548) T ss_pred CchHH-----------hHhhhcchHH---HH---HHHHhHHHhccccccCccccccccCCCCC-hHHHHHHHHHHHHHHH Confidence 11111 1111111110 00 00 000000 00000000000000000 0000000 000112 Q ss_pred HHHHHhCchhhhhhhhhhHHHhh-CCCeeeecccc-chhhhHHHHHHHHHHHHhc----------ChhHHHHHHHHhccc Q lcl|NC_019527. 105 LAALATRPEYRAFASTLSTELTR-EGIEITSKDRT-KAKEMASKIKELEEACEYY----------GVMGIIQKAAEHDCF 172 (516) Q Consensus 105 l~~y~~~~i~r~iVd~~aed~~r-~~~~i~~~~~~-~~~~~~~~i~~i~~~~~~l----------~~~~~l~ea~~~~rl 172 (516) -.+++-|++++.+|+...+..+= .|+.+...--. +.+...++-++|+..|++. .+....+.+++.... T Consensus 63 RdL~rNn~~a~~av~~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~ 142 (548) T protein:vir:95 63 RKLDEDHDLVTGLLDRLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLR 142 (548) T ss_pred HHHHhcChHHHHHHHHHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHh Confidence 35677899999999999998884 56666542111 1111123334555555433 355555556666666 Q ss_pred ceeeEEEEEecC-CC--cccCcccccccccccceeeEEeecceeeccccccccc------cccccccCcceeEEee---- Q lcl|NC_019527. 173 FGRGQISINIKG-AD--VSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALD------PTAPDFYKPSTWWVLG---- 239 (516) Q Consensus 173 yG~a~i~i~i~~-~~--~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~d------p~s~~yg~P~~y~v~g---- 239 (516) .|-+++.+..+. .+ ...++++ .|.+|+|.+|.-. .+..+ ..-..+|+|..|+|.. T Consensus 143 dGE~f~~~~~~~~~~~~~g~~~~~-----------~lqliepd~l~~~-~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPg 210 (548) T protein:vir:95 143 DGEGLAQKLMGRVPNYTFATSVPF-----------ALELLEPDYLPFS-YNNLSKGIVQGIERDTWRRKRAYHLLKDHPG 210 (548) T ss_pred CCceEEEeeecccccccCCcccce-----------EEEEechhhcCCC-CCCCCCceeeeeEECCCCceEEEEEeecCCC Confidence 777776654422 11 1111111 1444555554210 00000 0112467888887741 Q ss_pred -----------eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHH-HhCC-ceeeecchh Q lcl|NC_019527. 240 -----------REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVD-KFSR-TFLKTNMAQ 306 (516) Q Consensus 240 -----------~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~-~~~~-~v~k~~~~~ 306 (516) ..|..++|+|+-... ...+.-|+|.|-.++..|++++.-..+--.-.+ .+.+ -+++.+... T Consensus 211 d~~~~~~~~~~~rvpA~~VlHif~~~------r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~ 284 (548) T protein:vir:95 211 NLQTLGGSLAVKRVEAERIIHIAYRK------RIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPD 284 (548) T ss_pred cccccccccceeeechhHheeccccc------CCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCc Confidence 357778888875432 334556999999999999888776665433222 1222 334543322 Q ss_pred hhcCccHHHHHHHHHHHHHhcCCcceEE-EecCCcceeEEecc--cCCHHHHHHHHHHHHHhhhcCCceeeecccccccc Q lcl|NC_019527. 307 VLNGGEGGDVFDRVEMYVNMQSNLGLAV-MDFDSEDIVQVNTP--LSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLN 383 (516) Q Consensus 307 ~l~~~~~~~l~~r~~~~~~~~sn~g~~~-id~~~e~~e~~~~~--lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln 383 (516) .........-... ...+ .-|+.+ ....+++++.++.+ -++..++...+...||+.+|||.-.|.|-. ++.- T Consensus 285 ~~~~~~~~~~~~~---~~~~--~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~-s~nY 358 (548) T protein:vir:95 285 SYTVEPGKDRKNR---TIPI--APGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAY-DGTY 358 (548) T ss_pred cccCCCCcccccc---cccc--cCCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccc-chhH Confidence 2211111100000 0011 134432 22456888888754 468999999999999999999999999875 4544 Q ss_pred ccchHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHhCCCc--CC------cceEEeCC--CCCCCHHHHHHHHHH Q lcl|NC_019527. 384 ASSEGEIRSFYDDISSVQQSYY----FSPLDTMLKVIQLSKWGEI--DD------AITFKFKS--LWQTSAKEESEIRFN 449 (516) Q Consensus 384 atge~D~~~yyd~I~~~Qe~~l----~p~l~~l~~~l~~s~~g~~--~~------d~~~~f~p--L~~~sekEkAei~~~ 449 (516) ||.-..+..+...++.+|+..+ +|+.+.+++..+++ |.+ |. -+..+|-+ .-..+. .| T Consensus 359 SS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~--G~i~lP~~~~~~~~~~~~W~~P~~~~iDP-------~K 429 (548) T protein:vir:95 359 SAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLA--RKERLPADVDHRTLYAAVYQGPVMPWINP-------MH 429 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc--CCcCCCCCCCchhheeeeeecCCccccCh-------HH Confidence 5666677788888888887654 44555555444433 444 32 13455532 111222 35 Q ss_pred HHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh--------------------hcccc-ccchhc---------- Q lcl|NC_019527. 450 KAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL--------------------EIVQP-EMFDDD---------- 498 (516) Q Consensus 450 ~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~--------------------e~~~~-e~~~~e---------- 498 (516) .+++....+++|+.|..++..+.+.+.+..+..+-.+. ....+ +..+.+ T Consensus 430 ea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (548) T protein:vir:95 430 EANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTA 509 (548) T ss_pred HHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhcccccccccc Confidence 67777888999999988876554333221111110000 00000 000000 Q ss_pred -------CCCCC-CCCC-CCCCCCCCC Q lcl|NC_019527. 499 -------GADPY-MPDP-DVLPGEEGS 516 (516) Q Consensus 499 -------~~~~~-~~~~-~~~~~~e~t 516 (516) +..+. .+-| -+.++++.| T Consensus 510 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 536 (548) T protein:vir:95 510 DEARELVNRYGAGLPVPGPDFPNESNN 536 (548) T ss_pred chhHHhhccCCCCCcCCCCCCCccccc Confidence 00000 0001 112333333 No 154 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.24 E-value=1.1e-10 Score=75.15 Aligned_cols=419 Identities=12% Similarity=0.111 Sum_probs=182.3 Q ss_pred cCCCcCCC--CCChhhhH-HHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCccccccc Q lcl|NC_019527. 21 ARAEEQEK--ARKLAMRR-AVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQ 97 (516) Q Consensus 21 ~~~~~~~~--~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~ 97 (516) ..-++.-. .-+..... .+.+....+..+ .+....++-|-+.-... T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~r--------------------------------~~~~~~Yy~G~~~i~~~ 48 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDSTQN--------------------------------LKTNTSYYEAERRPEAI 48 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH--------------------------------HHHHHHHHhcCCcchhc Confidence 11111111 11111111 111111111110 00111111111110000 Q ss_pred C-cccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceee Q lcl|NC_019527. 98 P-FPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRG 176 (516) Q Consensus 98 ~-f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a 176 (516) + ...-.+-.....+.++++||+..++-+.=+|+.+.. +++.+ +.+++.|++-++.....++.+.+.+||.| T Consensus 49 ~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~--~~~~~------~~~~~i~~~N~~d~~~~~~~~~a~i~G~a 120 (485) T protein:vir:10 49 GVTVPIQMQSLLAHVGYPRLYVDSIAERQAVEGFRFGD--ADEAD------EELWQWWQANNLDIEAPLGYTDAYVHGRS 120 (485) T ss_pred CCCCChhhhhhhhhcCcHHHHHHHHHhhhcccceecCC--CchhH------HHHHHHHHhcCHhHHHHHHHHHHhhcCce Confidence 0 000011111223467899999999988777776532 22111 34566677778889999999999999999 Q ss_pred EEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccc-c---------------ccccccCcce-eEEe- Q lcl|NC_019527. 177 QISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALD-P---------------TAPDFYKPST-WWVL- 238 (516) Q Consensus 177 ~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~d-p---------------~s~~yg~P~~-y~v~- 238 (516) ++++..+... .++..+ ++. -.|++++|.++.+..-...+ + ..-.++.+.. |.+. T Consensus 121 y~~v~~~e~~--~~~~~~-----~~~-~~i~~~~p~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~ 192 (485) T protein:vir:10 121 YITISRPDPQ--IDLGWD-----PNT-PIIRVEPPTRMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYR 192 (485) T ss_pred EEEEeeCCcc--cccccC-----CCe-eEEEEEccceeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEE Confidence 9887654321 111000 111 12445555554432100000 0 0001111111 1110 Q ss_pred --e------eEeccc---eEEEecCCcchhhhhhccCCCCchHHHH-HHHHHHHHHHHHHHHHHHHHHhCCceeee---c Q lcl|NC_019527. 239 --G------REMHAS---RLLTIITRPLPDMLKPAYNFSGISMSQL-AQPYVENWLRTRQSVSDLVDKFSRTFLKT---N 303 (516) Q Consensus 239 --g------~~iH~S---Rli~~~~~~~p~~~k~~~~~~G~S~le~-~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~---~ 303 (516) + ..=|+- -|+.|.+++ .....||.|.++. +...+.+++++.......+..++.+...+ + T Consensus 193 ~~~~~~~~~~~~~~~g~vPvv~~~n~~------~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~ 266 (485) T protein:vir:10 193 VENEWQEWFNNPHGLGVVPVVPIPNRT------RLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIK 266 (485) T ss_pred cCCceEEeccccCCCCcccEEEecccc------ccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCC Confidence 0 000211 112222211 1223579998875 66666777777665554444444433221 1 Q ss_pred chhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccc Q lcl|NC_019527. 304 MAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGL 382 (516) Q Consensus 304 ~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gl 382 (516) ... ....++.. ...+... ...++++.+++-++-+++ .++.+..+.++....+||+.+++|...|-| +..+ T Consensus 267 ~~~-~~~~~~~~----~~~~~~~--~~~i~~~~~~d~k~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~-~~~n- 337 (485) T protein:vir:10 267 PEE-IGVDPETG----QTLFDAY--LARILAFEDAEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLST-AADN- 337 (485) T ss_pred ccc-cccccccc----chhhhhc--ccceeccCCCCceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHhcc-ccCc- Confidence 111 10011100 0111111 112333333323444443 234556677777888999999999887644 3221 Q ss_pred cccchHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CcC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_019527. 383 NASSEGEI---RSFYDDISSVQQSYYFSPLDTMLKVIQLSKWG-EID---DAITFKFKSLWQTSAKEESEIRFNKAQEAQ 455 (516) Q Consensus 383 natge~D~---~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g-~~~---~d~~~~f~pL~~~sekEkAei~~~~a~a~~ 455 (516) ++||+.=. ......++.+| ..+.+.+++++++++.-..+ ..+ .++++.|.+....|.+|.|+... T Consensus 338 ~~Sg~Al~~~~~~l~~k~~~k~-~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~------- 409 (485) T protein:vir:10 338 PASAEAIRAAESRLIKKVERKN-SIFGGAWEEAMRLAYRMMKGGDVPPDMLRMETVWRDPSTPTYAAKADAAS------- 409 (485) T ss_pred hhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCCCCCcccceeeeEEecCCCCCCHHHHHHHHH------- Confidence 25665433 33333444444 45788899998877653322 222 26789999998999988766554 Q ss_pred HHHHcC--CCCHHHHHHHHHhhhccCCCCCChh-hhc------------------cccccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 456 IYITNS--VIDPSEARQQLSDDPDSGWDNIDGD-LEI------------------VQPEMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 456 ~~~~~g--vi~~~e~r~~l~~~~~~~~~~~d~~-~e~------------------~~~e~~~~e~~~~~~~~~~~~~~~e 514 (516) +++++| +++.+.+++.| ||..-+.+ ++. ..+...+..+..+....+....+++ T Consensus 410 kl~~ag~~~~s~et~~~~l------g~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (485) T protein:vir:10 410 KLYNGGTGVIPRERARKDM------GYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPAPKPAALESGGD 483 (485) T ss_pred HHHhccccCCCHHHHHHhC------CCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccccccCcCCCCCCC Confidence 455544 66766666554 22111100 000 0000011111111111122233344 Q ss_pred CC Q lcl|NC_019527. 515 GS 516 (516) Q Consensus 515 ~t 516 (516) ++ T Consensus 484 ~~ 485 (485) T protein:vir:10 484 AA 485 (485) T ss_pred CC Confidence 44 No 155 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=99.24 E-value=1.5e-11 Score=79.89 Aligned_cols=433 Identities=14% Similarity=0.063 Sum_probs=191.3 Q ss_pred hhHHHHhHH---hhcCC-CccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCccc--c-cccCcccHHHHH Q lcl|NC_019527. 34 MRRAVMKSM---ERRAS-DAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYA--A-DIQPFPGYQNLA 106 (516) Q Consensus 34 ~~~~~~~~~---~~~~~-~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~--~-~~~~f~gy~ll~ 106 (516) +...+.+.. -++-. ... .....--+++.- .+.... ........+.|.+. . ......|.+.-. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~--------~~~~~~-~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~ 69 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKA--LKDVKDHKKVNA--------NDEDYK-YIDMWKRLYQGHYAEWHNLNYEHNGNPVNR 69 (496) T ss_pred ChhHHHHHHHHHHHHhccchh--hHHHHhcCCCcC--------CHHHHH-HHHHHHHHhcCCCchhhcchhccCCCcccc Confidence 222333222 12110 000 000000011100 000000 11111112222110 0 000001111111 Q ss_pred HHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCC- Q lcl|NC_019527. 107 ALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGA- 185 (516) Q Consensus 107 ~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~- 185 (516) ......+++.||+..|.-++-+...|++.++.. .+.|+..++.-++...+++++.....+|++++.+.++.. T Consensus 70 ~~~~~n~~k~i~~~~a~~l~~~p~~i~~~d~~~-------~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~ 142 (496) T protein:vir:38 70 RQLSMNLPKVTAKYMSKLLFNEKVKINIDDKAA-------EEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNK 142 (496) T ss_pred ceeecchHHHHHHHHhhhhhCCcceEeeCChHH-------HHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCC Confidence 122358899999999999999999998865322 345677777778999999999999999999988877632 Q ss_pred Cc-------cc--CcccccccccccceeeEEeecceeecccccc------------------ccccccccccCcceeEE- Q lcl|NC_019527. 186 DV-------SV--PLILDPRTIKKGSLTGFSNIEPMWTSPSAYN------------------ALDPTAPDFYKPSTWWV- 237 (516) Q Consensus 186 ~~-------~~--Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~------------------~~dp~s~~yg~P~~y~v- 237 (516) .+ +. |+.-+ .+.+..+.-+..+......+. .....+...|.|--+.- T Consensus 143 ~~~i~~v~~~~~~P~~~~-----~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~ 217 (496) T protein:vir:38 143 NVKVSFATADCMYPLSND-----SENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLL 217 (496) T ss_pred cEEEEEEcccceEEEEec-----CCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCcccccccc Confidence 11 11 22111 122222221211111100000 00000011111100000 Q ss_pred ----ee-eEec-cceEEE-ecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceee-ecchhhhc Q lcl|NC_019527. 238 ----LG-REMH-ASRLLT-IITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLK-TNMAQVLN 309 (516) Q Consensus 238 ----~g-~~iH-~SRli~-~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k-~~~~~~l~ 309 (516) .. ..++ -+|+.+ +...+.+.. ......+|+|+++.+.+.+..++.+......-+......++- ..+...+. T Consensus 218 ~~~~~~~~~~~~~~~~~f~~~~~~~~N~-~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~ 296 (496) T protein:vir:38 218 FDDIEPVVPLPDFTRPTFIYIKPNIANN-KNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAV 296 (496) T ss_pred ccccccceeecCCCcceEEEecCCcccc-cccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccC Confidence 00 0000 022211 111111110 112345799999999999999998877766555443333322 11111111 Q ss_pred CccHHHHHHHHHHHHHhcCCcce-EEEecC----CcceeEEeccc--CCHHHHHHHHHHHHHhhhcCCceeeeccccccc Q lcl|NC_019527. 310 GGEGGDVFDRVEMYVNMQSNLGL-AVMDFD----SEDIVQVNTPL--SGLADLQSQSQEHMCSVSKIPAIKLTGISPSGL 382 (516) Q Consensus 310 ~~~~~~l~~r~~~~~~~~sn~g~-~~id~~----~e~~e~~~~~l--sgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gl 382 (516) ...+..+ ..+ ..+... .++.+. ...++.++.++ ......++...+.++..+|+|... ||...+|. T Consensus 297 ~~~g~~~----~~~---~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~-f~~~~~g~ 368 (496) T protein:vir:38 297 NLDGSTT----QYF---DSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGT-FTFDENGL 368 (496) T ss_pred CCCCccc----cCC---CCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhh-cCCCcccc Confidence 2111111 001 111111 111111 12466665444 345677888889999999998754 56555564 Q ss_pred cccchHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----h-CCCc--CCcceEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_019527. 383 NASSEGE---IRSFYDDISSVQQSYYFSPLDTMLKVIQLS-----K-WGEI--DDAITFKFKSLWQTSAKEESEIRFNKA 451 (516) Q Consensus 383 natge~D---~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s-----~-~g~~--~~d~~~~f~pL~~~sekEkAei~~~~a 451 (516) +|+..= ....+.++..+| ..++..|++++..++.. . .|.. +.+++|.|+.-...++.+.++.. T Consensus 369 -~tAtei~~~~~~l~~~~~~~~-~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~---- 442 (496) T protein:vir:38 369 -KTATEVVSEKSETYQTKNSHS-QLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRY---- 442 (496) T ss_pred -chHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHH---- Confidence 344322 222344455544 45677777776666532 1 1222 34689999998888888765544 Q ss_pred HHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCCh-hhhccccccchhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 452 QEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDG-DLEIVQPEMFDDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 452 ~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~-~~e~~~~e~~~~e~~~~~~~~~~~~~~~e~ 515 (516) .+++.+|++|.+.++..+ .+.++ +.+..-+...++.....+..+.....+++- T Consensus 443 ---~~~~~~GiiS~et~l~~~--------~~~~d~ea~~el~ri~~E~~~~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 443 ---TNAKNQGMIPLKIALQRA--------WNITEAEADEWAEMLAKEKQAEMPNNDMNGIFGEEE 496 (496) T ss_pred ---HHHHhcCCCCHHHHHHhc--------CCCChHHHHHHHHHHHHhhhccCccccccCCCCCCC Confidence 456788999988776542 22221 111000111111111111111111111111 No 156 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.23 E-value=5.2e-11 Score=76.94 Aligned_cols=409 Identities=13% Similarity=0.105 Sum_probs=181.6 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHH-Hh Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAAL-AT 110 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y-~~ 110 (516) |.--..++..+..+-.... +++ .....++-|-+.-...+-.-.+-.+.+ .. T Consensus 1 ~~t~~d~i~~L~~~~~~~~------------------~r~----------~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~ 52 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDL------------------PNL----------LEAEAYRNGTRRLKTIGIGAPPELAYLDVQ 52 (480) T ss_pred CCCHHHHHHHHHHHHHHHH------------------HHH----------HHHHHHHhccccchhcccccchhhhhhhhh Confidence 3333444433322111000 000 000001111000000000000111112 24 Q ss_pred CchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccC Q lcl|NC_019527. 111 RPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVP 190 (516) Q Consensus 111 ~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~P 190 (516) +.++++||+..++-+.=+|+.+..+ .+. .+.|...|++-++.....++.+...+||.|++++.- +.... T Consensus 53 ~n~~~~ivd~~~~~l~~~g~~~~~d--~~~------~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~-~~~~~-- 121 (480) T protein:vir:78 53 PGWVATYLRTLSDRLDIEGFRISED--SEG------LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSH-PDVES-- 121 (480) T ss_pred cchHHHHHHHHHhhhccCceecCCC--chh------HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeec-Ccccc-- Confidence 5778999999999887778765321 111 244666777778899999999999999999987753 11000 Q ss_pred cccccccccccceeeEEeecceeecccccccc--ccc--------cccccCcceeEE--eee------------------ Q lcl|NC_019527. 191 LILDPRTIKKGSLTGFSNIEPMWTSPSAYNAL--DPT--------APDFYKPSTWWV--LGR------------------ 240 (516) Q Consensus 191 l~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~--dp~--------s~~yg~P~~y~v--~g~------------------ 240 (516) .+ ..|.. .+.+++|.++.+..-... .+. ..+.+.+.++.+ .+. T Consensus 122 --~d----~~~~~-~i~~~~p~~~~~i~D~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~ 194 (480) T protein:vir:78 122 --GD----PAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDG 194 (480) T ss_pred --CC----CCCee-EEEEEcccceEEEEcCCCccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccc Confidence 00 00111 244455555443211000 000 001111111110 000 Q ss_pred -Ee-cc-c--eEEEecCCcchhhhhhccCCCCchHHHH-HHHHHHHHHHHHHHHHHHHHHhCCceeee-c--chhhhcCc Q lcl|NC_019527. 241 -EM-HA-S--RLLTIITRPLPDMLKPAYNFSGISMSQL-AQPYVENWLRTRQSVSDLVDKFSRTFLKT-N--MAQVLNGG 311 (516) Q Consensus 241 -~i-H~-S--Rli~~~~~~~p~~~k~~~~~~G~S~le~-~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~-~--~~~~l~~~ 311 (516) .+ |. - -|+.|.++ ......+|.|.++. +.+.+.+++++.......+..++...+.+ + ....... T Consensus 195 ~~~~~~~g~vPvv~f~n~------~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~- 267 (480) T protein:vir:78 195 DVIKHGLGVVPVVPLTND------PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND- 267 (480) T ss_pred cccccCCCCcceEEeecc------cccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccccc- Confidence 00 10 0 11112111 11234579998875 77888888888877766665555443322 1 1111111 Q ss_pred cHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHH Q lcl|NC_019527. 312 EGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEI 390 (516) Q Consensus 312 ~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~ 390 (516) .... .+... +..+..+.+++-++-+++ .++.+..+.+.....++++.+++|...|-| ++.. ++||+.=. T Consensus 268 ~~~~------~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~-~~~n-~~Sg~Al~ 337 (480) T protein:vir:78 268 GENT------TLDIY--YGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSS-SSEN-PASAEAII 337 (480) T ss_pred cccc------hhhhh--hhhhccCCCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhcc-ccCc-hhHHHHHH Confidence 0101 11111 111223332222333332 234456667778888999999999866633 3221 25665432 Q ss_pred HHHHHHH--HHHHHHHHHHHHHHHHHHHHHHhCCCcCC---cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcC--CC Q lcl|NC_019527. 391 RSFYDDI--SSVQQSYYFSPLDTMLKVIQLSKWGEIDD---AITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNS--VI 463 (516) Q Consensus 391 ~~yyd~I--~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~---d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g--vi 463 (516) ..+...+ ...++..++..|.+++++++.-..+..+. ++++.|.+-...+..+.|+...+. +++| ++ T Consensus 338 ~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl-------~~~g~~~~ 410 (480) T protein:vir:78 338 ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKL-------YANGQGPI 410 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHH-------HHhcccCC Confidence 2232222 23344567889999999887765555433 588999999999999877765543 3333 44 Q ss_pred CHHHHHHHHHhhhccCCCCCChh-hhcc---cc-----------ccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 464 DPSEARQQLSDDPDSGWDNIDGD-LEIV---QP-----------EMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 464 ~~~e~r~~l~~~~~~~~~~~d~~-~e~~---~~-----------e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) +.+.+++.| +|..-+.+ ++.. +. +...+...++.+++.+++.+...+ T Consensus 411 s~et~~~~l------g~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (480) T protein:vir:78 411 PKEQARIDL------GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPS 472 (480) T ss_pred CHHHHHhcC------CCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCccccCCCCCCCCCccCCCcc Confidence 444444433 11110000 0000 00 000000111222222222222222 No 157 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=99.23 E-value=7.3e-11 Score=76.16 Aligned_cols=393 Identities=12% Similarity=0.059 Sum_probs=193.4 Q ss_pred cCCCccchhc--ccccccchhhhcccccCCcccccccCcccHHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccc Q lcl|NC_019527. 63 VPAGTTPAVA--MDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTK 139 (516) Q Consensus 63 ~~~~~~~~~a--~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~ 139 (516) +....-..+. .+... ..+.....++-|-+.--.. ..-.+..+..+ .+.+++.||+..+.-++.+++++++.++.. T Consensus 1 l~~~~l~~~i~~~~~~~-~r~~~l~~yy~g~~~il~~-~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~ 78 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFN-LSYSAYKQLYEGDHAILQQ-KQKEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQTSHENKQV 78 (429) T ss_pred CCHHHHHHHHHHHHHHH-HHHHHHHHHhccccccccc-cccccCCCcceeecchHHHHHHHHhhhhcccCceeecCChHH Confidence 1101000010 00000 0111111122111100000 00000001111 457899999999999999999998865432 Q ss_pred hhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccc Q lcl|NC_019527. 140 AKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAY 219 (516) Q Consensus 140 ~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~ 219 (516) .+.|+..+++-++...+.++.+....||.|++++..+... .+ .+.+++|.++.|..- T Consensus 79 -------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g---------------~~-~~~~~~p~~~~~v~d 135 (429) T protein:vir:98 79 -------SNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENA---------------EA-GITYLTPLEAFIVYD 135 (429) T ss_pred -------HHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCC---------------cE-EEEEEcccceEEEEe Confidence 2346667777789999999999999999999888664321 11 144555555554321 Q ss_pred cccccccccccCcceeEEeee-----EeccceEEEec---------C------CcchhhhhhccCCCCchHHHHHHHHHH Q lcl|NC_019527. 220 NALDPTAPDFYKPSTWWVLGR-----EMHASRLLTII---------T------RPLPDMLKPAYNFSGISMSQLAQPYVE 279 (516) Q Consensus 220 ~~~dp~s~~yg~P~~y~v~g~-----~iH~SRli~~~---------~------~~~p~~~k~~~~~~G~S~le~~~~~l~ 279 (516) ...+ ..+.++- .+|...+. .+-..++..+. . ..+|-. .-.++.+|.|.++.+.+.+. T Consensus 136 d~~~-~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv-~~~n~~~g~sd~e~v~~liD 212 (429) T protein:vir:98 136 DSIR-QKPLFAV-RYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMI-EYVENEERQSLLASVVTLIN 212 (429) T ss_pred CCCC-CceEEEE-EEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccceE-EecCCCCCCCcHHHHHHHHH Confidence 1111 0111110 01111000 00011111110 0 001111 11235689999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecC---CcceeEE--ecccCCHHH Q lcl|NC_019527. 280 NWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFD---SEDIVQV--NTPLSGLAD 354 (516) Q Consensus 280 ~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~---~e~~e~~--~~~lsgl~d 354 (516) +++.+....+..+..++..++...... .+++. .+.+ +. .+++.+..+ +.+++.+ ..+.+++.. T Consensus 213 ~~d~~~s~~~~~~~~~~~p~~~i~g~~----~~~~~-~~~~------~~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 280 (429) T protein:vir:98 213 AFNKAISEKANDVEYFADAYLKILGAE----LDDET-LKSL------RD-TRIINLKDTDAQQLTVEFLQKPDADATQEH 280 (429) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCC----CCcch-hhhH------hh-CceeeccCCCCCCcceeEEeecCCHHHHHH Confidence 999998888887777777665542211 11111 1111 11 233333221 1234434 566677888 Q ss_pred HHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHh---CCCc-CCc Q lcl|NC_019527. 355 LQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFY---DDISSVQQSYYFSPLDTMLKVIQLSK---WGEI-DDA 427 (516) Q Consensus 355 ~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yy---d~I~~~Qe~~l~p~l~~l~~~l~~s~---~g~~-~~d 427 (516) .++.+.+.|...+++|..- ++. .| |+||+.=...+. ..++.+ +..++..+++++++++.-. .+.. ..+ T Consensus 281 ~~~~l~~~i~~~s~~p~~~-~~~--~g-n~Sg~Al~~~~~~l~~k~~~~-~~~~~~~l~~~~~li~~~~~~~~~~~d~~~ 355 (429) T protein:vir:98 281 LLDRLENLIFRTAMVANIS-DES--FG-TASGIALRYRLQAMDNLAKTK-ERKFMSGMNRRYKLIASYPTSKIGPKDWIG 355 (429) T ss_pred HHHHHHHHHHHHhCccccC-ccc--cc-cchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhccCCCcccccc Confidence 8999999999999999632 221 12 456653323232 233333 4567888888888776532 1111 236 Q ss_pred ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchh-cCCCCCCCC Q lcl|NC_019527. 428 ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDD-DGADPYMPD 506 (516) Q Consensus 428 ~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~-e~~~~~~~~ 506 (516) ++++|++....++++.|++..+. +|++|.+.+.+.|.. .+.-+.+.+....|..+. +........ T Consensus 356 i~v~f~~~~p~~~~~~a~~~~kl---------~g~is~et~~~~l~~-----v~d~~~E~~ri~~E~~~~~~~~~~~~~~ 421 (429) T protein:vir:98 356 IKYKFTRNLPANLLEESQIAGNL---------AGIVSEETQVGVLSI-----VENPQKEIERKNSDKSTLISRQAGGLNG 421 (429) T ss_pred ceEEeCCCCCcCHHHHHHHHHHH---------hccCchHHHHHhCCC-----CCCHHHHHHHHHHHHHHHHHHHHhhhcC Confidence 89999999999999988766543 478887777665511 010011111111111111 110111111 Q ss_pred CCCCCCCC Q lcl|NC_019527. 507 PDVLPGEE 514 (516) Q Consensus 507 ~~~~~~~e 514 (516) .+.+...| T Consensus 422 ~~~~~~~~ 429 (429) T protein:vir:98 422 QNTTTILE 429 (429) T ss_pred CCCCCCCC Confidence 11111122 No 158 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.23 E-value=4e-10 Score=72.12 Aligned_cols=430 Identities=12% Similarity=0.066 Sum_probs=197.7 Q ss_pred hhHHHH-hH-HhhcC------CCccccccCCCCCCCccCCCccchhccccc-------------ccchhhhcccccCCcc Q lcl|NC_019527. 34 MRRAVM-KS-MERRA------SDAATKWAPPQLMPGVVPAGTTPAVAMDSL-------------CGPTYQFLNSAAGGLY 92 (516) Q Consensus 34 ~~~~~~-~~-~~~~~------~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~-------------~~~~~~~~~~~~~~~~ 92 (516) +...+. .+ ..... ..+-..|--+. ...+++|.. ......-...++.|-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~ 70 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRESRIRYRADN----------LEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGEN 70 (501) T ss_pred CceeeeeecccceeccccccchhHHhhhcccc----------cccccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 111111 01 11100 01111121111 011222210 0011111112222211 Q ss_pred c--ccccCcccHHHHHHH-HhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHh Q lcl|NC_019527. 93 A--ADIQPFPGYQNLAAL-ATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEH 169 (516) Q Consensus 93 ~--~~~~~f~gy~ll~~y-~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~ 169 (516) . ......... ..... ..+.+++.||+..+.-++.+++++++.++++.+. ..+.|...+++-++...+.++.+. T Consensus 71 ~~i~~~~~~~~~-~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~~~~~~---~~~~l~~~~~~n~~~~~~~~~~~~ 146 (501) T protein:vir:96 71 HDVLKSGRRKDN-EMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDDNSQ---NDDAIKRIGRINDLDSLNRTLIRD 146 (501) T ss_pred CcccCccccCcc-ccccceeecchHHHHHHHHhhhhcccCeeEeeCCccchhH---HHHHHHHHHHhcCHHHHHHHHHHH Confidence 0 000001110 01111 2568899999999999999999999887655432 234566677777899999999999 Q ss_pred cccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee-------eEe Q lcl|NC_019527. 170 DCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG-------REM 242 (516) Q Consensus 170 ~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g-------~~i 242 (516) ..+||.|++++..+... .+ .+.+++|.++.|..-+. ....+.++- .+|.+.. ..| T Consensus 147 ~~~~G~a~~~v~~dedg---------------~~-~i~~~~p~~~~~v~d~~-~~~~~~~~v-~~~~~~~~~~~~~~~~v 208 (501) T protein:vir:96 147 LSQTGRAYEVIYRSEYD---------------ET-RIKRLSPLETFVIYDNS-LEDNSIAAV-RYYNRGTLQSAKDVVEI 208 (501) T ss_pred HhhcCeEEEEEEEcCCC---------------ce-EEEEEccceeEEEEcCC-CCCceEEEE-EEEEeecCCCcEEEEEE Confidence 99999999888764321 11 14455555555432110 001111110 1111100 011 Q ss_pred -ccceEEEecCC--------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhh Q lcl|NC_019527. 243 -HASRLLTIITR--------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQV 307 (516) Q Consensus 243 -H~SRli~~~~~--------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~ 307 (516) .+.++.++... .+|-. .-.++-+|.|.++.+.+.+.+++.+....+.-+..++..++....... T Consensus 209 yt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv-~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~ 287 (501) T protein:vir:96 209 YTDEHIYTLDASDDFNEISVTTHAFGTVPIT-EYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLA 287 (501) T ss_pred EcCCcEEEEeeCCCceeccccccCCCccceE-EecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccc Confidence 12233333211 01111 112345799999999999999999998888888777777665432211 Q ss_pred hcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeecccccccccc Q lcl|NC_019527. 308 LNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNAS 385 (516) Q Consensus 308 l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnat 385 (516) ...+....-.+....+.. ...+.......+.+++.+ +.+.+++...++.+.+.|...+++|-.-+-+. |-|.| T Consensus 288 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~---~~n~S 362 (501) T protein:vir:96 288 LPKGMQASDMKRTRLMQL--KPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNF---SGNTS 362 (501) T ss_pred cCcccchhhhhhcCeeee--cccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccc---cccch Confidence 111111111111111110 001111111112234443 45667899999999999999999996444221 22456 Q ss_pred chHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh--CC--Cc--CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 386 SEGEIRSF---YDDISSVQQSYYFSPLDTMLKVIQLSK--WG--EI--DDAITFKFKSLWQTSAKEESEIRFNKAQEAQI 456 (516) Q Consensus 386 ge~D~~~y---yd~I~~~Qe~~l~p~l~~l~~~l~~s~--~g--~~--~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~ 456 (516) |..=...| ...+ ..++..++..+++++++++.-. .+ .- ..++++.|++-...+.++.|++..+.+ T Consensus 363 g~Al~~~~~~l~~ka-~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~----- 436 (501) T protein:vir:96 363 GEALKYKLFGLDQDR-VDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLG----- 436 (501) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh----- Confidence 65322222 2233 3445677888888887765421 11 11 136889999999999999888765543 Q ss_pred HHHcCCCCHHHHHHHH-------------HhhhccC-CCCCChhhhccccccchhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 457 YITNSVIDPSEARQQL-------------SDDPDSG-WDNIDGDLEIVQPEMFDDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 457 ~~~~gvi~~~e~r~~l-------------~~~~~~~-~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e~ 515 (516) |+||.+.+.+.+ .+..... ......+......+..+ +.++. +.++..++.- T Consensus 437 ----g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~-~~~e~---~~d~~e~~~~ 501 (501) T protein:vir:96 437 ----GQVSQETALSLSGLVESPNEELDKINKEMSEIDFKGYSNDFNEHVGKYTD-EVKET---HTDDFEREYE 501 (501) T ss_pred ----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHhhccccccchhhcccccCC-cCCCC---CCCccccccC Confidence 455554444433 2111000 00000000000000000 00000 0000000000 No 159 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.22 E-value=1.2e-10 Score=75.01 Aligned_cols=435 Identities=10% Similarity=0.057 Sum_probs=196.2 Q ss_pred HHhHHhhcCCCcc---ccccCCCCCCCccC-CCccchhccc------------ccccchhhhcccccCCccccccc-Ccc Q lcl|NC_019527. 38 VMKSMERRASDAA---TKWAPPQLMPGVVP-AGTTPAVAMD------------SLCGPTYQFLNSAAGGLYAADIQ-PFP 100 (516) Q Consensus 38 ~~~~~~~~~~~~~---~~~~~~~~~~gv~~-~~~~~~~a~d------------s~~~~~~~~~~~~~~~~~~~~~~-~f~ 100 (516) +.+.-++...... ..|.-|.-...+-. .+...-..++ ........-...++.|.+.--.. ... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~ 80 (512) T protein:vir:97 1 MLKANEFETDTDLRENRNYLFNDEANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR 80 (512) T ss_pred CccceeccCceeeeeCceeeeccccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc Confidence 1111111111000 11222211111100 0000000000 00011111111222222110000 000 Q ss_pred cHHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEE Q lcl|NC_019527. 101 GYQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQIS 179 (516) Q Consensus 101 gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~ 179 (516) ....-...+ .+.+++.||+..+.-++.+++.+++.++.. .+.|+..+++-++...+.++.+...+||.|+++ T Consensus 81 ~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~-------~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~ 153 (512) T protein:vir:97 81 KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDV-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEL 153 (512) T ss_pred cccccCcceeecchHHHHHHHHhhhhcccCceeccCChHH-------HHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEE Confidence 000000011 357788999999999999999998765432 245777777778999999999999999999998 Q ss_pred EEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe-----------eeE-eccceE Q lcl|NC_019527. 180 INIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL-----------GRE-MHASRL 247 (516) Q Consensus 180 i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~-----------g~~-iH~SRl 247 (516) +..+... .+ .+.+++|..+.|..-+ .....+.++ ..+|.+. -.. +.+.++ T Consensus 154 vy~ded~---------------~~-~i~~~~p~~~~~iyd~-~~~~~~~~~-vr~~~~~~~~~~~~~~~~~~~vyt~~~i 215 (512) T protein:vir:97 154 MIRNQDD---------------ET-RLYKSDAMSTFVIYDN-TIERNSIAG-VRYLRTKPIDKTDEDEVFTVDLFTSHGV 215 (512) T ss_pred EEeCCCC---------------ce-EEEEEcccceEEEEcC-CCCCceEEE-EEEEEeeeccccccceEEEEEEEeCCcE Confidence 8774321 01 1344444444432110 000011111 0111110 000 112222 Q ss_pred EEecCCc--------------------chhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhh Q lcl|NC_019527. 248 LTIITRP--------------------LPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQV 307 (516) Q Consensus 248 i~~~~~~--------------------~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~ 307 (516) .+|.... +|-. .-.++-+|.|.++.+.+.+.+++.+....+.-+..++..++....... T Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv-~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~ 294 (512) T protein:vir:97 216 YRYLTSRTNGLKLTPRENGFESHSFERMPIT-EFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN 294 (512) T ss_pred EEEEecCCCcccccccccccccccCcccceE-eecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcc Confidence 2222110 1111 112345799999999999999999988888877777766665422111 Q ss_pred hcCccHHHHHHHHHHHHHhc--CCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeecccccccc Q lcl|NC_019527. 308 LNGGEGGDVFDRVEMYVNMQ--SNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLN 383 (516) Q Consensus 308 l~~~~~~~l~~r~~~~~~~~--sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln 383 (516) .................... .+.+...-..++.+++.+ +.+.++....++.+.+.|...+++|-.-. +. -+| | T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~-~~-~~g-n 371 (512) T protein:vir:97 295 LDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD-DN-FSG-T 371 (512) T ss_pred CCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCc-cc-ccc-c Confidence 11111111100000000000 011111111122334444 45678899999999999999999997533 21 122 4 Q ss_pred ccchHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH--hCCCc--C---CcceEEeCCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_019527. 384 ASSEGEIRSFYDD--ISSVQQSYYFSPLDTMLKVIQLS--KWGEI--D---DAITFKFKSLWQTSAKEESEIRFNKAQEA 454 (516) Q Consensus 384 atge~D~~~yyd~--I~~~Qe~~l~p~l~~l~~~l~~s--~~g~~--~---~d~~~~f~pL~~~sekEkAei~~~~a~a~ 454 (516) .||+.=...|... -...++..++..|++++++++.. ..+.. + .++++.|++-...+..+.+++..+. T Consensus 372 ~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl---- 447 (512) T protein:vir:97 372 QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS---- 447 (512) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHH---- Confidence 5666322222222 12344567888999988887543 12222 2 2688999999999999988765543 Q ss_pred HHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccc----h---hcCCCCCCC-------CCCCCCCCCC Q lcl|NC_019527. 455 QIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMF----D---DDGADPYMP-------DPDVLPGEEG 515 (516) Q Consensus 455 ~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~----~---~e~~~~~~~-------~~~~~~~~e~ 515 (516) .|+||.+.+.+.|.. .+....+++....|.. . ....++.+. +.++...++. T Consensus 448 -----~giiS~et~~~~l~~-----v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 448 -----GGKISQTTLMSLFSF-----FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred -----hccCchHHHHHhCCC-----CCCHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCccccccccC Confidence 367777666655411 0110111111111100 0 000111110 0111111111 No 160 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.22 E-value=1.6e-10 Score=74.24 Aligned_cols=440 Identities=11% Similarity=0.011 Sum_probs=195.5 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhC Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATR 111 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~ 111 (516) |-+.-.+-+-+ ..|..-.+ ++- +...-.....-..+....+.+..+.+..|....+ ++- ....... T Consensus 1 ~~~~~~~~~~i--------~~w~~~~~-~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~-~~~--~~~~~~~ 66 (518) T protein:vir:78 1 MGVWSVMTRFI--------KGWLNGKP-NGS--EPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYV-PTV--HDKLMNS 66 (518) T ss_pred CcchhhHHHHH--------HHhhcCCC-Ccc--chhccHHHhhhcccchhhhhhhhhhhhhcccCCC-Ccc--ccccccC Confidence 22222221111 11221100 000 0000000000000000011111111111211110 111 1122255 Q ss_pred chhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcc--- Q lcl|NC_019527. 112 PEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVS--- 188 (516) Q Consensus 112 ~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~--- 188 (516) ++++.||+..|+-++.+..+|+..+.+..+++ ..-+.|+..++..+++..+.+++..+...|++++-+.++++.+. T Consensus 67 ~l~~~i~~~~A~ll~~e~~~i~v~~~~~~d~e-~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~~ 145 (518) T protein:vir:78 67 GTGNEIVVVAAEYISGKPLSIDVTGVNGSKDE-NLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILNGRPSISV 145 (518) T ss_pred ChHHHHHHHHHHhhcCCCceEEecCccccCcH-HHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEECCeeEEEE Confidence 78999999999999998877765332222211 12355777888889999999999999988888865555443211 Q ss_pred -cCcccccccccccceeeEEeecceeeccc---ccc---ccccc-----cccccCcceeEEeeeEe-------------- Q lcl|NC_019527. 189 -VPLILDPRTIKKGSLTGFSNIEPMWTSPS---AYN---ALDPT-----APDFYKPSTWWVLGREM-------------- 242 (516) Q Consensus 189 -~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~---~~~---~~dp~-----s~~yg~P~~y~v~g~~i-------------- 242 (516) .+-.+-|.. ..|.+..+.-+... .... .|. ...+. .-.|+ .|.|...-+ T Consensus 146 v~ad~~~P~~-~~g~~~~~~f~~~~-~~~~k~~~y~~lE~he~~~~~~~~~~~~---~~~I~n~ly~~~~~~~v~~~~~~ 220 (518) T protein:vir:78 146 HSSSQFWIDF-KNNEPFRFNFFEEI-PTSNKADIYYLVESREIKQWDKEGKKLS---GGFVTYSVIKIDGDKTTPISAER 220 (518) T ss_pred EcCCeeEEEe-ecCcEEEEEEEEEe-ecCCcceeEEEEEeeccccccceeeccc---ceeEEEEEeeecCcccccccccc Confidence 110011100 11222222211100 0000 000 00000 00000 011111000 Q ss_pred ---------ccce----EEEecCCcchh--hh-------hhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCcee Q lcl|NC_019527. 243 ---------HASR----LLTIITRPLPD--ML-------KPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFL 300 (516) Q Consensus 243 ---------H~SR----li~~~~~~~p~--~~-------k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~ 300 (516) ++.. ..+..+.+.|. +. +.....+|+|++..+.+.+..++.+......-+......++ T Consensus 221 ~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~ 300 (518) T protein:vir:78 221 LPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTKTKIA 300 (518) T ss_pred cccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCCceee Confidence 0000 00111111111 00 11123579999999999999999999888877766554444 Q ss_pred ee-cchhh-hcCccHHHHHHHHHHHHHhcCCcceE-EEe----cC---CcceeEEecccC--CHHHHHHHHHHHHHhhhc Q lcl|NC_019527. 301 KT-NMAQV-LNGGEGGDVFDRVEMYVNMQSNLGLA-VMD----FD---SEDIVQVNTPLS--GLADLQSQSQEHMCSVSK 368 (516) Q Consensus 301 k~-~~~~~-l~~~~~~~l~~r~~~~~~~~sn~g~~-~id----~~---~e~~e~~~~~ls--gl~d~~~~~~~~iaaas~ 368 (516) -- .+... ...++.... ..+.. ..... .+. .. .+.+++++..+- .....++.+...+...+| T Consensus 301 v~~~~l~~~~~~~~~~~~----~~fd~---~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G 373 (518) T protein:vir:78 301 ASERMFRKKVNKSTDKEE----WSMNV---DEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSG 373 (518) T ss_pred echhHhccCCCCCCCccc----cccCC---CCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhC Confidence 32 12111 111111100 00100 11111 111 11 123666665543 456667778888888888 Q ss_pred CCceeeeccccccccccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC---C---C--c--CCcceEEeCCC Q lcl|NC_019527. 369 IPAIKLTGISPSGLNASSEG---EIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKW---G---E--I--DDAITFKFKSL 435 (516) Q Consensus 369 IP~t~L~G~sp~Glnatge~---D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~---g---~--~--~~d~~~~f~pL 435 (516) ++..- ||.+. | ..|+.+ ....-|.++..+| ..++..|++|+..++.... + . . +.+++|.|++. T Consensus 374 ~s~~t-fg~~~-~-~~TATei~s~~~~~~~t~~~~~-~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~ 449 (518) T protein:vir:78 374 YNPAT-FNLGN-R-EVKATEIWSLQDATVRKIEKKK-RLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDP 449 (518) T ss_pred CChhh-cCccc-c-cccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCC Confidence 87654 46542 3 234432 2333567777777 4578888888777654321 1 1 1 23689999999 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 436 WQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 436 ~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e~ 515 (516) ...|+++++++. +.++.+|+++.+++.+.+ +.+.+++....+=+...+|...-..++|++..+.+. T Consensus 450 i~~D~~~~~~~~-------~~~v~aGimS~e~~i~~~-------~~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g~~~ 515 (518) T protein:vir:78 450 MSVNLNELSSTL-------NNMNSALAMSVEEKVKLI-------HPKWEDEEIQAEVKRIYLENAIGEVPDPEAIGGMET 515 (518) T ss_pred CCCCHHHHHHHH-------HHHHhcCCCCHHHHHHHh-------CCCCCHHHHHHHHHHHHHHhcccCCCCCccccCCCC Confidence 999988877654 457888999998876654 233332211111111111211111222222222221 Q ss_pred C Q lcl|NC_019527. 516 S 516 (516) Q Consensus 516 t 516 (516) - T Consensus 516 ~ 516 (518) T protein:vir:78 516 K 516 (518) T ss_pred C Confidence 1 No 161 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=99.21 E-value=3.2e-11 Score=78.14 Aligned_cols=406 Identities=13% Similarity=0.045 Sum_probs=192.8 Q ss_pred hhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHH----hhcCCCc--cccccCCCCCCCccCCCccchhcccccccc Q lcl|NC_019527. 6 RKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSM----ERRASDA--ATKWAPPQLMPGVVPAGTTPAVAMDSLCGP 79 (516) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~--~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~ 79 (516) -| .+.+.+=.-|....+...++..+ ..+..+- ...|+... +..+.. T Consensus 1 ~~------------~~~~~~~~~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~----------~~i~~~------ 52 (453) T protein:vir:39 1 MK------------YKPPKLMTFPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGI----------MAIDAE------ 52 (453) T ss_pred Ce------------ecCCcceEcCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------CchhcC------ Confidence 00 00000000011111111111111 1110000 01111110 000000 Q ss_pred hhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcCh Q lcl|NC_019527. 80 TYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGV 159 (516) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~ 159 (516) ... -.+-...++ .+.+++.||+..+.-++.+++.+++.+++. .+.|...+++-++ T Consensus 53 ---------------~~~-~~~~~~~ki--~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~-------~~~l~~i~~~N~~ 107 (453) T protein:vir:39 53 ---------------PTK-DLWKPDNRL--TVNFTKYIVDTFTGYFNGIPVKKSHSDKET-------LSKLQEFDNLNDM 107 (453) T ss_pred ---------------CCc-cccCcccee--ecchHHHHHHHHhhhhcccCceeccCChHH-------HHHHHHHHHhcCh Confidence 000 000000011 246799999999999999999998765432 3557788888899 Q ss_pred hHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee Q lcl|NC_019527. 160 MGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG 239 (516) Q Consensus 160 ~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g 239 (516) ...+.++.+.+..||.|++++..+... .+ .+.+++|.++.|..-+..+ ..+.|+. .+|...+ T Consensus 108 ~~~~~~~~~~~~~~G~~~~~v~~d~~g---------------~~-~i~~~~p~~~~~v~d~~~~-~~~~~~i-r~~~~~~ 169 (453) T protein:vir:39 108 EDEESELAKMACIYGRAFELLYQNEET---------------QT-NVIYNTPENMFMVYDDTIK-QEPLFAV-RYGYDDD 169 (453) T ss_pred hHHHHHHHHHHhhcCeEEEEEEecCCC---------------ce-EEEEEcccceEEEecCCCC-CeEEEEE-EEEEeCC Confidence 999999999999999999887664321 11 1444455554443211000 0111110 0110000 Q ss_pred e----Ee-ccceEEEecCCc---------------chhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce Q lcl|NC_019527. 240 R----EM-HASRLLTIITRP---------------LPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF 299 (516) Q Consensus 240 ~----~i-H~SRli~~~~~~---------------~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v 299 (516) . .+ -+.++.+|.... +|-.. -.++-+|.|.++.+.+.+.+++.+....+.-+..++..+ T Consensus 170 ~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~-~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~ 248 (453) T protein:vir:39 170 YKLYGEVYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVE-FYFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQY 248 (453) T ss_pred eEEEEEEEeCCeEEEEEecCCceeeecccccCCCceeEEE-ecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCce Confidence 0 01 122222222111 11110 123457999999999999999999888887777777665 Q ss_pred eeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEe-----cCCcceeE--EecccCCHHHHHHHHHHHHHhhhcCCce Q lcl|NC_019527. 300 LKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMD-----FDSEDIVQ--VNTPLSGLADLQSQSQEHMCSVSKIPAI 372 (516) Q Consensus 300 ~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id-----~~~e~~e~--~~~~lsgl~d~~~~~~~~iaaas~IP~t 372 (516) +...... +... .+. .+. . .+++.+. .++.++.. .+.+.+++...++.+.+.|...+++|-. T Consensus 249 ~~~~g~~-~~~~---~~~-~~~------~-~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~ 316 (453) T protein:vir:39 249 LTFLGAA-VEEE---DLK-NIR------S-NRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANI 316 (453) T ss_pred eeeecCC-CCch---hhh-hhh------h-cceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 5432211 1111 111 111 1 1111111 12233444 4456678888999999999999999953 Q ss_pred eeeccccccccccchHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHh--CCCc--CCcceEEeCCCCCCCHHHHHH Q lcl|NC_019527. 373 KLTGISPSGLNASSEGEIRSFY---DDISSVQQSYYFSPLDTMLKVIQLSK--WGEI--DDAITFKFKSLWQTSAKEESE 445 (516) Q Consensus 373 ~L~G~sp~Glnatge~D~~~yy---d~I~~~Qe~~l~p~l~~l~~~l~~s~--~g~~--~~d~~~~f~pL~~~sekEkAe 445 (516) -. +. -| |+||+.=...+. ..+..+| ..+...+++++++++.-. .|.. ..+++|.|++-...+.++.|+ T Consensus 317 ~~-~~--~g-n~Sg~Al~~~~~~l~~ka~~~~-~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~ 391 (453) T protein:vir:39 317 SD-ES--FG-SSSGVSLAYKLQAMSNLALSFQ-RKFQSSLNSRYKLYCELSTNVSNKEAWKDIEYTFTRNEPKDIKEQAE 391 (453) T ss_pred cc-cc--cc-CChHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHH Confidence 22 21 12 456664323232 3343333 456778888887765432 2222 247899999999999999888 Q ss_pred HHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccc-------hhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 446 IRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMF-------DDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 446 i~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~-------~~e~~~~~~~~~~~~~~~e 514 (516) +..+. +|+|+.+.+.+.+.. .+....+.+....|.. +.+....+..+..+..++| T Consensus 392 ~~~kl---------~g~is~et~l~~l~~-----v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 392 TANIL---------MGITSQETALSVISV-----IPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred HHHHH---------hccCChHHHHHhCCC-----CCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCcCCC Confidence 66543 467777777665411 0100111111111100 1111111111222222333 No 162 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.21 E-value=3.2e-10 Score=72.63 Aligned_cols=424 Identities=10% Similarity=-0.000 Sum_probs=183.9 Q ss_pred cccccCCCCCCCccCCCccchhc-ccccc---cchhhhcccccCCcccccccCcccHHHHHHH-HhCchhhhhhhhhhHH Q lcl|NC_019527. 50 ATKWAPPQLMPGVVPAGTTPAVA-MDSLC---GPTYQFLNSAAGGLYAADIQPFPGYQNLAAL-ATRPEYRAFASTLSTE 124 (516) Q Consensus 50 ~~~~~~~~~~~gv~~~~~~~~~a-~ds~~---~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y-~~~~i~r~iVd~~aed 124 (516) ..+-.-+ ..||+-.+. ...+. +-... .........++-|.+.....+-.--+-++.+ .-....++|||..++- T Consensus 1 ~~~~~~~-~~~gl~~~~-~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~r 78 (474) T protein:vir:81 1 MIQQQTV-RIPSLSNDE-NALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARR 78 (474) T ss_pred CcCCCcC-cCCCCChhH-HHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhh Confidence 1111111 123432221 11110 00000 0001111122222221111110000112233 2457789999999998 Q ss_pred HhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCccccccccccccee Q lcl|NC_019527. 125 LTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLT 204 (516) Q Consensus 125 ~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~ 204 (516) +.=+||.+....+ +. ..+.+.|.+-++.....++.+-+.+||-|++++.-++.....|+ T Consensus 79 l~~~Gf~~~d~~~-~~-------~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~------------- 137 (474) T protein:vir:81 79 CNLEGFVWPDGDL-DS-------LGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEAL------------- 137 (474) T ss_pred hcccceECCCCCc-cc-------hHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeE------------- Confidence 8889998643222 11 23566777788888889999999999999988754322111111 Q ss_pred eEEeecceeeccc-----------c-cccccc----ccccccCcce-eEEe---e-eEeccceEEEecCCcc-hhhhh-h Q lcl|NC_019527. 205 GFSNIEPMWTSPS-----------A-YNALDP----TAPDFYKPST-WWVL---G-REMHASRLLTIITRPL-PDMLK-P 261 (516) Q Consensus 205 ~l~v~d~~~v~p~-----------~-~~~~dp----~s~~yg~P~~-y~v~---g-~~iH~SRli~~~~~~~-p~~~k-~ 261 (516) +++++|.+++.. . ....|. ..-.++.|.. |++. + ......+.-|--|.|+ |...+ . T Consensus 138 -i~~~sp~~~~~~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvPvV~~~n~~~ 216 (474) T protein:vir:81 138 -IHVKDASEATGEWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVPAQVLPYKPA 216 (474) T ss_pred -EEEeccceEEEEEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcceEEeccccc Confidence 222222222210 0 000111 0111111111 1111 0 0001111111112221 11111 1 Q ss_pred ccCCCCchHH-HHHHHHHHHHHHHHHHHHHHHHHhCCce---eeecchhhhcCccH---HHHHHHHHHHHHhcCCcceEE Q lcl|NC_019527. 262 AYNFSGISMS-QLAQPYVENWLRTRQSVSDLVDKFSRTF---LKTNMAQVLNGGEG---GDVFDRVEMYVNMQSNLGLAV 334 (516) Q Consensus 262 ~~~~~G~S~l-e~~~~~l~~~~~~~~~~~~Ll~~~~~~v---~k~~~~~~l~~~~~---~~l~~r~~~~~~~~sn~g~~~ 334 (516) ....+|.|.+ +.++....++.++.-...--..-++.+. +..+... ....++ ..+...+..+...-.+....+ T Consensus 217 ~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~-~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~ 295 (474) T protein:vir:81 217 PKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESA-LKNADGTIKSVWEARLGRIKGLPDDADADI 295 (474) T ss_pred ccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhh-cccccccccchhhhhHHHHhcCCCcccccc Confidence 2233688866 6777777777777665443333333332 2222111 111111 123333333322222211111 Q ss_pred EecCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHH---HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 335 MDFDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGE---IRSFYDDISSVQQSYYFSPLD 410 (516) Q Consensus 335 id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D---~~~yyd~I~~~Qe~~l~p~l~ 410 (516) .....-++.+++ +++.++.+.+.....+||+.++||..-| |...-..++|++.= .......++.+|+ .+...++ T Consensus 296 ~~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~l-G~~~~~np~SaeAi~a~~~~l~~kae~k~~-~fg~~l~ 373 (474) T protein:vir:81 296 PQLARADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAV-AISGLSNPTSAESYDASQYELIAEAEGAVD-DFTPALR 373 (474) T ss_pred cccccccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHh-cccccccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Confidence 111112445543 6677888899999999999999998655 42211223566543 3344555666664 4688899 Q ss_pred HHHHHHHHHhCCC----cCC---cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCC--CCHHHHHHHHHhhhccCCC Q lcl|NC_019527. 411 TMLKVIQLSKWGE----IDD---AITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSV--IDPSEARQQLSDDPDSGWD 481 (516) Q Consensus 411 ~l~~~l~~s~~g~----~~~---d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gv--i~~~e~r~~l~~~~~~~~~ 481 (516) +++++.+.-..+. +++ .+++.|.+...+|..++|+.. .+++++|. .+.+-.++.+ ||+ T Consensus 374 ~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~-------~Kl~~a~~~~~~~~~~~~~l------g~t 440 (474) T protein:vir:81 374 KAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAG-------MKQLAAVPWLAETEVGLELI------GLT 440 (474) T ss_pred HHHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHH-------HHHHhcccCCCcHHHHHhhc------CCC Confidence 9988776655442 222 467899999999998875544 55566552 2233334333 222 Q ss_pred CCChhhhccccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 482 NIDGDLEIVQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 482 ~~d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) .- +++..+.+....+..++ ...--+..++.+| T Consensus 441 ~~--~i~~~~~~~~~~~~~~~-~~~l~~~~~~~~~ 472 (474) T protein:vir:81 441 PQ--QARRAMADKRRVQGRGT-LQALIDRSNNGAT 472 (474) T ss_pred HH--HHHHHHHHHHHHhHHHH-HHHHHhcCCCCCC Confidence 11 11111111111110000 0000000011111 No 163 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=99.20 E-value=1.9e-10 Score=73.92 Aligned_cols=450 Identities=12% Similarity=0.086 Sum_probs=213.6 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) ||+-++- .+. +++++.+.. +.+.......+++|..-+|... + +.-.++. T Consensus 1 ~~~~l~~--~~~---------~~~~d~~~~------------~e~~~~~~~s~~~p~~~dGa~~------i--~~~~~~~ 49 (521) T protein:vir:65 1 MFSRLKM--LAR---------WADFDNDKY------------EEQIKDKAESIAAPKNNDGATE------V--EINDNSP 49 (521) T ss_pred Cccchhh--hhh---------ccCchhhHH------------HhhhccCCCcccCCCCCCCcee------e--cccCCcc Confidence 6654321 110 111111111 1122222334566654444321 1 1000111 Q ss_pred hhhcccccCCcccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhh-----CCCeeeeccccchhhhHHH-HHHHH Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPFPGYQNLAALA---TRPEYRAFASTLSTELTR-----EGIEITSKDRTKAKEMASK-IKELE 151 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f~gy~ll~~y~---~~~i~r~iVd~~aed~~r-----~~~~i~~~~~~~~~~~~~~-i~~i~ 151 (516) .++++++++.+...+.+....++|...|+ .++++..+|+.+++||+- ..+++...+.+-.+...++ ..+++ T Consensus 50 ~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~ 129 (521) T protein:vir:65 50 ASSWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFK 129 (521) T ss_pred ccccccceeeeccccchhhhHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHH Confidence 12222232222223333445688888886 699999999999999973 2333333322222211111 23344 Q ss_pred HHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccccc-cc-c---- Q lcl|NC_019527. 152 EACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNAL-DP-T---- 225 (516) Q Consensus 152 ~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~-dp-~---- 225 (516) ..++-|++...-.+.+|.--+.|.-+.-..++ .++. .+|+.++.+||..+........ ++ . T Consensus 130 ~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid-~~pk------------~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~ 196 (521) T protein:vir:65 130 DLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIG-KNPK------------DGIVELRQLDPRNLEYVREIITEDTPEGKIY 196 (521) T ss_pred HHHHHhccchhhhHHHhhhhhcceeEEEEEEc-CCcc------------ccceeeeeeCCcceeeeeeecccccCCccee Confidence 44445577777777777777777766666666 4432 2345566666665554322111 00 0 Q ss_pred ---cc-cccCc--ceeEEee--------eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 226 ---AP-DFYKP--STWWVLG--------REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDL 291 (516) Q Consensus 226 ---s~-~yg~P--~~y~v~g--------~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~L 291 (516) .. .+|.| ..|...| .+|+.+=+...+..- .+...+.=+|.|+.+...+.+.-....+ -+ T Consensus 197 ~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~hSGl-----~d~~~~~i~syLhkAiKp~NQLkm~EDA--lV 269 (521) T protein:vir:65 197 KATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAHSGL-----MDCDDKYIIGYLHRAVKPANQLKLLEDA--MV 269 (521) T ss_pred cceeeeeeeecCCcceeccceeecCCcceeechhheeeeeccc-----eeCCCCeeeecchhhhHhHHhhHHHHhh--HH Confidence 00 01111 1122222 345554444333221 1222333357787777666665444332 34 Q ss_pred HHHhC----CceeeecchhhhcCccHHHHHHHHHHHHHhcC------Ccce--------EEE---------ecCCcceeE Q lcl|NC_019527. 292 VDKFS----RTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQS------NLGL--------AVM---------DFDSEDIVQ 344 (516) Q Consensus 292 l~~~~----~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~s------n~g~--------~~i---------d~~~e~~e~ 344 (516) +++.+ -.|+=+|+.++-.. ..++..+- .++.+++ .+|- .++ ++.+-++++ T Consensus 270 IYRitRAPeRRvFYIDvGnlPk~-KAeqYl~~--im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItT 346 (521) T protein:vir:65 270 VYRITRAPERRVFFIDTGNMNNR-KAAQHMNS--VAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTT 346 (521) T ss_pred HHhhhccccceEEEEecCCCCch-hHHHHHHH--HHHhcCceeEeecccccccccccccchhhhhcccccCCCCccceee Confidence 45444 34555554443211 11121111 1111111 0110 000 011123444 Q ss_pred Ee--cccCCHHHHHHHHHHHHHhhhcCCceeeecccccccc--ccch--HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_019527. 345 VN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSE--GEIRSFYDDISSVQQSYYFSPLDTMLKV-IQ 417 (516) Q Consensus 345 ~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge--~D~~~yyd~I~~~Qe~~l~p~l~~l~~~-l~ 417 (516) +. -+|+.++|+ ..|+..+-.+.++|.++|-.++.+|+| .++| -|.-.|..+|.+.|..+ ..++..+++. |. T Consensus 347 LpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~rLR~rF-s~lf~~~L~~qLi 424 (521) T protein:vir:65 347 LPGASGMSDIDDI-RYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTLQSQF-SEVLRDPLKYNLI 424 (521) T ss_pred cccCCCcChHHHH-HHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhh Confidence 43 245556554 589999999999999999555556664 2332 15567999999998654 4555555442 11 Q ss_pred HH------hCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH-cC-CCCHHHHHHHHHhhhccCCCCCChhhhc Q lcl|NC_019527. 418 LS------KWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT-NS-VIDPSEARQQLSDDPDSGWDNIDGDLEI 489 (516) Q Consensus 418 ~s------~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~-~g-vi~~~e~r~~l~~~~~~~~~~~d~~~e~ 489 (516) +. .|-.+-+.+.|+|..=.--+|-..+|+...+.++++.+-. .| .+|.+-+++.+-... |+++.. T Consensus 425 lKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~ILr~t-------Deei~~ 497 (521) T protein:vir:65 425 LKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYT-------DDQMDT 497 (521) T ss_pred hhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccC-------HHHHHH Confidence 11 1222334688999988888899999999999988888643 23 568888877653321 233332 Q ss_pred cccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 490 VQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 490 ~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) ....+.+ |..++--++|++ .+++= T Consensus 498 ~~k~I~~-E~~~~~~~~p~~--~~~~f 521 (521) T protein:vir:65 498 EKKQIEE-EANDPRFKQTPD--EIEDF 521 (521) T ss_pred HHHHHHH-hhhCCCCCCCcc--cccCC Confidence 2222222 111111111111 11111 No 164 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.20 E-value=1.5e-10 Score=74.51 Aligned_cols=428 Identities=7% Similarity=-0.001 Sum_probs=196.0 Q ss_pred hhhHHHHhHHhhcCC-Ccc--ccccCCCC--CCCcc-----CCCccchh--ccccccc--chhhhcccccCCcccccccC Q lcl|NC_019527. 33 AMRRAVMKSMERRAS-DAA--TKWAPPQL--MPGVV-----PAGTTPAV--AMDSLCG--PTYQFLNSAAGGLYAADIQP 98 (516) Q Consensus 33 ~~~~~~~~~~~~~~~-~~~--~~~~~~~~--~~gv~-----~~~~~~~~--a~ds~~~--~~~~~~~~~~~~~~~~~~~~ 98 (516) +..-+++....+..- ..- -+|-|--+ -.+++ .+.....+ ..+.... ........++-|-+.-.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~ 80 (492) T protein:vir:97 1 MQFIQLISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEP 80 (492) T ss_pred ChHHHHHHHHHHHHhcCCceeeccchhhhhHhhhcccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccc Confidence 223333332221110 000 01111000 00000 00000000 0000000 00111112222211000000 Q ss_pred ---ccc---HHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcc Q lcl|NC_019527. 99 ---FPG---YQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDC 171 (516) Q Consensus 99 ---f~g---y~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~r 171 (516) ..+ +...+..+ .+.+++.||+..+.-++.+++++++.++.. .+.|+..++ =+....+.++.+... T Consensus 81 ~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~-------~~~l~~~~~-n~~~~~~~~~~~~~~ 152 (492) T protein:vir:97 81 KPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEV-------VKRIDEVLG-NRFDDKLHSVLTGAS 152 (492) T ss_pred ccccccccccccccccccccchHHHHHHHHhhhhcccCceeccCchHH-------HHHHHHHHh-ccHHHHHHHHHHHHh Confidence 001 11111111 368899999999999999999997765432 233444443 367788889999999 Q ss_pred cceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee---eEe-ccceE Q lcl|NC_019527. 172 FFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG---REM-HASRL 247 (516) Q Consensus 172 lyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g---~~i-H~SRl 247 (516) .||.|++++..+... .+ .+.+++|..+.|..-+. ....+.++ -.+|.... ..+ .+.++ T Consensus 153 ~~G~a~~~v~~d~dg---------------~~-~~~~~~p~~~~~i~d~~-~~~~~~~~-vr~~~~~~~~~~~~y~~~~v 214 (492) T protein:vir:97 153 NKGIEWLHPYLDEEG---------------EF-KLFRVPAEQGIPIWTDK-EHEELEAF-IRMYKLENETKVEYWDKVTV 214 (492) T ss_pred hcCeEEEEEEecCCC---------------ce-EEEEEcccceEEEEcCC-CCCceEEE-EEEEeeccceeEEEEecCeE Confidence 999999888764321 11 14445555444421100 00001111 00111110 000 11111 Q ss_pred EEec----------------------C---CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeee Q lcl|NC_019527. 248 LTII----------------------T---RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKT 302 (516) Q Consensus 248 i~~~----------------------~---~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~ 302 (516) .+|. . ..+|-.. -.++-+|.|.++.+.+.+.+++.+....+..+..++..++.. T Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~-~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~ 293 (492) T protein:vir:97 215 NYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIP-FKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL 293 (492) T ss_pred EEEEEecCeeeecccccccccccccccCCCCCcceEE-ecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeee Confidence 1110 0 0111111 122456999999999999999998888888778777776654 Q ss_pred cchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeeccccc Q lcl|NC_019527. 303 NMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPS 380 (516) Q Consensus 303 ~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~ 380 (516) .... ..+.....+.+. ..+++.++. +.+.+.+ +.+.+++...++.+.+.|...+++|-.-+ + +-+ T Consensus 294 ~g~~---~~~~~~~~~~~~-------~~~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~-~-~~~ 360 (492) T protein:vir:97 294 KNYD---DQELPEFKRLLR-------YYGAIKVSD-NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSS-D-KFG 360 (492) T ss_pred ecCC---cccchhHHHHHh-------hccceecCC-CCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCc-c-ccc Confidence 3211 111112211111 122333333 3444544 45667889999999999999999996432 1 112 Q ss_pred cccccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhCCCcC--CcceEEeCCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 381 GLNASSEGEIRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSKWGEID--DAITFKFKSLWQTSAKEESEIRFNKAQEAQI 456 (516) Q Consensus 381 Glnatge~D~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~~~--~d~~~~f~pL~~~sekEkAei~~~~a~a~~~ 456 (516) | |.||+.=...|..... ...+..++..+++++++++.. +|..+ .++++.|++-...+++|.|++..+. T Consensus 361 ~-n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~-~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl------ 432 (492) T protein:vir:97 361 S-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH-FDIKGEHKDVDISFNYNKVANTELQVQTAQQS------ 432 (492) T ss_pred c-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hcCCcccceeeEEecCCCCCCHHHHHHHHHHH------ Confidence 2 4566642222322222 344566788999999887653 34332 4788999999999999987766553 Q ss_pred HHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccc-------hhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 457 YITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMF-------DDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 457 ~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~-------~~e~~~~~~~~~~~~~~~e~t 516 (516) .|+||.+.+.+.+... +..+.+.+....|.. +..+.......+++.++.+.. T Consensus 433 ---~G~iS~et~l~~l~~v-----~d~~~Eleri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (492) T protein:vir:97 433 ---MGIVSHETVLENHPFV-----EDLQAELERIEQEQTEYNKQLPNLDDGGADSAQQQERSNNKES 491 (492) T ss_pred ---hccCchHHHHHhCCCC-----CCHHHHHHHHHHHHHHHHHhhhccccCCCCCCccccccccccc Confidence 3777777666654110 100111111111100 000000001111111111111 No 165 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=99.20 E-value=1.4e-10 Score=74.57 Aligned_cols=394 Identities=8% Similarity=0.004 Sum_probs=197.7 Q ss_pred cCCCc-cchhcccccccchhhhcccccCCcccccccC---ccc--H-HHHHHHH-hCchhhhhhhhhhHHHhhCCCeeee Q lcl|NC_019527. 63 VPAGT-TPAVAMDSLCGPTYQFLNSAAGGLYAADIQP---FPG--Y-QNLAALA-TRPEYRAFASTLSTELTREGIEITS 134 (516) Q Consensus 63 ~~~~~-~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~---f~g--y-~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~ 134 (516) +.... ..-+...............++.|-+.-.... ... . .....++ .+.+++.||+..+.-++.+.+++++ T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~ 80 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFDI 80 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceeec Confidence 11110 1101000000111111222232221100000 000 0 0011122 3788999999999999999999977 Q ss_pred ccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceee Q lcl|NC_019527. 135 KDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWT 214 (516) Q Consensus 135 ~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v 214 (516) .++.+.. +.++. +.+=++.....++.+....||.|++++.++...... ...+|.++ +.+++|.++ T Consensus 81 ~~~~~~~------~~~~~-~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~-------~~~~~~~~-~~~i~p~~~ 145 (451) T protein:vir:10 81 DNNKELN------EKVTD-VLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGE-------QVTNQTFK-YGVVNTEEI 145 (451) T ss_pred CCcHHHH------HHHHH-HhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccc-------ccccccee-EEEEcccce Confidence 6543321 22332 323367888889999999999999988775432111 11123332 666777777 Q ss_pred ccccccccccccccccCcceeEE-------------eeeEec-cceEEEecCC-------------------cchhhhhh Q lcl|NC_019527. 215 SPSAYNALDPTAPDFYKPSTWWV-------------LGREMH-ASRLLTIITR-------------------PLPDMLKP 261 (516) Q Consensus 215 ~p~~~~~~dp~s~~yg~P~~y~v-------------~g~~iH-~SRli~~~~~-------------------~~p~~~k~ 261 (516) .|...+..+ ..+.+.- .+|.. .-..++ ..++..+... .+|-. .- T Consensus 146 ~~vydd~~~-~~~~~~i-r~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv-~~ 222 (451) T protein:vir:10 146 IPIYRNGIE-RELEAVI-RYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFV-EF 222 (451) T ss_pred EEEEcCCCC-CceEEEE-EEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEE-Ee Confidence 654321110 0111110 01110 000111 1222222110 00111 01 Q ss_pred ccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEec---- Q lcl|NC_019527. 262 AYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDF---- 337 (516) Q Consensus 262 ~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~---- 337 (516) .++-.|.|.++.+.+.+.+++.+....+..+..++..+++..... .....+....+. ..++..+.. T Consensus 223 ~nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~-------~~~~i~~~~~~~~ 292 (451) T protein:vir:10 223 SNNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFG---GEDTSEFLKELK-------RYKTIKTETDSEG 292 (451) T ss_pred ccCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC---cccchhhHHHHh-------hCCeEEecCcCCc Confidence 123458899999999999999998888887777777666543211 111112222211 122333321 Q ss_pred CCcceeE--EecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHH--HHHHHHHHHHHHHHHH Q lcl|NC_019527. 338 DSEDIVQ--VNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDI--SSVQQSYYFSPLDTML 413 (516) Q Consensus 338 ~~e~~e~--~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I--~~~Qe~~l~p~l~~l~ 413 (516) ++.+.+. ...+.++....++.+.+.|...+++|-.-. + . .| |+||..=...|.... ...++..++..+++++ T Consensus 293 ~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~-~g-n~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~ 368 (451) T protein:vir:10 293 DSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQQDT-E-N-FG-NASGVALKFFYRKLELKSGLLETEFRTSFDKLI 368 (451) T ss_pred cCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCcccccc-c-c-cc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1223444 445677889999999999999999996321 1 1 13 567763223332221 2344566788999999 Q ss_pred HHHHHHhCCCc-CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCC-hhhh--c Q lcl|NC_019527. 414 KVIQLSKWGEI-DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNID-GDLE--I 489 (516) Q Consensus 414 ~~l~~s~~g~~-~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d-~~~e--~ 489 (516) ++++.- +|.. +.++++.|++-...+++|.+++..+ + .|+||.+.+...+ +.++ .+.+ . T Consensus 369 ~li~~~-~~~~d~~~i~i~f~~~~p~n~~e~~~~~~k-------l--~g~iS~et~~~~~--------p~v~d~~~e~~~ 430 (451) T protein:vir:10 369 KAILYF-LGVTDYKKIQQTYTRNMMSNDLEDADIATK-------S--VGIIPTKIILRHH--------PWVDDVEEAEKL 430 (451) T ss_pred HHHHHH-hCCCCccceeEEecCCCCCCHHHHHHHHHH-------H--hccCchHHHHHhC--------CCCCCHHHHHHH Confidence 888753 3443 4689999999999999998876655 3 3788888877664 2222 1111 1 Q ss_pred cccccc-h---hcCCCCCCCC Q lcl|NC_019527. 490 VQPEMF-D---DDGADPYMPD 506 (516) Q Consensus 490 ~~~e~~-~---~e~~~~~~~~ 506 (516) ..++.. . ..+.-++.++ T Consensus 431 ~~ee~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 431 YLEEKKIQASKVSDDYNNFTE 451 (451) T ss_pred HHHHHHHHHHHHHhhcCCCCC Confidence 100000 0 0011111222 No 166 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=99.19 E-value=5e-10 Score=71.59 Aligned_cols=412 Identities=11% Similarity=0.070 Sum_probs=189.6 Q ss_pred CCCCCccCCCccchhcccccccc------------hhhhcccccCCccccc-----ccCcccHHHHH----HHH-hCchh Q lcl|NC_019527. 57 QLMPGVVPAGTTPAVAMDSLCGP------------TYQFLNSAAGGLYAAD-----IQPFPGYQNLA----ALA-TRPEY 114 (516) Q Consensus 57 ~~~~gv~~~~~~~~~a~ds~~~~------------~~~~~~~~~~~~~~~~-----~~~f~gy~ll~----~y~-~~~i~ 114 (516) ++++=. ... -.....+... ....+..++.|-+.-- ....-+..... -.+ .+.++ T Consensus 1 ~~~~~~--~~~--~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~ 76 (537) T protein:vir:78 1 MTSPLL--NKP--IDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFF 76 (537) T ss_pred CCcccc--ccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchH Confidence 222111 000 0011111100 0001111222211000 00000010000 011 35788 Q ss_pred hhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccc Q lcl|NC_019527. 115 RAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILD 194 (516) Q Consensus 115 r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld 194 (516) +.||+..+.-++-+.+++++.++... +..+.|+..++ =+....+.+..+....||.|+.++-++... T Consensus 77 k~Ivd~~~~yl~G~Pv~~~~~d~~~~----e~~~~l~~~~~-~~~~~~~~el~~~~s~~G~ay~~~y~de~~-------- 143 (537) T protein:vir:78 77 TELVDQLAQYLLSNGVEVKVKDEDNT----QLDEILQEYFD-EDFQATIDTLVTNASKKGFEGIFARTTSEG-------- 143 (537) T ss_pred HHHHHHHhhhhcccCceeecCcchhH----HHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeeEEEeeecCCC-------- Confidence 99999999999999999988655332 22333443332 355667788888899999999888775432 Q ss_pred cccccccceeeEEeecceeeccccccccccc--cc--------------------cccCcc---eeEEee---------- Q lcl|NC_019527. 195 PRTIKKGSLTGFSNIEPMWTSPSAYNALDPT--AP--------------------DFYKPS---TWWVLG---------- 239 (516) Q Consensus 195 ~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~--s~--------------------~yg~P~---~y~v~g---------- 239 (516) .++ +.+++|.++.|..-+..+.. .. +++.+. +|.-.+ T Consensus 144 -------~~~-~~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~ 215 (537) T protein:vir:78 144 -------KLK-FQTVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLD 215 (537) T ss_pred -------ceE-EEEEccceeEEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCccccccccc Confidence 011 23333333333211000000 00 000000 000000 Q ss_pred eEeccceEEEe---cC-------------------CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019527. 240 REMHASRLLTI---IT-------------------RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR 297 (516) Q Consensus 240 ~~iH~SRli~~---~~-------------------~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~ 297 (516) ..+....+-++ .. ..+|-+ .-.++-.|.|.++.+...+.+|+......+.-+..++. T Consensus 216 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv-~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~ 294 (537) T protein:vir:78 216 EAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQ-LLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSE 294 (537) T ss_pred ccccccccceeeeccccccccccccccccccccCCcceeEE-EeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcC Confidence 00000000000 00 001111 01234468899999999999999999999998888887 Q ss_pred ceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEe--cccCCHHHHHHHHHHHHHhhhcCCceeee Q lcl|NC_019527. 298 TFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLT 375 (516) Q Consensus 298 ~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~ 375 (516) .++.+..... .+..+..+.+ + ..++..+++++.+++.++ .+..+....++.+.+.|-..+..|-+ T Consensus 295 ~ilvi~g~~~---~~~~~~~~~l------~-~~~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~--- 361 (537) T protein:vir:78 295 AIYVVKGFSG---DSTDKLRQNI------K-AKKMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNS--- 361 (537) T ss_pred ceeeeecCCC---ccchhHHHHH------h-hcCceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCC--- Confidence 7776532211 1111222211 1 234445555555555554 55567888899988888877776653 Q ss_pred ccccccccccchHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHh--CC--Cc-CCcceEEeCCCCCCCHHHHHHHH Q lcl|NC_019527. 376 GISPSGLNASSEGEIRSFYDDI---SSVQQSYYFSPLDTMLKVIQLSK--WG--EI-DDAITFKFKSLWQTSAKEESEIR 447 (516) Q Consensus 376 G~sp~Glnatge~D~~~yyd~I---~~~Qe~~l~p~l~~l~~~l~~s~--~g--~~-~~d~~~~f~pL~~~sekEkAei~ 447 (516) ...-.| |+||. .++.-|..+ ....+..++..|++++++|+... -| .. ..++.|.|++-...+++|.|++. T Consensus 362 ~~~~~g-n~SGv-Alk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~ 439 (537) T protein:vir:78 362 TAVGDG-NVTNV-VIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTR 439 (537) T ss_pred cccccc-CCcHH-HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHH Confidence 222223 46665 443332222 23445667888888888776432 22 12 24789999999999999988766 Q ss_pred HHHHHHHHHHHHcCCCCHHHHHHHHH------------hhhccCCCCCC----hhhhccccccchh-c------------ Q lcl|NC_019527. 448 FNKAQEAQIYITNSVIDPSEARQQLS------------DDPDSGWDNID----GDLEIVQPEMFDD-D------------ 498 (516) Q Consensus 448 ~~~a~a~~~~~~~gvi~~~e~r~~l~------------~~~~~~~~~~d----~~~e~~~~e~~~~-e------------ 498 (516) .+ +++.|++|.+.+.+.+. ...+-.+..+. +..+...+..++. + T Consensus 440 ~~-------l~~~giiS~eT~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (537) T protein:vir:78 440 KT-------EAETEALKIGNIMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQP 512 (537) T ss_pred HH-------HHhcCcchHHHHHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCC Confidence 54 45666666655544331 11000111111 0000000000000 0 Q ss_pred --CCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 499 --GADPYMPDPDVLPGEEGS 516 (516) Q Consensus 499 --~~~~~~~~~~~~~~~e~t 516 (516) +.+.+.++|..-|..+-+ T Consensus 513 ~~d~~~~~~~~~~~~~~~~~ 532 (537) T protein:vir:78 513 PVDPNQPVADPNVVPPTDPN 532 (537) T ss_pred CCCccCCCCCCCCCCCCCCc Confidence 000111122222222222 No 167 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=99.18 E-value=1.3e-10 Score=74.85 Aligned_cols=444 Identities=12% Similarity=0.026 Sum_probs=199.9 Q ss_pred CCCChhhhHHHHhHHh--hcCCCccccccCCCCCCCccCCCc---cchh-cccccccchhhhcccccCCccc-c-cccCc Q lcl|NC_019527. 28 KARKLAMRRAVMKSME--RRASDAATKWAPPQLMPGVVPAGT---TPAV-AMDSLCGPTYQFLNSAAGGLYA-A-DIQPF 99 (516) Q Consensus 28 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~gv~~~~~---~~~~-a~ds~~~~~~~~~~~~~~~~~~-~-~~~~f 99 (516) -..+.-.-..+..-.- +-...+-..|--+..-.-. .+.. ...+ .+.........-...++.|-.. . ..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~ 79 (501) T protein:vir:27 1 MEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELM-VNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRR 79 (501) T ss_pred CCceeEEeccchhhhhhcccChhHHHhhccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCcc Confidence 0001111101110000 0011111222222111100 0000 0000 0000000111111122222110 0 00001 Q ss_pred ccHHHHHHH-HhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEE Q lcl|NC_019527. 100 PGYQNLAAL-ATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQI 178 (516) Q Consensus 100 ~gy~ll~~y-~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i 178 (516) ... ..... ..+.+++.||+..+.-++.+++++++.+++..+ ...+.|...+++-++...+.++.+...+||.|++ T Consensus 80 ~~~-~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~---~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~ 155 (501) T protein:vir:27 80 KDR-EMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDNNS---QNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYE 155 (501) T ss_pred Ccc-ccccceeccchHHHHHHHHhhhhcccCeeEecCCccchH---HHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEE Confidence 000 00111 246889999999999999999999987765443 2234566677788999999999999999999999 Q ss_pred EEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee-------eEec-cceEEEe Q lcl|NC_019527. 179 SINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG-------REMH-ASRLLTI 250 (516) Q Consensus 179 ~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g-------~~iH-~SRli~~ 250 (516) ++..+... . + .+.+++|.++.|..-+. ....+.++ -.+|.+.. ..|+ +.++.+| T Consensus 156 ~vy~ded~--~-------------~-~i~~~~p~~~~~v~d~~-~~~~~~~~-ir~~~~~~~~~~~~~~~vyt~~~v~~~ 217 (501) T protein:vir:27 156 VIYRNEYD--E-------------T-RIKRLNPLETFVIYDNS-LEDNSIAA-VRYYNRGTLQNAKDVVEIYTNEHIYTL 217 (501) T ss_pred EEEeCCCC--c-------------e-EEEEEccceeEEEecCC-CCCceEEE-EEEEEeeecCCcEEEEEEEeCCeEEEE Confidence 88775321 1 1 14445555555432110 00111111 01111100 0111 2222222 Q ss_pred cCC--------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHH Q lcl|NC_019527. 251 ITR--------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDV 316 (516) Q Consensus 251 ~~~--------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l 316 (516) ... .+|-.. -.++-+|.|.++.+.+.+.+++.+....+.-+..++..++.+........+....- T Consensus 218 ~~~~~~~~~~~~~~~~g~vPvv~-~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~ 296 (501) T protein:vir:27 218 DASDDFNEISVTTHAFGTVPITE-FLNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASD 296 (501) T ss_pred EeCCceeeccccccCCCcccEEE-ecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhh Confidence 211 111111 12345799999999999999999998888878877777666532211111111111 Q ss_pred HHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHH Q lcl|NC_019527. 317 FDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFY 394 (516) Q Consensus 317 ~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yy 394 (516) .+....+... ..+.......+.+++.+ +.+.+++...++.+.+.|...+++|-.-. +. . +-|.||..=...|. T Consensus 297 ~~~~~~~~~~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~-~-~~n~Sg~Al~~~~~ 371 (501) T protein:vir:27 297 MKRTRLMQLK--PPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSD-TN-F-SGNTSGEALKYKLF 371 (501) T ss_pred hhhcCceeec--ccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCc-cc-c-ccCchHHHHHHHHH Confidence 1111111100 00100001112244444 44556889999999999999999996433 21 1 22456664222221 Q ss_pred H--HHHHHHHHHHHHHHHHHHHHHHHHh--CCCc----CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHH Q lcl|NC_019527. 395 D--DISSVQQSYYFSPLDTMLKVIQLSK--WGEI----DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPS 466 (516) Q Consensus 395 d--~I~~~Qe~~l~p~l~~l~~~l~~s~--~g~~----~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~ 466 (516) . .-.+.++..++..|++++++++.-. .+.. ..++++.|++-...+.+|.|++..+.+ |+|+.+ T Consensus 372 ~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl~---------g~iS~e 442 (501) T protein:vir:27 372 GLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLG---------GQVSQE 442 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh---------ccCcHH Confidence 1 2224455678888999888776532 1211 135889999999999999988766543 455554 Q ss_pred HHHHHHHhhhccCCCCCChhhhccccc------------cchh----cCCCCCCCCCCCCCCCC Q lcl|NC_019527. 467 EARQQLSDDPDSGWDNIDGDLEIVQPE------------MFDD----DGADPYMPDPDVLPGEE 514 (516) Q Consensus 467 e~r~~l~~~~~~~~~~~d~~~e~~~~e------------~~~~----e~~~~~~~~~~~~~~~e 514 (516) .+.+.+.. ....+.+.+....| ..+. .+..+.....+.++..| T Consensus 443 t~l~~l~~-----v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 443 TALSLSGL-----VESPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred HHHHhCCC-----CCCHHHHHHHHHHHHHhhhHhhhcCccccccccccCCCCCCccccccccCC Confidence 44443210 00000000000000 0000 00000000111111111 No 168 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=99.18 E-value=9.3e-12 Score=81.06 Aligned_cols=336 Identities=10% Similarity=0.056 Sum_probs=155.2 Q ss_pred HhhcCCCccccccCCCCCCC---ccCCCc-------------cchhcccccccchhhhcccccCCcccccccCcccHHHH Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPG---VVPAGT-------------TPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNL 105 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~g---v~~~~~-------------~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll 105 (516) |.++..+...+..++..... ..++.. .+.-.+|.. ..+.+..-...+..+....++ +.|+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~--~~~~~~~~~~~~~~~~~pi~~--~~la 76 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVEVLDRR--ELLDYVECMRMGQWYEPPMPW--DGLA 76 (368) T ss_pred CCccccccchhccCcccccccccCcchhhccccCceEEEEcCCceeecchh--hHHHHHHHHhccchhccCcCH--HHHH Confidence 21111111111111100000 000000 000001110 000001001111110111111 1233 Q ss_pred HHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCC Q lcl|NC_019527. 106 AALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGA 185 (516) Q Consensus 106 ~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~ 185 (516) .+|+.++....++..-.. ++...+ .-..+--+..|..++..-.++|.|++++..+. T Consensus 77 ~~~~~~~~h~~~~~~~~n-~l~l~~----------------------~Pn~~~t~~~f~~l~~d~ll~Gnay~~~~r~~- 132 (368) T protein:vir:79 77 RSFRAAAHHSSAVYVKRN-ILVSTF----------------------IPHPLLSRATFERLVLDWQVFGNAYLERRENV- 132 (368) T ss_pred HHHhhccccchhhhhhcc-hhhhhc----------------------CCCcCCCHHHHHHHHHHHhhcCCeEEEEEEcC- Confidence 444433322222211100 111111 01111123345555555578899998875532 Q ss_pred CcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe----eeEeccceEEEecCCcchhhhhh Q lcl|NC_019527. 186 DVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL----GREMHASRLLTIITRPLPDMLKP 261 (516) Q Consensus 186 ~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~----g~~iH~SRli~~~~~~~p~~~k~ 261 (516) .|.+.+|.++.+.+|.... |. . .+|++. ...+.+..|||+.... + T Consensus 133 --------------~G~~~~L~~l~~~~v~~~~----~~--~-----~~~~~~~~~~~~~~~~~dIihir~~~------~ 181 (368) T protein:vir:79 133 --------------LGGTIRLDTPLAKYVRRGL----DL--N-----TYFFVQNWQQPYTFAAGSVFHLQEPD------I 181 (368) T ss_pred --------------CCCEEEEEEeCcccceeec----cC--C-----EEEEEecCCeEEEEccccEEEecCCC------C Confidence 2344567777776665321 11 0 122222 2457889999997532 2 Q ss_pred ccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecch-hhhcCccHHHHHHHHHHHHHhcCCcceEEEecC-- Q lcl|NC_019527. 262 AYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMA-QVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFD-- 338 (516) Q Consensus 262 ~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~-~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~-- 338 (516) ....+|+|.++.+...+..-..+......++.+....-.-..+. ..++.++.+.+.+.++. .....|.+.+++... T Consensus 182 ~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~~~l~~e~~~~lk~~~~~-~~G~~N~g~~~vl~~~g 260 (368) T protein:vir:79 182 NQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTDAAQKQEDVDTLREAMKS-AKGPGNFRNLFMYAPNG 260 (368) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHHHH-hcCCcccCceeEecCCC Confidence 33568999999999999888888888788877766443222111 23444444556666654 234456665444321 Q ss_pred -CcceeEEecccCC----HHHHHHHHHHHHHhhhcCCceeeeccccccc--cccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 339 -SEDIVQVNTPLSG----LADLQSQSQEHMCSVSKIPAIKLTGISPSGL--NASSEGEIRSFYDDISSVQQSYYFSPLDT 411 (516) Q Consensus 339 -~e~~e~~~~~lsg----l~d~~~~~~~~iaaas~IP~t~L~G~sp~Gl--natge~D~~~yyd~I~~~Qe~~l~p~l~~ 411 (516) .+.++...++.+. +-++.....++||++.+||- .|+|..+.+. .++-+...+.| -++.|.|++++ T Consensus 261 ~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp-~llGi~~~~t~~~sn~e~~~~~f-------~~~~l~Pl~~~ 332 (368) T protein:vir:79 261 KKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPP-QLMGIIPNNTGGFGDVEKAAMVF-------ARNEVKPLQDR 332 (368) T ss_pred CccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCH-HHccccCCCCCccccHHHHHHHH-------HHHHHHHHHHH Confidence 2334444444432 23455666789999999997 5668765432 13334344444 33557888777 Q ss_pred HHHHHHHHhCCCcCCcceEEeCC--CCCCCHHHHHHHHHHHH Q lcl|NC_019527. 412 MLKVIQLSKWGEIDDAITFKFKS--LWQTSAKEESEIRFNKA 451 (516) Q Consensus 412 l~~~l~~s~~g~~~~d~~~~f~p--L~~~sekEkAei~~~~a 451 (516) +.++.- ..| . ..|.|++ |...+.+.+|+...+-| T Consensus 333 ie~ln~--~l~---~-e~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 333 LLAIND--WIG---D-EVVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred HHHHHh--ccC---c-ceeeechhHhhcccccccCCcccccC Confidence 754321 111 1 2356665 77778888777665554 No 169 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.17 E-value=2.1e-10 Score=73.68 Aligned_cols=442 Identities=10% Similarity=0.011 Sum_probs=194.7 Q ss_pred CCChhhhHHHHhHHhhcCCCccccccCC-CCCCCccCC----C--ccchh-cccccccchhhhcccccCCcccccc-cCc Q lcl|NC_019527. 29 ARKLAMRRAVMKSMERRASDAATKWAPP-QLMPGVVPA----G--TTPAV-AMDSLCGPTYQFLNSAAGGLYAADI-QPF 99 (516) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~gv~~~----~--~~~~~-a~ds~~~~~~~~~~~~~~~~~~~~~-~~f 99 (516) --+..-.+.-+...+.... -+...+.- -..++..=+ . ....+ .+.............++.|.+.--. ... T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~ 79 (511) T protein:vir:93 1 MLKVNEFETDTDLRGNINY-LFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred Cccccchhhhhhhhhhhhh-hhhhhhCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc Confidence 0000000011111111000 01000000 000010000 0 00000 0000000001111112222111000 000 Q ss_pred ccHHHHHHH-HhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEE Q lcl|NC_019527. 100 PGYQNLAAL-ATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQI 178 (516) Q Consensus 100 ~gy~ll~~y-~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i 178 (516) ......+.. -.+.+++.||+..+.-++-+++.+++.++.. .+.|+..+++-++.....++.+...+||.|++ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~ 152 (511) T protein:vir:93 80 RKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDV-------LEVIEAFNDLNDVESHNRSLGLDLSIYGKAYE 152 (511) T ss_pred CcccccCcceeecchHHHHHHHHhhhhcccCeeeccCChHH-------HHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEE Confidence 000000001 1347789999999999999999998765432 24567777778899999999999999999999 Q ss_pred EEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe-----------eeEe-ccce Q lcl|NC_019527. 179 SINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL-----------GREM-HASR 246 (516) Q Consensus 179 ~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~-----------g~~i-H~SR 246 (516) ++..+... . + .+.+++|.++.|..-+. ....+-++ ..+|.+. -..| .+.+ T Consensus 153 ~vy~de~~--~-------------~-~i~~~~p~~~~~vydd~-~~~~~~~~-vr~~~~~~~~~~~~~~~~~~~iyt~~~ 214 (511) T protein:vir:93 153 LMIRNQDD--E-------------T-RLYKSDAMSTFVIYDNT-IERNSIAG-VRYLRTKPIDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred EEEeCCCC--c-------------e-EEEEEccceeEEEEcCC-CCCceEEE-EEEEEeeeccccccceEEEEEEEeCCc Confidence 88764321 0 1 13445555544431100 00011110 0111110 0011 1223 Q ss_pred EEEecCCc--------------------chhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchh Q lcl|NC_019527. 247 LLTIITRP--------------------LPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQ 306 (516) Q Consensus 247 li~~~~~~--------------------~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~ 306 (516) +.+|.... +|-. .-.++-+|.|.++.+.+.+.+++.+....+..+..++..++...... T Consensus 215 i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv-~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~ 293 (511) T protein:vir:93 215 VYRYLTSRTNGLKLTPRENGFESHSFERMPIT-EFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (511) T ss_pred EEEEEecCCCccccccccccccccCCCccceE-EecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCc Confidence 33321110 1111 11234579999999999999999999888888887776665532211 Q ss_pred hhcCccHHHHHH-HHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeecccccccc Q lcl|NC_019527. 307 VLNGGEGGDVFD-RVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLN 383 (516) Q Consensus 307 ~l~~~~~~~l~~-r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln 383 (516) ...........+ ++-.........+..+-..++.++..+ +.+.+++...++.+.+.|...+.+|-.-. + ..+| | T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~-~-~~~~-n 370 (511) T protein:vir:93 294 NLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD-D-NFSG-T 370 (511) T ss_pred ccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc-c-cccc-c Confidence 111111111000 000000000000000111122344444 45677899999999999999999997533 2 1122 4 Q ss_pred ccchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH--hCCCc--C---CcceEEeCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019527. 384 ASSEGEIRSF---YDDISSVQQSYYFSPLDTMLKVIQLS--KWGEI--D---DAITFKFKSLWQTSAKEESEIRFNKAQE 453 (516) Q Consensus 384 atge~D~~~y---yd~I~~~Qe~~l~p~l~~l~~~l~~s--~~g~~--~---~d~~~~f~pL~~~sekEkAei~~~~a~a 453 (516) .||..=...| ...+ ..++..++..|++++++|+.. .-+.. + .++++.|++-...+.+|.+++..+. T Consensus 371 ~Sg~Al~~~~~~l~~k~-~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--- 446 (511) T protein:vir:93 371 QSGEAMKYKLFGLEQRT-KTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--- 446 (511) T ss_pred chHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHH--- Confidence 5665322222 2233 344567888999988887642 11222 2 2579999999999999988765442 Q ss_pred HHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccch-------hcCCCC-------CCCCCCCCCCCCC Q lcl|NC_019527. 454 AQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFD-------DDGADP-------YMPDPDVLPGEEG 515 (516) Q Consensus 454 ~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~-------~e~~~~-------~~~~~~~~~~~e~ 515 (516) .|+||.+.+.+.+.. .+..+++++....|..+ ....++ +..+.++...++. T Consensus 447 ------~g~iS~et~~~~l~~-----v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 447 ------GGKISQTTLMSLFSF-----FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred ------hccCchHHHHHhCCC-----CCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccccccccC Confidence 467777666655411 01001111111111000 000011 1111111111111 No 170 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=99.17 E-value=1.3e-10 Score=74.74 Aligned_cols=409 Identities=13% Similarity=0.046 Sum_probs=194.7 Q ss_pred CCCccCCCccchhccccc-cc-----chhhhcccccCCccccc--------ccCc---------ccHHHHHHHH------ Q lcl|NC_019527. 59 MPGVVPAGTTPAVAMDSL-CG-----PTYQFLNSAAGGLYAAD--------IQPF---------PGYQNLAALA------ 109 (516) Q Consensus 59 ~~gv~~~~~~~~~a~ds~-~~-----~~~~~~~~~~~~~~~~~--------~~~f---------~gy~ll~~y~------ 109 (516) |--+ ..--+|. |. -..+-++..+..-...+ ...| +.|+-+..|+ T Consensus 1 ~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~ 73 (502) T protein:vir:48 1 MMEQ-------TLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHD 73 (502) T ss_pred Ccee-------EEEEecchhHHHhhcccChhHHhhhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc Confidence 1000 0001111 00 00011111111000000 0001 0122122222 Q ss_pred -----------------hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhccc Q lcl|NC_019527. 110 -----------------TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCF 172 (516) Q Consensus 110 -----------------~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rl 172 (516) .+.+++.||+..+.-++.+++.+++.++.+.+ ...+.|...+++-++...+.++.+.... T Consensus 74 i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~---~~~~~l~~~~~~N~~~~~~~~~~~~~~~ 150 (502) T protein:vir:48 74 VLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNEDNS---QNDDAIKRIGRINDIDTHNRNLIRDLSQ 150 (502) T ss_pred ccccccccccccccceeecchHHHHHHHHhhhhcccCeeEecCCccchh---HHHHHHHHHHhhcCHhHHHHHHHHHHhh Confidence 23677889999999999999999987765543 2234566677777899999999999999 Q ss_pred ceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe----e---eEec-c Q lcl|NC_019527. 173 FGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL----G---REMH-A 244 (516) Q Consensus 173 yG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~----g---~~iH-~ 244 (516) ||.|++++..+... .+ .+.+++|.++.|..-...+ ..+.++- .+|.+. + ..|| + T Consensus 151 ~G~a~~~v~~dedg---------------~~-~i~~~~p~~~~~vydd~~~-~~~~~~i-r~~~~~~~~~~~~~~~iyt~ 212 (502) T protein:vir:48 151 TGRAYEVIYRSEYD---------------ET-RIKRLSPLETFVIYDNSLE-DNSIAAV-RYYNRGTLQNAKDVVEIYTN 212 (502) T ss_pred cCeEEEEEEeCCCC---------------ce-EEEEEcccceEEEEcCCCC-CceEEEE-EEEEEeecCCcEEEEEEEeC Confidence 99999888764321 11 1444555555443211000 0011110 011110 0 0111 1 Q ss_pred ceEEEecCC--------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcC Q lcl|NC_019527. 245 SRLLTIITR--------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNG 310 (516) Q Consensus 245 SRli~~~~~--------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~ 310 (516) .++.++... .+|-.. -.++..|.|.++.+.+.+.+++.+....+.-+..++..++.......... T Consensus 213 ~~i~~~~~~~~~~~~~~~~~~~g~vPvv~-~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~ 291 (502) T protein:vir:48 213 QHIYTLDASDSFNEISVTPHAFGTVPITE-FLNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQ 291 (502) T ss_pred CeEEEEEeCCceeeccceecCCCccceEE-ecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccc Confidence 222222211 112111 12355799999999999999999999988888888877665432221111 Q ss_pred ccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchH Q lcl|NC_019527. 311 GEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEG 388 (516) Q Consensus 311 ~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~ 388 (516) +....-.+...++..... +..-...++-+++.+ +.+..++...++.+.++|...+++|-.-+ +.. +| |.||+. T Consensus 292 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~-~~~-~~-n~Sg~A 366 (502) T protein:vir:48 292 GMQASDMKRTRLMQLKPP--KSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSD-NHF-SG-NASGEA 366 (502) T ss_pred ccchhhhhhcceeecccc--ccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCc-ccc-cc-CchHHH Confidence 111111111111111000 000001122344444 45567888999999999999999996433 211 22 456664 Q ss_pred HHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh--CCCc----CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 389 EIRSF---YDDISSVQQSYYFSPLDTMLKVIQLSK--WGEI----DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT 459 (516) Q Consensus 389 D~~~y---yd~I~~~Qe~~l~p~l~~l~~~l~~s~--~g~~----~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~ 459 (516) =...| ...+ ..++..++..+++++++++... .+.. ..++++.|++-...+.+|.|++..+. T Consensus 367 lk~~~~~l~~k~-~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--------- 436 (502) T protein:vir:48 367 LKYKLFGLDQDR-VDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDL--------- 436 (502) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHH--------- Confidence 22222 2223 3345678888998888876532 1221 13588999999999999988766543 Q ss_pred cCCCCHHHHHHHHHhhhccCCCCCChhhhccccccch---------hcCCCCCCCCC-CCCCCCCCC Q lcl|NC_019527. 460 NSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFD---------DDGADPYMPDP-DVLPGEEGS 516 (516) Q Consensus 460 ~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~---------~e~~~~~~~~~-~~~~~~e~t 516 (516) +|+|+.+.+.+.+.. ....+.+.+....|..+ ..+....+.+. .+.+..++. T Consensus 437 ~g~iS~et~l~~l~~-----v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~~~~ 498 (502) T protein:vir:48 437 GGQVSQETALSLSGL-----VENPTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETHTDDFE 498 (502) T ss_pred hccCcHHHHHHhCCC-----CCCHHHHHHHHHHHHHhhhhhcccccccccccccCCCccCCCCcCcC Confidence 356666555554410 00000111110000000 00000011111 111111111 No 171 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=99.16 E-value=3.9e-10 Score=72.17 Aligned_cols=450 Identities=12% Similarity=0.084 Sum_probs=212.8 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) ||+-++- .+.+ ..++... .+.+.......++||...+|. ..-++-.+.. T Consensus 1 ~~~~l~~--~~~~---------~~~~~~~------------~~~~~~~~~~s~~~P~~~dGa--------~~i~~~~~~~ 49 (521) T protein:vir:81 1 MFSRLKM--LARW---------ADFDNDK------------YEEQIKDKAESIAAPKNNDGA--------TEVEINDNLP 49 (521) T ss_pred Ccchhhh--hHhh---------cCchhhh------------HHhhhccCccccccCCCCCCc--------eEecccCCCc Confidence 6654321 1000 0010000 011112223345666555554 1111111111 Q ss_pred hhhcccccCCcccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhh-----CCCeeeeccccchhhhHHH-HHHHH Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPFPGYQNLAALA---TRPEYRAFASTLSTELTR-----EGIEITSKDRTKAKEMASK-IKELE 151 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f~gy~ll~~y~---~~~i~r~iVd~~aed~~r-----~~~~i~~~~~~~~~~~~~~-i~~i~ 151 (516) ...++++...+...+.+....++|...|+ .++++..+|+.+++||+- ..+++...+.+-.+...++ ..+++ T Consensus 50 ~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~ 129 (521) T protein:vir:81 50 ASAWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFK 129 (521) T ss_pred ceeecceeeeecccccchhhHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHH Confidence 12222222222222333445688888886 699999999999999973 2333333322222211111 13344 Q ss_pred HHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccc-c-ccc-- Q lcl|NC_019527. 152 EACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALD-P-TAP-- 227 (516) Q Consensus 152 ~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~d-p-~s~-- 227 (516) ..++-|++...-.+.+|.--+.|.-+.-..++ .++. .+|+.++.+||..+........+ . .-. T Consensus 130 ~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid-~~pk------------~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~ 196 (521) T protein:vir:81 130 DLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIG-KNPK------------DGIVELRQLDPRNLEYVREIITEDTPEGKIY 196 (521) T ss_pred HHHHHhccchhhhHHHhhhhhcceEEEEEEEc-CCcc------------ccceeeeeeCCcceeeeeeecccccCcccee Confidence 44444577777777777777777766666666 3332 23455666666655443221110 0 000 Q ss_pred ------cccCcc--eeEEee--------eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 228 ------DFYKPS--TWWVLG--------REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDL 291 (516) Q Consensus 228 ------~yg~P~--~y~v~g--------~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~L 291 (516) .+|.|. .|...| .+|+.+=+...+.. +.+...+.=+|.|+.+...+.+.-....+ -+ T Consensus 197 ~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~hSG-----l~d~~~~~i~syLhkAiKp~NQLkm~EDA--lV 269 (521) T protein:vir:81 197 KATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAHSG-----LMDCDDKYIIGYLHRAVKPANQLKLLEDA--MV 269 (521) T ss_pred cceeeeeeeecCCccccccceeecCCcceeechhheeeeecc-----ceeCCCCeeeecchhhhHhHHhhHHHHhh--HH Confidence 011111 122222 34555444333221 11222233357787777666665444332 34 Q ss_pred HHHhC----CceeeecchhhhcCccHHHHHHHHHHHHHhcC------Ccce--------EEE---------ecCCcceeE Q lcl|NC_019527. 292 VDKFS----RTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQS------NLGL--------AVM---------DFDSEDIVQ 344 (516) Q Consensus 292 l~~~~----~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~s------n~g~--------~~i---------d~~~e~~e~ 344 (516) +++.+ -.|+=+|+.++-.. ..++..+- .++.+++ .+|- .++ ++.+-++++ T Consensus 270 IYRitRAPeRRvFYIDvGnlpk~-KAeqYl~~--im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItT 346 (521) T protein:vir:81 270 VYRITRAPERRVFFIDTGNMNNR-KAAQHMNS--VAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTT 346 (521) T ss_pred HHhhhccccceEEEEecCCCCch-hHHHHHHH--HHHhcCceeEeecccccccccccccchhhhhcccccCCCcccceee Confidence 45443 34555554443211 11121111 1111111 0110 000 011123444 Q ss_pred Ee--cccCCHHHHHHHHHHHHHhhhcCCceeeecccccccc--ccch--HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_019527. 345 VN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSE--GEIRSFYDDISSVQQSYYFSPLDTMLKV-IQ 417 (516) Q Consensus 345 ~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge--~D~~~yyd~I~~~Qe~~l~p~l~~l~~~-l~ 417 (516) +. -+|+.++|+ ..|+..+-.+.++|.++|-.++.+|+| .++| -|.-.|..+|.+.|..+ ..++..+++. |. T Consensus 347 LpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rF-s~lf~~~L~~qLi 424 (521) T protein:vir:81 347 LPGASGMSDIDDI-RYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTRQSQF-SEVLRDPLKYNLI 424 (521) T ss_pred cccCCCCChHHHH-HHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhh Confidence 43 245556554 589999999999999999655556764 3333 15567999999998654 4555555442 11 Q ss_pred HH------hCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH-cC-CCCHHHHHHHHHhhhccCCCCCChhhhc Q lcl|NC_019527. 418 LS------KWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT-NS-VIDPSEARQQLSDDPDSGWDNIDGDLEI 489 (516) Q Consensus 418 ~s------~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~-~g-vi~~~e~r~~l~~~~~~~~~~~d~~~e~ 489 (516) +. .|-.+-+.+.|+|..=.--+|-..+|+...+.++++.+-. .| .++.+-+++.+-... |+++.. T Consensus 425 lKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~t-------Deei~~ 497 (521) T protein:vir:81 425 LKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYT-------DDQMDT 497 (521) T ss_pred hhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccC-------HHHHHH Confidence 11 1222334688999988888899999999999988887643 23 568888877653322 233332 Q ss_pred cccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 490 VQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 490 ~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) ....+.+ |..++--++|++ ++++= T Consensus 498 ~~k~I~~-E~~~~~~~~p~~--~~~~f 521 (521) T protein:vir:81 498 EKKQIEE-EANDPRFKQTPD--EIEDF 521 (521) T ss_pred HHHHHHH-HhhCCCCCCCcc--cccCC Confidence 2222222 111111111111 11111 No 172 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.15 E-value=6.5e-10 Score=70.95 Aligned_cols=422 Identities=13% Similarity=0.103 Sum_probs=179.0 Q ss_pred cCCCcC---CCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCccccccc Q lcl|NC_019527. 21 ARAEEQ---EKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQ 97 (516) Q Consensus 21 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~ 97 (516) -..++. ....+......|++....+..+ + +....++-|-+.-... T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~~~~~r----------------------~----------~~~~~YY~G~~~i~~~ 48 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFEDQNQN----------------------L----------RSNTSYYEAERRPEAI 48 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHHHHHHH----------------------H----------HHHHHHHhccCchhhc Confidence 111111 0111111111222222211111 0 0000111111100000 Q ss_pred C-cccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceee Q lcl|NC_019527. 98 P-FPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRG 176 (516) Q Consensus 98 ~-f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a 176 (516) + ..--.+-.....+.++++|||..++-+.-+|+.+....+ . -+.+++.+++-++.....++.+...+||.| T Consensus 49 ~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~~--~------~~~l~~i~~~N~~d~~~~~~~~~a~i~G~a 120 (485) T protein:vir:24 49 GVTVPVQMQSLLAHVGYPRLYVDSIAERQAVEGFRLGDADE--A------DEELWQWWQANNLDIEAPLGYTDAYVHGRS 120 (485) T ss_pred CcccchhhhhhhhccchHHHHHHHHhhhhccCceecCCCch--h------HHHHHHHHHhcChhHHHHHHHHHHhhcCce Confidence 0 000011112234567899999999988778887543221 1 133566677767888899999999999999 Q ss_pred EEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccc-----------cc-----cccccccCcc-eeEE-- Q lcl|NC_019527. 177 QISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNA-----------LD-----PTAPDFYKPS-TWWV-- 237 (516) Q Consensus 177 ~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~-----------~d-----p~s~~yg~P~-~y~v-- 237 (516) ++++..+..... ... .++.. .+++++|.++.+..-.. .+ ...-.++.+. .|++ T Consensus 121 y~~v~~~~~~~~--~~~-----~~~~~-~i~~~~p~~~~~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~ 192 (485) T protein:vir:24 121 YITISRPDPQID--LGW-----DPNVP-LIRVEPPTRMYAEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFR 192 (485) T ss_pred EEEEecCCcccc--ccc-----CCCcc-eEEEeccceeEEEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEe Confidence 998866432110 000 00111 13344444443211000 00 0000001111 1111 Q ss_pred -eee------Eeccce---EEEecCCcchhhhhhccCCCCchHHHH-HHHHHHHHHHHHHHHHHHHHHhCCceeeecchh Q lcl|NC_019527. 238 -LGR------EMHASR---LLTIITRPLPDMLKPAYNFSGISMSQL-AQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQ 306 (516) Q Consensus 238 -~g~------~iH~SR---li~~~~~~~p~~~k~~~~~~G~S~le~-~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~ 306 (516) .+. .-|+-- |+.|.+++ .....+|.|.++. +.+.+.+++++.......+..++...+.+.... T Consensus 193 ~~~~~~~~~~~~h~~g~vPvv~f~n~~------~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~ 266 (485) T protein:vir:24 193 AEGEWVEWFSDPHGLGAVPVVPLPNRT------RLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIK 266 (485) T ss_pred cCCceEeecccccCCCcccEEEeccCc------ccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCC Confidence 110 112211 12222211 1223579998874 667777788887766555555554433221110 Q ss_pred --hhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeecccccccc Q lcl|NC_019527. 307 --VLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLN 383 (516) Q Consensus 307 --~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln 383 (516) .....++.. ...+.. .+..++++.+++-++-+++ .++.+..+.+.....++++.+++|..-| |.+... + T Consensus 267 ~~~~~~~~~~~----~~~~~~--~~~~i~~~~~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~f-g~~~~n-~ 338 (485) T protein:vir:24 267 PEEIGVDPETG----QTLFDA--YLARILAFEDAEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYL-STAADN-P 338 (485) T ss_pred ccccccccccc----cchhhh--cccceeccCCCCceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHh-ccccCc-c Confidence 000001100 111111 1222333333223343332 2334455667777888999999998666 433221 2 Q ss_pred ccchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCcC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 384 ASSEGEIRSF---YDDISSVQQSYYFSPLDTMLKVIQLSKW-GEID---DAITFKFKSLWQTSAKEESEIRFNKAQEAQI 456 (516) Q Consensus 384 atge~D~~~y---yd~I~~~Qe~~l~p~l~~l~~~l~~s~~-g~~~---~d~~~~f~pL~~~sekEkAei~~~~a~a~~~ 456 (516) +||+.=...+ -..++. ++..+.+.+++++.+++.... +..+ .++++.|.+-...|..+.|+...+.+++. T Consensus 339 ~Sg~Al~~~~~~l~~ka~~-~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g-- 415 (485) T protein:vir:24 339 ASAEAIRAAESRLIKKVER-KNAIFGGAWEEAMRLAYRLMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNG-- 415 (485) T ss_pred hHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcc-- Confidence 5665322222 222333 345678899999988765432 2222 36889999999999999877665543321 Q ss_pred HHHcCCCCHHHHHHHHHhhhccCCCCCCh-hhh---------------ccccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 457 YITNSVIDPSEARQQLSDDPDSGWDNIDG-DLE---------------IVQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 457 ~~~~gvi~~~e~r~~l~~~~~~~~~~~d~-~~e---------------~~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) .|+++.+.+++.| +|..-+. .++ ..........+.+.+..+.++++..++. T Consensus 416 ---~~~~s~et~~~~l------~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~ 482 (485) T protein:vir:24 416 ---QGVIPRERARKDM------GYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTPAPKPQPAIEGG 482 (485) T ss_pred ---cccCCHHHHHhhC------CCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCCCCCCccCCCCC Confidence 1245554444333 1111000 000 0000000000111111112222222222 No 173 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=99.15 E-value=5.6e-10 Score=71.32 Aligned_cols=368 Identities=13% Similarity=0.017 Sum_probs=182.8 Q ss_pred hhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccH----HHHHHHH Q lcl|NC_019527. 34 MRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGY----QNLAALA 109 (516) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy----~ll~~y~ 109 (516) +...++..+.++-.+-. ........++-|-+.. ...|. ++...|+ T Consensus 1 ~~~~~i~~L~~~~~~~~----------------------------~r~~~~~~yY~g~~~~---~~~~~~~p~~~~~~~~ 49 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHK----------------------------RRAEMRYDQYAMKYVD---RFKGITIPQALSQQYR 49 (409) T ss_pred CCHHHHHHHHHHHHHHh----------------------------HHHHHHHHHhcccCch---hhcChhhhHHHHHHHh Confidence 33333333321111000 0000000111111110 11111 1233343 Q ss_pred -hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcc Q lcl|NC_019527. 110 -TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVS 188 (516) Q Consensus 110 -~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~ 188 (516) -..+.++||+..++-+.=+||+. . + ..+++-|.+-++.....++.+-+.+||-|++++.-+ .+. T Consensus 50 ~v~nw~~~iVds~a~rl~~~Gf~~--~---d--------~~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~-~dg- 114 (409) T protein:vir:94 50 SILGWCAKGVDSLADRLVFREFEN--D---D--------FTVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKG-END- 114 (409) T ss_pred hhcchhHHHHHHhHhhcccCcccC--C---c--------hHHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecC-CCC- Confidence 33678999999998776677752 1 1 125566777778888889999999999999887532 221 Q ss_pred cCc--ccccccccccceeeEEeecceeeccc---cccccc----cccccccCcce-eE---Eeee---Eeccc---eEEE Q lcl|NC_019527. 189 VPL--ILDPRTIKKGSLTGFSNIEPMWTSPS---AYNALD----PTAPDFYKPST-WW---VLGR---EMHAS---RLLT 249 (516) Q Consensus 189 ~Pl--~ld~~~I~~g~l~~l~v~d~~~v~p~---~~~~~d----p~s~~yg~P~~-y~---v~g~---~iH~S---Rli~ 249 (516) +|. .++|. ..+.++||..=.|. .++..| +..-.++.|.. |+ ..+. .-|+- =+++ T Consensus 115 ~~~i~~~sp~-------~~~~i~D~~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~ 187 (409) T protein:vir:94 115 AVRLQVIEAV-------NATGIIDPITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVP 187 (409) T ss_pred ceEEEEeccc-------eEEEEEecCCCceeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEE Confidence 221 01111 11222232110000 011100 00111111111 11 1110 11221 1223 Q ss_pred ecCCcchhhhhhccCCCCchHH-HHHHHHHHHHHHHHHHHHHHHHHhCCceeee-cchhhhcCccHHHHHHHHHHHHHhc Q lcl|NC_019527. 250 IITRPLPDMLKPAYNFSGISMS-QLAQPYVENWLRTRQSVSDLVDKFSRTFLKT-NMAQVLNGGEGGDVFDRVEMYVNMQ 327 (516) Q Consensus 250 ~~~~~~p~~~k~~~~~~G~S~l-e~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~-~~~~~l~~~~~~~l~~r~~~~~~~~ 327 (516) |.+++ .....+|.|.+ +.++....++.++......-..-++.+...+ ++... +.....+...+.. T Consensus 188 f~n~~------~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d--~~~~~~~~~~~~~----- 254 (409) T protein:vir:94 188 IIHRP------DAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDD--AEPMETWKATVSS----- 254 (409) T ss_pred ecccc------ccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCC--CcccchhhhhHHH----- Confidence 33221 12346899977 6788888888888766554444444443332 21110 0011122222221 Q ss_pred CCcceEEEe----cCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHH---HHHHHHHHH Q lcl|NC_019527. 328 SNLGLAVMD----FDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEI---RSFYDDISS 399 (516) Q Consensus 328 sn~g~~~id----~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~---~~yyd~I~~ 399 (516) +..+. +++-++.+++ +++.++-+.+.....++|+.++||..-|-|.+ . .++|++.=. ......++. T Consensus 255 ----i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~-~-NpsSa~Al~a~~~~L~~~a~~ 328 (409) T protein:vir:94 255 ----MLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVS-D-NPSSVEAIKASHENLRLAGRK 328 (409) T ss_pred ----hhcCCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHhcccc-C-chhHHHHHHHHHHHHHHHHHH Confidence 22221 1112344443 46677788899999999999999987775543 1 236666432 233345555 Q ss_pred HHHHHHHHHHHHHHHHHHHHhCC--CcCC---cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcC--CCCHHHHHHHH Q lcl|NC_019527. 400 VQQSYYFSPLDTMLKVIQLSKWG--EIDD---AITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNS--VIDPSEARQQL 472 (516) Q Consensus 400 ~Qe~~l~p~l~~l~~~l~~s~~g--~~~~---d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g--vi~~~e~r~~l 472 (516) +|+ .+...++++.++++.-..+ ..++ ++++.|.|+...+..+.|+ .|+++.+++++| +.+.+.+++.| T Consensus 329 k~~-~fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~----~aDa~~Kl~~ag~~~~~~~~~~~~l 403 (409) T protein:vir:94 329 AQR-SLGAGLLNVAYLAACLRDDAPYLREQFRKTKPKWEPLFEADASMLSL----IGDGAIKLNQAIPEFINKDTIRDLT 403 (409) T ss_pred HHH-HHHHHHHHHHHHHHHHhCCCCccccccccceEEeccCCCcchHHHHH----HHHHHHHHHHhcccccchhHHHHHc Confidence 554 4678888888876544333 3344 4688999887777666654 568889999998 55667788777 Q ss_pred HhhhccCCCCCC Q lcl|NC_019527. 473 SDDPDSGWDNID 484 (516) Q Consensus 473 ~~~~~~~~~~~d 484 (516) ||+.-| T Consensus 404 ------G~~~~d 409 (409) T protein:vir:94 404 ------GIEGGE 409 (409) T ss_pred ------CCCCCC Confidence 555433 No 174 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.15 E-value=1.1e-09 Score=69.76 Aligned_cols=419 Identities=12% Similarity=0.069 Sum_probs=180.6 Q ss_pred cCCCcCC---CCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccc- Q lcl|NC_019527. 21 ARAEEQE---KARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADI- 96 (516) Q Consensus 21 ~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~- 96 (516) .+.+.+. .-....+...+.+....+..+- .....++-|-+.-.. T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~~~~~~r~--------------------------------~~l~~YY~G~~~i~~~ 48 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAFEDASKDL--------------------------------ASNTSYYDAERRPEAI 48 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHHHHHHHHH--------------------------------HHHHHHhcccCcchhc Confidence 1222211 1111112222323322222110 000011111110000 Q ss_pred cCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceee Q lcl|NC_019527. 97 QPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRG 176 (516) Q Consensus 97 ~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a 176 (516) ....-..+-.....+.+.++|||..++-+.=.|+.+....+. -+.+.+.+++-++.....++.+...+||.| T Consensus 49 ~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~~~g~~~~~~~~~--------~~~~~~i~~~N~~d~~~~~~~~~a~~~G~a 120 (486) T protein:vir:42 49 GVTVPREMQQLLAHVGYPRLYVDSVAERQAVEGFRLGDADEA--------DEELWQWWQANNLDIEAPLGYTDAYVHGRS 120 (486) T ss_pred ccccchhHhhhhhccchHHHHHHHHHhhhcccceecCCCchh--------HHHHHHHHHhcChhHHHHHHHHHHhhcCce Confidence 001112222222334568999999998887788875432211 133566677778888999999999999999 Q ss_pred EEEEEecCCC-cccCcccccccccccceeeEEeecceeeccccc------------ccc-cc---ccccccCcc-eeEEe Q lcl|NC_019527. 177 QISINIKGAD-VSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAY------------NAL-DP---TAPDFYKPS-TWWVL 238 (516) Q Consensus 177 ~i~i~i~~~~-~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~------------~~~-dp---~s~~yg~P~-~y~v~ 238 (516) ++++..+... ...+. + ....+++++|.++.+..- ... +. ..-.++.|. .|++. T Consensus 121 y~~v~~~e~~~~~~~~--------~-~~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~ 191 (486) T protein:vir:42 121 FITISKPDPQLDLGWD--------Q-NVPIIRVEPPTRMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWF 191 (486) T ss_pred EEEEecCCcccccccC--------C-CeeEEEEecccceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEE Confidence 9888654211 10000 0 011244455544443210 000 00 000111111 11110 Q ss_pred ---ee------Eeccc---eEEEecCCcchhhhhhccCCCCchHHHH-HHHHHHHHHHHHHHHHHHHHHhCCceeee--- Q lcl|NC_019527. 239 ---GR------EMHAS---RLLTIITRPLPDMLKPAYNFSGISMSQL-AQPYVENWLRTRQSVSDLVDKFSRTFLKT--- 302 (516) Q Consensus 239 ---g~------~iH~S---Rli~~~~~~~p~~~k~~~~~~G~S~le~-~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~--- 302 (516) +. .-|+- -|+.|.++ ......+|.|.++. +...+.+++++.........-++.....+ T Consensus 192 ~~~~~~~~~~~~~h~~g~vPvv~~~n~------~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~ 265 (486) T protein:vir:42 192 RADGEWAEWFNVPHGLGVVPVVPLPNR------TRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGI 265 (486) T ss_pred ecCCcEEeecceecCCCCceEEEeccc------cccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcC Confidence 11 11221 11112211 11233579999875 66667777777665554444444332221 Q ss_pred cchhhhcCccHHHHHHHHHHHHHhcCCcc-eEEEecCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeeccccc Q lcl|NC_019527. 303 NMAQVLNGGEGGDVFDRVEMYVNMQSNLG-LAVMDFDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPS 380 (516) Q Consensus 303 ~~~~~l~~~~~~~l~~r~~~~~~~~sn~g-~~~id~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~ 380 (516) +... ....++.. ...+.. ..+ ++++.++.-++.+++ .++....+.+.....++|+.+++|...|-| ++. T Consensus 266 ~~~~-~~~~~~~~----~~~~~~---~~~~~~~~~~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~-~~~ 336 (486) T protein:vir:42 266 KPEE-IGVDSETG----QTLFDA---YLARILAFEDAEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLST-AAD 336 (486) T ss_pred Cccc-cccccccc----cchhhh---hhchhcccCCCCceEEeecccCHHHHHHHHHHHHHHHhcccCCCHHHhcc-ccC Confidence 1111 11011000 011111 112 223322223343332 234455667777788899999999876644 322 Q ss_pred cccccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhCCC-cC---CcceEEeCCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_019527. 381 GLNASSEGEIRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSKWGE-ID---DAITFKFKSLWQTSAKEESEIRFNKAQEA 454 (516) Q Consensus 381 Glnatge~D~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~~g~-~~---~d~~~~f~pL~~~sekEkAei~~~~a~a~ 454 (516) . ++||+.=...+...+. ..++..+++.|+++++++.....+. .+ .++++.|.+-...|..+.|+...|.+++. T Consensus 337 n-~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~ 415 (486) T protein:vir:42 337 N-PASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNG 415 (486) T ss_pred c-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcc Confidence 2 2566643333333222 2334567889999998876654432 22 36889999999999998877665543321 Q ss_pred HHHHHcCCCCHHHHHHHHHhhhccCCCCCChh-hhcccc---------------ccchhcCCCC----CCCCC-CCCCCC Q lcl|NC_019527. 455 QIYITNSVIDPSEARQQLSDDPDSGWDNIDGD-LEIVQP---------------EMFDDDGADP----YMPDP-DVLPGE 513 (516) Q Consensus 455 ~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~-~e~~~~---------------e~~~~e~~~~----~~~~~-~~~~~~ 513 (516) .|+++.+.+++.+ +|..-+.+ ++...+ ......+.+. +.+++ ....|+ T Consensus 416 -----~g~~s~et~~~~l------g~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (486) T protein:vir:42 416 -----QGVIPRERARIDM------GYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIESSGG 484 (486) T ss_pred -----cCCCCHHHHHhcC------CCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCCCCC Confidence 1455555544433 11111000 000000 0000000011 11111 122222 Q ss_pred CC Q lcl|NC_019527. 514 EG 515 (516) Q Consensus 514 e~ 515 (516) .+ T Consensus 485 ~~ 486 (486) T protein:vir:42 485 DA 486 (486) T ss_pred CC Confidence 22 No 175 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=99.14 E-value=1e-09 Score=69.87 Aligned_cols=439 Identities=10% Similarity=0.091 Sum_probs=203.9 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) ||.||.|- +... .+.+.......++||..-+|... + .+..+++ T Consensus 6 lf~f~~~~-----------------d~~~------------~~~~~~~~~~s~~~p~~~DGa~~------i--~~~~~~~ 48 (516) T protein:vir:10 6 LFKFWDRV-----------------DQNE------------YDERLKQGHESIATPKKDDGATE------I--EAREGES 48 (516) T ss_pred hcccccch-----------------hhHH------------HHhhhcCCCCcccCCCCccCcee------e--ecCcccc Confidence 66665441 0000 11112222334666665555421 1 1111111 Q ss_pred hhhcccccCCcccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhhC-----CCeeeeccccchhhhHHHHHHHHH Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPFPGYQNLAALA---TRPEYRAFASTLSTELTRE-----GIEITSKDRTKAKEMASKIKELEE 152 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f~gy~ll~~y~---~~~i~r~iVd~~aed~~r~-----~~~i~~~~~~~~~~~~~~i~~i~~ 152 (516) ..+++.+.+...+...-..++|...|+ .++.+..+|+.+++||+-+ .+++...+-+-. ...-++|.+ T Consensus 49 --~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~pEvd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s---~sik~kI~e 123 (516) T protein:vir:10 49 --SYNALMQQFFGIDNNISGTKDLINTYRQLTNNPEVERAVANIVNEAVVYEKGHKVVSLDLDDTEFS---SSIKDKILE 123 (516) T ss_pred --cccceeeeeecccCccccHHHHHHHHHHhhhccchhHHHHHhhcceeEecCCCceEEEEecccccc---hHHHHHHHH Confidence 111222111112222234578888886 6999999999999999732 233322222111 122233444 Q ss_pred HHH----hcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccc--- Q lcl|NC_019527. 153 ACE----YYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPT--- 225 (516) Q Consensus 153 ~~~----~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~--- 225 (516) +++ -|++...-.+.+|.--+.|.-+.--.++ ++. .+|+.++.+||..+....+...... T Consensus 124 eF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid--~~k------------~GI~elr~lDPr~i~~vR~i~~~~~~~~ 189 (516) T protein:vir:10 124 EFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMP--NPK------------EGIVELRRLDPRHVEYYREIVTSDVGGT 189 (516) T ss_pred HHHHHHHHhccchhhhHHHHhhhhcceEEEEEEec--Ccc------------cceeeeeeeCCcceeeEEeeecccCcch Confidence 444 4466666666666555555544332343 211 2344555566655544322111000 Q ss_pred ------cc--cccC-cceeEEeee--------EeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 226 ------AP--DFYK-PSTWWVLGR--------EMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSV 288 (516) Q Consensus 226 ------s~--~yg~-P~~y~v~g~--------~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~ 288 (516) .. .|.. -.+|-++|+ +|+.+=+...+..- .+.....=+|.|+.+...+.+.-....+ T Consensus 190 ~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~daI~y~hSGl-----~d~~~~~i~syLhkAiKp~NQLkm~EDA- 263 (516) T protein:vir:10 190 SVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSAIVYAHSGL-----QDCSDRGIVGYLHNAVKPANQLKLLEDA- 263 (516) T ss_pred hhhhceeeeeeeecCccceeccccccCCCCceecchhheeeeecCc-----ccCCCCceeceehhhhHhHHhhHHHHhh- Confidence 00 0110 122333333 34443333322111 1111122257777776666655443332 Q ss_pred HHHHHHhC----CceeeecchhhhcCccHHHHH-HHHHHHH---HhcCCcceE--------------EE---ecCCccee Q lcl|NC_019527. 289 SDLVDKFS----RTFLKTNMAQVLNGGEGGDVF-DRVEMYV---NMQSNLGLA--------------VM---DFDSEDIV 343 (516) Q Consensus 289 ~~Ll~~~~----~~v~k~~~~~~l~~~~~~~l~-~r~~~~~---~~~sn~g~~--------------~i---d~~~e~~e 343 (516) -++++.+ -.|+=+|+.++-.. ..++.. .-+..+. .+-.++|-+ +- ++.+-+++ T Consensus 264 -lVIYRitRAPeRRvFYIDVGnLPk~-KAeqYl~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 341 (516) T protein:vir:10 264 -LVIYRITRAPERRVFYIDVGNMPNR-KATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVT 341 (516) T ss_pred -HHHHhhhccccceEEEEecCCCCch-hHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCccccee Confidence 3444433 24444454433211 111111 1011000 011111110 00 01112344 Q ss_pred EEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccc--cccchH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_019527. 344 QVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGL--NASSEG--EIRSFYDDISSVQQSYYFSPLDTMLKV-I 416 (516) Q Consensus 344 ~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gl--natge~--D~~~yyd~I~~~Qe~~l~p~l~~l~~~-l 416 (516) ++. -+|+.++|+ ..|+..+-.+.++|.++|-..+...+ +.++|= |.-.|..+|.+.|..+ ..++..+++. | T Consensus 342 TLpGgqnlgem~DV-~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rF-s~lF~~~L~~qL 419 (516) T protein:vir:10 342 SLPGAQTMGEMDDV-RWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQHNF-EEIFLDPLKTNL 419 (516) T ss_pred eccccCCcChHHHH-HHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHh Confidence 442 235556554 58999999999999999976654444 334332 4567999999998654 3444444332 2 Q ss_pred HH------HhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH--HcCCCCHHHHHHHHHhhhccCCCCCChhhh Q lcl|NC_019527. 417 QL------SKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYI--TNSVIDPSEARQQLSDDPDSGWDNIDGDLE 488 (516) Q Consensus 417 ~~------s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~--~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e 488 (516) .+ ..|-.+-+++.|+|..=.--+|-..+|+...+.++++.+- -...++.+-+++.+-... |+++. T Consensus 420 ilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~t-------Deei~ 492 (516) T protein:vir:10 420 IYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMT-------DEQIA 492 (516) T ss_pred hhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCC-------HhHHH Confidence 21 2233444578899998888889999999999999888874 245788888887753322 22222 Q ss_pred ccccccchhcCCCCCCCCCCCCCCC Q lcl|NC_019527. 489 IVQPEMFDDDGADPYMPDPDVLPGE 513 (516) Q Consensus 489 ~~~~e~~~~e~~~~~~~~~~~~~~~ 513 (516) ..+..+. .|..++--++|+++.+= T Consensus 493 ~~~k~I~-~E~~~~~~~~p~~e~~f 516 (516) T protein:vir:10 493 QEEKQIE-KEANVKRFQNPENEDDF 516 (516) T ss_pred HHHHHHH-HhhhCCCCCCCCccccC Confidence 2222222 22222212222222111 No 176 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=99.13 E-value=6.7e-10 Score=70.87 Aligned_cols=360 Identities=13% Similarity=0.038 Sum_probs=181.0 Q ss_pred hhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccH----HHHHHHH Q lcl|NC_019527. 34 MRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGY----QNLAALA 109 (516) Q Consensus 34 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy----~ll~~y~ 109 (516) +...++..+.++-.+-. ........++-|-+.. ...|. ++...|+ T Consensus 1 ~~~~~i~~L~~~~~~~~----------------------------~r~~~~~~yY~g~~~~---~~~~~~~p~~~~~~~~ 49 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHK----------------------------RRAEMRYEQYAMKHVD---RFKGITIPQALSQQYR 49 (409) T ss_pred CCHHHHHHHHHHHHHHh----------------------------HHHHHHHHHHhccCch---hhcchhhhHHHHHHHh Confidence 33333333322111100 0000000111111110 01111 2223343 Q ss_pred -hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcc Q lcl|NC_019527. 110 -TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVS 188 (516) Q Consensus 110 -~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~ 188 (516) .....++|||..++-+.=+||+. . + ..+.+-|.+-++.....++.+-+.+||-|++++.- +.+. T Consensus 50 ~v~nw~~~iVds~a~rl~~~Gf~~--~---d--------~~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~-~~dg- 114 (409) T protein:vir:16 50 SILGWCAKGVDSLADRLVFREFEN--D---D--------FTVNEIFEENNPDIFFDSTVLSALIASCSFTYISK-GEND- 114 (409) T ss_pred hhcChhHHHHHHhHhhcccccccC--c---c--------hHHHHHHHhcChhHHHHHHHHHHHHhCceeEEEec-CCCC- Confidence 34678999999998777678752 1 1 12556677778888899999999999999988753 2221 Q ss_pred cCcccccccccccceeeEEeecceeeccc----------cccccccc------cccccCcce--eEEee-e----Eeccc Q lcl|NC_019527. 189 VPLILDPRTIKKGSLTGFSNIEPMWTSPS----------AYNALDPT------APDFYKPST--WWVLG-R----EMHAS 245 (516) Q Consensus 189 ~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~----------~~~~~dp~------s~~yg~P~~--y~v~g-~----~iH~S 245 (516) .|. +++++|.+++.. .+..++.. .-.++.|.. +.+.. . .-|+- T Consensus 115 ~~~--------------i~~~sP~~~~~i~D~~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (409) T protein:vir:16 115 AVR--------------LQVIEATNATGIIDPITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPT 180 (409) T ss_pred ceE--------------EEEEcccceEEEeecccccceeeeEEEEecCCCceEEEEEEecCcEEEEEecCccccceecCC Confidence 121 222333332210 01111100 001111110 11110 0 11322 Q ss_pred e---EEEecCCcchhhhhhccCCCCchHH-HHHHHHHHHHHHHHHHHHHHHHHhCCceeee-cchhhhcCccHHHHHHHH Q lcl|NC_019527. 246 R---LLTIITRPLPDMLKPAYNFSGISMS-QLAQPYVENWLRTRQSVSDLVDKFSRTFLKT-NMAQVLNGGEGGDVFDRV 320 (516) Q Consensus 246 R---li~~~~~~~p~~~k~~~~~~G~S~l-e~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~-~~~~~l~~~~~~~l~~r~ 320 (516) - +++|.+++ .....+|.|-+ +.++....++.++.....--..-++.+...+ ++... +.....+...+ T Consensus 181 g~vPvV~f~n~~------~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d--~~~~~~~~~~~ 252 (409) T protein:vir:16 181 GNPLLVPIIHRP------DAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDD--AEPMETWKATV 252 (409) T ss_pred CCcceEEecccc------cccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCC--CCccchhhhhh Confidence 1 33333322 12345799976 6788888888877766444444444433322 22110 00111122211 Q ss_pred HHHHHhcCCcceEEEe----cCCcceeEEe-cccCCHHHHHHHHHHHHHhhhcCCceeeecccccccc-ccchH---HHH Q lcl|NC_019527. 321 EMYVNMQSNLGLAVMD----FDSEDIVQVN-TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLN-ASSEG---EIR 391 (516) Q Consensus 321 ~~~~~~~sn~g~~~id----~~~e~~e~~~-~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln-atge~---D~~ 391 (516) .. +..+. ++.-++.+++ .++.+..+.+.....++|+.++||..-|-|.+ -| +|++. ... T Consensus 253 ~~---------i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~---~NpsSa~Ai~a~~~ 320 (409) T protein:vir:16 253 SS---------MLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVS---DNPSSVEAIKASHE 320 (409) T ss_pred hH---------hhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHccccc---CchhHHHHHHHHHH Confidence 11 22221 1112344443 55778888899999999999999988775543 23 55553 233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CcCC---cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCC--CC Q lcl|NC_019527. 392 SFYDDISSVQQSYYFSPLDTMLKVIQLSKWG--EIDD---AITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSV--ID 464 (516) Q Consensus 392 ~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g--~~~~---d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gv--i~ 464 (516) .....++.+|+ .+...++++.++++....+ ..++ ++++.|.|+..++....|+ .|+++.+++++|. .. T Consensus 321 ~L~~ka~~k~~-~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a~----~aDa~~Kl~~a~~~~~~ 395 (409) T protein:vir:16 321 NLRLAGRKAQR-SLGAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADASMLSL----IGDGAIKLNQAIPEFIN 395 (409) T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCccchhhccceEEecCCCCcchhhHHH----HHHHHHHHHhhcccccc Confidence 44456666654 4688899988877655443 3443 4689999887666544433 6788888898873 34 Q ss_pred HHHHHHHHHhhhccCCCCCC Q lcl|NC_019527. 465 PSEARQQLSDDPDSGWDNID 484 (516) Q Consensus 465 ~~e~r~~l~~~~~~~~~~~d 484 (516) .+.+++.+ ||+.-| T Consensus 396 ~~v~~~~~------g~~~~d 409 (409) T protein:vir:16 396 KDTIRDLT------GIKGAE 409 (409) T ss_pred hhHHHHhc------cCCCCC Confidence 45566666 454433 No 177 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.13 E-value=5.3e-10 Score=71.42 Aligned_cols=438 Identities=10% Similarity=0.030 Sum_probs=194.3 Q ss_pred CCChhhhHHHHhHHhhcCCCccccccC-CCCCCCccCCCccchhcccc-----c-------ccchhhhcccccCCccccc Q lcl|NC_019527. 29 ARKLAMRRAVMKSMERRASDAATKWAP-PQLMPGVVPAGTTPAVAMDS-----L-------CGPTYQFLNSAAGGLYAAD 95 (516) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~gv~~~~~~~~~a~ds-----~-------~~~~~~~~~~~~~~~~~~~ 95 (516) -....-.++-+.+....... +...+. -...++. . ....++. + ......-...++.|.+.-- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~n~~~~~~~~----e-~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~ 74 (511) T protein:vir:99 1 MLKVNEFETDTDLRGNINYL-FNDEANVVYTYDGT----E-SDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL 74 (511) T ss_pred Cccccchhhhhhhhhhhhhh-hhhhhCCccccchh----h-hhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccc Confidence 00000011111111111000 000000 0000010 0 0000110 0 0000111111111111100 Q ss_pred cc-CcccHHHHHHH-HhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccc Q lcl|NC_019527. 96 IQ-PFPGYQNLAAL-ATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFF 173 (516) Q Consensus 96 ~~-~f~gy~ll~~y-~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rly 173 (516) .. ........... -.+.+++.||+..+.-++.+.+.+++.++.. .+.|+..+++-++...+.++.+...+| T Consensus 75 ~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~-------~~~l~~~~~~n~~~~~~~~~~~~~~i~ 147 (511) T protein:vir:99 75 VELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDV-------LEAIEAFNDLNDVESHNRSLGLDLSIY 147 (511) T ss_pred cccCcccccccCcceeecchHHHHHHHHHhhhcccCceeecCchHH-------HHHHHHHHhhcCHhHHHHHHHHHHHhc Confidence 00 00000000001 1246778999999998999999998765432 356777788888999999999999999 Q ss_pred eeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe-----------eeEe Q lcl|NC_019527. 174 GRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL-----------GREM 242 (516) Q Consensus 174 G~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~-----------g~~i 242 (516) |.|++++..+... .+ .+.+++|..+.|..-.. ....+-++ -.+|.+. -..| T Consensus 148 G~a~~~vy~ded~---------------~~-~i~~~~p~~~~~vyd~~-~~~~~~~~-vr~~~~~~~~~~~~~~~~~~~v 209 (511) T protein:vir:99 148 GKAYELMIRNQDD---------------ET-RLYKSDAMSTFVIYDNT-IERNSIAG-VRYLRTKPIDKTDEDEVFTVDL 209 (511) T ss_pred CeeEEEEEeCCCC---------------ce-EEEEEccceeEEEEcCC-CCCceEEE-EEEEEeeecccCccceEEEEEE Confidence 9999988764321 11 14455555555432110 00111111 1112111 0011 Q ss_pred -ccceEEEecCC--------------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceee Q lcl|NC_019527. 243 -HASRLLTIITR--------------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLK 301 (516) Q Consensus 243 -H~SRli~~~~~--------------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k 301 (516) .+.++.+|... .+|-. .-.++-+|.|.++.+.+.+.+++.+....+..+..++..++. T Consensus 210 yt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv-~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv 288 (511) T protein:vir:99 210 FTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPIT-EFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLL 288 (511) T ss_pred EeCCcEEEEEecCCccccccccccccccCCCCccceE-EecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhh Confidence 12233333211 01111 112345799999999999999999998888877777665544 Q ss_pred ecchhhhcCccHHHHHH-HHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeeccc Q lcl|NC_019527. 302 TNMAQVLNGGEGGDVFD-RVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGIS 378 (516) Q Consensus 302 ~~~~~~l~~~~~~~l~~-r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~s 378 (516) .........+......+ ++-.........+...-..++-+++.+ +.+.+++...++.+.+.|...+.+|-.-.-+. T Consensus 289 ~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~- 367 (511) T protein:vir:99 289 IKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF- 367 (511) T ss_pred hccCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc- Confidence 32111111111111111 000000000001111111122345544 44566888999999999999999997533221 Q ss_pred cccccccchHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHh--CCCc--C---CcceEEeCCCCCCCHHHHHHHHH Q lcl|NC_019527. 379 PSGLNASSEGEIRSFY---DDISSVQQSYYFSPLDTMLKVIQLSK--WGEI--D---DAITFKFKSLWQTSAKEESEIRF 448 (516) Q Consensus 379 p~Glnatge~D~~~yy---d~I~~~Qe~~l~p~l~~l~~~l~~s~--~g~~--~---~d~~~~f~pL~~~sekEkAei~~ 448 (516) +| |.||..=...|. ..+ ..++..++..|++++++++.-. .+.. + .+++|.|++-...+.+|.+++.. T Consensus 368 -~g-n~Sg~Alk~~~~~l~~ka-~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~ 444 (511) T protein:vir:99 368 -SG-TQSGEAMKYKLFGLEQRT-KTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYI 444 (511) T ss_pred -cc-cchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHH Confidence 22 456653222222 222 3445677888998888775421 2221 2 26899999998899999888665 Q ss_pred HHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccccc----chh---cCCCCCC-CCCCCCCCCCCC Q lcl|NC_019527. 449 NKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEM----FDD---DGADPYM-PDPDVLPGEEGS 516 (516) Q Consensus 449 ~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~----~~~---e~~~~~~-~~~~~~~~~e~t 516 (516) +. .|+||.+.+.+.+.. .+..+.+.+....|. ... ...++.. .+.+..+.++.+ T Consensus 445 kl---------~GiiS~et~l~~l~~-----v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) T protein:vir:99 445 DS---------GGKISQTTLMSLFSF-----FQDPELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDSTKDS 506 (511) T ss_pred HH---------hccCCHHHHHHhCCC-----CCCHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCCcCc Confidence 43 266666666555411 000001111000000 000 0000000 001111111111 No 178 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.12 E-value=1.6e-09 Score=68.80 Aligned_cols=420 Identities=9% Similarity=0.003 Sum_probs=197.7 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) |=-+. .. -.-|++......+....+ ..+ .+. | ..-+..|.-|...+..+ T Consensus 1 ~~~~~--------d~-----~g~p~~~~~~~~~~~~~~-~~~----~~~---~-~~~~~~gltp~~l~~il--------- 49 (528) T protein:vir:10 1 MAAIV--------DI-----YGNPLRTQQLRKQQTAHL-AGL----AKE---F-ANHPAKGLTPAKLAHIL--------- 49 (528) T ss_pred CCeeE--------CC-----CCCccccccccchhhhhh-hhh----hhh---h-cccCCCCCCHHHHHHHH--------- Confidence 11100 00 000000000000000000 000 000 0 00011233222211111 Q ss_pred hhhcccccCCcccccccCccc-HHHHHHH-HhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcC Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPFPG-YQNLAAL-ATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYG 158 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f~g-y~ll~~y-~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~ 158 (516) +.+. .| .+.. ++|...+ .+..-+..++++...-++..-|.|....++..+ .....+.+++.+.++. T Consensus 50 -~~a~---~g-------d~~~~~~L~~~m~e~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~-~~~~a~~v~~~l~~~~ 117 (528) T protein:vir:10 50 -IEAE---QG-------HLQAQAELFMDMEERDAHLFAEMSKRKRAVLGLDWTIEPPRNASAA-EKADAEYLHELLLDLE 117 (528) T ss_pred -Hhhh---CC-------CHHHHHHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCCCCHH-HHHHHHHHHHHHhCCc Confidence 1111 10 1111 2333333 368889999999999999888888664332221 1233455677777775 Q ss_pred hhHHHHHHHHhcccceeeEEEEE--ecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeE Q lcl|NC_019527. 159 VMGIIQKAAEHDCFFGRGQISIN--IKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWW 236 (516) Q Consensus 159 ~~~~l~ea~~~~rlyG~a~i~i~--i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~ 236 (516) -++.+..-+-.+.+||.+++=+. .+++.+ .++.+...++.|+...... ...+ .-..-. T Consensus 118 ~f~~~i~~~lda~~~G~s~~Ei~w~~~~g~~--------------~~~~~~~r~~~~f~~~~~~-----~~~l-~~~~~~ 177 (528) T protein:vir:10 118 GIEDLMLDCMDGVGHGYSAIELDWSLQGREW--------------LPQAFDHRPQSWFQLNPDD-----QDEL-RLRDNS 177 (528) T ss_pred cHHHHHHHHHhhhhhcceeEEEEEeecCCce--------------eEEEeeeecccceeeccCC-----CcEE-eccCCC Confidence 45566666677999999986543 332221 2334555555554321110 0000 000112 Q ss_pred EeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCcee--eecchhhhcCccHH Q lcl|NC_019527. 237 VLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFL--KTNMAQVLNGGEGG 314 (516) Q Consensus 237 v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~--k~~~~~~l~~~~~~ 314 (516) +.|..+++-+.+++.... ...+.+|.+++..|+-...--..+...-+.++.++++++. |++-. .+.+ T Consensus 178 ~~g~~l~~~k~iv~~~~~------~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~-----a~~~ 246 (528) T protein:vir:10 178 IAGEVLQPFGWIMHKPRS------RSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPPG-----TPDE 246 (528) T ss_pred CCceeecCCCeEEEeecC------CCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCC-----CCHH Confidence 345667777777766532 2345569999999998887777777778889999997654 44311 1222 Q ss_pred HHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCH---HHHHHHHHHHHHhhhcCCceeeecccc--------cccc Q lcl|NC_019527. 315 DVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGL---ADLQSQSQEHMCSVSKIPAIKLTGISP--------SGLN 383 (516) Q Consensus 315 ~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl---~d~~~~~~~~iaaas~IP~t~L~G~sp--------~Gln 383 (516) +...-++.+..+.++ +..++.. +.+++.++.+=++. ..+++..-..|+-+ ++|++- +|-+ T Consensus 247 ek~~L~~al~~i~~~-~~~iiP~-~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~-------iLGqtlTs~~~~g~~gS~ 317 (528) T protein:vir:10 247 EKVTLLRAVTGLGHA-AAGIIPE-SMSIDFQEASKGSAEPFMAMMRWCDDSMSKA-------ILGGTLTSQTSESGGGAY 317 (528) T ss_pred HHHHHHHHHHHHhhC-cEEEecC-CceeEEeecCCCChhHHHHHHHHHHHHHHHH-------Hhhhhhhccccccccchh Confidence 333444445554443 3444544 47899888653433 34566666666633 234331 1233 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-CC--cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019527. 384 ASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI-DD--AITFKFKSLWQTSAKEESEIRFNKAQEAQIYITN 460 (516) Q Consensus 384 atge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~--d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~ 460 (516) |-|+.-.....+.+++-......-+.+.|+.-++.-.||.. +. --.|+|..--..+- +..|++++.+++. T Consensus 318 Alg~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl-------~~~a~~~~~L~~~ 390 (528) T protein:vir:10 318 ALGQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADL-------AAMATSLPPLVKL 390 (528) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCcccH-------HHHHHHHHHHHhC Confidence 33333445556666665554433444557777776666532 21 23566654432222 3467788889999 Q ss_pred CC-CCHHHHHHHHHhhhccCCCCCChhhhccccccchh--cCCCCCCCCCCCC--CCCCCC Q lcl|NC_019527. 461 SV-IDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDD--DGADPYMPDPDVL--PGEEGS 516 (516) Q Consensus 461 gv-i~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~--e~~~~~~~~~~~~--~~~e~t 516 (516) |+ |+.+++++.++ ++.-..+++...+..... .....+....... .....+ T Consensus 391 G~~i~~~~i~e~~g------ip~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (528) T protein:vir:10 391 GVQVPVNWVQEQLG------IPLPANGEAVLGDQAGAGIAQLSRRPGPRIAALAQVIGPRY 445 (528) T ss_pred CCCCCHHHHHHHhC------CCCCCCCcccccCCCcccccccCcccccccccccccccccc Confidence 98 89999999883 322111111100100000 0000000000000 000000 No 179 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=99.12 E-value=3e-10 Score=72.82 Aligned_cols=328 Identities=8% Similarity=-0.027 Sum_probs=160.3 Q ss_pred hhHHHHhHHhh---cCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCc-ccccccCcccHHHHHHHH Q lcl|NC_019527. 34 MRRAVMKSMER---RASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGL-YAADIQPFPGYQNLAALA 109 (516) Q Consensus 34 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~-~~~~~~~f~gy~ll~~y~ 109 (516) +..++.+.... .+..++. |- .-|.. + +|.. ..+.+..-...+. .+++ .+..-..|..+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-~~-------~~p~~----~-~~~~--~~~~~~~~~~~~~~~~~e-pp~~~~~La~l~~ 64 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKSVYS-FD-------PNPEP----V-DTNS--WMTRYCELFYNDFDDYWE-PPISLKGLAEIAN 64 (348) T ss_pred CCccccchhhccccCCceEEE-ec-------CCCee----e-cCcc--hHHHHHHHHhcCCCcccc-CCCCHHHHHHHHh Confidence 11111111100 0011111 11 00111 0 1110 0112222222111 1111 1222345677777 Q ss_pred hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCccc Q lcl|NC_019527. 110 TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSV 189 (516) Q Consensus 110 ~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~ 189 (516) .|+....+|.......++ ++. +. + +--+..|.+++..-.++|.|++++..+. T Consensus 65 ~n~~h~~~i~~k~N~l~~-~~~-Pn---~------------------~~t~~~f~~~~~d~ll~Gnay~~~~rn~----- 116 (348) T protein:vir:26 65 ANGYHGSLLKARANYVAG-RFM-NG---G------------------GLPMYKMNSACWDYFGLGMSAFVKIRSY----- 116 (348) T ss_pred hhhhhhhhHhhhhhHHhh-ccc-CC---C------------------CCCHHHHHHHHHHHHhcCCeEEEEEEcC----- Confidence 777777777666554333 221 00 0 0012234445545567899998875432 Q ss_pred CcccccccccccceeeEEeecceeeccccccccccccccccCcceeEE--ee--eEeccceEEEecCCcchhhhhhccCC Q lcl|NC_019527. 190 PLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV--LG--REMHASRLLTIITRPLPDMLKPAYNF 265 (516) Q Consensus 190 Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v--~g--~~iH~SRli~~~~~~~p~~~k~~~~~ 265 (516) .|.+..|.++.+.+|.... |. + +|++ .+ +.++++.|+||.+.. +.... T Consensus 117 ----------~G~~~~L~~l~~~~v~~~~----d~---~-----~~~~~~~g~~~~f~~~dIiHir~~~------~~~~~ 168 (348) T protein:vir:26 117 ----------LKNVIALEPLPMVHMRKRK----NG---D-----FVQLLRNNEQKVFKAKDVIFIPQYD------PQQQI 168 (348) T ss_pred ----------CCcEEEEEEecCceeEeee----cC---c-----EEEEEecCeEEEEcCccEEEEcCCC------CCCCc Confidence 1334567777777665421 11 1 2333 22 467899999997532 23456 Q ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecc-hhhhcCccHHHHHHHHHHHHHhcCCcceEEEe---cCCcc Q lcl|NC_019527. 266 SGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNM-AQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMD---FDSED 341 (516) Q Consensus 266 ~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~-~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id---~~~e~ 341 (516) +|+|.+..+...+..-..+.........+....-+-..+ ...++.++.+++.+.++.. .+..|.+.+++. ++.+. T Consensus 169 ~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~~~~lk~~~~~~-~G~~n~~~~~vl~~~g~~~G 247 (348) T protein:vir:26 169 YGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATDPNLSEADEKALKEKIASS-KGIGNFRSMFVNIPNGKEKG 247 (348) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHh-cCcccccceeEEcCCCCccc Confidence 799999999998887777777777777665543222211 1234444455566666553 344555544443 11233 Q ss_pred eeEEecccCCH----HHHHHHHHHHHHhhhcCCceeeecccccc--ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 342 IVQVNTPLSGL----ADLQSQSQEHMCSVSKIPAIKLTGISPSG--LNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKV 415 (516) Q Consensus 342 ~e~~~~~lsgl----~d~~~~~~~~iaaas~IP~t~L~G~sp~G--lnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~ 415 (516) ++...++.+.- -++.....+.||++.+||-. |+|+.+.+ -.++-+...+.|+ +..|.|.++.+.+. T Consensus 248 i~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~-llGi~~~~~~~~sn~e~~~~~f~-------~~~l~P~~~~ie~~ 319 (348) T protein:vir:26 248 IQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAG-MGGMLPQQGANVPDPLKVSQVYD-------FYEVIPVCKRFMDA 319 (348) T ss_pred eeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHH-HccccCCCCCccccHHHHHHHHH-------HHHHHHHHHHHHHH Confidence 44444444433 33445556789999999975 66765442 1233344444444 35588888888776 Q ss_pred HHHHhCCCcCCcceEEe--CCCCCCCHHHHHHH Q lcl|NC_019527. 416 IQLSKWGEIDDAITFKF--KSLWQTSAKEESEI 446 (516) Q Consensus 416 l~~s~~g~~~~d~~~~f--~pL~~~sekEkAei 446 (516) |-.. ++ +++++.|+| ++...-++ ++-+ T Consensus 320 ln~~-l~-~~~~~~~~fdl~~~~e~~~--~~a~ 348 (348) T protein:vir:26 320 VNND-PE-IPDNLKLKFNLNPGVESAN--GSAV 348 (348) T ss_pred Hhhh-hC-CCCccEEEEecCcccccch--hhcC Confidence 6532 22 455555554 44322222 2222 No 180 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.12 E-value=1.8e-10 Score=74.03 Aligned_cols=450 Identities=12% Similarity=0.053 Sum_probs=202.0 Q ss_pred hhhhHHHHhHH---hhcCCCccccccCCCCCCCccCCCccchhccc-ccccchhhhcccccCCcccccccCcccHHHHHH Q lcl|NC_019527. 32 LAMRRAVMKSM---ERRASDAATKWAPPQLMPGVVPAGTTPAVAMD-SLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAA 107 (516) Q Consensus 32 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~d-s~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~ 107 (516) |.+...+-+-+ ..+... ..+.....++.++++ .+......+..-+.+..++.....-.|...-.. T Consensus 1 m~~~~~ik~~~~~~~~~~~~-----------~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~ 69 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSG-----------QTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERD 69 (517) T ss_pred CchHHHHHHHHHHHHHHhcc-----------cchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccc Confidence 33332221111 111100 000000011122222 112222222222333333322211111111112 Q ss_pred HHhCchhhhhhhhhhHHHhhCCCeeeeccccchh----hhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEec Q lcl|NC_019527. 108 LATRPEYRAFASTLSTELTREGIEITSKDRTKAK----EMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIK 183 (516) Q Consensus 108 y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~----~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~ 183 (516) ..+-.++++||...|+-++.+.-+|+..+....+ .....-+.|++.++.-++...+.+++......|++++-+.++ T Consensus 70 ~~sl~~~~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d 149 (517) T protein:vir:98 70 YMTLNLRKLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVD 149 (517) T ss_pred eeecCcHHHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEe Confidence 2234889999999999999888787765433211 111122447777777789999999999999999988877665 Q ss_pred CCCc-------cc--CcccccccccccceeeEEeecceeeccccc---cccccccccccCcceeEEeee----------- Q lcl|NC_019527. 184 GADV-------SV--PLILDPRTIKKGSLTGFSNIEPMWTSPSAY---NALDPTAPDFYKPSTWWVLGR----------- 240 (516) Q Consensus 184 ~~~~-------~~--Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~---~~~dp~s~~yg~P~~y~v~g~----------- 240 (516) ++.. +. |+..+..++..+.|.......... ....| ....+..-.++. ..|+|... T Consensus 150 ~~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~-~~~~Yt~lE~H~~~~~~~~~-~~y~I~n~ly~s~~~~~lG 227 (517) T protein:vir:98 150 NGEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGN-KTVYYTLLEFHEWEKTEEGE-SLYVITNELYKSDNEGEIG 227 (517) T ss_pred CCeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecC-CceEEEEEEEEecCceeccC-CcEEEEEEEEecCCCcccc Confidence 5321 11 222221122111111100000000 00000 000000000000 12333221 Q ss_pred -Ee-----cc--ceEEEecCCcchhh--hh-------hccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceee-e Q lcl|NC_019527. 241 -EM-----HA--SRLLTIITRPLPDM--LK-------PAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLK-T 302 (516) Q Consensus 241 -~i-----H~--SRli~~~~~~~p~~--~k-------~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k-~ 302 (516) +| ++ .--..+.|-+-|.. .+ .....+|+|++..+.+.++..+.+......-+......++- . T Consensus 228 ~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~ 307 (517) T protein:vir:98 228 KRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVSD 307 (517) T ss_pred ccccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecCh Confidence 11 10 00112222222211 11 11246799999999999999998777666544444333322 1 Q ss_pred cchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEeccc--CCHHHHHHHHHHHHHhhhcCCceeeeccccc Q lcl|NC_019527. 303 NMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPL--SGLADLQSQSQEHMCSVSKIPAIKLTGISPS 380 (516) Q Consensus 303 ~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~l--sgl~d~~~~~~~~iaaas~IP~t~L~G~sp~ 380 (516) .+...-...++...- ..+..-..-+..+-.+.++..++.++..+ ......++.+.+.|+..+|++.. -||.... T Consensus 308 ~~l~~~~~~~g~~~~---~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~-t~~~~~~ 383 (517) T protein:vir:98 308 VMLRTVPDESGMPPP---QVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVG-TFSFDGR 383 (517) T ss_pred hhhccccCCCCcccC---CCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcc-ccccccc Confidence 111000011100000 00000000001111112223455555444 34667788888999999999865 4466656 Q ss_pred cccccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hC-CCcC--CcceEEeCCCCCCCHHHHHHHHHH Q lcl|NC_019527. 381 GLNASSEG---EIRSFYDDISSVQQSYYFSPLDTMLKVIQLS-----KW-GEID--DAITFKFKSLWQTSAKEESEIRFN 449 (516) Q Consensus 381 Glnatge~---D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s-----~~-g~~~--~d~~~~f~pL~~~sekEkAei~~~ 449 (516) |. .|+.+ ..+.-|.+++++|. .++..|++++..++.- .+ |.++ .+++|.|.+-...|.++.++.. T Consensus 384 ~~-kTATEi~s~~~~~~~t~~~~~~-~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~-- 459 (517) T protein:vir:98 384 SM-KTATEIVSENDLTYRTRNDHVY-EVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFY-- 459 (517) T ss_pred cc-ccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHH-- Confidence 65 34432 33456678887775 4788899988877532 12 3333 3689999998888887766544 Q ss_pred HHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh-hccccccchh-cCCCC---CCCCCCCCCCCCC Q lcl|NC_019527. 450 KAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL-EIVQPEMFDD-DGADP---YMPDPDVLPGEEG 515 (516) Q Consensus 450 ~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~-e~~~~e~~~~-e~~~~---~~~~~~~~~~~e~ 515 (516) .+++.+|+++..+++..+ | +++++. +..-++..++ .+.+| .....+..+|.+- T Consensus 460 -----~~~v~aG~ms~~~~i~~~-------~-g~~eeeA~~e~~~i~~E~~~~~~~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 460 -----GQAKTFGFIPTVEAIQRI-------F-KVPKKTAEQWLEEIRKDQIELDPVTISQRAQKRMFGDEE 517 (517) T ss_pred -----HHHHhcCCCCHHHHHHHh-------C-CCChHHHHHHHHHHHHhccccCCCCccccccCCCCCCCC Confidence 457889999998887664 2 233221 1111111110 01111 1111222222222 No 181 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=99.12 E-value=1.1e-09 Score=69.75 Aligned_cols=363 Identities=13% Similarity=0.026 Sum_probs=179.4 Q ss_pred ccccccchhhhcccccCCcccccccCcccH----HHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHH Q lcl|NC_019527. 73 MDSLCGPTYQFLNSAAGGLYAADIQPFPGY----QNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKI 147 (516) Q Consensus 73 ~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy----~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i 147 (516) +|..... ......++.|-+.. ...|. ++-..++ -....++|||..++-+.=+||+. .++ T Consensus 1 l~~~~~r-~~~~~~yY~g~~~~---~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~~--~d~---------- 64 (410) T protein:vir:95 1 MNLYQSR-VNLRYKHYAMQHYE---APTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFAN--DDF---------- 64 (410) T ss_pred CCcchhh-HHHHHHHhcCCCCc---cccchhccHHHHhHHHhhcchhHHHHHHhHhhhccccccC--CCc---------- Confidence 2221111 11111122222111 11222 1222333 34778999999999777788752 111 Q ss_pred HHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccc--------- Q lcl|NC_019527. 148 KELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSA--------- 218 (516) Q Consensus 148 ~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~--------- 218 (516) .+.+-|.+-++.....++.+-+.+||-|++++.- +.+. +|. +++++|.+++... T Consensus 65 -~l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~-~~d~-~~~--------------i~~~sP~~~~~i~Dp~~~~~~~ 127 (410) T protein:vir:95 65 -NVTEIFDRNNPDIFFDSAILSALIGSCSFVYISK-GEDD-EVR--------------LQVIESSNATGVIDPITGLLVE 127 (410) T ss_pred -hHHHHHhhcChHHHHHHHHHHHHHhCceeEEEec-CCCC-ceE--------------EEEEcccceEEEEeCCCCceEE Confidence 2455677778888899999999999999988743 2221 121 3333343333110 Q ss_pred ---cccc----ccccccccCcc-eeEEee----eEe-ccc---eEEEecCCcchhhhhhccCCCCchHH-HHHHHHHHHH Q lcl|NC_019527. 219 ---YNAL----DPTAPDFYKPS-TWWVLG----REM-HAS---RLLTIITRPLPDMLKPAYNFSGISMS-QLAQPYVENW 281 (516) Q Consensus 219 ---~~~~----dp~s~~yg~P~-~y~v~g----~~i-H~S---Rli~~~~~~~p~~~k~~~~~~G~S~l-e~~~~~l~~~ 281 (516) +... .+..-.++.|. .|++.. ..+ |+= -+++|.+++ .....+|.|.+ +.++..+.++ T Consensus 128 al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~------~l~~~~G~s~I~~~v~~l~da~ 201 (410) T protein:vir:95 128 GYAVLARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRP------DAVRPFGRSRITRAGMYYQKYA 201 (410) T ss_pred EEEEEEecCCCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEecccc------cCCccCCccccchhHHHHHHHH Confidence 0000 00111111111 111110 001 111 112232221 12345799965 7788888888 Q ss_pred HHHHHHHHHHHHHhCCceeee-cchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCC----cceeEEe-cccCCHHHH Q lcl|NC_019527. 282 LRTRQSVSDLVDKFSRTFLKT-NMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDS----EDIVQVN-TPLSGLADL 355 (516) Q Consensus 282 ~~~~~~~~~Ll~~~~~~v~k~-~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~----e~~e~~~-~~lsgl~d~ 355 (516) .++.....--..-++.+...+ ++.. .+...+.+...+ ..+..+..++ -++.+++ .++.+..+. T Consensus 202 ~r~~~~~~~~~e~~a~pqr~i~G~d~--d~~~~~~~~~~~---------~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~ 270 (410) T protein:vir:95 202 KRTLERADITAEFYSWPQKYILGLDP--DAEPMEKWKATV---------SSLLTISSSDKGVKPSVGQFTTASMSPFTEQ 270 (410) T ss_pred HHHHHHHHHHHHHhcchhheeeccCC--CCCcCchhhhhh---------hhheeccCCCCCCcceEEecCCCChHHHHHH Confidence 887766544444444443332 2111 000011111111 1133333221 1343443 466777788 Q ss_pred HHHHHHHHHhhhcCCceeeeccccccccccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cCC---c Q lcl|NC_019527. 356 QSQSQEHMCSVSKIPAIKLTGISPSGLNASSEG---EIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGE--IDD---A 427 (516) Q Consensus 356 ~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~---D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~--~~~---d 427 (516) +.....+||+.++||..-|-|.+ -.++|++. ........++.+|+. +...++++.++.+.-..+. .++ + T Consensus 271 l~~l~~~~a~~s~lP~~~lg~~~--~NpsSa~Al~a~~~~L~~ka~~k~~~-fg~~l~~~~rla~~i~~~~~~~~~~~~~ 347 (410) T protein:vir:95 271 LRTAAAGFAGEMGLTLDDLGFVS--DNPSSVEAIKASHENLRLAGRKAQRS-LGAGLLNVAYVAACLRDEFRYTRSQFVR 347 (410) T ss_pred HHHHHHHHhhhcCCCHHHhcccc--CchhHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCCcccccce Confidence 89999999999999988775543 12256654 344556666676654 5788888888765443332 333 3 Q ss_pred ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHc--CCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCC Q lcl|NC_019527. 428 ITFKFKSLWQTSAKEESEIRFNKAQEAQIYITN--SVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGA 500 (516) Q Consensus 428 ~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~--gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~ 500 (516) +.+.|.|+..++....| ..|+++.+++++ |+++.+.+++.| ||+.- ++.....+....-+. T Consensus 348 ~~v~W~p~~d~~~~s~a----~~aDa~~Kl~~a~~g~~~~~~~~~~l------g~~~~--~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 348 TAVKWEPLFEADANTMT----MIGDGVVKLNQALPGYINAETIRDLT------GIAGD--MSAKPVVSEGGSNGE 410 (410) T ss_pred eeEEeeecCCcchhhHH----HHHHHHHHHHHhccCCccHHHHHHhc------CCChH--HHHHHHHHHHHhCCC Confidence 57889876555433332 256677777777 677777777776 44321 111100111111111 No 182 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.11 E-value=7.6e-10 Score=70.58 Aligned_cols=442 Identities=10% Similarity=0.007 Sum_probs=196.2 Q ss_pred CCChhhhHHHHhHHhhcCCCccccccCC-CCCCCccCC----C--ccchh-cccccccchhhhcccccCCccccccc-Cc Q lcl|NC_019527. 29 ARKLAMRRAVMKSMERRASDAATKWAPP-QLMPGVVPA----G--TTPAV-AMDSLCGPTYQFLNSAAGGLYAADIQ-PF 99 (516) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~gv~~~----~--~~~~~-a~ds~~~~~~~~~~~~~~~~~~~~~~-~f 99 (516) --...-.+.-+...+.... -+...+.- -..++..=+ . ....+ .+.........-...++.|.+.--.. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~ 79 (511) T protein:vir:96 1 MLKVNEFETDTDLRGNINY-LFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred Cccccchhhhhhhhhhhhh-hhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc Confidence 0000000011111111000 00000000 000010000 0 00000 00000000011111122111110000 00 Q ss_pred ccHHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEE Q lcl|NC_019527. 100 PGYQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQI 178 (516) Q Consensus 100 ~gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i 178 (516) .....-+..+ .+.+++.||+..+.-++.+.+.+++.++.. .+.|+..++.-++...+.++.+...+||.|++ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~-------~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~ 152 (511) T protein:vir:96 80 RKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDV-------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYE 152 (511) T ss_pred CcccccCcceeecchHHHHHHHHHhhhccCCceeecCchHH-------HHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEE Confidence 0000000011 246788999999999999999998765532 35577778888899999999999999999999 Q ss_pred EEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe-----------eeE-eccce Q lcl|NC_019527. 179 SINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL-----------GRE-MHASR 246 (516) Q Consensus 179 ~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~-----------g~~-iH~SR 246 (516) ++..+... .+ .+.+++|.++.|..-+. ....+-++ -.+|.+. -.. +.+.+ T Consensus 153 ~vy~ded~---------------~~-~i~~~~p~~~~~vydd~-~~~~~~~~-vr~~~~~~~d~~~~~~~~~~~iyt~~~ 214 (511) T protein:vir:96 153 LMIRNQDD---------------ET-RLYKSDAMSTFVIYDNT-IERNSIAG-VRYLRTKPIDKTDEDEVFTVDLFTSHG 214 (511) T ss_pred EEEeCCCC---------------ce-EEEEEccceeEEEEcCC-CCCceEEE-EEEEEeeeccccccceEEEEEEEeCCc Confidence 88764321 01 14445555554432111 11111111 0111110 001 11223 Q ss_pred EEEecCC--------------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchh Q lcl|NC_019527. 247 LLTIITR--------------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQ 306 (516) Q Consensus 247 li~~~~~--------------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~ 306 (516) +.+|... .+|-.. -.++-+|.|.++.+.+.+.+++.+....+..+..++..++...... T Consensus 215 i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~-~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~ 293 (511) T protein:vir:96 215 VYRYLTSRTNGLKLTPRENGFESHSFERMPITE-FSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL 293 (511) T ss_pred EEEEEecCCCcccccccccccccccCCceeeEE-ecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCc Confidence 3332211 011110 1224579999999999999999999998888887777665542211 Q ss_pred hhcCccHHHHH-HHHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeecccccccc Q lcl|NC_019527. 307 VLNGGEGGDVF-DRVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLN 383 (516) Q Consensus 307 ~l~~~~~~~l~-~r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln 383 (516) ........... .++..........+...-..++-++..+ ..+.+++...++.+.+.|...+++|-.-.-+. +| | T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~--~~-n 370 (511) T protein:vir:96 294 NLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF--SG-T 370 (511) T ss_pred cCCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc--cc-c Confidence 11111111100 0110000000000000101112334444 45667899999999999999999997533221 22 4 Q ss_pred ccchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh--CCCc--C---CcceEEeCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019527. 384 ASSEGEIRSF---YDDISSVQQSYYFSPLDTMLKVIQLSK--WGEI--D---DAITFKFKSLWQTSAKEESEIRFNKAQE 453 (516) Q Consensus 384 atge~D~~~y---yd~I~~~Qe~~l~p~l~~l~~~l~~s~--~g~~--~---~d~~~~f~pL~~~sekEkAei~~~~a~a 453 (516) .||..=...| ...+ ..++..++..+++++++|+... -+.. + .++++.|++-...+.++.+++..+ T Consensus 371 ~Sg~Al~~~~~~l~~k~-~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~k---- 445 (511) T protein:vir:96 371 QSGEAMKYKLFGLEQRT-KTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYID---- 445 (511) T ss_pred chHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHH---- Confidence 5665422222 2223 3345667888988888776421 1222 1 268999999999999998876544 Q ss_pred HHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccccc--------------chhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 454 AQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEM--------------FDDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 454 ~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~--------------~~~e~~~~~~~~~~~~~~~e~ 515 (516) + .|++|.+.+.+.+.. .+..+.+++....|. ++..+.+.+..+.++.+.++. T Consensus 446 ---l--~G~iS~et~l~~l~~-----v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 446 ---S--GGKISQTTLMSLFSF-----FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred ---H--hccCChHHHHHhCCC-----CCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCcccccccccC Confidence 2 477777777665411 010011111111110 000001111111111111111 No 183 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=99.11 E-value=3.9e-10 Score=72.19 Aligned_cols=398 Identities=9% Similarity=-0.002 Sum_probs=192.3 Q ss_pred ccCCCccchh----cccccccchhhhcccccCCcccc-cccCc----ccHHH-----HHHHH-hCchhhhhhhhhhHHHh Q lcl|NC_019527. 62 VVPAGTTPAV----AMDSLCGPTYQFLNSAAGGLYAA-DIQPF----PGYQN-----LAALA-TRPEYRAFASTLSTELT 126 (516) Q Consensus 62 v~~~~~~~~~----a~ds~~~~~~~~~~~~~~~~~~~-~~~~f----~gy~l-----l~~y~-~~~i~r~iVd~~aed~~ 126 (516) ..-+....-+ .+-...-..+.....++.|-+.- ....+ .+... ..-.+ .+.+++.||+..+.-++ T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 1000000000 00000000011111122221100 00000 00000 00011 46889999999999999 Q ss_pred hCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeE Q lcl|NC_019527. 127 REGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGF 206 (516) Q Consensus 127 r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l 206 (516) -+++++++.++... +.|...++. +....+.++.+....||.+++++.++... .+ .+ T Consensus 81 G~p~~~~~~d~~~~-------~~l~~~~~~-~~~~~~~~l~~~~~~~G~a~~~~y~d~~~---------------~~-~~ 136 (470) T protein:vir:10 81 SVFPDIDVGKDADN-------KKIIDVLGD-DRALTLNGLLVDSSNAGRAWLHYWIDEDG---------------NF-RY 136 (470) T ss_pred ccceeeecCchHHH-------HHHHHHHhh-hHHHHHHHHHHHHhhcCeeEEEEEecCCC---------------ce-EE Confidence 99999987654322 334444443 56677888889999999999988775321 11 13 Q ss_pred EeecceeeccccccccccccccccCcceeEEe---------eeEec-cceEEEecC------------------------ Q lcl|NC_019527. 207 SNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL---------GREMH-ASRLLTIIT------------------------ 252 (516) Q Consensus 207 ~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~---------g~~iH-~SRli~~~~------------------------ 252 (516) .+++|.++.|..-.. +...+.++ -.+|... -..+| ..++.++.. T Consensus 137 ~~~~p~~~~~v~d~~-~~~~~~a~-ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (470) T protein:vir:10 137 GIIQPDQITPIYATT-LDNKLLGI-LRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYE 214 (470) T ss_pred EEEcccceEEEEcCC-CCCceEEE-EEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccc Confidence 334444444321100 00000000 0001000 00111 011111100 Q ss_pred -----------CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHH Q lcl|NC_019527. 253 -----------RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVE 321 (516) Q Consensus 253 -----------~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~ 321 (516) ..+|-.. -.++-.|.|.++.+.+.+.+++.+....+.-+..++..++....... .+..+... T Consensus 215 ~~~~~~~~~~~g~vPvv~-~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~---~~~~~~~~--- 287 (470) T protein:vir:10 215 TGQSNTLKHNFGRVPFIE-FSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGG---ADLHQFMN--- 287 (470) T ss_pred cccccccccCCCeeeEEE-eecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCc---cccchhhh--- Confidence 0011110 11234689999999999999999999999888888877766532211 11112211 Q ss_pred HHHHhcCCcceEEEec----CCccee--EEecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHH Q lcl|NC_019527. 322 MYVNMQSNLGLAVMDF----DSEDIV--QVNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYD 395 (516) Q Consensus 322 ~~~~~~sn~g~~~id~----~~e~~e--~~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd 395 (516) ..+. .++..+.. ++-+++ +...+.++....++.+.++|...+++|-.-..+ .| |+||..=...|.. T Consensus 288 ---~~~~-~~~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~---~g-n~Sg~Alk~~~~~ 359 (470) T protein:vir:10 288 ---DLRK-YKSIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFE---SS-NASGVAIKMLYSH 359 (470) T ss_pred ---hhhh-cCeEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCccc---cc-cchHHHHHHHHHH Confidence 1111 22222221 122344 444666788899999999999999999643222 23 5677643333333 Q ss_pred HH--HHHHHHHHHHHHHHHHHHHHHHhC-CCc-CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHH Q lcl|NC_019527. 396 DI--SSVQQSYYFSPLDTMLKVIQLSKW-GEI-DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQ 471 (516) Q Consensus 396 ~I--~~~Qe~~l~p~l~~l~~~l~~s~~-g~~-~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~ 471 (516) .. .+..+..++..|++++++|..-.. +.. ..++++.|++-...+++|.|++..+. +|++|.+.+.+. T Consensus 360 l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~---------~g~iS~et~l~~ 430 (470) T protein:vir:10 360 LELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV---------ANYSSKEAVAKA 430 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccceeeEEeccCCCCCHHHHHHHHHHH---------hccCcHHHHHHh Confidence 32 244566788999999988865322 222 24789999999999999998876442 578888777765 Q ss_pred HHhhhccCCCCCChhhhccccccchhc--CCCCCCCCCCCCCCCC Q lcl|NC_019527. 472 LSDDPDSGWDNIDGDLEIVQPEMFDDD--GADPYMPDPDVLPGEE 514 (516) Q Consensus 472 l~~~~~~~~~~~d~~~e~~~~e~~~~e--~~~~~~~~~~~~~~~e 514 (516) +.. ....++..+....|..+.. ....+..+.+...+++ T Consensus 431 ~p~-----v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~dde~ 470 (470) T protein:vir:10 431 NPI-----VDDWQQELKDLAKDKEENDPYSNQADELNGKGVNDEQ 470 (470) T ss_pred CCC-----CCCHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCC Confidence 411 0111111111111111110 0111112222222233 No 184 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=99.10 E-value=7.9e-10 Score=70.48 Aligned_cols=438 Identities=11% Similarity=0.075 Sum_probs=208.2 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) ||.||.|. . +... .+.-.. ...-++||..-+| +..-++..+.. T Consensus 8 lf~f~~k~-----------------~----------e~~~-~~~~~~-~~~s~~~p~~~dG--------a~~I~~~~~~~ 50 (521) T protein:vir:10 8 LLQPWMKD-----------------D----------EKRV-QSDLSD-RIDSFAVPDTADG--------AIEVDKQIDTT 50 (521) T ss_pred Hhhhhhhh-----------------h----------hhHH-hhhhcc-CccccccccCCCC--------ceeeccCCCcc Confidence 22222211 0 0000 000000 1112333432222 22223322222 Q ss_pred hhhcccccCCcccccccCc--ccHHHHHHHH---hCchhhhhhhhhhHHHhhCC-----CeeeeccccchhhhHHH-HHH Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPF--PGYQNLAALA---TRPEYRAFASTLSTELTREG-----IEITSKDRTKAKEMASK-IKE 149 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f--~gy~ll~~y~---~~~i~r~iVd~~aed~~r~~-----~~i~~~~~~~~~~~~~~-i~~ 149 (516) +. .++..+.+ .+.+++ ..++|...|+ .++++..+|+.+++||+-+- +.+...+-+.++...++ ..+ T Consensus 51 ~~-~~~~~~~~--~~~~~~~~n~~eLI~~YR~ma~~pEvd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~ee 127 (521) T protein:vir:10 51 AP-KTAIVQSV--LGYAPKIQNTKDLINQYRSLSKYHEVDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREE 127 (521) T ss_pred cc-ccchhhhh--hccccccchHHHHHHHHHHHhhccchhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHH Confidence 11 11111111 222334 4678888886 69999999999999997432 33333332222221111 233 Q ss_pred HHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccc---------c Q lcl|NC_019527. 150 LEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAY---------N 220 (516) Q Consensus 150 i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~---------~ 220 (516) ++..++-|++...-.+.+|.--+.|.-+.-..|+..++.. +|+.++.+||..+..+.. + T Consensus 128 F~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~------------GI~Elr~lDPr~i~~vr~i~k~~~~~~~ 195 (521) T protein:vir:10 128 FRTILKLLKFEREGKRHFRRWYVDSRIYFHKMIDPARPKD------------GIKELRLLDPRNVEYYRVNLKSNENGND 195 (521) T ss_pred HHHHHHHhccchhhhHHHhhheeeeeEEEEEEeeCCCccc------------cceeeeeeCCcceeeeeeecCCCCCcch Confidence 4444444567777777777666666644444455444322 234455555554432211 0 Q ss_pred ccccccccc-cC---cceeEEe-----eeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 221 ALDPTAPDF-YK---PSTWWVL-----GREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDL 291 (516) Q Consensus 221 ~~dp~s~~y-g~---P~~y~v~-----g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~L 291 (516) ...-....| |. +.+|-.+ +.+|+.+-+.+.+.. +........+|.|+.+...+.+.-....+ -+ T Consensus 196 v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~daI~y~hSG-----L~d~~~~~i~syLhkAiKp~NQLkm~EDA--lV 268 (521) T protein:vir:10 196 VYKGVKEFFTYGATEDNRYNISGNSNNLVQIPIDAIVYSHSG-----KVDIDGKTIVGYLHNVIKPANQLKMLEDA--MV 268 (521) T ss_pred hhccceeeeeeccCCCceecCCCCCCcceeechhheeeeccc-----ceeCCCCceeccchhhhHhHHhhHHHHhh--HH Confidence 000000000 11 1233333 245777554444322 22344566888998888777765444433 34 Q ss_pred HHHhC----CceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecC--------------------------Ccc Q lcl|NC_019527. 292 VDKFS----RTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFD--------------------------SED 341 (516) Q Consensus 292 l~~~~----~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~--------------------------~e~ 341 (516) +++.+ -.|+-+|+.++-.. ..++..+-+ ++.++ +-++-|+. +-+ T Consensus 269 IYRitRAPeRRvFYIDvGnlpk~-KAeqYl~~i--M~k~k---NklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTE 342 (521) T protein:vir:10 269 IYRITRAPERRVFYIDVGTMPNK-KATQHLNNV--MQGLK---NRVVYDSSTGKVKNSSNNLAMTEDYWLMRRDGKATTE 342 (521) T ss_pred HHhhhccccceEEEEecCCCCch-hHHHHHHHH--HHhcC---ceEEEeccCceeccchhhhhhHhhhcccccCCCCccc Confidence 45444 34555554443211 111211111 11111 01111111 113 Q ss_pred eeEEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchH----HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 342 IVQVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEG----EIRSFYDDISSVQQSYYFSPLDTMLKV 415 (516) Q Consensus 342 ~e~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~----D~~~yyd~I~~~Qe~~l~p~l~~l~~~ 415 (516) ++++. -+|+.++|+ ..|+..+-.+.++|.++|-.+ .+|+|-+..+ |.-.|..+|.+.|..+ ..++..+++. T Consensus 343 I~TLpggqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e-~~~f~~Gr~~EItRDEikF~KFI~rLR~rF-s~~f~~~L~~ 419 (521) T protein:vir:10 343 VSTLPGAQSMGEMDDV-RWFNRKLYESMKIPLSRLPQE-GAGVTFGAGNDITRDELQFTKYIRGLQQQF-EPIFLNPLRT 419 (521) T ss_pred eeeccccCCcChHHHH-HHHHHHHHHHhCCCccccCCC-CCceecccccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHH Confidence 44442 235556554 589999999999999998544 3455433322 4567999999998654 4555555442 Q ss_pred H-HH------HhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH---cC-CCCHHHHHHHHHhhhccCCCCCC Q lcl|NC_019527. 416 I-QL------SKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT---NS-VIDPSEARQQLSDDPDSGWDNID 484 (516) Q Consensus 416 l-~~------s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~---~g-vi~~~e~r~~l~~~~~~~~~~~d 484 (516) = .+ ..|-.+-+++.|+|..=.--+|-..+|+...+.++++.+-- .| .++.+-+++.+-... | T Consensus 420 qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~t-------D 492 (521) T protein:vir:10 420 NLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMS-------D 492 (521) T ss_pred hhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHHHHHHHhcCC-------H Confidence 1 11 12333445788999988888899999999999998888733 33 678888887653322 2 Q ss_pred hhhhccccccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 485 GDLEIVQPEMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 485 ~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e 514 (516) +++...+..+.++. .++--++|+++.+.= T Consensus 493 eeik~~~k~I~~E~-~~~~~~~p~~e~~df 521 (521) T protein:vir:10 493 EDIKTEREKIDGEL-KDSVYKNPEDPMEEF 521 (521) T ss_pred hHHHHHHHHHHHhh-hCCCCCCCcchhhcC Confidence 23322222222211 111111111110000 No 185 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=99.10 E-value=1.9e-10 Score=73.88 Aligned_cols=335 Identities=10% Similarity=0.066 Sum_probs=160.9 Q ss_pred chhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccc-cccCCCCCCCccCCCccchhcccccccchhh Q lcl|NC_019527. 4 FDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAAT-KWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQ 82 (516) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~ 82 (516) -+||+ ++... .....+ +..... ..+..+.+ -|--|++ | +|+. .... T Consensus 1 ~~~~~--~~~~~----------~~~~~~-----~~~~~~-~~~~~~~~~~~~~p~~---v----------~~~~--~~~~ 47 (351) T protein:vir:78 1 MSKRR--SRAPR----------TFAAAP-----NPSAGS-AAPARAEVFTFDDPTP---V----------MNRA--EILD 47 (351) T ss_pred CCCCC--CCCCC----------CCCCCC-----chhhhh-cccceeEEEEcCCcee---e----------cCcc--hhhh Confidence 22111 11000 000000 000000 00111100 1111110 1 1111 0111 Q ss_pred hcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHH Q lcl|NC_019527. 83 FLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGI 162 (516) Q Consensus 83 ~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~ 162 (516) +......+. +++ .++.-..|..+++.++....+|-.-+.+.++ ++.- . + +--+.. T Consensus 48 ~~~~~~~~~-~~~-pp~~~~~la~~~~~~~~h~~~l~~k~n~l~~-~~~P-n---~------------------~~t~~~ 102 (351) T protein:vir:78 48 YVECWSNGE-WFE-PPVSFAGLAKSFRASTHHSSALFFKANVLAS-TFRP-H---R------------------WLSRHA 102 (351) T ss_pred hhhhhccCc-eec-CCCCHHHHHHHHhhhHhhhhhhhhhhhHHhh-cccC-C---C------------------CCCHHH Confidence 111122221 111 1122234677777777777777665543333 2210 0 0 001223 Q ss_pred HHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe--e- Q lcl|NC_019527. 163 IQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL--G- 239 (516) Q Consensus 163 l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~--g- 239 (516) |.+++....+||.+++++..++ .|.+.+|.++.+.++..... .| .+|++. + T Consensus 103 f~~~~~d~ll~Gnay~~~~rn~---------------~G~~~~L~pl~~~~v~~~~~--~~---------~~~~~~~~~~ 156 (351) T protein:vir:78 103 FERWALDFLTFGNGYLERRRNM---------------VGGTLRLEPALAKYVRRKAD--FS---------GFVYVNGWQE 156 (351) T ss_pred HHHHHHHHHhcCCeEEEEEECC---------------CCCEEEEEEecCcceEEeee--CC---------eEEEEecCCe Confidence 5555555678999998876532 13345577777666654221 01 123332 1 Q ss_pred -eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhhcCccHHHH Q lcl|NC_019527. 240 -REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVLNGGEGGDV 316 (516) Q Consensus 240 -~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l~~~~~~~l 316 (516) ..+.+..|||+.... +....+|+|.+..+...+..-..+......+..+....- +++. ...++.++.+.+ T Consensus 157 ~~~~~~~eVihir~~~------~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~-~~~ls~e~~~~l 229 (351) T protein:vir:78 157 RHEFAPDSVFQLVRPD------INQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMT-DAAQKQDDVDNM 229 (351) T ss_pred EEEEccccEEEEcCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec-CCCCCHHHHHHH Confidence 346788899986532 234568999999999999888888877777777766432 3322 123555555667 Q ss_pred HHHHHHHHHhcCCcceEEEecC---CcceeEEecccCC----HHHHHHHHHHHHHhhhcCCceeeecccccccc--ccch Q lcl|NC_019527. 317 FDRVEMYVNMQSNLGLAVMDFD---SEDIVQVNTPLSG----LADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSE 387 (516) Q Consensus 317 ~~r~~~~~~~~sn~g~~~id~~---~e~~e~~~~~lsg----l~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge 387 (516) .+.++.. .+..|.+.+++... .+.++...++.+. +-++.....++||++.+||-. |+|+.+.+-. ++-+ T Consensus 230 r~~~~~~-~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~-llGi~~~~t~~~sn~e 307 (351) T protein:vir:78 230 RDALKNA-KGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQ-LLGIVPSNSGGFGTPD 307 (351) T ss_pred HHHHHHh-cCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcccHH Confidence 7766643 45566665554321 2334444444433 334456667789999999975 5587654321 2223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHH Q lcl|NC_019527. 388 GEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEES 444 (516) Q Consensus 388 ~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkA 444 (516) ...+ .+.++.|.|.++++.++.- . ++.+ .|+|++---+.-.++| T Consensus 308 ~~~~-------~f~~~~l~P~~~~iee~n~--~---l~~~-~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 308 TAAR-------VFGRNEIRPLQARFAELND--W---LGDE-VVRFDDYEIPPAPVAA 351 (351) T ss_pred HHHH-------HHHHHHHHHHHHHHHHHHh--h---cCcc-ceecChhhhccccccC Confidence 2333 3344567787777765332 1 2223 2677654333333333 No 186 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=99.09 E-value=2e-09 Score=68.30 Aligned_cols=441 Identities=11% Similarity=0.126 Sum_probs=207.7 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) |+.||+|-=.... +.........++||..-+|... +. +..+. T Consensus 12 ~~~~~~~~d~~~~-----------------------------~~~~~~~~~s~~~p~~~dGa~~------i~--~~~~~- 53 (524) T protein:vir:98 12 FFKNFAREDEIEL-----------------------------EQQLKNDTGSVAPPKNNDGAYE------IE--TDLNN- 53 (524) T ss_pred HhhhhhhhhhhhH-----------------------------hhhhcCCcccccCCCCCCCcee------ec--CCCCc- Confidence 5555555211000 0111122233556654444311 11 10000 Q ss_pred hhhcccccCCcccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhh-----CCCeeeeccccchhhhHHH-HHHHH Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPFPGYQNLAALA---TRPEYRAFASTLSTELTR-----EGIEITSKDRTKAKEMASK-IKELE 151 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f~gy~ll~~y~---~~~i~r~iVd~~aed~~r-----~~~~i~~~~~~~~~~~~~~-i~~i~ 151 (516) +.+++.+...+...+..--..++|...|+ .++++..+|+.+++||+- ..+++...+.+-.+...++ ..+++ T Consensus 54 ~~~~g~~~~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~ 133 (524) T protein:vir:98 54 QKYAGVFQQFYSGQDPAIQNKEQLINTYRGIMSYPEVENAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFD 133 (524) T ss_pred ceecceeeeeccccccccchHHHHHHHHHHHhhccchhhHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHH Confidence 11111111111111222224678888886 699999999999999973 2333333222221211111 23344 Q ss_pred HHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccc----------- Q lcl|NC_019527. 152 EACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYN----------- 220 (516) Q Consensus 152 ~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~----------- 220 (516) ..++-|++...-.+.+|.--+.|.-+.-..++. ++. + +|+.++.+||..+...... T Consensus 134 ~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~-~~~-----------k-GI~ELr~lDPr~i~~vr~~~~~~~~~~~~v 200 (524) T protein:vir:98 134 NVLNIYDFDNMGARLFRDWYVDSRIYFHKIMHK-DES-----------K-GIRELRQLDPRCMELIRESITETLDGGVKV 200 (524) T ss_pred HHHHHhccchhhhHHHhhhhhcceeEEEEEEcC-CCC-----------c-ceeeeeeeCCccceeeeeccccccccchhh Confidence 444455777777777777777777666666652 211 2 2555666666655432111 Q ss_pred --------ccccccccc-cCcceeEEe-eeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 221 --------ALDPTAPDF-YKPSTWWVL-GREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSD 290 (516) Q Consensus 221 --------~~dp~s~~y-g~P~~y~v~-g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~ 290 (516) ..+|...+| +-+.++..+ +.+||.+-+.+.+..-++ . +..=+|.|+.+...+.+.-....+ - T Consensus 201 ~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAIvy~hSGL~d-~-----~~~iisyLhkAiKp~NQLkm~EDA--l 272 (524) T protein:vir:98 201 FRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRSAIVYAHSGLED-C-----SNNIIGYLHRAVKPANQLRLLEDA--M 272 (524) T ss_pred ccceeeeeeeccCCCccccccceecCCCceeechhheeeeccCccc-C-----CCCeeeehhHhhHhHHhhHHHHhh--H Confidence 111111111 011111111 235677666655443221 1 111257888877777665444333 3 Q ss_pred HHHHhC----CceeeecchhhhcCccHHHHHHHHHHHHHhc------CCcce-------E-------EE---ecCCccee Q lcl|NC_019527. 291 LVDKFS----RTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQ------SNLGL-------A-------VM---DFDSEDIV 343 (516) Q Consensus 291 Ll~~~~----~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~------sn~g~-------~-------~i---d~~~e~~e 343 (516) ++++.+ -.|+-+|+.++-.. ..++..+ ..++.++ ..+|- + +- ++.+-+++ T Consensus 273 VIYRitRAPeRRvFYIDvGnlPk~-KAeqYl~--~im~k~kNklvYDa~TGevrddrk~msMlEDyWLpRReGgrgTEIt 349 (524) T protein:vir:98 273 VIYRITRAPERRVFYIDVGQMGGN-KATQYVN--NIAQGLKNRVVYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVS 349 (524) T ss_pred HHHhhhccccceEEEEecCCCCch-hHHHHHH--HHHHhcCceeEeeccCceeeccccccchhhhhcccccCCCCcccee Confidence 455544 34555554443211 1112111 1111111 01110 0 00 01112344 Q ss_pred EEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccc--cch--HHHHHHHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_019527. 344 QVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNA--SSE--GEIRSFYDDISSVQQSYYFSPLDTMLKV-I 416 (516) Q Consensus 344 ~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glna--tge--~D~~~yyd~I~~~Qe~~l~p~l~~l~~~-l 416 (516) ++. -+|+.++|+ ..|+..+-.+.++|.++|-- +.+|+|- ++| -|.-.|..+|.+.|..+ ..++..+++. | T Consensus 350 TLpggqnlgem~DV-~YF~kkLy~aLnVP~sRl~~-~~~~f~~Gr~~EItRDEiKF~KFI~rLR~rF-s~lf~~~L~~qL 426 (524) T protein:vir:98 350 TLPGGQNFSDMDDI-KWFNRKLYEALRVPLSRMPR-DDGGMQIGGGGEITRDELKFSKFIRTLQIQF-SPVLSDPLKTNL 426 (524) T ss_pred eccccCCcChHHHH-HHHHHHHHHHhCCCceeccC-CCCccccccccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhh Confidence 442 235556554 58999999999999999931 2234433 222 15567999999998654 4555554442 1 Q ss_pred HHH------hCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH-cC-CCCHHHHHHHHHhhhccCCCCCChhhh Q lcl|NC_019527. 417 QLS------KWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT-NS-VIDPSEARQQLSDDPDSGWDNIDGDLE 488 (516) Q Consensus 417 ~~s------~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~-~g-vi~~~e~r~~l~~~~~~~~~~~d~~~e 488 (516) .+. .|-.+-+.+.|+|..=.--+|-..+|+...+.++++.+-. .| .++.+-+++.+-... |+++. T Consensus 427 ilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~t-------Deei~ 499 (524) T protein:vir:98 427 IAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEILRMS-------DEDID 499 (524) T ss_pred hhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHHhccC-------HHHHH Confidence 111 1222334688999988888899999999999888887654 33 678788877653221 23332 Q ss_pred ccccccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 489 IVQPEMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 489 ~~~~e~~~~e~~~~~~~~~~~~~~~e 514 (516) ..+..+ ++|..++-.++|+++.+.= T Consensus 500 ~~~k~I-~~E~k~~~~~~p~~e~~~f 524 (524) T protein:vir:98 500 EQAKLI-EEESKEERFKNPEAEEENF 524 (524) T ss_pred HHHHHH-HHHHhCCCCcCCccccccC Confidence 211111 1111111111111111111 No 187 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=99.09 E-value=2.4e-09 Score=67.80 Aligned_cols=432 Identities=14% Similarity=0.146 Sum_probs=196.5 Q ss_pred hHHHH-hHHhhc-CCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHH--- Q lcl|NC_019527. 35 RRAVM-KSMERR-ASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALA--- 109 (516) Q Consensus 35 ~~~~~-~~~~~~-~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~--- 109 (516) -.+|. =+++++ ......-|+||...+|..+ +. + +++.+.....+..--..++|...|+ T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~------i~--~---------~~~~~~~~~~e~~~~~~~eLI~~YR~ma 63 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQP------VS--G---------GGYYGYTVDFDGQVRNEYQLISRYREMV 63 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccce------ee--c---------ccccceeeecccccchHHHHHHHHHHHh Confidence 01111 111111 1111223455544433321 10 0 1111111112223234689988886 Q ss_pred hCchhhhhhhhhhHHHhhCC-----CeeeeccccchhhhHHHHHHHHHHHH----hcChhHHHHHHHHhcccceeeEEEE Q lcl|NC_019527. 110 TRPEYRAFASTLSTELTREG-----IEITSKDRTKAKEMASKIKELEEACE----YYGVMGIIQKAAEHDCFFGRGQISI 180 (516) Q Consensus 110 ~~~i~r~iVd~~aed~~r~~-----~~i~~~~~~~~~~~~~~i~~i~~~~~----~l~~~~~l~ea~~~~rlyG~a~i~i 180 (516) .++++..+|+.+++||+-+- +.+...+.+-++ ..-++|.++++ -|++...-.+.+|.--+.|.-+.-. T Consensus 64 ~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~---~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHk 140 (533) T protein:vir:10 64 LQPECDSAVDDIVNETICGNFDDVPVSVELSNLKVSD---KIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHK 140 (533) T ss_pred hccchhhHHHHhhcceeeecCCCceEEEEecccccch---HHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEE Confidence 68999999999999997422 233332222222 11233444444 4455666555555555555444334 Q ss_pred EecCCCcccCcccccccccccceeeEEeecceeeccccccc---ccc------ccccccC--------cceeEE---eee Q lcl|NC_019527. 181 NIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNA---LDP------TAPDFYK--------PSTWWV---LGR 240 (516) Q Consensus 181 ~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~---~dp------~s~~yg~--------P~~y~v---~g~ 240 (516) .|+..++. .+|+.|+.+||..+.+...-. .|. ....|+. |..+.. ++. T Consensus 141 iid~~~pk------------~GI~ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~v 208 (533) T protein:vir:10 141 VIDPDNPQ------------GGLIELRYIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGL 208 (533) T ss_pred EecCCCcc------------ccceeeeeccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccccCCCce Confidence 45444332 234455666666555432110 000 0111222 222211 234 Q ss_pred EeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhC----CceeeecchhhhcCccHHHH Q lcl|NC_019527. 241 EMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFS----RTFLKTNMAQVLNGGEGGDV 316 (516) Q Consensus 241 ~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~----~~v~k~~~~~~l~~~~~~~l 316 (516) +|+.+=+...+..- .+.....=+|.|+.+...+.+.-....+ -++++.+ -.|+-+|+.++-.. ..++. T Consensus 209 kI~~dAI~y~hSGl-----~d~~~~~i~syLhkAiKp~NQLkm~EDA--lVIYRitRAPeRRvFYIDVGnLPk~-KAeqY 280 (533) T protein:vir:10 209 KIAPDSICYVHSGI-----MDLNKNMTLSHLHKAIKAVNQLRMIEDS--LVIYRLSRAPERRIFYIDVGNLPKN-KAEQY 280 (533) T ss_pred ecchhheeeeeccc-----eeCCCCceeccchHhHHHHHhhHHHHhh--HHHHhhhccccceEEEEecCCCCch-hHHHH Confidence 57765444433221 1222233357777777666665444332 3444443 34555554443211 11111 Q ss_pred HHHHHHHHHhcCCcceEEEecC--------------------------CcceeEEe--cccCCHHHHHHHHHHHHHhhhc Q lcl|NC_019527. 317 FDRVEMYVNMQSNLGLAVMDFD--------------------------SEDIVQVN--TPLSGLADLQSQSQEHMCSVSK 368 (516) Q Consensus 317 ~~r~~~~~~~~sn~g~~~id~~--------------------------~e~~e~~~--~~lsgl~d~~~~~~~~iaaas~ 368 (516) .+ ..+++++ +-++-|+. +-+++++. -+|+.++|+ .+|+..+-.+.+ T Consensus 281 lr--~iM~k~K---NklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV-~YF~kKLY~aLn 354 (533) T protein:vir:10 281 LR--EVMGRYR---NKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLGELEDV-KYFQKKLYKSLN 354 (533) T ss_pred HH--HHHHhcc---ceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhC Confidence 11 1111111 01111111 11344442 235556554 589999999999 Q ss_pred CCceeeeccccccccccchH----HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH------HhCCCcCCcceEEeCCCCC Q lcl|NC_019527. 369 IPAIKLTGISPSGLNASSEG----EIRSFYDDISSVQQSYYFSPLDTMLKVI-QL------SKWGEIDDAITFKFKSLWQ 437 (516) Q Consensus 369 IP~t~L~G~sp~Glnatge~----D~~~yyd~I~~~Qe~~l~p~l~~l~~~l-~~------s~~g~~~~d~~~~f~pL~~ 437 (516) +|.++|-. .+|+|.+.-+ |.-.|..+|.+.|..+ ..++..+++.= .+ ..|-.+-+++.|+|..=.- T Consensus 355 VP~SRl~~--e~~f~~Gr~~EItRDEiKF~KFI~RLR~rF-s~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~ 431 (533) T protein:vir:10 355 VPGSRLET--ETTFNVGRAAEITRDEVKFQKFVARLRKRF-SELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNY 431 (533) T ss_pred CCccccCC--CCcccccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecch Confidence 99999943 3576653222 5567999999998654 45555554421 11 1233344578899988888 Q ss_pred CCHHHHHHHHHHHHHHHHHHHH--cCCCCHHHHHHHH------------H---hh-hccCCCCCChhhhccc-cccchhc Q lcl|NC_019527. 438 TSAKEESEIRFNKAQEAQIYIT--NSVIDPSEARQQL------------S---DD-PDSGWDNIDGDLEIVQ-PEMFDDD 498 (516) Q Consensus 438 ~sekEkAei~~~~a~a~~~~~~--~gvi~~~e~r~~l------------~---~~-~~~~~~~~d~~~e~~~-~e~~~~e 498 (516) .+|-..+|+...+.++++.+-. .-.+|.+-+++.+ . .. .+..|..-+.+++... .-.++.+ T Consensus 432 f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~ 511 (533) T protein:vir:10 432 FAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAG 511 (533) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcC Confidence 8888899999988888876521 1133444444332 1 11 1222332222222111 1111111 Q ss_pred C-----CCCCCCCCCCCCCCCC Q lcl|NC_019527. 499 G-----ADPYMPDPDVLPGEEG 515 (516) Q Consensus 499 ~-----~~~~~~~~~~~~~~e~ 515 (516) + ..|.+|+|+.+..-+= T Consensus 512 ~~~~~~~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:10 512 GAPAEEVAPEGPDPSDERKAEF 533 (533) T ss_pred CcccccCCCCCCCcchhhccCC Confidence 1 1122222222222111 No 188 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=99.09 E-value=1.1e-09 Score=69.73 Aligned_cols=433 Identities=14% Similarity=0.132 Sum_probs=196.0 Q ss_pred hHHHHh-HH--hhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHH-- Q lcl|NC_019527. 35 RRAVMK-SM--ERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALA-- 109 (516) Q Consensus 35 ~~~~~~-~~--~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~-- 109 (516) -.++.. ++ +.+.......++||..-+|+.+ + . .+++++.+...+...+..++|...|+ T Consensus 1 m~~lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~------~--------~---~~g~~~~~~~~~~~~~~~~eLI~~YR~m 63 (558) T protein:vir:10 1 MAKLFGFSIEETQKKSTSIISPVPKNNEDGVDN------F--------I---SSGFYGQYVDIEGAYRSEYDLIRRYREM 63 (558) T ss_pred CcchhcchhhhhhhhccCCccccCCCccccccc------e--------e---ccceeeeeecccchhhhHHHHHHHHHHH Confidence 011111 11 1122222334555544443300 1 1 01111111122334456788988886 Q ss_pred -hCchhhhhhhhhhHHHhhC-----CCeeeeccccchhhhHHHHHHHHHHHH----hcChhHHHHHHHHhcccceeeEEE Q lcl|NC_019527. 110 -TRPEYRAFASTLSTELTRE-----GIEITSKDRTKAKEMASKIKELEEACE----YYGVMGIIQKAAEHDCFFGRGQIS 179 (516) Q Consensus 110 -~~~i~r~iVd~~aed~~r~-----~~~i~~~~~~~~~~~~~~i~~i~~~~~----~l~~~~~l~ea~~~~rlyG~a~i~ 179 (516) .++++..+|+.+++||+-+ .+.+..++-+.++ ..-++|.++++ -|++...-.+.+|.--+.|.-+.- T Consensus 64 a~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~---~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfH 140 (558) T protein:vir:10 64 ALHPEADGAIEDVVNEAIVSDLYDSPVEVELSNLNASN---TLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYL 140 (558) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEecccCcch---HHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEE Confidence 6999999999999999742 2233332222211 22234444444 456677777777776677766555 Q ss_pred EEecCCCcccCcccccccccccceeeEEeecceeecccccc------------ccc-------cccccc--cCccee--- Q lcl|NC_019527. 180 INIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYN------------ALD-------PTAPDF--YKPSTW--- 235 (516) Q Consensus 180 i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~------------~~d-------p~s~~y--g~P~~y--- 235 (516) ..|+..++.. +|+.++.+||..+...... ..+ |.-..| |.|... T Consensus 141 Kiid~k~pk~------------GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~ 208 (558) T protein:vir:10 141 KVIDTKNPQE------------GIQDLRYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPT 208 (558) T ss_pred EEEeCCCccc------------cceeeeeeCcccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCccccc Confidence 5566554432 2334444444444321110 000 000011 112111 Q ss_pred ----EEe---eeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhC----Cceeeecc Q lcl|NC_019527. 236 ----WVL---GREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFS----RTFLKTNM 304 (516) Q Consensus 236 ----~v~---g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~----~~v~k~~~ 304 (516) ++. +.+|+.+=+...+. -+ .......=.|.|+.+...+.+.-....+ -++++.+ ..|+-+|+ T Consensus 209 ~~~~~~~~~~~vkI~~dAI~y~hS-GL----~d~~~~~i~syLhkAIKp~NQLkmlEDA--lVIYRitRAPERRvFYIDV 281 (558) T protein:vir:10 209 GMVGQMGGKNSIKIAKDSITMCTS-GL----VDRNKNRVLSYLHKAIKALNQLRMIEDS--LVIYRLSRAPERRIFYIDV 281 (558) T ss_pred ccceeecCCCceeechhheeeecc-cc----eecCCCeeeecchHhhHhHHhhHHHHhh--HHHHhhhccccceEEEEec Confidence 111 13444443332221 11 1112223356777766666554433332 3444433 24444454 Q ss_pred hhhhcCccHHHHHHHHHHHHHhcCCcceEEEecC--------------------------CcceeEEe--cccCCHHHHH Q lcl|NC_019527. 305 AQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFD--------------------------SEDIVQVN--TPLSGLADLQ 356 (516) Q Consensus 305 ~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~--------------------------~e~~e~~~--~~lsgl~d~~ 356 (516) .++-.. ..++..+ ..+++++ +-++-|+. +-+++++. -+|+.++|+ T Consensus 282 GnLPk~-KAeqYlr--~iM~k~K---NklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~DV- 354 (558) T protein:vir:10 282 GNLPKV-KAEQYLK--EVMSRYR---NKLVYDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELSDV- 354 (558) T ss_pred CCCCch-hHHHHHH--HHHHhcc---ceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcchHHHH- Confidence 433211 1111111 0111111 01111111 11344432 245566665 Q ss_pred HHHHHHHHhhhcCCceeeeccccccccccchH----HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH------HhCCCcC Q lcl|NC_019527. 357 SQSQEHMCSVSKIPAIKLTGISPSGLNASSEG----EIRSFYDDISSVQQSYYFSPLDTMLKVI-QL------SKWGEID 425 (516) Q Consensus 357 ~~~~~~iaaas~IP~t~L~G~sp~Glnatge~----D~~~yyd~I~~~Qe~~l~p~l~~l~~~l-~~------s~~g~~~ 425 (516) .+|+..+-.+.++|.++|-.+ +|+|-+.-+ |.-.|..+|.+.|..+ ..++..+++.= .+ ..|-.+- T Consensus 355 ~YF~kKLy~aLnVP~SRl~~e--~~f~~Gr~~EItRDEiKF~KFI~RLR~rF-s~lF~~~Lk~qLilKgiit~eeW~~i~ 431 (558) T protein:vir:10 355 DYFQKKLYRALGVPESRIAAE--GGFNLGRSSEILRDELKFAKFVGRLRKRF-AAMFNDMLKTQLVLKNIVTPEDWKTME 431 (558) T ss_pred HHHHHHHHHHhCCCccccCCC--CcccccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHh Confidence 589999999999999999544 466553332 4567999999998654 45555554421 11 1233334 Q ss_pred CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH-cC-CCCHHHHHHHHHh---------------h-hccCCCCCChhh Q lcl|NC_019527. 426 DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT-NS-VIDPSEARQQLSD---------------D-PDSGWDNIDGDL 487 (516) Q Consensus 426 ~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~-~g-vi~~~e~r~~l~~---------------~-~~~~~~~~d~~~ 487 (516) +++.|+|..=.-.+|-..+|+...+.++++.+-. .| .+|.+-+++.+-. . .+..|..-++.. T Consensus 432 ~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~ 511 (558) T protein:vir:10 432 DHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQID 511 (558) T ss_pred hcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccC Confidence 5788999988888898999999998888877532 12 3455555443211 1 111222211100 Q ss_pred hccccccchh-cCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 488 EIVQPEMFDD-DGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 488 e~~~~e~~~~-e~~~~~~~~~~~~~~~e~t 516 (516) ..+..-++.. +...+..+.+-..|..++| T Consensus 512 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 541 (558) T protein:vir:10 512 PITGEPLPQEGDPAMEGMGEQPVDPDLEAQ 541 (558) T ss_pred hhhccccCccCCchhccCCCCCcccccccc Confidence 0100111110 1011111111122223333 No 189 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=99.08 E-value=4.1e-10 Score=72.05 Aligned_cols=343 Identities=11% Similarity=0.076 Sum_probs=160.7 Q ss_pred cccccccccCCCc---------------CCCCCChhhhHHHHh-HHh-hcCCCccc-cccCCCCCCCccCCCccchhccc Q lcl|NC_019527. 13 VADKLADAARAEE---------------QEKARKLAMRRAVMK-SME-RRASDAAT-KWAPPQLMPGVVPAGTTPAVAMD 74 (516) Q Consensus 13 ~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~-~~~-~~~~~~~~-~~~~~~~~~gv~~~~~~~~~a~d 74 (516) +... .+++.++. +++.+......+... ... ..+..+.+ -|--|+ . + +| T Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~fg~p~---~---------v-~~ 66 (376) T protein:vir:10 1 MPAR-DRPRAARRRRHSFIFIHGVLRMSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPT---P---------V-MN 66 (376) T ss_pred CCCC-ccchhhhhhcccchhhcccccchhccCCCcccchhhhhHhhhccCcceeEEEEcCCce---e---------c-cC Confidence 0000 00111110 011110000000000 000 00000000 010010 0 0 11 Q ss_pred ccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHH Q lcl|NC_019527. 75 SLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEAC 154 (516) Q Consensus 75 s~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~ 154 (516) +. ....+......+. +++ .++.-+.|..+++.++....+|..-+.+.++ ++.= . + T Consensus 67 ~~--~~~~~~~~~~~~~-~~~-pp~~~~~La~~~~~~~~h~s~l~~k~n~l~~-~~~P-n---p---------------- 121 (376) T protein:vir:10 67 RA--EILDYVECWSNGE-WFE-PPVSFAGLAKSFRASTHHSSALFFKANVLAS-TFRP-H---R---------------- 121 (376) T ss_pred cc--hhhhhhhhhhcCc-eec-CCCCHHHHHHHHhhhHHhhhhHHHHhHHHHh-ccCC-C---C---------------- Confidence 10 0111111122221 121 1222345778888888888887776665443 2210 0 0 Q ss_pred HhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcce Q lcl|NC_019527. 155 EYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPST 234 (516) Q Consensus 155 ~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~ 234 (516) +--+..|.+++....+||.|++++..+. .|.+.+|.++.+.+|.... |+. .+ T Consensus 122 --~lT~~~f~~~v~d~ll~Gnay~~~~rn~---------------~G~~~~L~pl~~~~vr~~~----d~~-------~~ 173 (376) T protein:vir:10 122 --WLSRHAFERWALDFLTFGNGYLERRRNM---------------VGGTLRLEPALAKYVRRKA----DFN-------GF 173 (376) T ss_pred --CCCHHHHHHHHHHHHhcCCeEEEEEECC---------------CCCEEEEEEeCCcceEEEe----eCC-------eE Confidence 0012224555555567899998875432 2445567777777765421 111 12 Q ss_pred eEEe--e--eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhh Q lcl|NC_019527. 235 WWVL--G--REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVL 308 (516) Q Consensus 235 y~v~--g--~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l 308 (516) |++. + ..+.++.|||+.... +....+|+|.++.+...+..-..+......+..+....- +.+. ...+ T Consensus 174 ~~~~~~~~~~~~~~~eViHir~~~------~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~-d~~l 246 (376) T protein:vir:10 174 VYVNGWQERHEFEPDSVFQLVRPD------INQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMT-DAAQ 246 (376) T ss_pred EEEEcCCeEEEEccccEEEecCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec-CCCC Confidence 3332 1 246678899986532 234568999999999999887777777777777765432 2221 1234 Q ss_pred cCccHHHHHHHHHHHHHhcCCcceEEEecC---CcceeEEecccCC----HHHHHHHHHHHHHhhhcCCceeeecccccc Q lcl|NC_019527. 309 NGGEGGDVFDRVEMYVNMQSNLGLAVMDFD---SEDIVQVNTPLSG----LADLQSQSQEHMCSVSKIPAIKLTGISPSG 381 (516) Q Consensus 309 ~~~~~~~l~~r~~~~~~~~sn~g~~~id~~---~e~~e~~~~~lsg----l~d~~~~~~~~iaaas~IP~t~L~G~sp~G 381 (516) +.++.+.+.+.++.. .+..|.+.+++... .+.++...++.+. +-++.....+.||++.+||- .|+|+.+.+ T Consensus 247 ~~e~~~~lr~~~~~~-~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp-~llGi~~~~ 324 (376) T protein:vir:10 247 KQDDVDNMRDALKNA-KGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPP-QLLGIVPSN 324 (376) T ss_pred CHHHHHHHHHHHHHh-cCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCH-HHhcccCCC Confidence 544445566666542 34556555544321 2334444444332 23345566788999999997 577887643 Q ss_pred c--cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHH Q lcl|NC_019527. 382 L--NASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEES 444 (516) Q Consensus 382 l--natge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkA 444 (516) - .++-|...+.| -++.|.|.++++.++. . . +..+ .|+|++-.-+.-.++| T Consensus 325 t~~~sn~eq~~~~f-------~~~~L~Pl~~~ieeln-~-~---L~~~-~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 325 SGGFGTPDTAARVF-------GRNEIRPLQARFAELN-D-W---LGEE-VVRFDDYEIPPAPVAA 376 (376) T ss_pred CCCcccHHHHHHHH-------HHHHHHHHHHHHHHHH-h-h---cccc-ccccChhHhhcccccC Confidence 2 13333333433 3355778777776532 1 1 1222 2666653333333333 No 190 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=99.08 E-value=5.1e-10 Score=71.54 Aligned_cols=416 Identities=11% Similarity=0.064 Sum_probs=191.9 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchh--cccccccchhhhcccccCCcccccccC--cccHHHHHHHH-hCchhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAV--AMDSLCGPTYQFLNSAAGGLYAADIQP--FPGYQNLAALA-TRPEYRA 116 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~--a~ds~~~~~~~~~~~~~~~~~~~~~~~--f~gy~ll~~y~-~~~i~r~ 116 (516) |.-.|.. .|.-|.. ..... ..-..+ .++.-. ........++.|.+.-.... ..+-. ..+ .+.+++. T Consensus 1 ~~~~~~~---~~~~~~~-~~~~~-~~i~~~i~~~~~~~-~r~~~~~~yy~g~~~i~~~~~~~~~~~---~~ki~~n~~~~ 71 (453) T protein:vir:73 1 MNLKPIK---LMTYSRD-EEITD-KVVNDFMKKHQEEV-ERYEYLGNMYKGIMEISSQKAKDSWKP---DNRLTNNFAKY 71 (453) T ss_pred Cccccce---eeecccc-ccCCH-HHHHHHHHHHHHHH-HHHHHHHHHhccccchhcCCCCCccCc---cceeecchHHH Confidence 2111111 1111110 00000 000011 111100 11111112222221110000 01111 112 3578999 Q ss_pred hhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccc Q lcl|NC_019527. 117 FASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPR 196 (516) Q Consensus 117 iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~ 196 (516) ||+..+.-++.+++++++.++.. .+.|+..++.-++...+.++.+....||.|++++..+... . T Consensus 72 ivd~~~~~l~g~~~~~~~~d~~~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~--~------- 135 (453) T protein:vir:73 72 IVDTFVGYFNGIPIKKTHDDKSV-------LEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNEST--E------- 135 (453) T ss_pred HHHHhhhhhcccCceeecCChHH-------HHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCC--c------- Confidence 99999999999999998765432 2456667777789999999999999999999888764321 0 Q ss_pred cccccceeeEEeecceeeccccccccccccccccCcceeEE--eee---E-eccceEEEecCC---------------cc Q lcl|NC_019527. 197 TIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV--LGR---E-MHASRLLTIITR---------------PL 255 (516) Q Consensus 197 ~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v--~g~---~-iH~SRli~~~~~---------------~~ 255 (516) + .+.+++|.++.|..-...+. .+.|.. .|.. .+. . +-+.++.+|.+. .+ T Consensus 136 ------~-~i~~~~p~~~~~v~dd~~~~-~~~~~i--~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~v 205 (453) T protein:vir:73 136 ------S-EVIYCSPLNVFMVYDDSIKQ-KPLFAV--YYGFDEEGNLSGTVYTLLETISITGKAGEVKFGESTYNVYSDL 205 (453) T ss_pred ------e-EEEEEcccceEEEEeCCCCc-eeEEEE--EEEEecCceEEEEEEeCCeEEEEEecCCceEEccceeccCCce Confidence 1 12333443333321100000 000000 0000 000 0 011122222111 01 Q ss_pred hhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEE Q lcl|NC_019527. 256 PDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVM 335 (516) Q Consensus 256 p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~i 335 (516) |-. .-.++-+|.|.++.+.+.+.+++.+....+..+..++...+...... +.......+. +...+.......+.... T Consensus 206 Pvv-~~~n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 282 (453) T protein:vir:73 206 PIV-EYNFNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAE-VDEEDAKNIK-DNRLINFFDKNSNGQGT 282 (453) T ss_pred eEE-EecCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-CCchhhhccc-ccccccccccccccccc Confidence 111 11234579999999999999999999888887777776655442211 1111111111 11111111000001111 Q ss_pred ecCCcceeE--EecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHH--HHHHHHHHHHHHH Q lcl|NC_019527. 336 DFDSEDIVQ--VNTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDIS--SVQQSYYFSPLDT 411 (516) Q Consensus 336 d~~~e~~e~--~~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~--~~Qe~~l~p~l~~ 411 (516) ...+.+++. ...+.+++...++.+.+.|...+++|-.-. +. .| |+||+.=...|...+. +.++..++..+++ T Consensus 283 ~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~--~g-n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~ 358 (453) T protein:vir:73 283 NAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISD-EN--FG-NSSGVALAYKLQAMSNLALSFQRKFQSALNR 358 (453) T ss_pred cccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCc-cc--cc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122233444 445667888899999999999999996322 21 12 4566533233332222 3334567788888 Q ss_pred HHHHHHHHh--CCCc--CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh Q lcl|NC_019527. 412 MLKVIQLSK--WGEI--DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL 487 (516) Q Consensus 412 l~~~l~~s~--~g~~--~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~ 487 (516) ++++++.-. .|.. ..++++.|++-...++++.|++..+ +. |+++.+.+.+.+.. .+..+.+. T Consensus 359 ~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k-------~~--giis~et~~~~~~~-----~~d~~~E~ 424 (453) T protein:vir:73 359 RYSLWSSLSTNASNKDAWKDIEYTFTRNEPKDIKEQAETANI-------LK--GITSEETALSVISV-----IPDVQAEM 424 (453) T ss_pred HHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHH-------Hh--ccCcHHHHHHhCCC-----CCCHHHHH Confidence 888775432 2222 2478999999999999998876544 22 77887766655411 11111111 Q ss_pred hccccccchh---cCCCCCCCCCCCCCCC Q lcl|NC_019527. 488 EIVQPEMFDD---DGADPYMPDPDVLPGE 513 (516) Q Consensus 488 e~~~~e~~~~---e~~~~~~~~~~~~~~~ 513 (516) +....|..+. +........+....+. T Consensus 425 ~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 453 (453) T protein:vir:73 425 EKIKKKKLLQLSLTRTSNLVRMKQMRGNL 453 (453) T ss_pred HHHHHHHHHHHHHHHhccCCcchhhhcCC Confidence 1111111110 0000001111111112 No 191 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=99.07 E-value=1.3e-09 Score=69.39 Aligned_cols=415 Identities=11% Similarity=0.056 Sum_probs=182.9 Q ss_pred cCCCc--cchhccc---ccc---cchhhhcccccCCcccccccC-cccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeee Q lcl|NC_019527. 63 VPAGT--TPAVAMD---SLC---GPTYQFLNSAAGGLYAADIQP-FPGYQNLAALATRPEYRAFASTLSTELTREGIEIT 133 (516) Q Consensus 63 ~~~~~--~~~~a~d---s~~---~~~~~~~~~~~~~~~~~~~~~-f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~ 133 (516) +++.. .+.-..+ ... .........++-|-+.-...+ ...-.+-.....+.++++||+..++-+.=+|+.+. T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~~ 80 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQELEGFRIP 80 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhccceecc Confidence 11111 0000000 000 000111111222211100000 00111111223567889999999987777888775 Q ss_pred eccccch--hhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecc Q lcl|NC_019527. 134 SKDRTKA--KEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEP 211 (516) Q Consensus 134 ~~~~~~~--~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~ 211 (516) ....... ....+....+...|++-++.....++.+...+||.|++++..+..... .. + ..+ ...|++++| T Consensus 81 ~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~--~~-~----~~~-~~~i~~~~p 152 (488) T protein:vir:23 81 SANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVD--FD-V----DPE-VPLIRVEPP 152 (488) T ss_pred CCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccc--cC-C----CCC-cceEEEecc Confidence 4321110 001122345677788888999999999999999999988765321110 00 0 000 112445555 Q ss_pred eeeccccccccccccccc----------cCcce---------eEEe---ee------Eeccce---EEEecCCcchhhhh Q lcl|NC_019527. 212 MWTSPSAYNALDPTAPDF----------YKPST---------WWVL---GR------EMHASR---LLTIITRPLPDMLK 260 (516) Q Consensus 212 ~~v~p~~~~~~dp~s~~y----------g~P~~---------y~v~---g~------~iH~SR---li~~~~~~~p~~~k 260 (516) ..+.+..-.. .. .+.| +.... |++. |. .-|+=- |+.|.++ . T Consensus 153 ~~~~~~~d~~-~~-~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~------~ 224 (488) T protein:vir:23 153 TALYAEVDPR-TR-KVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPISNR------T 224 (488) T ss_pred ceeEEEEecC-CC-ceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEeccc------c Confidence 5544321000 00 0111 11100 0000 00 001110 1112111 1 Q ss_pred hccCCCCchHHHH-HHHHHHHHHHHHHHHHHHHHHhCCceeee---cchhhhcCccHHHHHHHHHHHHHhcCCcceEEEe Q lcl|NC_019527. 261 PAYNFSGISMSQL-AQPYVENWLRTRQSVSDLVDKFSRTFLKT---NMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMD 336 (516) Q Consensus 261 ~~~~~~G~S~le~-~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~---~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id 336 (516) .....+|.|.++. +.+.+.+++++.......+.-++...+.+ +... ....++ .....+....+ .++++. T Consensus 225 ~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~-~~~~~~----~~~~~~~~~~~--~v~~~~ 297 (488) T protein:vir:23 225 RLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEE-LGINAE----TGQRMFDAYMA--RILAFE 297 (488) T ss_pred ccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccc-cccccc----ccchhhhhhhh--hhccCC Confidence 1234579998864 55656667777766555544444332221 1111 100110 00111111111 122222 Q ss_pred cCCcceeEEec---ccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHH--HHHHHHHHHHHHH Q lcl|NC_019527. 337 FDSEDIVQVNT---PLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDIS--SVQQSYYFSPLDT 411 (516) Q Consensus 337 ~~~e~~e~~~~---~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~--~~Qe~~l~p~l~~ 411 (516) ++++.+..+. ++.+..+.+.....+|++.+++|..-| |.+..+ ++||+.=...+...+. ..++..+...|.+ T Consensus 298 -~g~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~-g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~ 374 (488) T protein:vir:23 298 -GGEGAHAEQFSAAELRNFVDALDALDRKAASYSGLPPQYL-SSSSDN-PASAEAIKAAESRLVKKVERKNKIFGGAWEQ 374 (488) T ss_pred -CCCCceeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHh-ccccCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2333333333 445566777777889999999998666 433222 2566543333332222 3334567888999 Q ss_pred HHHHHHHHhCCC-cC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcC--CCCHHHHHHHHHhhhccCCCCCCh Q lcl|NC_019527. 412 MLKVIQLSKWGE-ID---DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNS--VIDPSEARQQLSDDPDSGWDNIDG 485 (516) Q Consensus 412 l~~~l~~s~~g~-~~---~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g--vi~~~e~r~~l~~~~~~~~~~~d~ 485 (516) ++.+++.-..|. .+ .++++.|.+-...|..+.|+...| ++++| +++.+.+++.|. |..-+. T Consensus 375 ~~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~k-------l~~~g~~~~s~et~~~~l~------~~~d~~ 441 (488) T protein:vir:23 375 AMRLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAK-------LFANGAGLIPRERGWVDMG------YTIVER 441 (488) T ss_pred HHHHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHH-------HHhcccccCCHHHHHHhCC------CCchHH Confidence 999887544332 22 368899999999999998776544 45544 667766666552 211110 Q ss_pred -hhhcc-ccccc-----------hh-cCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 486 -DLEIV-QPEMF-----------DD-DGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 486 -~~e~~-~~e~~-----------~~-e~~~~~~~~~~~~~~~e~t 516 (516) .++.. +.+.. .. +...++..+..+.+.+|.. T Consensus 442 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 486 (488) T protein:vir:23 442 EQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEAPVGEPPAPEPD 486 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCCCCCCCCCCCCC Confidence 00000 00000 00 0000111112223333333 No 192 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.05 E-value=5e-10 Score=71.58 Aligned_cols=442 Identities=15% Similarity=0.132 Sum_probs=193.5 Q ss_pred hhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccc-cccchhhhcccccCCcccccccCcccHHHHHHHHh Q lcl|NC_019527. 32 LAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDS-LCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALAT 110 (516) Q Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds-~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~ 110 (516) |.+...+-+-+.+-...-..+ +. +-+ ..++.++++. +......+-.-+.+.++...+..-.|...-..+.. T Consensus 1 m~~~~~~k~~~~k~~~~~~~~---~~---~~i--~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~s 72 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTS---NL---NSI--LEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNH 72 (522) T ss_pred CchHHHHHHHHHHHHHHhhcc---cc---hhc--cccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhccccee Confidence 222211111111000000000 00 000 0011111111 11111111111111111110111112222223344 Q ss_pred CchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCC---- Q lcl|NC_019527. 111 RPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGAD---- 186 (516) Q Consensus 111 ~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~---- 186 (516) -.+++.||+..|+-++.+...|+..++.. -+.|++.++.-++...+.+++..+...||+++-+.++++. T Consensus 73 lnl~~~i~~~~A~lv~~e~~~i~v~d~~~-------~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~~~i~ 145 (522) T protein:vir:47 73 LPIARTASKKIASLVYNEQATITTKNEIL-------QKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDGDKVRVA 145 (522) T ss_pred cchHHHHHHHHhhhhcCCcceeecCChHH-------HHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcCCceEEE Confidence 58999999999999999999888765322 2467778888889999999999999999888777776542 Q ss_pred ---ccc--Ccccccccccccce------------eeEEeecce-eeccccccccccccccccCcceeEEe---------- Q lcl|NC_019527. 187 ---VSV--PLILDPRTIKKGSL------------TGFSNIEPM-WTSPSAYNALDPTAPDFYKPSTWWVL---------- 238 (516) Q Consensus 187 ---~~~--Pl~ld~~~I~~g~l------------~~l~v~d~~-~v~p~~~~~~dp~s~~yg~P~~y~v~---------- 238 (516) .+. |+..+...+..+.+ ++.+.+..+ |+.. ....+...-. +. .|.|. T Consensus 146 ~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~--~~~~~~~~~~-~~--~~~I~n~ly~~~~~~ 220 (522) T protein:vir:47 146 FIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTA--DGQETGSTND-KK--YYRITNELYRSDVND 220 (522) T ss_pred EEcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeeccc--cccccccccc-CC--ceEEEEEEeecCCCc Confidence 122 33222222211111 011111111 0000 0000000000 00 12221 Q ss_pred --eeEe-----------cc-------ceEE-EecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019527. 239 --GREM-----------HA-------SRLL-TIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR 297 (516) Q Consensus 239 --g~~i-----------H~-------SRli-~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~ 297 (516) |.+| ++ +|.+ .+.-.+.|.- ......+|+|++..+.+.+...+.+......=+..... T Consensus 221 ~lG~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~-~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~ 299 (522) T protein:vir:47 221 VLGQRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNN-KDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQR 299 (522) T ss_pred ccCccccccccccccCCCCceEeCCCCcceEEEecCCcccc-cccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccc Confidence 1111 11 1111 0111111111 11134679999999999999998877665544443333 Q ss_pred ceeee-cchhhhcCc-cHH-HHHHHHHHHH-HhcCCcceEEEecCCcceeEEeccc--CCHHHHHHHHHHHHHhhhcCCc Q lcl|NC_019527. 298 TFLKT-NMAQVLNGG-EGG-DVFDRVEMYV-NMQSNLGLAVMDFDSEDIVQVNTPL--SGLADLQSQSQEHMCSVSKIPA 371 (516) Q Consensus 298 ~v~k~-~~~~~l~~~-~~~-~l~~r~~~~~-~~~sn~g~~~id~~~e~~e~~~~~l--sgl~d~~~~~~~~iaaas~IP~ 371 (516) .++-- .+......+ ++. .....+..-. .++. .+.. ++++..++.++..+ ..+...+..+.+.|+-.+++.. T Consensus 300 ~i~v~~~~l~~~~~~~~g~~~~~~~fd~~~~~f~~-~~~~--~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~ 376 (522) T protein:vir:47 300 RVIVPEHLTQRQYQRPDGTIDFRPRFDVEQNVYMQ-IGGS--SMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSS 376 (522) T ss_pred eeecchHHhccCCCCCCcccccccccCcccceEee-cCCC--CCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCc Confidence 33221 111111011 111 0000111000 0111 0110 12224466665544 3456677777777888888765 Q ss_pred eeeeccccccccccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----C-CCc--CCcceEEeCCCCCCCH Q lcl|NC_019527. 372 IKLTGISPSGLNASSEG---EIRSFYDDISSVQQSYYFSPLDTMLKVIQLSK-----W-GEI--DDAITFKFKSLWQTSA 440 (516) Q Consensus 372 t~L~G~sp~Glnatge~---D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~-----~-g~~--~~d~~~~f~pL~~~se 440 (516) . -||...+|. .|+.+ ..+.-|.+++.+|. .++..|++|+..|+... + |.+ +.+++|.|++-...+. T Consensus 377 ~-tf~~~~~~~-kTAtEi~s~~~~~~~t~~~~~~-~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~ 453 (522) T protein:vir:47 377 G-MFTFDGQGM-KTATEIVSENSDTYQMRSSIVA-LVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDR 453 (522) T ss_pred c-ccCcccccc-ccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCH Confidence 4 334444444 23332 23445677777774 57888888887776432 1 222 2468899999888887 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhh-hccccccchh------cCCCC-CCCCC-CCCC Q lcl|NC_019527. 441 KEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDL-EIVQPEMFDD------DGADP-YMPDP-DVLP 511 (516) Q Consensus 441 kEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~-e~~~~e~~~~------e~~~~-~~~~~-~~~~ 511 (516) .++++.. .+++.+|+++..+++..+ .+++++. +..-+...++ ++.+. ++.++ .... T Consensus 454 ~~~~~~~-------~~~v~aG~~s~e~~i~~~--------~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 518 (522) T protein:vir:47 454 HAELDYW-------AKMVAAGFSTKKRAIGKT--------LNISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEEKA 518 (522) T ss_pred HHHHHHH-------HHHHhcCCCCHHHHHHhc--------CCCChHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccccC Confidence 6655444 557889999998877653 2232221 1000111000 00111 11111 2233 Q ss_pred CCCC Q lcl|NC_019527. 512 GEEG 515 (516) Q Consensus 512 ~~e~ 515 (516) +++| T Consensus 519 d~~~ 522 (522) T protein:vir:47 519 DDKG 522 (522) T ss_pred CCCC Confidence 3333 No 193 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=99.05 E-value=7.7e-10 Score=70.55 Aligned_cols=436 Identities=13% Similarity=0.149 Sum_probs=189.5 Q ss_pred hHHHHh-HHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHH---h Q lcl|NC_019527. 35 RRAVMK-SMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALA---T 110 (516) Q Consensus 35 ~~~~~~-~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~---~ 110 (516) -.+|.. +.+++......-+.||..-.|+ ..++ + +.+++.....++ ......++|...|+ . T Consensus 1 m~~lfgf~i~~~~~~~~~S~vpp~~~~~~------~~i~--~---g~~g~~v~~~g~-----~~~~n~~eLI~~YR~ma~ 64 (564) T protein:vir:10 1 MSQLFGFLINEKEGQKGQSPVPPNDEASV------STVA--G---GYFGTYVDTSGG-----QNSRNEYELIRRYRDMSL 64 (564) T ss_pred CcchhcceeeeeccCCCCCcccCCcCCCh------hhhh--c---cccceeeecccc-----cchhhHHHHHHHHHHHhh Confidence 011110 1111111111223333221111 0111 0 011111111111 11235678888886 6 Q ss_pred CchhhhhhhhhhHHHhh-----CCCeeeeccccchhhhHHH-HHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecC Q lcl|NC_019527. 111 RPEYRAFASTLSTELTR-----EGIEITSKDRTKAKEMASK-IKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKG 184 (516) Q Consensus 111 ~~i~r~iVd~~aed~~r-----~~~~i~~~~~~~~~~~~~~-i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~ 184 (516) ++++..+|+.+++||+- .-+++...+.+-.+...++ ..+++..++-|++...-.+.+|.--+.|.-+.-..|+. T Consensus 65 ~pEVd~Av~eIVneaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~ 144 (564) T protein:vir:10 65 HPEVDSAIDEIVNEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDL 144 (564) T ss_pred ccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeC Confidence 99999999999999873 2333333322222211111 13333344444666666666665555555444444554 Q ss_pred CCcccCcccccccccccceeeEEeecceeecccccccccc--c----------cccccC-cceeEEee------------ Q lcl|NC_019527. 185 ADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDP--T----------APDFYK-PSTWWVLG------------ 239 (516) Q Consensus 185 ~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp--~----------s~~yg~-P~~y~v~g------------ 239 (516) .++. +| |+.|+.+||..+........++ . .-+|+. +++|.+++ T Consensus 145 ~~pk-----------~G-I~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~ 212 (564) T protein:vir:10 145 DNPK-----------KG-ILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTG 212 (564) T ss_pred CChh-----------hh-hhhhhhhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCccccccc Confidence 4332 22 4455556665544432211111 0 112222 33443331 Q ss_pred ---------eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhC----Cceeeecchh Q lcl|NC_019527. 240 ---------REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFS----RTFLKTNMAQ 306 (516) Q Consensus 240 ---------~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~----~~v~k~~~~~ 306 (516) .+||.+-+.+.+.. +.......=+|.|+.+...+.+.-....+ -++++.+ ..|+-+|+.+ T Consensus 213 ~~~~~~~~~ikI~~daI~y~hSG-----L~d~~~~~i~gyLhkAIKp~NQLkmlEDA--lVIYRitRAPeRRvFYIDVGn 285 (564) T protein:vir:10 213 SMDWSNQEGIKIASDAIAQSTSG-----LMDLNKKMTLSFLHKAIKSLNQLRMIEDS--LVIYRLSRAPERRIFYIDVGN 285 (564) T ss_pred ccccccccceeechhhcceeccc-----ceeCCCCceeccchhhhHhHHhhHHHHhh--HHHHhhhccccceEEEEecCC Confidence 24444443333221 11223333456777776666655443332 3444443 2445455443 Q ss_pred hhcCccHHHHHHHHHHHHHhcCCcceEEEecC--------------------------CcceeEEe--cccCCHHHHHHH Q lcl|NC_019527. 307 VLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFD--------------------------SEDIVQVN--TPLSGLADLQSQ 358 (516) Q Consensus 307 ~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~--------------------------~e~~e~~~--~~lsgl~d~~~~ 358 (516) +-.. ..++..+ ..+++++ +-++-|+. +-+++++. -+|+.++|+ .+ T Consensus 286 LPk~-KAeqYlr--~iM~k~K---NklVYDa~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV-~Y 358 (564) T protein:vir:10 286 LPKV-KAEQYLR--DVMSRYR---NKLVYDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDV-EY 358 (564) T ss_pred CCch-hHHHHHH--HHHHhcC---ceEEEeccCceecccchhhhhHhhhcccccCCCcccceeeccccCCcchHHHH-HH Confidence 3211 1111111 1111111 01111111 11344442 245666665 58 Q ss_pred HHHHHHhhhcCCceeeecccccccc--ccch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH------HhCCCcCCc Q lcl|NC_019527. 359 SQEHMCSVSKIPAIKLTGISPSGLN--ASSE--GEIRSFYDDISSVQQSYYFSPLDTMLKVI-QL------SKWGEIDDA 427 (516) Q Consensus 359 ~~~~iaaas~IP~t~L~G~sp~Gln--atge--~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l-~~------s~~g~~~~d 427 (516) |+..+-.+.++|.++|-.+ .+|+| .++| -|.-.|..+|.+.|..+ ..++..+++.= .+ ..|-.+-++ T Consensus 359 F~kKLY~aLnVP~SRl~~e-~~~f~~Gr~~EItRDEiKF~KFI~RLR~rF-s~lF~~~Lk~qLiLKgiit~eeW~~i~~~ 436 (564) T protein:vir:10 359 FKKKLYNSLNLPPSRLTDD-NKAFNLGKSTEILRDELKFTKFIGRLRKRF-AQLFHDILKTQLILKGIITPEDWDDMEEH 436 (564) T ss_pred HHHHHHHHhCCCcccccCC-CceeecccccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhccCCCHHHHHHHhhc Confidence 9999999999999999654 34554 3332 14567999999998654 45555554421 11 123334457 Q ss_pred ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH--cCCCCHHHHHHHH------------Hh---hh-ccCCCCCChhhhc Q lcl|NC_019527. 428 ITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT--NSVIDPSEARQQL------------SD---DP-DSGWDNIDGDLEI 489 (516) Q Consensus 428 ~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~--~gvi~~~e~r~~l------------~~---~~-~~~~~~~d~~~e~ 489 (516) +.|+|..=.-.+|-..+|+...+.++++.+-. .-.+|.+-+++.+ .+ .. +..|. +++++. T Consensus 437 I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~--~P~e~~ 514 (564) T protein:vir:10 437 IQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAI--DPIQVN 514 (564) T ss_pred ceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCC--Cchhhh Confidence 88999888888888889999888888776521 1123444443322 11 11 11121 111111 Q ss_pred cc-----------cccchhcCCCCCCCCC-------CCCCCC---CCC Q lcl|NC_019527. 490 VQ-----------PEMFDDDGADPYMPDP-------DVLPGE---EGS 516 (516) Q Consensus 490 ~~-----------~e~~~~e~~~~~~~~~-------~~~~~~---e~t 516 (516) .- +++....+..++.+++ +..++. +.+ T Consensus 515 ~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 562 (564) T protein:vir:10 515 MLDDMEKQNQAFAPELQAAQDDLAAEREIKKLNSAPKPPPSQQSKSQS 562 (564) T ss_pred cCCCccCCCCcCCcchhhhccccccccChhhhccCCCCCCCCCCcCcC Confidence 00 1111111111111111 111111 111 No 194 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=99.03 E-value=4.9e-09 Score=66.15 Aligned_cols=439 Identities=10% Similarity=0.095 Sum_probs=202.2 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) ||.||-|- ++.. .+.+.......++||..-+|... +. +..+. T Consensus 6 lf~f~~~~-----------------d~~~------------~~~~~~~~~~s~~~p~~~dGa~~------i~--~~~~~- 47 (516) T protein:vir:10 6 LFKFWDRV-----------------DQNE------------YDERLKLGHESIATPKKDDGATE------IE--TREGE- 47 (516) T ss_pred hcccccch-----------------hhhH------------HhhhhcCCcCcccCCCCCCCcee------ee--cCCCc- Confidence 55553321 0000 11111222334566655444310 10 00000 Q ss_pred hhhcccccCCcccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhh-----CCCeeeeccccchhhhHHHHHHHHH Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPFPGYQNLAALA---TRPEYRAFASTLSTELTR-----EGIEITSKDRTKAKEMASKIKELEE 152 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f~gy~ll~~y~---~~~i~r~iVd~~aed~~r-----~~~~i~~~~~~~~~~~~~~i~~i~~ 152 (516) ..++++++.+...+...-..++|...|+ .++++..+|+.+++||+- +.+++...+-+-.+ ..-++|.+ T Consensus 48 -~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~---~ik~kI~e 123 (516) T protein:vir:10 48 -ATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGS---NVKEKILE 123 (516) T ss_pred -ccccceeeeeeccccccchHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcch---HHHHHHHH Confidence 1111222111112222225678888886 699999999999999973 23333332222212 11233444 Q ss_pred HHH----hcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccc-cccccc Q lcl|NC_019527. 153 ACE----YYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNA-LDPTAP 227 (516) Q Consensus 153 ~~~----~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~-~dp~s~ 227 (516) +++ -|++...-.+.+|.--+.|.-+.--.++ ++. .+|+.++.+||..+....+.. .|..+. T Consensus 124 eF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid--~~k------------~GI~Elr~lDPr~i~~vR~i~~~~~~~~ 189 (516) T protein:vir:10 124 EFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMP--NPK------------KGIAELRRLDPRFMEYYREIVTSDIGGT 189 (516) T ss_pred HHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec--Ccc------------ccceeeeeeCCcceeeEeeecccccccc Confidence 444 4466666666666555555544332343 211 124445555555444322210 011000 Q ss_pred -------cc--cCc--ceeEEee--------eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 228 -------DF--YKP--STWWVLG--------REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSV 288 (516) Q Consensus 228 -------~y--g~P--~~y~v~g--------~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~ 288 (516) .| |.| ..|.+.| .+|+.+=+.+.+.. +.....+.=+|.|+.+...+.+.-....+ T Consensus 190 ~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSG-----L~d~~~~~i~syLhkAiKp~NQLkm~EDA- 263 (516) T protein:vir:10 190 TIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSG-----LMDCSDRGIIGYLHNAVKPANQLKLLEDA- 263 (516) T ss_pred hhhhhhhheeeeccCccccccccceeCCCcceeechhheeeeccc-----ceeCCCCceeeeehhhhHhHHhhHHHHhh- Confidence 00 000 1222333 34555543333221 11222222366777766666554433332 Q ss_pred HHHHHHhC----CceeeecchhhhcCccHHHHH-HHHHHHH---HhcCCcceE--------------EE---ecCCccee Q lcl|NC_019527. 289 SDLVDKFS----RTFLKTNMAQVLNGGEGGDVF-DRVEMYV---NMQSNLGLA--------------VM---DFDSEDIV 343 (516) Q Consensus 289 ~~Ll~~~~----~~v~k~~~~~~l~~~~~~~l~-~r~~~~~---~~~sn~g~~--------------~i---d~~~e~~e 343 (516) -++++.+ -.|+=+|+.++-.. ..++.. .-+..+. .+-.++|-+ +- ++.+-+++ T Consensus 264 -lVIYRitRAPeRRvFYIDvGnlPk~-KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 341 (516) T protein:vir:10 264 -MVIYRITRAPERRVFYIDVGNMNNR-KATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVS 341 (516) T ss_pred -HHHHhhhccccceEEEEecCCCCch-hHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCcccee Confidence 3444433 23444444433211 111111 1011000 001111110 00 01112344 Q ss_pred EEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccc--cccchH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_019527. 344 QVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGL--NASSEG--EIRSFYDDISSVQQSYYFSPLDTMLKVI- 416 (516) Q Consensus 344 ~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gl--natge~--D~~~yyd~I~~~Qe~~l~p~l~~l~~~l- 416 (516) ++. -+|+.++|+ ..|+..+-.+.++|.++|-..+...+ +.++|= |.-.|..+|.+.|..+ ..++..+++.= T Consensus 342 TLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rF-s~lf~~~L~~qL 419 (516) T protein:vir:10 342 SLPGAQTMGDMDDV-RWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDF-EEIFLDPLKTNL 419 (516) T ss_pred eccccCCcChHHHH-HHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhh Confidence 442 235556554 58999999999999999976654444 333332 5567999999998654 45555554421 Q ss_pred HH------HhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH--HcCCCCHHHHHHHHHhhhccCCCCCChhhh Q lcl|NC_019527. 417 QL------SKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYI--TNSVIDPSEARQQLSDDPDSGWDNIDGDLE 488 (516) Q Consensus 417 ~~------s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~--~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e 488 (516) .+ ..|-.+-+++.|+|..=.--+|-..+|+...+.++++.+- -...++.+-+++.+-... |+++. T Consensus 420 ilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~t-------Deei~ 492 (516) T protein:vir:10 420 IYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMT-------EEQIA 492 (516) T ss_pred hhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCC-------HhhHH Confidence 11 1233344578899998888889999999999999888874 345788888887753322 22222 Q ss_pred ccccccchhcCCCCCCCCCCCCCCC Q lcl|NC_019527. 489 IVQPEMFDDDGADPYMPDPDVLPGE 513 (516) Q Consensus 489 ~~~~e~~~~e~~~~~~~~~~~~~~~ 513 (516) ..+..+.+ |..++--.+|+++.+= T Consensus 493 ~e~k~I~~-E~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 493 QEEKQIEQ-EAGIKRFQNPENEDDF 516 (516) T ss_pred HHHHHHHH-hhhCCCCCCCCccccC Confidence 22222221 2222211222222111 No 195 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=99.03 E-value=4.9e-09 Score=66.15 Aligned_cols=439 Identities=10% Similarity=0.095 Sum_probs=202.2 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) ||.||-|- ++.. .+.+.......++||..-+|... +. +..+. T Consensus 6 lf~f~~~~-----------------d~~~------------~~~~~~~~~~s~~~p~~~dGa~~------i~--~~~~~- 47 (516) T protein:vir:10 6 LFKFWDRV-----------------DQNE------------YDERLKLGHESIATPKKDDGATE------IE--TREGE- 47 (516) T ss_pred hcccccch-----------------hhhH------------HhhhhcCCcCcccCCCCCCCcee------ee--cCCCc- Confidence 55553321 0000 11111222334566655444310 10 00000 Q ss_pred hhhcccccCCcccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhh-----CCCeeeeccccchhhhHHHHHHHHH Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPFPGYQNLAALA---TRPEYRAFASTLSTELTR-----EGIEITSKDRTKAKEMASKIKELEE 152 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f~gy~ll~~y~---~~~i~r~iVd~~aed~~r-----~~~~i~~~~~~~~~~~~~~i~~i~~ 152 (516) ..++++++.+...+...-..++|...|+ .++++..+|+.+++||+- +.+++...+-+-.+ ..-++|.+ T Consensus 48 -~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~---~ik~kI~e 123 (516) T protein:vir:10 48 -ATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGS---NVKEKILE 123 (516) T ss_pred -ccccceeeeeeccccccchHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcch---HHHHHHHH Confidence 1111222111112222225678888886 699999999999999973 23333332222212 11233444 Q ss_pred HHH----hcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccc-cccccc Q lcl|NC_019527. 153 ACE----YYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNA-LDPTAP 227 (516) Q Consensus 153 ~~~----~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~-~dp~s~ 227 (516) +++ -|++...-.+.+|.--+.|.-+.--.++ ++. .+|+.++.+||..+....+.. .|..+. T Consensus 124 eF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid--~~k------------~GI~Elr~lDPr~i~~vR~i~~~~~~~~ 189 (516) T protein:vir:10 124 EFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMP--NPK------------KGIAELRRLDPRFMEYYREIVTSDIGGT 189 (516) T ss_pred HHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec--Ccc------------ccceeeeeeCCcceeeEeeecccccccc Confidence 444 4466666666666555555544332343 211 124445555555444322210 011000 Q ss_pred -------cc--cCc--ceeEEee--------eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 228 -------DF--YKP--STWWVLG--------REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSV 288 (516) Q Consensus 228 -------~y--g~P--~~y~v~g--------~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~ 288 (516) .| |.| ..|.+.| .+|+.+=+.+.+.. +.....+.=+|.|+.+...+.+.-....+ T Consensus 190 ~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSG-----L~d~~~~~i~syLhkAiKp~NQLkm~EDA- 263 (516) T protein:vir:10 190 TIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSG-----LMDCSDRGIIGYLHNAVKPANQLKLLEDA- 263 (516) T ss_pred hhhhhhhheeeeccCccccccccceeCCCcceeechhheeeeccc-----ceeCCCCceeeeehhhhHhHHhhHHHHhh- Confidence 00 000 1222333 34555543333221 11222222366777766666554433332 Q ss_pred HHHHHHhC----CceeeecchhhhcCccHHHHH-HHHHHHH---HhcCCcceE--------------EE---ecCCccee Q lcl|NC_019527. 289 SDLVDKFS----RTFLKTNMAQVLNGGEGGDVF-DRVEMYV---NMQSNLGLA--------------VM---DFDSEDIV 343 (516) Q Consensus 289 ~~Ll~~~~----~~v~k~~~~~~l~~~~~~~l~-~r~~~~~---~~~sn~g~~--------------~i---d~~~e~~e 343 (516) -++++.+ -.|+=+|+.++-.. ..++.. .-+..+. .+-.++|-+ +- ++.+-+++ T Consensus 264 -lVIYRitRAPeRRvFYIDvGnlPk~-KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 341 (516) T protein:vir:10 264 -MVIYRITRAPERRVFYIDVGNMNNR-KATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVS 341 (516) T ss_pred -HHHHhhhccccceEEEEecCCCCch-hHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCcccee Confidence 3444433 23444444433211 111111 1011000 001111110 00 01112344 Q ss_pred EEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccc--cccchH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_019527. 344 QVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGL--NASSEG--EIRSFYDDISSVQQSYYFSPLDTMLKVI- 416 (516) Q Consensus 344 ~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gl--natge~--D~~~yyd~I~~~Qe~~l~p~l~~l~~~l- 416 (516) ++. -+|+.++|+ ..|+..+-.+.++|.++|-..+...+ +.++|= |.-.|..+|.+.|..+ ..++..+++.= T Consensus 342 TLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rF-s~lf~~~L~~qL 419 (516) T protein:vir:10 342 SLPGAQTMGDMDDV-RWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDF-EEIFLDPLKTNL 419 (516) T ss_pred eccccCCcChHHHH-HHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhh Confidence 442 235556554 58999999999999999976654444 333332 5567999999998654 45555554421 Q ss_pred HH------HhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHH--HcCCCCHHHHHHHHHhhhccCCCCCChhhh Q lcl|NC_019527. 417 QL------SKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYI--TNSVIDPSEARQQLSDDPDSGWDNIDGDLE 488 (516) Q Consensus 417 ~~------s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~--~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e 488 (516) .+ ..|-.+-+++.|+|..=.--+|-..+|+...+.++++.+- -...++.+-+++.+-... |+++. T Consensus 420 ilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~t-------Deei~ 492 (516) T protein:vir:10 420 IYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMT-------EEQIA 492 (516) T ss_pred hhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCC-------HhhHH Confidence 11 1233344578899998888889999999999999888874 345788888887753322 22222 Q ss_pred ccccccchhcCCCCCCCCCCCCCCC Q lcl|NC_019527. 489 IVQPEMFDDDGADPYMPDPDVLPGE 513 (516) Q Consensus 489 ~~~~e~~~~e~~~~~~~~~~~~~~~ 513 (516) ..+..+.+ |..++--.+|+++.+= T Consensus 493 ~e~k~I~~-E~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 493 QEEKQIEQ-EAGIKRFQNPENEDDF 516 (516) T ss_pred HHHHHHHH-hhhCCCCCCCCccccC Confidence 22222221 2222211222222111 No 196 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=99.01 E-value=9.4e-10 Score=70.08 Aligned_cols=335 Identities=10% Similarity=0.067 Sum_probs=160.8 Q ss_pred chhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccc-cccCCCCCCCccCCCccchhcccccccchhh Q lcl|NC_019527. 4 FDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAAT-KWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQ 82 (516) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~ 82 (516) -+||| ++... .....+ +..... ..+..+.+ -|--|++ | +|+. .... T Consensus 1 ~~~~~--~~~~~----------~~~~~~-----~~~~~~-~~~~~~~~~~~~~p~~---v----------~~~~--~~~~ 47 (351) T protein:vir:79 1 MSKRR--SRAPR----------TFAAAP-----NPSAGS-AAPARAEVFTFDDPTP---V----------MNRA--EILD 47 (351) T ss_pred CCCCC--CCCCC----------CCCCCC-----chhhhh-cccceeEEEEcCCcee---e----------cCcc--hhhh Confidence 22111 11000 000000 000000 00111100 1111210 1 1111 0011 Q ss_pred hcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHH Q lcl|NC_019527. 83 FLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGI 162 (516) Q Consensus 83 ~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~ 162 (516) +......+. +++ .++.-..|..+++.++....+|-.-+.+.++ ++.- + + +--+.. T Consensus 48 ~~~~~~~~~-~~~-pp~~~~~la~~~~~~~~h~~~l~~k~n~l~~-~~~P---n-p------------------~~t~~~ 102 (351) T protein:vir:79 48 YVECWSNGE-WFE-PPVSFAGLAKSFRASTHHSSALFFKANVLAS-TFRP---H-R------------------WLSRHA 102 (351) T ss_pred hhhhhhcCc-eec-CCCCHHHHHHHHhhhHhhhhhhhhhhhHHhh-cccC---C-C------------------CCCHHH Confidence 111112221 111 1122234677777787777777665553333 2210 0 0 001222 Q ss_pred HHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe--e- Q lcl|NC_019527. 163 IQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL--G- 239 (516) Q Consensus 163 l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~--g- 239 (516) |..++....+||.|++++..++ .|.+.+|.++.+.++..... .+ .+|++. + T Consensus 103 f~~~v~d~ll~Gnay~~~~r~~---------------~G~~~~L~~l~~~~v~~~~~--~~---------~~~~~~~~g~ 156 (351) T protein:vir:79 103 FERWALDFLTFGNGYLERRRNM---------------VGGTLRLEPALAKYVRRKAD--FS---------GFVYVNGWQE 156 (351) T ss_pred HHHHHHHHHhcCCeEEEEEECC---------------CCCEEEEEEeCCcceeeeec--CC---------eEEEEecCce Confidence 4444444567899998875532 23455677777777664221 01 123332 2 Q ss_pred -eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhhcCccHHHH Q lcl|NC_019527. 240 -REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVLNGGEGGDV 316 (516) Q Consensus 240 -~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l~~~~~~~l 316 (516) ..+.+..|||+.... +....+|+|.++.+...+..-..+......+..+....- +++. ...++.++.+.+ T Consensus 157 ~~~~~~~eIihir~~~------~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~il~~~-~~~ls~e~~~~l 229 (351) T protein:vir:79 157 RHEFEPDSVFQLVRPD------INQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMT-DAAQKQDDVDNM 229 (351) T ss_pred EEEEcCccEEEeCCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec-CCCCCHHHHHHH Confidence 246678999986432 234567999999999999888888877777877766532 2322 123555555667 Q ss_pred HHHHHHHHHhcCCcceEEEecC---CcceeEEecccCC----HHHHHHHHHHHHHhhhcCCceeeecccccccc--ccch Q lcl|NC_019527. 317 FDRVEMYVNMQSNLGLAVMDFD---SEDIVQVNTPLSG----LADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSE 387 (516) Q Consensus 317 ~~r~~~~~~~~sn~g~~~id~~---~e~~e~~~~~lsg----l~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge 387 (516) .+.++.. .+.+|.+.+++... .+.++...++.+. +-++.....+.||++.+||-. |+|+.+.+-. ++-| T Consensus 230 k~~~~~~-~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~-llGi~~~~t~~~~n~e 307 (351) T protein:vir:79 230 RDALKNA-KGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQ-LLGIVPSNSGGFGTPD 307 (351) T ss_pred HHHHHHh-cCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcccHH Confidence 7777653 44566665554321 2334444444443 334556677889999999975 5587654321 3334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHH Q lcl|NC_019527. 388 GEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEES 444 (516) Q Consensus 388 ~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkA 444 (516) ...+.|+ ++.|.|.++++.++. . .+| .+ .++|++.--+....+| T Consensus 308 ~~~~~f~-------~~~l~Pl~~~ie~ln-~-~lg---~~-~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 308 TAARVFG-------RNEIRPLQARFAELN-D-WLG---DE-VVTFDDYEIPPAPVAA 351 (351) T ss_pred HHHHHHH-------HHHHHHHHHHHHHHH-h-hcC---cc-eeeeChhhhccccccC Confidence 3444443 345677777665532 1 112 22 2677764333333333 No 197 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=99.00 E-value=6.2e-09 Score=65.59 Aligned_cols=436 Identities=10% Similarity=0.020 Sum_probs=193.6 Q ss_pred CCChhhhHHHHhHHhhcCCCccccccCCCCCCCc--cCCCccchhcccc-----cc-------cchhhhcccccCCcccc Q lcl|NC_019527. 29 ARKLAMRRAVMKSMERRASDAATKWAPPQLMPGV--VPAGTTPAVAMDS-----LC-------GPTYQFLNSAAGGLYAA 94 (516) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv--~~~~~~~~~a~ds-----~~-------~~~~~~~~~~~~~~~~~ 94 (516) --...-.+.-+...+.... .|.+- ..++ .+... ....++. +. .....-...++.|.+.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~----~~~~~--~n~~~~~~~~~-~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i 73 (511) T protein:vir:10 1 MLKVNEFETDTDLRGNINY----LFNDE--ANVVYTYDGTE-SDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKN 73 (511) T ss_pred Cccccchhhhhhhhhhhhh----hhhhh--hcCCccCchhh-hhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcc Confidence 0000000011111110000 01110 0010 00000 0011110 00 00011111222221110 Q ss_pred ccc-CcccHHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhccc Q lcl|NC_019527. 95 DIQ-PFPGYQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCF 172 (516) Q Consensus 95 ~~~-~f~gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rl 172 (516) -.. ........+..+ .+.+++.||+..+.-++.+++.+++.++.. .+.|...+++-++.....++.+...+ T Consensus 74 ~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~-------~~~l~~~~~~n~~~~~~~~~~~~~~i 146 (511) T protein:vir:10 74 LVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDV-------LEAIEAFNDLNDVESHNRSLGLDLSI 146 (511) T ss_pred ccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCchHH-------HHHHHHHHhhcCHHHHHHHHHHHHHh Confidence 000 000000000011 357789999999999999999998765432 24577777777899999999999999 Q ss_pred ceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe-----------eeE Q lcl|NC_019527. 173 FGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL-----------GRE 241 (516) Q Consensus 173 yG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~-----------g~~ 241 (516) ||.|++++.++... .+ .+.+++|.++.|..-+. ....+.++ ..+|.+. -.. T Consensus 147 ~G~ay~~vy~dedg---------------~~-~i~~~~p~~~~~vydd~-~~~~~~~~-vr~~~~~~~d~~~~~~~~~~~ 208 (511) T protein:vir:10 147 YGKAYEIMIRNQDD---------------ET-RLYKSDAMSTFVIYDNT-IERNSIAG-VRYLRTKPIDKTDEDEVFTVD 208 (511) T ss_pred cCeeEEEEEeCCCC---------------ce-EEEEEccceeEEEEcCC-CCCceEEE-EEEEEeeecccCccceEEEEE Confidence 99999888764321 01 13444555444432110 00011111 0111110 000 Q ss_pred -eccceEEEecCC--------------------cchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCcee Q lcl|NC_019527. 242 -MHASRLLTIITR--------------------PLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFL 300 (516) Q Consensus 242 -iH~SRli~~~~~--------------------~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~ 300 (516) +.+.++.+|... .+|-. .-.++-+|.|.++.+.+.+.+++.+....+..+..++..++ T Consensus 209 iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv-~f~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l 287 (511) T protein:vir:10 209 LFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPIT-EFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAML 287 (511) T ss_pred EEeCCcEEEEEecCCCcccccccccccccccCcceeEE-EecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCcee Confidence 111222222110 01111 01224579999999999999999998888888887777665 Q ss_pred eecchhhhcCccHHHHHH-HHHHHHHhcCCcceEEEecCCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeecc Q lcl|NC_019527. 301 KTNMAQVLNGGEGGDVFD-RVEMYVNMQSNLGLAVMDFDSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGI 377 (516) Q Consensus 301 k~~~~~~l~~~~~~~l~~-r~~~~~~~~sn~g~~~id~~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~ 377 (516) .................+ ++-.........+...-..++.+++.+ +.+.+++...++.+.+.|...+.+|-.-.-+ T Consensus 288 v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~- 366 (511) T protein:vir:10 288 LIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN- 366 (511) T ss_pred eeeccccCCchhhccchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc- Confidence 542211111111111111 000000000000000111122344444 4566789999999999999999999743322 Q ss_pred ccccccccchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh--CCCc--C---CcceEEeCCCCCCCHHHHHHHH Q lcl|NC_019527. 378 SPSGLNASSEGEIRSF---YDDISSVQQSYYFSPLDTMLKVIQLSK--WGEI--D---DAITFKFKSLWQTSAKEESEIR 447 (516) Q Consensus 378 sp~Glnatge~D~~~y---yd~I~~~Qe~~l~p~l~~l~~~l~~s~--~g~~--~---~d~~~~f~pL~~~sekEkAei~ 447 (516) .+| |.||..=...| ...+ ..++..++..|++++++++... .+.+ + .+++|.|++-...+.++.+++. T Consensus 367 -~~~-n~Sg~Al~~~~~~l~~k~-~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~ 443 (511) T protein:vir:10 367 -FSG-TQSGEAMKYKLFGLEQRT-KTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAY 443 (511) T ss_pred -ccc-cchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHH Confidence 122 45665422222 2223 3345677888999888775421 1221 2 2689999999999999988766 Q ss_pred HHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccc--------------hhcCCCCCCCCCCCCCCC Q lcl|NC_019527. 448 FNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMF--------------DDDGADPYMPDPDVLPGE 513 (516) Q Consensus 448 ~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~--------------~~e~~~~~~~~~~~~~~~ 513 (516) .+. .|+||.+.+.+.+.. .+....+++....|.. +..+.+.+..+.++.+.+ T Consensus 444 ~kl---------~G~iS~et~~~~l~~-----v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (511) T protein:vir:10 444 IDS---------GGKISQTTLMSLFSF-----FQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDK 509 (511) T ss_pred HHH---------hccCcHHHHHHhCCC-----CCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccCcccc Confidence 543 266776666555411 0000011111111000 000000111111111111 Q ss_pred CC Q lcl|NC_019527. 514 EG 515 (516) Q Consensus 514 e~ 515 (516) +. T Consensus 510 ~~ 511 (511) T protein:vir:10 510 KE 511 (511) T ss_pred cC Confidence 11 No 198 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=98.99 E-value=1.6e-09 Score=68.83 Aligned_cols=325 Identities=9% Similarity=0.026 Sum_probs=161.1 Q ss_pred hhhhHHHHhHHh--hcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccC-CcccccccCcccHHHHHHH Q lcl|NC_019527. 32 LAMRRAVMKSME--RRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAG-GLYAADIQPFPGYQNLAAL 108 (516) Q Consensus 32 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~-~~~~~~~~~f~gy~ll~~y 108 (516) |..+++..+... ..+... .-|--+.+.+ .+ ...+..-... ...+++ .+..-..|..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~f~~~~~~~------------~~-----~~~y~~~~~~~~~~~~e-pp~~~~~la~l~ 61 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPIND-RTFSLNEISA------------SP-----ALDYVGIGFDENYNCYL-PPVNRHALAKLP 61 (345) T ss_pred CCCCccccchhhcccCccee-EEeecCCccc------------cc-----chhhhhhhhcCCccccC-CCCCHHHHHHHh Confidence 111111111100 001111 0111111110 00 0111111110 011111 112223456666 Q ss_pred HhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcc Q lcl|NC_019527. 109 ATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVS 188 (516) Q Consensus 109 ~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~ 188 (516) +.++....+|..-....+ .++. +.. +--+..|..++....+||.|++++..++ T Consensus 62 ~~~~~h~~~i~~k~n~l~-~~~~-Pn~---------------------~lt~~~f~~~~~d~ll~Gnay~~~~rn~---- 114 (345) T protein:vir:37 62 HQNAQHGGILHSRANMVS-SLYE-GGK---------------------ALSRMDMRALCLNLIQFGDVGLLKVRNG---- 114 (345) T ss_pred hcccccccceeeechHHH-hhcc-CCC---------------------CCCHHHHHHHHHHHHhcCCeEEEEEEcC---- Confidence 666666666654433222 2221 100 0012224445445568899998875432 Q ss_pred cCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe--e--eEeccceEEEecCCcchhhhhhccC Q lcl|NC_019527. 189 VPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL--G--REMHASRLLTIITRPLPDMLKPAYN 264 (516) Q Consensus 189 ~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~--g--~~iH~SRli~~~~~~~p~~~k~~~~ 264 (516) .|.+..|.++.+.+|.... | ...|+....+.+. | ..+.+..|||+.+.. +... T Consensus 115 -----------~G~~~~L~pl~~~~vr~~~----d--~~~~~~~~~~~~~~~g~~~~~~~~dVihir~~~------~~~~ 171 (345) T protein:vir:37 115 -----------FGQVVRLVPLSSLYLRVRK----D--GGYSYLMKKSLYDTAQEIYRYDAKDIIFIKLYD------PMQQ 171 (345) T ss_pred -----------CCcEEEEEEEcCceeEEEE----e--CCeeEEEEEeEecCCceEEEEccccEEEecCCC------CCCC Confidence 2345567777777665321 2 1223333333322 2 356778999986532 2345 Q ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEe-c--CC Q lcl|NC_019527. 265 FSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMD-F--DS 339 (516) Q Consensus 265 ~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id-~--~~ 339 (516) .+|+|.+..+...+..-..+......+..+....- +.+. ...++.++.+.+.+.++.. ....|.+.+++. . .. T Consensus 172 ~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~-d~~l~~e~~~~lk~~~~~~-~g~~n~~~~~i~~p~g~~ 249 (345) T protein:vir:37 172 VYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYST-DPDLTEEMEEEIARKISES-KGVGNFRSMFVNIANGHP 249 (345) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEec-CCCCCHHHHHHHHHHHHHh-cCcccccceEEEcCCCcc Confidence 67999999999998887777777777777655432 2321 1234444555666666553 344555554442 1 12 Q ss_pred cceeEEecccCCH----HHHHHHHHHHHHhhhcCCceeeecccccccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 340 EDIVQVNTPLSGL----ADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSEGEIRSFYDDISSVQQSYYFSPLDTML 413 (516) Q Consensus 340 e~~e~~~~~lsgl----~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge~D~~~yyd~I~~~Qe~~l~p~l~~l~ 413 (516) +.++...++.+.- -++.....+.||++.+||-. |+|..+.+-. ++-|...+.|+ ++.|.|.+++|. T Consensus 250 ~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~-llGi~~~~~~~~~~~e~~~~~f~-------~~~l~P~~~~ie 321 (345) T protein:vir:37 250 DGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAG-LSGIIPTNTGGLGDPLKYREVYH-------YDEVMPLQEIIA 321 (345) T ss_pred cceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhCccCCCCCCcccHHHHHHHHH-------HHHHHHHHHHHH Confidence 3344443333322 23444677899999999975 5687654322 22333334443 456889888888 Q ss_pred HHHHHHhCCCcCCcceEEeCC--CCC Q lcl|NC_019527. 414 KVIQLSKWGEIDDAITFKFKS--LWQ 437 (516) Q Consensus 414 ~~l~~s~~g~~~~d~~~~f~p--L~~ 437 (516) +.+-+ +.+++.+..+.|++ |.. T Consensus 322 ~~ln~--~~~~~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 322 ETINQ--DPEIKNLLKIKFREQNFAK 345 (345) T ss_pred HHhhh--hccCCCcceEEecchhhcC Confidence 87743 33566778888874 433 No 199 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=98.99 E-value=7.2e-09 Score=65.24 Aligned_cols=419 Identities=10% Similarity=0.039 Sum_probs=192.7 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) |= +-+..-.+ |+............+ ... .+. | +.-+..|.-|...+..+ T Consensus 1 ~~--------~~~d~~g~-----p~~~~~~~~~~~~~~-~~~----~~~---~-~~~~~~gltp~~l~~iL--------- 49 (526) T protein:vir:99 1 MA--------QIVDVYGN-----PIRTQQLREPQTSRL-AGL----AKE---F-AQHPAKGLTPAKLARIL--------- 49 (526) T ss_pred CC--------eeECCCCC-----ccccccccchhhhhh-hhh----hhh---h-cccCcCCCCHHHHHHHH--------- Confidence 10 00000000 000000000000000 000 000 0 00011222221111111 Q ss_pred hhhcccccCCcccccccCccc-HHHHHHH-HhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcC Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPFPG-YQNLAAL-ATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYG 158 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f~g-y~ll~~y-~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~ 158 (516) +.+ ..| .+.. ++|...+ .+..-+..++.+...-.+..-|.|....++..++ ....+.+++.+.++. T Consensus 50 -r~a---~~g-------d~~~~~~L~e~m~e~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~-~~~a~~v~~~l~~~~ 117 (526) T protein:vir:99 50 -VEA---EQG-------NLQAQAELFMDMEERDAHLFAEMSKRKRAILGLDWAVEPPRNASAAE-KADADYLHELLLDLE 117 (526) T ss_pred -Hhh---hCC-------CHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCCCceEecCCCCCHHH-HHHHHHHHHHHhccc Confidence 111 100 0111 2333223 3688899999999988888888886543322111 123355667777764 Q ss_pred -hhHHHHHHHHhcccceeeEEEEE--ecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCccee Q lcl|NC_019527. 159 -VMGIIQKAAEHDCFFGRGQISIN--IKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTW 235 (516) Q Consensus 159 -~~~~l~ea~~~~rlyG~a~i~i~--i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y 235 (516) +.+.+.+++ .+.+||.|+.=+. .+++.+ .++.+...++.|..-. +..-....-..- T Consensus 118 ~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~--------------~~~~l~~r~~~~f~~~------~~~~~~l~~~~~ 176 (526) T protein:vir:99 118 GLEDLLLDAL-DGIGHGYSCIELEWALQGREW--------------MPLAFHHRPQSWFQLN------PEDQNELRLRDN 176 (526) T ss_pred CHHHHHHHHH-HhhhhcceeEEEEEeecCCce--------------eEEEeeeecccceeec------cCCCcEEEecCC Confidence 555555444 6999999986553 222221 2234555555544311 100000000001 Q ss_pred EEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCcee--eecchhhhcCccH Q lcl|NC_019527. 236 WVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFL--KTNMAQVLNGGEG 313 (516) Q Consensus 236 ~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~--k~~~~~~l~~~~~ 313 (516) ...|..+++.+.+++.... ...+.+|.+++..|+-...--..+...-+.++.++++++. |++-. ... T Consensus 177 ~~~g~~l~~~k~i~~~~~~------~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~-----a~~ 245 (526) T protein:vir:99 177 SPAGEALQPFGWIIHRPRA------RSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPG-----TAD 245 (526) T ss_pred CCCceeecCCCeEEEeecC------CcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCC-----CCH Confidence 1235667777777665532 3346689999999988777666677777789999997654 44311 122 Q ss_pred HHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCH---HHHHHHHHHHHHhhhcCCceeeeccc-c-------ccc Q lcl|NC_019527. 314 GDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGL---ADLQSQSQEHMCSVSKIPAIKLTGIS-P-------SGL 382 (516) Q Consensus 314 ~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl---~d~~~~~~~~iaaas~IP~t~L~G~s-p-------~Gl 382 (516) ++...-++.+..+.++ +..++. ++.+++.+..+=++. ..+++..-++|+-+ ++|++ + +|- T Consensus 246 ~ek~~L~~av~~i~~d-~~~iiP-~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~-------iLGqtlTs~~~~g~~gS 316 (526) T protein:vir:99 246 EEKATLLRAVTGLGHA-AAGIIP-ETMAIDFQQAAQGSSEPFLAMMRQSEDAISKA-------VLGGTLTSTTSQSGGGA 316 (526) T ss_pred HHHHHHHHHHHHHhhC-cEEEec-CCceeEEeecCCCCHHHHHHHHHHHHHHHHHH-------HhhhhhccccccCcchh Confidence 3334444555555444 444454 457899888653443 34556566666543 23443 1 123 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-CC--cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 383 NASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEI-DD--AITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT 459 (516) Q Consensus 383 natge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~--d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~ 459 (516) +|-|+.-.....+.+++-......-+.+.|+..++.-.|+.. +. --.|+|..--..+- +..|+++..+++ T Consensus 317 ~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl-------~~~a~~~~~L~~ 389 (526) T protein:vir:99 317 FALGQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADI-------TSMAQSIPALVN 389 (526) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccH-------HHHHHHHHHHHh Confidence 333443445555666665554433444557777776666643 21 23566654333332 346778888999 Q ss_pred cCC-CCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCCCCCCC-----CCCCC--CCCCC Q lcl|NC_019527. 460 NSV-IDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGADPYMPD-----PDVLP--GEEGS 516 (516) Q Consensus 460 ~gv-i~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~~~~~~-----~~~~~--~~e~t 516 (516) .|+ |+.+++++.++ ++.-..+++...+..........++.. ....+ ....+ T Consensus 390 ~G~~i~~~~i~e~~G------ip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (526) T protein:vir:99 390 VGLEIPSAWVYDKLG------IPQPAKNEPVLRSAAQPAILSRQHGQRVAALATIVGPRYGDQQA 448 (526) T ss_pred CCCccCHHHHHHHhC------CCCCCCcccccCCCCCCcccccccccccccccccccccCcchhh Confidence 997 89999999873 322211111111100000000000000 00000 00000 No 200 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=98.99 E-value=4.5e-09 Score=66.34 Aligned_cols=445 Identities=9% Similarity=0.074 Sum_probs=204.7 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhccccc-ccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSL-CGP 79 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~-~~~ 79 (516) |++||-| +.+.. ..+.. . ....-++||..-+|... . -...++ .++ T Consensus 8 ~~~~w~~-----------------~de~~--------~~~~~---~-~~~~S~~~p~~~Dga~e-~----~~~~~~~a~~ 53 (524) T protein:vir:72 8 LFAPWAK-----------------MDERN--------FKDQE---K-EDLVSITAPKLDDGARE-F----EVSSNEAASP 53 (524) T ss_pred Hhhcccc-----------------Ccchh--------hhhhh---c-cCCccccCccCCCCcee-e----eecccccccc Confidence 3333222 11110 00111 1 11122455554444311 0 000011 011 Q ss_pred hhhhcccccCCcccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhh-----CCCeeeeccccchhhhHH-HHHHH Q lcl|NC_019527. 80 TYQFLNSAAGGLYAADIQPFPGYQNLAALA---TRPEYRAFASTLSTELTR-----EGIEITSKDRTKAKEMAS-KIKEL 150 (516) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~---~~~i~r~iVd~~aed~~r-----~~~~i~~~~~~~~~~~~~-~i~~i 150 (516) ..+......++ .+.+-...++|...|+ .++++..+|+.+++||+- +.+++...+-+-.+...+ ...++ T Consensus 54 ~~g~~~~~~g~---~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF 130 (524) T protein:vir:72 54 YNAAFQTIFGS---YEPGMKTTRELIDTYRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEF 130 (524) T ss_pred cceeeeehhcc---cccccchHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHH Confidence 11111111111 1122235678888886 699999999999999973 233333322221121111 12334 Q ss_pred HHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccc---------- Q lcl|NC_019527. 151 EEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYN---------- 220 (516) Q Consensus 151 ~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~---------- 220 (516) +..++-|++...-.+.+|.--+.|.-+.-..|+..++.. +|+.++.+||..+..+..- T Consensus 131 ~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~------------GI~Elr~lDPr~i~~vr~i~~~~~~~~~v 198 (524) T protein:vir:72 131 SDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKE------------GIKELRRLDPRQVQYVREIITETEAGTKI 198 (524) T ss_pred HHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccc------------cceeeeeeCCccceeeeeeccCCCccchh Confidence 444445577777777777766777665555566555432 2344555555544332111 Q ss_pred --------ccccccccc-cCcceeEE-eeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 221 --------ALDPTAPDF-YKPSTWWV-LGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSD 290 (516) Q Consensus 221 --------~~dp~s~~y-g~P~~y~v-~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~ 290 (516) ..+|....| +.+..|.- ++.+||.+=+.+.+..- .+.....=+|.|+++...+.+.-....+ - T Consensus 199 i~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI~y~hSGL-----~d~~~~~i~gyLhkAiKp~NQLkmlEDA--l 271 (524) T protein:vir:72 199 VKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAVVYAHSGL-----VDCCGKNIIGYLHRAVKPANQLKLLEDA--V 271 (524) T ss_pred hcchhhheeeccCccccccCccccCCCcceecchhheeeeeccc-----eeCCCCceeccchhhhHhHHhhhHHHhh--H Confidence 111111111 11111111 12356655544333221 1222223356777766666554433332 2 Q ss_pred HHHHhC----CceeeecchhhhcCccHHHHHHHHHHHHHh------cCCcceE--------------EE---ecCCccee Q lcl|NC_019527. 291 LVDKFS----RTFLKTNMAQVLNGGEGGDVFDRVEMYVNM------QSNLGLA--------------VM---DFDSEDIV 343 (516) Q Consensus 291 Ll~~~~----~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~------~sn~g~~--------------~i---d~~~e~~e 343 (516) ++++.+ -.|+=+|+.++-.. ..++..+- .++.+ -.++|-+ +- ++.+-+++ T Consensus 272 VIYRitRAPeRRvFYIDvGnlPk~-KAeqYl~~--im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 348 (524) T protein:vir:72 272 VIYRITRAPDRRVWYVDTGNMPAR-KAAEHMQH--VMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVD 348 (524) T ss_pred HHHhhhccccceEEEEecCCCCch-hHHHHHHH--HHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCccccee Confidence 444433 24444444433211 11111110 11111 1111110 00 01112344 Q ss_pred EEe--cccCCHHHHHHHHHHHHHhhhcCCceeeecccccccc--ccch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_019527. 344 QVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSE--GEIRSFYDDISSVQQSYYFSPLDTMLKVI- 416 (516) Q Consensus 344 ~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge--~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l- 416 (516) ++. -+|+.++|+ ..|+..+-.+.++|.++|-+.+++|+| .++| -|.-.|..+|.+.|..+ ..++..+++.= T Consensus 349 TLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rF-s~~f~~~Lk~qL 426 (524) T protein:vir:72 349 TLPGADNTGNMEDI-RWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKF-EEVFLDPLKTNL 426 (524) T ss_pred eccccCCcChHHHH-HHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhh Confidence 442 245556554 589999999999999999777777776 3332 15567999999998654 45555554421 Q ss_pred HH------HhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH-cC-CCCHHHHHHHHHhhhccCCCCCChhhh Q lcl|NC_019527. 417 QL------SKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT-NS-VIDPSEARQQLSDDPDSGWDNIDGDLE 488 (516) Q Consensus 417 ~~------s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~-~g-vi~~~e~r~~l~~~~~~~~~~~d~~~e 488 (516) .+ ..|-.+-+++.|+|..=.--+|-..+|+...+.++++.+-. .| .++.+-+++.+-... |++++ T Consensus 427 ilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~t-------Deei~ 499 (524) T protein:vir:72 427 LLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMT-------DEEIE 499 (524) T ss_pred hhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccC-------HHHHH Confidence 11 12333345788999988888899999999999888887643 22 567777876653321 22332 Q ss_pred ccccccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 489 IVQPEMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 489 ~~~~e~~~~e~~~~~~~~~~~~~~~e 514 (516) .....+.+ |..++-.++|+++.+.= T Consensus 500 ~~~k~I~~-E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:72 500 QEAKQIEE-ESKEARFQDPDQEQEDF 524 (524) T ss_pred HHHHHHHH-HhhcCCCCCCchhhhcC Confidence 22222211 11111111111110000 No 201 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=98.99 E-value=4.7e-09 Score=66.23 Aligned_cols=445 Identities=9% Similarity=0.074 Sum_probs=204.3 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhccccc-ccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSL-CGP 79 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~-~~~ 79 (516) |++||-| +.+.. ..+.. . ....-++||..-+|... . -...++ .++ T Consensus 8 ~~~~w~~-----------------~de~~--------~~~~~---~-~~~~S~~~p~~~Dga~e-~----~~~~~~~a~~ 53 (524) T protein:vir:10 8 LFAPWAK-----------------MDERN--------FKDQE---K-EDLVSITAPKLDDGARE-F----EVSSNEAASP 53 (524) T ss_pred Hhhcccc-----------------Ccchh--------hhhhh---c-cCCccccCccCCCCcee-e----eecccccccc Confidence 3333222 11110 00111 1 11122455554444311 0 000011 011 Q ss_pred hhhhcccccCCcccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhh-----CCCeeeeccccchhhhHH-HHHHH Q lcl|NC_019527. 80 TYQFLNSAAGGLYAADIQPFPGYQNLAALA---TRPEYRAFASTLSTELTR-----EGIEITSKDRTKAKEMAS-KIKEL 150 (516) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~---~~~i~r~iVd~~aed~~r-----~~~~i~~~~~~~~~~~~~-~i~~i 150 (516) ..+......++ .+.+-...++|...|+ .++++..+|+.+++||+- +.+++...+-+-.+...+ ...++ T Consensus 54 ~~g~~~~~~g~---~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF 130 (524) T protein:vir:10 54 YNAAFQTIFGS---YEPGMKTTRELIDTYRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEF 130 (524) T ss_pred cceeeeehhcc---cccccchHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHH Confidence 11111111111 1122235678888886 699999999999999973 233333322221121111 12334 Q ss_pred HHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccc---------- Q lcl|NC_019527. 151 EEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYN---------- 220 (516) Q Consensus 151 ~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~---------- 220 (516) +..++-|++...-.+.+|.--+.|.-+.-..|+..++.. +|+.++.+||..+..+..- T Consensus 131 ~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~------------GI~Elr~lDPr~i~~vr~i~~~~~~~~~v 198 (524) T protein:vir:10 131 NDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKE------------GIKELRRLDPRQVQYVREIITETEAGTKI 198 (524) T ss_pred HHHHHHhccchhhhHHHhhheeeeEEEEEEEeeCCCccc------------cceeeeeeCCccceeeeeeccCCCccchh Confidence 444445577777777777766767665555566554332 2344555555544332111 Q ss_pred --------ccccccccc-cCcceeEE-eeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 221 --------ALDPTAPDF-YKPSTWWV-LGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSD 290 (516) Q Consensus 221 --------~~dp~s~~y-g~P~~y~v-~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~ 290 (516) ..+|....| +.+..|.- ++.+||.+=+.+.+..- .+.....=+|.|+++...+.+.-....+ - T Consensus 199 i~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI~y~hSGL-----~d~~~~~i~gyLhkAiKp~NQLkmlEDA--l 271 (524) T protein:vir:10 199 VKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAIVYAHSGL-----VDCCGKNIIGYLHRAVKPANQLKLLEDA--V 271 (524) T ss_pred hcchhhheeeccCccccccCccccCCCcceecchhheeeeeccc-----eeCCCCceeccchhhhHHHHhhhHHHhh--H Confidence 011111111 11111111 12356655544333221 1222223356777766666554433332 2 Q ss_pred HHHHhC----CceeeecchhhhcCccHHHHHHHHHHHHHh------cCCcceE--------------EE---ecCCccee Q lcl|NC_019527. 291 LVDKFS----RTFLKTNMAQVLNGGEGGDVFDRVEMYVNM------QSNLGLA--------------VM---DFDSEDIV 343 (516) Q Consensus 291 Ll~~~~----~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~------~sn~g~~--------------~i---d~~~e~~e 343 (516) ++++.+ -.|+=+|+.++-.. ..++..+- .++.+ -.++|-+ +- ++.+-+++ T Consensus 272 VIYRitRAPeRRvFYIDvGnlPk~-KAeqYl~~--im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 348 (524) T protein:vir:10 272 VIYRITRAPDRRVWYVDTGNMPAR-KAAEHMQH--VMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVD 348 (524) T ss_pred HHHhhhccccceEEEEecCCCCch-hHHHHHHH--HHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCccccee Confidence 444433 24444444433211 11111110 11111 1111110 00 01112344 Q ss_pred EEe--cccCCHHHHHHHHHHHHHhhhcCCceeeecccccccc--ccch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_019527. 344 QVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSE--GEIRSFYDDISSVQQSYYFSPLDTMLKVI- 416 (516) Q Consensus 344 ~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge--~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l- 416 (516) ++. -+|+.++|+ ..|+..+-.+.++|.++|-+.+++|+| .++| -|.-.|..+|.+.|..+ ..++..+++.= T Consensus 349 TLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rF-s~~f~~~Lk~qL 426 (524) T protein:vir:10 349 TLPGADNTGNMEDV-RWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKF-EEVFLDPLKTNL 426 (524) T ss_pred eccccCCcChHHHH-HHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhh Confidence 442 245556554 589999999999999999777777776 3332 15567999999998654 45555554421 Q ss_pred HH------HhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH-cC-CCCHHHHHHHHHhhhccCCCCCChhhh Q lcl|NC_019527. 417 QL------SKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT-NS-VIDPSEARQQLSDDPDSGWDNIDGDLE 488 (516) Q Consensus 417 ~~------s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~-~g-vi~~~e~r~~l~~~~~~~~~~~d~~~e 488 (516) .+ ..|-.+-+++.|+|..=.--+|-..+|+...+.++++.+-. .| .++.+-+++.+-... |++++ T Consensus 427 ilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~t-------Deei~ 499 (524) T protein:vir:10 427 LLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMT-------DEEIE 499 (524) T ss_pred hhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccC-------HHHHH Confidence 11 12333345788999988888899999999999888887643 22 567777876653321 22332 Q ss_pred ccccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 489 IVQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 489 ~~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) .....+.+ |..++-.++|+++.. .= T Consensus 500 ~~~k~I~~-E~k~~~~~~~~~~~~--~f 524 (524) T protein:vir:10 500 QEAKQIEE-ESKEARFQDPDQEQE--DF 524 (524) T ss_pred HHHHHHHH-HhhcCCCCCCchhhh--cC Confidence 22222211 111111111111100 00 No 202 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=98.97 E-value=2.7e-09 Score=67.58 Aligned_cols=327 Identities=10% Similarity=0.046 Sum_probs=159.0 Q ss_pred hhhhHHHHh--HHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhccccc-CCcccccccCcccHHHHHHH Q lcl|NC_019527. 32 LAMRRAVMK--SMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAA-GGLYAADIQPFPGYQNLAAL 108 (516) Q Consensus 32 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~-~~~~~~~~~~f~gy~ll~~y 108 (516) |..+.+..+ .....+..+.+ |-.|.+. | .+ ...+..-.. ....+++ .+..-..|..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~----~--------~~-----~~~y~~~~~~~~~~~~e-pp~~~~~la~~~ 61 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRT-FSLSEIT----A--------SP-----ALDYVGIGFDENYNCYL-PPVNRHALAKLP 61 (345) T ss_pred CCccccccchhhhcCCCceEEE-eecCCcc----c--------ch-----hhcccceeeecCCcccc-CCCCHHHHHHHh Confidence 111100000 00011111111 1111100 0 00 111111110 0111111 122234567777 Q ss_pred HhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcc Q lcl|NC_019527. 109 ATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVS 188 (516) Q Consensus 109 ~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~ 188 (516) +.|+....+|..-+...+ .++. +.. +--+..|.+++....+||.|++++..++ T Consensus 62 ~~~~~h~~~i~~k~n~l~-~~~~-Pn~---------------------~~t~~~f~~~v~d~ll~Gnay~~i~rn~---- 114 (345) T protein:vir:37 62 HQNAQHGGILHSRANMVS-ATYE-GGK---------------------ALSKMEMRALCLNLIQFGDVGLLKVRNG---- 114 (345) T ss_pred hcchhhcchhhhhhhHHh-hccC-CCC---------------------CCCHHHHHHHHHHHHhcCCeEEEEEECC---- Confidence 777777777665554332 2331 110 0012334445555567899999886532 Q ss_pred cCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee--eEeccceEEEecCCcchhhhhhccCCC Q lcl|NC_019527. 189 VPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG--REMHASRLLTIITRPLPDMLKPAYNFS 266 (516) Q Consensus 189 ~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g--~~iH~SRli~~~~~~~p~~~k~~~~~~ 266 (516) .|.+..|.++.+.+|.-.. |.....+.....+...+ ..+.++.|+||.... +....+ T Consensus 115 -----------~G~~~~L~pl~~~~vr~~~----d~~~~~~~~~~~~~~~g~~~~~~~~eViHir~~~------~~~~~~ 173 (345) T protein:vir:37 115 -----------FGQVVRLVPLSSLYLRVHK----DGGYSYLMKKSLYDTAQEIYRYDAKDIIFIKLYD------PMQQVY 173 (345) T ss_pred -----------CCCEEEEEEecCceeEEee----cCCeeEEEeeeeeccCceEEEEccccEEEEcCCC------CCCCcc Confidence 2345567777776665211 11100011111111112 356789999997532 234567 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEe-cC--Ccc Q lcl|NC_019527. 267 GISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMD-FD--SED 341 (516) Q Consensus 267 G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id-~~--~e~ 341 (516) |+|.+..+...+..-..+......+..+....- +.+. ...++.++.+.+.+.++... ...|.+.+++. .+ .+. T Consensus 174 Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t-~~~l~~e~~~~lk~~~~~~~-g~~n~~~~~i~~~~g~~~G 251 (345) T protein:vir:37 174 GSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYST-DPDLTEEMEEEIARKISESK-GVGNFRSMFVNIAGGHPDG 251 (345) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC-CCCCCHHHHHHHHHHHHHhc-CccccCceeEecCCCCccc Confidence 999999988888777777777777777655432 2221 12355444555666665543 33454444332 21 233 Q ss_pred eeEEecccCCH----HHHHHHHHHHHHhhhcCCceeeecccccccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 342 IVQVNTPLSGL----ADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSEGEIRSFYDDISSVQQSYYFSPLDTMLKV 415 (516) Q Consensus 342 ~e~~~~~lsgl----~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~ 415 (516) ++...++.+.- -++.....+.||++.+||-. |.|..+.+.+ ++-+...+.| -+..|.|.++++.+. T Consensus 252 ~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~-liGi~~~~t~~~s~~e~~~~~f-------~~~~l~P~~~~ie~~ 323 (345) T protein:vir:37 252 LKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAG-LSGIIPTNTGGLGDPLKYREVY-------HYDEVMPLQEIIAET 323 (345) T ss_pred eeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhccccCCCCCcccHHHHHHHH-------HHHHHHHHHHHHHHH Confidence 44444444432 33456667889999999974 5587654321 3333344444 345678888888777 Q ss_pred HHHHhCCCcCCcceEEeCC--CCC Q lcl|NC_019527. 416 IQLSKWGEIDDAITFKFKS--LWQ 437 (516) Q Consensus 416 l~~s~~g~~~~d~~~~f~p--L~~ 437 (516) +-+ ..+++.+..|.|++ |.. T Consensus 324 ln~--~~e~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 324 INQ--DPEIKNLLKIKFREQNFAK 345 (345) T ss_pred hhh--hhccCCcceEEECchhhcC Confidence 653 33456677888875 433 No 203 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=98.97 E-value=1.9e-09 Score=68.40 Aligned_cols=427 Identities=8% Similarity=0.001 Sum_probs=189.4 Q ss_pred cccccccccCCCcCCCCCChhhhHHHHhHHhhcC-C--CccccccCCCCCCCccCCCccchhcccccccchhhhcccccC Q lcl|NC_019527. 13 VADKLADAARAEEQEKARKLAMRRAVMKSMERRA-S--DAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAG 89 (516) Q Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~ 89 (516) +...-= -.-.....-+......++.....+. . +....|+.... .-+- +. ... T Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~----------~i~~---------~~---~~~ 55 (489) T protein:vir:99 1 MLQEDF---EAIDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDN----------NIKY---------RP---AKT 55 (489) T ss_pred CCccce---eeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC----------cccc---------cc---ccc Confidence 000000 0000011111111112222221111 0 11111221110 0000 00 000 Q ss_pred CcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHh Q lcl|NC_019527. 90 GLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEH 169 (516) Q Consensus 90 ~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~ 169 (516) ....+.. + -.+++++.||+..+.-++.+++++++.++.. .+.|+..++.-++...+.++.+. T Consensus 56 ------~~~~~~~---k--i~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~-------~~~l~~~~~~n~~~~~~~~~~~~ 117 (489) T protein:vir:99 56 ------DKYAADN---R--IASDFAKYITVFEQGYMLGVPVEYKNENKDL-------QAAIDLMSVRNNEDYHNVKIKTD 117 (489) T ss_pred ------cccCCcc---e--eecchHHHHHHHHhhhhccCCceeecCChhH-------HHHHHHHHhhcChhHHHHHHHHH Confidence 0000000 0 1357899999999999999999998765432 23456666677888899999999 Q ss_pred cccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee--------eE Q lcl|NC_019527. 170 DCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG--------RE 241 (516) Q Consensus 170 ~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g--------~~ 241 (516) ..+||.|++++.+....... +.+ .+.+++|..+.|..- ......+.|+- .+|.+.. .. T Consensus 118 ~~~~G~~~~~v~~~~~~d~~-----------~~~-~i~~~~p~~~~~v~d-d~~~~~~~~~i-~~~~~~~~~~~~~~~~~ 183 (489) T protein:vir:99 118 LSIYGRAYELLTVEKIDDKK-----------TEV-KLYQLPAEQTFVIYD-DTYQRNSLMAV-HFYDIDYGSGKRKQIIK 183 (489) T ss_pred HhhCCeEEEEEeeccCcCCC-----------cce-EEEEEcccceEEEEc-CCCCCceEEEE-EEEEEecCCCceEEEEE Confidence 99999999887653211000 111 144555555554321 00011111110 0111110 01 Q ss_pred -eccceEEEecCCc------------------chhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeee Q lcl|NC_019527. 242 -MHASRLLTIITRP------------------LPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKT 302 (516) Q Consensus 242 -iH~SRli~~~~~~------------------~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~ 302 (516) +.+.++.+|.... +|-. .-.++-+|.|.++.+.+.+.+++.+....+.-+..++..++.. T Consensus 184 ~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv-~~~n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i 262 (489) T protein:vir:99 184 AYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVN-EYANNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVI 262 (489) T ss_pred EEeCCcEEEEEecCCCcccceecccccccCCceeEE-EeecCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhh Confidence 1122333332110 1111 0123456899999999999999888877776665555444332 Q ss_pred cchhhhcCccHHHHHHHHHHHH-------HhcCCcceEEEecC------Cc--ceeEEecccCCHHHHHHHHHHHHHhhh Q lcl|NC_019527. 303 NMAQVLNGGEGGDVFDRVEMYV-------NMQSNLGLAVMDFD------SE--DIVQVNTPLSGLADLQSQSQEHMCSVS 367 (516) Q Consensus 303 ~~~~~l~~~~~~~l~~r~~~~~-------~~~sn~g~~~id~~------~e--~~e~~~~~lsgl~d~~~~~~~~iaaas 367 (516) .... +...+..+.......-. .......+..++.. +. .+-....+.+++...++.+.+.|...+ T Consensus 263 ~g~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s 341 (489) T protein:vir:99 263 AGNA-YTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFT 341 (489) T ss_pred ccCC-cccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHh Confidence 2111 11111111111111000 00001111222111 11 233344566788889999999999999 Q ss_pred cCCceeeeccccccccccchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh---CCCcC-----CcceEEeCCCC Q lcl|NC_019527. 368 KIPAIKLTGISPSGLNASSEGEIRSF---YDDISSVQQSYYFSPLDTMLKVIQLSK---WGEID-----DAITFKFKSLW 436 (516) Q Consensus 368 ~IP~t~L~G~sp~Glnatge~D~~~y---yd~I~~~Qe~~l~p~l~~l~~~l~~s~---~g~~~-----~d~~~~f~pL~ 436 (516) ++|-.-.- +.+| |+||..=...+ ...+..+ +..++..+++++++++.-. .+... .+++|.|++-. T Consensus 342 ~~p~~~~~--~~~~-n~Sg~Al~~~~~~l~~k~~~k-~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~ 417 (489) T protein:vir:99 342 FTPDTQDM--KFSG-VQSGESMKYKLMASDNYREKQ-ERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNL 417 (489) T ss_pred CCcccccc--cccc-cchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCC Confidence 99964322 2223 56666422222 3334443 3567888888887765421 22222 36899999998 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChh--hhccccccchhcC-CCC-CCCCC-CCCC Q lcl|NC_019527. 437 QTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGD--LEIVQPEMFDDDG-ADP-YMPDP-DVLP 511 (516) Q Consensus 437 ~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~--~e~~~~e~~~~e~-~~~-~~~~~-~~~~ 511 (516) ..+..+.+++..+ + .|+++.+.+.+.+. ++..-+.. .+....|...... .++ ..++. ++.. T Consensus 418 p~d~~~~~~~~~k-------l--~giis~et~~~~l~-----~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 483 (489) T protein:vir:99 418 PQNDNEIVTAAQN-------L--YGIVSDQTIFEILN-----TVTGVDAEAELKRLKEEADKKQSLPEPRLVGDASGQEE 483 (489) T ss_pred CcCHHHHHHHHHH-------H--hccCCHHHHHHhcC-----CCCchhHHHHHHHHHHHHHHHhccccccccCCCCCCcC Confidence 8899888776544 2 37888877776541 11111111 1111111100000 000 01111 1111 Q ss_pred CCCCC Q lcl|NC_019527. 512 GEEGS 516 (516) Q Consensus 512 ~~e~t 516 (516) +.+.. T Consensus 484 ~~~~~ 488 (489) T protein:vir:99 484 PTAEK 488 (489) T ss_pred CCCCC Confidence 11111 No 204 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.97 E-value=2.5e-09 Score=67.76 Aligned_cols=390 Identities=10% Similarity=-0.019 Sum_probs=172.2 Q ss_pred cCCCccchh---ccccc-ccchhhhcccccCCcccccccCc-ccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccc Q lcl|NC_019527. 63 VPAGTTPAV---AMDSL-CGPTYQFLNSAAGGLYAADIQPF-PGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDR 137 (516) Q Consensus 63 ~~~~~~~~~---a~ds~-~~~~~~~~~~~~~~~~~~~~~~f-~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~ 137 (516) +.+.....+ ..--. .....+....++-|-+.-...+. ..-++-.....+.++++||+..++-+.=.|++.. ++ T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~g~~~~--d~ 78 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLDWLGWTNG--DG 78 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhccccccCC--Ch Confidence 111111111 00000 00011111122222221111111 1111111223567899999999987765666421 11 Q ss_pred cchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccc Q lcl|NC_019527. 138 TKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPS 217 (516) Q Consensus 138 ~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~ 217 (516) +.++..++.-++...+.++.+...+||.|++++..+... .| .+.+++|.++.+. T Consensus 79 ----------~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g--~~--------------~i~~~~p~~~~~i 132 (441) T protein:vir:80 79 ----------YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDG--TV--------------SVRPQSPKNCTGK 132 (441) T ss_pred ----------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCC--ce--------------EEEEEccceEEEE Confidence 235566777789999999999999999999887654321 11 1233333333321 Q ss_pred cccccccc----------cc------cccCcceeEE--e--------eeEeccc-e--EEEecCCcchhhhhhccCCCCc Q lcl|NC_019527. 218 AYNALDPT----------AP------DFYKPSTWWV--L--------GREMHAS-R--LLTIITRPLPDMLKPAYNFSGI 268 (516) Q Consensus 218 ~~~~~dp~----------s~------~yg~P~~y~v--~--------g~~iH~S-R--li~~~~~~~p~~~k~~~~~~G~ 268 (516) .-...... .. -|.....|++ . ...-|+- + |++|.++ ......||. T Consensus 133 ~d~~~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~------~~~~~~~G~ 206 (441) T protein:vir:80 133 FSADGSRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNR------RRTSRIDGR 206 (441) T ss_pred EeCCCCceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeecc------ccCCccCCc Confidence 10000000 00 0111111111 0 0011211 1 1112111 122345799 Q ss_pred hHHH-HHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEec--CCcceeEE Q lcl|NC_019527. 269 SMSQ-LAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDF--DSEDIVQV 345 (516) Q Consensus 269 S~le-~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~--~~e~~e~~ 345 (516) |.+. .+.+.+.+++.+.......+..++...+.+.... +........... -.++..+++ +++..+.. T Consensus 207 s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~-~~~~~~~~~~~~---------~~~i~~~~~~~~~~~~~~~ 276 (441) T protein:vir:80 207 SEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVS-ADEFSQPGWVLS---------MASVWAVDKDDDGDTPNVG 276 (441) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCC-ccccccchhhhc---------ccccccCCCCCCCCcceeE Confidence 9775 4667777777777766666665655544332111 111111110000 111222222 22223333 Q ss_pred ecccCC---HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 346 NTPLSG---LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSK 420 (516) Q Consensus 346 ~~~lsg---l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~ 420 (516) +.+-++ ..+.+.....++++.++||...| |.++. .++||+.=...+...+. .+++..+.+.|.+++++++.-. T Consensus 277 ~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~-g~~~~-~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~ 354 (441) T protein:vir:80 277 SFPVNSPTPYSDQMRLLAQLTAGEAAVPERYF-GFITS-NPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKAL 354 (441) T ss_pred ecCccchHHHHHHHHHHHHHHhcccCCCHHHh-ccCCC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 333334 44556666789999999997555 54432 23566643222322222 3445667888999888776543 Q ss_pred C--CCcC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCC--HHHHHHHHHhhhccCCCCCChhhhccccc Q lcl|NC_019527. 421 W--GEID---DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVID--PSEARQQLSDDPDSGWDNIDGDLEIVQPE 493 (516) Q Consensus 421 ~--g~~~---~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~--~~e~r~~l~~~~~~~~~~~d~~~e~~~~e 493 (516) . +..+ .++++.|++....+.+|.|+.. .+++++|++. .+.++..+ ++. +.+.+..+.+ T Consensus 355 ~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~-------~kl~~~g~~~~s~~~~~~~l------~~~--~~e~~~~~~e 419 (441) T protein:vir:80 355 DSRVDEADFFGDVGLRWRDASTPTRAATADAV-------TKLVGAGILPADSRTVLEML------GLD--DVQVEAVMRH 419 (441) T ss_pred cCCCcccccceeeeEEeCCCCCcCHHHHHHHH-------HHHHhcCcccccHHHHHHhC------CCC--HHHHHHHHHH Confidence 2 2222 3678999999999998876654 5567777653 22333322 222 1122211111 Q ss_pred cchhcCC--CCCCCCCCCCCCC Q lcl|NC_019527. 494 MFDDDGA--DPYMPDPDVLPGE 513 (516) Q Consensus 494 ~~~~e~~--~~~~~~~~~~~~~ 513 (516) ..+.++. ...+..+..+..+ T Consensus 420 ~~e~~~~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 420 RAESSDPLAVLAGAISRQTNEV 441 (441) T ss_pred HHHHHHHHHHHhhhhhcccccC Confidence 1111110 0011111111111 No 205 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=98.95 E-value=1e-08 Score=64.38 Aligned_cols=398 Identities=9% Similarity=-0.018 Sum_probs=192.0 Q ss_pred cCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHH Q lcl|NC_019527. 25 EQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQN 104 (516) Q Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~l 104 (516) |+++.....+. +.. ...+....|. .|..+.. +.+ .+.+. + -.+.+ T Consensus 1 v~~~~l~~e~a-----t~~-~~~d~~~~~~-----~~l~~~~--~~i---------l~~a~---~----------g~~~~ 45 (488) T protein:vir:99 1 MEKPALGREIA-----TSG-DGRDITRPFI-----SGLQVPN--DSI---------LQRRG---G----------NDLRV 45 (488) T ss_pred CCccchhHHHH-----HHH-hhhhhhcccc-----CCCCCCC--hHH---------HHhhc---c----------CCHHH Confidence 22222222211 000 0011111111 1111101 011 00000 0 01233 Q ss_pred HHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEe-- Q lcl|NC_019527. 105 LAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINI-- 182 (516) Q Consensus 105 l~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i-- 182 (516) ...+....-+..++++...-+++.-|+|...+++.. ..+..+.+++.++++.+.+.+.+++ .+.+||.|+.=+.- T Consensus 46 y~~l~~D~~i~s~l~~rk~av~~~~w~i~p~~~~~~--~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~ 122 (488) T protein:vir:99 46 YEEILSDAQVKTVWGQRQLAVVSREWKVEAGGDRPI--DQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGR 122 (488) T ss_pred HHHHhhChHHHHHHHHHHHHHhcCCceEEcCCCChH--HHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEee Confidence 344456888999999999999999999976544322 1233466778888888777777666 68999999865432 Q ss_pred cCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeeeEec-cce-EEEecCCcchhhhh Q lcl|NC_019527. 183 KGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMH-ASR-LLTIITRPLPDMLK 260 (516) Q Consensus 183 ~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH-~SR-li~~~~~~~p~~~k 260 (516) +++.+ .++.|...++.|+..... .............|..++ +-. +++.... T Consensus 123 ~~g~~--------------~~~~l~~r~~~~f~~d~~------~~l~~~~~~~~~~g~~lp~~~~~i~~~~~~------- 175 (488) T protein:vir:99 123 DDRYI--------------TLEAIKVRNRRRFRYDQD------GGLRLLTPNNMFEGEPCPAPYFWHFSTGAD------- 175 (488) T ss_pred cCCee--------------eEeeeeeecccceeecCC------CceEEeccCCCCCccccccCceEEEEeecC------- Confidence 22211 233455555555442110 000000000111233342 212 2222111 Q ss_pred hccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCc Q lcl|NC_019527. 261 PAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSE 340 (516) Q Consensus 261 ~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e 340 (516) ...+.+|.|++..|+....--..+...-+.++.++++++..-+.... +.+.++..+-++.+..+.+ .+..++.. +. T Consensus 176 ~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~--~a~~~ek~~l~~av~~~~~-~~~~viP~-~~ 251 (488) T protein:vir:99 176 NDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDK--TATPEDKAKLLAALHAIQT-DSAIIMPA-GM 251 (488) T ss_pred CCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCC--CCCHHHHHHHHHHHHHHhc-CcEEEecC-Cc Confidence 12356899999999998776677777888889999987653322111 1112222333344444444 34445544 47 Q ss_pred ceeEEecccCCH---HHHHHHHHHHHHhh-hcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 341 DIVQVNTPLSGL---ADLQSQSQEHMCSV-SKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVI 416 (516) Q Consensus 341 ~~e~~~~~lsgl---~d~~~~~~~~iaaa-s~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l 416 (516) +++.++..=++. ..+++..-++|+-+ .|= | |.++.-+|-+|.|+.-.....+.+++-......-+.+.|+..+ T Consensus 252 ~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGq--t-lts~~~~Gs~a~~~vh~~v~~d~~~aDa~~i~~tln~~li~~l 328 (488) T protein:vir:99 252 QAELLEAGRSGTADYKTLHDTMDATIAKVGLGQ--V-ASTQGTPGRLGNDDLQADVRLDLVKADADLICESFNLGPARWL 328 (488) T ss_pred eeEEeecCCCChHHHHHHHHHHHHHHHHHHhhh--h-hcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 788887653443 44666666777644 222 1 2233323445666666677777777777654444445577766 Q ss_pred HHHhCCCc-CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHc-CC-CCHHHHHHHHHhhhccCCCCCChhhhccccc Q lcl|NC_019527. 417 QLSKWGEI-DDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITN-SV-IDPSEARQQLSDDPDSGWDNIDGDLEIVQPE 493 (516) Q Consensus 417 ~~s~~g~~-~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~-gv-i~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e 493 (516) +.-.|+.. ++ .|.|......+.+ ..|+++.++++. |+ ++.+.+++.+ |++.-....+...+ T Consensus 329 ~~~N~~~~~~p--~~~~~~~e~edl~-------~~a~~~~~l~~~~G~~i~~~~i~e~~------Gip~~~~~~~~~~~- 392 (488) T protein:vir:99 329 TEWNFPGAQPP--RVYRVIEEPEDIT-------AKAERDEKVFRMSGFRPTRGYVQETY------GVEVESTQAEATAP- 392 (488) T ss_pred HHhCcCCcCCc--eeEecCCCcccHH-------HHHHHHHHHHhhcCCCCCHHHHHHHc------CCCCcccccccccC- Confidence 66555422 22 3444433333333 346667778886 75 7888898887 33321100000000 Q ss_pred cchhcCCCCCCC-CCCCCCCCCCC Q lcl|NC_019527. 494 MFDDDGADPYMP-DPDVLPGEEGS 516 (516) Q Consensus 494 ~~~~e~~~~~~~-~~~~~~~~e~t 516 (516) ...... +....++..+. T Consensus 393 ------~~~~~~~~~~~~~~~~~~ 410 (488) T protein:vir:99 393 ------TPSTEFAEGDQPSDPAAA 410 (488) T ss_pred ------CCcccCCCCCCCCCchHH Confidence 000000 01111111111 No 206 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=98.94 E-value=4.9e-09 Score=66.13 Aligned_cols=325 Identities=12% Similarity=0.078 Sum_probs=154.6 Q ss_pred cchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhh Q lcl|NC_019527. 3 PFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQ 82 (516) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~ 82 (516) =.|||+.+ ..+.. ... ..+..++ .|--|++ | +|+. .... T Consensus 1 m~~~~~~~---~~~~~-------~~~---------------~~~~~~~-~~~~p~~---~----------~~~~--~~~~ 39 (340) T protein:vir:98 1 MSKRKPRK---AVAMT-------ASA---------------PQKMEAF-TFGEPVP---V----------LDKR--DILD 39 (340) T ss_pred CCCCCCCc---ccccc-------ccC---------------ccceeEE-EcCCcee---e----------cCcc--hhhh Confidence 11222100 00000 000 0000111 1211211 1 1111 0011 Q ss_pred hcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHH Q lcl|NC_019527. 83 FLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGI 162 (516) Q Consensus 83 ~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~ 162 (516) +......+.++....++ ..|..+++.++.-..+|...+.+..+ ++.- . +. + .+.. T Consensus 40 ~~~~~~~~~~~~pp~~~--~~la~l~~a~~~h~s~i~~k~n~l~~-~~~P-n---~~-------l-----------t~~~ 94 (340) T protein:vir:98 40 YVECISNGKWYEPPVSF--SGLAKSLRSAVHHSSPIYVKRNVLAS-TYIP-H---PL-------L-----------SRQD 94 (340) T ss_pred hhhhhhcCceecCCCCH--HHHHHHHHhccccchhhhhhhhHHhh-ccCC-C---CC-------C-----------CHHH Confidence 11111122111222233 23666677777666666655544333 2210 0 00 0 1122 Q ss_pred HHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee--- Q lcl|NC_019527. 163 IQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG--- 239 (516) Q Consensus 163 l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g--- 239 (516) |..++....++|.|++.+..+. .|.+..|.++.+.++.... +. . .+|++.+ T Consensus 95 f~~~~~d~ll~Gnay~~~~rn~---------------~G~~~~L~pl~~~~vr~~~----~~--~-----~~~~~~~~~~ 148 (340) T protein:vir:98 95 FSRFALDYLVFGNAFLEQRHSV---------------TGQLIKLLTSPAKYTRRGV----DD--S-----VFWFVENFTQ 148 (340) T ss_pred HHHHHHHHHhcCCeEEEEEECC---------------CCcEEEEEEeCCceEEEcc----cC--c-----EEEEEecCCe Confidence 4444444567899998875432 1344557777776665321 11 1 2344442 Q ss_pred -eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhhcCccHHHH Q lcl|NC_019527. 240 -REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVLNGGEGGDV 316 (516) Q Consensus 240 -~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l~~~~~~~l 316 (516) ..+++..|+||.... +....+|+|.+..+...+..-..+......+..+....- +.+. ...++.++.+.+ T Consensus 149 ~~~~~~~eViHir~~~------~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~-~~~ls~e~~~~l 221 (340) T protein:vir:98 149 PHEFAPDTVFHLLEPD------INQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVT-DPAQSATDVESL 221 (340) T ss_pred EEEEccccEEEEcCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec-CCCCCHHHHHHH Confidence 367889999997532 234568999999999888877777777777777655432 2222 123555555566 Q ss_pred HHHHHHHHHhcCCcceEEEecC---CcceeEEecccCC----HHHHHHHHHHHHHhhhcCCceeeecccccccc--ccch Q lcl|NC_019527. 317 FDRVEMYVNMQSNLGLAVMDFD---SEDIVQVNTPLSG----LADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSE 387 (516) Q Consensus 317 ~~r~~~~~~~~sn~g~~~id~~---~e~~e~~~~~lsg----l~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge 387 (516) .+.++.. .+..|.+.+++... .+.++...++.+. +-++.....+.||++.+||-. |+|..+.+.. ++-+ T Consensus 222 k~~~~~~-~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~-llGi~~~~t~~~sn~e 299 (340) T protein:vir:98 222 RDAMRNS-KGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQ-LMGGKPENIGSLGDVE 299 (340) T ss_pred HHHHHHh-cCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHH-HhcccCCCCCccccHH Confidence 6666643 44556555444321 2334444444433 334566777899999999975 6677553221 2333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHH Q lcl|NC_019527. 388 GEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAK 441 (516) Q Consensus 388 ~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sek 441 (516) ...+.| -++.|.|.++++.++. .. . ..++ |+|++-.-++.. T Consensus 300 ~~~~~f-------~~~~l~Pl~~~iee~n-~~-L---~~e~-~rF~~~~l~~~d 340 (340) T protein:vir:98 300 KVAKVF-------VRNELSPLQDRFREVN-DW-L---GMEV-IRFKEYTLDNPE 340 (340) T ss_pred HHHHHH-------HHHHHHHHHHHHHHHH-hc-c---cccc-cccCccccccCC Confidence 333333 3456788888776532 21 1 1221 456543322222 No 207 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=98.94 E-value=4.6e-09 Score=66.30 Aligned_cols=442 Identities=11% Similarity=0.061 Sum_probs=191.4 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCC------Cccchh-cc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPA------GTTPAV-AM 73 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~------~~~~~~-a~ 73 (516) |-+- .+|.+ ..-.+..+..........-.. | ++..-+ -....+ .+ T Consensus 1 ~~~~--~~~~~-------------------~~~~~~~~~~~~~~~~n~~~~-~------~~~e~~~~~~~~~i~~~i~~~ 52 (511) T protein:vir:78 1 MLKV--NEFET-------------------DTDLRGNINYLFNDEANVVYT-Y------DGTESDLLQNVNEVSKYIEHH 52 (511) T ss_pred Cccc--cchhh-------------------hhhhhhhhhhhhhhhhCCccc-c------cchhhhhhcCHHHHHHHHHHH Confidence 2111 11110 000000000011111110000 0 000000 000000 00 Q ss_pred cccccchhhhcccccCCccccccc-CcccHHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQ-PFPGYQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELE 151 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~-~f~gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~ 151 (516) .......+.-...++.|.+.--.. ...........+ .+.+++.||+..+.-++.+.+.+++.++.. .+.|. T Consensus 53 ~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~-------~~~l~ 125 (511) T protein:vir:78 53 MDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDV-------LEAIE 125 (511) T ss_pred HHhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCchHH-------HHHHH Confidence 000000000011111111100000 000000000011 347789999999999999999998765432 35577 Q ss_pred HHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccC Q lcl|NC_019527. 152 EACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYK 231 (516) Q Consensus 152 ~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~ 231 (516) ..+++-++.....++.+...+||.|++++..+... .+ .+.+++|.++.|..-+.. ...+-++- T Consensus 126 ~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg---------------~~-~i~~~~p~~~~~v~dd~~-~~~~~~~v 188 (511) T protein:vir:78 126 AFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD---------------ET-RLYKSDAMSTFIIYDNTV-ERNSIAGV 188 (511) T ss_pred HHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCC---------------ce-EEEEEcccceEEEEcCCC-CCceEEEE Confidence 77888889999999999999999999988764321 01 133444444443321100 00111110 Q ss_pred cceeEEe-----------eeEe-ccceEEEecCC--------------------cchhhhhhccCCCCchHHHHHHHHHH Q lcl|NC_019527. 232 PSTWWVL-----------GREM-HASRLLTIITR--------------------PLPDMLKPAYNFSGISMSQLAQPYVE 279 (516) Q Consensus 232 P~~y~v~-----------g~~i-H~SRli~~~~~--------------------~~p~~~k~~~~~~G~S~le~~~~~l~ 279 (516) .+|.+. -..| .+.++.+|... .+|-. .-.++-+|.|.++.+.+.+. T Consensus 189 -r~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv-~~~n~~~g~gd~e~v~~liD 266 (511) T protein:vir:78 189 -RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPIT-EFSNNERRKGDYEKVITLID 266 (511) T ss_pred -EEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceE-EecCCCCCCCchhhhHHHHH Confidence 111110 0011 12233332110 01111 11234579999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHH-HHHHHHHh--cCCcceEEEecCCcceeE--EecccCCHHH Q lcl|NC_019527. 280 NWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFD-RVEMYVNM--QSNLGLAVMDFDSEDIVQ--VNTPLSGLAD 354 (516) Q Consensus 280 ~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~-r~~~~~~~--~sn~g~~~id~~~e~~e~--~~~~lsgl~d 354 (516) +++.+....+.-+..++..++.+...............+ ++-..... ....+.-. .++.+++. .+.+.+++.. T Consensus 267 a~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~~~~~~~e~ 344 (511) T protein:vir:78 267 LYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRET--EGSVDGGYIYKQYDVQGTEA 344 (511) T ss_pred HHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccC--CCCcceeEEeecCCHHHHHH Confidence 999988888877776666655432211111111111100 00000000 00000001 11223333 4456678999 Q ss_pred HHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHh--CCCc--C--- Q lcl|NC_019527. 355 LQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDI--SSVQQSYYFSPLDTMLKVIQLSK--WGEI--D--- 425 (516) Q Consensus 355 ~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I--~~~Qe~~l~p~l~~l~~~l~~s~--~g~~--~--- 425 (516) .++.+.++|...+++|-.-. +. .+| |.||..=...|.... ...++..++..+++++++++.-. .+.. + T Consensus 345 ~~~~L~~~I~~~s~~P~~~~-~~-~~~-n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~ 421 (511) T protein:vir:78 345 YKDRLNSDIHMFTNTPNMKD-DN-FSG-TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 421 (511) T ss_pred HHHHHHHHHHHHhCCccccc-cc-ccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc Confidence 99999999999999997433 22 122 456664222232222 23345667888888888765421 1221 2 Q ss_pred CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccccc----------- Q lcl|NC_019527. 426 DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEM----------- 494 (516) Q Consensus 426 ~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~----------- 494 (516) .++++.|++-...+.++.+++..+. .|++|.+.+.+.+.. .+..+..++....|. T Consensus 422 ~~i~~~f~~~~p~n~~e~~d~~~kl---------~G~iS~et~l~~l~~-----v~d~~~El~ri~~E~~~~~~~~~~~~ 487 (511) T protein:vir:78 422 NTVRYVYNRNLPKSLIEELKAYIDS---------GGKISQTTLMSLFSF-----FQDPELEVKKIEEDEKESIKKAQKGI 487 (511) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHH---------hccCChHHHHHhCCC-----CCCHHHHHHHHHHHHHHHHHHHhhcc Confidence 2689999999999999988765443 266776666554411 010001111111110 Q ss_pred ---chhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 495 ---FDDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 495 ---~~~e~~~~~~~~~~~~~~~e~ 515 (516) ++..+.+.+..+.++...++. T Consensus 488 ~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 488 YKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred ccCCCCCCCCCCCCCccCcccccC Confidence 000001111111111111111 No 208 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=98.94 E-value=4.6e-09 Score=66.30 Aligned_cols=442 Identities=11% Similarity=0.061 Sum_probs=191.4 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCC------Cccchh-cc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPA------GTTPAV-AM 73 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~------~~~~~~-a~ 73 (516) |-+- .+|.+ ..-.+..+..........-.. | ++..-+ -....+ .+ T Consensus 1 ~~~~--~~~~~-------------------~~~~~~~~~~~~~~~~n~~~~-~------~~~e~~~~~~~~~i~~~i~~~ 52 (511) T protein:vir:96 1 MLKV--NEFET-------------------DTDLRGNINYLFNDEANVVYT-Y------DGTESDLLQNVNEVSKYIEHH 52 (511) T ss_pred Cccc--cchhh-------------------hhhhhhhhhhhhhhhhCCccc-c------cchhhhhhcCHHHHHHHHHHH Confidence 2111 11110 000000000011111110000 0 000000 000000 00 Q ss_pred cccccchhhhcccccCCccccccc-CcccHHHHHHHH-hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHH Q lcl|NC_019527. 74 DSLCGPTYQFLNSAAGGLYAADIQ-PFPGYQNLAALA-TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELE 151 (516) Q Consensus 74 ds~~~~~~~~~~~~~~~~~~~~~~-~f~gy~ll~~y~-~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~ 151 (516) .......+.-...++.|.+.--.. ...........+ .+.+++.||+..+.-++.+.+.+++.++.. .+.|. T Consensus 53 ~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~-------~~~l~ 125 (511) T protein:vir:96 53 MDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDV-------LEAIE 125 (511) T ss_pred HHhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCchHH-------HHHHH Confidence 000000000011111111100000 000000000011 347789999999999999999998765432 35577 Q ss_pred HHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccC Q lcl|NC_019527. 152 EACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYK 231 (516) Q Consensus 152 ~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~ 231 (516) ..+++-++.....++.+...+||.|++++..+... .+ .+.+++|.++.|..-+.. ...+-++- T Consensus 126 ~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg---------------~~-~i~~~~p~~~~~v~dd~~-~~~~~~~v 188 (511) T protein:vir:96 126 AFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDD---------------ET-RLYKSDAMSTFIIYDNTV-ERNSIAGV 188 (511) T ss_pred HHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCC---------------ce-EEEEEcccceEEEEcCCC-CCceEEEE Confidence 77888889999999999999999999988764321 01 133444444443321100 00111110 Q ss_pred cceeEEe-----------eeEe-ccceEEEecCC--------------------cchhhhhhccCCCCchHHHHHHHHHH Q lcl|NC_019527. 232 PSTWWVL-----------GREM-HASRLLTIITR--------------------PLPDMLKPAYNFSGISMSQLAQPYVE 279 (516) Q Consensus 232 P~~y~v~-----------g~~i-H~SRli~~~~~--------------------~~p~~~k~~~~~~G~S~le~~~~~l~ 279 (516) .+|.+. -..| .+.++.+|... .+|-. .-.++-+|.|.++.+.+.+. T Consensus 189 -r~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv-~~~n~~~g~gd~e~v~~liD 266 (511) T protein:vir:96 189 -RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPIT-EFSNNERRKGDYEKVITLID 266 (511) T ss_pred -EEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceE-EecCCCCCCCchhhhHHHHH Confidence 111110 0011 12233332110 01111 11234579999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHH-HHHHHHHh--cCCcceEEEecCCcceeE--EecccCCHHH Q lcl|NC_019527. 280 NWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFD-RVEMYVNM--QSNLGLAVMDFDSEDIVQ--VNTPLSGLAD 354 (516) Q Consensus 280 ~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~-r~~~~~~~--~sn~g~~~id~~~e~~e~--~~~~lsgl~d 354 (516) +++.+....+.-+..++..++.+...............+ ++-..... ....+.-. .++.+++. .+.+.+++.. T Consensus 267 a~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~~~~~~~e~ 344 (511) T protein:vir:96 267 LYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRET--EGSVDGGYIYKQYDVQGTEA 344 (511) T ss_pred HHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccC--CCCcceeEEeecCCHHHHHH Confidence 999988888877776666655432211111111111100 00000000 00000001 11223333 4456678999 Q ss_pred HHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHh--CCCc--C--- Q lcl|NC_019527. 355 LQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDI--SSVQQSYYFSPLDTMLKVIQLSK--WGEI--D--- 425 (516) Q Consensus 355 ~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I--~~~Qe~~l~p~l~~l~~~l~~s~--~g~~--~--- 425 (516) .++.+.++|...+++|-.-. +. .+| |.||..=...|.... ...++..++..+++++++++.-. .+.. + T Consensus 345 ~~~~L~~~I~~~s~~P~~~~-~~-~~~-n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~ 421 (511) T protein:vir:96 345 YKDRLNSDIHMFTNTPNMKD-DN-FSG-TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 421 (511) T ss_pred HHHHHHHHHHHHhCCccccc-cc-ccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc Confidence 99999999999999997433 22 122 456664222232222 23345667888888888765421 1221 2 Q ss_pred CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhcccccc----------- Q lcl|NC_019527. 426 DAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEM----------- 494 (516) Q Consensus 426 ~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~----------- 494 (516) .++++.|++-...+.++.+++..+. .|++|.+.+.+.+.. .+..+..++....|. T Consensus 422 ~~i~~~f~~~~p~n~~e~~d~~~kl---------~G~iS~et~l~~l~~-----v~d~~~El~ri~~E~~~~~~~~~~~~ 487 (511) T protein:vir:96 422 NTVRYVYNRNLPKSLIEELKAYIDS---------GGKISQTTLMSLFSF-----FQDPELEVKKIEEDEKESIKKAQKGI 487 (511) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHH---------hccCChHHHHHhCCC-----CCCHHHHHHHHHHHHHHHHHHHhhcc Confidence 2689999999999999988765443 266776666554411 010001111111110 Q ss_pred ---chhcCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 495 ---FDDDGADPYMPDPDVLPGEEG 515 (516) Q Consensus 495 ---~~~e~~~~~~~~~~~~~~~e~ 515 (516) ++..+.+.+..+.++...++. T Consensus 488 ~~~~~~~~~~~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 488 YKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred ccCCCCCCCCCCCCCccCcccccC Confidence 000001111111111111111 No 209 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=98.93 E-value=8.3e-09 Score=64.90 Aligned_cols=442 Identities=10% Similarity=0.077 Sum_probs=203.0 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) ||.||-|- .++ ...+.-... ..-++||..-+|. ..-++..+.+ T Consensus 10 lf~f~~~~-----------------de~-----------~~~~~~~~~-~~S~~~p~~~dGa--------~~I~~~~~~~ 52 (524) T protein:vir:10 10 FLKPWANE-----------------DEK-----------EYKQQINNN-LESVTAPKLDDGA--------REIETQEQNI 52 (524) T ss_pred Hhhhhhcc-----------------hhh-----------hhhhhhccC-CCccccCCCCCCc--------eeeccCcccc Confidence 33332220 000 000000111 1124445433332 2222221111 Q ss_pred h--hhcccccCCcccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhhC-----CCeeeeccccchhhhHH-HHHH Q lcl|NC_019527. 81 Y--QFLNSAAGGLYAADIQPFPGYQNLAALA---TRPEYRAFASTLSTELTRE-----GIEITSKDRTKAKEMAS-KIKE 149 (516) Q Consensus 81 ~--~~~~~~~~~~~~~~~~~f~gy~ll~~y~---~~~i~r~iVd~~aed~~r~-----~~~i~~~~~~~~~~~~~-~i~~ 149 (516) . +.+++.+++. +......++|...|+ .++++..+|+.+++||+-+ .+++...+-+-.+...+ ...+ T Consensus 53 ~~~~~~q~~y~~~---e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~ee 129 (524) T protein:vir:10 53 PYNALMQQMFGSN---EPEVKNTRELIDTYRNLMNNYEVDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAE 129 (524) T ss_pred cchhhhhhhhhcc---cchhhhHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHH Confidence 0 0111112111 222234678888886 6999999999999999732 23333322221111111 1133 Q ss_pred HHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccc--------- Q lcl|NC_019527. 150 LEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYN--------- 220 (516) Q Consensus 150 i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~--------- 220 (516) ++..++-|++...-.+.+|.--+.|.-+.-..|+..++.. +|+.++.+||..+..+... T Consensus 130 F~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~------------GI~Elr~lDPr~i~~vr~i~~~~~~~~~ 197 (524) T protein:vir:10 130 FSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKIINPKKMKD------------GVQELRRLDPRQVQYIREIVTRMEDGVK 197 (524) T ss_pred HHHHHHHhccchhhhHHHhhheeeceEEEEEEeeCCCccc------------cceeeeeeCCccceeeeeecccCcccch Confidence 3444444567777777777666666644444455444322 2334444444444321111 Q ss_pred ---------ccccccccccC-cceeEE-eeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 221 ---------ALDPTAPDFYK-PSTWWV-LGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVS 289 (516) Q Consensus 221 ---------~~dp~s~~yg~-P~~y~v-~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~ 289 (516) ..+|..+.|.- +..|.- ++.+|+.+-+.+.+..-+ +.....=+|.|+.+...+.+.-....+ T Consensus 198 vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~dAIvy~~SGL~-----d~~~~~i~syLhkAiKp~NQLkm~EDA-- 270 (524) T protein:vir:10 198 IVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRAAVVYAHSGLL-----DCCGKNIIGYLQRAIKPANQLKLMEDA-- 270 (524) T ss_pred hhcchhhheeecCCCcccccCcceecCCcceecchhheeeeccCcc-----cCCCCceeccchHhhHHHHhhHHHHhh-- Confidence 11222222111 111111 134677777666543322 222223357777777666655443332 Q ss_pred HHHHHhC----CceeeecchhhhcCccHHHHHHHHHHHHHhc------CCcceE--------------EE---ecCCcce Q lcl|NC_019527. 290 DLVDKFS----RTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQ------SNLGLA--------------VM---DFDSEDI 342 (516) Q Consensus 290 ~Ll~~~~----~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~------sn~g~~--------------~i---d~~~e~~ 342 (516) -++++.+ -.|+=+|+.++-.. ..++..+- .++.++ ..+|-+ +- ++.+-++ T Consensus 271 lVIYRitRAPeRRvFYIDVGnlPk~-KAeqYl~~--im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEI 347 (524) T protein:vir:10 271 MVIYRITRAPDRRVFYIDTGNMPSR-KAAAQMQH--IMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEV 347 (524) T ss_pred HHHHhhhccccceEEEEecCCCCch-hHHHHHHH--HHHhcCceeEEeccCCeeccchhhhhhHhhhcccccCCCCccce Confidence 3444443 23444444433211 11111110 111111 011100 00 0111234 Q ss_pred eEEe--cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccc--cch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 343 VQVN--TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNA--SSE--GEIRSFYDDISSVQQSYYFSPLDTMLKVI 416 (516) Q Consensus 343 e~~~--~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glna--tge--~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l 416 (516) +++. -+|+.++|+ ..|+..+-.+.++|.++|-..+++|+|- ++| -|.-.|..+|.+.|..+ ..++..+++.= T Consensus 348 tTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF~KFI~rLR~rF-s~lf~~~L~~q 425 (524) T protein:vir:10 348 DTMPGATGMSDMDDV-LYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDELKFAKWIRQLQNKF-EEIFLDPLKTN 425 (524) T ss_pred eeccccCCcChHHHH-HHHHHHHHHHhCCCchhccCCCCccccccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHh Confidence 4442 235556554 5899999999999999996555566543 332 14567999999998654 45555554421 Q ss_pred -HH------HhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH-cC-CCCHHHHHHHHHhhhccCCCCCChhh Q lcl|NC_019527. 417 -QL------SKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT-NS-VIDPSEARQQLSDDPDSGWDNIDGDL 487 (516) Q Consensus 417 -~~------s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~-~g-vi~~~e~r~~l~~~~~~~~~~~d~~~ 487 (516) .+ ..|-.+-+++.|+|..=.--+|-..+|+...+.++++.+-. .| .++.+-+++.+-... |+++ T Consensus 426 LilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~t-------Deei 498 (524) T protein:vir:10 426 LILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDFLQMT-------DEEI 498 (524) T ss_pred hhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccC-------HHHH Confidence 11 12333345788999988888899999999999888887643 22 567777876653321 2233 Q ss_pred hccccccchhcCCCCCCCCCCCCCCCC Q lcl|NC_019527. 488 EIVQPEMFDDDGADPYMPDPDVLPGEE 514 (516) Q Consensus 488 e~~~~e~~~~e~~~~~~~~~~~~~~~e 514 (516) ......+.+ |..++-.++|+++.+.= T Consensus 499 ~~~~k~I~~-E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 499 NQEAKQIEE-ESKEARFQNPDEEEEDF 524 (524) T ss_pred HHHHHHHHH-HhhcCCCCCCChhhhcC Confidence 222222211 11111111111111000 No 210 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=98.91 E-value=6.6e-09 Score=65.42 Aligned_cols=328 Identities=13% Similarity=0.115 Sum_probs=160.0 Q ss_pred chhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCc-cccccCCCCCCCccCCCccchhcccccccchhh Q lcl|NC_019527. 4 FDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDA-ATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQ 82 (516) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~ 82 (516) -.||| ...++ ++ . +......... .-.|--|++ | +|+.- ... T Consensus 1 ~~~~~-----~~~~~---~~-------------~--~~~~~~~~~~~~~~f~~p~~---v----------~~~~~--~~~ 42 (344) T protein:vir:20 1 MSKKK-----GKTPQ---PA-------------A--KTMTASGPKMEAFTFGEPVP---V----------LDRRD--ILD 42 (344) T ss_pred CCccc-----CCCCc---ch-------------h--hhhhccCCceEEEEcCCceE---e----------cCcch--hhh Confidence 11111 00000 00 0 0000001111 111222211 1 11110 011 Q ss_pred hcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHH Q lcl|NC_019527. 83 FLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGI 162 (516) Q Consensus 83 ~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~ 162 (516) +..-...+. +++ .++.-..|..+++.++....+|...+....+ ++. +. +. + -+.. T Consensus 43 ~~~~~~~~~-~~~-pp~~~~~la~~~~a~~~h~~~i~~k~n~l~~-~~~-Pn---~~-------l-----------t~~~ 97 (344) T protein:vir:20 43 YVECISNGR-WYE-PPVSFTGLAKSLRAAVHHSSPIYVKRNILAS-TFI-PH---PW-------L-----------SQQD 97 (344) T ss_pred hhhhhhcCc-eec-CCCCHHHHHHHHhhhhhhCccceehhhhHHH-hcc-CC---CC-------C-----------CHHH Confidence 111111121 111 1233345777777777666666555543322 221 00 00 0 1122 Q ss_pred HHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee--- Q lcl|NC_019527. 163 IQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG--- 239 (516) Q Consensus 163 l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g--- 239 (516) |+.++..-.++|.|++.+..+. .|.+..|.++.+.++.... +. + .+|++.+ T Consensus 98 f~~~~~d~ll~Gnay~~i~rn~---------------~G~~~~L~pl~~~~vr~~~----~~---~----~~~~~~~~~~ 151 (344) T protein:vir:20 98 FSRFVLDFLVFGNAFLEKRYST---------------TGKVIRLETSPAKYTRRGV----EE---D----VYWWVPSFNE 151 (344) T ss_pred HHHHHHHHHhcCCeEEEEEECC---------------CCcEEEEEEcCCceeEeee----cC---C----EEEEEccCCe Confidence 4333434467899998874421 2345567777666655321 11 0 1344432 Q ss_pred -eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhhcCccHHHH Q lcl|NC_019527. 240 -REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVLNGGEGGDV 316 (516) Q Consensus 240 -~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l~~~~~~~l 316 (516) ..+.+..|||+.... +....+|+|.+..+...+..-..+......+..+.+..- +.+. ...++.++.+.+ T Consensus 152 ~~~~~~~eIiHir~~~------~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~-d~~l~~e~~~~i 224 (344) T protein:vir:20 152 PTAFAPGSVFHLLEPD------INQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVT-DAVQDRNDIEML 224 (344) T ss_pred EEEEcCccEEEeCCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec-CcCCCHHHHHHH Confidence 457788999987532 234567999999999999888888877777777766532 3322 123444444556 Q ss_pred HHHHHHHHHhcCCcceEEEecC---CcceeEEecccCCH----HHHHHHHHHHHHhhhcCCceeeecccccccc--ccch Q lcl|NC_019527. 317 FDRVEMYVNMQSNLGLAVMDFD---SEDIVQVNTPLSGL----ADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSE 387 (516) Q Consensus 317 ~~r~~~~~~~~sn~g~~~id~~---~e~~e~~~~~lsgl----~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge 387 (516) .++++.. ...+|...+++... .+.++...++.+.. -++-....+.||++.+||-. |+|..+.+.. ++-+ T Consensus 225 k~~~~~~-~g~~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~-llGi~~~~t~~~~n~e 302 (344) T protein:vir:20 225 RENMVKS-KGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQ-LMGGKPENVGSLGDIE 302 (344) T ss_pred HHHHHHh-cCCCCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHH-HhccCCCCCCccccHH Confidence 6666553 23444444455322 23345444444433 33456667889999999986 5576554221 2233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCH Q lcl|NC_019527. 388 GEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSA 440 (516) Q Consensus 388 ~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~se 440 (516) ...+.| -++.|.|.++++.++.- ..| .+.|+|.++.|..-++ T Consensus 303 ~~~~~f-------~~~~l~P~~~~~e~in~--~lg--~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 303 KVAKVF-------VRNELIPLQDRIREING--WLG--QEVIRFKNYSLDTDND 344 (344) T ss_pred HHHHHH-------HHHHHHHHHHHHHHHHH--hcC--CcccccCccccccCCC Confidence 333333 33557787777765332 223 2457888888877666 No 211 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=98.90 E-value=4.2e-09 Score=66.53 Aligned_cols=331 Identities=11% Similarity=0.074 Sum_probs=156.1 Q ss_pred chhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhh Q lcl|NC_019527. 4 FDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQF 83 (516) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~ 83 (516) -+||+ ++... ++ .+ .....+..++. |--|+ -.+|+. ....+ T Consensus 1 m~~~~--~~~~~--------------~~--~~-----~~~~~~~~~~~-~~~p~-------------~~~~~~--~~~~~ 41 (346) T protein:vir:10 1 MKKQL--RKNLT--------------QN--DR-----LQPQAQTEIFS-FGDPI-------------PVLDRA--DILNY 41 (346) T ss_pred CCccc--CCCCC--------------cc--cc-----cccccCeEEEe-cCCcc-------------eecCch--hHHHH Confidence 11110 00000 00 00 00000000000 10111 112221 01111 Q ss_pred cccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHH Q lcl|NC_019527. 84 LNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGII 163 (516) Q Consensus 84 ~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l 163 (516) ..-......|++ .++.-..|+.+++.++.-..++..-+....+ .+..+ ..+--+..| T Consensus 42 ~~~~~~~~~~~~-pp~~~~~la~l~~~~~~h~~~i~~k~n~l~~-l~~~P---------------------n~~~t~~~f 98 (346) T protein:vir:10 42 LECSAMYEKWYN-PPMSFDGLAKSLRSSTHHESAIITKANILLS-TCEVD---------------------SRYLSRRDL 98 (346) T ss_pred HHHhhcCCceEe-cCCCHHHHHHHHHhhhhcchhhhhhhhhHHH-HHhCC---------------------CCCCCHHHH Confidence 111100111121 1333355777777766554444433322111 11111 111123445 Q ss_pred HHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee--eE Q lcl|NC_019527. 164 QKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG--RE 241 (516) Q Consensus 164 ~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g--~~ 241 (516) .+++..-.++|.|++.+..+. .|.+..|.++.+.+|..... .|.. ++ ..+.+.| .. T Consensus 99 ~~~~~d~ll~Gnay~~i~r~~---------------~G~~~~L~pl~~~~v~~~~~--~~~~---~~--~~~~~~g~~~~ 156 (346) T protein:vir:10 99 SSFVKDYLVFGNAYFEVVRNR---------------LGQVQRIESPLAKYVRKGLE--AGQF---YY--VPQRFDHQEHE 156 (346) T ss_pred HHHHHHHHhcCCeEEEEEEcC---------------CCcEEEEEEecCCceEEEEc--CCeE---EE--EEEccCCeEEE Confidence 555555567999998875422 23455677888877764221 1111 01 1122233 35 Q ss_pred eccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhhcCccHHHHHHH Q lcl|NC_019527. 242 MHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVLNGGEGGDVFDR 319 (516) Q Consensus 242 iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l~~~~~~~l~~r 319 (516) +-++.|||+.... +....+|+|.+..+...+..-..+......+..+....- +++. ...++.++.+.+.+. T Consensus 157 ~~~~dIih~r~~~------~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~-d~~l~~e~~~~i~~~ 229 (346) T protein:vir:10 157 FAKGSIYHLLEPD------INQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMS-DASQKQEDVENIRQQ 229 (346) T ss_pred EecccEEEecCCC------CCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-CCCCCHHHHHHHHHH Confidence 6778999997543 223457999999999999988888888888877765432 2221 123444444556666 Q ss_pred HHHHHHhcCCcceEE-EecC--CcceeEEecccCCHH----HHHHHHHHHHHhhhcCCceeeeccccccc--cccchHHH Q lcl|NC_019527. 320 VEMYVNMQSNLGLAV-MDFD--SEDIVQVNTPLSGLA----DLQSQSQEHMCSVSKIPAIKLTGISPSGL--NASSEGEI 390 (516) Q Consensus 320 ~~~~~~~~sn~g~~~-id~~--~e~~e~~~~~lsgl~----d~~~~~~~~iaaas~IP~t~L~G~sp~Gl--natge~D~ 390 (516) ++.. ....|.+..+ +..+ .+.++....+.+.-+ ++.....++||++.+||-. |+|..+++- .++-+... T Consensus 230 ~~~~-~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~-llG~~~~~~~~~s~~e~~~ 307 (346) T protein:vir:10 230 LKQS-KGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQ-LMGIIPNNTGGFGNVADAA 307 (346) T ss_pred HHHh-cCccccCceeEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcccHHHHH Confidence 6543 3445555444 4322 233444444433332 3455667889999999986 558765432 13334344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCC--CCCCCH Q lcl|NC_019527. 391 RSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKS--LWQTSA 440 (516) Q Consensus 391 ~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~p--L~~~se 440 (516) +.|+ +..|.|.+++|.++.- .+| .+ .|+|++ |..-|| T Consensus 308 ~~f~-------~~~l~P~~~~iee~n~--~L~---~e-~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 308 EVFF-------ITEIEPLQERLKEFNQ--WLG---QE-VIKFKPSKLLQRTQ 346 (346) T ss_pred HHHH-------HHHHHHHHHHHHHHHh--hcc---cc-eeeechhhhcccCC Confidence 4443 4567888887765332 122 22 356665 555555 No 212 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=98.89 E-value=1.7e-08 Score=63.18 Aligned_cols=447 Identities=12% Similarity=0.077 Sum_probs=202.2 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccch Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPT 80 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~ 80 (516) ||.||-|---+ ...... .....-++||..-+|... -..+.. ..+.+. T Consensus 8 lf~f~~~~de~--------------------------~~~~~~---~~~~~S~~~p~~dDGa~~-i~~~~~---~~~~~~ 54 (523) T protein:vir:68 8 LFAPWAKMDER--------------------------DYKDQE---KENLESITSPKLDDGAKE-YEVSEN---EAQQTY 54 (523) T ss_pred hhhhhhhhhhh--------------------------hhhhhh---hccCCCccccCCCCccee-eecccc---cccccc Confidence 44443321000 000000 111122455554444200 000000 000001 Q ss_pred hhhcccccCCcccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhhCC-----CeeeeccccchhhhHHH-HHHHH Q lcl|NC_019527. 81 YQFLNSAAGGLYAADIQPFPGYQNLAALA---TRPEYRAFASTLSTELTREG-----IEITSKDRTKAKEMASK-IKELE 151 (516) Q Consensus 81 ~~~~~~~~~~~~~~~~~~f~gy~ll~~y~---~~~i~r~iVd~~aed~~r~~-----~~i~~~~~~~~~~~~~~-i~~i~ 151 (516) .+..+..+++. +..-...++|...|+ .++++..+|+.+++||+-+- +.+...+.+-.+...++ ..+++ T Consensus 55 ~~~~q~~y~~~---e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~ 131 (523) T protein:vir:68 55 NAMFQRMFGSQ---EPGLKSTRELIDTYRNLMTNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFN 131 (523) T ss_pred chhhhhhhhcc---ccccchHHHHHHHHHHHhhccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHH Confidence 11111112111 111235678888886 69999999999999997432 33333332222211111 23344 Q ss_pred HHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeecccccc---------cc Q lcl|NC_019527. 152 EACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYN---------AL 222 (516) Q Consensus 152 ~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~---------~~ 222 (516) ..++-|++...-.+.+|.--+.|.-+.-..++..++.. +|+.++.+||..|..+..- .. T Consensus 132 ~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~------------GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi 199 (523) T protein:vir:68 132 EVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKE------------GIKELRRLDPRQVQYVREVITTTEAGVKIV 199 (523) T ss_pred HHHHHhccchhhhHHHHhheeeeEEEEEEEeeCCCccc------------cceeeeeeCCcceeEEEeecCCCCcchhhh Confidence 44445577777777777666767665555566554332 3444555555555332110 00 Q ss_pred cccccc-ccCcce--eEEe--------eeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 223 DPTAPD-FYKPST--WWVL--------GREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDL 291 (516) Q Consensus 223 dp~s~~-yg~P~~--y~v~--------g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~L 291 (516) +-.... .|.|.. |..+ +.+|+.+=+.+.+..- .+.....=+|.|+++...+.+.-....+ -+ T Consensus 200 ~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL-----~d~~~~~i~gyLhkAiKp~NQLkmlEDA--lV 272 (523) T protein:vir:68 200 KGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKAAIVYAHSGL-----VDCCGKNIIGYLHRAIKPANQLKLLEDA--VV 272 (523) T ss_pred hhhhhheeeccccccccccccccCCCcceecchhheeeeeccc-----eeCCCCceeccchhhhHHHHhhHHHHhh--HH Confidence 000000 111111 1122 2356655544333221 1222223356777776666655433332 34 Q ss_pred HHHhC----CceeeecchhhhcCccHHHHHH-HHHHHHH---hcCCcceE--------------EE---ecCCcceeEEe Q lcl|NC_019527. 292 VDKFS----RTFLKTNMAQVLNGGEGGDVFD-RVEMYVN---MQSNLGLA--------------VM---DFDSEDIVQVN 346 (516) Q Consensus 292 l~~~~----~~v~k~~~~~~l~~~~~~~l~~-r~~~~~~---~~sn~g~~--------------~i---d~~~e~~e~~~ 346 (516) +++.+ -.|+=+|+.++-.. ..++..+ -+..+.. +-..+|-+ +- ++.+-+++++. T Consensus 273 IYRitRAPeRRvFYIDvGnlPk~-KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLp 351 (523) T protein:vir:68 273 IYRITRAPDRRVWYVDTGNMPSR-KAAEHMQHVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLP 351 (523) T ss_pred HHhhhccccceEEEEecCCCCch-hHHHHHHHHHHhhcceeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeecc Confidence 44433 24454454433211 1111111 1111000 00111110 00 01112344442 Q ss_pred --cccCCHHHHHHHHHHHHHhhhcCCceeeecccccccc--ccch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH- Q lcl|NC_019527. 347 --TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSE--GEIRSFYDDISSVQQSYYFSPLDTMLKVI-QL- 418 (516) Q Consensus 347 --~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge--~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l-~~- 418 (516) -+|+.++|+ ..|+..+-.+.++|.++|-+++ +|+| .++| -|.-.|..+|.+.|..+ ..++..+++.= .+ T Consensus 352 Ggqnlgem~DV-~YF~kkLy~aLnVP~sRl~~~~-~~f~~Gr~~EItRDEikF~KFI~rLR~rF-s~lf~~~Lk~qLilK 428 (523) T protein:vir:68 352 GADNTGNMEDV-RWFRNALYMALRIPITRIPSDQ-GGIQFDAGTSITRDELSFGKFIRELQHKF-EEIFLDPLKTNLILK 428 (523) T ss_pred ccCCcChHHHH-HHHHHHHHHHhCCcceeecCCC-cceecccccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhc Confidence 245556554 5899999999999999996653 5565 3332 15567999999998654 45555554421 11 Q ss_pred -----HhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH-cC-CCCHHHHHHHHHhhhccCCCCCChhhhccc Q lcl|NC_019527. 419 -----SKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT-NS-VIDPSEARQQLSDDPDSGWDNIDGDLEIVQ 491 (516) Q Consensus 419 -----s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~-~g-vi~~~e~r~~l~~~~~~~~~~~d~~~e~~~ 491 (516) ..|-.+-+++.|+|..=.--+|-..+|+...+.++++.+-. .| .++.+-+++.+-... |++++... T Consensus 429 giit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~t-------Deei~~~~ 501 (523) T protein:vir:68 429 GIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKDILQMS-------DEEIEQEA 501 (523) T ss_pred cCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccC-------HHHHHHHH Confidence 12333345788999988888899999999999888887643 22 567777876653321 22332222 Q ss_pred cccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 492 PEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 492 ~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) ..+.+ |..++-.++|+++.. .= T Consensus 502 kqI~~-E~k~~~~~~p~~e~~--~f 523 (523) T protein:vir:68 502 KQIEE-ESKEARFQDPDQEQE--DF 523 (523) T ss_pred HHHHH-HhhcCCCCCCchhhh--cC Confidence 22211 111111111111100 00 No 213 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=98.86 E-value=2.5e-08 Score=62.31 Aligned_cols=419 Identities=10% Similarity=0.032 Sum_probs=191.8 Q ss_pred cCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcc Q lcl|NC_019527. 21 ARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFP 100 (516) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~ 100 (516) .+-....--+... ...+ ...+...-.......+.-+..|.-|.. ++. +.+.+ -.| .+. T Consensus 1 ~~~~~d~~g~p~~-~~~~-~~~~~~~~~~~~~~~~~~~~~gltp~~----l~~------il~~a---~~g-------d~~ 58 (526) T protein:vir:79 1 MAQIVDVYGNPIR-PQQL-REPQTSRLAGLAKEFAQHPAKGLTPAK----LAR------ILVEA---EQG-------NLQ 58 (526) T ss_pred CCeeeCCCCCccC-cccc-chhhhhhhhhhhhhcccCCCCCcCHHH----HHH------HHHHh---hCC-------CHH Confidence 1111110000000 0000 000000000000000000112221111 110 11111 000 011 Q ss_pred -cHHHHHHH-HhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcC-hhHHHHHHHHhcccceeeE Q lcl|NC_019527. 101 -GYQNLAAL-ATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYG-VMGIIQKAAEHDCFFGRGQ 177 (516) Q Consensus 101 -gy~ll~~y-~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~-~~~~l~ea~~~~rlyG~a~ 177 (516) .++|.... ....-+..++.+.-.-.+..-|.|....++..++ ....+.+++.+.++. +.+.+. -+-.+.+||.|+ T Consensus 59 ~~~~L~edm~e~D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~-~~~a~~v~~~l~~~~~~~~~i~-~~ldA~~~G~s~ 136 (526) T protein:vir:79 59 AQAELFMDMEERDAHLFAEMSKRKRAILGLDWAVEPPRNASAAE-KADADYLHELLLDLEGLEDLLL-DALDGIGHGYSC 136 (526) T ss_pred HHHHHHHHHHhhChHHHHHHHHHHHHHhCCCceEecCCCCChHH-HHHHHHHHHHHhcccCHHHHHH-HHHhhhhhccee Confidence 12333332 3678888899998888888788886543322211 123355677777764 444444 444599999998 Q ss_pred EEEE--ecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcc Q lcl|NC_019527. 178 ISIN--IKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPL 255 (516) Q Consensus 178 i~i~--i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~ 255 (516) .=+. .+++.+ .++.|...++.|..- |+..-....-..-...|..+++-+.+++.... T Consensus 137 ~Ei~w~~~~g~~--------------~~~~l~~r~~~~F~~------~~~~~~~l~~~~~~~~g~~l~~~k~iv~~~~~- 195 (526) T protein:vir:79 137 IELEWALQGREW--------------MPLAFHHRPQSWFQL------NPEDQNELRLRDNSPAGEALQPFGWIIHRPRA- 195 (526) T ss_pred EEEEEeecCCce--------------eEEEeeeecccceEe------ccCCCcEEEecCCCCCceeecCCceEEEeecC- Confidence 6553 332211 123344444544331 11000000000011235567777677665432 Q ss_pred hhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCcee--eecchhhhcCccHHHHHHHHHHHHHhcCCcceE Q lcl|NC_019527. 256 PDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFL--KTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLA 333 (516) Q Consensus 256 p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~--k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~ 333 (516) ...+.+|.+++..|+-...--..+...-+.++.++++++. |++-. ...++...-++.+..+.++ +.. T Consensus 196 -----~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~-----a~~~ek~~L~~av~~i~~d-a~~ 264 (526) T protein:vir:79 196 -----RSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPG-----TADEEKATLLRAVTGLGHA-AAG 264 (526) T ss_pred -----CcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCC-----CCHHHHHHHHHHHHHHhcC-cEE Confidence 3345679999999988776666677777789999997654 44311 1223334444555555443 444 Q ss_pred EEecCCcceeEEecccCCH---HHHHHHHHHHHHhhhcCCceeeeccc-c-------ccccccchHHHHHHHHHHHHHHH Q lcl|NC_019527. 334 VMDFDSEDIVQVNTPLSGL---ADLQSQSQEHMCSVSKIPAIKLTGIS-P-------SGLNASSEGEIRSFYDDISSVQQ 402 (516) Q Consensus 334 ~id~~~e~~e~~~~~lsgl---~d~~~~~~~~iaaas~IP~t~L~G~s-p-------~Glnatge~D~~~yyd~I~~~Qe 402 (516) ++. ++.+++.++.+=++. ..+++..-..|+-+ ++|++ + +|-+|-|+.-.....+.+++-.. T Consensus 265 iiP-~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~-------iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~ 336 (526) T protein:vir:79 265 IIP-ETMAIDFQQAAQGSSEPFLAMMRQSEDAISKA-------VLGGTLTSTTSQSGGGAFALGQVHNEVRHDILASDAR 336 (526) T ss_pred Eec-CCceeEEeecCCCCHHHHHHHHHHHHHHHHHH-------HhhhhhccccccCcchhhhhHHHHHHHHHHHHHHHHH Confidence 554 457899988654443 34556566666543 23433 1 12333344445556666666665 Q ss_pred HHHHHHHHHHHHHHHHHhCCCc-CC--cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCC-CCHHHHHHHHHhhhcc Q lcl|NC_019527. 403 SYYFSPLDTMLKVIQLSKWGEI-DD--AITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSV-IDPSEARQQLSDDPDS 478 (516) Q Consensus 403 ~~l~p~l~~l~~~l~~s~~g~~-~~--d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gv-i~~~e~r~~l~~~~~~ 478 (516) ....-+.+.|+..++.-.||.. +. --+|+|..--..+- +..|++++.+++.|+ |+.+.+++.++ T Consensus 337 ~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl-------~~~a~~~~~L~~~G~~i~~~~i~e~~g----- 404 (526) T protein:vir:79 337 QLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADI-------TSMAQSIPALVNVGLEIPSAWVYDKLG----- 404 (526) T ss_pred HHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccH-------HHHHHHHHHHHhCCCcCCHHHHHHHhC----- Confidence 4444444568887777666643 21 13455544322222 346778888999997 89999999873 Q ss_pred CCCCCChhhhccccccchhc--CCCCCCCC-----CCCCCCCCCC Q lcl|NC_019527. 479 GWDNIDGDLEIVQPEMFDDD--GADPYMPD-----PDVLPGEEGS 516 (516) Q Consensus 479 ~~~~~d~~~e~~~~e~~~~e--~~~~~~~~-----~~~~~~~e~t 516 (516) ++...++.+...+...... ...+.... .........+ T Consensus 405 -ip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (526) T protein:vir:79 405 -IPQPAKNEPVLRPAAQPAILSRQHGQRVAALATIVGPRYGDQQA 448 (526) T ss_pred -CCCCCCchhhccccCCccccccccccccccccccccccCchhhH Confidence 3222111111111100000 00000000 0000011111 No 214 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=98.85 E-value=1.5e-08 Score=63.43 Aligned_cols=328 Identities=12% Similarity=0.102 Sum_probs=157.7 Q ss_pred cchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCcc-ccccCCCCCCCccCCCccchhcccccccchh Q lcl|NC_019527. 3 PFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAA-TKWAPPQLMPGVVPAGTTPAVAMDSLCGPTY 81 (516) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~ 81 (516) =+||| +. ..+. .. +.....+.... --|--|++ | +|+. ... T Consensus 1 m~~~~------~~------------~~~~--~~----~~~~~~~~~~~~~~f~~p~~---v----------~~~~--~~~ 41 (344) T protein:vir:60 1 MSKKK------GK------------TLQP--AA----KKMTASAPKMEAFTFGEPVP---V----------LDRR--DIL 41 (344) T ss_pred CCccc------CC------------CCCc--hH----HhhcCCcCcEEEEEcCCcee---e----------cCCc--chh Confidence 11111 00 0000 00 00000011110 11222211 1 1111 011 Q ss_pred hhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhH Q lcl|NC_019527. 82 QFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMG 161 (516) Q Consensus 82 ~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~ 161 (516) .+..-...+. +++ .++.-..|..+++.++....+|..-+....+ ++. .. + +--+. T Consensus 42 ~~~~~~~~~~-~~~-pp~~~~~la~~~~a~~~h~~~i~~k~n~l~~-~~~--Pn--~------------------~~t~~ 96 (344) T protein:vir:60 42 DYVECISNGR-WYE-PPISFTGLAKSLRAAVHHSSPIYVKRNILAS-TFI--PH--P------------------WLSQQ 96 (344) T ss_pred HHHHhhhcCc-ccc-CCCCHHHHHHHHHhhhhhccchhhhhhHHHh-hcc--CC--C------------------CCCHH Confidence 2222122221 111 1232344666666666555555554443322 221 00 0 00112 Q ss_pred HHHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee-- Q lcl|NC_019527. 162 IIQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG-- 239 (516) Q Consensus 162 ~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g-- 239 (516) .|+.++..-.+||.|++.+..++ .|.+..|.++.+.+|..... .| .+|+|.+ T Consensus 97 ~f~~~~~d~ll~Gnay~~i~rn~---------------~G~~~~L~~l~~~~vr~~~~--~~---------~~~~v~~~~ 150 (344) T protein:vir:60 97 DFSRFVLDFLVFGNAFLEKRYST---------------TGKVIRLETSPAKYTRRGVE--ED---------VYWWVPSFN 150 (344) T ss_pred HHHHHHHHHHhcCCeEEEEEECC---------------CCcEEEEEEcCcceEEEeec--CC---------eEEEEccCC Confidence 24433434467899998875432 13345566776666653211 11 1344432 Q ss_pred --eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHH Q lcl|NC_019527. 240 --REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGD 315 (516) Q Consensus 240 --~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~ 315 (516) ..+.+..|||+.... +....+|+|.++.+...+..-..+......+..+.... ++++. ...++.++.+. T Consensus 151 ~~~~~~~~eIiHir~~~------~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~-~~~ls~e~~~~ 223 (344) T protein:vir:60 151 EPTAFAPGSVFHLLEPD------INQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVT-DAVQDRNDIEM 223 (344) T ss_pred eEEEEcCccEEEEcCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec-CcCCCHHHHHH Confidence 356788899987532 23456799999999999988888887777777776653 33332 12355544555 Q ss_pred HHHHHHHHHHhcCCcceEEEecC---CcceeEEecccCCH----HHHHHHHHHHHHhhhcCCceeeecccccccc--ccc Q lcl|NC_019527. 316 VFDRVEMYVNMQSNLGLAVMDFD---SEDIVQVNTPLSGL----ADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASS 386 (516) Q Consensus 316 l~~r~~~~~~~~sn~g~~~id~~---~e~~e~~~~~lsgl----~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atg 386 (516) +.+.++... ..+|...+++... .+.++...++.+.- -++.....+.||++.+||-. |+|..+.+.. ++. T Consensus 224 ik~~~~~~~-g~~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~-llGi~~~~t~~~~n~ 301 (344) T protein:vir:60 224 LRENMVKSK-GRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQ-LMGGKPENVGSLGDI 301 (344) T ss_pred HHHHHHHhc-CCCCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHH-HhcccCCCCCccccH Confidence 666665532 3444444555322 23344444444433 33455777899999999986 5576554321 233 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCH Q lcl|NC_019527. 387 EGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSA 440 (516) Q Consensus 387 e~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~se 440 (516) +...+.| .++.|.|+++++.++.. ..|. +.|+|.+..|..-+. T Consensus 302 e~~~~~f-------~~~~L~Pl~~~~e~ln~--~lg~--~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 302 EKVAKVF-------VRNELIPLQDRIREING--WLGQ--EVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHH-------HHHHHHHHHHHHHHHHH--hcCC--cccccCccccCCCCC Confidence 3333333 34557888777765332 1221 346677766665555 No 215 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=98.83 E-value=3.4e-08 Score=61.53 Aligned_cols=420 Identities=11% Similarity=0.076 Sum_probs=189.7 Q ss_pred ccccccCCCCCCCccCCCccchhccc----------ccccchhhhcccccCCcccc--cccCcccHHHHHHHH-hCchhh Q lcl|NC_019527. 49 AATKWAPPQLMPGVVPAGTTPAVAMD----------SLCGPTYQFLNSAAGGLYAA--DIQPFPGYQNLAALA-TRPEYR 115 (516) Q Consensus 49 ~~~~~~~~~~~~gv~~~~~~~~~a~d----------s~~~~~~~~~~~~~~~~~~~--~~~~f~gy~ll~~y~-~~~i~r 115 (516) -.-.+....-...+.|++.. .++-+ ........-...++.|-+.- .......-......+ .+++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~ 79 (506) T protein:vir:94 1 MDYDLTEHKQANLIYQESLE-NLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAK 79 (506) T ss_pred CCcchhhhhcceeecccchh-cCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecchHH Confidence 00001111112222333211 11100 00000001111111111100 000000000000111 357899 Q ss_pred hhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCccccc Q lcl|NC_019527. 116 AFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPLILDP 195 (516) Q Consensus 116 ~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~ 195 (516) .||+..+.-++-+++++++.++.. .+.|+..++.-++...+.++.+...+||.|++++.++... . T Consensus 80 ~Iv~~~~~~l~G~p~~~~~~d~~~-------~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~--~------ 144 (506) T protein:vir:94 80 YIADFQTSYSVGNPINVKLPDDGS-------NSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDN--E------ 144 (506) T ss_pred HHHHHhhhhhcccCceeecCcchH-------HHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCC--e------ Confidence 999999999999999998876543 2456777777788999999999999999999988775321 0 Q ss_pred ccccccceeeEEeecceeeccccccccccccccccCcceeEEe------------eeEe-ccceEEEecCCcc------- Q lcl|NC_019527. 196 RTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL------------GREM-HASRLLTIITRPL------- 255 (516) Q Consensus 196 ~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~------------g~~i-H~SRli~~~~~~~------- 255 (516) + .+.+++|.++.|..-+..+ ..+-++ ..+|.+. -..+ -..++.++.+... T Consensus 145 -------~-~i~~~~p~~~~~v~dd~~~-~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~ 214 (506) T protein:vir:94 145 -------E-HLAKLDPLDTFVIYSTDVD-PKPIMA-VRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVD 214 (506) T ss_pred -------e-EEEEEcccceEEEecCCCC-CceEEE-EEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccceecc Confidence 1 1344555554442211000 011111 0011100 0001 1112222222111 Q ss_pred --------hhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeec-----------chhhhcCccHH-- Q lcl|NC_019527. 256 --------PDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTN-----------MAQVLNGGEGG-- 314 (516) Q Consensus 256 --------p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~-----------~~~~l~~~~~~-- 314 (516) |-.. -.++-.|.|.++.+.+.+.+++.+....+.-+..++..++.+. +...+...+.. T Consensus 215 ~~~~~g~vPvv~-~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~ 293 (506) T protein:vir:94 215 TTKPITTFPVVE-FKNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAM 293 (506) T ss_pred ccccCCccceEE-ecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccc Confidence 1110 1122347888888888888888877666655544443322211 01111110000 Q ss_pred --HHHHHHHHHHHhcCCcceEEEec--------CCcceeEE--ecccCCHHHHHHHHHHHHHhhhcCCceeeeccccccc Q lcl|NC_019527. 315 --DVFDRVEMYVNMQSNLGLAVMDF--------DSEDIVQV--NTPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGL 382 (516) Q Consensus 315 --~l~~r~~~~~~~~sn~g~~~id~--------~~e~~e~~--~~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Gl 382 (516) ............+.+. ++.+.. .+.+++.+ +.+.+++...++.+.+.|...+++|-.-. + +- +- T Consensus 294 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~~-~~ 369 (506) T protein:vir:94 294 AKLAKDKLELIKEMKDAN-MLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTD-E-NF-AS 369 (506) T ss_pred cccccchhHHHhhhhhcC-eeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCcccccc-c-cc-cc Confidence 0011122222222222 222211 11234433 45677889999999999999999996322 1 11 22 Q ss_pred cccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHh---CCCcC---CcceEEeCCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_019527. 383 NASSEGEIRSFYDDIS--SVQQSYYFSPLDTMLKVIQLSK---WGEID---DAITFKFKSLWQTSAKEESEIRFNKAQEA 454 (516) Q Consensus 383 natge~D~~~yyd~I~--~~Qe~~l~p~l~~l~~~l~~s~---~g~~~---~d~~~~f~pL~~~sekEkAei~~~~a~a~ 454 (516) |.||..=...|..... ...+..++..++++++++.... .+..+ .+++|.|++-...+++|.|++..+. T Consensus 370 n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl---- 445 (506) T protein:vir:94 370 NSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQA---- 445 (506) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHH---- Confidence 4566533333333222 3345567888888887765421 22222 2578999999999999998865542 Q ss_pred HHHHHcCCCCHHHHHHHHHhhhccCCCCCC---hhhhccccccc-hhcCC-----CCCCCCCCCCCCCCCC Q lcl|NC_019527. 455 QIYITNSVIDPSEARQQLSDDPDSGWDNID---GDLEIVQPEMF-DDDGA-----DPYMPDPDVLPGEEGS 516 (516) Q Consensus 455 ~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d---~~~e~~~~e~~-~~e~~-----~~~~~~~~~~~~~e~t 516 (516) .|+||.+.++..+ +.++ .+.+....|.. .++.. .....++++..+++.- T Consensus 446 -----~g~iS~et~~~~l--------p~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (506) T protein:vir:94 446 -----GATLPQKYLYQQL--------PGVTNPQDIVDMMKEQSANGDYSFDQNGVISNDGQTNTTATQTDE 503 (506) T ss_pred -----hccCChHHHHHhC--------CCCCCHHHHHHHHHHHHHHHhhcchhhcCCCcccCcccccccccc Confidence 4788888877665 2221 11111111110 01111 1111111111111111 No 216 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=98.82 E-value=3.7e-08 Score=61.33 Aligned_cols=444 Identities=11% Similarity=0.111 Sum_probs=206.5 Q ss_pred CcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchh Q lcl|NC_019527. 2 WPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTY 81 (516) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~ 81 (516) |.||+|- +++. .+.+......-++||..-+|..- + .+..+.. T Consensus 1 ~~~w~~~-----------------de~~------------~~~~~~~~~~S~~~p~~~DGa~~------i--~~~~~~~- 42 (511) T protein:vir:56 1 MKFWTKE-----------------EEQD------------IQKIEKNPVRSFSAPDNVDGAKE------I--HTNLLAP- 42 (511) T ss_pred CCCccch-----------------hhhh------------hhhhccCCcccccCCCCCCCceE------E--ecccccc- Confidence 5554442 0100 01111122223566655555310 1 1110000 Q ss_pred hhcccccCCcccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhh-----CCCeeeeccccchhhhHH-HHHHHHH Q lcl|NC_019527. 82 QFLNSAAGGLYAADIQPFPGYQNLAALA---TRPEYRAFASTLSTELTR-----EGIEITSKDRTKAKEMAS-KIKELEE 152 (516) Q Consensus 82 ~~~~~~~~~~~~~~~~~f~gy~ll~~y~---~~~i~r~iVd~~aed~~r-----~~~~i~~~~~~~~~~~~~-~i~~i~~ 152 (516) .+++.+.+..-..+ ..+..-+|...|+ .++++..+|+.+++||+- ..+++...+-+-.+...+ ...+++. T Consensus 43 ~~~g~~~~~~~~~~-~~~~~~eLI~~YR~ma~~pEvd~Av~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~ 121 (511) T protein:vir:56 43 QLGHAIIPSDAQSE-GTIPVKELIKSYRALAEYHEVDDAIQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDR 121 (511) T ss_pred eecceecccccccc-CccchHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHH Confidence 01111111111111 1122247877776 689999999999999973 223333322221111111 1233344 Q ss_pred HHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcccCc-cccccccccc------ceeeEEeecceeeccccccccccc Q lcl|NC_019527. 153 ACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVSVPL-ILDPRTIKKG------SLTGFSNIEPMWTSPSAYNALDPT 225 (516) Q Consensus 153 ~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl-~ld~~~I~~g------~l~~l~v~d~~~v~p~~~~~~dp~ 225 (516) .++-|++...-.+.+|.--+.|.-+.-..++..+--..| .|||..|++. .+.+..++... ..+-..+|. T Consensus 122 Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~k~GI~eLr~lDPr~i~~vr~i~~~~~~~~~v~~~~----~ey~~Y~~~ 197 (511) T protein:vir:56 122 VVSLLQMRKHGYKWFRKWYVDSRIYFHKILDKDNNIIELRPLNPMKMELVREIQKETIDGVEVVKGT----LEYYVYKQS 197 (511) T ss_pred HHHHhccchhhhHHHhhhhhcceEEEEEEeccccceeehhhcCcccchhhhhhhcccccccccccce----eeeeEecCC Confidence 444456677777777666666664443445543321111 2444433221 01111111100 001111222 Q ss_pred cccccCcceeEE-----eeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhC---- Q lcl|NC_019527. 226 APDFYKPSTWWV-----LGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFS---- 296 (516) Q Consensus 226 s~~yg~P~~y~v-----~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~---- 296 (516) + |..|..+.. ++.+|+.+-+.+.+..-+ + -...++..+|.|+.+...+.+.-....+ .++++.+ T Consensus 198 ~--~~~~~~~~~~~~~~~~vkI~~daI~y~hSGL~-d--~~~~~g~i~syLhkAiKp~NQLkm~EDA--lVIYRitRAPe 270 (511) T protein:vir:56 198 D--YKMPSWMSATNRAQTSFRIPKDAIVFAHSGLM-R--GCADDPYIIGYLDRAIKPANQLKMLEDA--LVIYRLARAPE 270 (511) T ss_pred C--cccCcccccccccccceeechhheeeecccce-e--ccCCCCeeeccchhhhHHHHhhHHHHhh--HHHHhhhcccc Confidence 2 223333332 235677776654432210 0 1234556789999888777765544433 3445443 Q ss_pred CceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecC--------------------------CcceeEEe--cc Q lcl|NC_019527. 297 RTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFD--------------------------SEDIVQVN--TP 348 (516) Q Consensus 297 ~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~--------------------------~e~~e~~~--~~ 348 (516) -.|+=+|+.++-.. ..++..+-+ ++.++ +-++-|+. +-+++++. -+ T Consensus 271 RRvFYIDVGnLPk~-KAeqYl~~i--M~k~k---NklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqn 344 (511) T protein:vir:56 271 RRVFYVDVGNLPTQ-KAQQYVNGI--MQNVK---NRVVYDTQTGQVKNTTNAMSMLEDYYLPRREGSKGTEVSTLPGGQS 344 (511) T ss_pred ceEEEEecCCCCch-hHHHHHHHH--HHhcC---ceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCC Confidence 34555554443211 111211111 11111 01111111 11344442 23 Q ss_pred cCCHHHHHHHHHHHHHhhhcCCceeeecc-ccccccc--cch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH---- Q lcl|NC_019527. 349 LSGLADLQSQSQEHMCSVSKIPAIKLTGI-SPSGLNA--SSE--GEIRSFYDDISSVQQSYYFSPLDTMLKVI-QL---- 418 (516) Q Consensus 349 lsgl~d~~~~~~~~iaaas~IP~t~L~G~-sp~Glna--tge--~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l-~~---- 418 (516) |+.++|+ ..|+..+-.+.++|.++|-.. +.+|+|- ++| -|.-.|..+|.+.|..+ ..++..+++.= .+ T Consensus 345 lgem~DV-~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~KFI~RLR~rF-s~lF~~~Lk~qLilKgii 422 (511) T protein:vir:56 345 LGDIEDV-LYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFTKFVKRLQTKF-ETVITDPLKHQLIVNNII 422 (511) T ss_pred cChHHHH-HHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhhccCC Confidence 5556554 589999999999999999743 3456652 222 15567999999998654 45555554421 11 Q ss_pred --HhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHc-C-CCCHHHHHHHHHhhhccCCCCCChhhhcccccc Q lcl|NC_019527. 419 --SKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITN-S-VIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEM 494 (516) Q Consensus 419 --s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~-g-vi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~ 494 (516) ..|-.+-+++.|+|..=.--+|-..+|+...+.++++.+-.= | .+|.+-+++.+-.. .|+++......+ T Consensus 423 t~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~-------tDeei~~~~k~I 495 (511) T protein:vir:56 423 TEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRL-------SDDQITAMQSEI 495 (511) T ss_pred CHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhcc-------CHHHHHHHHHHH Confidence 123333457889999888888999999999998888876331 2 56888887765332 233333322222 Q ss_pred chhcCCCCCCCCCCCC Q lcl|NC_019527. 495 FDDDGADPYMPDPDVL 510 (516) Q Consensus 495 ~~~e~~~~~~~~~~~~ 510 (516) .++..++--...+++. T Consensus 496 ~~E~k~~~~~~~e~~f 511 (511) T protein:vir:56 496 DEEETNPRFQQDDQGF 511 (511) T ss_pred HHhhcCCCCCCcccCC Confidence 2222221111111222 No 217 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=98.78 E-value=4.8e-08 Score=60.69 Aligned_cols=327 Identities=11% Similarity=0.097 Sum_probs=154.1 Q ss_pred chhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCc-cccccCCCCCCCccCCCccchhcccccccchhh Q lcl|NC_019527. 4 FDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDA-ATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQ 82 (516) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~ 82 (516) -.||+ ...++ ...+......... .-.|--|++ |.... .+ .. T Consensus 1 ~~~~~-----~~~~~------------------~~~~~~~~~~~~~~~~~~~~p~~---v~~~~---~~---------~~ 42 (344) T protein:vir:56 1 MSKKK-----GKTPQ------------------PAAKTMTASAPKMEAFTFGEPVP---VLDRR---DI---------LD 42 (344) T ss_pred CCCCC-----CCCCc------------------hhhHHhhcCCCceEEEEcCCcee---ecCcc---hh---------hh Confidence 11111 00000 0001111111111 111222211 11111 01 11 Q ss_pred hcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHH Q lcl|NC_019527. 83 FLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGI 162 (516) Q Consensus 83 ~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~ 162 (516) +..-...+.++--..+| ..|..+++.|+....+|...+...++ ++. +. + +--+.. T Consensus 43 ~~~~~~~~~~~~pp~~~--~~la~~~~a~~~h~s~i~~k~n~l~~-~~~-Pn---p------------------~~t~~~ 97 (344) T protein:vir:56 43 YVECISNGRWYEPPVSF--TGLAKSLRAAVHHSSPIYVKRNILAS-TFI-PH---P------------------WLSQQD 97 (344) T ss_pred HHHhhhcCccccCCCCH--HHHHHHHhhhhhhCccceehhhhHHh-hcC-CC---C------------------CCCHHH Confidence 11111222211111223 34677777776666666655543322 221 00 0 001122 Q ss_pred HHHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee--- Q lcl|NC_019527. 163 IQKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG--- 239 (516) Q Consensus 163 l~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g--- 239 (516) |..++..-.++|.|++.+..+. .|.+.+|.++.+.+|..... .+ .+|++.+ T Consensus 98 f~~~~~d~ll~Gnay~~~~rn~---------------~G~~~~L~pl~~~~v~~~~~--~~---------~~~~~~~~g~ 151 (344) T protein:vir:56 98 FSRFVLDFLVFGNAFLEKRYST---------------TGKVIRLETSPAKYTRRGVE--ED---------VYWWVPSFNE 151 (344) T ss_pred HHHHHHHHHhcCCeEEEEEECC---------------CCcEEEEEEeCCceeEEeec--CC---------EEEEEecCCe Confidence 4333334457899998875431 23455677777766653211 11 1344432 Q ss_pred -eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhhcCccHHHH Q lcl|NC_019527. 240 -REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVLNGGEGGDV 316 (516) Q Consensus 240 -~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l~~~~~~~l 316 (516) ..+.+..|||+.+.. +....+|+|.+..+...+..-..+......+..+....- +.+. ...++.++.+.+ T Consensus 152 ~~~~~~~dIiHir~~~------~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~-d~~ls~e~~~~l 224 (344) T protein:vir:56 152 PTAFAPGSVFHLLEPD------INQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVT-DAVQDRNDIEML 224 (344) T ss_pred EEEEcCccEEEECCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec-CCCCCHHHHHHH Confidence 357789999997532 234567999999999999888888888888887755433 2322 123554444556 Q ss_pred HHHHHHHHHhcCCcceEEEecC---CcceeEEecccCCH----HHHHHHHHHHHHhhhcCCceeeecccccccc--ccch Q lcl|NC_019527. 317 FDRVEMYVNMQSNLGLAVMDFD---SEDIVQVNTPLSGL----ADLQSQSQEHMCSVSKIPAIKLTGISPSGLN--ASSE 387 (516) Q Consensus 317 ~~r~~~~~~~~sn~g~~~id~~---~e~~e~~~~~lsgl----~d~~~~~~~~iaaas~IP~t~L~G~sp~Gln--atge 387 (516) .+.++.. ...+|...+++... .+.++...++.+.- -++.....+.||++.+||-. |+|..+.+.. ++-+ T Consensus 225 k~~~~~~-~g~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~-llGi~~~~t~~~~n~e 302 (344) T protein:vir:56 225 RENMVKS-KGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQ-LMGGKPENVGSLGDIE 302 (344) T ss_pred HHHHHHh-cCCCCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHH-HhccCCCCCCccccHH Confidence 6666543 23445455555432 13344444444433 34566667889999999985 5576654322 2233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCC-CHH Q lcl|NC_019527. 388 GEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQT-SAK 441 (516) Q Consensus 388 ~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~-sek 441 (516) ...+.| -+..|.|.++++.++.. ..+.+ .+.|++..-. ++. T Consensus 303 q~~~~f-------~~~tL~Pl~~~ie~~n~-~l~~~-----~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 303 KVAKVF-------VRNELIPLQDRIREING-WIGQE-----VIRFKNYSLDTDNG 344 (344) T ss_pred HHHHHH-------HHHHHHHHHHHHHHHHh-hhccc-----cccCCCccccccCC Confidence 333333 34567787776655322 22211 1445443211 111 No 218 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=98.76 E-value=1.2e-09 Score=69.57 Aligned_cols=239 Identities=12% Similarity=0.074 Sum_probs=133.4 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTL 121 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~ 121 (516) |.- |..+.. ....+.. .... +.... ..+.. + .+...|- ...+.+++-+..+|+.+ T Consensus 1 Mgl--------F~~~~~-r~~~~~~--~~~~--~~~~~----~~~~~-~------~~~~~v~-~~~al~~~~v~~~i~~i 55 (251) T protein:vir:46 1 MGI--------FYKNEK-RDLQYNE--DDLQ--MMVQT----LPSFQ-G------TKLRQYK-DIEAIRHSDIFTAVMMI 55 (251) T ss_pred CCc--------cccccc-cccCCCc--cchh--hhhhh----hcccc-C------cCcceec-hhhhhccHHHHHHHHHH Confidence 210 100000 0000000 0000 00000 00000 0 0011111 12233566788999999 Q ss_pred hHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhc-ccceeeEEEEEecCCCcccCcccccccccc Q lcl|NC_019527. 122 STELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHD-CFFGRGQISINIKGADVSVPLILDPRTIKK 200 (516) Q Consensus 122 aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~-rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~ 200 (516) ++++-+..+++....+... .......|........-...|.+++... .++|.|++++.-++ . T Consensus 56 a~~iA~lp~~~~~~~~~~~--~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~---------------~ 118 (251) T protein:vir:46 56 ASDLARMPIRVTVNGQINY--SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK---------------T 118 (251) T ss_pred HHhHhhCceEEeeCccccc--cchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECC---------------C Confidence 9999999998865443322 2233344555555555566777777766 66788888875432 2 Q ss_pred cceeeEEeecceeeccccccccccccccccCcceeEE------ee--eEeccceEEEecCCcchhhhhhccCCCCchHHH Q lcl|NC_019527. 201 GSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWV------LG--REMHASRLLTIITRPLPDMLKPAYNFSGISMSQ 272 (516) Q Consensus 201 g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v------~g--~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le 272 (516) |.+.+|.+++|.+|++..-. .|.+.++.. .| ..+.++.||||.+..+ ..+.|.|.++ T Consensus 119 G~~~~L~~i~~~~v~v~~~~--------~g~~~~~~~~~~~~~~g~~~~~~~~diiH~r~~~~-------dg~~G~spi~ 183 (251) T protein:vir:46 119 GEPMNLTFRKTSEIELKSDA--------RGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSL-------DGINGLSLLD 183 (251) T ss_pred CcEEEEEEECCceEEEEECC--------CCcEEEEEEEeccCCcceeEEECCccEEEecCcCC-------CCeeecCHHH Confidence 44667899999988753211 122322211 12 4688999999986432 3468999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCcc-HHHHHHHHHHHHHhcCCcceEEEecCCc Q lcl|NC_019527. 273 LAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGE-GGDVFDRVEMYVNMQSNLGLAVMDFDSE 340 (516) Q Consensus 273 ~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~-~~~l~~r~~~~~~~~sn~g~~~id~~~e 340 (516) .+.+.|.....+......++.+.... +++++ ..+..++ .+.+.++++......+|.|.+.+.++ | T Consensus 184 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~--~~l~~~e~~~~~~~~~~~~~~g~~n~g~~~~gm~-~ 251 (251) T protein:vir:46 184 TLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK--GVLDNKKARDRAREEFPKVLVELNKLGKLSYSMN-Q 251 (251) T ss_pred HHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeC--CCCCCHHHHHHHHHHHHHHhcCcccccccccccC-C Confidence 99999999999999999999987654 44554 3343332 23455556655555567777666553 3 No 219 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=98.75 E-value=5e-08 Score=60.62 Aligned_cols=320 Identities=8% Similarity=0.014 Sum_probs=154.3 Q ss_pred hhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcc Q lcl|NC_019527. 6 RKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLN 85 (516) Q Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~ 85 (516) =+|.+ +++.+ + ....+..++. |--|++ | +|+. ..+.+.. T Consensus 1 m~~~~---~~~~~------------~----------~~~~~~~~~~-~~~p~~---~----------~~~~--~~~~~~~ 39 (337) T protein:vir:78 1 MTKRQ---QQPAQ------------A----------AASSPRPSVV-FSMPEA---I----------DPTA--WMTDYTG 39 (337) T ss_pred CCCcc---cCccc------------c----------cccCceeEEE-ecCccc---c----------cCcc--hhHhhhh Confidence 01000 00000 0 0000011122 111110 0 1111 0111111 Q ss_pred cccCC-cccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHH Q lcl|NC_019527. 86 SAAGG-LYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQ 164 (516) Q Consensus 86 ~~~~~-~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ 164 (516) -.... ..+++ .+..-..|..+++.++.-+.++..-+.. +..++... ++.+. T Consensus 40 ~~~~~~~~~~~-pP~~~~~La~l~~~~~~h~~~L~~k~N~-~~~~f~~~--------------------------~~~~~ 91 (337) T protein:vir:78 40 VFYNPYGEYYQ-PPIDRKGLAKVARANAHHGAILMARRNM-VAGRFTNQ--------------------------RATIT 91 (337) T ss_pred hhhccCcceec-CCCCHHHHHHHhhcchhhhhHHHhhhcc-ccccCcCc--------------------------HHHHH Confidence 11110 11111 1233355777888777777776665542 22233211 01233 Q ss_pred HHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEe---eeE Q lcl|NC_019527. 165 KAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVL---GRE 241 (516) Q Consensus 165 ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~---g~~ 241 (516) .++..-.+||.|++++..++ .|.+.+|.++++.+|.... | . +| +|... ... T Consensus 92 ~~~~d~ll~GNay~~~~rn~---------------~G~~~~L~pl~~~~v~~~~----d--~-~~----~~~~~~~~~~~ 145 (337) T protein:vir:78 92 AFVHNYLQFGDGGLLKLRNS---------------FGQVVGLHPLSSVYLRRRE----D--G-CF----VYLQQGKPNLI 145 (337) T ss_pred HHHHHHHhhCCeEEEEEECC---------------CCcEEEEEEeCCceeEeee----C--C-eE----EEEEcCCceEE Confidence 33434467899998875532 1334456667666554221 1 0 11 12122 235 Q ss_pred eccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecch-hhhcCccHHHHHHHH Q lcl|NC_019527. 242 MHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMA-QVLNGGEGGDVFDRV 320 (516) Q Consensus 242 iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~-~~l~~~~~~~l~~r~ 320 (516) +.++.|+|+.+.. +....+|+|.++.+...+..-..+......+..+....-.-..+. ..++.++.+.+.+.+ T Consensus 146 ~~~~eIiHik~~~------~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~~lk~~~ 219 (337) T protein:vir:78 146 YRPDDVIWLAQYD------PEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEEEMKEMI 219 (337) T ss_pred ECCccEEEECCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHH Confidence 6678899987532 233567999999999999888888888888877755432221111 224444445566666 Q ss_pred HHHHHhcCCcceEEEe-c--CCcceeEEecccCCHH----HHHHHHHHHHHhhhcCCceeeecccccccccc---chHHH Q lcl|NC_019527. 321 EMYVNMQSNLGLAVMD-F--DSEDIVQVNTPLSGLA----DLQSQSQEHMCSVSKIPAIKLTGISPSGLNAS---SEGEI 390 (516) Q Consensus 321 ~~~~~~~sn~g~~~id-~--~~e~~e~~~~~lsgl~----d~~~~~~~~iaaas~IP~t~L~G~sp~Glnat---ge~D~ 390 (516) +.. ....|.+.+++. . ..+.++...++.+..+ ++.....+.||++.+||-. |+|+.+.+-.+| -|... T Consensus 220 ~~~-~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~-llGi~~~~~~~~~~n~e~~~ 297 (337) T protein:vir:78 220 ANS-KGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPA-LAGIIPTNGGGGLGDPEKYD 297 (337) T ss_pred HHh-cCcccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHH-HcccccCCCcCccccHHHHH Confidence 543 344566654443 1 1233444444444332 3455667899999999985 557765443222 23223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCc--ceEEeCCCCCC Q lcl|NC_019527. 391 RSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDA--ITFKFKSLWQT 438 (516) Q Consensus 391 ~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d--~~~~f~pL~~~ 438 (516) . .+-+..|.|.++++.+.+-+... +.. +.|+|+.=-.+ T Consensus 298 ~-------~f~~~~L~P~~~~ie~~~n~~ll---~~~~~~~f~~~~~~~~ 337 (337) T protein:vir:78 298 A-------TYARNEVLPLCELVQDAINSAGL---PRALWVTFRETIGAAV 337 (337) T ss_pred H-------HHHHHHHHHHHHHHHHHHhhhcC---ChhhceeccccccccC Confidence 3 33445688988888887754322 222 34555432222 No 220 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=98.73 E-value=7.8e-08 Score=59.56 Aligned_cols=333 Identities=10% Similarity=0.042 Sum_probs=152.2 Q ss_pred chhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhh Q lcl|NC_019527. 4 FDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQF 83 (516) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~ 83 (516) -+||+ +++.+...+.+..+..+.++.+ .....++. |--|++ | +|.. ....+ T Consensus 1 m~~~~-~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~-~~~p~~---v----------~~~~--~~~~y 51 (350) T protein:vir:11 1 MSKRR-SHRRQQPVTVQSAQEGEFIPRQ------------GGRAEAFT-FGDPMP---V----------LDGR--GILDY 51 (350) T ss_pred CCccc-cCCCcCccccCCcchhhhcccc------------ccceEEEE-eCCcee---e----------cCcc--hhhHH Confidence 22221 1111111111111111111100 00001111 111210 1 1110 00111 Q ss_pred cccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHH Q lcl|NC_019527. 84 LNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGII 163 (516) Q Consensus 84 ~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l 163 (516) ..-...+.......++. .|..+++.++....+|..-.....+ ++. +. .+--+..| T Consensus 52 ~~~~~~~~~~~pp~~~~--~la~~~~~~~~h~~~l~~k~n~l~~-~~~-Pn---------------------~~~t~~~f 106 (350) T protein:vir:11 52 LECWPNGRWYEPPLSME--GLAKSVGSSVYLQSGLKFKRNMLAK-TFI-PH---------------------RLLSRATF 106 (350) T ss_pred HHHhhcCccccCCCCHH--HHHHHHhhhhhhccchhhhhhhhhh-ccc-CC---------------------CCCCHHHH Confidence 11111121111112232 3556666666655555544433222 111 00 01112234 Q ss_pred HHHHHhcccceeeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEee---- Q lcl|NC_019527. 164 QKAAEHDCFFGRGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLG---- 239 (516) Q Consensus 164 ~ea~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g---- 239 (516) ..++....+||.|++++..+. .|.+.+|.++++.+|..... .+ . +|++.+ T Consensus 107 ~~~v~d~ll~Gnay~~~~rn~---------------~G~~~~L~~l~~~~vr~~~~--~~----~-----~~~~~~~~~~ 160 (350) T protein:vir:11 107 EQFSLDWLTFGSAYLEQPRSR---------------LGTRMPLQAPLAKYMRRGTD--LE----T-----FYQVRSWKDE 160 (350) T ss_pred HHHHHHHHhcCCeEEEEEEcC---------------CCCEEEEEEeCCceeEeeec--CC----e-----EEEEeeCCeE Confidence 445555567999998875432 23455677777777664221 11 1 344432 Q ss_pred eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHH Q lcl|NC_019527. 240 REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVF 317 (516) Q Consensus 240 ~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~ 317 (516) ..+.++.|||+.... +....+|+|.+..+...+..-..+......+..+.... ++++.. ..++.++.+.+. T Consensus 161 ~~~~~~eVihir~~~------~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~-~~ls~e~~~~l~ 233 (350) T protein:vir:11 161 HEFEKGSVIQLREAD------INQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTD-AAQNEEDIDALR 233 (350) T ss_pred EEECcccEEEeCCCC------CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-CCCCHHHHHHHH Confidence 367789999997532 23346799999999999988777777777777776543 233321 235554555566 Q ss_pred HHHHHHHHhcCCcceEEEecC---CcceeEEecccCC----HHHHHHHHHHHHHhhhcCCceeeeccccccc--cccchH Q lcl|NC_019527. 318 DRVEMYVNMQSNLGLAVMDFD---SEDIVQVNTPLSG----LADLQSQSQEHMCSVSKIPAIKLTGISPSGL--NASSEG 388 (516) Q Consensus 318 ~r~~~~~~~~sn~g~~~id~~---~e~~e~~~~~lsg----l~d~~~~~~~~iaaas~IP~t~L~G~sp~Gl--natge~ 388 (516) +.++.. ....|.+.+++... .+.++...++.+. +-++.....++||++.+||-. |+|+.+.+- .++-+. T Consensus 234 ~~~~~~-~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~-llGi~~~~t~~~sn~e~ 311 (350) T protein:vir:11 234 TALKTA-KGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQ-LMGVVPQNAGGFGSISD 311 (350) T ss_pred HHHHHh-cCccccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcCCHHH Confidence 666542 34456665554321 2334444333332 334555677889999999975 667654422 123343 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCC--CCCC Q lcl|NC_019527. 389 EIRSFYDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKS--LWQT 438 (516) Q Consensus 389 D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~p--L~~~ 438 (516) ..+.|+ ++.|.|.++.+.++.- ..|. +- +.|++ |-.+ T Consensus 312 ~~~~f~-------~~~L~P~~~~ie~ln~--~l~~--~~--~~F~~~~~~~l 350 (350) T protein:vir:11 312 AAAVWA-------SLELAPMQTRLQQVNE--MIGE--EV--VRFAQFDAPGL 350 (350) T ss_pred HHHHHH-------HHHHHHHHHHHHHHHh--hcCc--cc--cccCcccccCC Confidence 334443 2446777776655321 1221 11 22332 1111 No 221 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=98.54 E-value=3.2e-07 Score=56.19 Aligned_cols=305 Identities=10% Similarity=-0.009 Sum_probs=138.1 Q ss_pred eeEEEEEecCCCcccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeE--E---eeeEeccceEEE Q lcl|NC_019527. 175 RGQISINIKGADVSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWW--V---LGREMHASRLLT 249 (516) Q Consensus 175 ~a~i~i~i~~~~~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~--v---~g~~iH~SRli~ 249 (516) -.=++...+++. ..++.|...++.++. .+..++ +-+.-...+ + .+..+++.+.|+ T Consensus 1 v~Eivw~~~~g~--------------~~~~~l~~r~~~~~~---~f~~~~---~~~l~~~~~~~~~g~~~~~lp~~kfi~ 60 (355) T protein:vir:78 1 MFEQVYRIENGR--------------ARLGKLAWRPPRTIS---RFDVAP---DGGLVAIEQWGVFGKATVRIPVDRLVV 60 (355) T ss_pred CeEEEEEeeCCe--------------EEEeeeeecCcccee---eeeecc---CCceeEEEecCCCCCCcceeccCCEEE Confidence 000111111111 112223333332111 000010 101000000 1 234677777777 Q ss_pred ecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHh--CCceeeecchhhhcCcc--------HHHHHHH Q lcl|NC_019527. 250 IITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKF--SRTFLKTNMAQVLNGGE--------GGDVFDR 319 (516) Q Consensus 250 ~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~--~~~v~k~~~~~~l~~~~--------~~~l~~r 319 (516) +.... ...+.+|.+++..||-.+.--......-+.++.++ .+.+.+..........+ ....... T Consensus 61 ~~~~~------~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l 134 (355) T protein:vir:78 61 FVNER------EGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEG 134 (355) T ss_pred EEeCC------CCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHH Confidence 65432 34567899999999998877777788888899988 56676653211111111 1111222 Q ss_pred HHHHHHhcCCcceEEEecCCcceeEEeccc--CCHHHHHHHHHHHHHhhhcCCceeeeccccc-cccccchHHHHHHHHH Q lcl|NC_019527. 320 VEMYVNMQSNLGLAVMDFDSEDIVQVNTPL--SGLADLQSQSQEHMCSVSKIPAIKLTGISPS-GLNASSEGEIRSFYDD 396 (516) Q Consensus 320 ~~~~~~~~sn~g~~~id~~~e~~e~~~~~l--sgl~d~~~~~~~~iaaas~IP~t~L~G~sp~-Glnatge~D~~~yyd~ 396 (516) .++.....++....++..++.+++.++..- ++...+++..-++|+-+.--. |.-.+.+.+ |-+|-|+.-.....+. T Consensus 135 ~~~~~~i~~g~~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGq-tlTs~~~~~gGS~Alg~vh~~v~~~~ 213 (355) T protein:vir:78 135 LQLAKEFRAGEAAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAH-FLTLGGDKSTGSYALGDTFASFFTGS 213 (355) T ss_pred HHHHHHhhCCcceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHHhhh-hhccccCCccchhhHHHHHHHHHHHH Confidence 333333344433333334457888886542 245667777777776543221 111111111 3334456556677777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHH-HHHHHHhh Q lcl|NC_019527. 397 ISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSE-ARQQLSDD 475 (516) Q Consensus 397 I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e-~r~~l~~~ 475 (516) +++-......-+.+.|+.-|+.-.||..+.--.|+|...- + ..++.|++++.+++.|++.+++ ..+.++. T Consensus 214 ~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~P~~~~~~~~---~-----~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e- 284 (355) T protein:vir:78 214 LNAVMKHIADVTQQHVVEDLVDQNWGPEEPAPRLVPAQLG---K-----EQPVTAEAIRALVECGAFTADPELEKDLRA- 284 (355) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCcC---h-----hHHHHHHHHHHHHhCCCccccHHHHHHHHH- Confidence 7777765444444567777776667665444567775432 1 1134678889999999876543 2333332 Q ss_pred hccCCCCCChhhh-ccccccchhcCC------------CCCCCCCCCCCCCCCC Q lcl|NC_019527. 476 PDSGWDNIDGDLE-IVQPEMFDDDGA------------DPYMPDPDVLPGEEGS 516 (516) Q Consensus 476 ~~~~~~~~d~~~e-~~~~e~~~~e~~------------~~~~~~~~~~~~~e~t 516 (516) ..|++.-.+..+ ............ +.+...+.+.+..+.+ T Consensus 285 -~~gip~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~a~~~~~~~ 337 (355) T protein:vir:78 285 -RYGLPAPAERDDGADAAAAKAAGRRRAKRLPGQRQGAALPSRSPRADPPRRRG 337 (355) T ss_pred -HhCCCCCCCCCcccCCccccccccccccccCCccccccccccCCCCCChhhhH Confidence 113321111111 000000000000 0000011111111111 No 222 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=98.46 E-value=5.7e-07 Score=54.82 Aligned_cols=428 Identities=8% Similarity=-0.019 Sum_probs=185.8 Q ss_pred CccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccc----cCcccHHHHHHHHhCchhhhhhhhhhH Q lcl|NC_019527. 48 DAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADI----QPFPGYQNLAALATRPEYRAFASTLST 123 (516) Q Consensus 48 ~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~----~~f~gy~ll~~y~~~~i~r~iVd~~ae 123 (516) -|.. -....|+-| .+||..+ +.+....++..+-+. +.--++.+........-+..++++.-. T Consensus 1 ~~~~----~~~~~gl~p----~rl~~i~------~~~~~~~~~~~~~~~~~~Lr~~~~~~ly~~m~~D~hi~s~l~~Rk~ 66 (488) T protein:vir:95 1 MADI----TETQESLPP----FRMGEVG------SLGLKVKNGRIYEEPRQALRFPESIKTFQLMMRDPAVAASVNIIKM 66 (488) T ss_pred CCCc----cccCCCCCH----HHHHHHH------HHhhccccchhhccchhhhcccchHHHHHHHhhChHHHHHHHHHHH Confidence 0000 011123222 1244322 111111111111111 111134454444568889999999999 Q ss_pred HHhhCCCeeeeccccchhhh-HHHHHHHHHHHHhcCh-h-HHHHHHHHhcccceeeEEEEEecCCCccc-Ccc--ccccc Q lcl|NC_019527. 124 ELTREGIEITSKDRTKAKEM-ASKIKELEEACEYYGV-M-GIIQKAAEHDCFFGRGQISINIKGADVSV-PLI--LDPRT 197 (516) Q Consensus 124 d~~r~~~~i~~~~~~~~~~~-~~~i~~i~~~~~~l~~-~-~~l~ea~~~~rlyG~a~i~i~i~~~~~~~-Pl~--ld~~~ 197 (516) -.+..-|.|...++.+.+.+ .+..+.++..++.+.. + +.+.+++ .+.+||.|+.=+.=+...... +.. .+... T Consensus 67 av~~~~w~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~ 145 (488) T protein:vir:95 67 FVRKVNWRFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGL 145 (488) T ss_pred HHhcCCceEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCe Confidence 88888888875443322211 1223446666766653 3 4455554 689999998544322111100 000 00000 Q ss_pred ccccceeeEEeecce---e--eccccc----cccccccccccCcceeEE---eeeEeccceEEEecCCcchhhhhhccCC Q lcl|NC_019527. 198 IKKGSLTGFSNIEPM---W--TSPSAY----NALDPTAPDFYKPSTWWV---LGREMHASRLLTIITRPLPDMLKPAYNF 265 (516) Q Consensus 198 I~~g~l~~l~v~d~~---~--v~p~~~----~~~dp~s~~yg~P~~y~v---~g~~iH~SRli~~~~~~~p~~~k~~~~~ 265 (516) + .++.|.+..++ | ..+... ...++....+..+..|.. .+..|.+.+.+.+.... ...+. T Consensus 146 ~---~~~~i~~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~------~~g~p 216 (488) T protein:vir:95 146 I---GWAKLPIRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDD------EYGNP 216 (488) T ss_pred e---eeeeeeecCcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecC------CCCcc Confidence 1 11222222221 1 111100 001111112222333322 24567777877766432 34567 Q ss_pred CCchHHHHHHHHHHHHHHHHHHHHHHHHHhC--Cceeeecch--hhhcCccHHHHHHHHHHHH-HhcC--CcceEEEecC Q lcl|NC_019527. 266 SGISMSQLAQPYVENWLRTRQSVSDLVDKFS--RTFLKTNMA--QVLNGGEGGDVFDRVEMYV-NMQS--NLGLAVMDFD 338 (516) Q Consensus 266 ~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~--~~v~k~~~~--~~l~~~~~~~l~~r~~~~~-~~~s--n~g~~~id~~ 338 (516) +|.+++..|+-...--.....--+.++.++. +.+.+..-. .--+..+...+.+.+..+. .... -.|+.+-.+- T Consensus 217 ~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~ 296 (488) T protein:vir:95 217 EGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYI 296 (488) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeecccc Confidence 7999999998877655566666677777754 455554211 1111111222333333222 2111 1233322111 Q ss_pred Cc-------ceeEEecccC---CHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 339 SE-------DIVQVNTPLS---GLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSP 408 (516) Q Consensus 339 ~e-------~~e~~~~~ls---gl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~ 408 (516) .. +++.++..=+ ....+++..-++|+-+.--. |.-.+..-+|-+|.|+.-.....+.+++-......-+ T Consensus 297 ~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGq-tLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tl 375 (488) T protein:vir:95 297 DPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSD-VLAMGQSKYGSFSLADSKTSLLAMSVDILLKQIKNVI 375 (488) T ss_pred ccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhcc-ccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 1222222212 24557777777777543111 1111112235555566666777777777665544455 Q ss_pred HHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCH-----HHHHHHHHhhhccCCCCC Q lcl|NC_019527. 409 LDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDP-----SEARQQLSDDPDSGWDNI 483 (516) Q Consensus 409 l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~-----~e~r~~l~~~~~~~~~~~ 483 (516) .+.|+.-|+.-.||....--+|+|...-..+ .++.|++++.++++|+.-+ +.+|+.+ |++.- T Consensus 376 n~~li~~l~~~Nfg~~~~~P~~~~~~~e~~D-------l~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~------gip~~ 442 (488) T protein:vir:95 376 NRDLVAQTYALNMWDDEEHVQITYDDIETPD-------LEAIGSYIQKTVAVGALEVDKELSNKLREHI------GLPPA 442 (488) T ss_pred HHHHHHHHHHhcCCCCCCccEEEecCcChhh-------HHHHHHHHHHHHhCCCccccHHHHHHHHHHh------CCCCC Confidence 5567776666667654433467775442222 2356788888999998755 2345444 33321 Q ss_pred ChhhhccccccchhcCCCCCCCCCC--------CCCCCCCC Q lcl|NC_019527. 484 DGDLEIVQPEMFDDDGADPYMPDPD--------VLPGEEGS 516 (516) Q Consensus 484 d~~~e~~~~e~~~~e~~~~~~~~~~--------~~~~~e~t 516 (516) ..+++...+..+. ..+..+... ..+..+.+ T Consensus 443 ~~~e~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (488) T protein:vir:95 443 DESQPVSEKLSPN---SQSRSGDGYKTAGEGTAKTPSAKDP 480 (488) T ss_pred CCCccccccCCCC---CCCCCCcccCCCcccCCcccccccc Confidence 1111110010000 001111110 11111111 No 223 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=98.40 E-value=8.4e-07 Score=53.90 Aligned_cols=413 Identities=9% Similarity=-0.023 Sum_probs=190.2 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcc--c-------ccccchh--hhcccccCCcccccccCcccH-HH-HHHH Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAM--D-------SLCGPTY--QFLNSAAGGLYAADIQPFPGY-QN-LAAL 108 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~--d-------s~~~~~~--~~~~~~~~~~~~~~~~~f~gy-~l-l~~y 108 (516) |.+... ++-.|...++. .....+.++. + +++++.. .....+.. ..+..| +| +.+. T Consensus 1 m~~~~d----~~g~p~~~~~~-~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~-------gd~~~~~~L~~dm~ 68 (512) T protein:vir:19 1 MGRILD----ISGQPFDFDDE-MQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAER-------GDLTAQADLAFDME 68 (512) T ss_pred CcceeC----CCCCccccccc-cccccchhcccchhhccccccCCCHHHHHHHHHHhhC-------CCHHHHHHHHHHHH Confidence 100000 00011100000 0000011110 0 0000000 00000110 112222 22 2333 Q ss_pred HhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEE--ecCCC Q lcl|NC_019527. 109 ATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISIN--IKGAD 186 (516) Q Consensus 109 ~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~--i~~~~ 186 (516) .+..-+..++++.-.-.+..-|.|....++..+ .....+.+++.+..+.-++.+..-+-.+.+||.+++=+. .+++. T Consensus 69 ~~D~hi~s~l~~Rk~av~~~~w~I~p~~~~~~~-~~~~a~~v~~~l~~~~~f~~~~~~lldA~~~G~s~~Ei~w~~~~g~ 147 (512) T protein:vir:19 69 EKDTHLFSELSKRRLAIQALEWRIAPARDASAQ-EKKDADMLNEYLHDAAWFEDALFDAGDAILKGYSMQEIEWGWLGKM 147 (512) T ss_pred hhChHHHHHHHHHHHHHhCCCceEecCCCCCHH-HHHHHHHHHHHHhcCCCHHHHHHHHHhhhhhcceeeeeEeeeeCCc Confidence 467888888998888888878888654332211 112334566677666434444445557999999985442 22322 Q ss_pred cccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCC Q lcl|NC_019527. 187 VSVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFS 266 (516) Q Consensus 187 ~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~ 266 (516) + .++.|...+|.|.........-. .+.. ..+.|..+++...+++.... ...+.+ T Consensus 148 ~--------------~~~~~~~r~~~~f~~~~~~~~~l---r~~~---~~~~G~~l~~~k~i~~~~~~------~~g~p~ 201 (512) T protein:vir:19 148 R--------------VPVALHHRDPALFCANPDNLNEL---RLRD---ASYHGLELQPFGWFMHRAKS------RTGYVG 201 (512) T ss_pred e--------------eeeeeeeeccccceeccCCCcEE---EecC---CCCCceeecCCceEEEeccC------CCCCcc Confidence 1 23345556666654221100000 0000 01235567776666665543 234567 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhCCcee--eecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeE Q lcl|NC_019527. 267 GISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFL--KTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQ 344 (516) Q Consensus 267 G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~--k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~ 344 (516) |.+++..|+-...--..+...-+.++.++++++. |++.. ...++....++.+..+.++ +..++. ++.+++. T Consensus 202 g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~-----a~~~ek~~L~~al~~~~~~-a~~iiP-~~~~ie~ 274 (512) T protein:vir:19 202 TNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPTG-----STNREKATLMQAVMDIGRR-AGGIIP-MGMTLDF 274 (512) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCCC-----CCHHHHHHHHHHHHHHhhC-cEEEec-CCceEEE Confidence 9999999998888888888888899999997654 44311 1222333444555555444 344444 3478888 Q ss_pred EecccCCH---HHHHHHHHHHHHhh-hcCCceeeeccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 345 VNTPLSGL---ADLQSQSQEHMCSV-SKIPAIKLTGISPSGLNASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLSK 420 (516) Q Consensus 345 ~~~~lsgl---~d~~~~~~~~iaaa-s~IP~t~L~G~sp~Glnatge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~ 420 (516) +.+.=++. ..+++..-..|+-+ .|-=+|- +..-+|-+|.|+.-.....+.+++-......-+.+.|+.-++.-. T Consensus 275 ~ea~~~~~~~y~~li~~~d~~Isk~iLGqtlTs--~~g~~Gs~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N 352 (512) T protein:vir:19 275 QSAADGQSDPFMAMIGWAEKAISKAILGGTLTT--EAGDKGARSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLALN 352 (512) T ss_pred eecCCCCHHHHHHHHHHHHHHHHHHHhhhhhcc--cccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 87643333 34566666667643 2221111 111124445555556666677777666544444456888777666 Q ss_pred CCCc-CC--cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCC-CCHHHHHHHHHhhhccCCCCCChhh--hcccccc Q lcl|NC_019527. 421 WGEI-DD--AITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSV-IDPSEARQQLSDDPDSGWDNIDGDL--EIVQPEM 494 (516) Q Consensus 421 ~g~~-~~--d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gv-i~~~e~r~~l~~~~~~~~~~~d~~~--e~~~~e~ 494 (516) ||.. +. --.|+|...-..+- ++.++++..+. .|+ |+.+.+++.++ ++.-..++ ....+.. T Consensus 353 ~~~~~~~~~~p~~~f~~~e~eDl-------~~~a~~~~~l~-~G~~i~~~~i~e~~G------ip~~~~~e~~~~~~~~~ 418 (512) T protein:vir:19 353 SDSTIDINRLPGIVFDTSEAGDI-------TALSDAIPKLA-AGMRIPVSWIQEKLH------IPQPVGDEAVFTIQPVV 418 (512) T ss_pred CCCCCCccccceEEecCCChhhH-------HHHHHHHHHHh-cCCCCCHHHHHHHhC------CCCCCCccccccCCCcc Confidence 6532 21 12456644322222 34555556655 564 79999999884 22111111 0000000 Q ss_pred chhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 495 FDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 495 ~~~e~~~~~~~~~~~~~~~e~t 516 (516) +...............+..+-. T Consensus 419 ~~~~~~~~~~~~~~~~~~~~~~ 440 (512) T protein:vir:19 419 PDNGSQKEAALSAEDIPQEDDI 440 (512) T ss_pred ccccccccccccccCCCchhhH Confidence 0000000000000000000000 No 224 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=98.30 E-value=2.1e-07 Score=57.17 Aligned_cols=208 Identities=12% Similarity=0.063 Sum_probs=113.0 Q ss_pred cccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchH Q lcl|NC_019527. 191 LILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISM 270 (516) Q Consensus 191 l~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~ 270 (516) ++ .-..|.+.+.... ...+. .| ....++++.|+||.+.. +....+|+|. T Consensus 1 ~r----~~~dg~~~y~~~~----------~~~~~----~g-------~~~~~~~~eilH~r~~~------~~~~~~Glsp 49 (219) T protein:vir:98 1 MR----VCKDGNYKYLMKK----------SLYDT----KS-------EIYEYNKNDVIFIKLYD------PMQQVYGSPD 49 (219) T ss_pred Cc----eeecCeEEEEEec----------ceecC----Cc-------eeEEeccccEEEecCCC------CCCCcceecH Confidence 11 1122222221100 00000 01 13468899999997632 2345679999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEec-----CCccee Q lcl|NC_019527. 271 SQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDF-----DSEDIV 343 (516) Q Consensus 271 le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~-----~~e~~e 343 (516) ++.+...+.....+......+..+...+- +.+.. ..|+.+.-+.+.+.++.. ....|.+.+++.. ++-+|+ T Consensus 50 i~~a~~~i~~~~aa~~~~~~~f~Ng~~p~gil~~~~-~~l~~e~~~~~~~~~~~~-~g~~n~~~~~l~~~gg~~~G~~~~ 127 (219) T protein:vir:98 50 YVGGITSALLNSDATIFRRRYYSNGAHMGFILYSTD-PDMTEEMEDEIAERIRDS-KGVGNFRSMFVNIAGGHPDGLKVI 127 (219) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEEeCC-CCCCHHHHHHHHHHHHHh-cCcccccceeEecCCCCccceeEE Confidence 99999888877777777777777765543 33321 234444444555555442 3344544444431 223566 Q ss_pred EEecccCC--HHHHHHHHHHHHHhhhcCCceeeeccccccc--cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 344 QVNTPLSG--LADLQSQSQEHMCSVSKIPAIKLTGISPSGL--NASSEGEIRSFYDDISSVQQSYYFSPLDTMLKVIQLS 419 (516) Q Consensus 344 ~~~~~lsg--l~d~~~~~~~~iaaas~IP~t~L~G~sp~Gl--natge~D~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s 419 (516) .++.+..+ +-+........||.+.+||-.+| |....+- .++-|.... .+.+..|.|.++++...|... T Consensus 128 ~~~~~~~d~qfle~rk~~~~eIa~~fgVPp~~l-G~~~~~~~~~sn~eq~~~-------~f~~~tL~P~~~~ie~~ln~~ 199 (219) T protein:vir:98 128 PIGDTGQKDEFANIKNISAQDVLTSHRFPPGLS-GIIPVNTAGLGDPLKIRE-------AYQADEVLPLQEIIAESINSD 199 (219) T ss_pred EccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHc-ccccCCCCCccCHHHHHH-------HHHHHHHHHHHHHHHHHhhhh Confidence 66554433 33445555789999999998754 6543221 123333333 334566889998888877542 Q ss_pred hCCCcCCcceEEeCCCCCCCHHH Q lcl|NC_019527. 420 KWGEIDDAITFKFKSLWQTSAKE 442 (516) Q Consensus 420 ~~g~~~~d~~~~f~pL~~~sekE 442 (516) + .+|.+..+.|+.-. .+++- T Consensus 200 -~-~~~~~~~~~F~~~~-~~d~~ 219 (219) T protein:vir:98 200 -Y-EIKSALKVNFKQPE-KRDKN 219 (219) T ss_pred -h-cCCCccEEeecCcc-cccCC Confidence 2 35778888886322 22222 No 225 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=98.26 E-value=2e-06 Score=51.86 Aligned_cols=396 Identities=13% Similarity=0.075 Sum_probs=193.7 Q ss_pred hHHhhcCCCccccccCCCCCCCccCCCccc-hhcccccccchhhhcccccCCccc-ccccCcccHHHHHHH----HhCch Q lcl|NC_019527. 40 KSMERRASDAATKWAPPQLMPGVVPAGTTP-AVAMDSLCGPTYQFLNSAAGGLYA-ADIQPFPGYQNLAAL----ATRPE 113 (516) Q Consensus 40 ~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~-~~a~ds~~~~~~~~~~~~~~~~~~-~~~~~f~gy~ll~~y----~~~~i 113 (516) -.|+. +.+| .|.-.+. +-++|+. +-+.++.+ .+. ....+..-|+-+.+| +..+- T Consensus 1 ~~~~~-------~~~p-------~~~~~~~~~~~~~~~-~~~~g~~~-----~D~~lr~~gg~~~~~~~l~~~m~e~D~~ 60 (446) T protein:vir:98 1 MNMEV-------RNAP-------TPAIRRRTIYAMEHL-GLATSYLS-----EDGGYKRAGKPTYQQLSAWDEAAQTEPI 60 (446) T ss_pred Ccccc-------cCCC-------chhhhhhhhhccccc-hhhcccCC-----cchHhhhcCCChHHHHHHHHHHHhcchH Confidence 00111 1111 0000000 0111111 00000110 000 000112224444554 45789 Q ss_pred hhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEe--cCCCcccCc Q lcl|NC_019527. 114 YRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINI--KGADVSVPL 191 (516) Q Consensus 114 ~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i--~~~~~~~Pl 191 (516) +..++++...-.++.-|+|...++ +..+.+++.+..+.+...+.. +-.+..||.++.=+.- .++. ..|. T Consensus 61 v~s~l~~Rk~av~~~~w~V~p~~~-------~~a~~v~~~l~~~~~~~~~~~-~ldai~~G~s~~Eivw~~~~g~-~~p~ 131 (446) T protein:vir:98 61 IAQGLDSIALSVLNKVGPYQHGDK-------RIKKFIDDQLRNRAKTWISHC-VKSIMTYGFSLSEQIYAHGARD-NMPA 131 (446) T ss_pred HHHHHHHHHHHhhcCCceecCccH-------HHHHHHHHHHhhcCchhHHHH-HHHHHhhCceeeeEEEeecccc-cccc Confidence 999999999888888888875422 123456777777776555544 4456669999854432 2221 2232 Q ss_pred ccccccccccceeeEEeecceeecccccc-------------c--cccccc-cccCcceeEE---eeeEeccceEEEecC Q lcl|NC_019527. 192 ILDPRTIKKGSLTGFSNIEPMWTSPSAYN-------------A--LDPTAP-DFYKPSTWWV---LGREMHASRLLTIIT 252 (516) Q Consensus 192 ~ld~~~I~~g~l~~l~v~d~~~v~p~~~~-------------~--~dp~s~-~yg~P~~y~v---~g~~iH~SRli~~~~ 252 (516) .+.. + +.++......|+...... . ..|..+ .++.|..+.- .+..|...+++++.. T Consensus 132 ~~~d-----~-~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~ 205 (446) T protein:vir:98 132 TVLD-----D-IVNYHPLQVMLIANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINY 205 (446) T ss_pred hhhc-----c-ccccccccceeeeccCCccccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEe Confidence 2211 0 112222322232211000 0 000000 1122222111 134577778887765 Q ss_pred CcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecch-h--hhcCccHHH----HHH-HHHH Q lcl|NC_019527. 253 RPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMA-Q--VLNGGEGGD----VFD-RVEM 322 (516) Q Consensus 253 ~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~-~--~l~~~~~~~----l~~-r~~~ 322 (516) .. ...+.+|.|++..||-...=-..+..--+.++.++++++ .|..-. . .....+..+ ..+ -++. T Consensus 206 ~~------~~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~a 279 (446) T protein:vir:98 206 NT------KGNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDA 279 (446) T ss_pred cC------CCCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHH Confidence 42 345678999999999888777777778888999988654 454311 1 111111111 111 2333 Q ss_pred HHHhcCCcceEEE----ecCCcceeEEecccCC---HHHHHHHHHHHHHhhhcCCceeeeccccc--cccccchHHHHHH Q lcl|NC_019527. 323 YVNMQSNLGLAVM----DFDSEDIVQVNTPLSG---LADLQSQSQEHMCSVSKIPAIKLTGISPS--GLNASSEGEIRSF 393 (516) Q Consensus 323 ~~~~~sn~g~~~i----d~~~e~~e~~~~~lsg---l~d~~~~~~~~iaaas~IP~t~L~G~sp~--Glnatge~D~~~y 393 (516) +....+ .+..++ +-++-+++.++..-++ ...+++..-.+||-+.--.. ..+|++.+ |-+|-|+.....+ T Consensus 280 v~~~~~-da~~ii~~~~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~-Ltl~~~~~~~GS~ala~vh~~V~ 357 (446) T protein:vir:98 280 LRRLST-DSGLVLTQLSKEQPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPN-LLVQNRETTFGTGRASEIQLELF 357 (446) T ss_pred HHhccc-cceeeeecccCCCCceEEeeccccCChhhHHHHHHHHHHHHHHHHhccc-ccccccccccchhhhHHHHHHHH Confidence 333223 233333 2334568777665443 46678888888987765553 23355432 4445566666777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEE-eCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCH---HHHH Q lcl|NC_019527. 394 YDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFK-FKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDP---SEAR 469 (516) Q Consensus 394 yd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~-f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~---~e~r 469 (516) .+.+++-.+....-+.+.|+.-|+.-.||....-.... ..|-....+ ..+ .++.|++++.+++.|++.+ +.+| T Consensus 358 ~d~~~aDa~~i~~tln~~Li~~l~~lNf~~~~~~~~~~~~~~~~~~~e--~eD-l~~~a~~~~~L~~~G~~~p~~~~~ir 434 (446) T protein:vir:98 358 DGKINSIFDTVIHAFTEQVIGNLIRLNFDPALYPLASNTGYITRLPGR--ATD-LAALVEAIKQMHDMGFLVDGDKDHIR 434 (446) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccceeccCC--hhh-HHHHHHHHHHHHhCCccccccHHHHH Confidence 78888887765555556788777766666432111110 111112222 223 3457999999999998653 3477 Q ss_pred HHHHhhhccCCCCCChhhhcc Q lcl|NC_019527. 470 QQLSDDPDSGWDNIDGDLEIV 490 (516) Q Consensus 470 ~~l~~~~~~~~~~~d~~~e~~ 490 (516) +.+ |++.-+ +.+ T Consensus 435 e~~------giP~~~---~~~ 446 (446) T protein:vir:98 435 SIT------GLPDAI---SST 446 (446) T ss_pred HHh------CcCCCC---CCC Confidence 776 333222 222 No 226 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=98.24 E-value=2.1e-06 Score=51.70 Aligned_cols=432 Identities=14% Similarity=0.054 Sum_probs=185.3 Q ss_pred Hh--hcCCCccccccCCC-CCCCccCCCccchhcccccccchhhhcccccCCcccccccC-----cccHHHHHHHHhCch Q lcl|NC_019527. 42 ME--RRASDAATKWAPPQ-LMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQP-----FPGYQNLAALATRPE 113 (516) Q Consensus 42 ~~--~~~~~~~~~~~~~~-~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~-----f~gy~ll~~y~~~~i 113 (516) |. +++-+...+|.+.. ..|.++|+..+.+++ +++..-.++.+.. ..+.+ -.|++. =-.++- T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~-------aY~l~~~~y~n~~-~~~~~~lrg~~~~~~r---~~~~ps 69 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLA-------SYRLYEDMYLTNT-SDYQVILRGGDEGDQR---PIYVPN 69 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHH-------HHHHHHHHhcCch-hheeeecCCccccccc---eeeehh Confidence 21 22222223333321 234555555555554 2222111111110 00000 001100 001122 Q ss_pred hhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCc--ccCc Q lcl|NC_019527. 114 YRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADV--SVPL 191 (516) Q Consensus 114 ~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~--~~Pl 191 (516) ++++|+.+-. .+-.|.+......+ ...-..++.-+++=++..++.++-+|.-+-|.++..+.-|.... ..|- T Consensus 70 ~~~~~~~~~~-~~~~g~~~~~~~~~-----e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~ 143 (527) T protein:vir:10 70 GEKLIEAKMR-FLGQGLKWEFSKKD-----AKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLS 143 (527) T ss_pred hHHhhCCcce-eeccCccccccchh-----HHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCce Confidence 3455544421 12233332221111 11224455566666788899999999999998887766543221 0000 Q ss_pred --cccccc-------ccccceeeEEeecceeeccccc---c----------cccccccc-ccC----cceeEE------- Q lcl|NC_019527. 192 --ILDPRT-------IKKGSLTGFSNIEPMWTSPSAY---N----------ALDPTAPD-FYK----PSTWWV------- 237 (516) Q Consensus 192 --~ld~~~-------I~~g~l~~l~v~d~~~v~p~~~---~----------~~dp~s~~-yg~----P~~y~v------- 237 (516) .+||.. -..+.+.++...+-|+.-...- . -.|...|- .|. -..|.+ T Consensus 144 v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~ 223 (527) T protein:vir:10 144 LHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRP 223 (527) T ss_pred EeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecccccccc Confidence 011111 1122344444443332211100 0 00111111 111 111221 Q ss_pred -------------eeeEeccc-------eEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019527. 238 -------------LGREMHAS-------RLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR 297 (516) Q Consensus 238 -------------~g~~iH~S-------Rli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~ 297 (516) .+..+|+. =++||.+. .+.+..||.|-|+.+..-+....++..-..-++.-..+ T Consensus 224 e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~------p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~ 297 (527) T protein:vir:10 224 ESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGH------PIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGL 297 (527) T ss_pred ccccchhhhhhhcCceeeecccCCCCccceEeecCC------CccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCC Confidence 11223322 12233222 24456899999998777776666665544444444556 Q ss_pred ceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEec--ccCCHHHHHHHHHHHHHhhhcCCceeee Q lcl|NC_019527. 298 TFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNT--PLSGLADLQSQSQEHMCSVSKIPAIKLT 375 (516) Q Consensus 298 ~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~--~lsgl~d~~~~~~~~iaaas~IP~t~L~ 375 (516) .++.+........ .++. +...=.-|.++-.+++-++..++. .++++.+.++.+++.|+..+++|.+-+ T Consensus 298 Pi~~~tg~~~vd~-~G~~--------~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~- 367 (527) T protein:vir:10 298 GFYATDSAPPRDS-RGNM--------VPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAV- 367 (527) T ss_pred ceeeecccccccc-cCCc--------CccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeee- Confidence 6666644433321 1110 001112334443355677877775 456788889999999999999998655 Q ss_pred ccccccccccchHHHHHHHHHHHHHHHHH--HHHHHHHHHH-HH--H---HHhCCCcC----CcceEEeCCCCCCCHHHH Q lcl|NC_019527. 376 GISPSGLNASSEGEIRSFYDDISSVQQSY--YFSPLDTMLK-VI--Q---LSKWGEID----DAITFKFKSLWQTSAKEE 443 (516) Q Consensus 376 G~sp~Glnatge~D~~~yyd~I~~~Qe~~--l~p~l~~l~~-~l--~---~s~~g~~~----~d~~~~f~pL~~~sekEk 443 (516) |.--.+-+.||-.=.-.+--.+++.|+.. ++-++.++.. .+ + +..++.-+ -.+.+.|-|....++++. T Consensus 368 G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~av 447 (527) T protein:vir:10 368 GVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKR 447 (527) T ss_pred ccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHH Confidence 52211223334322222333444444443 2333332211 11 0 11222222 156899999977777665 Q ss_pred HHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccc-----hhc--------CCCCCCCCCCCC Q lcl|NC_019527. 444 SEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMF-----DDD--------GADPYMPDPDVL 510 (516) Q Consensus 444 Aei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~-----~~e--------~~~~~~~~~~~~ 510 (516) .+ ....++++|+++.+-+.++|.+-+ ++..-+.+.+...++.. ..+ -.+..+. ++++ T Consensus 448 ie-------~v~tL~~aGi~S~~tAv~~L~~~~--g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~-~~~~ 517 (527) T protein:vir:10 448 FN-------QLLQLWEAGLIPAKKLTEELSKIM--GFELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGI-PDEE 517 (527) T ss_pred HH-------HHHHHHHcCchhHHHHHHHHHhcc--CCCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCC-CCCC Confidence 44 446689999999999998885422 22221111111111100 000 0011111 1111 Q ss_pred CCCCCC Q lcl|NC_019527. 511 PGEEGS 516 (516) Q Consensus 511 ~~~e~t 516 (516) ++.-+- T Consensus 518 ~d~~~~ 523 (527) T protein:vir:10 518 DDQALN 523 (527) T ss_pred cccccC Confidence 111111 No 227 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=98.24 E-value=2.2e-06 Score=51.62 Aligned_cols=432 Identities=14% Similarity=0.050 Sum_probs=185.2 Q ss_pred Hh--hcCCCccccccCCC-CCCCccCCCccchhcccccccchhhhcccccCCcccccccC-----cccHHHHHHHHhCch Q lcl|NC_019527. 42 ME--RRASDAATKWAPPQ-LMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQP-----FPGYQNLAALATRPE 113 (516) Q Consensus 42 ~~--~~~~~~~~~~~~~~-~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~-----f~gy~ll~~y~~~~i 113 (516) |. +++-+...+|.+.. ..|.++|+..+.+++ +++..-.++.+.. ..+.+ -.|++. =-.++- T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~-------aY~l~~~~y~n~~-~~~~~~lrg~~~~~~r---~~~~ps 69 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLA-------SYRLYEDMYLTNT-SDYQVILRGGDEGDQR---PIYVPN 69 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHH-------HHHHHHHHhcCch-hheeeecCCccccccc---eeeehh Confidence 21 22222223333321 234555555555554 2222111111110 00000 001100 001122 Q ss_pred hhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCc--ccCc Q lcl|NC_019527. 114 YRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADV--SVPL 191 (516) Q Consensus 114 ~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~--~~Pl 191 (516) ++++|+.+-. .+-.|.+......+ ...-..++.-+++=++..++.++-+|.-+-|.++..+.-|.... ..|- T Consensus 70 ~~~~~~~~~~-~~~~g~~~~~~~~~-----e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~ 143 (527) T protein:vir:10 70 GEKLIEAKMR-FLGQGLKWEFSKKD-----AKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLS 143 (527) T ss_pred hHHhhCCcce-eeccCccccccchh-----HHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCce Confidence 3455544421 12233332221111 11224455566666788899999999999998887766543221 0000 Q ss_pred --cccccc-------ccccceeeEEeecceeeccccc---c----------cccccccc-ccC----cceeEE------- Q lcl|NC_019527. 192 --ILDPRT-------IKKGSLTGFSNIEPMWTSPSAY---N----------ALDPTAPD-FYK----PSTWWV------- 237 (516) Q Consensus 192 --~ld~~~-------I~~g~l~~l~v~d~~~v~p~~~---~----------~~dp~s~~-yg~----P~~y~v------- 237 (516) .+||.. -..+.+.++...+-|+.-...- . -.|...|- .|. -..|.+ T Consensus 144 v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~ 223 (527) T protein:vir:10 144 LHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRP 223 (527) T ss_pred EeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecccccccc Confidence 011111 1122344444443332211100 0 00111111 111 111221 Q ss_pred -------------eeeEeccc-------eEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_019527. 238 -------------LGREMHAS-------RLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSR 297 (516) Q Consensus 238 -------------~g~~iH~S-------Rli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~ 297 (516) .+..+|+. =++||.+. .+.+..||.|-|+.+..-+....++..-..-++.-..+ T Consensus 224 e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~------p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~ 297 (527) T protein:vir:10 224 ESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGH------PIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGL 297 (527) T ss_pred ccccchhhhhhhcCceeeecccCCCCccceEeecCC------CccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCC Confidence 11223322 12233222 24456899999998777776666665544444444556 Q ss_pred ceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEec--ccCCHHHHHHHHHHHHHhhhcCCceeee Q lcl|NC_019527. 298 TFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNT--PLSGLADLQSQSQEHMCSVSKIPAIKLT 375 (516) Q Consensus 298 ~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~--~lsgl~d~~~~~~~~iaaas~IP~t~L~ 375 (516) .++.+........ .++. +...=.-|.++-.+++-++..++. .++++.+.++.+++.|+..+++|.+-+ T Consensus 298 Pi~~~tg~~~vd~-~G~~--------~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~- 367 (527) T protein:vir:10 298 GFYATDSAPPRDS-RGNM--------VPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAV- 367 (527) T ss_pred ceeeecccccccc-cCCc--------CccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeee- Confidence 6666644433321 1110 001112334443355677877775 456788889999999999999999655 Q ss_pred ccccccccccchHHHHHHHHHHHHHHHHH--HHHHHHHHHH-HH--H---HHhCCCcC----CcceEEeCCCCCCCHHHH Q lcl|NC_019527. 376 GISPSGLNASSEGEIRSFYDDISSVQQSY--YFSPLDTMLK-VI--Q---LSKWGEID----DAITFKFKSLWQTSAKEE 443 (516) Q Consensus 376 G~sp~Glnatge~D~~~yyd~I~~~Qe~~--l~p~l~~l~~-~l--~---~s~~g~~~----~d~~~~f~pL~~~sekEk 443 (516) |.--.+-+.||-.=.-.+--.+++.|+.. ++-++.++.. .+ + +..++.-+ -.+.+.|.|....++++. T Consensus 368 G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~av 447 (527) T protein:vir:10 368 GVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKR 447 (527) T ss_pred ccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHH Confidence 52211223334322222333444444443 2333332211 11 0 11222222 156899999977776665 Q ss_pred HHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccc-----hhc--------CCCCCCCCCCCC Q lcl|NC_019527. 444 SEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMF-----DDD--------GADPYMPDPDVL 510 (516) Q Consensus 444 Aei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~-----~~e--------~~~~~~~~~~~~ 510 (516) .+ ....++++|+++.+-+.++|.+-+ ++..-+.+.....++.. ..+ -.+..+. ++++ T Consensus 448 ie-------~v~tL~~aGiiS~etAv~~L~~~~--g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~-~~~~ 517 (527) T protein:vir:10 448 FA-------QLLELWEAGLIPAKKLTEELSKIM--GFELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGI-PDEE 517 (527) T ss_pred HH-------HHHHHHHcCchhHHHHHHHHHhcc--CCCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCC-CCCC Confidence 44 446689999999999988885422 22221111111111100 000 0011111 1111 Q ss_pred CCCCCC Q lcl|NC_019527. 511 PGEEGS 516 (516) Q Consensus 511 ~~~e~t 516 (516) ++.-+- T Consensus 518 ~d~~~~ 523 (527) T protein:vir:10 518 DDQALN 523 (527) T ss_pred cccccC Confidence 111111 No 228 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=97.96 E-value=9.2e-06 Score=48.19 Aligned_cols=486 Identities=15% Similarity=0.109 Sum_probs=189.0 Q ss_pred CCcchhhhhhhhcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCcc--ccccCCCCCCCc-cCCCccchhcccccc Q lcl|NC_019527. 1 MWPFDRKKFKREVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAA--TKWAPPQLMPGV-VPAGTTPAVAMDSLC 77 (516) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~gv-~~~~~~~~~a~ds~~ 77 (516) |=.- |+....+.++.+-+.. +...|+ .-+..++..+.-... .+--.|....+. ++..-.+..+.|++. T Consensus 1 ~~~~--~~~~~~~~~~~~~~~~---~~~~~~----~~~~~~~~~~~~~~~~~s~~g~p~~~~~~~~~~~~~~t~~~D~~~ 71 (569) T protein:vir:10 1 MADN--KITLSSVRKALAGVFK---DNGERD----NILLSALAVHGGSGYLFSRAGAPVQLSGFLGGKPGDSGMAGDGLV 71 (569) T ss_pred CCcc--hhHHHHHHHHHhhhhh---cCCccc----hhhhhhheeecCcceEEeecCcchhhhhhhccCccccchhhhhHH Confidence 2211 1111111111110000 000000 000011100000000 000001111111 001111123333332 Q ss_pred cchhhhcccccCCcccccccCcccHHHH---HHHHhCchhhhhhhhhhHHHh----hCCCe--eeec---cccchhhhHH Q lcl|NC_019527. 78 GPTYQFLNSAAGGLYAADIQPFPGYQNL---AALATRPEYRAFASTLSTELT----REGIE--ITSK---DRTKAKEMAS 145 (516) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~f~gy~ll---~~y~~~~i~r~iVd~~aed~~----r~~~~--i~~~---~~~~~~~~~~ 145 (516) .+.....+.-- -+....|++ ......+.+..+.++...-++ +.|-. |+.. ++.+.+. T Consensus 72 ~g~~~~~~~~~--------~pr~R~qiY~~~eeM~~~p~Ia~AlniHVtaALggde~TGd~vfI~p~~~~~~a~~da--- 140 (569) T protein:vir:10 72 DGSRFIFDEVQ--------LPEDRLQRYPLLEEMAVYSTIATALNIHITHALSFDKKTGQTFSIVPVHNGNDSDYDA--- 140 (569) T ss_pred HHHHHHhhhcc--------CchhHHHHHHHHHHHhcCchhhhhhhhhhheeecccccccceEEEEeecCCCCCcchH--- Confidence 22100000000 012233333 333456777777777665555 22333 3221 1111111 Q ss_pred HHHHHHHHHHh-cC--hhHHHHHHHHhcccceeeEEEEEecCCC---------cccCcccccccccccceeeEEeeccee Q lcl|NC_019527. 146 KIKELEEACEY-YG--VMGIIQKAAEHDCFFGRGQISINIKGAD---------VSVPLILDPRTIKKGSLTGFSNIEPMW 213 (516) Q Consensus 146 ~i~~i~~~~~~-l~--~~~~l~ea~~~~rlyG~a~i~i~i~~~~---------~~~Pl~ld~~~I~~g~l~~l~v~d~~~ 213 (516) -+++.+++.. +. +.+.+-...++.-.||.||+=|-.+.+. ...|-.+-| --..|...||...++.. T Consensus 141 -akai~~el~~dl~~~iNr~~~~lA~~~~aFGdsYaRiY~~~~~GV~dl~~s~yt~PsfIqp-FE~g~~tvGF~~~~~~~ 218 (569) T protein:vir:10 141 -AQALCGELMNDIGRTINKEVAGWAFIMSVFGVAYVRPYAKEGIGITSFECSYYTLPSFIKE-FEVSGNLAGFSGDYLKD 218 (569) T ss_pred -HHHHHHHHHHHHHHHHHHHhhHHHHHHHhhhhhheeeeccCCceeEEEEecccccccccch-hhhcCceEEeecccCCc Confidence 1233333332 22 4555556778888899998776665432 222322211 00223445554444333 Q ss_pred eccccccccccc-cccccCcceeEEee-eEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_019527. 214 TSPSAYNALDPT-APDFYKPSTWWVLG-REMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSD- 290 (516) Q Consensus 214 v~p~~~~~~dp~-s~~yg~P~~y~v~g-~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~- 290 (516) .+..... .+|- --.+-.|.+-.+.+ +.||.-..-.+--.++++....--.-.|-|+|+.+++...+...+..+... T Consensus 219 ~~~ti~~-l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~d~~~~~Pi~psn~GgSFL~~ae~pf~~l~~Al~sL~~q 297 (569) T protein:vir:10 219 ASGKMVF-ADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLDNPEERTPIETQNYGTSLLEYAYEPYMNLRSAIRSLKAT 297 (569) T ss_pred cccceee-echhhhhhhcccceeeccccchhhhhhhheeecccccccccccchhhhhHHHHHHHhHHHHHHHHHHhccch Confidence 2211110 0110 00122332212222 233332221111122333322223346889999999998888777766322 Q ss_pred -HHHHhCCceeeec---chhhhcCc---cHHHHHHHHHH-HHH-hcCCcce------EE-EecCCc-----ceeEEeccc Q lcl|NC_019527. 291 -LVDKFSRTFLKTN---MAQVLNGG---EGGDVFDRVEM-YVN-MQSNLGL------AV-MDFDSE-----DIVQVNTPL 349 (516) Q Consensus 291 -Ll~~~~~~v~k~~---~~~~l~~~---~~~~l~~r~~~-~~~-~~sn~g~------~~-id~~~e-----~~e~~~~~l 349 (516) ++-.....++..+ |...-... .-.++.+|... +.. .++...+ ++ +-++.. +..+...+. T Consensus 298 ri~dSv~~~~Itlnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~~~~~~~H~LPv~gekq~~~tvDt~~~~A~~ 377 (569) T protein:vir:10 298 RFNASKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANNMPTVTNTLLPIMGDGKGQMTIDTQTIQADI 377 (569) T ss_pred hhHHHHHhHHhhccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCccccccceeeeeeecCccccccccccccccCc Confidence 2222222222221 11110000 01223333222 111 1222111 11 112211 233444556 Q ss_pred CCHHHHHHHHHHHHHhhhcCCceeee--ccccccccccchHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHhCC Q lcl|NC_019527. 350 SGLADLQSQSQEHMCSVSKIPAIKLT--GISPSGLNASSEGEIRSFYDDISSVQQSY-----YFSPLDTMLKVIQLSKWG 422 (516) Q Consensus 350 sgl~d~~~~~~~~iaaas~IP~t~L~--G~sp~Glnatge~D~~~yyd~I~~~Qe~~-----l~p~l~~l~~~l~~s~~g 422 (516) .|++|++ +...++||+.||-.+.|- -+-.+||+.+|- |.-.+++.++.. +...+++++++-+-.++| T Consensus 378 ~gIEdvM-~~~R~LagaLGlD~SMlGwAD~LsGGLGeGG~-----frtSaQaa~RS~~iRqa~~e~in~iidiH~~fKYg 451 (569) T protein:vir:10 378 NGIEDIL-TYMRQLAAALGLDYTLLGWADQMSGGLGEGGF-----LRTAIQAAMRASWIQQGVEEFIQRAIDIHLAFKYG 451 (569) T ss_pred ccHHHHH-HHHHHHHhhhccchhHhhHHHHhcccccccHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcC Confidence 6787766 677889999999988661 223678876664 444444444332 355677888888888888 Q ss_pred CcCC----cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH-------cCCCCHHH-HHHHHHhhhccCCCC-CChhh-h Q lcl|NC_019527. 423 EIDD----AITFKFKSLWQTSAKEESEIRFNKAQEAQIYIT-------NSVIDPSE-ARQQLSDDPDSGWDN-IDGDL-E 488 (516) Q Consensus 423 ~~~~----d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~-------~gvi~~~e-~r~~l~~~~~~~~~~-~d~~~-e 488 (516) ++.+ -|.|+|++..+.=+.|..+....++.+..+.+| +.++-.+| +...+-.+.. ++.. +.+.. . T Consensus 452 evf~~~drP~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~~m~y~l~d~~-~~De~~~e~l~a 530 (569) T protein:vir:10 452 KVYPEGDRPYKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDVL-EIDEKISEALVN 530 (569) T ss_pred cccCCCCcceEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHHHHh-hcchhHHHHHHh Confidence 8742 399999999888788877777666666655443 33333333 2333322221 1111 11111 0 Q ss_pred ccccccchhcC------CCCCCC----CCCCCCCCCCC Q lcl|NC_019527. 489 IVQPEMFDDDG------ADPYMP----DPDVLPGEEGS 516 (516) Q Consensus 489 ~~~~e~~~~e~------~~~~~~----~~~~~~~~e~t 516 (516) ...+...|++- ..||.. -+...+.+.+. T Consensus 531 e~~akp~DEe~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 568 (569) T protein:vir:10 531 ELKAKSEDDDHLMDSIIKTPPQELAQILESVFKEGNDN 568 (569) T ss_pred hcCCCcchhHHHHHHHhcCChHHHHHHHHHHhhccCCC Confidence 00011111110 011100 00000000000 No 229 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=97.95 E-value=9.6e-06 Score=48.09 Aligned_cols=417 Identities=11% Similarity=0.056 Sum_probs=174.4 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCccccc----ccCcccHHHHHHHHhCchhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAAD----IQPFPGYQNLAALATRPEYRAF 117 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~----~~~f~gy~ll~~y~~~~i~r~i 117 (516) |.++...... -+|. +-++.+. ..+.+++ ......+..+.|....+ .+.--+..+...+....-+..+ T Consensus 1 m~k~~~k~~~-~~~~--~~~~~~~-~~~~~~~-----~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~ 71 (448) T protein:vir:79 1 MAKRGRKPKE-LVPG--PGSIDPS-DVPKLEG-----ASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNA 71 (448) T ss_pred CCCCCCCCcc-ccCc--ccccccc-cchhhhh-----hhhhhcccccccccccchhHhhccccchHHHHHHhhChHHHHH Confidence 3222222110 0110 0001000 0111110 00011111111111000 1111123444445568889999 Q ss_pred hhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHhcC------hhHHHHHHHHhcccceeeEEEEE--e-cCCCcc Q lcl|NC_019527. 118 ASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEYYG------VMGIIQKAAEHDCFFGRGQISIN--I-KGADVS 188 (516) Q Consensus 118 Vd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~------~~~~l~ea~~~~rlyG~a~i~i~--i-~~~~~~ 188 (516) +++...-+++.-|.|...+++..+ .+..+.+.+.++... -+..+...+-.+.+||.+++=+. . .++.+ T Consensus 72 l~~Rk~av~~~~w~v~p~~~~~~~--~~~ae~v~~~l~~~~~~~~~~~f~~~~~~~lda~~~G~s~~Eivw~~~~~g~~- 148 (448) T protein:vir:79 72 LNYIFGRIRSAKWYVEPASTDPED--IAIAAFIHAQLGIDDASVGKYPFGRLFAIYENAYIYGMAAGEIVLTLGADGKL- 148 (448) T ss_pred HHHHHHHHhcCCceEecCCCCHHH--HHHHHHHHHHhhhhhhhhccCCHHHHHHHHHHhhhhcceeEEEEeeecCCCce- Confidence 999999889888888754333221 112223333333211 13344455666899999985443 2 12211 Q ss_pred cCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCc Q lcl|NC_019527. 189 VPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGI 268 (516) Q Consensus 189 ~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~ 268 (516) .|..+.+. ..-.+..| .+++..- .......++.. +. .....+..+...+++++... ...+.+|. T Consensus 149 ~~~~l~~r--~~~~~~~f-~~~~d~~-l~~~~~~~~~~---~~--~~~~~~~~lP~~~~i~~~~~-------~~g~p~g~ 212 (448) T protein:vir:79 149 ILDKIVPI--HPFNIDEV-LYDEEGG-PKALKLSGEVK---GG--SQFVSGLEIPIWKTVVFLHN-------DDGSFTGQ 212 (448) T ss_pred eccccccc--CCccccce-eeecCCc-eEEeecCCccc---cc--ccCCCccccccceEEEEecC-------ccCCcccc Confidence 11111100 00001111 1111100 00000001100 00 01112344555666666432 12356799 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--eeeecchhhhcCccHHHHHHHHHHHHHhcC-CcceEEEecCCcceeEE Q lcl|NC_019527. 269 SMSQLAQPYVENWLRTRQSVSDLVDKFSRT--FLKTNMAQVLNGGEGGDVFDRVEMYVNMQS-NLGLAVMDFDSEDIVQV 345 (516) Q Consensus 269 S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~--v~k~~~~~~l~~~~~~~l~~r~~~~~~~~s-n~g~~~id~~~e~~e~~ 345 (516) +++..|+-...--..+...-+.++.+++++ +.|.+-. . ..+.+....-.+++...+. .....++. ++.+++.+ T Consensus 213 gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~g--a-~~~~~~~~~l~~av~~i~~g~~a~~iiP-~~~~ie~~ 288 (448) T protein:vir:79 213 SALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKS--V-RQGTKQWEAAKEIVKNFVQKPRHGIILP-DDWKFDTV 288 (448) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCC--C-CcCHHHHHHHHHHHHHHhcCCceEEEec-CCceEEEE Confidence 999999988777777777778999999966 4454311 1 1222233333344444432 23334444 45788888 Q ss_pred ecccC--CHHHHHHHHHHHHHhhhcCCceeeeccc-cc----cccccchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 346 NTPLS--GLADLQSQSQEHMCSVSKIPAIKLTGIS-PS----GLNASSEGEI-RSFYDDISSVQQSYYFSPLDTMLKVIQ 417 (516) Q Consensus 346 ~~~ls--gl~d~~~~~~~~iaaas~IP~t~L~G~s-p~----Glnatge~D~-~~yyd~I~~~Qe~~l~p~l~~l~~~l~ 417 (516) +..-+ ...++++..-++|+-+ ++|++ ++ |-.+.+.++. ....+.+++-.+....-+.+.|+.-|+ T Consensus 289 ea~~~~~~~~~~i~~~d~~Isk~-------iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~ 361 (448) T protein:vir:79 289 DLKSAMPDAIPYLTYHDAGIARA-------LGIDFNTVQLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLV 361 (448) T ss_pred ecCCCcccHHHHHHHHHHHHHHH-------HhhhhhccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 76532 3455666666666633 23432 11 2211111121 223344444433333333345666555 Q ss_pred HHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchh Q lcl|NC_019527. 418 LSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDD 497 (516) Q Consensus 418 ~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~ 497 (516) .-.||...+--.|.|...- ..|+ ++.|+++.++++.+-+..+-+++.+ +.+. ....++... T Consensus 362 ~lNfg~~~~~P~~~f~~~e------~~Dl-~~~a~~~~~l~~~~~~~~~~~~~~~------~~p~------~~~~~~~~a 422 (448) T protein:vir:79 362 LPNWPSATRFPRLTFEMEE------RNDF-SAAANLMGMLINAVKDSEDIPTELK------ALID------ALPSKMRRA 422 (448) T ss_pred HhcCCCcCCCcEEEecCCC------hHHH-HHHHHHhhhhhccchhhHHHHHHhh------cCCC------CCCCccccc Confidence 5556644322356664221 1122 3357777778776543333334332 1110 111111111 Q ss_pred cCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 498 DGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 498 e~~~~~~~~~~~~~~~e~t 516 (516) ....++..+....|+.+++ T Consensus 423 ~~~~~~~~~~~~~~~~~~~ 441 (448) T protein:vir:79 423 LGVVDEVREAVRQPADSRY 441 (448) T ss_pred cCCCCcccccccCCccccc Confidence 1111222233345555555 No 230 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=97.65 E-value=3.3e-05 Score=45.17 Aligned_cols=419 Identities=11% Similarity=0.060 Sum_probs=164.7 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCccccc----ccCcccHHHHHHHHhCchhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAAD----IQPFPGYQNLAALATRPEYRAF 117 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~----~~~f~gy~ll~~y~~~~i~r~i 117 (516) |.++....... .| .+|.+-....+.+++ ......+..+.|....+ .+.--+..+...+....-+..+ T Consensus 1 m~kk~~k~~~~-~~---~~~~~~~~~~~~~~~-----~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~D~hi~s~ 71 (448) T protein:vir:77 1 MAKRGRKPKEL-VP---GPGSIDPSDVPKLEG-----ASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLSDGTVKNA 71 (448) T ss_pred CCCCCCCCccc-CC---cccccchhhhhhhcc-----chhhhcccccccccccchhHhhccccchHHHHHHhhChHHHHH Confidence 32222221000 00 011100000001110 00000111111111000 0111123444444568888899 Q ss_pred hhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHH-------hcChhHHHHHHHHhcccceeeEEEEE--e-cCCCc Q lcl|NC_019527. 118 ASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACE-------YYGVMGIIQKAAEHDCFFGRGQISIN--I-KGADV 187 (516) Q Consensus 118 Vd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~-------~l~~~~~l~ea~~~~rlyG~a~i~i~--i-~~~~~ 187 (516) +++...-+++.-|.|...+++..+. +..+.+++.+. ++.+.+.+.+ +-.+.+||.|++=+. . .++.+ T Consensus 72 l~~Rk~av~~~~w~v~p~~~~~~d~--~~ae~v~~~l~~~~~~~~~~~f~~~i~~-~lda~~~G~s~~Eivw~~~~dg~~ 148 (448) T protein:vir:77 72 LNYIFGRIRSAKWYVEPASTDPEDI--AIAAFIHAQLGIDDASVGKYPFGRLFAI-YENAYIYGMAAGEIVLTLGADGKL 148 (448) T ss_pred HHHHHHHHhcCCceEecCCCCHHHH--HHHHHHHHHhhchhhhhccCCHHHHHHH-HHHhhhhcceeEEEEEeecCCCce Confidence 9999988888888886533322211 11122333332 2344444544 457999999985332 2 12211 Q ss_pred ccCcccccccccccceeeEEeecceeeccccccccccccccccCcceeEEeeeEeccceEEEecCCcchhhhhhccCCCC Q lcl|NC_019527. 188 SVPLILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFYKPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSG 267 (516) Q Consensus 188 ~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G 267 (516) .|..+.+. ..-.+..| .+++..-- ......++. -+.+ ....+..+...+++++... ...+.+| T Consensus 149 -~~~~l~~r--~~~~~~~f-~~~~~~~l-~~~~~~~~~---~~~~--~~~~~~~lP~~~~i~~~~~-------~~g~p~g 211 (448) T protein:vir:77 149 -ILDKIVPI--HPFNIDEV-LYDEEGGP-KALKLSGEV---KGGS--QFVNGLEIPIWKTVVFLHN-------DDGSFTG 211 (448) T ss_pred -eecccccc--CCCcccee-eeecCCce-EEEecCCcc---cccc--cCCCccccccceEEEEecC-------CcCCccc Confidence 11111100 00011111 11111000 000000100 0000 0112334555666665432 2235689 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhCCce--eeecchhhhcCccHHHHHHHHHHHHHhcC-CcceEEEecCCcceeE Q lcl|NC_019527. 268 ISMSQLAQPYVENWLRTRQSVSDLVDKFSRTF--LKTNMAQVLNGGEGGDVFDRVEMYVNMQS-NLGLAVMDFDSEDIVQ 344 (516) Q Consensus 268 ~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v--~k~~~~~~l~~~~~~~l~~r~~~~~~~~s-n~g~~~id~~~e~~e~ 344 (516) .+++..|+-...--..+...-+.++.++++++ .|.+- .+..+.+....-++++...+. ..+..++. ++.+++. T Consensus 212 ~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~---ga~~~~~~~~~l~~av~~i~~g~~a~~iiP-~g~~ie~ 287 (448) T protein:vir:77 212 QSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPK---SVRQGTKQWEAAKEIVKNFVQKPRHGIILP-DDWKFDT 287 (448) T ss_pred chHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCC---CCCCCHHHHHHHHHHHHHHhcCCceEEEec-CCceEEE Confidence 99999999877666667777788999988764 44421 111222333333344443332 22334444 4578888 Q ss_pred EecccC--CHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019527. 345 VNTPLS--GLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEI-RSFYDDISSVQQSYYFSPLDTMLKVIQLSKW 421 (516) Q Consensus 345 ~~~~ls--gl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~-~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~~ 421 (516) ++..-+ ...++++..-.+|+-+.--. |.-.+ +-+|..+.+.++. ....+.+++--+....-+.+.|+.-++.-.| T Consensus 288 ~ea~~~~~~~~~~i~~~d~~Isk~iLGq-tlTs~-~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNf 365 (448) T protein:vir:77 288 VDLKSAMPDAIPYLTYHDAGIARALGID-FNTVQ-LNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNW 365 (448) T ss_pred EecCCCccCHHHHHHHHHHHHHHHHhcc-ccccc-cccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 876532 34556666666776433111 11111 1122222122222 2333444444433334344457665555556 Q ss_pred CCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhcCCC Q lcl|NC_019527. 422 GEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDDGAD 501 (516) Q Consensus 422 g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e~~~ 501 (516) |...+--.|.|...-..+-+ +.|+.+..+++ .+++.+.-. ...++..+............+ T Consensus 366 g~~~~~P~~~f~~~e~eDl~-------~~a~~~~~l~~-------~~~~~~~ip-----~~~~~~~~~~~~~~~~~~~~~ 426 (448) T protein:vir:77 366 PGATRFPRLTFEMEERNDFS-------AAANLMGMLIN-------AVKDSEDIP-----TELKALIDALPSKMRRALGVV 426 (448) T ss_pred CCCCCCCEEEecCCChhhHH-------HHHHHhHHHHH-------HHHHHhcCC-----ccCCcCCCCCchhcccccCCC Confidence 64432235666533222222 34555555542 244443211 111111111111110000011 Q ss_pred CCCCCCCCCCCCCCC Q lcl|NC_019527. 502 PYMPDPDVLPGEEGS 516 (516) Q Consensus 502 ~~~~~~~~~~~~e~t 516 (516) + +++.+....-+| T Consensus 427 ~--~~~~~~~~~~~~ 439 (448) T protein:vir:77 427 D--EVREAVRQPADS 439 (448) T ss_pred C--CCCchhhcchhh Confidence 1 111111112222 No 231 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=97.36 E-value=8.5e-05 Score=42.91 Aligned_cols=439 Identities=11% Similarity=0.094 Sum_probs=171.0 Q ss_pred HhhcCCCccccccCCCCCCCccCCCccchh-cccccccchhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhh Q lcl|NC_019527. 42 MERRASDAATKWAPPQLMPGVVPAGTTPAV-AMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFAST 120 (516) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~-a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~ 120 (516) |.. -..+|.|++. ..|++....+ ++|-..-.+++..-.++.+..+...--..|=. ..--..+-+|++|++ T Consensus 1 m~~----~~~q~~p~~~---~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~il~G~d--r~~~~~ps~r~~V~~ 71 (563) T protein:vir:74 1 MPY----NHKQYDPAKP---FLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLVLRGDD--SVPILMPSGRKIVEA 71 (563) T ss_pred CCc----cccccCCCcc---cccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhhcCCCc--eeeeccchHHHHHHH Confidence 211 1234444432 1222222222 23322222233222222222111000000000 000012347899999 Q ss_pred hhHHHhhCCCeeeeccccchhhhHHHH-HHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCCcc-cCc---cccc Q lcl|NC_019527. 121 LSTELTREGIEITSKDRTKAKEMASKI-KELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGADVS-VPL---ILDP 195 (516) Q Consensus 121 ~aed~~r~~~~i~~~~~~~~~~~~~~i-~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~-~Pl---~ld~ 195 (516) .. --+-.+.+|.......++...+.+ +.|+.-.++=++..++.++.+|+-+-|.++..+.-|..... +=+ .+|| T Consensus 72 ~~-~~Lg~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~~vDP 150 (563) T protein:vir:74 72 VH-RFLGVGFDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVDEVDP 150 (563) T ss_pred HH-HhcCCCcEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEeecCC Confidence 55 566888888654443332222222 34455555668889999999999999988877654431110 000 0111 Q ss_pred ccc----cccceeeEEeec---ceeeccc------------ccccccccccc-c-cCcceeEE----------------e Q lcl|NC_019527. 196 RTI----KKGSLTGFSNIE---PMWTSPS------------AYNALDPTAPD-F-YKPSTWWV----------------L 238 (516) Q Consensus 196 ~~I----~~g~l~~l~v~d---~~~v~p~------------~~~~~dp~s~~-y-g~P~~y~v----------------~ 238 (516) ..| .++...++..++ .|+.-.. ..++..+.-.. | ..-+.|.. . T Consensus 151 ~~~fp~~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~~~~~~~ 230 (563) T protein:vir:74 151 RQIFLIEDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISDEQARRK 230 (563) T ss_pred ceeeeccCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCccchhhhccc Confidence 100 111111221111 1110000 00000000000 0 00011111 0 Q ss_pred eeEeccceE---------------EEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeec Q lcl|NC_019527. 239 GREMHASRL---------------LTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTN 303 (516) Q Consensus 239 g~~iH~SRl---------------i~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~ 303 (516) ...+|-+|. +||.+- .+.+.+||.|.|..+..-+....++.--.+-.+.-.++.++.+. T Consensus 231 ~~~~~~~~d~e~~~LP~pi~~iPiv~~~ti------p~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~ 304 (563) T protein:vir:74 231 EQVRSAQHDEEEEELPEPISQLPLYRWRNK------PPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTN 304 (563) T ss_pred chhhhhhhhchhhhccccccCccEEEcCCC------CCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEec Confidence 111232222 223222 24556899999987776666555554444444444556666654 Q ss_pred chhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCC-c--ceeEEec--ccCCHHHHHHHHHH-HHHhhhcCCceeeecc Q lcl|NC_019527. 304 MAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDS-E--DIVQVNT--PLSGLADLQSQSQE-HMCSVSKIPAIKLTGI 377 (516) Q Consensus 304 ~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~-e--~~e~~~~--~lsgl~d~~~~~~~-~iaaas~IP~t~L~G~ 377 (516) .+.--.... .++. -|+- .-|.++=.+++ + -++.++. +|+++..=++.+.. -++..+++|.+-+ |. T Consensus 305 ~~~p~d~~~-g~~~----~w~v---gpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~-G~ 375 (563) T protein:vir:74 305 ASAPVDPNT-GELT----DWNI---GPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAI-GR 375 (563) T ss_pred ccccccccc-cccc----cccc---CCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceee-cc Confidence 322111100 0111 1111 12222211221 1 2444433 23444444665555 5788899998654 52 Q ss_pred ccccccccc---hHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHH---Hh-----CCCc--CC--cceEEeCC Q lcl|NC_019527. 378 SPSGLNASS---EGEIRSFYDDISSVQQSYYFSPLDT--------MLKVIQL---SK-----WGEI--DD--AITFKFKS 434 (516) Q Consensus 378 sp~Glnatg---e~D~~~yyd~I~~~Qe~~l~p~l~~--------l~~~l~~---s~-----~g~~--~~--d~~~~f~p 434 (516) --.|-.-|| +-.+.-.-..|+.+. ..+.-.+.+ +++.+.+ .. +|.- |. .+++.|.| T Consensus 376 vD~~~~~SGiALeL~L~PL~a~~~ek~-l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p 454 (563) T protein:vir:74 376 VDVTSAESGISLELQLKPLLAANEEKE-LEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFAD 454 (563) T ss_pred cccccccchhhhhhhhhHHHHhhhhhH-HHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCC Confidence 211222222 222222222222221 112222222 2222212 11 1211 11 35788999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCC--hhhhccccccc-h-----hcCCCCCC-- Q lcl|NC_019527. 435 LWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNID--GDLEIVQPEMF-D-----DDGADPYM-- 504 (516) Q Consensus 435 L~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d--~~~e~~~~e~~-~-----~e~~~~~~-- 504 (516) +...+..+. .+-...++++|+|+.+.+.++|.+. ||+--| ...+..+.+.. + .+..++.+ T Consensus 455 ~~P~d~~~v-------v~~~~tl~~aGiiSretAv~~L~~~---g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~ 524 (563) T protein:vir:74 455 PMPVNKTQV-------TQDTLLLQQAHLILRKMAVAKLRSI---GWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLS 524 (563) T ss_pred CCCccHHHH-------HHHHHHHHHcCchhHHHHHHHHHhC---CCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccce Confidence 876655543 2344678999999999999988653 333222 11111111100 0 01011100 Q ss_pred -CCCCCCCCCCCC Q lcl|NC_019527. 505 -PDPDVLPGEEGS 516 (516) Q Consensus 505 -~~~~~~~~~e~t 516 (516) .....-+.++.- T Consensus 525 a~~~~g~~~~~~d 537 (563) T protein:vir:74 525 AMDNGGAGEQQFD 537 (563) T ss_pred ecccCCCCccccc Confidence 011111111111 No 232 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=94.92 E-value=0.0032 Score=34.28 Aligned_cols=417 Identities=13% Similarity=0.078 Sum_probs=165.6 Q ss_pred CCCccCCCc---cch--------hcccccccc--hhhhcccccCCcccccccCcccHHH-HHHHHhCchhhhhhhhhhHH Q lcl|NC_019527. 59 MPGVVPAGT---TPA--------VAMDSLCGP--TYQFLNSAAGGLYAADIQPFPGYQN-LAALATRPEYRAFASTLSTE 124 (516) Q Consensus 59 ~~gv~~~~~---~~~--------~a~ds~~~~--~~~~~~~~~~~~~~~~~~~f~gy~l-l~~y~~~~i~r~iVd~~aed 124 (516) |+-+-|+.+ ++. -.++..+++ ..+.++..+..-+.. .+-..|+. ++.=.....++++|+..+-- T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~--E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~ 78 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQE--ETDKGYQERLASAVLLNMVEQTLDTLSGK 78 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCC--CCHHHHHHHHhcccCCChHHHHHHHHhhh Confidence 111111111 111 112222221 112222222211111 11122321 11112357788899999988 Q ss_pred HhhCCCeeeeccccchhhhHHHHHHHHHHHH--hcChhHHHHHHHHhcccceeeEEEEEecCCCc---c----------- Q lcl|NC_019527. 125 LTREGIEITSKDRTKAKEMASKIKELEEACE--YYGVMGIIQKAAEHDCFFGRGQISINIKGADV---S----------- 188 (516) Q Consensus 125 ~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~--~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~~---~----------- 188 (516) ++|+..++....-+ ..+..|..-.+ -.++...++.++.....||.+++++....... . T Consensus 79 vf~k~p~~~~~~p~------~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~ 152 (513) T protein:vir:97 79 PFSEPIKLNEDVPK------AIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRRE 152 (513) T ss_pred hhhcCcccCcCchH------HHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhh Confidence 89988876432111 11111211112 23688999999999999999999987532110 0 Q ss_pred --cCc--ccccccccccc---eeeEEeecceeeccccccccccccccccC----------cceeEEe-----------ee Q lcl|NC_019527. 189 --VPL--ILDPRTIKKGS---LTGFSNIEPMWTSPSAYNALDPTAPDFYK----------PSTWWVL-----------GR 240 (516) Q Consensus 189 --~Pl--~ld~~~I~~g~---l~~l~v~d~~~v~p~~~~~~dp~s~~yg~----------P~~y~v~-----------g~ 240 (516) .|- .+.++.|-... +.|..++.-.-+.-... ..|. |+. |..|+|- +. T Consensus 153 ~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~-~~Dg----f~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~ 227 (513) T protein:vir:97 153 GLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYM-EQDG----FAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEW 227 (513) T ss_pred ccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEe-ecCC----CcceEEEEEEEEeCceEEEEEeecCCCccccce Confidence 111 01122221111 01111111100000000 1122 221 1122221 00 Q ss_pred EeccceEEEecCCcc-hhhhhhccCCCCc-hHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHH Q lcl|NC_019527. 241 EMHASRLLTIITRPL-PDMLKPAYNFSGI-SMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFD 318 (516) Q Consensus 241 ~iH~SRli~~~~~~~-p~~~k~~~~~~G~-S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~ 318 (516) .+|.+.-..+.--|+ +.......-.-|. +++..+.=.+..|..... --++++..+++++-+.... ...... T Consensus 228 ~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd-~~~il~~~~~P~l~~~G~~---~~~~~~--- 300 (513) T protein:vir:97 228 ALADEWATGLNYVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQSASD-QRHILTVSRFPILACSGAS---GEDSDP--- 300 (513) T ss_pred EEecCCCCcCCceeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhh-HHHHHHhcccceeeeecCC---cCCCCc--- Confidence 111111111100011 0011111111233 355666666666644433 3457777777776553211 111010 Q ss_pred HHHHHHHhcCCcceEEEecCCcceeEEecccCCHHH---HHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHH-- Q lcl|NC_019527. 319 RVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGLAD---LQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSF-- 393 (516) Q Consensus 319 r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl~d---~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~y-- 393 (516) ........+.+...+.++..++.+-+++.. .++...++|..+-- +|+..++ | +.|++.-...+ T Consensus 301 ------i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga----~ll~~~~-~-~~Ta~a~~~~~~~ 368 (513) T protein:vir:97 301 ------VVVGPNKVLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGA----EFLKRKT-G-GQTATARALDSAE 368 (513) T ss_pred ------eEeeccccccCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHHHHH----HhhccCC-c-cccHHHHHHHHHH Confidence 011222223343334567777766666644 44444455543332 2222232 2 35555433333 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHH Q lcl|NC_019527. 394 -YDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQL 472 (516) Q Consensus 394 -yd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l 472 (516) +..++++.. .+...++++++++..= .|.-+++.+|+.++-+....-+.. ..++...+++.|.|+.+..++.| T Consensus 369 ~~S~L~~~a~-~le~al~~~l~~~a~w-lg~~~~~~~v~in~dF~~~~~~~~-----~~~al~~a~~~G~is~~t~~~~L 441 (513) T protein:vir:97 369 ATSDLSAMTG-LFEDALAQALDITADW-LRLGPNGGTVELVKDYDLEEMDAP-----GLQALQVAREKRDISRKTYLNGL 441 (513) T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHH-hCCCCCccEEEeccccCcccCCHH-----HHHHHHHHHhCCCCCHHHHHHHH Confidence 344444443 3677788888776542 243345677776654433222211 23455667888888888888887 Q ss_pred HhhhccCCCCCChhh--hccccccchh----------------------------cCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 473 SDDPDSGWDNIDGDL--EIVQPEMFDD----------------------------DGADPYMPDPDVLPGEEG 515 (516) Q Consensus 473 ~~~~~~~~~~~d~~~--e~~~~e~~~~----------------------------e~~~~~~~~~~~~~~~e~ 515 (516) ++...-. +.++.+. +...+++.+. ...+.++++-+..|++|. T Consensus 442 ~r~gvl~-~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 442 RLRGVLP-EDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEGGEGGEGGGNPGGES 513 (513) T ss_pred HhccCCC-ccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCC Confidence 6532211 2222111 0000000000 000111111222222222 No 233 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=94.08 E-value=0.0054 Score=33.00 Aligned_cols=395 Identities=14% Similarity=0.051 Sum_probs=166.2 Q ss_pred CCCccCCCccchh--------cccccccch--hhhcccccCCcccccccCcccHHHHHHHH----hCchhhhhhhhhhHH Q lcl|NC_019527. 59 MPGVVPAGTTPAV--------AMDSLCGPT--YQFLNSAAGGLYAADIQPFPGYQNLAALA----TRPEYRAFASTLSTE 124 (516) Q Consensus 59 ~~gv~~~~~~~~~--------a~ds~~~~~--~~~~~~~~~~~~~~~~~~f~gy~ll~~y~----~~~i~r~iVd~~aed 124 (516) |+ | +..++.. .++..+++- .+.++.-+..-+. .-+-..| +.|. ....++++|+..+-- T Consensus 1 m~-V--~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~--~E~~~~Y---~~rl~rA~~~n~~~~t~~~~~G~ 72 (452) T protein:vir:94 1 MP-I--ETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLS--GQTDDMY---NAYKQRALFYSITSKTLSALSGM 72 (452) T ss_pred CC-C--CCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCC--CCCHHHH---HHHHhhccCCchHHHHHHHHhch Confidence 22 2 1112211 111112211 1111111111110 1111223 3332 247788999999988 Q ss_pred HhhCCCeeeeccccchhhhHHHHHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecC--CCc-------ccCccccc Q lcl|NC_019527. 125 LTREGIEITSKDRTKAKEMASKIKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKG--ADV-------SVPLILDP 195 (516) Q Consensus 125 ~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~--~~~-------~~Pl~ld~ 195 (516) ++|+..++.... .+..+..-..-.++.+.++.++.....||.++++++..- ..+ .+=++.+. T Consensus 73 vf~k~p~~~~p~---------~l~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~W~~ 143 (452) T protein:vir:94 73 VLDQPPVITHPD---------AMSKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILNWEE 143 (452) T ss_pred hhcCCceecccH---------HHHHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcCccc Confidence 899998874321 123332223345789999999999999999999987532 111 00011111 Q ss_pred ccccccceeeEEeecceeecccccccccccccc----cc----CcceeEEe------ee--EeccceEEEecCCc---ch Q lcl|NC_019527. 196 RTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPD----FY----KPSTWWVL------GR--EMHASRLLTIITRP---LP 256 (516) Q Consensus 196 ~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~----yg----~P~~y~v~------g~--~iH~SRli~~~~~~---~p 256 (516) +.+ |.+. ++++--..+. ....|..+.. |- .|-.|+|. +. .++.-......+.+ +| T Consensus 144 ~~~--g~l~-~v~lre~~~~---~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP 217 (452) T protein:vir:94 144 DED--GRLL-MVVLREFYTV---RDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIP 217 (452) T ss_pred ccc--CCee-EEEEEEEEEE---ecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeE Confidence 122 2222 1111100000 0001111110 10 02222221 00 01111111111111 11 Q ss_pred hh---hhhccCCCCch-HHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcce Q lcl|NC_019527. 257 DM---LKPAYNFSGIS-MSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGL 332 (516) Q Consensus 257 ~~---~k~~~~~~G~S-~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~ 332 (516) .. ........|.| ++..+.=.+..|..... --++++...++++-+.... ... . ...+.... T Consensus 218 ~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~sd-~~~~l~~~~~P~l~~~g~~---~~~--~---------i~iG~~~~ 282 (452) T protein:vir:94 218 FFCITPSGLSMTPAKPPMIDIVDINYSHYRTSAD-LEHGRHFTGLPTPWITGAE---SQS--T---------MHIGSTKA 282 (452) T ss_pred EEEEcCCCCCCCCCccchHHHHHHHHHHhcchhH-HHHHHHHcccceeEeecCc---CCC--c---------eEeccccc Confidence 11 11111123444 44555566666666555 4567777777776542211 111 0 11122222 Q ss_pred EEEecCCcceeEEecccCCHH---HHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHH---HHHHHHHHHHHHH Q lcl|NC_019527. 333 AVMDFDSEDIVQVNTPLSGLA---DLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSF---YDDISSVQQSYYF 406 (516) Q Consensus 333 ~~id~~~e~~e~~~~~lsgl~---d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~y---yd~I~~~Qe~~l~ 406 (516) +.+...+.++..++.+-+++. +.++...+++..+-. ..|.++ +.| +.|++.-...+ +..+.++- ..+. T Consensus 283 ~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~~m~~~Ga---~ll~~~-~~~-~~s~ea~~~~~~~~~s~L~~~a-~~~e 356 (452) T protein:vir:94 283 WVIPEVAAKVGFLEFTGQGLQSLEKALSEKQAQLASLSA---RLIDNS-TRG-SEATETVKLRYMSETASLKSVT-RAVE 356 (452) T ss_pred ccCCCCCCcceEEccCchhHHHHHHHHHHHHHHHHHHHH---HhhccC-CCc-chHHHHHHHHHHHhhHHHHHHH-HHHH Confidence 334432456777776666663 444445555433221 123332 222 33454332222 23333333 3356 Q ss_pred HHHHHHHHHHHHHhCCCcCCcceEEeCC---CCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCC Q lcl|NC_019527. 407 SPLDTMLKVIQLSKWGEIDDAITFKFKS---LWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNI 483 (516) Q Consensus 407 p~l~~l~~~l~~s~~g~~~~d~~~~f~p---L~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~ 483 (516) ..++++++++.. |...+.+..|+-+. ...++.. ..++...+++.|.|+.+..++.|+.... -.. T Consensus 357 ~al~~~l~~~a~--w~g~~~~~~v~~n~dF~~~~~~~~--------~~~al~~~~~~G~is~~t~~~~L~~~gv---l~~ 423 (452) T protein:vir:94 357 ALLNKAYSCIMD--MESMGGTLNIKLNSAFLDSKLTAA--------ELKAWVEAYLSGGISKEIYIHALKVGKV---LPP 423 (452) T ss_pred HHHHHHHHHHHH--HcCCCCceEEEeccccccccCCHH--------HHHHHHHHHhcCCCcHHHHHHHHHhCCC---CCC Confidence 778888886654 33344555555442 2223332 3344567799999999999999977432 122 Q ss_pred ChhhhccccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 484 DGDLEIVQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 484 d~~~e~~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) +.+.+...+| .+.+++.+...|+--+| T Consensus 424 ~~e~~~i~~E------~~~~~~~~~~~~~~~~~ 450 (452) T protein:vir:94 424 PGESMGVIPD------PPAPEPSPSNTPPNPSS 450 (452) T ss_pred ccCHHHHHHH------hhccCcccCCCCCCCcc Confidence 2111111111 11122222222222222 No 234 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=94.02 E-value=0.0056 Score=32.92 Aligned_cols=414 Identities=12% Similarity=0.078 Sum_probs=168.2 Q ss_pred CCCccCCCccchh--------cccccccchh--hhcccccCCcccccccCcccHHHHHHHHh----CchhhhhhhhhhHH Q lcl|NC_019527. 59 MPGVVPAGTTPAV--------AMDSLCGPTY--QFLNSAAGGLYAADIQPFPGYQNLAALAT----RPEYRAFASTLSTE 124 (516) Q Consensus 59 ~~gv~~~~~~~~~--------a~ds~~~~~~--~~~~~~~~~~~~~~~~~f~gy~ll~~y~~----~~i~r~iVd~~aed 124 (516) ||.| ...++.. .++..+++-. +.++.-+..-...+..+--+=+..+.|.+ ....+++|+...-- T Consensus 1 m~~V--~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~ 78 (501) T protein:vir:95 1 MPNV--SFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQ 78 (501) T ss_pred CCCC--CCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhh Confidence 4444 1222221 1222222111 11111121111111111111122344432 36677788888878 Q ss_pred HhhCCCeeeeccccchhhhHHHHHHHHHHHH--hcChhHHHHHHHHhcccceeeEEEEEecCC--C-c-c---------c Q lcl|NC_019527. 125 LTREGIEITSKDRTKAKEMASKIKELEEACE--YYGVMGIIQKAAEHDCFFGRGQISINIKGA--D-V-S---------V 189 (516) Q Consensus 125 ~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~--~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~--~-~-~---------~ 189 (516) ++|+..++.. + ..++.|..-.+ -.++...++.++.....||.++|++..... . . + . T Consensus 79 vf~k~p~~~~---p------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~r 149 (501) T protein:vir:95 79 VFMRDPVVKV---P------ALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIR 149 (501) T ss_pred hhcCCcceeC---c------HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCC Confidence 8888887732 1 12333332222 226899999999999999999999875211 1 0 0 1 Q ss_pred Cc--ccccccc--------c-ccceeeEEeecceeeccccccccccccc--------------cccCcceeEEee----- Q lcl|NC_019527. 190 PL--ILDPRTI--------K-KGSLTGFSNIEPMWTSPSAYNALDPTAP--------------DFYKPSTWWVLG----- 239 (516) Q Consensus 190 Pl--~ld~~~I--------~-~g~l~~l~v~d~~~v~p~~~~~~dp~s~--------------~yg~P~~y~v~g----- 239 (516) |- .+.++.| . ...|..++.-+.. ....|+++. .++.-+.|+-.. T Consensus 150 Py~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~------~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~ 223 (501) T protein:vir:95 150 PTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETW------CAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKAD 223 (501) T ss_pred cEEEEecHhhhcCcceeccCCceeeeEEEEEEEE------eecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccC Confidence 21 0111111 1 0012111111111 011111111 111111121110 Q ss_pred -eE------eccceEE--EecCCcc---hhhh-hhccCCC--C-chHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeec Q lcl|NC_019527. 240 -RE------MHASRLL--TIITRPL---PDML-KPAYNFS--G-ISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTN 303 (516) Q Consensus 240 -~~------iH~SRli--~~~~~~~---p~~~-k~~~~~~--G-~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~ 303 (516) .. .+.+... .-.+..+ |... -...+++ | -+++..+.=.|.+|..... --++++..+++++-+. T Consensus 224 ~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd-~~~~l~~~~~P~l~i~ 302 (501) T protein:vir:95 224 GSKIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSAD-YEESCYIVGQPTPVLI 302 (501) T ss_pred cceecCCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhH-HHHHHHHcccceeeee Confidence 00 0000000 0011111 1110 0112222 2 3466666666777766554 4457888888776542 Q ss_pred -chhh-hcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCH-HHHHHHHHHHHHhhhcCCceeeeccccc Q lcl|NC_019527. 304 -MAQV-LNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGL-ADLQSQSQEHMCSVSKIPAIKLTGISPS 380 (516) Q Consensus 304 -~~~~-l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl-~d~~~~~~~~iaaas~IP~t~L~G~sp~ 380 (516) +... ........ + . ......+.+. ++.++..+..+-+++ ...++...++|..+-- +|+-++. T Consensus 303 G~~~~~~~~~~~~~-------i-~-~G~~~~~~lP-~~~~~~~ie~~~~~i~~~~l~~l~~~m~~~Ga----~ll~~~~- 367 (501) T protein:vir:95 303 GLTEEWVTNVLKGS-------V-N-FGSRGGIPLP-VGADAKLLQASENTMLKEAMDTKERQMVALGA----KLVEQKE- 367 (501) T ss_pred CCcccccccCCCCc-------e-e-ecccccccCC-CCCceeEEecChhhHHHHHHHHHHHHHHHHHH----hhccCCc- Confidence 2111 00000000 0 0 1111222333 345677776655555 3345444455443321 1222221 Q ss_pred cccccchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 381 GLNASSEGEIRSF---YDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIY 457 (516) Q Consensus 381 Glnatge~D~~~y---yd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~ 457 (516) -+.|++.-...+ +..+.++- ..+...++++++++..= .|..+.+.+|+.++-+....-+ ...+++...+ T Consensus 368 -~~~Ta~~~~~~~~~~~S~L~~~a-~~le~al~~~l~~~a~w-~g~~~~~~~v~i~~df~~~~~~-----~~~~~al~~~ 439 (501) T protein:vir:95 368 -VQRTATEAELEAASEGSTLSSAT-KNVSAAFEWALKWAARW-VGQADSGVKFELNTDFDIARMT-----PDERRSLVEE 439 (501) T ss_pred -cchhHHHHHHHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHH-cCCCCCceEEEEecccccccCC-----HHHHHHHHHH Confidence 223444332222 22333332 34577788888866542 3555666777766654332222 2235667788 Q ss_pred HHcCCCCHHHHHHHHHhhhccCCCCCCh-hhhcc-ccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 458 ITNSVIDPSEARQQLSDDPDSGWDNIDG-DLEIV-QPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 458 ~~~gvi~~~e~r~~l~~~~~~~~~~~d~-~~e~~-~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) ++.|.|+.++.++.|+.. ++...+. +++.. +++..+..-.+.++..+.+..|++.- T Consensus 440 ~~~G~is~~t~~~~L~~~---~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~~ 497 (501) T protein:vir:95 440 WQKGAITFEEMRTGLRKA---GVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGDNV 497 (501) T ss_pred HhCCCCcHHHHHHHHHhC---CCCChhHHHHHHHHHhhhcCcccccccCCCCCCCcccccc Confidence 999999999999999763 2222211 11110 11100000011112222222222221 No 235 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=90.41 E-value=0.021 Score=29.78 Aligned_cols=441 Identities=10% Similarity=0.041 Sum_probs=163.5 Q ss_pred Ccchhhhhhhhccccccccc-CCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccch--------hc Q lcl|NC_019527. 2 WPFDRKKFKREVADKLADAA-RAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPA--------VA 72 (516) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~--------~a 72 (516) ---||....|+++.+.-.+. .||+.+-+..|+ .| ...++. -. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~---------------------------dV--~~~hp~y~a~~~~W~~ 51 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLIPPQAPPTSGLGPSLP---------------------------NV--GYQRVEFGEMLPKWRK 51 (535) T ss_pred CCcchhhhhhhhhhhcccCCCCcCCCCCCCCCC---------------------------CC--CcCCHHHHHHHHHHHH Confidence 12223333333333221111 111111111111 11 011111 01 Q ss_pred ccccccch--hhhcccccCCccccccc---CcccHHH-HHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHH Q lcl|NC_019527. 73 MDSLCGPT--YQFLNSAAGGLYAADIQ---PFPGYQN-LAALATRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASK 146 (516) Q Consensus 73 ~ds~~~~~--~~~~~~~~~~~~~~~~~---~f~gy~l-l~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~ 146 (516) ++..+++. .+.++.-+..-...+-. +-..|+- ++.=....+.+++|+..+--++|+..++... .. T Consensus 52 ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk~p~~~~p---------~~ 122 (535) T protein:vir:80 52 IMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMMGQVFSRDPIRQLP---------PA 122 (535) T ss_pred HHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHhchhhcCCcceecc---------HH Confidence 22222221 11111112111111000 1111321 1111234778888888888888988766321 12 Q ss_pred HHHHHHHHHh--cChhHHHHHHHHhcccceeeEEEEEecC-CCc----------ccCcc--ccccc--------cc-ccc Q lcl|NC_019527. 147 IKELEEACEY--YGVMGIIQKAAEHDCFFGRGQISINIKG-ADV----------SVPLI--LDPRT--------IK-KGS 202 (516) Q Consensus 147 i~~i~~~~~~--l~~~~~l~ea~~~~rlyG~a~i~i~i~~-~~~----------~~Pl~--ld~~~--------I~-~g~ 202 (516) ++.|..-.+. .++.+.++.++.....||.++|++.... +.. ..|-- +.++. |. ... T Consensus 123 l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~ 202 (535) T protein:vir:80 123 LEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSV 202 (535) T ss_pred HHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccc Confidence 3333322222 2689999999999999999999987521 110 01210 11111 11 111 Q ss_pred eeeEEeecceeeccccccccccccccc--------------cCcceeEEeee--E-eccceEEEe--cCCcc---hhh-h Q lcl|NC_019527. 203 LTGFSNIEPMWTSPSAYNALDPTAPDF--------------YKPSTWWVLGR--E-MHASRLLTI--ITRPL---PDM-L 259 (516) Q Consensus 203 l~~l~v~d~~~v~p~~~~~~dp~s~~y--------------g~P~~y~v~g~--~-iH~SRli~~--~~~~~---p~~-~ 259 (516) |..++..+.... ..|.++... ++...|...+. . .-.++.... .+..+ |-. . T Consensus 203 Lt~v~lrE~~~~------~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~ 276 (535) T protein:vir:80 203 ISLVVIQENVLA------QDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFI 276 (535) T ss_pred eeEEEEEEEEEe------cCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccccccceeecccCCCcccCeeEEEEe Confidence 222222221111 111111111 11111111110 0 000000000 00111 111 0 Q ss_pred hhccCCC--Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeec-chhhhcCccHHHHHHHHHHHHHhcCCcceEEE Q lcl|NC_019527. 260 KPAYNFS--GI-SMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTN-MAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVM 335 (516) Q Consensus 260 k~~~~~~--G~-S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~-~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~i 335 (516) -...+.+ |. +++..+.=.|..|..... --++++..+++++-+. +.+....+.... .- ........+.+ T Consensus 277 ~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd-~~~il~~~~~P~l~i~G~~~~~~~~~~~~--~~-----i~iG~~~~~~l 348 (535) T protein:vir:80 277 GPLDNNADIDHPPLLDLCEVNIGHYRNSAD-YEEMAFVAGQPTAFFTGLTKDWVEDVFKD--FK-----VHLGSRAIIPL 348 (535) T ss_pred ecCCCCCCCCccchHHHHHHHHHHhhchhH-HHHHHHHhcCceeeeecCchhhhhcCCCC--cc-----eEecCcccccC Confidence 0111222 33 455666666777766555 4457788887766542 222110000000 00 00111122223 Q ss_pred ecCCcceeEEecccCCHH-HHHHHHHHHHHhhhcCCceeeeccccccccccchHHH---HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 336 DFDSEDIVQVNTPLSGLA-DLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEI---RSFYDDISSVQQSYYFSPLDT 411 (516) Q Consensus 336 d~~~e~~e~~~~~lsgl~-d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~---~~yyd~I~~~Qe~~l~p~l~~ 411 (516) . ++.++..+...-+++. +.++...++++..-...+ .+++++. |..... ..=+..+.++- ..+...+++ T Consensus 349 P-~~~~~~~~e~~~~~~a~~~l~~~e~qM~~lGa~ll----~~~~~~~--Ta~~a~~~~~~~~S~L~~~a-~~le~al~~ 420 (535) T protein:vir:80 349 P-QGATAGILQITPNSVPFEAMTHKESQMIAMGANLL----VKSGGNR--TFGEAQQEEASEQSILSACT-KNVSMAFRK 420 (535) T ss_pred C-CCCCcceeeeccchhHHHHHHHHHHHHHHHHHHhh----ccCcccc--cHHHHHHHHHHHhHHHHHHH-HHHHHHHHH Confidence 2 3344544444445543 234444444444332222 2222222 222111 11123333332 346777888 Q ss_pred HHHHHHHHhCCCc-C-CcceEEeCC---CCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChh Q lcl|NC_019527. 412 MLKVIQLSKWGEI-D-DAITFKFKS---LWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGD 486 (516) Q Consensus 412 l~~~l~~s~~g~~-~-~d~~~~f~p---L~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~ 486 (516) +++++..= .|.. + .++.|.-+. .-.++..+ .++...+++.|.|+.+..++.|+....-. +..+.+ T Consensus 421 aL~~~A~w-~G~~~~~~~~~i~~n~dF~~~~ld~~~--------~~all~~~~~G~Is~et~~~~L~r~gvl~-~~~~~e 490 (535) T protein:vir:80 421 ALRWANQF-QTGIVNDETVEYNLNTDFPAARLTPNE--------RAELILEWQQGAITFKEMRAGLRRAGVAS-EDDAKA 490 (535) T ss_pred HHHHHHHH-cCCccCCCceEEEeccccccccCCHHH--------HHHHHHHHhcCCCCHHHHHHHHHhCCCCC-cccchH Confidence 88866543 2432 2 345555442 22333333 45566788999999999999986642110 111111 Q ss_pred hhc--cccccchhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 487 LEI--VQPEMFDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 487 ~e~--~~~e~~~~e~~~~~~~~~~~~~~~e~t 516 (516) .+. .+.|..+ ....++.. ..++.++| T Consensus 491 ee~~ri~~E~~~---~~~~~g~~-~d~~~~g~ 518 (535) T protein:vir:80 491 ETEGKATVEFIA---KTAAAGKV-GDAASGGT 518 (535) T ss_pred HHHHHHHhhhhh---ccccCCCC-CCCCCCCC Confidence 111 1111100 00111111 11122222 No 236 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=66.39 E-value=0.27 Score=23.72 Aligned_cols=419 Identities=12% Similarity=0.084 Sum_probs=162.7 Q ss_pred HHhHHhhcCCCccccccCCCCCCCccCCCccchh--------cccccccchhhhcccccCCcccccccCcccHHHHHHHH Q lcl|NC_019527. 38 VMKSMERRASDAATKWAPPQLMPGVVPAGTTPAV--------AMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNLAALA 109 (516) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~--------a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~ 109 (516) ++ .....+..+ +..++.. .++..+.+-........ +..... .+.+-...+.|. T Consensus 1 ~~---~~~~~~~~V-------------~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~-yl~~~~--~~~~e~~Y~~rl 61 (489) T protein:vir:78 1 ML---TENGQGSGV-------------KTKHREWLHYAPKWQKVRHALAGELVSYLRNV-GLNEPD--KAYGEARQAEYE 61 (489) T ss_pred Cc---cCCCccCCC-------------CccCHHHHHHHHHHHHHHHHhcCcccccccCC-CCCCCC--CCCChHHHHHHH Confidence 00 000000001 1111110 12222222111111111 111110 122222223332 Q ss_pred ----hCchhhhhhhhhhHHHhhCCCeeeeccccchhhhHHHHHHHHHHHHh--cChhHHHHHHHHhcccceeeEEEEEec Q lcl|NC_019527. 110 ----TRPEYRAFASTLSTELTREGIEITSKDRTKAKEMASKIKELEEACEY--YGVMGIIQKAAEHDCFFGRGQISINIK 183 (516) Q Consensus 110 ----~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~~~~~~i~~i~~~~~~--l~~~~~l~ea~~~~rlyG~a~i~i~i~ 183 (516) .....+++++..+--++|+..++... ..++.|..-.+. .++...++.++.....||.+++++... T Consensus 62 ~rA~~~n~~~~tl~~l~G~vfrk~p~~~~p---------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P 132 (489) T protein:vir:78 62 AGGIVYNFTRRTLSGMVGSVMRKEPEINIP---------KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAP 132 (489) T ss_pred hccccCChHHHHHHHHhchhhcCCcceecc---------HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeC Confidence 34778889998888889998877321 123333332222 368899999999999999999998763 Q ss_pred CCCc----------ccCc--ccccccccc------c---ceeeEEeecceeec-c-cc--------ccccccccccccCc Q lcl|NC_019527. 184 GADV----------SVPL--ILDPRTIKK------G---SLTGFSNIEPMWTS-P-SA--------YNALDPTAPDFYKP 232 (516) Q Consensus 184 ~~~~----------~~Pl--~ld~~~I~~------g---~l~~l~v~d~~~v~-p-~~--------~~~~dp~s~~yg~P 232 (516) -... ..|- .+.++.|-. | .|..++.-+...+. + +. +...++.....++- T Consensus 133 ~~~~~T~ade~~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~ 212 (489) T protein:vir:78 133 ETGAATAAEQNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQ 212 (489) T ss_pred CCCCcCHHHHHHhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEE Confidence 2110 0121 111222211 1 11111111111110 0 00 00111111111122 Q ss_pred ceeEEe--e------eEe-ccceEEEecCCcchhh-hhhccCCC--Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHhCCce Q lcl|NC_019527. 233 STWWVL--G------REM-HASRLLTIITRPLPDM-LKPAYNFS--GI-SMSQLAQPYVENWLRTRQSVSDLVDKFSRTF 299 (516) Q Consensus 233 ~~y~v~--g------~~i-H~SRli~~~~~~~p~~-~k~~~~~~--G~-S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v 299 (516) +.|+.. | .++ |.+.-..+ ..+|-. .-...+.+ |. +++..+.=.|.+|..... --++++..++++ T Consensus 213 ~~~r~~~~g~~~~~~~~~~~~~g~~~l--~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd-~~~~l~~~~~P~ 289 (489) T protein:vir:78 213 RLFRFDAEGGAQEDVVEIYPDLGESLR--GVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSAD-NEESSFVVGQPT 289 (489) T ss_pred EEEEeecCCcccceeeEEeccCCCCcc--CeeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhH-HHHHHHHcccce Confidence 222211 1 011 11110000 001111 00112233 32 356666777777777665 456888888888 Q ss_pred eeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEecccCCH-HHHHHHHHHHHHh-hhcCCceeeecc Q lcl|NC_019527. 300 LKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNTPLSGL-ADLQSQSQEHMCS-VSKIPAIKLTGI 377 (516) Q Consensus 300 ~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~~lsgl-~d~~~~~~~~iaa-as~IP~t~L~G~ 377 (516) +-+......+........++ -...-++.+. .+- .+.++..++..-+++ ...++...+++.. .+.+ +.+ T Consensus 290 l~i~G~d~~~~~~~~~~~~~---~i~~g~~~~~-~lp-~~~~~~~ie~~~~~~~r~~l~~le~qm~~lGa~l-----~~~ 359 (489) T protein:vir:78 290 LFIYPGENLTPQAFKEANPN---GIKFGSRRGH-NLG-YGGSAQLIQAGENNLARQNMLDKEQQAIQIGAQL-----ITP 359 (489) T ss_pred eeeecCccCCcccccccCcc---ceeeCCcccc-cCC-CCCCcceeccCcchHHHHHHHHHHHHHHHHhhhh-----ccC Confidence 76533222211110000000 0001122222 222 234455555444444 3334333444432 2332 221 Q ss_pred ccccccccchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-CCcceEEeCC---CCCCCHHHHHHHHHHH Q lcl|NC_019527. 378 SPSGLNASSEGEIRSF---YDDISSVQQSYYFSPLDTMLKVIQLSKWGEI-DDAITFKFKS---LWQTSAKEESEIRFNK 450 (516) Q Consensus 378 sp~Glnatge~D~~~y---yd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~-~~d~~~~f~p---L~~~sekEkAei~~~~ 450 (516) ++ +-|++.-...+ +..++++- ..+...++++++++..= .|.- +.+..|.-|. ...++..+ T Consensus 360 --~~-~~Ta~~~~~~~~~~~S~L~~~a-~~~e~al~~~l~~~a~w-~G~~~~~~~~i~~n~dF~~~~~d~~~-------- 426 (489) T protein:vir:78 360 --TQ-QITAQSARIQRGADTSVMATIA-RNVSQAYTDALRWVAVM-LGKPEDTEVEFRLNMDFFLEPMTAQD-------- 426 (489) T ss_pred --Cc-chhHHHHHHHHHHhhHHHHHHH-HHHHHHHHHHHHHHHHH-cCCCCCCceEEEeecccCcccCCHHH-------- Confidence 12 34444332222 22333332 33567788888777643 2432 3344443332 22333332 Q ss_pred HHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChhhhccccccchhc---CCCCCCCCCCCCCCCCC Q lcl|NC_019527. 451 AQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGDLEIVQPEMFDDD---GADPYMPDPDVLPGEEG 515 (516) Q Consensus 451 a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~~e~~~~e~~~~e---~~~~~~~~~~~~~~~e~ 515 (516) .++...+++.|.|+.+..++.|+.... .+..++++ ++++.++. +...++.-|+..-+.|. T Consensus 427 ~~al~~~~~~G~is~~t~~~~L~~~gv--~d~~~e~~---~~ei~~~~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 427 RAAWMADINAGLLPATAYYAALRKAGV--TDWTDADI---KDAVADQPLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred HHHHHHHHhcCCCCHHHHHHHHHhCCC--CCccHHHH---HHHHhhcCCCcccCCcccCCCCcccccC Confidence 455566789999999999999976322 12222222 22222111 00011111111111122 No 237 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=48.27 E-value=0.67 Score=21.52 Aligned_cols=425 Identities=16% Similarity=0.173 Sum_probs=166.5 Q ss_pred cccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhccccccc-chhhhcccccCCc-ccc Q lcl|NC_019527. 17 LADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCG-PTYQFLNSAAGGL-YAA 94 (516) Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~-~~~~~~~~~~~~~-~~~ 94 (516) -+.+..=++.+. |.|-.|...+ +|+.=|..- ..-+...-. ++. ... T Consensus 1 ~~a~~~lr~~rr------------------------------pkg~~~a~~r-~L~aAs~~~~dpg~~~~~~-~g~~~~~ 48 (631) T protein:vir:10 1 MAATQSLRLVRR------------------------------PKGGRPAPSR-ALTAASQPLPDPSQVFSKS-TGISRNS 48 (631) T ss_pred CCcccceeeeec------------------------------CCCCCccchh-hhhhhhccccchhhhhhhh-cCCcccc Confidence 000000001111 2333332322 222111110 000000000 111 011 Q ss_pred cccCcccHHHHH--HHHhCchhhhhhhhhhHHHhhCCCeeeecccc---------chhhhHHHHHHHHHHHH--hcChhH Q lcl|NC_019527. 95 DIQPFPGYQNLA--ALATRPEYRAFASTLSTELTREGIEITSKDRT---------KAKEMASKIKELEEACE--YYGVMG 161 (516) Q Consensus 95 ~~~~f~gy~ll~--~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~---------~~~~~~~~i~~i~~~~~--~l~~~~ 161 (516) + +|..+ .|..=++.|-.|.-.+.-|-|+-.-..-.+.+ +++-..++++++...+- .++..+ T Consensus 49 ~------WQ~eAW~~~d~v~Elry~vgW~~~s~sr~rL~as~idpDtg~ptg~iee~~~~~~~v~~~~~~i~gG~lgQ~~ 122 (631) T protein:vir:10 49 D------WQTDAWEAVDLVGELRYYVGWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAA 122 (631) T ss_pred h------hhHHHHHHHHhhhhHHHHhhhhhhhhceeeeEeeeeccCCCCCccccccCCchhHHHHHHHHhcCCCcchHHH Confidence 1 22221 22222455555555555444443333222211 01111133333333211 457788 Q ss_pred HHHHHHHhcccceeeEEEEEecCCC--cccCcccccccccccceeeEEeecceeeccc----cccccccccccccCccee Q lcl|NC_019527. 162 IIQKAAEHDCFFGRGQISINIKGAD--VSVPLILDPRTIKKGSLTGFSNIEPMWTSPS----AYNALDPTAPDFYKPSTW 235 (516) Q Consensus 162 ~l~ea~~~~rlyG~a~i~i~i~~~~--~~~Pl~ld~~~I~~g~l~~l~v~d~~~v~p~----~~~~~dp~s~~yg~P~~y 235 (516) .++.....--+-|.++|++....++ ..+|- ..|+.- ....++++..+.-. .....-|. |.+-.| T Consensus 123 llkrl~~~ltV~GE~wiv~l~~p~~~~~~~pd----~~~r~~--~~W~~vt~~ei~~~~~g~g~~v~lp~----g~~h~~ 192 (631) T protein:vir:10 123 LTKRVVECLTVPGELWIVILTRPVKGAPAQPD----GSVRTR--QEWYAVSKEEIKKSNKGSGTNIVLPT----GEEHEF 192 (631) T ss_pred HHHHHHhheecccceEEEEEeccCcCCCCCcc----cccccc--cceeeccHHHHhcccCcccceeecCC----CCccce Confidence 8888888888899999988764432 12221 011110 11222222222100 00000110 111111 Q ss_pred EEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHH----hCCceeeecch----h- Q lcl|NC_019527. 236 WVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDK----FSRTFLKTNMA----Q- 306 (516) Q Consensus 236 ~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~----~~~~v~k~~~~----~- 306 (516) . .+. -+++-+=+| ++.+...-.|.+..|.+.|+-...+...+..-.+. .++.++--.+. . T Consensus 193 ~-~~~-----D~l~RiW~P-----~prr~~e~dSpvra~l~~l~Ei~~~t~~i~aaakSRl~gnGvlflP~els~P~~~~ 261 (631) T protein:vir:10 193 V-KGT-----DIIFRVWIP-----KPRKASEPDSPVRAVLDSIREIVRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQG 261 (631) T ss_pred e-cCC-----ceEEEeeCC-----CcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeEeccccccCCCCC Confidence 1 111 112111122 34555667888888888888877776665543322 22322221111 0 Q ss_pred ---hh--------cC-ccHHHHHHH----HHHHHHhcCCcce---EEEecCCcceeEEe-ccc-CCHHHH----HHHHHH Q lcl|NC_019527. 307 ---VL--------NG-GEGGDVFDR----VEMYVNMQSNLGL---AVMDFDSEDIVQVN-TPL-SGLADL----QSQSQE 361 (516) Q Consensus 307 ---~l--------~~-~~~~~l~~r----~~~~~~~~sn~g~---~~id~~~e~~e~~~-~~l-sgl~d~----~~~~~~ 361 (516) .. .+ .-..+|..- ......-.+.... +++-..+|-++.++ ..| +.++++ -+-.-. T Consensus 262 ~~~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~vPii~~~p~E~i~~i~hlkf~~ei~e~aiktR~daI~ 341 (631) T protein:vir:10 262 PVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGVPGEQIKDVKHIRFDNEITEVAIKTRNDAIA 341 (631) T ss_pred CCCCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCeeEEeecCchhHHHHhhHHHHHH Confidence 00 00 001122221 1111111222111 22333445444443 223 356554 334445 Q ss_pred HHHhhhcCCceeeeccccccc----cccchHHHHH----HH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEe Q lcl|NC_019527. 362 HMCSVSKIPAIKLTGISPSGL----NASSEGEIRS----FY-DDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKF 432 (516) Q Consensus 362 ~iaaas~IP~t~L~G~sp~Gl----natge~D~~~----yy-d~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f 432 (516) .+|...+||--+|+|.-..+. =.-+++|++- +- =-|+++=.++|||.|+.. |. +++=++.| T Consensus 342 RlA~glDi~pE~LLGlGsd~NHWsAWqI~dedVrlHI~P~l~lic~AlT~q~Lrp~Le~e---------Gv-Dp~kYvvW 411 (631) T protein:vir:10 342 RLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALTDQILRVTLARE---------GI-DPSKYVVW 411 (631) T ss_pred HHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHHHHHHHHh---------CC-CHHHhEee Confidence 799999999999999710110 0124555443 21 223455555555555432 44 44333555 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChh-------------------------- Q lcl|NC_019527. 433 KSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGD-------------------------- 486 (516) Q Consensus 433 ~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~-------------------------- 486 (516) -+-.+++.+- .+.+.+..+++.|+|+.+..|++|+-..+.+|.--+.+ T Consensus 412 ~DaS~Lt~dP------dr~deA~qa~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~~~a~~av~~dpaLip~lApl~~~ 485 (631) T protein:vir:10 412 YDPSQLTIDP------DKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQDAVSKDPTLIPMLAPLIAG 485 (631) T ss_pred ecCcccccCC------CCcHHHHHHHHcCCcCHHHHHHHhcCchhcccCcCchHHHHHHHHHHhhcccCcchhhHHHHHH Confidence 5444443211 12233455799999999999999987777666521100 Q ss_pred ----hhcccccc-----chhcCCCCCCCCCCCCCCCCCC Q lcl|NC_019527. 487 ----LEIVQPEM-----FDDDGADPYMPDPDVLPGEEGS 516 (516) Q Consensus 487 ----~e~~~~e~-----~~~e~~~~~~~~~~~~~~~e~t 516 (516) .+...+.. .++++++.++.+-.++|+.+++ T Consensus 486 ~~~~v~~P~~~a~~~~g~ed~~~~~~~~~g~~epdt~d~ 524 (631) T protein:vir:10 486 VLKQIEFPQQQAIDSGGNEDTSDADDLDDGEQEPDTEDD 524 (631) T ss_pred HhhhccCCCCCCCCCCCCCccccccccccCCCCCCCCCC Confidence 00000000 0011111112222222222222 No 238 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=39.37 E-value=1 Score=20.53 Aligned_cols=420 Identities=13% Similarity=0.056 Sum_probs=175.1 Q ss_pred hhHHHHhHHhhcCCCc--------cccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccccccCcccHHHH Q lcl|NC_019527. 34 MRRAVMKSMERRASDA--------ATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAADIQPFPGYQNL 105 (516) Q Consensus 34 ~~~~~~~~~~~~~~~~--------~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~f~gy~ll 105 (516) ++ +....+-.+.++. ...|.-|.+. ..|.+.... ..... . . | T Consensus 1 m~-~~~~~l~~k~~R~~~e~~w~e~a~~~lP~~~--~~~~~~~~~---------------~~~~~---~---~---~--- 50 (514) T protein:vir:80 1 MR-QQASAMWAEYRDSTAIRKAEDFAKFTIASLM--VDPLDKTHQ---------------AEVVE---Y---D---F--- 50 (514) T ss_pred Cc-cchHHHHHHhhcchHHHHHHHHHHHhccccc--CCCCCCccc---------------ccccc---c---c---c--- Confidence 33 2222221122221 1222223211 101110000 00000 0 0 0 Q ss_pred HHHHhCchhhhhhhhhhHHH-------hhCCCeeeeccccch---------hhhHHHH----HHHHHHHHhcChhHHHHH Q lcl|NC_019527. 106 AALATRPEYRAFASTLSTEL-------TREGIEITSKDRTKA---------KEMASKI----KELEEACEYYGVMGIIQK 165 (516) Q Consensus 106 ~~y~~~~i~r~iVd~~aed~-------~r~~~~i~~~~~~~~---------~~~~~~i----~~i~~~~~~l~~~~~l~e 165 (516) ...+-+++++.|.-+ -+.||.+...++... .....++ +.+...+.+-++...+.+ T Consensus 51 -----dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~ 125 (514) T protein:vir:80 51 -----QSAGAFLVNNLTAKLALTLFPPGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHR 125 (514) T ss_pred -----chhHHHHHHHHHHHHHhhhcCCCCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHH Confidence 111122222222222 257888865422110 0112222 235556667799999999 Q ss_pred HHHhcccceeeEEEEEecCCCcccCcccccccc---cccceeeEEeecceeecccc----cccc---ccccc-cccCcce Q lcl|NC_019527. 166 AAEHDCFFGRGQISINIKGADVSVPLILDPRTI---KKGSLTGFSNIEPMWTSPSA----YNAL---DPTAP-DFYKPST 234 (516) Q Consensus 166 a~~~~rlyG~a~i~i~i~~~~~~~Pl~ld~~~I---~~g~l~~l~v~d~~~v~p~~----~~~~---dp~s~-~yg~P~~ 234 (516) ++..--.||.+.+|+.-+.... .-.++..-.| ..|.+.. ++-+.+++... +... +.... .+.+-+. T Consensus 126 ~~~~L~~~G~a~l~~~~~~~~~-~~~pl~~y~v~~d~~G~v~~--i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v 202 (514) T protein:vir:80 126 ILKLLVVTGNALFYREPGTGKM-LVWTMQSYTVRRTSHGDPAV--VVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDL 202 (514) T ss_pred HHHHHHhHCeEEEEEecCCCcE-EEEEcCeEEEeeCCCcCeEE--EEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEE Confidence 9999999999888864322111 1111111011 1122211 22222222110 0000 00000 0111111 Q ss_pred eEEe----e-----eEec----cceEEEecC---Ccchh----hhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 235 WWVL----G-----REMH----ASRLLTIIT---RPLPD----MLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDK 294 (516) Q Consensus 235 y~v~----g-----~~iH----~SRli~~~~---~~~p~----~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~ 294 (516) |... . .-|| -.++..-.+ ...|+ +.+....-||.|..+.++..++............... T Consensus 203 ~~~v~~~~~~~~~~~sv~~e~~g~~i~~es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~ 282 (514) T protein:vir:80 203 YTVIEWQPTPNGKRCAVWHELEGKRVGPESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFE 282 (514) T ss_pred EEEEEeecCCCCeEEEEEEeccceeecccCccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1110 0 0122 223322222 12343 2344555789999999999999999888887776665 Q ss_pred hCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEec----ccCCHHHHHHHHHHHHHhhhcCC Q lcl|NC_019527. 295 FSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVNT----PLSGLADLQSQSQEHMCSVSKIP 370 (516) Q Consensus 295 ~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~~----~lsgl~d~~~~~~~~iaaas~IP 370 (516) +.-..+..+ +++ +.+.. .+.. ...|.. +.+..+++..+.. +|.-+...++...+.|.-++ T Consensus 283 a~~~~~~v~-------~~g--~~~~~-~l~~--~~~g~~-v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF--- 346 (514) T protein:vir:80 283 ALSLLNLVD-------EAK--GGAVD-DYRD--AETGDF-VPGQVGSVASYERGDYNKIAQASASVESIVMRLNRAF--- 346 (514) T ss_pred hcCCCceeC-------ccc--ccchh-hhcc--cCCcee-ecCCCccceeeecCcccchHHHHHHHHHHHHHHHHHH--- Confidence 554443321 111 11111 1111 112222 3344456655543 24445567777788887664 Q ss_pred ceeeeccccccccccchHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHhCCCcC---C-cceEEeC-CCCC Q lcl|NC_019527. 371 AIKLTGISPSGLNASSEGEIRSFYDD--------ISSVQQSYYFSPLDTMLKVIQLSKWGEID---D-AITFKFK-SLWQ 437 (516) Q Consensus 371 ~t~L~G~sp~Glnatge~D~~~yyd~--------I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~---~-d~~~~f~-pL~~ 437 (516) .++.....|-+=|.+ +++.=+.. +...|..+|.|++++.+.++.+...|.+| + -+.+++- +|.. T Consensus 347 --ml~~~~rd~~rvTAt-EV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l~~~~~vs~la~ 423 (514) T protein:vir:80 347 --MYTGQVRDAERVTVE-EIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGVYRPSIITGIPA 423 (514) T ss_pred --hhhccCCCCCCCCHH-HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchhhcceeeecHHH Confidence 233322233322443 33322222 34578889999999999998876666654 3 2444443 3444 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHc-----CCCCHHHHHHHHHhhhccCCC--CC--Chhhhccccccchh--------c-C Q lcl|NC_019527. 438 TSAKEESEIRFNKAQEAQIYITN-----SVIDPSEARQQLSDDPDSGWD--NI--DGDLEIVQPEMFDD--------D-G 499 (516) Q Consensus 438 ~sekEkAei~~~~a~a~~~~~~~-----gvi~~~e~r~~l~~~~~~~~~--~~--d~~~e~~~~e~~~~--------e-~ 499 (516) +.-...++--...++.+..+.+. -.|+.+++.+.+...- |.+ .+ +++......+.... + . T Consensus 424 l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~--Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~ 501 (514) T protein:vir:80 424 LTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANN--SVDLSTLSKDPDVVAAEAEQEAALAQQQLDVASG 501 (514) T ss_pred HHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHh--CCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44444444444445555555443 2478888887775432 222 11 11111111111100 0 0 Q ss_pred CCCCCCCCCCCCC Q lcl|NC_019527. 500 ADPYMPDPDVLPG 512 (516) Q Consensus 500 ~~~~~~~~~~~~~ 512 (516) ..-.+...+-+|+ T Consensus 502 ~~~~~~~~~~~~~ 514 (514) T protein:vir:80 502 ALAAETSAGVLTS 514 (514) T ss_pred HHHHhhhccccCC Confidence 0001111222232 No 239 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=28.35 E-value=1.8 Score=19.24 Aligned_cols=442 Identities=10% Similarity=0.026 Sum_probs=182.6 Q ss_pred cCCCcCCCCCChhhhHHHHhHHhhcCCC------ccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCcccc Q lcl|NC_019527. 21 ARAEEQEKARKLAMRRAVMKSMERRASD------AATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGLYAA 94 (516) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~ 94 (516) +...+.......- ..+.+..+...+.. ....|.-|.+.++ |+..+. .. ....+ T Consensus 1 m~~~~~~~~~~~~-~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~------------~~~~~~-------~~-~~~~~ 59 (535) T protein:vir:33 1 MADSKRTGLGEDG-AKATYDRLTNDRRAYETRAENCAQYTIPSLFPK------------ESDNES-------TD-YTTPW 59 (535) T ss_pred CChhhhhccChhH-HHHHHHHHHHHhhHHHHHHHHHHHHhcccccCC------------CCCccc-------cc-ccccc Confidence 3332222222211 12334444332221 1223333432211 110000 00 00001 Q ss_pred cccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCCeeeeccccchh---------hhH----HHHHHHHHHHHhcChhH Q lcl|NC_019527. 95 DIQPFPGYQNLAALATRPEYRAFASTLSTELTREGIEITSKDRTKAK---------EMA----SKIKELEEACEYYGVMG 161 (516) Q Consensus 95 ~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~~~~---------~~~----~~i~~i~~~~~~l~~~~ 161 (516) +.+ |-..+..++ +.+...+. |+ +.||.+...+....+ ... ...+.+...+.+-++.. T Consensus 60 dst---~~~a~~~La-a~l~~~lt--P~----~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~ 129 (535) T protein:vir:33 60 QAV---GARGLNNLA-SKLMLALF--PM----QSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRV 129 (535) T ss_pred ccc---HHHHHHHHH-HHHHHhhc--CC----CcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHH Confidence 111 112222221 12222222 43 468888654422111 111 12244556677779999 Q ss_pred HHHHHHHhcccceeeEEEEEecCCCc--ccCcccccccc---cccceeeEEeecceeeccc----ccccc---cc-cccc Q lcl|NC_019527. 162 IIQKAAEHDCFFGRGQISINIKGADV--SVPLILDPRTI---KKGSLTGFSNIEPMWTSPS----AYNAL---DP-TAPD 228 (516) Q Consensus 162 ~l~ea~~~~rlyG~a~i~i~i~~~~~--~~Pl~ld~~~I---~~g~l~~l~v~d~~~v~p~----~~~~~---dp-~s~~ 228 (516) .+.++++.-.+||.+.+++.-+.+.. -..+++..-.| ..|.+.. ++-+..++.. .+... +. .... T Consensus 130 ~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd~--i~r~~~~t~~ql~~~~~~~~~~~~~~k~~ 207 (535) T protein:vir:33 130 TLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRLSSYVVQRDAYGNVLQ--IVTRDQIAFGALPEDVRSAVEKSGGEKKM 207 (535) T ss_pred HHHHHHHHHHhhCceeEEeecCCCCceeeEEEEcCeeEEeeCCCCCeeE--EEeeEeecHHHHHHHhhhhhccccccccc Confidence 99999999999999988875332211 11112211111 2233322 2222222210 01000 00 0001 Q ss_pred ccCcceeEEe------e-eEecc----ceEEEecC----Ccchh----hhhhccCCCCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 229 FYKPSTWWVL------G-REMHA----SRLLTIIT----RPLPD----MLKPAYNFSGISMSQLAQPYVENWLRTRQSVS 289 (516) Q Consensus 229 yg~P~~y~v~------g-~~iH~----SRli~~~~----~~~p~----~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~ 289 (516) +..++.|... + ..+|+ .++.-..+ +..|+ +.+....-||.|..+.++..++.......... T Consensus 208 ~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l 287 (535) T protein:vir:33 208 DEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIV 287 (535) T ss_pred ccCCeEEEEEEeeCCCCcEEEEEEEeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHH Confidence 1122222211 0 11111 11100001 11222 33445557899999999999999999999998 Q ss_pred HHHHHhCCceeeecchhhhcCccHHHHHHHHHHHHHhcCCcceEEEecCCcceeEEe----cccCCHHHHHHHHHHHHHh Q lcl|NC_019527. 290 DLVDKFSRTFLKTNMAQVLNGGEGGDVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN----TPLSGLADLQSQSQEHMCS 365 (516) Q Consensus 290 ~Ll~~~~~~v~k~~~~~~l~~~~~~~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~----~~lsgl~d~~~~~~~~iaa 365 (516) ..+..+.-..+..+-.. +.++... . ....|. ++.+..+++..+. .+|.-....++...+.|.- T Consensus 288 ~~~~~~~~p~~lv~~~g---------~~~~~~~-~--~~~~g~-~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~ 354 (535) T protein:vir:33 288 KMSMISAKVIGLVNPAG---------ITQPRRL-T--KAQTGD-FVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSY 354 (535) T ss_pred HHHHHHhcCceeecccc---------ccchhhc-c--cCCcee-eecCCcccceeeecccccchhHHHHHHHHHHHHHHH Confidence 88887765554432111 1111111 1 112222 3333445555553 2355566777788888877 Q ss_pred hhcCCceeeeccccccccccchHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCcCC-cceEEeCCCC Q lcl|NC_019527. 366 VSKIPAIKLTGISPSGLNASSEGE-------IRSFYDDISSVQQSYYFSPLDTMLKVIQLSK-WGEIDD-AITFKFKSLW 436 (516) Q Consensus 366 as~IP~t~L~G~sp~Glnatge~D-------~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~-~g~~~~-d~~~~f~pL~ 436 (516) ++=+- .+++ ..+-+=|..+= ....--.+..+|...|.|++++++.++.+.. +-.+|+ .++++|.+.. T Consensus 355 af~~~---~~~~-~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~L 430 (535) T protein:vir:33 355 AFMLN---SAVQ-RTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGL 430 (535) T ss_pred HHhhh---hccc-CCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHH Confidence 65111 1111 11222233321 2233344556788899999999999887642 222333 5777776432 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHc-----C-CCCHHHHHHHHHhhhccCCCC--C---Chhhhccccccchhc------- Q lcl|NC_019527. 437 QTSAKEESEIRFNKAQEAQIYITN-----S-VIDPSEARQQLSDDPDSGWDN--I---DGDLEIVQPEMFDDD------- 498 (516) Q Consensus 437 ~~sekEkAei~~~~a~a~~~~~~~-----g-vi~~~e~r~~l~~~~~~~~~~--~---d~~~e~~~~e~~~~e------- 498 (516) ....|..-..+..+.+..+.+. . .|+.+++.+.+...- |.+. + +++......+..... T Consensus 431 --a~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~--Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~ 506 (535) T protein:vir:33 431 --EAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAI--GIDTSGILLTDEQKQALMMQDAAQTGVENAAA 506 (535) T ss_pred --HHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHc--CCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHH Confidence 2233322222222333333222 1 367777776665421 2221 1 111111111100000 Q ss_pred --C---CCC-----CC-CCCCCCCCCCCC Q lcl|NC_019527. 499 --G---ADP-----YM-PDPDVLPGEEGS 516 (516) Q Consensus 499 --~---~~~-----~~-~~~~~~~~~e~t 516 (516) + ..+ ++ -.-.+.-|.++| T Consensus 507 ~~g~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 507 AGGAGVGALATSSPEAMQGAAAKAGLNAT 535 (535) T ss_pred hhhhhhcchhhcCChhHHHHHHhccCCCC Confidence 0 000 00 011223445555 No 240 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=27.51 E-value=1.8 Score=19.14 Aligned_cols=245 Identities=18% Similarity=0.190 Sum_probs=90.9 Q ss_pred ccCcccccccccccceeeEEeecceee--c------cccccccccccccccCc-ceeEEeeeE---------------ec Q lcl|NC_019527. 188 SVPLILDPRTIKKGSLTGFSNIEPMWT--S------PSAYNALDPTAPDFYKP-STWWVLGRE---------------MH 243 (516) Q Consensus 188 ~~Pl~ld~~~I~~g~l~~l~v~d~~~v--~------p~~~~~~dp~s~~yg~P-~~y~v~g~~---------------iH 243 (516) ..-++++. +-...++.-|.|-||..- - .+.+.+.|-....-.+- -+|-.+|++ |+ T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (279) T protein:vir:40 1 MSLFNLSR-RAEDVSFSTFTVQDPTTDLLLGKLLGLVSYFDNVDYSEASKLEDLFYWALQGKEVYRVWYGGFKYYAQRVN 79 (279) T ss_pred Ccccccch-hhcccceeeeeecCcchhHHHHHHHHHHHHhhcccchhhhhhhhhhhhhhccceeehhhhhhHHHHHhhcC Confidence 11111211 112223333333333210 0 01111111110000000 122233322 22 Q ss_pred cceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHH------HHHHHHHHHH-HHHHhCCc-eeeecchhhhcCccHHH Q lcl|NC_019527. 244 ASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVEN------WLRTRQSVSD-LVDKFSRT-FLKTNMAQVLNGGEGGD 315 (516) Q Consensus 244 ~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~------~~~~~~~~~~-Ll~~~~~~-v~k~~~~~~l~~~~~~~ 315 (516) .+..-+..-.+ ......-.+.-.++++..+-.+.. ++-+.++++. |=...+++ ++|++....+..- -++ T Consensus 80 ~d~fn~~vr~~--~~~~vtVP~~Dv~IieNPlv~v~~ee~~kM~~la~nai~~KLD~~~qIk~fIKTd~d~glee~-kek 156 (279) T protein:vir:40 80 ADQFNIVVREP--NRREVTIRTNDYEMLLNPFYGANPQRFGVMFGMASNGIGRRLDSQAQIKIYWKTKVSSGLKEV-WDR 156 (279) T ss_pred cchhhhheecC--CcceeEeecchhhhhhcchheeccchhhHHHHHHHhhhhhhhcccceeeeEEecCcchhHHHH-HHH Confidence 22111110000 000000111222233222111000 0001112222 21223343 4566543222111 123 Q ss_pred HHHHHHHHHHhcCC-cceEEEecCCcceeEEecccCC-HHHHHHHHHHHHHhhhcCCceeeeccccccccccchHHHHHH Q lcl|NC_019527. 316 VFDRVEMYVNMQSN-LGLAVMDFDSEDIVQVNTPLSG-LADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGEIRSF 393 (516) Q Consensus 316 l~~r~~~~~~~~sn-~g~~~id~~~e~~e~~~~~lsg-l~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D~~~y 393 (516) .+.|++.+..+-.+ .|+..++.+ |++.+++-+.++ +.+-++....++....+||-.+|.|++ .|..+.+| T Consensus 157 aR~rIk~mlalAk~~nGityid~~-ddItQL~kDYStslk~die~lkS~l~Sq~GinekIL~GsA-------tE~q~iAy 228 (279) T protein:vir:40 157 IRERLTQQQQLAREFNGVSVIGSD-DDIKQIQPDYSGSLQNDANLAIEIALSEYGMPRELLYGQS-------NEVTIIAF 228 (279) T ss_pred HHHHHHHHHHHHHhcCCeeeecCC-ceeEeeccccccccHHHHHHHHHHHHhhcCCchhhccccC-------chhhhhhH Confidence 44455444443333 688888875 999999988875 566788899999999999999999964 46677878 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHH Q lcl|NC_019527. 394 YDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDDAITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLS 473 (516) Q Consensus 394 yd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~ 473 (516) |.+ .+.|+|+.+.+-|.++. ++-+.|- +-|.+ .|+|......+.- T Consensus 229 y~r-------tVePILkQyek~liY~~------E~fv~y~---ttta~------------------gg~~~s~~~~~~~- 273 (279) T protein:vir:40 229 AIQ-------KVLPLLKQHDKNIIFNQ------ENFVAYI---STTAK------------------GGAIESKSSKRDS- 273 (279) T ss_pred HHh-------hHHHHHHHhcccccchh------hhhhhhh---eeccc------------------CcccccccccccC- Confidence 754 35576666555333221 1111110 00000 0000000000000 Q ss_pred hhhccCC Q lcl|NC_019527. 474 DDPDSGW 480 (516) Q Consensus 474 ~~~~~~~ 480 (516) .+.+.- T Consensus 274 -~~~~~~ 279 (279) T protein:vir:40 274 -EPVGND 279 (279) T ss_pred -CCCCCC Confidence 000000 No 241 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=23.54 E-value=2.3 Score=18.61 Aligned_cols=420 Identities=10% Similarity=-0.001 Sum_probs=168.1 Q ss_pred cCCCccchhccccc-------cc---c-------hhhhcccccCCcccccccCcccHHHHHHHHhCchhhhhhhhhhHHH Q lcl|NC_019527. 63 VPAGTTPAVAMDSL-------CG---P-------TYQFLNSAAGGLYAADIQPFPGYQNLAALATRPEYRAFASTLSTEL 125 (516) Q Consensus 63 ~~~~~~~~~a~ds~-------~~---~-------~~~~~~~~~~~~~~~~~~~f~gy~ll~~y~~~~i~r~iVd~~aed~ 125 (516) -.+ +-.+|-... .+ . ...+............. +...... ....+-.++++.|..+ T Consensus 1 ~~~--~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~----~~~~~~~--~dst~~~a~~~Las~l 72 (522) T protein:vir:94 1 MAE--REGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNS----STEYTTP--WQAVGARCLNNLAAKL 72 (522) T ss_pred Ccc--cchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcc----ccccccc--ccccHHHHHHHHHHHH Confidence 111 111211110 00 0 00000000000000000 0000000 0112222222222222 Q ss_pred h------hCCCeeeecccc---------chhhhHHH----HHHHHHHHHhcChhHHHHHHHHhcccceeeEEEEEecCCC Q lcl|NC_019527. 126 T------REGIEITSKDRT---------KAKEMASK----IKELEEACEYYGVMGIIQKAAEHDCFFGRGQISINIKGAD 186 (516) Q Consensus 126 ~------r~~~~i~~~~~~---------~~~~~~~~----i~~i~~~~~~l~~~~~l~ea~~~~rlyG~a~i~i~i~~~~ 186 (516) + +.||.+...+.. .......+ .+.+...+.+-++...+.++++.--.||.+.+++.-+..+ T Consensus 73 ~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~ 152 (522) T protein:vir:94 73 MLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQG 152 (522) T ss_pred HhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCC Confidence 2 479998754311 11111122 2445556667799999999999988999998887543221 Q ss_pred -c----ccCcccccccc---cccceeeEEeecceeecc-----cccc--ccccccccccCcceeE-Ee----eeEeccc- Q lcl|NC_019527. 187 -V----SVPLILDPRTI---KKGSLTGFSNIEPMWTSP-----SAYN--ALDPTAPDFYKPSTWW-VL----GREMHAS- 245 (516) Q Consensus 187 -~----~~Pl~ld~~~I---~~g~l~~l~v~d~~~v~p-----~~~~--~~dp~s~~yg~P~~y~-v~----g~~iH~S- 245 (516) . ..|+. .-.+ ..|.+..+ +-+..++. .... ..|...|+ ..-+.|. |. ...+|++ T Consensus 153 ~~~~~~~~pl~--~y~v~~d~~G~vd~i--~r~~~~~~~~l~~~~~~~~~~~~~~p~-~~v~v~~~v~~~~~~~~~~~~~ 227 (522) T protein:vir:94 153 TYSPMRMYRLV--SYVVQRDAFGNILQI--VTIDKVAFSALPEDVKSQLNADDYEPD-TELEVYTHIYRQDDEYLRYEEV 227 (522) T ss_pred ceeeEEEEEcc--eEEEeeCCCcCeEEE--eeeeeccHHhcchHHHHHHhcccCCcc-ceEEEEEEEEeeCCceeEEeec Confidence 1 11221 1011 11222111 11111111 0000 01111110 1111111 10 0111211 Q ss_pred --eEE-EecC----Ccchh----hhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeecchhhhcCccHH Q lcl|NC_019527. 246 --RLL-TIIT----RPLPD----MLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDKFSRTFLKTNMAQVLNGGEGG 314 (516) Q Consensus 246 --Rli-~~~~----~~~p~----~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~~~~~v~k~~~~~~l~~~~~~ 314 (516) ..+ --.+ ...|+ +.+....-||.|..+.++..++............+..+.-..+..+-...+ T Consensus 228 ~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~------ 301 (522) T protein:vir:94 228 EGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGIT------ 301 (522) T ss_pred cCceecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccc------ Confidence 111 1111 11222 344555678999999999999999999999998888877666544211111 Q ss_pred HHHHHHHHHHHhcCCcceEEEecCCcceeEEe----cccCCHHHHHHHHHHHHHhhhcCCceeeeccccccccccchHH- Q lcl|NC_019527. 315 DVFDRVEMYVNMQSNLGLAVMDFDSEDIVQVN----TPLSGLADLQSQSQEHMCSVSKIPAIKLTGISPSGLNASSEGE- 389 (516) Q Consensus 315 ~l~~r~~~~~~~~sn~g~~~id~~~e~~e~~~----~~lsgl~d~~~~~~~~iaaas~IP~t~L~G~sp~Glnatge~D- 389 (516) +.... .....|. ++.+..+++..+. .+|.-....++...+.|..++=+- .+++ ..+-+=|..+= T Consensus 302 ---~~~~~---~~~~~g~-~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~---~~~~-~~~~r~TAtEV~ 370 (522) T protein:vir:94 302 ---QPRRL---NKAATGE-FVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLN---SAVQ-RNAERVTAEEIR 370 (522) T ss_pred ---cchhe---eccCCce-eecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhh---hhcc-CCCccccHHHHH Confidence 11100 1112222 2334445555443 235556677777778887766221 1221 22222233321 Q ss_pred ------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCcCC-cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHc- Q lcl|NC_019527. 390 ------IRSFYDDISSVQQSYYFSPLDTMLKVIQLSK-WGEIDD-AITFKFKSLWQTSAKEESEIRFNKAQEAQIYITN- 460 (516) Q Consensus 390 ------~~~yyd~I~~~Qe~~l~p~l~~l~~~l~~s~-~g~~~~-d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~- 460 (516) ....--.+..+|..+|.|++++++.++.+.. +-.+|+ .+++++.+. +....|+.-..+..+.++.+.+. T Consensus 371 ~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~--La~~qr~~~~~~l~~~~~~ia~l~ 448 (522) T protein:vir:94 371 YVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTG--LEALGRGQDLEKLTQAVNMMTGLQ 448 (522) T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccEEeeEecH--HHHHHHHHHHHHHHHHHHHHHhcc Confidence 2223344556778889999999999887643 223443 467776543 22333322222222222222221 Q ss_pred -----CCCCHHHHHHHHHhhhccCC--CCC---Chhhhccccccchhc----CCCC-CCCCCCCCCCCCCC Q lcl|NC_019527. 461 -----SVIDPSEARQQLSDDPDSGW--DNI---DGDLEIVQPEMFDDD----GADP-YMPDPDVLPGEEGS 516 (516) Q Consensus 461 -----gvi~~~e~r~~l~~~~~~~~--~~~---d~~~e~~~~e~~~~e----~~~~-~~~~~~~~~~~e~t 516 (516) --|+.+++.+.+...- |. +.+ +++......+....+ .... .+......+.+.++ T Consensus 449 P~~~~~~id~d~~~~~~a~~~--Gv~~~~ivr~~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~~~~~~~~ 517 (522) T protein:vir:94 449 PLSQDPDINLPTLKLRLLNAL--GIDTAGLLLTQDEKIQRMAEQSSQQAVVQGASAAGANMGAAVGQGAGE 517 (522) T ss_pred chhhhhcCCHHHHHHHHHHHc--CCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccch Confidence 1367777776665431 22 111 111111110000000 0000 00000001111111 No 242 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=20.25 E-value=2.8 Score=18.13 Aligned_cols=419 Identities=12% Similarity=0.123 Sum_probs=164.0 Q ss_pred hcccccccccCCCcCCCCCChhhhHHHHhHHhhcCCCccccccCCCCCCCccCCCccchhcccccccchhhhcccccCCc Q lcl|NC_019527. 12 EVADKLADAARAEEQEKARKLAMRRAVMKSMERRASDAATKWAPPQLMPGVVPAGTTPAVAMDSLCGPTYQFLNSAAGGL 91 (516) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~~~~~~a~ds~~~~~~~~~~~~~~~~ 91 (516) -.....+....|+..+ ...+++++...-+. ..-||+.+.+. +| .. T Consensus 1 ma~~~lr~~rrpk~~p---~~~r~~al~aas~~------------i~~p~~~~~ks----~~----------------~~ 45 (629) T protein:vir:86 1 MAPTSLRIVRRPKSEP---VSTRQRALVAASQP------------VENPGKAFRKA----MG----------------SS 45 (629) T ss_pred CCccceeeeecCCCCC---hhhhhhhhhhhhhc------------cccccchhhhh----cC----------------CC Confidence 0111111111111100 00111222221111 11122222110 00 00 Q ss_pred ccccccCcccHHHH--HHHHhCchhhhhhhhhhHHHhhCCCeeeecccc---------chhhhHHHHHHHHHHHH--hcC Q lcl|NC_019527. 92 YAADIQPFPGYQNL--AALATRPEYRAFASTLSTELTREGIEITSKDRT---------KAKEMASKIKELEEACE--YYG 158 (516) Q Consensus 92 ~~~~~~~f~gy~ll--~~y~~~~i~r~iVd~~aed~~r~~~~i~~~~~~---------~~~~~~~~i~~i~~~~~--~l~ 158 (516) ...+ +|.. +.|..-++.|-.|.-.+.-|-|+-.-..-.+.+ +.+-...+++++-..+- .++ T Consensus 46 ~~~~------WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~rL~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lg 119 (629) T protein:vir:86 46 TRTD------WQEDAWKAYDAVGELRYYVGWRSSSASRVRLIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALG 119 (629) T ss_pred chhh------hhHHHHHHHHhhhhHHHHhhhhhhhhceeeeEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhh Confidence 0001 1221 112223444444444444343333322222100 01111223333333222 446 Q ss_pred hhHHHHHHHHhcccceeeEEEEEecCCCcc--cCc------ccccccccccceeeEEeecceeecccccccccccccccc Q lcl|NC_019527. 159 VMGIIQKAAEHDCFFGRGQISINIKGADVS--VPL------ILDPRTIKKGSLTGFSNIEPMWTSPSAYNALDPTAPDFY 230 (516) Q Consensus 159 ~~~~l~ea~~~~rlyG~a~i~i~i~~~~~~--~Pl------~ld~~~I~~g~l~~l~v~d~~~v~p~~~~~~dp~s~~yg 230 (516) ..+.++.+...--+-|.++|++....++.. .+- -++++.|+ .+..+.. ..-|+ | T Consensus 120 qa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~eW~~vt~~ei~-~~~~~~~----------------i~lP~-g 181 (629) T protein:vir:86 120 QAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPEWLALTPEEVR-ASEKKTI----------------IELPT-G 181 (629) T ss_pred HHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhhheeechHHhh-hccCcee----------------eEcCC-C Confidence 788888888888999999998876432211 110 01111121 1111110 00111 1 Q ss_pred CcceeEEeeeEeccceEEEecCCcchhhhhhccCCCCchHHHHHHHHHHHHHHHHHHHHHHHHH----hCCceeee--cc Q lcl|NC_019527. 231 KPSTWWVLGREMHASRLLTIITRPLPDMLKPAYNFSGISMSQLAQPYVENWLRTRQSVSDLVDK----FSRTFLKT--NM 304 (516) Q Consensus 231 ~P~~y~v~g~~iH~SRli~~~~~~~p~~~k~~~~~~G~S~le~~~~~l~~~~~~~~~~~~Ll~~----~~~~v~k~--~~ 304 (516) .+ .+.+.+.. =|+++ =+| ++.+...-.|.+..|.+.|+-...+...+..-.+. .++.++-- .+ T Consensus 182 ~~-~e~~~~~d----~l~Ri-W~P-----~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakSRL~gnGvlflP~e~sl 250 (629) T protein:vir:86 182 DK-HEFRDGLD----GMFRV-WNP-----RARRAREPDSPVRANLDSLKEIVRTTKTIANASKSRLIGNGVVFVPHEMSL 250 (629) T ss_pred Cc-ceeeCCCc----eEEEe-eCC-----CcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHHHHhhCceeeeccCccc Confidence 11 11122221 11111 111 34555667888888888888777776665443322 22322211 11 Q ss_pred hh----------------hhcCccHHHHHHHHHHH----HHhcCCcce---EEEecCCcceeEEe-ccc-CCHHHHH--- Q lcl|NC_019527. 305 AQ----------------VLNGGEGGDVFDRVEMY----VNMQSNLGL---AVMDFDSEDIVQVN-TPL-SGLADLQ--- 356 (516) Q Consensus 305 ~~----------------~l~~~~~~~l~~r~~~~----~~~~sn~g~---~~id~~~e~~e~~~-~~l-sgl~d~~--- 356 (516) -. .+..+-..+|..-+... ..-.+.... +++-..+|-++.++ ..| +.++++. T Consensus 251 P~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i~hlkf~~ei~e~aikt 330 (629) T protein:vir:86 251 PSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNVTHLKFDNQVTEVAIKT 330 (629) T ss_pred CccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCeeEEeecCchhHHHHhh Confidence 11 11111112333332211 111221111 23333445444443 223 3566543 Q ss_pred -HHHHHHHHhhhcCCceeeeccccccc----cccchHHHHH----H-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCC Q lcl|NC_019527. 357 -SQSQEHMCSVSKIPAIKLTGISPSGL----NASSEGEIRS----F-YDDISSVQQSYYFSPLDTMLKVIQLSKWGEIDD 426 (516) Q Consensus 357 -~~~~~~iaaas~IP~t~L~G~sp~Gl----natge~D~~~----y-yd~I~~~Qe~~l~p~l~~l~~~l~~s~~g~~~~ 426 (516) +-.-..+|...+||--+|+|.-..+. =.-+++|++- + -=.|+++-.++|+|.|+.. |. ++ T Consensus 331 R~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp~Le~e---------Gi-Dp 400 (629) T protein:vir:86 331 RNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQVLRTVLMRE---------GI-DP 400 (629) T ss_pred HHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhHHHHHHHHh---------CC-CH Confidence 34445799999999999999710110 0124555443 1 1223444455555555432 44 44 Q ss_pred cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhhhccCCCCCChh--------hhccccc----- Q lcl|NC_019527. 427 AITFKFKSLWQTSAKEESEIRFNKAQEAQIYITNSVIDPSEARQQLSDDPDSGWDNIDGD--------LEIVQPE----- 493 (516) Q Consensus 427 d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~gvi~~~e~r~~l~~~~~~~~~~~d~~--------~e~~~~e----- 493 (516) +=++.|-+-.+++..- .+.+.+..+++.|+|+.+..|++|+-..+.||.-...+ .-...++ T Consensus 401 ~kYvvW~DaS~Lt~dP------d~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~Li~~~ 474 (629) T protein:vir:86 401 NAYVVWHDASQLTVDP------DKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPNLLPTL 474 (629) T ss_pred HHhEeeecCcccccCC------CCcHHHHHHHHcCCcCHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcchhhhh Confidence 3335555444443211 12233455799999999999999988887777422111 0000111 Q ss_pred -----------cch--------------hc-CCCCCCCCCCCC--CCCCCC Q lcl|NC_019527. 494 -----------MFD--------------DD-GADPYMPDPDVL--PGEEGS 516 (516) Q Consensus 494 -----------~~~--------------~e-~~~~~~~~~~~~--~~~e~t 516 (516) .+. +| ...+.+.+++++ ++..++ T Consensus 475 a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga~~~~ep~te~d~~~~~a 525 (629) T protein:vir:86 475 AVLIPELADVEFPTPTVALPPAEEQDGDEEASGASRREEPDTEDDAGTDDS 525 (629) T ss_pred hhhhhhhcccccCccCCCCCccccCCCcccccCCCcCCCCCCCCCCccccc Confidence 000 00 011122222222 222222 Done!